The instant application contains a Sequence Listing which has been submitted in 20 .xml format via Patent Center in accordance with 37 C.F.R. §§ 1.831 to 1.834, and is hereby incorporated by reference in its entirety for all purposes. The Sequence Listing written in .xml format is named 1369949.xml, was created on Jan. 19, 2023, and is 31,135 bytes in size.
Industrial production of chemicals has benefited tremendously and will continue to benefit from the incorporation of biocatalysis into its synthetic processes. Biocatalysis is environmentally friendly, sustainable, highly efficient, and selective for the desired molecular transformation. It has an especially high potential in industrial production of carbohydrates and glycoconjugates that are soluble in aqueous solution which is a preferred medium for enzyme-catalyzed reactions. The enzyme stabilization power of high concentration carbohydrate substrates has also been shown, adding another advantage for carbohydrate production by biocatalysis. With the growing demands for carbohydrates as pharmaceuticals, nutraceuticals, and ingredients for foods and health beneficial products, developing biocatalytic strategies for their synthesis will become increasingly important especially if their natural sources are limited and their chemical synthesis is challenging. Human milk oligosaccharides (HMOs) are among the well-suited attractive carbohydrate targets for industrial biocatalytic production.
In addition to lactose, lipids, and proteins, human milk oligosaccharides (HMOs) are a major component of human milk which is the sole food for breast-fed infants in the first few months of their lives. HMOs cannot be digested directly by infants but contribute significantly to the health benefits of breastfeeding. Their roles as prebiotics, decoys to pathogenic bacteria and bacteriostatic agents, gut epithelial cell maturation and immune regulators, nutrients for brain development and their applications as infant formula supplements, potential nutraceuticals and therapeutics are being continuously explored.
More than 150 HMOs structures are known. They are extended from a free reducing end disaccharide, lactose, with β1-3/6-linked N-acetyl-D-glucosamine (GlcNAc) and β1-3/4-linked D-galactose (Gal), and with or without α2-3/6-linked N-acetylneuraminic acid (Neu5Ac) and/or α1-2/3/4-linked L-fucose (Fuc). The progress in exploring the biological functions of HMOs and their applications is highly dependent on the access to structurally defined HMOs in sufficient quantity which is currently limited. However, only a portion of structurally characterized HMOs have been synthesized by chemical, enzymatic and chemoenzymatic, and fermentation approaches. HMOs and their sialylated and/or fucosylated derivatives including those with short chain and long-chain structures with or without branches are challenging targets for chemical synthesis or fermentation production but are well suited for biocatalytic production using purified or partially purified enzymes. With the identification and characterization of an increasing number of glycosyltransferases that are well suited for HMO synthesis and the advances in the development of enzymatic and chemoenzymatic methods, the number of HMOs that have been produced in preparative and multi-gram-scales is increasing. HMO product purification remains a bottleneck to quick accessing HMOs in even multi-gram quantities. Continuous innovation is necessary to overcome the challenges for large-scale production of HMOs.
Described herein are compounds of Formula I:
or a salt thereof, wherein R1 is selected from the group consisting of benzyl and fluorenylmethyl; R13 is selected from the group consisting of H and fucosyl; R26 is selected from the group consisting of H, N-acetylglucosamine, N-acetylneuraminic acid, and an oligosaccharide comprising 2-10 monosaccharides; and each monosaccharide is independently selected from the group consisting of glucose, galactose, L-fucose, N-acetylglucosamine, N-trifluoroacetylglucosamine, glucosamine, N-Fmoc-glucosamine, N-Boc-glucosamine, and N-acetylneuraminic acid; R32 is selected from the group consisting of H, acetyl, trifluoroacetyl, Boc, Fmoc; R36 is selected from the group consisting of H and N-acetylneuraminic acid; R33 and R34 are independently selected from the group consisting of H, fucosyl, and a galactosyl moiety A:
R42 is selected from the group consisting of H and fucosyl; R46 is selected from the group consisting of H, N-acetylglucosamine, N-acetylneuraminic acid, and an oligosaccharide comprising 2-10 monosaccharides; and each monosaccharide is independently selected from the group consisting of glucose, galactose, L-fucose, N-acetylglucosamine, N-trifluoroacetylglucosamine, glucosamine, N-Fmoc-glucosamine, N-Boc-glucosamine, and N-acetylneuraminic acid; R43 is selected from the group consisting of a H, a monosaccharide, and an oligosaccharide comprising 2-10 monosaccharides; and each monosaccharide is independently selected from the group consisting of glucose, galactose, L-fucose, N-acetylglucosamine, N-trifluoroacetylglucosamine, glucosamine, N-Fmoc-glucosamine, N-Boc-glucosamine, and N-acetylneuraminic acid; provided that: (i) at least one of R33 and R34 is fucosyl or galactosyl moiety A; (ii) at least one of R42 and R43 is a monosaccharide or an oligosaccharide when (a) R26, R36, and R46 are H, and (b) R33 is galactosyl moiety A and R34 is H or fucosyl, or R34 is galactosyl moiety A and R33 is H or fucosyl; and (iii) R43 is a monosaccharide or an oligosaccharide when R26, R36, and R46 are H, R42 is fucosyl, R33 is galactosyl moiety A, and R34 is H.
Also described are compounds according to Formula II:
or a salt thereof, wherein: R1 is selected from the group consisting of benzyl and fluorenylmethyl; R13 is selected from the group consisting of H and fucosyl; R26 is selected from the group consisting of H, N-acetylglucosamine, N-acetylneuraminic acid, and an oligosaccharide comprising 2-10 monosaccharides; and each monosaccharide is independently selected from the group consisting of glucose, galactose, L-fucose, N-acetylglucosamine, N-trifluoroacetylglucosamine, glucosamine, N-Fmoc-glucosamine, N-Boc-glucosamine, and N-acetylneuraminic acid; R32 is selected from the group consisting of H, acetyl, trifluoroacetyl, Boc, Fmoc; R36 is selected from the group consisting of H and N-acetylneuraminic acid; R33 and R34 are independently selected from the group consisting of H, fucosyl, and a galactosyl moiety A:
R42 is selected from the group consisting of H and fucosyl; R46 is selected from the group consisting of H, N-acetylglucosamine, N-acetylneuraminic acid, and an oligosaccharide comprising 2-10 monosaccharides; and each monosaccharide is independently selected from the group consisting of glucose, galactose, L-fucose, N-acetylglucosamine, N-trifluoroacetylglucosamine, glucosamine, N-Fmoc-glucosamine, N-Boc-glucosamine, and N-acetylneuraminic acid; R43 is selected from the group consisting of a H, a monosaccharide, and an oligosaccharide comprising 2-10 monosaccharides; and each monosaccharide is independently selected from the group consisting of glucose, galactose, L-fucose, N-acetylglucosamine, N-trifluoroacetylglucosamine, glucosamine, N-Fmoc-glucosamine, N-Boc-glucosamine, and N-acetylneuraminic acid.
Also described are compounds according to Formula III:
or a salt thereof, wherein: each R2 is is independently selected from the group consisting of C1-6 alkyl and optionally substituted C6-14 aryl, or two R2 are taken together to form a C1-6 alkylene diradical; R13 is selected from the group consisting of H and fucosyl; R26 is selected from the group consisting of H, N-acetylglucosamine, N-acetylneuraminic acid, and an oligosaccharide comprising 2-10 monosaccharides; and each monosaccharide is independently selected from the group consisting of glucose, galactose, L-fucose, N-acetylglucosamine, N-trifluoroacetylglucosamine, glucosamine, N-Fmoc-glucosamine, N-Boc-glucosamine, and N-acetylneuraminic acid; R32 is selected from the group consisting of H, acetyl, trifluoroacetyl, Boc, Fmoc; R36 is selected from the group consisting of H and N-acetylneuraminic acid; R33 and R34 are independently selected from the group consisting of H, fucosyl, and a galactosyl moiety A:
R42 is selected from the group consisting of H and fucosyl; R46 is selected from the group consisting of H, N-acetylglucosamine, N-acetylneuraminic acid, and an oligosaccharide comprising 2-10 monosaccharides; and each monosaccharide is independently selected from the group consisting of glucose, galactose, L-fucose, N-acetylglucosamine, N-trifluoroacetylglucosamine, glucosamine, N-Fmoc-glucosamine, N-Boc-glucosamine, and N-acetylneuraminic acid; R43 is selected from the group consisting of a H, a monosaccharide, and an oligosaccharide comprising 2-10 monosaccharides; and each monosaccharide is independently selected from the group consisting of glucose, galactose, L-fucose, N-acetylglucosamine, N-trifluoroacetylglucosamine, glucosamine, N-Fmoc-glucosamine, N-Boc-glucosamine, and N-acetylneuraminic acid.
Methods for making and using compounds according to Formulas I, II and III and are also described herein.
Also described herein is an hST6GALNAC V variant comprising a polypeptide having at least 80% identity to the amino acid sequence of wild-type hST6GALNAC V. Optionally, the hST6GALNAC V variant comprises a mutation at one or more positions corresponding to G82, A84, V99, H107, G132, S140, Y176, Q184, H190, L191, A201, H207, Q218, T239, F275, and N306 in wild-type hST6GALNAC V. In some cases, the N-terminus of the polypeptide is fused to a maltose binding protein. In some cases, the C-terminus of the polypeptide is fused to a His6 peptide. Further described herein are hST6GALNAC V variants comprising a polypeptide having at least 80% identity to the amino acid sequence set forth in SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, or SEQ ID NO: 8. Further described herein is a polynucleotide encoding a hST6GALNAC V variant as described herein. A host cell comprising the polynucleotide is also provided herein. Additionally provided herein is a reaction mixture comprising an hST6GALNAC V variant as described herein.
Further provided herein is an hGCNT2-B variant comprising a polypeptide having at least 80% identity to the amino acid sequence set forth in SEQ ID NO: 10. Also provided herein is a polynucleotide encoding an hGCNT2-B variant as described herein. A host cell comprising the polynucleotide is also provided herein. Additionally provided herein is a reaction mixture comprising an hGCNT2-B variant as described herein.
The details of one or more embodiments are set forth in the drawings and the description below. Other features, objects, and advantages will be apparent from the description and drawings, and from the claims.
Innovation in process development is essential for applying biocatalysis in industrial and laboratory productions of organic compounds, including beneficial carbohydrates such as human milk oligosaccharides (HMOs). HMOs have attracted increasing attention for exploring their potential applications as key ingredients in products that can improve human health. As described in detail below, strategies that combine substrate and process engineering have now been developed to access HMOs via biocatalysis in an efficient manner. The strategies, referred to herein as multistep one-pot multienzyme (MSOPME) and stepwise one-pot multienzyme (StOPMe) processes, allow the access to target HMOs using a single C18-cartridge purification. Their efficiency has been demonstrated in high-yield (71-92%) synthesis of more than fifty tagged HMOs including fucosylated and/or sialylated HMOs short-chain and long-chain oligosaccharides up to nonaoses in one-pot from lactoside without the isolation of the intermediate oligosaccharides. Gram-scale synthesis of an important HMO, lacto-N-fucopentaose-I (LNFP-I), succeeded with 84% yield. Tag removal was achieved in high efficiency (90-97%) without the need for column purification to produce the desired natural HMOs with a free reducing end. The method can be readily adapted for automation to allow quick access of HMOs as well as other glycans and glycoconjugates.
The development of a glycosyltransferase acceptor substrate tagging strategy for highly efficient one-pot multienzyme (OPME) synthesis of sialosides allows facile purification of the more complex glycoside product from its reaction mixture in a single C18-cartridge-based process. The carboxybenzyl (Cbz) group is easy to install from an inexpensive commercially available material in a protection group-free fashion and is readily removable from the product to form oligosaccharides with a free reducing end. Described herein is the development of strategies that combine process engineering with the substrate-tagging (substrate engineering) and OPME chemoenzymatic synthetic systems for facile chemoenzymatic synthesis and purification of HMO targets including those with fucose and/or sialic acid those with long chain structures. Examples for preparative-scale synthesis of targets up to nonaoses are provided herein and gram-scale synthesis is demonstrated for lacto-N-fucopentaose-I (LNFP-I). The methods can be readily adapted for automation and is suitable for introducing isotope-labeled or structurally modified monosaccharides for the formation of HMOs, other glycans and glycoconjugates, as well as their derivatives.
As shown in
To further improve the efficiency, Stepwise One-Pot Multienzyme (StOPMe) strategy with pre-generation of sugar nucleotides is provided herein. In this strategy, sugar nucleotides including uridine 5′-diphosphate-N-acetylglucosamine (UDP-GlcNAc), uridine 5′-diphosphate-galatose (UDP-Gal), and guanosine 5′-diphosphate-L-fucose (GDP-Fuc) are pregenerated using sugar activation (SA) systems before stepwise adding of glycosyltransferases and inactivation of enzymes between glycosyltransferase-catalyzed reaction steps. Compared to MSOPME approach, the StOPMe strategy shortens the glycosyltransferase-catalyzed reaction times and minimize the lost of glycosyltransferase activity during extended reaction time. Enzyme Assembly Synthetic Maps (EASyMaps) are designed to guide the synthesis of these HMOs in a target orientated systematic manner.
As used herein, the term “monosaccharide” refers to a sugar having a six-membered carbon backbone (i.e., a hexose) or a nine-carbon backbone (i.e., a nonulosonic acid). Examples of monosaccharides include, but are not limited to, glucose (Glc), galactose (Gal), L-fucose (Fuc), mannose (Man), glucuronic acid (GlcA), iduronic acid (IdoA), and sialic acids such as N-acetylneuraminic acid (Neu5Ac), N-glycoylneuraminic acid (Neu5Gc), 3-keto-3-deoxy-D-glycero-D-galacto-nononic acid (Kdn), and others. Monosaccharides also include hexoses and nonulosonic acid substituted with hydroxy groups, oxo groups, amino groups, acetamido groups, and other functional groups. “Deoxy” monosaccharides refer to monosaccharides having carbon atoms one or more carbon atoms in the hexose backbone having only hydrogen substituents. Monosaccharides also include, but are not limited to, glucosamine (2-amino-2-deoxy-glucose; GlcN), N-acetylglucosamine (2-acetamido-2-deoxy-glucose; GlcNAc), galactosamine (2-amino-2-deoxy-galactose; GalN), N-acetylgalactosamine (2-acetamido-2-deoxy-galactose; GalNAc), mannosamine (2-amino-2-deoxy-mannose; ManN), N-acetylmannosamine (2-acetamido-2-deoxy-mannose; ManNAc), neuraminic acid (Neu), N-trifluoroacetyl neuraminic acid (Neu5AcTFA), and others.
As used herein, the term “oligosaccharide” refers to a compound containing at least two sugars covalently linked together. Oligosaccharides include disaccharides, trisaccharides, tetrasaccharides, pentasaccharides, hexasaccharides, heptasaccharides, octasaccharides, and the like. Covalent linkages for linking sugars generally consist of glycosidic linkages (i.e., C—O—C bonds) formed from the hydroxyl groups of adjacent sugars. Linkages can occur between the 1-carbon (the anomeric carbon) and the 4-carbon of adjacent sugars (i.e., a 1-4 linkage), the 1-carbon (the anomeric carbon) and the 3-carbon of adjacent sugars (i.e., a 1-3 linkage), the 1-carbon (the anomeric carbon) and the 6-carbon of adjacent sugars (i.e., a 1-6 linkage), or the 1-carbon (the anomeric carbon) and the 2-carbon of adjacent sugars (i.e., a 1-2 linkage). A sugar can be linked within an oligosaccharide such that the anomeric carbon is in the α- or β-configuration. The oligosaccharides prepared according to the methods of the invention can also include linkages between carbon atoms other than the 1-, 2-, 3-, 4-, and 6-carbons.
As used herein, the term “isomer” refers to a compound having the same bond structure as a reference compound but having a different three-dimensional arrangement of the bonds. An isomer can be, for example, an enantiomer or a diastereomer.
As used herein, the term “glycoside” refers to a saccharide compound having a moiety “—OR” replacing a hydroxyl group of the parent compound, wherein R is another saccharide (e.g., a monosaccharide, oligosaccharide, or polysaccharide) or a non-saccharide moiety (e.g., a lipid, a protein, a peptide, a linker moiety, a label moiety, etc.). In some embodiments, the moiety —OR in the glycoside replaces the hydroxyl group of the anomeric carbon at the reducing end of the parent saccharide.
A “galactoside” refers to a galactopyranose moiety or a galactofuranose moiety wherein one of the hydroxyl groups of the parent compound is replaced with a moiety —OR as described above. Galactosides include, for example, lactosides (i.e., β-D-galactopyranosyl-(1→4)-D-glucopyranoses).
A “sialoside” refers to a sialic acid moiety wherein one of the hydroxyl groups of the parent compound is replaced with a moiety —OR as described above. Sialic acid is a general term for N- and O-substituted derivatives of neuraminic acid, and includes, but is not limited to, N-acetyl (Neu5Ac) or N-glycolyl (Neu5Gc) substitutions, as well as O-substitutions including acetyl, lactyl, methyl, sulfate and phosphate, among others.
As used herein, the term “alkyl” refers to a straight or branched, saturated, aliphatic radical having the number of carbon atoms indicated. Alkyl can include any number of carbons, such as C1-2, C1-3, C1-4, C1-5, C1-6, C1-7, C1-8, C1-9, C1-10, C2-3, C2-4, C2-5, C2-6, C3-4, C3-5, C3-6, C4-5, C4-6 and C5-6. For example, C1-6 alkyl includes, but is not limited to, methyl, ethyl, propyl, isopropyl, butyl, isobutyl, sec-butyl, tert-butyl, pentyl, isopentyl, hexyl, etc. Alkyl can also refer to alkyl groups having up to 20 carbons atoms, such as, but not limited to heptyl, octyl, nonyl, decyl, etc. The term “alkylene” refers to a divalent alkyl radical, wherein the two points of attachment to the diradical are on the same carbon atom or different carbon atoms.
As used herein, the terms “halo” and “halogen” refer to a fluorine, chlorine, bromine, or iodine atom.
As used herein, the term “CMP-sialic acid synthetase” refers to a polypeptide that catalyzes the synthesis of cytidine monophosphate sialic acid (CMP-sialic acid) from cytidine triphosphate (CTP) and sialic acid.
As used herein, the term “sialyltransferase” refers to an enzyme that catalyzes the transfer of a sialic acid to a monosaccharide, an oligosaccharide, or another glycosylated molecule.
As used herein, the term “sialidase” refers to an enzyme that catalyzes the cleavage of a terminal sialic acid from a sialylated target such as an oligosaccharide, a polysaccharide, or a glycosylated protein.
The term “variant,” or “mutant,” in the context of the enzymes in the present disclosure, means a polypeptide, typically recombinant, that comprises one or more amino acid substitutions relative to a corresponding, naturally-occurring or unmodified enzyme.
The term “amino acid” refers to any monomeric unit that can be incorporated into a peptide, polypeptide, or protein. Amino acids include naturally-occurring α-amino acids and their stereoisomers, as well as unnatural (non-naturally occurring) amino acids and their stereoisomers. “Stereoisomers” of a given amino acid refer to isomers having the same molecular formula and intramolecular bonds but different three-dimensional arrangements of bonds and atoms (e.g., an L-amino acid and the corresponding D-amino acid).
Naturally-occurring amino acids are those encoded by the genetic code, as well as those amino acids that are later modified, e.g., hydroxyproline, γ-carboxyglutamate and O-phosphoserine. Naturally-occurring α-amino acids include, without limitation, alanine (Ala), cysteine (Cys), aspartic acid (Asp), glutamic acid (Glu), phenylalanine (Phe), glycine (Gly), histidine (His), isoleucine (Ile), arginine (Arg), lysine (Lys), leucine (Leu), methionine (Met), asparagine (Asn), proline (Pro), glutamine (Gln), serine (Ser), threonine (Thr), valine (Val), tryptophan (Trp), tyrosine (Tyr), and combinations thereof. Stereoisomers of a naturally-occurring α-amino acids include, without limitation, D-alanine (D-Ala), D-cysteine (D-Cys), D-aspartic acid (D-Asp), D-glutamic acid (D-Glu), D-phenylalanine (D-Phe), D-histidine (D-His), D-isoleucine (D-Ile), D-arginine (D-Arg), D-lysine (D-Lys), D-leucine (D-Leu), D-methionine (D-Met), D-asparagine (D-Asn), D-proline (D-Pro), D-glutamine (D-Gln), D-serine (D-Ser), D-threonine (D-Thr), D-valine (D-Val), D-tryptophan (D-Trp), D-tyrosine (D-Tyr), and combinations thereof.
Unnatural (non-naturally occurring) amino acids include, without limitation, amino acid analogs, amino acid mimetics, synthetic amino acids, N-substituted glycines, and N-methyl amino acids in either the L- or D-configuration that function in a manner similar to the naturally-occurring amino acids. For example, “amino acid analogs” can be unnatural amino acids that have the same basic chemical structure as naturally-occurring amino acids (i.e., a carbon that is bonded to a hydrogen, a carboxyl group, an amino group) but have modified side-chain groups or modified peptide backbones, e.g., homoserine, norleucine, methionine sulfoxide, methionine methyl sulfonium. “Amino acid mimetics” refer to chemical compounds that have a structure that is different from the general chemical structure of an amino acid, but that functions in a manner similar to a naturally-occurring amino acid.
Amino acids may be referred to herein by either the commonly known three letter symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical Nomenclature Commission. Nucleotides, as described herein, may also be referred to by their commonly accepted single-letter codes.
With respect to amino acid sequences, one of skill in the art will recognize that individual substitutions, additions, or deletions to a peptide, polypeptide, or protein sequence which alters, adds, or deletes a single amino acid or a small percentage of amino acids in the encoded sequence is a “conservatively modified variant” where the alteration results in the substitution of an amino acid with a chemically similar amino acid. The chemically similar amino acid includes, without limitation, a naturally-occurring amino acid such as an L-amino acid, a stereoisomer of a naturally occurring amino acid such as a D-amino acid, and an unnatural amino acid such as an amino acid analog, amino acid mimetic, synthetic amino acid, N-substituted glycine, and N-methyl amino acid.
As used herein, the term “forming a reaction mixture” refers to the process of bringing into contact at least two distinct species such that they mix together and can react, either modifying one of the initial reactants or forming a third distinct species, i.e., a product. It should be appreciated, however, the resulting reaction product can be produced directly from a reaction between the added reagents or from an intermediate from one or more of the added reagents which can be produced in the reaction mixture.
The terms “expression” and “expressed” in the context of a gene refer to the transcriptional and/or translational product of the gene. The level of expression of a DNA molecule in a cell may be determined on the basis of either the amount of corresponding mRNA that is present within the cell or the amount of protein encoded by that DNA produced by the cell.
The term “recombinant” when used with reference, e.g., to a cell, or nucleic acid, protein, or vector, indicates that the cell, nucleic acid, protein or vector, has been modified by the introduction of a heterologous nucleic acid or protein or the alteration of a native nucleic acid or protein, or that the cell is derived from a cell so modified. For example, recombinant cells express genes that are not found within the native (non-recombinant) form of the cell or express native genes that are otherwise abnormally expressed, under-expressed, or not expressed at all.
The terms “amino acid modification” and “amino acid alteration” refer to a substitution, a deletion, or an insertion of one or more amino acids. For example, substitutions may be made wherein an aliphatic amino acid (e.g., G, A, I, L, or V) is substituted with another member of the group. Similarly, an aliphatic polar-uncharged group such as C, S, T, M, N, or Q, may be substituted with another member of the group; and basic residues, e.g., K, R, or H, may be substituted for one another. In some embodiments, an amino acid with an acidic side chain, e.g., E or D, may be substituted with its uncharged counterpart, e.g., Q or N, respectively; or vice versa. Each of the following eight groups contains exemplary amino acids that are conservative substitutions for one another: 1) Alanine (A), Glycine (G); 2) Aspartic acid (D), Glutamic acid (E); 3) Asparagine (N), Glutamine (Q); 4) Arginine (R), Lysine (K); 5) Isoleucine (I), Leucine (L), Methionine (M), Valine (V); 6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W); 7) Serine (S), Threonine (T); and 8) Cysteine (C), Methionine (M) (see, e.g., Creighton, Proteins (1984)).
The terms “nucleic acid,” “nucleotide,” and “polynucleotide” refer to deoxyribonucleic acids (DNA) or ribonucleic acids (RNA) and polymers. The term includes, but is not limited to, single-, double-, or multi-stranded DNA or RNA, genomic DNA, cDNA, and DNA-RNA hybrids, as well as other polymers comprising purine and/or pyrimidine bases or other natural, chemically modified, biochemically modified, non-natural, synthetic, or derivatized nucleotide bases. Unless specifically limited, the term encompasses nucleic acids containing known analogs of natural nucleotides that have similar binding properties as the reference nucleic acid and are metabolized in a manner similar to naturally occurring nucleotides. Unless otherwise indicated, a particular nucleic acid sequence also implicitly encompasses conservatively modified variants thereof (e.g., degenerate codon substitutions), orthologs, and complementary sequences as well as the sequence explicitly indicated. Specifically, degenerate codon substitutions may be achieved by generating sequences in which the third position of one or more selected (or all) codons is substituted with mixed-base and/or deoxyinosine residues (Batzer et al., Nucleic Acid Res. 19:5081 (1991); Ohtsuka et al., J Biol. Chem. 260:2605-2608 (1985); and Rossolini et al., Mol. Cell. Probes 8:91-98 (1994)).
The terms “nucleotide sequence encoding a peptide” and “gene” refer to the segment of DNA involved in producing a peptide chain. In addition, a gene will generally include regions preceding and following the coding region (leader and trailer) involved in the transcription/translation of the gene product and the regulation of the transcription/translation. A gene can also include intervening sequences (introns) between individual coding segments (exons). Leaders, trailers, and introns can include regulatory elements that are necessary during the transcription and the translation of a gene (e.g., promoters, terminators, translational regulatory sequences such as ribosome binding sites and internal ribosome entry sites, enhancers, silencers, insulators, boundary elements, replication origins, matrix attachment sites and locus control regions, etc.). A “gene product” can refer to either the mRNA or protein expressed from a particular gene.
“Percentage of sequence identity” is determined by comparing two optimally aligned sequences over a comparison window, wherein the portion of the sequence (e.g., a peptide of the invention) in the comparison window may comprise additions or deletions (i.e., gaps) as compared to the reference sequence which does not comprise additions or deletions, for optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which the identical amino acid residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison and multiplying the result by 100 to yield the percentage of sequence identity.
“Identical” and “identity,” in the context of two or more nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that are the same. Sequences are “substantially identical” to each other if they have a specified percentage of nucleotides or amino acid residues that are the same (e.g., at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, or at least 99% identical over a specified region), when compared and aligned for maximum correspondence over a comparison window, or designated region as measured using one of the following sequence comparison algorithms or by manual alignment and visual inspection. These definitions also refer to the complement of a nucleic acid test sequence.
“Similarity” and “percent similarity,” in the context of two or more polypeptide sequences, refer to two or more sequences or subsequences that have a specified percentage of amino acid residues that are either the same or similar as defined by a conservative amino acid substitutions (e.g., at least 20%, at least 25%, at least 30%, at least 35%, at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95% similar over a specified region), when compared and aligned for maximum correspondence over a comparison window, or designated region as measured using one of the following sequence comparison algorithms or by manual alignment and visual inspection. Sequences are “substantially similar” to each other if, for example, they are at least 20%, at least 25%, at least 30%, at least 35%, at least 40%, at least 45%, at least 50%, or at least 55% similar to each other.
For sequence comparison, typically one sequence acts as a reference sequence, to which test sequences are compared. When using a sequence comparison algorithm, test and reference sequences are entered into a computer, subsequence coordinates are designated, if necessary, and sequence algorithm program parameters are designated. Default program parameters can be used, or alternative parameters can be designated. The sequence comparison algorithm then calculates the percent sequence identities for the test sequences relative to the reference sequence, based on the program parameters. For sequence comparison of nucleic acids and proteins, the BLAST and BLAST 2.0 algorithms and the default parameters discussed below are used.
Methods of alignment of sequences for comparison are well-known in the art. Optimal alignment of sequences for comparison can be conducted, which are useful for determining percent sequence identity and sequence similarity, are the BLAST and BLAST 2.0 algorithms, which are described in Altschul et al., (1990) J Mol. Biol. 215: 403-410 and Altschul et al. (1977) Nucleic Acids Res. 25: 3389-3402, respectively. Software for performing BLAST analyses is publicly available at the National Center for Biotechnology Information website, ncbi.nlm.nih.gov. The algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short words of length W in the query sequence, which either match or satisfy some positive-valued threshold score T when aligned with a word of the same length in a database sequence. T is referred to as the neighborhood word score threshold (Altschul et al., supra). These initial neighborhood word hits act as seeds for initiating searches to find longer HSPs containing them. The word hits are then extended in both directions along each sequence for as far as the cumulative alignment score can be increased. Cumulative scores are calculated using, for nucleotide sequences, the parameters M (reward score for a pair of matching residues; always >0) and N (penalty score for mismatching residues; always <0). For amino acid sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word hits in each direction are halted when: the cumulative alignment score falls off by the quantity X from its maximum achieved value; the cumulative score goes to zero or below, due to the accumulation of one or more negative-scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) uses as defaults a word size (W) of 28, an expectation (E) of 10, M=1, N=−2, and a comparison of both strands. For amino acid sequences, the BLASTP program uses as defaults a word size (W) of 3, an expectation (E) of 10, and the BLOSUM62 scoring matrix (see, e.g., Henikoff and Henikoff, Proc. Natl. Acad. Sci. USA 89:10915 (1989)).
The BLAST algorithm also performs a statistical analysis of the similarity between two sequences (see, e.g., Karlin and Altschul, Proc. Nat. Acad. Sci. USA, 90: 5873-5787 (1993)). One measure of similarity provided by the BLAST algorithm is the smallest sum probability (P(N)), which provides an indication of the probability by which a match between two nucleotide or amino acid sequences would occur by chance. For example, a nucleic acid is considered similar to a reference sequence if the smallest sum probability in a comparison of the test nucleic acid to the reference nucleic acid is less than about 0.2, more preferably less than about 0.01, and most preferably less than about 0.001.
An indication that two nucleic acid sequences or peptides are substantially identical is that the peptide encoded by the first nucleic acid is immunologically cross reactive with the antibodies raised against the peptide encoded by the second nucleic acid. Thus, a peptide is typically substantially identical to a second peptide, for example, where the two peptides differ only by conservative substitutions. Another indication that two nucleic acid sequences are substantially identical is that the two molecules or their complements hybridize to each other under stringent conditions, as described below. Yet another indication that two nucleic acid sequences are substantially identical is that the same primers can be used to amplify the sequence.
The terms “transfection” and “transfected” refer to introduction of a nucleic acid into a cell by non-viral or viral-based methods. The nucleic acid molecules may be gene sequences encoding complete proteins or functional portions thereof. See, e.g., Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual, 18.1-18.88.
Expression of a transfected gene can occur transiently or stably in a cell. During “transient expression” the transfected gene is not transferred to the daughter cell during cell division. Since its expression is restricted to the transfected cell, expression of the gene is lost over time. In contrast, stable expression of a transfected gene can occur when the gene is co-transfected with another gene that confers a selection advantage to the transfected cell. Such a selection advantage may be a resistance towards a certain toxin that is presented to the cell.
The term “promoter,” as used herein, refers to a polynucleotide sequence capable of driving transcription of a coding sequence in a cell. Thus, promoters used in the polynucleotide constructs of the invention include cis-acting transcriptional control elements and regulatory sequences that are involved in regulating or modulating the timing and/or rate of transcription of a gene. For example, a promoter can be a cis-acting transcriptional control element, including an enhancer, a promoter, a transcription terminator, an origin of replication, a chromosomal integration sequence, 5′ and 3′ untranslated regions, or an intronic sequence, which are involved in transcriptional regulation. These cis-acting sequences typically interact with proteins or other biomolecules to carry out (turn on/off, regulate, modulate, etc.) gene transcription. A “constitutive promoter” is one that is capable of initiating transcription in nearly all tissue types, whereas a “tissue-specific promoter” initiates transcription only in one or a few particular tissue types. An “inducible promoter” is one that initiates transcription only under particular environmental conditions or developmental conditions.
A polynucleotide/polypeptide sequence is “heterologous” to an organism or a second polynucleotide/polypeptide sequence if it originates from a different species, or, if from the same species, is modified from its original form. For example, when a promoter is said to be operably linked to a heterologous coding sequence, it means that the coding sequence is derived from one species whereas the promoter sequence is derived another, different species; or, if both are derived from the same species, the coding sequence is not naturally associated with the promoter (e.g., is a genetically engineered coding sequence, e.g., from a different gene in the same species, or an allele from a different ecotype or variety).
An “expression cassette” refers to a nucleic acid construct, which when introduced into a host cell, results in transcription and/or translation of a RNA or polypeptide, respectively. Antisense constructs or sense constructs that are not or cannot be translated are expressly included by this definition. One of skill will recognize that the inserted polynucleotide sequence need not be identical, but may be only substantially similar to a sequence of the gene from which it was derived.
The terms “vector” and “recombinant expression vector” refer to a nucleic acid construct, generated recombinantly or synthetically, with a series of specified nucleic acid elements that permit transcription of a particular polynucleotide sequence in a host cell. An expression vector may be part of a plasmid, viral genome, or nucleic acid fragment. Typically, an expression vector includes a polynucleotide to be transcribed, operably linked to a promoter. Nucleic acid or amino acid sequences are “operably linked” (or “operatively linked”) when placed into a functional relationship with one another. For instance, a promoter or enhancer is operably linked to a coding sequence if it regulates, or contributes to the modulation of, the transcription of the coding sequence. Operably linked DNA sequences are typically contiguous, and operably linked amino acid sequences are typically contiguous and in the same reading frame. However, since enhancers generally function when separated from the promoter by up to several kilobases or more and intronic sequences may be of variable lengths, some polynucleotide elements may be operably linked but not contiguous. Similarly, certain amino acid sequences that are non-contiguous in a primary polypeptide sequence may nonetheless be operably linked due to, for example folding of a polypeptide chain.
As used herein, the term “glycosyltransferase” refers to a polypeptide that catalyzes the formation of an oligosaccharide from a nucleotide-sugar an acceptor sugar. Nucleotide-sugars include, but are not limited to, nucleotide diphosphate sugars (NDP-sugars) and nucleotide monophosphate sugars (NMP-sugars) such as a cytidine monophosphate sugar (CMP-sugar). In general, a glycosyltransferase catalyzes the transfer of the monosaccharide moiety of an NDP-sugar or CMP-sugar to a hydroxyl group of the acceptor sugar. The covalent linkage between the monosaccharide and the acceptor sugar can be a 1-3 linkage, a 1-4 linkage, a 1-6-linkage, a 1-2 linkage, a 2-3 linkage, a 2-6 linkage, a 2-8 linkage, or a 2-9 linkage as described above. The linkage may be in the α- or β-configuration with respect to the anomeric carbon of the monosaccharide. Other types of linkages may be formed by the glycosyltransferases in the methods of the invention. Glycosyltransferases include, but are not limited to, heparosan synthases (HSs), glucosaminyltransferases, N-acetylglucosaminyltransferases, glucosyltransferasess, glucuronyltransferases, and sialyltransferases.
As used herein, the term “oligosaccharide” refers to a compound containing at least two monosaccharides covalently linked together. Oligosaccharides include disaccharides, trisaccharides, tetrasaccharides, pentasaccharides, hexasaccharides, heptasaccharides, octasaccharides, and the like. Covalent linkages generally consist of glycosidic linkages (i.e., C—O—C bonds) formed from the hydroxyl groups of adjacent sugars. Linkages can occur between the 1-carbon and the 4-carbon of adjacent sugars (i.e., a 1-4 linkage), the 1-carbon and the 3-carbon of adjacent sugars (i.e., a 1-3 linkage), the 1-carbon and the 6-carbon of adjacent sugars (i.e., a 1-6 linkage), or the 1-carbon and the 2-carbon of adjacent sugars (i.e., a 1-2 linkage). Linkages can occur between the 2-carbon and the 3-carbon of adjacent sugars (i.e., a 2-3 linkage), the 2-carbon and the 6-carbon of adjacent sugars (i.e., a 2-6 linkage), the 2-carbon and the 8-carbon of adjacent sugars (i.e., a 2-8 linkage), or the 2-carbon and the 9-carbon of adjacent sugars (i.e., a 2-9 linkage). A sugar can be linked within an oligosaccharide such that the anomeric carbon is in the α- or β-configuration. The oligosaccharides prepared according to the methods of the invention can also include linkages between carbon atoms other than the 1-, 2-, 3-, 4-, and 6-carbons or the 2-, 3-, 6-, 8-, and 9-carbons.
“Acceptor glycoside” or “glycosylation acceptor” refers to a substance (e.g., a glycosylated amino acid, a glycosylated protein, an oligosaccharide, or a polysaccharide) containing a sphingosine moiety that accepts a sugar moiety from a donor substrate.
As used herein, the term “kinase” refers to a polypeptide that catalyzes the covalent addition of a phosphate group to a substrate. The substrate for a kinase used in the methods of the invention is generally a sugar as defined above, and a phosphate group is added to the anomeric carbon (i.e. the “1,” position) of the sugar. The product of the reaction is a sugar-1-phosphate. Kinases include, but are not limited to, N-acetylhexosamine 1-kinases (NahKs), glucuronokinases (GlcAKs), glucokinases (GlcKs), galactokinases (GalKs), monosaccharide-1-kinases, and xylulokinases. Certain kinases utilize nucleotide triphosphates, including adenosine-5′-triphosphate (ATP) as substrates.
As used herein, the term “dehydrogenase” refers to a polypeptide that catalyzes the oxidation of a primary alcohol. In general, the dehyrogenases used in the methods of the invention convert the hydroxymethyl group of a hexose (i.e. the C6-OH moiety) to a carboxylic acid. Dehydrogenases useful in the methods of the invention include, but are not limited to, UDP-glucose dehydrogenases (Ugds).
As used herein, the term “nucleotide-sugar pyrophosphorylase” refers to a polypeptide that catalyzes the conversion of a sugar-1-phosphate to a UDP-sugar. In general, a uridine-5′-monophosphate moiety is transferred from uridine-5′-triphosphate to the sugar-1-phosphate to form the UDP-sugar. Examples of nucleotide-sugar pyrophosphorylases include glucosamine uridylyltransferases (GlmUs) and glucose-1-phosphate uridylyltransferases (GalUs). Nucleotide-sugar pyrophosphorylases also include promiscuous UDP-sugar pyrophosphorylases, termed “USPs,” that can catalyze the conversion of various sugar-1-phosphates to UDP-sugars including UDP-Glc, UDP-GlcNAc, UDP-GlcNH2, UDP-Gal, UDP-GalNAc, UDP-GalNH2, UDP-Man, UDP-ManNAc, UDP-ManNH2, UDP-GlcA, UDP-IdoA, UDP-GalA, and their substituted analogs.
As used herein, the term “pyrophosphatase” (abbreviated as PpA) refers to a polypeptide that catalyzes the conversion of pyrophosphate (i.e., P2O74−, HP2O73−, H2P2O72−, H3P2O7−) to two molar equivalents of inorganic phosphate (i.e., PO43−, HPO42−, H2PO4−).
An amino acid residue “corresponding to an amino acid residue [X] in [specified sequence,” or an amino acid substitution “corresponding to an amino acid substitution [X] in [specified sequence]” refers to an amino acid in a polypeptide of interest that aligns with the equivalent amino acid of a specified sequence. Generally, as described herein, the amino acid corresponding to a position of a specified polypeptide sequence can be determined using an alignment algorithm such as BLAST.
Human milk oligosaccharides (HMOs) are the third major component of human milk. They contribute significantly to the health benefit of breast feeding. The potential of applying HMOs in the development of infant formula additives and therapeutics for treating infant and adult diseases is continuously explored. In addition to the challenges in quantitative analysis of HMOs, difficulties to access HMOs (especially in their structurally pure individual forms) in sufficient amounts hindered the progress of related research and practical applications of HMOs.
Accordingly, one aspect of the disclosure provides compounds of Formula I:
Accordingly, another aspect of the disclosure provides compounds of Formula I:
In some embodiments, compounds according to Formula I are provided as described above wherein R1 is benzyl.
In some embodiments, compounds according to Formula I are provided as described above wherein R33 is galactosyl moiety A.
In some embodiments, compounds according to Formula I are provided as described above wherein R42 is H and R43 is a monosaccharide or an oligosaccharide.
In some embodiments, compounds according to Formula I are provided as described above wherein R13 is H. In some embodiments, R13 is fucosyl.
In some embodiments, compounds according to Formula I are provided as described above wherein the compound according to Formula I is Galβ4GlcβNHCbz, GlcNAcβ3Galβ4GlcβNHCbz, Galβ4GlcNAcβ3Galβ4GlcβNHCbz, GlcNAcβ3Galβ4GlcNAcβ3Galβ4GlcβNHCbz, Galβ4GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz, Galβ4(Fucα3)GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz, Galβ3GlcNAcβ3Galβ4GlcβNHCbz, Fucα2GalβGlcNAcβ3Galβ4GlcβNHCbz, GalβGlcNAcβ3Galβ4(Fucα3)GlcβNHCbz, Galβ(Fucα4)GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz, Galβ4GlcNAcβ3Galβ4GlcNAcβ3Galβ4GlcβNHCbz, GlcNAcβ3Galβ4GlcNAcβ3Galβ4GlcNAcβ3Galβ4GlcβNHCbz, Fucα2Galβ4GlcNAcβ3Galβ4GlcNAcβ3Galβ4GlcβNHCbz, Galβ4(Fucα3)GlcNAcβ3Galβ4(Fucα3)GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz, Neu5Acα3Gal β4GlcNAcβ3Galβ4GlcNAcβ3Gal β4GlcβNHCbz, Neu5Acα6Gal β4GlcNAcβ3Galβ4GlcNAcβ3Galβ4GlcβNHCbz, GalβGlcNAcβ3Galβ4GlcNAcβ3Galβ4GlcβNHCbz, Fucα2GalβGlcNAcβ3Galβ4GlcNAcβ3Galβ4GlcβNHCbz, Galβ(Fucα4)GlcNAcβ3Galβ4(Fucα3)GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz, Galβ4GlcNAcβ3Galβ4GlcNAcβ3Galβ4GlcNAcβ3Galβ4GlcβNHCbz, GalβGlcNAcβ3Galβ4GlcNAcβ3Galβ4GlcNAcβ3Galβ4GlcβNHCbz, Fucα2GalβGlcNAcβ3Galβ4GlcNAcβ3Galβ4GlcNAcβ3Galβ4GlcβNHCbz, Fucα2Galβ(Fucα4)GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz, Fucα2GalβGlcNAcβ3Galβ4(Fucα3)GlcβNHCbz, Galβ(Fucα4)GlcNAcβ3Galβ4GlcβNHCbz, Fucα2Galβ(Fucα4)GlcNAcβ3Galβ4GlcβNHCbz, Fucα2Galβ4GlcNAcβ3Galβ4GlcβNHCbz, Fucα2Galβ4(Fucα3)GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz, Fucα2Galβ4GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz, Galβ4(Fucα3)GlcNAcβ3Galβ4Galβ1βNHCbz, Fucα2Galβ4(Fucα3)GlcNAcβ3Galβ4GlcβNHCbz, Neu5Acα3GalβGlcNAcβ3Galβ4GlcβNHCbz, Neu5Acα6Gal β3GlcNAcβ3Galβ4GlcβNHCbz, Neu5Acα3GalβGlcNAcβ3Galβ4(Fucα3)GlcβNHCbz, Neu5Acα6Gal β3GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz, Neu5Acα3Galβ(Fucα4)GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz, Neu5Acα3Galβ(Fucα4)GlcNAcβ3Galβ4GlcβNHCbz, Neu5Acα3Galβ4GlcNAcβ3Galβ4GlcβNHCbz, Neu5Acα6Galβ4GlcNAcβ3Galβ4GlcβNHCbz, Neu5Acα3Galβ4GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz, Neu5Acα6Galβ4GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz, Neu5Acα3Galβ4(Fucα3)GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz, Neu5Acα3Galβ4(Fucα3)GlcNAcβ3Galβ4GlcβNHCbz, Galβ(Neu5Acα6)GlcNAcβ3Galβ4GlcβNHCbz, Fucα2Galβ(Neu5Acα6)GlcNAcβ3Galβ4GlcβNHCbz, Neu5Acα3Galβ(Neu5Acα6)GlcNAcβ3Galβ4GlcβNHCbz, Neu5Acα3Galβ(Neu5Acα6)GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz, Neu5Acα3Galβ3(Neu5Acα6) (Fucα4)GlcNAcβ3Galβ4GlcβNHCbz, Galβ4GlcNAcβ3(GlcNAcβ6)Galβ4GlcβNHCbz Galβ4(Fucα3)GlcNAcβ3Galβ4(Fucα3)GlcNAcβ3Galβ4GlcβNHCbz, Galβ4GlcNAcβ3Galβ4(Fucα3)GlcNAcβ3Galβ4GlcβNHCbz, Galβ4(Fucα3)GlcNAcβ3Galβ4GlcNAcβ3Galβ4GlcβNHCbz, Galβ4GlcNAcβ3Galβ4GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz, or any salt thereof.
In some embodiments, compounds according to Formula I are provided as described above wherein the compound according to Formula I is GlcNHTFAβ3Galβ4GlcβNHCbz, GlcNH2Galβ4GlcβNHCbz, Galβ4GlcNAcβ3Galβ4GlcNH2β3Galβ4GlcβNHCbz, Galβ4GlcNAcβ3Galβ4GlcNHBocβ3Galβ4GlcβNHCbz, Galβ4GlcNAcβ3Galβ4GlcNHFmocβ3Galβ4GlcβNHCbz, Galβ4(Fucα3)GlcNAcβ3Galβ4GlcNHBocβ3Galβ4(Fucα3)GlcβNHCbz, Galβ4(Fucα3)GlcNAcβ3Galβ4GlcNHFmocβ3Galβ4(Fucα3)GlcβNHCbz, Galβ4(Fucα3)GlcNAcβ3Galβ4GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz, GalβGlcNAcβ3Galβ4GlcNH2β3Galβ4GlcβNHCbz, GalβGlcNAcβ3Galβ4GlcNHBocβ3Galβ4GlcβNHCbz, GalβGlcNAcβ3Galβ4GlcNHFmocβ3Galβ4GlcβNHCbz, Galβ(Fucα4)GlcNAcβ3Galβ4GlcNHBocβ3Galβ4(Fucα3)GlcβNHCbz, Galβ(Fucα4)GlcNAcβ3Galβ4GlcNHFmocβ3Galβ4(Fucα3)GlcβNHCbz, Galβ(Fucα4)GlcNAcβ3Galβ4GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz, or any salt thereof.
Another aspect of the disclosure provides compounds of Formula II:
In some embodiments, compounds according to Formula II are provided as described above wherein:
In some embodiments, compounds according to Formula II are provided as described above wherein R1 is benzyl.
In some embodiments, compounds according to Formula II are provided as described above wherein R33 is galactosyl moiety A.
In some embodiments, compounds according to Formula II are provided as described above wherein R42 is H and R43 is a monosaccharide or an oligosaccharide.
In some embodiments, compounds according to Formula II are provided as described above wherein R13 is H. In some embodiments, R13 is fucosyl.
Another aspect of the disclosure provides compounds of Formula III:
In some embodiments, compounds according to Formula III are provided as described above wherein R1 is benzyl.
In some embodiments, compounds according to Formula III are provided as described above wherein R33 is galactosyl moiety A.
In some embodiments, compounds according to Formula III are provided as described above wherein R42 is H and R43 is a monosaccharide or an oligosaccharide.
In some embodiments, compounds according to Formula III are provided as described above wherein R13 is H. In some embodiments, R13 is fucosyl.
Described herein, in some embodiments, are HMO compounds having a tetraose core (e.g., HMO compounds of Formula I). Human milk oligosaccharides (HMOs) containing a tetraose core such as lacto-N-tetraose (LNT), lacto-N-neotetraose (LNnT), lacto-N-fucopentaose I (LNFP-I) and II (LNFP-II), lacto-N-difuco-hexaose-I (LNDFH-I), and disialyllacto-N-tetraose (DSLNT) are among the top ten most abundant HMOs in human milk. Different from animals which may contain both N-acetylneuraminic acid (Neu5Ac) and N-glycolylneuraminic acid (Neu5Gc) as the sialic acid forms in their milk, human milk contains only Neu5Ac as the sialic acid form. Overall, a total of thirteen LNT-based and six LNnT-based HMOs have been reported.
Systematic synthesis of a complete library of HMOs containing a tetraose core such as LNT or LNnT had not been previously achieved. The highly efficient methods presented herein allow access, in a systematic manner, to the complete library of tetraose-core-based HMOs including those with different combination of fucosylation and/or sialylation. Described herein is the synthesis of these HMOs in a systematic target orientated manner using a Stepwise One-Pot Multienzyme (StOPMe) strategy with pre-generation of sugar nucleotides. Enzyme Assembly Synthetic Maps (EASyMaps) are designed to guide the synthesis. The challenging Neu5Acα2-6GlcNAc-containing HMOs including DSLNT and a complex disialyl Lewis a structure are accessed using E. coli-expressed human α2-6-sialyltransferase (hST6GALNAC V) and mutants thereof with improved activity.
Theoretically, a total of 33 HMOs (
In some embodiments, compounds described herein are HMOs with an LNT or LNnT core. In some embodiments, compounds described herein have a neutral tetraose core, as shown in
Although LNnT-based HMOs with a fucose α1-2-linked to the terminal Gal have not been previously identified or isolated, they are possible products of appropriate fucosyltransferases. Therefore, there are a total of sixteen possible neutral HMOs containing a tetraose core of either LNT or LNnT (
In some embodiments, HMO compounds described herein contain an LNT or an LNnT core, wherein the HMO compounds are negatively charged and monosialylated, and wherein the HMO compounds contain a Neu5Ac α2-3 (
In some embodiments, HMO compounds provided herein are LDNF-IIIβNHCbz, LNFP-VβNHCbz, LNDFH-IIβNHCbz, LNFP-IIβNHCbz, LNDFH-IβNHCbz, LNTβNHCbz, LNFP-IβNHCbz, or LNTFHepβNHCbz.
In some embodiments, HMO compounds provided herein are LNnFP-VβNHCbz, LNnDFH-IIIβNHCbz, LNnDFH-IIβNHCbz, LNFP-IIIβNHCbz, LNnDFH-IβNHCbz, LNnTβNHCbz, LNnFP-IβNHCbz, or LNnTFHepβNHCbz.
In some embodiments, the HMO compounds provided herein with a tetraose core may be sialylated. In some embodiments, the sialylated HMO compounds provided herein are Neu5Acα3LNTβNHCbz(LSTaβNHCbz), Neu5Acα6LNTβNHCbz, Neu5Acα3LNFP-VβNHCbz, Neu5Acα6LNFP-VβNHCbz, Neu5Acα3LNDFH-IIβNHCbz, Neu5Acα3LNFP-IIβNHCbz(S-LNF II or F-LSTa), Neu5Acα3LNnTβNHCbz, Neu5Acα6LNnTβNHCbz (LSTcβNHCbz), Neu5Acα3LNnFP-VβNHCbz, Neu5Acα6LNnFP-VβNHCbz(F-LSTcβNHCbz), Neu5Acα3LNnDFH-IIβNHCbz, or Neu5Acα3LNFP-IIIβNHCbz.
In some embodiments, the HMO compounds provided herein are negatively charged with an LNT core with a Neu5Acα2-6GlcNAc linkage. In some embodiments, the HMO compounds are LSTbβNHCbz, F-LSTbβNHCbz, DSLNTβNHCbz, FDS-LNT-IIβNHCbz, or FDS-LNT IβNHCbz.
In some embodiments, any one of the compounds provided herein may be obtained in isolated form. An isolated HMO compound as provided herein may be between 70% and 100% pure (e.g., 70%, 75%, 80%, 85%, 90%, 95%, or 100% purity).
In some embodiments, the compounds described herein are obtained in at least 50% yield and up to quantitative (100%) yield (e.g., 50% to 99%, 60% to 95%, 70% to 90% or 80 to 85% yield).
Described herein are multistep one-pot multienzyme (MSOPME) and stepwise one-pot multienzyme (StOPMe) strategies for enzymatic synthesis of HMO compounds from precursor materials, e.g., from monosaccharides and oligosaccharides. Monosaccharides may include, but are not limited to, GlcNAc, Gal, L-Fuc, Neu5Ac, ManNAc, Neu5Gc, ManNGc, mannose, and Kdn. Oligosaccharides may include, but are not limited to, Lac and LacβOR. As used herein, “one-pot” refers to performing one or multiple steps in the same reaction vessel. Optionally, the methods provided herein are performed without the purification of intermediate oligosaccharides. The methods described herein provide quick access to a variety of HMO compounds and derivatives.
For example, the methods and enzymes described herein can be applied to synthesizing a variety of HMO compounds, including short-chain and long-chain as well as linear and branched HMOs and their sialylated and/or fucosylated derivatives.
Accordingly, one aspect of the disclosure provides a method for preparing a compound according to Formula I, or salt thereof, the method comprising: forming a reaction mixture containing a protected lactosylamine; one or more monosaccharides selected from the group consisting of galactose, fucose, N-acetylglucosamine, and N-acetylneuraminic acid; one or more monosaccharide sugar activation enzymes; one or more nucleotide triphosphates; and one or more glycosyltransferases; and incubating the reaction mixture to form the compound according to Formula I;
wherein: R1 is selected from the group consisting of benzyl and fluorenylmethyl; R13 is selected from the group consisting of H and fucosyl; R26 is selected from the group consisting of H, N-acetylglucosamine, N-acetylneuraminic acid, and an oligosaccharide comprising 2-10 monosaccharides; and each monosaccharide is independently selected from the group consisting of glucose, galactose, L-fucose, N-acetylglucosamine, N-trifluoroacetylglucosamine, glucosamine, N-Fmoc-glucosamine, N-Boc-glucosamine, and N-acetylneuraminic acid; R32 is selected from the group consisting of H, acetyl, trifluoroacetyl, Boc, Fmoc; R36 is selected from the group consisting of H and N-acetylneuraminic acid; R33 and R34 are independently selected from the group consisting of H, fucosyl, and a galactosyl moiety A:
R42 is selected from the group consisting of H and fucosyl; R46 is selected from the group consisting of H, N-acetylglucosamine, N-acetylneuraminic acid, and an oligosaccharide comprising 2-10 monosaccharides; and each monosaccharide is independently selected from the group consisting of glucose, galactose, L-fucose, N-acetylglucosamine, N-trifluoroacetylglucosamine, glucosamine, N-Fmoc-glucosamine, N-Boc-glucosamine, and N-acetylneuraminic acid; R43 is selected from the group consisting of a H, a monosaccharide, and an oligosaccharide comprising 2-10 monosaccharides; and each monosaccharide is independently selected from the group consisting of glucose, galactose, L-fucose, N-acetylglucosamine, N-trifluoroacetylglucosamine, glucosamine, N-Fmoc-glucosamine, N-Boc-glucosamine, and N-acetylneuraminic acid.
In some embodiments, the the reaction mixture of the method for preparing a compound according to Formula I, or salt thereof, comprises the protected lactosylamine. Optionally, the protected lactosylamine is purified prior to introduction to the enzymatic reaction mixture. In some examples, the protected lactosylamine is purified by crystallization. The crystallization can be performed by (i) dissolving crude protected lactosylamine in a solvent mixture at an elevated temperature; (ii) cooling the combined mixture of protected lactosylamine in the solvent mixture to room temperature; (iii) adding pure protected lactosylamine; and (iv) incubating the mixture for a period of time to obtain crystals. Optionally, the solvent mixture comprises water and an alcohol (e.g., methanol, ethanol, propanol, 1-butanol, isobutanol, and the like). Optionally, the solvent mixture comprises a 5:1 to 1:5 ratio (e.g., a 2:1 to 1:2 ratio) of the water to the alcohol.
Another aspect described herein is a method for preparing a compound according to Formula II, or a salt thereof, the method comprising: forming a reaction mixture containing a protected lactosylthiazolidine; one or more monosaccharides selected from the group consisting of galactose, fucose, N-acetylglucosamine, and N-acetylneuraminic acid; one or more monosaccharide sugar activation enzymes; one or more nucleotide triphosphates; and one or more glycosyltransferases; and incubating the reaction mixture to form the compound according to Formula II;
wherein: R1 is selected from the group consisting of benzyl and fluorenylmethyl; R13 is selected from the group consisting of H and fucosyl; R26 is selected from the group consisting of H, N-acetylglucosamine, N-acetylneuraminic acid, and an oligosaccharide comprising 2-10 monosaccharides; and each monosaccharide is independently selected from the group consisting of glucose, galactose, L-fucose, N-acetylglucosamine, N-trifluoroacetylglucosamine, glucosamine, N-Fmoc-glucosamine, N-Boc-glucosamine, and N-acetylneuraminic acid; R32 is selected from the group consisting of H, acetyl, trifluoroacetyl, Boc, Fmoc; R36 is selected from the group consisting of H and N-acetylneuraminic acid; R33 and R34 are independently selected from the group consisting of H, fucosyl, and a galactosyl moiety A:
R42 is selected from the group consisting of H and fucosyl; R46 is selected from the group consisting of H, N-acetylglucosamine, N-acetylneuraminic acid, and an oligosaccharide comprising 2-10 monosaccharides; and each monosaccharide is independently selected from the group consisting of glucose, galactose, L-fucose, N-acetylglucosamine, N-trifluoroacetylglucosamine, glucosamine, N-Fmoc-glucosamine, N-Boc-glucosamine, and N-acetylneuraminic acid; R43 is selected from the group consisting of a H, a monosaccharide, and an oligosaccharide comprising 2-10 monosaccharides; and each monosaccharide is independently selected from the group consisting of glucose, galactose, L-fucose, N-acetylglucosamine, N-trifluoroacetylglucosamine, glucosamine, N-Fmoc-glucosamine, N-Boc-glucosamine, and N-acetylneuraminic acid.
Another aspect of the disclosure provides a method for preparing a compound according to Formula III,
or salt thereof, the method comprising: forming a reaction mixture containing a lactose dithioacetal, one or more monosaccharides selected from the group consisting of galactose, fucose, N-acetylglucosamine, and N-acetylneuraminic acid, one or more monosaccharide sugar activation enzymes, one or more nucleotide triphosphates, and one or more glycosyltransferases; and incubating the reaction mixture to form the compound according to Formula III; wherein each R2 is is independently selected from the group consisting of C1-6 alkyl and optionally substituted C6-14 aryl, or two R2 are taken together to form a C1-6 alkylene diradical; R13 is selected from the group consisting of H and fucosyl; R26 is selected from the group consisting of H, N-acetylglucosamine, N-acetylneuraminic acid, and an oligosaccharide comprising 2-10 monosaccharides; and each monosaccharide is independently selected from the group consisting of glucose, galactose, L-fucose, N-acetylglucosamine, N-trifluoroacetylglucosamine, glucosamine, N-Fmoc-glucosamine, N-Boc-glucosamine, and N-acetylneuraminic acid; R32 is selected from the group consisting of H, acetyl, trifluoroacetyl, Boc, Fmoc; R36 is selected from the group consisting of H and N-acetylneuraminic acid; R33 and R34 are independently selected from the group consisting of H, fucosyl, and a galactosyl moiety A
R42 is selected from the group consisting of H and fucosyl; R46 is selected from the group consisting of H, N-acetylglucosamine, N-acetylneuraminic acid, and an oligosaccharide comprising 2-10 monosaccharides; and each monosaccharide is independently selected from the group consisting of glucose, galactose, L-fucose, N-acetylglucosamine, N-trifluoroacetylglucosamine, glucosamine, N-Fmoc-glucosamine, N-Boc-glucosamine, and N-acetylneuraminic acid; R43 is selected from the group consisting of a H, a monosaccharide, and an oligosaccharide comprising 2-10 monosaccharides; and each monosaccharide is independently selected from the group consisting of glucose, galactose, L-fucose, N-acetylglucosamine, N-trifluoroacetylglucosamine, glucosamine, N-Fmoc-glucosamine, N-Boc-glucosamine, and N-acetylneuraminic acid.
In some embodiments, the incubation of the reaction mixture of the methods described herein comprises converting the nucleotide triphosphate and the monosaccharide to 1.1-2 molar equivalents of a nucleotide sugar, with respect to the protected lactosylamine, the protected lactosylthiazolidine, or the lactose dithioacetal.
In some embodiments, the reaction mixture of the methods described herein to prepare compounds of Formula I, Formula II, or Formula III, wherein the compounds of Formula I, Formula II, or Formula III comprise one or more fucosyl moieties, comprises one or more of the glycosyltransferases in the reaction mixture wherein the one or more glycosyltransferases are fucosyltransferases.
In some embodiments, two or more of the glycosyltransferase reactions of the methods described herein are conducted without purification and/or isolation of the oligosaccharide intermediates.
In some embodiments, the methods described herein include a purification step to isolate the compounds described herein. The purification step may include C18 cartridge purification. In some embodiments, the compounds described herein may be isolated via reverse-phase chromatography. In some embodiments, wherein the isolated compound is a compound of Formula I, the —NHC(O)OR1 moiety is converted to a —OH moiety in the isolated compound.
The methods described herein may include one or more sugar activation systems (SA), as shown in
In some embodiments, the enzymes for the formation of sugar nucleotide and the glycosyltransferases used herein are BLNahK, Bifidobacterium longum N-acetylhexosamine-1-kinase; PmGlmU, Pasteurella multocida N-acetylglucosamine 1-phosphate uridylyltransferase; PmPpA, Pasteurella multocida inorganic pyrophosphatase; NmLgtA, Neisseria meningitidis 01-3-N-acetylglucosaminyltransferase; Hpβ3GlcNAcT, Helicobacter pylori 01-3-N-acetylglucosaminyltransferase; β6GlcNAcT, β1-6-N-acetylglucosaminyltransferase; SpGalK, Streptococcus pneumoniae galactokinase; BLUSP, Bifidobacterium longum UDP-sugar synthase; Cvβ3GalT, Chromobacterium violaceum β1-3-galactosyltransferase; NmLgtB, Neisseria meningitidis β1-4-galactosyltransferase; BfFKP, a bifunctional enzyme from Bacteroides fragilis that has both L-fucokinase and GDP-fucose pyrophosphorylase activities; Hm2FT, Helicobacter mustelae α1-2-fucosyltransferase; Te2FT, Thermosynechococcus elongatus α1-2-fucosyltransferase; Hp3/4FT, Helicobacter pylori α1-3/4-fucosyltransferase; NmCSS, Neisseria meningitidis CMP-sialic acid synthetase; PmST3, Pasteurella multocida α2-3-sialyltransferase 3; PmST1_M144D, Pasteurella multocida α2-3-sialyltransferase 1 M144D mutant; Pd2,6ST, Photobacterium damselae α2-6-sialyltransferase (can add Neu5Ac to both terminal and internal Gal residues); Pd2,6ST_A200Y/S232Y, Photobacterium damselae α2-6-sialyltransferase A200Y/S232Y double mutant (a regio-selective α2-6-sialyltransferase for adding Neu5Ac to terminal Gal only); PmST1_β34H/M144V, PmST1 P34H/M144V double mutant (a regio-selective α2-6-sialyltransferase for adding Neu5Ac preferably to terminal Gal); hST6GALNAC V, recombinant human ST6GALNAC V expressed in E. coli; or a combination thereof.
Throughout this application, various publications are referenced. The disclosures of these publications in their entireties are hereby incorporated by reference into this application.
The examples below are intended to further illustrate certain aspects of the methods and compositions described herein, and are not intended to limit the scope of the claims.
Also described herein is an hST6GALNAC V variant comprising a polypeptide having at least 80% identity to the amino acid sequence of wild-type hST6GALNAC V. Optionally, the hST6GALNAC V variant comprises a mutation at one or more positions corresponding to G82, A84, V99, H107, G132, 5140, Y176, Q184, H190, L191, A201, H207, Q218, T239, F275, and N306 in wild-type hST6GALNAC V. In some cases, the N-terminus of the polypeptide is fused to a maltose binding protein. In some cases, the C-terminus of the polypeptide is fused to a His6 peptide. Further described herein are hST6GALNAC V variants comprising a polypeptide having at least 80% identity to the amino acid sequence set forth in SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, or SEQ ID NO: 8. Further described herein is a polynucleotide encoding a hST6GALNAC V variant as described herein. A host cell comprising the polynucleotide is also provided herein. Additionally provided herein is a reaction mixture comprising an hST6GALNAC V variant as described herein.
Further provided herein is an hGCNT2-B variant comprising a polypeptide having at least 80% identity to the amino acid sequence set forth in SEQ ID NO: 10. Also provided herein is a polynucleotide encoding an hGCNT2-B variant as described herein. A host cell comprising the polynucleotide is also provided herein. Additionally provided herein is a reaction mixture comprising an hGCNT2-B variant as described herein.
Provided below in the application are sequences corresponding to hST6GALNAC V and hGCNT2-B variants.
SEQ ID NO: 1 is the nucleic acid sequence of MBP-Δ50hST6GALNAC V-His6, as provided in the Sequence Listing herein.
SEQ ID NO: 2 is the nucleic acid sequence of MBP-Δ50hST6GALNAC V_V99M-His6, as provided in the Sequence Listing herein.
SEQ ID NO: 3 is the nucleic acid sequence of MBP-Δ50hST6GALNAC V_ext20-His6, as provided in the Sequence Listing herein.
SEQ ID NO: 4 is the nucleic acid sequence of MBP-Δ50hST6GALNAC V_V99M_ext20-His6, as provided in the Sequence Listing herein.
SEQ ID NO: 5 is the amino acid sequence of MBP-Δ50hST6GALNAC V-His6, as provided in the Sequence Listing herein.
SEQ ID NO: 6 is the amino acid sequence of MBP-Δ50hST6GALNAC V_V99M-His6, as provided in the Sequence Listing herein.
SEQ ID NO: 7 is the amino acid sequence of MBP-Δ50hST6GALNAC V_ext20-His6, as provided in the Sequence Listing herein.
SEQ ID NO: 8 is the amino acid sequence of MBP-Δ50hST6GALNAC V_V99M_ext20-His6, as provided in the Sequence Listing herein.
SEQ ID NO: 9 is the nucleic acid sequence of MBP-Δ25hGCNT2-B-His6, as provided in the Sequence Listing herein.
SEQ ID NO: 10 is the amino acid sequence of MBP-Δ25hGCNT2-B-His6, as provided in the Sequence Listing herein.
Materials. All chemicals were obtained from commercial suppliers and used without further purification. 1H NMR (600 or 800 MHz) and 13C NMR (150 or 200 MHz) spectra were recorded on a Bruker Avance-600 Spectrometer or a Avance-800 Spectrometer. High-resolution electrospray ionization (ESI) mass spectra were recorded using a Thermo Scientific Q Exactive HF Orbitrap Mass Spectrometer at the Mass Spectrometry Facilities in The University of California, Davis. Thin-layer chromatography (TLC, Sorbent Technologies) was performed on silica gel plates using anisaldehyde sugar stain for detection.
Expression and purification of enzymes involved in the synthesis. Recombinant enzymes were expressed and purified as described previously for NmLgtA, Hpβ3GlcNAcT, NmLgtB, Cvβ3GalT, PmST3, Pd2,6ST_A200Y/S232Y, Hm2FT, Hp3/4FT, Te2FT, SpGalK, BLUSP, BLNahK, PmGlmU, NmCSS, BfFKP, PmPpA. See, Lau, Yu, and X. Chen, et al. Chem. Commun. 2010, 46, 6066-6068; Li, Yu, and X. Chen, et al. Bioorg. Med. Chem. 2016, 24, 1696-1705; McArthur, Yu, and X. Chen, et al. ACS Catal. 2019, 9, 10721-10726; Thon, Yu, and X. Chen, et al. Appl. Microbiol. Biotechnol. 2012, 94, 977-985; Xu, Wang, Cao, and Cheng, et al. ACS Catal. 2018, 8, 7222-7227; Xiao and Wang, et al. J. Org. Chem. 2016, 81, 5851-5865; Yu and X. Chen, et al. Chem. Commun. 2017,53, 11012-11015; Zhao, Yu, and X. Chen, et al. Chem. Commun. 2016, 52, 3899-3902; M. Chen and Wang, et al. Carbohydr. Res. 2011, 346, 2421-2425; Muthana, Yu, and X. Chen, et al. Chem. Commun. 2012, 48, 2728-2730; Li, Yu, and X. Chen, et al. Molecules 2011, 16, 6396-6407; Y. Chen, Yu, and X. Chen, et al. Chem. Commun. 2011, 47, 10815-10817; Yu and X. Chen, et al. Bioorg. Med. Chem. 2004, 12, 6427-6435; Yi and X. Chen et al., Proc. Nat. Acad. Sci. U.S.A 2009, 106, 4207-4212; which are incorporated herein by reference in their entireties.
Briefly, E. coli BL21 (DE3) strains harboring the recombinant plasmid with the target gene was cultured in 50 mL Luria-Bertani (LB) media (10 g/L tryptone, 5 g/L yeast extract, and 10 g/L NaCl) containing 0.1 mg/mL ampicillin with rapid shaking (220 rpm) at 37° C. for overnight. Then 15 mL of the overnight cell culture was transferred into 1 L of LB media containing 0.1 mg/mL ampicillin and incubated at 37° C. When the OD600 nm of the cell culture reached 0.6-0.8, isopropyl-1-thio-β-D-galactopyranoside (IPTG, 0.1 mM) was added to induce the expression of the recombinant enzyme. The culture was then incubated at 20° C. with shaking (220 rpm) for 20 h. Cells were collected by centrifugation at 4392×g for 50 mn at 4° C. The cell pellet was re-suspended with lysis buffer (100 mM Tris-HCl buffer, pH 8.0, containing 0.1% Triton X-100). The cells were lysed by sonication with the following conditions: amplitude at 65%, 2 s pulse on and 3 s pulse off for 120 cycles. Cell lysate was obtained by centrifugation at 9016×g for 1 hour at 4° C. The supernatant was collected and loaded to a Ni2+-NTA affinity column pre-equilibrated with a binding buffer (50 mM Tris-HCl buffer, pH 7.5, 5 mM imidazole, 0.5 M NaCl). The column was washed with 10 column volumes of binding buffer and 10 column volumes of washing buffer (50 mM Tris-HCl buffer, pH 7.5, 10 mM imidazole, 0.5 M NaCl) and eluted using 10 column volumes of elution buffer (50 mM Tris-HCl buffer, pH 7.5, 200 mM imidazole, 0.5 M NaCl). Fractions containing the target protein were collected. PmST3, SpGalK, BLUSP, BLNahK, PmGlmU, NmCSS and PmPpA were dialyzed against a dialysis buffer (20 mM Tris-HCl buffer, pH 7.5, 10% glycerol) three times and stored at −20° C. 10% glycerol (by volume) was added to un-dialyzed NmLgtA, Hpβ3GlcNAcT, NmLgtB, Cvβ3GalT, Pd2,6ST_A200Y/S232Y, Hm2FT, Hp3/4FT, Te2FT and BfFKP, prior to storage at −20° C.
Mutagenesis to obtain Pd2,6ST_A200Y/S232Y. Plasmid in pET15b vector for expressing an N-terminal His6-tagged truncated Pd2,6ST (16-497 aa) was used as a mutagenesis template. See, Yu and X. Chen, et al. Angew. Chem. Int. Ed. Engl. 2006, 45, 3938-3944; See, Huynh, Yu, X. Chen, and Fisher, et al. FEBS Lett. 2014, 588, 4720-4729, which are incorporated herein by reference in their entireties. The plasmid was used as the template for polymerase chain reactions (PCRs) to introduce mutation sites following the QuikChange™ site-directed mutagenesis method (Stratagene; La Jolla, CA, USA). The single mutant Pd2,6ST_A200Y was obtained first which was then used as the template to obtain the double mutant Pd2,6ST_A200Y/S232Y. Primers used for generating the A200Y and S232Y mutations are shown below (with mutated sited underlined).
Polymerase chain reactions (PCRs) were each performed in a 50 μL reaction mixture containing 5 ng of template DNA, 1 M for each of the forward and reverse primers, 5 μL of 10× Phusion© HF buffer, 1 mM dNTP mixture, and 5 units (1 μL) of Phusion® HF DNA polymerase. The reaction mixtures were subjected to 30 cycles of amplifications with an annealing temperature of 55° C. The resulting PCR products were digested with DpnI, purified, and transformed into chemically competent E. coli DH5α cells. The plasmids of selected colonies were purified and sent to Genewiz (South Plainfield, NJ, USA) for sequencing. Positive plasmids were transformed into BL21 (DE3) chemically competent cells. Selected clones were grown for protein expression, purification, characterization, and application in chemoenzymatic synthesis.
Multigram-scale synthesis and purification of LacβNHCbz (1) from lactose. Lactose (5 g, 14.62 mmol) and ammonium bicarbonate (1.3 g, 16.45 mmol) was dissolved in ammonium hydroxide (25 mL) and the solution was incubated at 45-50° C. for 24 hours to form lactosylamine (LacβNH2). Solvent was removed in vacuo and the residue was dried under vacuum for 3-4 h. To obtain LacβNHCbz, LacβNH2 (5 g, 14.62 mmol) was dissolved in 650 mL of MeOH (methanol) and benzyl chloroformate (CbzCl, 9.5 g, 55.69 mmol) was added to the reaction mixture in a 1 L round bottom flask submerged in an ice-water bath. The pH of the mixture was adjusted and kept at 8.0-10.0 by adding N,N-diisopropylethylamine. The reaction mixture was then stirred at room temperature for 22 hours. LacβNHCbz was purified by ODS-SM column (140 g, 50 μm, 120 Å, Yamazen) on a CombiFlash® Rf 200i system. Consistently, 75-80% yields were obtained for 3-5 g-scale syntheses. The product formation was monitored by thin-layer chromatography (TLC) and by high-resolution mass spectrometry (HRMS). Ethyl acetate (EtOAc):methanol (MeOH):H2O=5:1.6:1 (by volume) was used as the developing solvent for TLC. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C20H29NNaO12 498.1587; found 498.1585.
Single C18-cartridge purification process for MSOPME-synthesized βNHCbz-tagged HMOs (2-22). After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes. The mixture was then cooled down to room temperature and centrifuged at 9016×g for 30 min at 4° C. The supernatant was collected. The precipitate was washed twice, each time with 3 mL of H2O, and the supernatants were combined. The combined supernatant was concentrated by rotavap to reduce the volume to about 3-5 mL which was purified by passing through a ODS-SM column (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash® Rf 200i system and monitored at 214 nm. The product was eluted with a mixed solvent of acetonitrile and water with a flow rate of 30 mL/min. The eluting program used was the following: Mobile phase A: water (v/v); Mobile phase B: acetonitrile (v/v); 0% B for 8 min followed by gradient 0% to 60% B over 25 min, gradient 60% to 100% B over 3 min, 100% B for 2 min, then 100% to 80% B over 2 min.
Preparative-scale synthesis of GlcNAcβ3Galβ4GlcβNHCbz (LNT-IIβNHCbz) (2) by OPME1a. LacβNHCbz (1, 100 mg, 0.21 mmol), GlcNAc (0.32 mmol), ATP (0.32 mmol), UTP (0.32 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5) and MgCl2 (20 mM). After the addition of BLNahK (1.5 mg), PmGlmU (1.2 mg), NmLgtA (1.8 mg), and PmPpA (1 mg) (OPME1a), water was added to bring the final volume to 21 mL and the concentration of LacβNHCbz (1) was 10 mM. The reaction mixture was incubated at 30° C. with agitation at 180 rpm in an incubator shaker for overnight. The product formation was monitored by thin-layer chromatography (TLC) and by high-resolution mass spectrometry (HRMS). Ethyl acetate (EtOAc):methanol (MeOH):H2O=5:1.6:1 (by volume) was used as the developing solvent for TLC. After the reaction was completed (22 h), the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes. The reaction mixture was then cooled down and the product was purified by following the single C18-cartridge purification procedures described above. LNT-IIβNHCbz (2) was obtained as a white powder (130 mg, 91% yield in 1 step from 100 mg LacβNHCbz). 1H NMR (800 MHz, D2O, 30° C.) δ 7.46-7.38 (m, 5H), 5.17 (t, J=9.4 Hz, 2H), 4.81 (d, J=9.3 Hz, 1H), 4.67 (d, J=8.5 Hz, 1H), 4.43 (d, J=7.9 Hz, 1H), 4.13 (d, J=3.3 Hz, 1H), 3.93-3.84 (m, 2H), 3.80-3.52 (m, 12H), 3.49-3.34 (m, 3H), 2.02 (s, 3H). 13C NMR (200 MHz, D2O, 30° C.) δ 174.92, 158.08, 135.99, 128.76, 128.46, 127.80, 102.82, 102.81, 81.85, 81.62, 77.67, 76.41, 76.11, 75.62, 74.99, 74.85, 73.51, 71.37, 69.98, 69.65, 69.20, 68.33, 67.38, 60.94, 60.44, 59.83, 55.62, 22.12. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C28H42N2NaO17 701.2381; found 701.2383.
MSOPME preparative-scale synthesis of Galβ4GlcNAcβ3Galβ4GlcβNHCbz (LNnTβNHCbz) (3). The same procedure was carried out as described above (OPME1a) for the preparation of LNT-IIβNHCbz (2) from LacβNHCbz (1). After confirming the completion of the reaction for the formation of LNT-IIβNHCbz (2), the reaction mixture was incubated in a boiling water bath for 5 min and was then cooled down to room temperature. To the reaction mixture in the same tube, Gal (0.32 mmol), ATP (0.32 mmol), and UTP (0.32 mmol) were added and the pH of the reaction was adjusted to 7.5 by adding 4 M NaOH. Enzymes including SpGalK (2.5 mg), BLUSP (2 mg), NmLgtB (3 mg), and PmPpA (1 mg) (OPME2a) were then added without increasing the volume of the reaction mixture significantly. The reaction mixture was incubated at 30° C. in an incubator shaker with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:2:1 (by volume) as the developing solvent and by HRMS. After the reaction was completed (14 h), the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes and then cooled down to room temperature before the product was purified. LNnTβNHCbz (3) was obtained as a white powder (152 mg, 86% yield for two steps from 100 mg LacβNHCbz). 1H NMR (800 MHz, D2O, 30° C.) δ 7.45-7.38 (m, 5H), 5.17 (t, J=9.3 Hz, 2H), 4.81 (s, 1H), 4.69 (d, J=8.4 Hz, 1H), 4.46 (d, J=7.9 Hz, 1H), 4.43 (d, J=7.9 Hz, 1H), 4.14 (d, J=3.3 Hz, 1H), 3.96-3.87 (m, 3H), 3.86-3.61 (m, 16H), 3.60-3.50 (m, 3H), 3.43-3.33 (m, 1H), 2.02 (s, 3H). 13C NMR (200 MHz, D2O, 30° C.) δ 174.94, 158.15, 136.05, 128.83, 128.53, 127.87, 102.89, 102.78, 82.01, 81.70, 78.19, 77.74, 76.48, 76.18, 75.38, 75.06, 74.91, 74.59, 72.53, 72.20, 71.44, 71.00, 70.01, 69.27, 68.58, 68.39, 67.45, 61.06, 61.01, 59.89, 55.22, 22.21. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C34H52N2NaO22 863.2909; found 863.2896.
MSOPME preparative-scale synthesis of GlcNAcβ3Galβ4GlcNAcβ3Galβ4Glc-βNHCbz (GlcNAc-LNnTβNHCbz) (4). LacβNHCbz (1, 50 mg, 0.105 mmol), GlcNAc (0.16 mmol), ATP (0.16 mmol), and UTP (0.16 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (200 mM, pH 7.5) and MgCl2 (40 mM). After the addition of BLNahK (1.2 mg), PmGlmU (1 mg), NmLgtA (1.5 mg), and PmPpA (0.5 mg) (OPME1a), water was added to bring the final volume to 5 mL, resulting in a solution containing 20 mM LacβNHCbz (1). The reaction mixture was incubated at 30° C. in an incubator shaker for 18 h with agitation at 180 rpm. Reaction progress was monitored by TLC and by HRMS as described above for LNT-IIβNHCbz (2). After incubating the reaction mixture in a boiling water bath for 5 min and then cooled down, Gal (0.16 mmol), ATP (0.16 mmol), and UTP (0.16 mmol) were added, the pH of the reaction was adjusted to 7.5 by adding 4 M NaOH. SpGalK (1.5 mg), BLUSP (1.2 mg), NmLgtB (1.8 mg), and PmPpA (0.5 mg) (OPME2a) were then added and the reaction mixture was incubated at 30° C. for overnight with agitation at 180 rpm. Reaction progress was monitored by TLC and by HRMS. Once the reaction was completed (14 h), the mixture was incubated in a boiling water bath for 5 min and then cooled down. In the same reaction tube without workup or purification, GlcNAc (0.16 mmol), ATP (0.16 mmol), and UTP (0.16 mmol) were added, and the pH of the reaction was adjusted to 7.5 by adding 4 M NaOH. BLNahK (1 mg), PmGlmU (0.8 mg), Hpβ3GlcNAcT (1.5 mg), and PmPpA (0.5 mg) (OPME1b) were then added. The final concentration of the acceptor in the reaction mixture was kept at around 10 mM and the final volume of the reaction mixture was kept at around 10.5 mL by adding Tris-HCl buffer (100 mM, pH 7.5) and MgCl2 (20 mM). The reaction mixture was incubated at 30° C. for overnight with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:2.4:1 (by volume) as the developing solvent and by HRMS. After the reaction was completed (16 h), the reaction mixture was incubated in a boiling water bath for 5 min, cooled down, and purified. GlcNAc-LNnTβNHCbz (4) was obtained as a white powder (91 mg, 83% yield for three steps from 50 mg LacβNHCbz). 1H NMR (600 MHz, D2O, 30° C.) δ 7.48-7.39 (m, 5H), 5.18 (d, J=3.5 Hz, 2H), 4.82 (d, J=9.2 Hz, 1H), 4.69 (dd, J=11.7, 8.4 Hz, 2H), 4.47 (d, J=7.9 Hz, 1H), 4.44 (d, J=7.9 Hz, 1H), 4.15 (t, J=3.0 Hz, 2H), 3.97-3.87 (m, 3H), 3.85-3.65 (m, 18H), 3.58 (ddt, J=11.8, 10.4, 5.9 Hz, 4H), 3.51-3.40 (m, 3H), 2.04 (s, 3H), 2.03 (s, 3H). 13C NMR (150 MHz, D2O, 30° C.) δ 174.93, 174.87, 158.08, 135.99, 128.77, 128.47, 128.41, 127.81, 127.67, 102.87, 102.83, 102.71, 81.97, 81.94, 81.62, 78.18, 77.71, 76.13, 75.64, 75.01, 74.86, 74.84, 74.53, 73.54, 72.14, 69.98, 69.95, 69.66, 68.34, 68.29, 67.39, 60.95, 60.93, 60.46, 59.85, 55.64, 55.13, 22.15. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C42H65N3NaO27 1066.3703; found 1066.3708.
MSOPME preparative-scale synthesis of Galβ4GlcNAcβ3Galβ4(Fucα3)Glc-βNHCbz (LNnFP-VβNHCbz) (5). LacβNHCbz (1, 50 mg, 0.105 mmol), GlcNAc (0.16 mmol), ATP (0.16 mmol), UTP (0.16 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (200 mM, pH 7.5) and MgCl2 (40 mM). After the addition of BLNahK (1.2 mg), PmGlmU (1 mg), NmLgtA (1.5 mg), and PmPpA (0.5 mg) (OPME1a), water was added to bring the final volume to 5 mL with the concentration of LacβNHCbz (1) of 20 mM. The reaction mixture was incubated at 30° C. in an incubator shaker for 18 h with agitation at 180 rpm. Reaction was monitored as described above for preparation of LNT-IIβNHCbz (2). After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min and then cooled down. In the same reaction tube without workup or purification, L-fucose (0.16 mmol), ATP (0.16 mmol), and GTP (0.16 mmol) were added, and the pH of the reaction was adjusted to 7.5 by adding 4 M NaOH. BfFKP (1.5 mg), Hp3/4FT (1 mg), and PmPpA (0.5 mg) (OPME3c) were then added. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:2:1 (by volume) as the developing solvent and by HRMS. The reaction mixture was incubated at 30° C. for 14 h with agitation at 180 rpm. After reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min and then cooled down. In the same reaction tube without workup or purification, Gal (0.16 mmol), ATP (0.16 mmol), and UTP (0.16 mmol) were added, and the pH of the reaction was adjusted to 7.5 by adding 4 M NaOH. SpGalK (1.5 mg), BLUSP (1.2 mg), NmLgtB (1.8 mg), and PmPpA (0.5 mg) (OPME2a) were then added. The final concentration of the acceptor was kept at around 10 mM by adding Tris-HCl buffer (100 mM, pH 7.5) and MgCl2 (20 mM) to bring the final volume of the reaction to around 10.5 mL. The reaction mixture was incubated at 30° C. for 16 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:2.4:1 (by volume) as the developing solvent and by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down, and purified. LNnFP-V βNHCbz (5) was obtained as a white powder (83 mg, 80% yield for three steps from 50 mg LacβNHCbz). 1H NMR (600 MHz, D2O, 30° C.) δ 7.48-7.39 (m, 5H), 5.44 (d, J=4.0 Hz, 1H), 5.18 (s, 2H), 4.82 (d, J=5.7 Hz, 1H), 4.80 (s, 1H), 4.70 (d, J=8.3 Hz, 1H), 4.48 (d, J=7.8 Hz, 1H), 4.42 (d, J=7.7 Hz, 1H), 4.09 (d, J=3.5 Hz, 1H), 3.98-3.90 (m, 4H), 3.90-3.42 (m, 23H), 2.03 (s, 3H), 1.16 (d, J=6.6 Hz, 3H). 13C NMR (150 MHz, D2O, 30° C.) δ 174.91, 158.18, 136.09, 128.84, 128.53, 127.85, 127.74, 102.91, 102.74, 101.74, 98.53, 81.88, 81.54, 78.25, 77.67, 76.80, 75.39, 74.57, 74.53, 73.54, 73.15, 72.55, 72.17, 72.07, 71.96, 71.01, 70.68, 69.26, 68.59, 68.31, 68.02, 67.44, 66.57, 61.52, 61.06, 59.91, 59.66, 55.20, 22.20, 15.25. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C40H62N2NaO261009.3488; found 1009.3410.
MSOPME preparative-scale synthesis of Galβ4(Fucα3)GlcNAcβ3Galβ4(Fucα3)-GlcβNHCbz (LNnDFH-IIβNHCbz) (6). LNnTβNHCbz (3) was prepared as an intermediate from 50 mg LacβNHCbz (1) (OPME1a and OPME2a) similar to that described above for the preparation of GlcNAc-LNnTβNHCbz (4). After the reaction for the formation of LNnTβNHCbz (3) was completed, the reaction mixture was incubated in a boiling water bath for 5 min and then cooled down. In the same reaction tube without workup or purification, L-fucose (0.3 mmol), ATP (0.32 mmol), and GTP (0.32 mmol) were added. The pH of the reaction was adjusted to 7.5 by adding 4 M NaOH. BfFKP (4 mg), Hp3/4FT (1.5 mg), and PmPpA (1 mg) (OPME3c) were then added. The final concentration of the acceptor was kept at around 10 mM, and the final volume of the reaction was kept at around 10.5 mL by adding Tris-HCl buffer (100 mM, pH 7.5) and MgCl2 (20 mM). The reaction mixture was incubated at 30° C. in an incubator shaker for 14 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:2.6:1 (by volume) as the developing solvent and by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down, and purified. LNnDFH-IIβNHCbz (6) was obtained as a white powder (94 mg, 80% yield for three steps from 50 mg LacβNHCbz). 1H NMR (600 MHz, D2O, 30° C.) δ 7.48-7.39 (m, 5H), 5.44 (d, J=4.0 Hz, 1H), 5.18 (s, 2H), 5.14 (d, J=4.0 Hz, 1H), 4.84 (d, J=7.2 Hz, 2H), 4.71 (d, J=8.4 Hz, 1H), 4.47 (d, J=7.8 Hz, 1H), 4.42 (d, J=7.8 Hz, 1H), 4.09 (d, J=3.6 Hz, 1H), 4.01-3.55 (m, 29H), 3.50 (ddd, J=10.6, 5.4, 3.3 Hz, 2H), 2.02 (s, 3H), 1.20-1.12 (m, 6H). 13C NMR (150 MHz, D2O, 30° C.) δ 174.64, 158.11, 136.02, 128.78, 128.47, 127.78, 127.68, 102.48, 101.73, 101.67, 98.56, 98.46, 81.82, 81.48, 77.60, 76.73, 75.06, 74.89, 74.73, 74.47, 73.02, 72.46, 71.99, 71.89, 71.03, 70.64, 69.18, 68.33, 68.24, 67.96, 67.68, 67.38, 66.67, 66.50, 61.49, 61.46, 59.62, 55.93, 22.22, 15.29, 15.18. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C46H72N2NaO30 1155.4068; found 1155.4071.
MSOPME preparative-scale synthesis of GalβGlcNAcβ3Galβ4GlcβNHCbz (LNTβNHCbz) (7). LNT-IIβNHCbz (2) was formed as an intermediate from 100 mg LacβNHCbz (1) (OPME1a) similar to that described above. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min and then cooled down. In the same reaction tube without workup or purification, Gal (0.32 mmol), ATP (0.32 mmol), and UTP (0.32 mmol) were added, and the pH of the reaction was adjusted to 7.5 by adding 4 M NaOH. SpGalK (2.5 mg), BLUSP (2 mg), Cvβ3GalT (4 mg), and PmPpA (1 mg) (OPME2b) were then added and the volume of the reaction mixture was kept at around 21 mL. The reaction mixture was incubated at 30° C. in an incubator shaker for 14 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:2:1 (by volume) as the developing solvent and by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down, and purified. LNTβNHCbz (7) was obtained as a white powder (155 mg, 88% yield for two steps from 100 mg LacβNHCbz). 1H NMR (800 MHz, D2O, 30° C.) δ 7.45-7.39 (m, 5H), 5.17 (t, J=9.4 Hz, 2H), 4.81 (d, J=9.1 Hz, 1H), 4.72 (d, J=8.5 Hz, 1H), 4.43 (dt, J=7.8, 1.3 Hz, 2H), 4.14 (d, J=3.3 Hz, 1H), 3.92-3.87 (m, 4H), 3.82-3.61 (m, 14H), 3.60-3.50 (m, 3H), 3.47 (ddd, J=10.0, 5.1, 2.3 Hz, 1H), 3.42-3.33 (m, 1H), 2.01 (s, 3H). 13C NMR (200 MHz, D2O, 30° C.) δ 175.00, 158.15, 136.06, 128.83, 128.52, 127.87, 103.52, 102.88, 102.60, 82.07, 81.95, 81.84, 81.70, 77.76, 77.33, 76.48, 76.18, 75.31, 75.22, 75.06, 74.92, 72.49, 71.44, 70.71, 70.05, 69.27, 68.56, 68.48, 68.38, 67.45, 61.07, 61.01, 60.53, 59.90, 54.73, 22.26. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C34H52N2NaO22 863.2909; found 863.2911.
MSOPME gram-scale synthesis of Fucα2Galβ3GlcNAcβ3Galβ4GlcβNHCbz (LNFP-IβNHCbz) (8). LacβNHCbz (1, 1 g, 2.1 mmol), GlcNAc (3.2 mmol), ATP (3.2 mmol), UTP (3.2 mmol) were dissolved in water in a 250 mL wide-mouth bottle. Tris-HCl buffer (100 mM, pH 7.5) and MgCl2 (20 mM) were then added. After BLNahK (8 mg), PmGlmU (9 mg), NmLgtA (12 mg), and PmPpA (3 mg) (OPME1a) were added, water was added to bring the final concentration of LacβNHCbz (1) to 25 mM. The reaction mixture was incubated at 30° C. in an incubator shaker for 3 days with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:1.6:1 (by volume) as the developing solvent and by HRMS. After the reaction was completed, Gal (3.2 mmol), ATP (3.2 mmol), UTP (3.2 mmol) were added and the pH was adjusted to 7.5. SpGalK (16 mg), BLUSP (16.5 mg), Cvβ3GalT (14 mg), and PmPpA (3 mg) (OPME2b) were added and the concentration of the acceptor was kept at 25 mM. The reaction mixture was incubated at 30° C. for 2 days with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:2:1 (by volume) as the developing solvent and by HRMS. After the reaction was completed, Fuc (3.2 mmol), ATP (3.2 mmol), and GTP (3.2 mmol) were added and the pH was adjusted to 7.5. BfFKP (30 mg), Te2FT (34 mg), and PmPpA (3 mg) (OPME3b) were then added and the concentration of the acceptor was kept at 25 mM. The reaction mixture was incubated at 30° C. for 5 days with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:2.4:1 (by volume) as the developing solvent and by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down, and then centrifuge at 9016×g for 30 min at 4° C. The supernatant was concentrated and purified by ODS-SM column (140 g, 50 μm, 120 Å, Yamazen) using CombiFlash® Rf 200i system, with a flow rate of 40 mL/min and a gradient elution of 0-100% acetonitrile in water for 60 min. Mobile phase A: water (v/v); Mobile phase B: acetonitrile (v/v); Gradient: 0% B for 15 min, 0% to 100% B over 40 min, 100% B for 2 min, then 100% to 80% B over 3 min. LNFP-IβNHCbz (8) was obtained as a white powder (1.73 g, 84% yield for three steps from 1 g of LacβNHCbz). 1H NMR (800 MHz, D2O, 30° C.) δ 7.53-7.48 (m, 5H), 5.25 (m, J=5.7 Hz, 3H), 4.92-4.83 (m, 1H), 4.70 (d, J=7.7 Hz, 1H), 4.69 (d, J=8.4 Hz, 1H), 4.49 (d, J=7.9 Hz, 1H), 4.35 (q, J=6.7 Hz, 1H), 4.21-4.17 (m, 1H), 4.05 (dd, J=10.5, 8.6 Hz, 1H), 4.00-3.68 (m, 21H), 3.66-3.54 (m, 4H), 3.47 (dt, J=10.0, 8.1 Hz, 1H), 2.12 (s, 3H), 1.29 (d, J=6.6 Hz, 3H). 13C NMR (200 MHz, D2O, 30° C.) δ 174.27, 159.29, 136.08, 128.85, 128.54, 128.48, 127.88, 127.75, 103.27, 102.95, 100.30, 99.56, 81.65, 81.59, 78.10, 77.76, 77.35, 77.28, 76.71, 76.51, 76.22, 75.32, 75.13, 75.07, 74.87, 73.56, 71.91, 71.48, 70.26, 69.51, 69.33, 69.19, 68.64, 68.63, 68.55, 68.12, 67.46, 66.54, 62.56, 61.19, 61.01, 60.62, 60.50, 59.96, 55.02, 22.22, 15.32. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C40H62N2NaO261009.3488; found 1009.3459.
MSOPME preparative-scale synthesis of GalβGlcNAcβ3Galβ4(Fucα3)Glc-βNHCbz (LNFP-VβNHCbz) (9). LacβNHCbz (1, 50 mg, 0.105 mmol), GlcNAc (0.16 mmol), ATP (0.16 mmol), and UTP (0.16 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (200 mM, pH 7.5) and MgCl2 (40 mM). BLNahK (1.2 mg), PmGlmU (1 mg), NmLgtA (1.5 mg), and PmPpA (0.5 mg) (OPME1a) were then added and water was added to bring the total volume to 5 mL and the concentration of LacβNHCbz (1) to 20 mM. The reaction mixture was incubated at 30° C. in an incubator shaker for 18 h with agitation at 180 rpm. Reaction was monitored similar to that described for preparation of LNT-IIβNHCbz (2). After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min and then cooled down. In the same reaction tube without workup or purification, L-fucose (0.16 mmol), ATP (0.16 mmol), and GTP (0.16 mmol) were added and the pH was adjusted to 7.5 by adding 4 M NaOH. BfFKP (1.5 mg), Hp3/4FT (1 mg), and PmPpA (0.5 mg) (OPME3c) were then added and the acceptor concentration was around 15 mM. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:2:1 (by volume) as the developing solvent and by HRMS. The reaction mixture was incubated at 30° C. for 14 h with agitation at 180 rpm. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min and then cooled down. In the same reaction tube without workup or purification, Gal (0.16 mmol), ATP (0.16 mmol), and UTP (0.16 mmol) were added and the pH was adjusted to 7.5 by adding 4 M NaOH. SpGalK (1.5 mg), BLUSP (1.2 mg), Cvβ3GalT (2.5 mg), and PmPpA (0.5 mg) (OPME2b) were then added and the final volume of the reaction was kept at around 10.5 mL with the final concentration of the acceptor at around 10 mM. The reaction mixture was incubated at 30° C. for 16 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:2.4:1 (by volume) as the developing solvent and by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down, and purified. LNFP-VβNHCbz (9) was obtained as a white powder (84 mg, 81% yield for three steps from 50 mg LacβNHCbz). 1H NMR (800 MHz, D2O, 30° C.) δ 7.45-7.41 (m, 5H), 5.44 (d, J=4.0 Hz, 1H), 5.18 (d, J=5.5 Hz, 2H), 4.82 (s, 1H), 4.80 (s, 1H), 4.72 (d, J=8.5 Hz, 1H), 4.44 (d, J=7.7 Hz, 1H), 4.42 (d, J=7.8 Hz, 1H), 4.09 (d, J=3.5 Hz, 1H), 3.95-3.68 (m, 19H), 3.65-3.46 (m, 8H), 2.02 (s, 3H), 1.16 (d, J=6.7 Hz, 3H). 13C NMR (200 MHz, D2O, 30° C.) δ 174.98, 158.17, 136.08, 128.84, 128.83, 128.53, 127.84, 127.74, 103.50, 102.56, 101.72, 98.53, 82.01, 81.87, 81.48, 77.67, 76.80, 76.49, 75.32, 75.19, 74.54, 73.14, 72.50, 72.07, 71.95, 70.73, 70.72, 69.28, 69.25, 68.57, 68.48, 68.30, 68.02, 67.44, 66.56, 61.52, 61.08, 60.53, 59.67, 54.73, 29.64, 22.25, 15.25. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C40H62N2NaO261009.3488; found 1009.3409.
MSOPME preparative-scale synthesis of Galβ33(Fucα4)GlcNAcβ3Galβ4(Fucα3)-GlcβNHCbz (LNDFH-IIβNHCbz) (10). LNFP-VβNHCbz (9) was prepared as an intermediate from 50 mg LacβNHCbz (1) (OPME1a, OPME3c, and OPME2b) as described above. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min and then cooled down. In the same reaction tube without workup or purification, L-fucose (0.16 mmol), ATP (0.16 mmol), and GTP (0.16 mmol) were added and the pH of the reaction was adjusted to 7.5 by adding 4 M NaOH. BfFKP (1.5 mg), Hp3/4FT (0.8 mg), and PmPpA (0.5 mg) (OPME3c) were then added and the final volume of the reaction was kept at around 10.5 mL. The reaction mixture was incubated at 30° C. in an incubator shaker for 14 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:2.6:1 (by volume) as the developing solvent and by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down, and purified. LNDFH-IIβNHCbz (10) was obtained as a white powder (95 mg, 80% yield for four steps from 50 mg LacβNHCbz). 1H NMR (600 MHz, D2O, 30° C.) δ 7.48-7.39 (m, 5H), 5.44 (d, J=4.0 Hz, 1H), 5.18 (s, 2H), 5.03 (d, J=4.0 Hz, 1H), 4.88 (q, J=6.7 Hz, 1H), 4.83-4.80 (m, 1H), 4.76 (s, 1H), 4.69 (d, J=8.5 Hz, 1H), 4.51 (d, J=7.7 Hz, 1H), 4.42 (d, J=7.7 Hz, 1H), 4.10-4.05 (m, 2H), 3.98-3.67 (m, 22H), 3.65-3.46 (m, 8H), 2.03 (s, 3H), 1.17 (dd, J=9.1, 6.6 Hz, 6H). 13C NMR (150 MHz, D2O, 30° C.) δ 174.71, 158.11, 136.01, 128.77, 128.46, 127.77, 127.67, 102.81, 102.56, 101.68, 98.46, 97.97, 81.80, 81.50, 77.59, 76.73, 75.89, 75.16, 74.78, 74.47, 73.07, 72.28, 72.09, 72.00, 71.92, 71.89, 70.62, 70.55, 70.48, 69.19, 69.11, 68.33, 68.23, 67.95, 67.77, 67.37, 66.82, 66.50, 61.63, 61.46, 59.58, 55.84, 22.25, 15.34, 15.18. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C46H72N2NaO30 1155.4068; found 1155.4080.
MSOPME preparative-scale synthesis of Galβ4GlcNAcβ3Galβ4GlcNAcβ3Gal-β4GlcβNHCbz (pLNnHβNHCbz) (11). GlcNAc-LNnTβNHCbz (4) was prepared as an intermediate from 50 mg LacβNHCbz (1) (OPME1a, OPME2a, and OPME1b) as described above. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min and then cooled down. In the same reaction tube without workup or purification, Gal (0.16 mmol), ATP (0.16 mmol), and UTP (0.16 mmol) were added, and the pH of the reaction was adjusted to 7.5 by adding 4 M NaOH. SpGalK (1.5 mg), BLUSP (1.2 mg), NmLgtB (1.8 mg), and PmPpA (0.5 mg) (OPME2a) were then added and the final volume of the reaction was kept at around 10.5 mL. The reaction mixture was incubated at 30° C. in an incubator shaker for 16 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:2.8:1 (by volume) as the developing solvent and by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down, and purified. pLNnHβNHCbz (11) was obtained as a white powder (100 mg, 79% yield for four steps from 50 mg LacβNHCbz). 1H NMR (600 MHz, D2O, 30° C.) δ 7.48-7.39 (m, 4H), 5.18 (d, J=3.6 Hz, 2H), 4.82 (d, J=9.5 Hz, 1H), 4.70 (d, J=8.4 Hz, 2H), 4.46 (dt, J=15.5, 7.8 Hz, 3H), 4.15 (dd, J=5.2, 3.3 Hz, 2H), 3.94 (tt, J=10.9, 4.7 Hz, 4H), 3.87-3.63 (m, 24H), 3.61-3.50 (m, 5H), 3.42 (t, J=8.9 Hz, 1H), 2.03 (s, 6H). 13C NMR (150 MHz, D2O, 30° C.) δ 174.87, 158.08, 136.00, 128.78, 128.47, 127.82, 127.68, 102.87, 102.84, 102.73, 102.71, 82.05, 81.95, 81.65, 78.18, 78.15, 78.04, 77.71, 76.12, 75.33, 75.01, 74.85, 74.53, 72.49, 72.16, 72.14, 72.03, 71.40, 70.94, 69.96, 69.94, 68.53, 68.33, 68.29, 67.40, 63.02, 62.46, 61.00, 60.95, 60.93, 59.84, 56.34, 55.17, 55.13, 22.16. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C48H75N3NaO32 1228.4231; found 1228.4226.
MSOPME preparative-scale synthesis of GlcNAcβ3Galβ4GlcNAcβ3Gal-β4GlcNAcβ3Galβ4GlcβNHCbz (GlcNAc-pLNnHβNHCbz) (12). pLNnHβNHCbz (11) was prepared as an intermediate from 50 mg LacβNHCbz (1) (OPME1a, OPME2a, OPME1b, and OPME2a) as described above. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min and then cooled down. In the same reaction tube without workup or purification, GlcNAc (0.16 mmol), ATP (0.16 mmol), and UTP (0.16 mmol) were added, and the pH of the reaction was adjusted to 7.5 by adding 4 M NaOH. BLNahK (1 mg), PmGlmU (0.8 mg), Hpβ3GlcNAcT (1.5 mg), and PmPpA (0.5 mg) (OPME1b) were then added and the final volume of the reaction was kept at around 10.5 mL. The reaction mixture was incubated at 30° C. in an incubator shaker for 18 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:3.2:1 (by volume) as the developing solvent and by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down, and purified. GlcNAc-pLNnHβNHCbz (12) was obtained as a white powder (114 mg, 77% yield for five steps from 50 mg LacβNHCbz). 1H NMR (800 MHz, D2O, 30° C.) δ 7.46-7.39 (m, 5H), 5.18 (d, J=6.1 Hz, 2H), 4.69 (d, J=8.4 Hz, 2H), 4.67 (d, J=8.4 Hz, 1H), 4.46 (d, J=7.9 Hz, 2H), 4.43 (d, J=7.9 Hz, 1H), 4.14 (t, J=3.6 Hz, 3H), 3.92 (ddd, J=43.6, 12.5, 2.2 Hz, 4H), 3.85-3.64 (m, 26H), 3.60-3.54 (m, 6H), 3.48-3.39 (m, 3H), 2.03 (s, 3H), 2.02 (s, 6H). 13C NMR (200 MHz, D2O, 30° C.) δ 174.92, 174.87, 158.07, 136.00, 128.77, 128.46, 128.41, 127.81, 127.67, 102.86, 102.83, 102.72, 102.71, 82.03, 81.96, 81.94, 81.64, 78.14, 77.69, 76.12, 75.63, 75.00, 74.86, 74.84, 74.52, 73.53, 72.14, 71.39, 69.97, 69.95, 69.93, 69.65, 68.33, 68.29, 67.39, 60.94, 60.93, 60.45, 59.83, 55.63, 55.12, 29.57, 22.15. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C56H88N4NaO37 1431.5025; found 1431.5033.
MSOPME preparative-scale synthesis of Fucα2Galβ4GlcNAcβ3Galβ4GlcNAc-β3Galβ4GlcβNHCbz (F-pLNnH-IβNHCbz) (13). pLNnHβNHCbz (11) was prepared as an intermediate from 50 mg LacβNHCbz (1) (OPME1a, OPME2a, OPME1b, and OPME2a) as described above. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min and then cooled down. In the same reaction tube without workup or purification, L-fucose (0.16 mmol), ATP (0.16 mmol), and GTP (0.16 mmol) were added and the pH of the reaction was adjusted to 7.5 by adding 4 M NaOH. BfFKP (1.5 mg), Hm2FT (1.5 mg), and PmPpA (0.5 mg) (OPME3a) were then added and the final volume of the reaction was kept at around 10.5 mL. The reaction mixture was incubated at 30° C. in an incubator shaker for 18 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:3:1 (by volume) as the developing solvent and by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down, and purified. F-pLNnH-Iβ3NHCbz (13) was obtained as a white powder (108 mg, 76% yield for five steps from 50 mg LacβNHCbz). 1H NMR (600 MHz, D2O, 30° C.) δ 7.48-7.38 (m, 5H), 5.31 (d, J=3.0 Hz, 1H), 5.18 (d, J=3.8 Hz, 2H), 4.82 (s, 1H), 4.70 (d, J=8.4 Hz, 2H), 4.55 (d, J=7.7 Hz, 1H), 4.47 (d, J=8.1 Hz, 1H), 4.44 (d, J=7.9 Hz, 1H), 4.22 (d, J=6.7 Hz, 1H), 4.15 (d, J=3.3 Hz, 2H), 4.01-3.62 (m, 33H), 3.62-3.52 (m, 3H), 3.49-3.38 (m, 2H), 2.04 (s, 3H), 2.03 (s, 3H), 1.23 (d, J=6.6 Hz, 3H). 13C NMR (150 MHz, D2O, 30° C.) δ 174.88, 158.07, 135.98, 128.77, 128.47, 128.40, 127.81, 127.67, 102.86, 102.83, 102.75, 102.70, 100.22, 99.39, 81.98, 81.94, 78.16, 77.69, 76.43, 76.12, 75.84, 75.23, 75.07, 75.00, 74.84, 74.82, 74.52, 73.50, 72.14, 72.04, 71.64, 71.37, 69.96, 69.59, 69.09, 68.34, 68.28, 68.18, 67.39, 66.92, 61.10, 60.95, 60.89, 59.84, 55.36, 55.13, 22.18, 22.15, 15.29. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C54H85N3NaO36 1374.4810; found 1374.4807.
MSOPME preparative-scale synthesis of Galβ4(Fucα3)GlcNAcβ3Galβ4(Fucα3)-GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz (TF-pLNnHβNHCbz) (14). pLNnHβNHCbz (11) was prepared as an intermediate from 50 mg LacβNHCbz (1) (OPME1a, OPME2a, OPME1b, and OPME2a) as described above. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min and then cooled down. In the same reaction tube without workup or purification, L-fucose (0.53 mmol), ATP (0.53 mmol), and GTP (0.53 mmol) were added and the pH of the reaction was adjusted to 7.5 by adding 4 M NaOH. BfFKP (5.5 mg), Hp3/4FT (2.5 mg), and PmPpA (1 mg) (OPME3c) were then added and the final volume was kept at around 10.5 mL. The reaction mixture was incubated at 30° C. in an incubator shaker for 18 h with agitation at 180 rpm. The product formation was monitored by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down, and purified. TF-pLNnHβNHCbz (14) was obtained as a white powder (131 mg, 76% yield for five steps from 50 mg LacβNHCbz). 1H NMR (600 MHz, D2O, 30° C.) δ 7.49-7.39 (m, 5H), 5.44 (d, J=4.0 Hz, 1H), 5.18 (s, 2H), 5.13 (dd, J=8.3, 4.0 Hz, 2H), 4.84 (d, J=6.6 Hz, 1H), 4.82 (d, J=8.1 Hz, 2H), 4.81-4.80 (m, 1H), 4.70 (d, J=8.4 Hz, 2H), 4.49-4.39 (m, 3H), 4.09 (dd, J=6.0, 3.4 Hz, 2H), 3.99-3.55 (m, 43H), 3.49 (ddt, J=9.6, 7.9, 4.9 Hz, 3H), 2.07-1.98 (m, 6H), 1.20-1.12 (m, 9H). 13C NMR (150 MHz, D2O, 30° C.) δ 174.64, 174.62, 158.10, 136.01, 128.77, 128.46, 127.77, 127.67, 102.46, 101.72, 101.67, 98.66, 98.56, 98.45, 81.80, 81.59, 81.45, 77.59, 76.72, 75.04, 74.89, 74.73, 74.45, 74.41, 73.08, 73.00, 72.74, 72.45, 71.97, 71.88, 71.83, 71.03, 70.64, 70.50, 69.17, 69.14, 68.33, 68.22, 67.95, 67.68, 67.61, 67.37, 66.66, 66.49, 61.48, 61.44, 59.61, 55.92, 23.24, 22.21, 15.29, 15.25, 15.18. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C66H105N3NaO44 1666.5969; found 1666.5840.
MSOPME preparative-scale synthesis of Neu5Acα3Galβ4GlcNAcβ3Gal-β4GlcNAcβ3Galβ4GlcβNHCbz (Neu5Acα2-3pLNnHβNHCbz) (15). pLNnHβNHCbz (11) was prepared as an intermediate from 50 mg LacβNHCbz (1) (OPME1a, OPME2a, OPME1b, and OPME2a) as described above. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min and then cooled down. In the same reaction tube without workup or purification, Neu5Ac (0.16 mmol) and CTP (0.26 mmol) were added and the pH was adjusted to 8.5 by adding 4 M NaOH. NmCSS (0.5 mg) and PmST3 (1.5 mg) (OPME4a) were then added and the final volume of the reaction was kept at around 10.5 mL. The reaction mixture was incubated at 30° C. in an incubator shaker for 14 h with agitation at 180 rpm. The product formation was monitored by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down, and purified. Neu5Acα2-3pLNnHβNHCbz (15) was obtained as a white powder (123 mg, 78% yield for five steps from 50 mg LacβNHCbz). 1H NMR (600 MHz, D2O, 30° C.) δ 7.48-7.39 (m, 5H), 5.18 (d, J=3.7 Hz, 2H), 4.82 (s, 1H), 4.70 (dd, J=8.5, 2.3 Hz, 2H), 4.56 (d, J=7.9 Hz, 1H), 4.45 (dd, J=15.4, 7.8 Hz, 2H), 4.15 (dd, J=5.6, 3.3 Hz, 2H), 4.12 (dd, J=9.9, 3.1 Hz, 1H), 3.99-3.38 (m, 40H), 2.76 (dd, J=12.4, 4.6 Hz, 1H), 2.03 (s, 9H), 1.80 (t, J=12.1 Hz, 1H). 13C NMR (150 MHz, D2O, 30° C.) δ 175.06, 174.94, 174.92, 173.90, 158.15, 136.06, 128.84, 128.54, 127.88, 127.74, 102.92, 102.85, 102.79, 102.60, 99.86, 82.12, 82.02, 81.72, 78.24, 78.05, 77.79, 76.19, 75.54, 75.22, 75.08, 74.94, 74.92, 74.60, 72.94, 72.21, 72.19, 71.82, 71.47, 70.02, 69.43, 68.39, 68.35, 68.14, 67.53, 67.46, 62.63, 61.08, 61.05, 61.02, 59.91, 59.69, 55.23, 55.19, 54.41, 51.73, 42.59, 39.69, 22.23, 22.09, 12.19. HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C59H91N4O40 1495.5215; found 1495.5212.
MSOPME preparative-scale synthesis of Neu5Acα6Galβ4GlcNAcβ3Gal-β4GlcNAcβ3Galβ4GlcβNHCbz (Neu5Acα2-6pLNnHβNHCbz) (16). pLNnHβNHCbz (11) was prepared as an intermediate from 50 mg LacβNHCbz (1) (OPME1a, OPME2a, OPME1b, and OPME2a) as described above. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min and then cooled down. In the same reaction tube without workup or purification, Neu5Ac (0.13 mmol) and CTP (0.21 mmol) were added and the pH of the reaction was adjusted to 8.5 by adding 4 M NaOH. NmCSS (0.5 mg) and Pd2,6ST_A200Y/S232Y (1.5 mg) (OPME4b) were then added and the final volume of the reaction was kept at around 10.5 mL. The reaction mixture was incubated at 30° C. in an incubator shaker for 14 h with agitation at 180 rpm. The product formation was monitored by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down, and purified. Neu5Acα2-6pLNnHβNHCbz (16) was obtained as a white powder (118 mg, 75% yield for five steps from 50 mg LacβNHCbz). 1H NMR (600 MHz, D2O, 30° C.) δ 7.49-7.38 (m, 5H), 5.23-5.15 (m, 2H), 4.83 (s, 1H), 4.73 (d, J=7.7 Hz, 1H), 4.70 (d, J=8.3 Hz, 1H), 4.50-4.40 (m, 3H), 4.15 (t, J=4.0 Hz, 2H), 4.05-3.36 (m, 40H), 2.72-2.62 (m, 1H), 2.05 (s, 3H), 2.03 (s, 6H), 1.72 (t, J=12.2 Hz, 1H). 13C NMR (150 MHz, D2O, 30° C.) δ 174.89, 173.50, 158.09, 136.00, 128.77, 128.47, 127.81, 103.44, 102.84, 102.73, 102.56, 100.11, 82.00, 81.96, 81.64, 80.46, 78.17, 77.72, 76.12, 75.00, 74.86, 74.53, 74.24, 73.68, 72.52, 72.40, 72.21, 72.15, 71.69, 71.39, 70.71, 69.95, 68.38, 68.34, 68.28, 68.19, 67.39, 63.33, 62.64, 60.95, 60.12, 59.83, 59.29, 55.11, 54.92, 51.86, 40.05, 22.27, 22.15, 22.01. HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C59H91N4O40 1495.5215; found 1495.5140.
MSOPME preparative-scale synthesis of GalβGlcNAcβ3Galβ4GlcNAcβ3Gal-β4GlcβNHCbz (pLNHβNHCbz) (17). GlcNAc-LNnTβNHCbz (4) was prepared as an intermediate from 50 mg LacβNHCbz (1) (OPME1a, OPME2a, and OPME1b) as described above. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min and then cooled down. In the same reaction tube without workup or purification, Gal (0.16 mmol), ATP (0.16 mmol), and UTP (0.16 mmol) were added, and the pH was adjusted to 7.5 by adding 4 M NaOH. SpGalK (1.5 mg), BLUSP (1.2 mg), Cvβ3GalT (2.5 mg), and PmPpA (0.5 mg) (OPME2b) were then added and the final volume of the reaction was kept at around 10.5 mL. The reaction mixture was incubated at 30° C. in an incubator shaker for 18 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:2.8:1 (by volume) as the developing solvent and by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down, and purified. pLNHβNHCbz (17) was obtained as a white powder (101 mg, 80% yield for four steps from 50 mg LacβNHCbz). 1H NMR (600 MHz, D2O, 30° C.) δ 7.50-7.39 (m, 5H), 5.19 (t, J=3.4 Hz, 2H), 4.82 (d, J=9.8 Hz, 1H), 4.73 (d, J=8.4 Hz, 1H), 4.70 (d, J=8.3 Hz, 1H), 4.47 (d, J=7.8 Hz, 1H), 4.44 (d, J=7.8 Hz, 2H), 4.15 (t, J=3.8 Hz, 2H), 3.98-3.88 (m, 4H), 3.87-3.33 (m, 30H), 1.97-2.08 (m, 6H). 13C NMR (150 MHz, D2O, 30° C.) δ 174.94, 174.88, 158.09, 136.00, 128.78, 128.47, 127.82, 127.68, 103.47, 102.84, 102.73, 102.71, 102.55, 82.05, 82.00, 81.95, 81.65, 78.18, 78.15, 77.71, 76.12, 75.33, 75.26, 75.16, 75.01, 74.85, 74.53, 72.49, 72.45, 72.14, 71.40, 70.94, 70.66, 69.96, 68.51, 68.42, 68.34, 68.28, 67.40, 63.02, 61.01, 60.95, 60.93, 60.47, 59.84, 55.17, 55.13, 54.68, 22.21, 22.15. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C48H75N3NaO32 1228.4231; found 1228.4233.
MSOPME preparative-scale synthesis of Fucα2GalβGlcNAcβ3Galβ4GlcNAc-β3Galβ4GlcβNHCbz (F-pLNH-IβNHCbz) (18). pLNHβNHCbz (17) was prepared as an intermediate from 50 mg LacβNHCbz (1) (OPME1a, OPME2a, OPME1b, and OPME2b) as described above. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min and then cooled down. In the same reaction tube without workup or purification, L-fucose (0.16 mmol), ATP (0.16 mmol), and GTP (0.16 mmol) were added and the pH was adjusted to 7.5 by adding 4 M NaOH. BfFKP (1.5 mg), Hm2FT (1.5 mg), and PmPpA (0.5 mg) (OPME3a) were then added and the final volume of the reaction was kept at around 10.5 mL. The reaction mixture was incubated at 30° C. in an incubator shaker for 18 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:3:1 (by volume) as the developing solvent and by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down, and purified. F-pLNH-Iβ3NHCbz (18) was obtained as a white powder (105 mg, 74% yield for five steps from 50 mg LacβNHCbz). 1H NMR (600 MHz, D2O, 30° C.) δ 7.44 (d, J=4.4 Hz, 5H), 5.18 (dd, J=5.7, 3.7 Hz, 3H), 4.81 (d, J=9.4 Hz, 1H), 4.69 (d, J=8.3 Hz, 1H), 4.64 (d, J=7.7 Hz, 1H), 4.61 (d, J=8.4 Hz, 1H), 4.44 (dd, J=7.9, 6.2 Hz, 2H), 4.29 (q, J=6.7 Hz, 1H), 4.14 (q, J=3.3 Hz, 2H), 4.01-3.86 (m, 5H), 3.85-3.63 (m, 24H), 3.61-3.33 (m, 9H), 1.98-2.07 (m, 6H), 1.23 (d, J=6.6 Hz, 3H). 13C NMR (150 MHz, D2O, 30° C.) δ 174.87, 174.20, 158.09, 135.99, 128.76, 128.46, 127.80, 103.22, 102.88, 102.83, 102.70, 100.21, 99.48, 81.93, 81.76, 81.58, 78.04, 77.67, 77.26, 77.12, 76.63, 76.41, 76.11, 75.21, 75.04, 74.99, 74.83, 74.78, 74.53, 73.45, 72.10, 71.82, 71.70, 71.38, 70.16, 69.94, 69.40, 69.20, 69.09, 68.53, 68.42, 68.33, 68.01, 67.38, 66.46, 61.12, 60.91, 60.50, 60.35, 59.82, 55.11, 54.94, 22.14, 22.11, 15.23. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C54H85N3NaO36 1374.4810; found 1374.4860.
MSOPME preparative-scale synthesis of Galβ3(Fucα4)GlcNAcβ3Galβ4(Fucα3)-GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz (TF-pLNH-IIIβNHCbz) (19). pLNHβNHCbz (17) was prepared as an intermediate from 50 mg LacβNHCbz (1) (OPME1a, OPME2a, OPME1b, and OPME2b) as described above. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min and then cooled down. In the same reaction tube without workup or purification, L-fucose (0.53 mmol), ATP (0.53 mmol), and GTP (0.53 mmol) were added and the pH was adjusted to 7.5 by adding 4 M NaOH. BfFKP (5.5 mg), Hp3/4FT (2.5 mg), and PmPpA (1 mg) (OPME3c) were then added and the final volume of the reaction was kept at around 10.5 mL. The reaction mixture was incubated at 30° C. in an incubator shaker for 18 h with agitation at 180 rpm. The product formation was monitored by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down, and purified. TF-pLNH-IIIβNHCbz (19) was obtained as a white powder (124 mg, 72% yield for five steps from 50 mg LacβNHCbz). 1H NMR (600 MHz, D2O, 30° C.) δ 7.46-7.39 (m, 5H), 5.43 (d, J=4.0 Hz, 1H), 5.18 (d, J=6.4 Hz, 2H), 5.12 (d, J=4.0 Hz, 1H), 5.02 (d, J=4.0 Hz, 1H), 4.87 (q, J=6.7 Hz, 1H), 4.84-4.79 (m, 3H), 4.69 (t, J=7.8 Hz, 2H), 4.51 (d, J=7.6 Hz, 1H), 4.43 (dd, J=16.3, 7.8 Hz, 2H), 4.11-4.04 (m, 3H), 3.98-3.66 (m, 33H), 3.65-3.44 (m, 12H), 2.02 (s, 3H), 2.01 (s, 3H), 1.19-1.12 (m, 9H). 13C NMR (150 MHz, D2O, 30° C.) δ 174.69, 174.64, 158.11, 136.01, 128.76, 128.46, 127.77, 127.67, 102.82, 102.55, 102.45, 101.69, 98.66, 98.45, 97.97, 81.79, 81.63, 81.44, 77.58, 76.72, 75.89, 75.15, 75.03, 74.78, 74.72, 74.45, 74.42, 73.07, 72.76, 72.27, 72.07, 71.96, 71.92, 71.87, 71.83, 70.63, 70.46, 69.18, 69.14, 69.10, 68.32, 68.22, 67.94, 67.75, 67.61, 67.37, 66.82, 66.65, 66.49, 61.63, 61.44, 59.59, 55.92, 55.84, 22.24, 22.20, 15.34, 15.25, 15.17. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C66H105N3NaO44 1666.5969; found 1666.5909.
MSOPME preparative-scale synthesis of Galβ4GlcNAcβ3Galβ4GlcNAcβ3Gal-β4GlcNAcβ3Galβ4GlcβNHCbz (pLNnOβNHCbz) (20). GlcNAc-pLNnHβNHCbz (12) was prepared as an intermediate from 50 mg LacβNHCbz (1) (OPME1a, OPME2a, OPME1b, OPME2a, and OPME1b) as described above. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min and then cooled down. In the same reaction tube without workup or purification, Gal (0.16 mmol), ATP (0.16 mmol), and UTP (0.16 mmol) were added, and the pH was adjusted to 7.5 by adding 4 M NaOH. SpGalK (1.5 mg), BLUSP (1.2 mg), NmLgtB (1.8 mg), and PmPpA (0.5 mg) (OPME2a) were then added and the final volume of the reaction was kept at around 10.5 mL. The reaction mixture was incubated at 30° C. in an incubator shaker for 16 h with agitation at 180 rpm. The product formation was monitored by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down, and purified. pLNnOβNHCbz (20) was obtained as a white powder (119 mg, 72% yield for six steps from 50 mg LacβNHCbz). 1H NMR (800 MHz, D2O, 30° C.) δ 7.46-7.38 (m, 5H), 5.18 (dd, J=11.8, 6.8 Hz, 2H), 4.81 (d, J=10.4 Hz, 1H), 4.69 (dd, J=8.4, 2.5 Hz, 3H), 4.48-4.44 (m, 3H), 4.43 (d, J=7.8 Hz, 1H), 4.14 (dt, J=5.3, 2.9 Hz, 3H), 3.95-3.85 (m, 5H), 3.85-3.61 (m, 32H), 3.60-3.50 (m, 7H), 3.36-3.44 (m, 1H), 2.00-2.03 (m, 9H). 13C NMR (200 MHz, D2O, 30° C.) δ 174.86, 135.98, 128.76, 128.46, 128.40, 127.80, 127.66, 102.85, 102.82, 102.73, 102.71, 82.03, 81.94, 81.63, 78.11, 78.01, 77.67, 76.11, 75.32, 74.99, 74.83, 74.51, 72.47, 72.14, 72.13, 71.38, 70.93, 69.94, 69.92, 68.51, 68.32, 68.28, 67.39, 61.00, 60.94, 60.92, 59.81, 55.15, 55.11, 22.14. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C62H98N4NaO42 1593.5553; found 1593.5419.
MSOPME preparative-scale synthesis of GalβGlcNAcβ3Galβ4GlcNAcβ3Gal-β4GlcNAcβ3Galβ4GlcβNHCbz (pLNOβNHCbz) (21). GlcNAc-pLNnHβNHCbz (12) was prepared as an intermediate from 50 mg LacβNHCbz (1) (OPME1a, OPME2a, OPME1b, OPME2a, and OPME1b) as described above. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min and then cooled down. In the same reaction tube without workup or purification, Gal (0.16 mmol), ATP (0.16 mmol), and UTP (0.16 mmol) were added, and the pH was adjusted to 7.5 by adding 4 M NaOH. SpGalK (1.5 mg), BLUSP (1.2 mg), Cvβ3GalT (2.5 mg), and PmPpA (0.5 mg) (OPME2b) were then added and the final volume of the reaction was kept at around 10.5 mL. The reaction mixture was incubated at 30° C. in an incubator shaker for 18 h with agitation at 180 rpm. The product formation was monitored by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down, and purified. pLNOβNHCbz (21) was obtained as a white powder (117 mg, 71% yield for six steps from 50 mg LacβNHCbz). 1H NMR (800 MHz, D2O, 30° C.) δ 7.47-7.38 (m, 5H), 5.17 (d, J=6.2 Hz, 2H), 4.81 (d, J=9.1 Hz, 1H), 4.69 (dd, J=8.4, 2.2 Hz, 3H), 4.46 (dd, J=11.2, 7.9 Hz, 3H), 4.43 (d, J=7.9 Hz, 1H), 4.14 (dt, J=6.0, 2.6 Hz, 3H), 3.95-3.87 (m, 6H), 3.85-3.74 (m, 12H), 3.73-3.68 (m, 16H), 3.66-3.62 (m, 3H), 3.59-3.51 (m, 7H), 3.40 (t, J=9.1 Hz, 1H), 2.00-2.03 (m, 9H). 13C NMR (200 MHz, D2O, 30° C.) δ 174.86, 158.08, 135.98, 128.75, 128.46, 127.80, 103.45, 102.85, 102.82, 102.72, 102.71, 102.54, 82.02, 81.93, 81.62, 78.11, 77.66, 76.11, 75.31, 75.24, 75.14, 74.99, 74.83, 74.51, 72.46, 72.43, 72.12, 71.37, 70.93, 70.64, 69.96, 69.94, 69.92, 68.51, 68.49, 68.40, 68.32, 68.28, 67.39, 60.99, 60.93, 60.92, 60.45, 59.81, 55.15, 55.11, 22.19, 22.14. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C62H98N4NaO42 1593.5553; found 1593.5554.
Preparative-scale synthesis of Fucα2Galβ3GlcNAcβ3Galβ4GlcNAcβ3Gal-β4GlcNAcβ3Galβ4GlcβNHCbz (Fucα2pLNOβNHCbzβNHCbz) (22) from 21 by OPME3a. pLNOβNHCbz (21, 20 mg, 0.0127 mmol), L-fucose (0.019 mmol), ATP (0.019 mmol), and GTP (0.019 mmol) were dissolved in a small amount of water. Tris-HCl buffer (100 mM, pH 7.5) and MgCl2 (20 mM) were added. After the addition of BfFKP (0.8 mg), Hm2FT (0.5 mg), and PmPpA (0.1 mg) (OPME3a), water was added to bring the final concentration of pLNOβNHCbz (21) to 10 mM. The reaction mixture was incubated at 30° C. in an incubator shaker for 12 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=4:3:1 (by volume) as the developing solvent and by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down, and purified. Fucα2pLNOβNHCbzβNHCbz (22) was obtained as a white powder (20 mg, 92% yield from 20 mg of pLNOβNHCbz 21). 1H NMR (600 MHz, D2O, 30° C.) δ 7.52-7.36 (m, 5H), 5.19 (d, J=4.0 Hz, 3H), 4.84-4.81 (m, 1H), 4.70 (d, J=8.4 Hz, 2H), 4.63 (dd, J=15.8, 8.0 Hz, 2H), 4.48-4.42 (m, 3H), 4.29 (q, J=6.5 Hz, 1H), 4.15 (q, J=3.6 Hz, 3H), 4.01-3.88 (m, 6H), 3.86-3.64 (m, 34H), 3.61-3.48 (m, 8H), 3.41 (q, J=8.2, 7.5 Hz, 1H), 2.06 (s, 3H), 2.03 (s, 3H), 1.23 (d, J=6.6 Hz, 3H). 13C NMR (150 MHz, D2O, 30° C.) δ 174.95, 174.27, 136.07, 128.84, 128.54, 128.48, 127.88, 127.74, 103.29, 102.93, 102.90, 102.82, 102.79, 100.29, 99.56, 99.46, 82.10, 82.03, 78.21, 78.10, 77.77, 77.20, 76.70, 76.49, 76.18, 75.91, 75.29, 75.13, 75.07, 74.91, 74.59, 73.57, 72.21, 72.11, 71.90, 71.71, 71.46, 70.23, 70.02, 70.00, 69.66, 69.48, 69.16, 68.60, 68.51, 68.40, 68.36, 68.25, 68.09, 67.46, 66.99, 66.53, 61.16, 60.99, 60.43, 60.04, 59.90, 55.42, 55.19, 55.02, 22.24, 22.22, 22.19, 15.36, 15.30. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C65H108N4NaO46 1739.6132; found 1739.6113.
Hydrogenation procedures. To remove the Cbz tag from the obtained glycosides (2-22), a catalytic amount (10-20%) of palladium on charcoal (Pd/C) (2 mg) was added to a solution containing 20 mg of each compound in H2O:MeOH=1:1 (by volume) (3 mL). The mixture was stirred at room temperature under a hydrogen atmosphere with a balloon. The reaction was monitored by high-resolution mass spectrometry (“HRMS”). When the reaction was completed (2-12 hours (h)), the mixture was passed through a 0.45 μm syringe filter to remove palladium and charcoal. The solvent was removed in vacuo. The residue obtained was dissolved in 3 mL of H2O and the mixture was incubated at 37° C. with 120 rpm agitation in an incubator shaker. The reaction was monitored by HRMS until the β-glycosylamine was completely converted to the target HMO with a free reducing end (4-5 days for HMOs containing an L-fucose linked to the reducing end Glc residue or 20-48 hours for other HMOs). The pure HMO was obtained by lyophilization without further purification.
GlcNAcβ3Galβ4Glc (LNT-II, 2a). White powder, 15.4 mg, 95% yield. 1H NMR (800 MHz, D2O, 30° C.) δ 5.21 (d, J=3.7 Hz, 0.4H), 4.67 (d, J=8.5 Hz, 1H), 4.65 (d, J=8.0 Hz, 0.6H), 4.42 (dd, J=7.9, 1.4 Hz, 1H), 4.14 (d, J=3.4 Hz, 1H), 3.95-3.68 (m, 9H), 3.68-3.22 (m, 7H), 2.02 (s, 3H). 13C NMR (200 MHz, D2O, 30° C.) δ 174.92, 102.89, 102.85, 102.81, 95.70, 91.77, 81.91, 81.89, 78.32, 78.22, 75.65, 75.62, 74.86, 74.76, 74.31, 73.97, 73.75, 73.52, 71.36, 71.09, 70.08, 70.00, 69.97, 69.66, 69.65, 68.34, 68.31, 60.94, 60.93, 60.44, 60.04, 59.91, 55.68, 55.62, 22.14, 22.12. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C20H35NNaO16 568.1854; found 568.1889.
Galβ4GlcNAcβ3Galβ4Glc (LNnT, 3a). White powder, 16.5 mg, 95% yield. 1H NMR (600 MHz, D2O, 30° C.) δ 5.22 (d, J=3.8 Hz, 0.4H), 4.73-4.71 (d, 1H), 4.67 (d, J=7.9 Hz, 0.6H), 4.48 (d, J=7.8 Hz, 1H), 4.44 (d, J=7.8 Hz, 1H), 4.16 (d, J=3.3 Hz, 1H), 3.99-3.92 (m, 3H), 3.90-3.71 (m, 13H), 3.69-3.53 (m, 7H), 3.28 (t, J=8.5 Hz, 0.6H), 2.04 (s, 3H). 13C NMR (150 MHz, D2O, 30° C.) δ 174.88, 102.91, 102.87, 102.84, 102.72, 95.72, 91.79, 82.01, 78.36, 78.26, 78.16, 75.33, 74.86, 74.78, 74.54, 74.33, 73.77, 72.49, 72.16, 71.38, 71.11, 70.95, 70.10, 69.95, 68.53, 68.32, 61.01, 60.95, 60.06, 59.93, 59.85, 55.18, 22.16. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C26H45NNaO21 730.2382; found 730.2371.
GlcNAcβ3Galβ4GlcNAcβ3Galβ4Glc (4a). White powder, 17.2 mg, 95% yield. 1H NMR (800 MHz, D2O, 30° C.) δ 5.21 (d, J=3.8 Hz, 0.4H), 4.69 (dd, J=8.4, 3.0 Hz, 1H), 4.67 (d, J=8.5 Hz, 1H), 4.65 (d, J=8.0 Hz, 0.6H), 4.45 (d, J=7.9 Hz, 1H), 4.42 (d, J=7.9, 1.4 Hz, 1H), 4.14 (d, J=3.3 Hz, 2H), 3.91 (ddd, J=44.3, 12.1, 2.0 Hz, 3H), 3.84-3.54 (m, 23H), 3.47-3.26 (m, 2H), 2.04-2.00 (m, 6H). 13C NMR (200 MHz, D2O, 30° C.) δ 174.91, 174.86, 102.89, 102.85, 102.83, 102.70, 95.70, 91.77, 81.99, 81.97, 81.96, 78.32, 78.21, 78.13, 75.62, 74.85, 74.83, 74.76, 74.51, 74.31, 73.75, 73.52, 72.13, 71.36, 71.09, 70.08, 69.96, 69.93, 69.63, 68.33, 68.30, 68.28, 60.93, 60.92, 60.43, 60.03, 59.90, 59.82, 55.62, 55.11, 22.14, 22.12. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C34H58N2NaO26 933.3175; found 933.3238.
Galβ4GlcNAcβ3Galβ4(Fucα3)Glc (LNnFP-V, 5a). White powder, 17.1 mg, 96% yield. 1H NMR (800 MHz, D2O, 30° C.) δ 5.41 (d, J=4.0 Hz, 0.4H), 5.36 (d, J=4.0 Hz, 0.4H), 5.17 (d, J=3.8 Hz, 0.3H), 4.80 (d, J=6.8 Hz, 1H), 4.69 (dd, J=8.4, 2.9 Hz, 1H), 4.64 (d, J=8.0 Hz, 0.4H), 4.47 (d, J=7.8 Hz, 1H), 4.40 (d, J=8.2 Hz, 1H), 4.08 (d, J=3.4 Hz, 1H), 3.93 (tdd, J=12.1, 6.2, 2.7 Hz, 4H), 3.86-3.65 (m, 18H), 3.58-3.43 (m, 5H), 2.01-2.03 (m, 3H), 1.15 (dd, J=6.7, 4.6 Hz, 3H). 13C NMR (200 MHz, D2O, 30° C.) δ 174.83, 102.82, 102.66, 101.69, 98.52, 98.41, 95.80, 92.08, 81.53, 81.46, 78.14, 76.96, 75.50, 75.33, 75.31, 74.65, 74.48, 74.46, 72.64, 72.46, 72.34, 72.24, 72.09, 71.89, 71.88, 70.93, 70.91, 70.63, 70.58, 69.24, 69.18, 68.51, 68.25, 68.22, 67.99, 67.96, 66.46, 66.42, 61.46, 61.43, 60.99, 59.82, 59.75, 59.68, 55.13, 22.12, 15.19, 15.17. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C32H55NNaO25 876.2961; found 876.2956.
Galβ4(Fucα3)GlcNAcβ3Galβ4(Fucα3)Glc (LNnDFH-II, 6a). White powder, 17.3 mg, 96% yield. 1H NMR (800 MHz, D2O, 30° C.) δ 5.41 (d, J=4.0 Hz, 0.4H), 5.36 (d, J=4.0 Hz, 0.4H), 5.17 (d, J=3.8 Hz, 0.4H), 5.12 (d, J=4.1 Hz, 1H), 4.82 (d, J=6.7 Hz, 1H), 4.80 (d, 1H), 4.69 (dd, J=8.5, 2.8 Hz, 1H), 4.64 (d, J=8.0 Hz, 0.4H), 4.45 (d, J=7.8 Hz, 1H), 4.40 (d, J=7.7 Hz, 1H), 4.08 (d, J=3.4 Hz, 1H), 3.97-3.83 (m, 11H), 3.81-3.41 (m, 20H), 1.99-2.02 (m, 3H), 1.18-1.13 (m, 6H). 13C NMR (200 MHz, D2O, 30° C.) δ 174.62, 102.47, 101.71, 101.69, 98.55, 98.51, 98.40, 95.80, 92.09, 81.52, 81.45, 76.95, 75.50, 75.33, 75.04, 74.88, 74.72, 74.64, 74.45, 72.99, 72.88, 72.64, 72.43, 72.31, 72.21, 71.89, 71.87, 71.01, 70.91, 70.66, 70.61, 69.23, 69.18, 69.15, 68.31, 68.24, 68.21, 67.99, 67.96, 67.66, 66.65, 66.45, 66.41, 61.47, 61.43, 59.76, 59.69, 59.59, 55.91, 22.20, 15.27, 15.18, 15.17. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C38H65NNaO29 1022.3540; found 1022.3510.
Galβ3GlcNAcβ3Galβ4Glc (LNT, 7a). White powder, 15.6 mg, 95%. 1H NMR (800 MHz, D2O, 30° C.) δ 5.28 (d, J=3.8 Hz, 0.3H), 4.80 (d, J=4.9 Hz, 1H), 4.72 (d, J=7.9 Hz, 0.5H), 4.50 (dd, J=7.8, 3.0 Hz, 2H), 4.21 (d, J=3.3 Hz, 1H), 4.03-3.75 (m, 15H), 3.72-3.32 (m, 8H), 2.09 (s, 3H). 13C NMR (200 MHz, D2O, 30° C.) δ 174.99, 103.52, 102.97, 102.93, 102.56, 95.79, 91.86, 82.18, 82.03, 82.00, 81.82, 78.70, 78.52, 78.42, 75.33, 75.25, 74.94, 74.85, 74.41, 73.85, 72.54, 71.46, 71.20, 70.74, 70.18, 70.09, 70.06, 68.59, 68.53, 68.39, 68.37, 61.07, 61.01, 60.99, 60.59, 60.18, 60.05, 54.76, 22.30. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C26H45NNaO21 730.2382; found 730.2361.
Fucα2Galβ3GlcNAcβ3Galβ4Glc (LNFP-I, 8a). White powder, 16.6 mg, 96% yield. 1H NMR (600 MHz, D2O, 30° C.) δ 5.22 (d, J=3.6 Hz, 0.4H), 5.19 (d, J=4.1 Hz, 1H), 4.68-4.61 (m, 2.6H), 4.42 (d, J=7.9 Hz, 1H), 4.29 (q, J=6.6 Hz, 1H), 4.14 (d, J=3.3 Hz, 1H), 4.02-3.45 (m, 27H), 3.28 (t, J=8.5 Hz, 0.5H), 2.06 (s, 3H), 1.23 (d, J=6.6 Hz, 3H). 13C NMR (150 MHz, D2O, 30° C.) δ 174.22, 103.21, 102.93, 102.89, 100.22, 99.49, 95.87, 95.71, 92.05, 91.78, 81.54, 78.21, 78.12, 77.13, 76.64, 75.91, 75.71, 75.22, 75.04, 74.79, 74.30, 74.09, 73.76, 73.46, 72.72, 71.83, 71.34, 71.10, 70.18, 70.12, 69.60, 69.56, 69.40, 69.10, 68.56, 68.45, 68.02, 66.47, 61.12, 60.93, 60.70, 60.54, 60.37, 60.05, 59.92, 54.95, 22.11, 15.24.
Galβ33GlcNAcβ3Galβ4(Fucα3)Glc (LNFP-V, 9a). White powder, 16.8 mg, 95% yield. 1H NMR (800 MHz, D2O, 30° C.) δ 5.41 (d, J=4.0 Hz, 0.5H), 5.36 (d, J=4.0 Hz, 0.4H), 5.17 (d, J=3.8 Hz, 0.4H), 4.81 (d, J=1.3 Hz, 1H), 4.71 (dd, J=8.5, 3.2 Hz, 1H), 4.64 (d, J=8.0 Hz, 0.5H), 4.43 (d, J=7.7 Hz, 1H), 4.40 (d, J=7.9 Hz, 1H), 4.08 (d, J=3.4 Hz, 1H), 3.97-3.67 (m, 24H), 3.63-3.37 (m, 9H), 2.00-2.02 (m, 3H), 1.15 (dd, J=6.7, 4.6 Hz, 3H). 13C NMR (200 MHz, D2O, 30° C.) δ 174.91, 103.42, 102.48, 101.68, 98.52, 98.41, 95.85, 95.80, 92.08, 81.93, 81.47, 81.39, 76.96, 75.70, 75.50, 75.33, 75.24, 75.11, 74.65, 74.47, 72.64, 72.41, 72.34, 72.24, 71.89, 71.87, 71.82, 70.91, 70.69, 70.63, 69.54, 69.23, 69.18, 68.49, 68.39, 68.24, 68.20, 67.99, 67.96, 66.45, 66.41, 61.45, 61.43, 61.00, 60.69, 60.44, 59.76, 59.69, 54.66, 22.17, 15.18. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C32H55NNaO25 876.2961; found 876.2968.
Galβ33(Fucα4)GlcNAcβ3Galβ4(Fucα3)Glc (LNDFH-II, 10a). White powder, 17.5 mg, 97% yield. 1H NMR (600 MHz, D2O, 30° C.) δ 5.42 (d, J=4.0 Hz, 0.4H), 5.36 (d, J=4.0 Hz, 0.4H), 5.17 (d, J=3.8 Hz, 0.4H), 5.02 (d, J=4.0 Hz, 1H), 4.87 (q, J=6.8 Hz, 1H), 4.83-4.80 (m, 1H), 4.68 (d, J=8.4 Hz, 1H), 4.64 (d, J=8.0 Hz, 0.5H), 4.51 (d, J=7.7 Hz, 1H), 4.41 (d, J=7.8 Hz, 1H), 4.10-4.03 (m, 2H), 3.99-3.65 (m, 22H), 3.65-3.42 (m, 8H), 2.05-2.00 (m, 3H), 1.16 (dd, J=10.9, 6.8 Hz, 6H). 13C NMR (150 MHz, D2O, 30° C.) δ 174.71, 102.81, 102.55, 101.71, 98.52, 98.41, 97.97, 95.81, 92.09, 81.56, 81.49, 76.96, 75.89, 75.51, 75.34, 75.15, 74.77, 74.65, 74.47, 72.65, 72.35, 72.27, 72.08, 71.91, 70.92, 70.64, 70.59, 70.46, 69.24, 69.18, 69.10, 68.32, 68.20, 68.00, 67.97, 67.75, 66.81, 66.46, 66.42, 61.62, 61.44, 59.76, 59.56, 55.83, 22.24, 15.33, 15.17. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C38H65NNaO29 1022.3540; found 1022.3551.
Galβ4GlcNAcβ3Galβ4GlcNAcβ3Galβ4Glc (pLNnH, 11a). White powder, 17.4 mg, 96% yield. 1H NMR (800 MHz, D2O, 30° C.) δ 5.21 (d, J=3.8 Hz, 0.3H), 4.69 (d, J=8.4 Hz, 2H), 4.65 (d, J=8.0 Hz, 0.5H), 4.46 (dd, J=10.2, 7.9 Hz, 2H), 4.42 (d, J=7.9 Hz, 1H), 4.14 (t, J=3.8 Hz, 2H), 3.95-3.90 (m, 4H), 3.88-3.69 (m, 21H), 3.67-3.51 (m, 9H), 2.02 (s, 6H). 13C NMR (200 MHz, D2O, 30° C.) δ 174.86, 102.89, 102.85, 102.82, 102.72, 102.70, 95.70, 91.77, 82.03, 81.99, 81.97, 78.32, 78.21, 78.12, 78.10, 75.31, 74.83, 74.76, 74.51, 74.31, 73.74, 72.46, 72.14, 72.13, 71.36, 71.09, 70.92, 70.08, 69.95, 69.92, 68.51, 68.30, 68.27, 60.99, 60.93, 60.92, 60.03, 59.81, 55.15, 55.11, 22.14. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C40H68N2NaO31 1095.3704; found 1095.3761.
GlcNAcβ3Galβ4GlcNAcβ3Galβ4GlcNAcβ3Galβ4Glc (GlcNAc-pLNnH, 12a). White powder, 17.7 mg, 96% yield. 1H NMR (800 MHz, D2O, 30° C.) δ 5.21 (d, J=3.8 Hz, 0.3H), 4.70-4.66 (m, 3H), 4.65 (d, J=7.9 Hz, 0.6H), 4.45 (d, J=7.9 Hz, 2H), 4.42 (dd, J=7.9, 1.4 Hz, 1H), 4.14 (t, J=2.7 Hz, 3H), 3.95-3.68 (m, 29H), 3.64-3.54 (m, 8H), 3.47-3.41 (m, 2H), 3.29-3.24 (m, 0.5H), 2.00-2.05 (m, 9H). 13C NMR (200 MHz, D2O, 30° C.) δ 174.91, 174.86, 102.89, 102.85, 102.83, 102.72, 102.70, 95.70, 91.77, 82.02, 81.99, 81.97, 81.95, 78.32, 78.21, 78.11, 75.61, 74.85, 74.83, 74.76, 74.51, 74.31, 73.75, 73.52, 72.13, 71.36, 71.09, 70.08, 69.96, 69.92, 69.63, 68.30, 68.28, 60.92, 60.43, 60.03, 59.90, 59.81, 55.62, 55.11, 22.14. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C48H81N3NaO36 1298.4497; found 1298.4452.
Fucα2Galβ4GlcNAcβ3Galβ4GlcNAcβ3Galβ4Glc (F-pLNnH-I, 13a). White powder, 17.6 mg, 96% yield. 1H NMR (800 MHz, D2O, 30° C.) δ 5.29 (d, J=3.4 Hz, 1H), 5.20 (d, J=3.7 Hz, 0.4H), 4.70-4.67 (m, 2H), 4.65 (d, J=8.0 Hz, 0.6H), 4.53 (d, J=7.7 Hz, 1H), 4.45 (d, J=7.8 Hz, 1H), 4.42 (dd, J=7.9, 1.4 Hz, 1H), 4.20 (q, J=6.7 Hz, 1H), 4.13 (t, J=3.5 Hz, 2H), 3.94 (ddd, J=12.5, 8.6, 2.2 Hz, 3H), 3.88-3.43 (m, 35H), 3.26 (dd, J=9.2, 7.9 Hz, 0.6H), 2.05-2.00 (m, 6H), 1.21 (d, J=6.6 Hz, 3H). 13C NMR (200 MHz, D2O, 30° C.) δ 174.88, 174.87, 102.89, 102.85, 102.84, 102.75, 102.70, 100.21, 99.37, 95.70, 91.77, 81.99, 81.97, 78.32, 78.21, 78.12, 76.40, 75.82, 75.22, 75.05, 74.83, 74.81, 74.76, 74.51, 74.31, 73.75, 73.49, 72.13, 72.03, 71.63, 71.36, 71.09, 70.08, 69.95, 69.93, 69.57, 69.08, 68.32, 68.30, 68.27, 68.16, 66.91, 61.09, 60.93, 60.92, 60.89, 60.03, 59.96, 59.90, 59.81, 55.34, 55.12, 22.16, 22.13, 15.28. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C46H78N2NaO35 1241.4283; found 1241.4262.
Galβ4(Fucα3)GlcNAcβ3Galβ4(Fucα3)GlcNAcβ3Galβ4(Fucα3)Glc (TF-pLNnH, 14a). White powder, 17.9 mg, 95% yield. 1H NMR (800 MHz, D2O, 30° C.) δ 5.41 (d, J=4.0 Hz, 0.4H), 5.36 (d, J=4.0 Hz, 0.3H), 5.17 (d, J=3.8 Hz, 0.3H), 5.12 (d, J=4.0 Hz, 1H), 5.11 (d, J=4.2 Hz, 1H), 4.82 (d, J=6.7 Hz, 1H), 4.81 (d, J=6.8 Hz, 2H), 4.69 (d, J=8.4 Hz, 2H), 4.64 (d, J=8.0 Hz, 0.4H), 4.45 (d, J=7.8 Hz, 1H), 4.43 (d, J=7.8 Hz, 1H), 4.40 (d, J=7.8 Hz, 1H), 4.08 (t, J=4.1 Hz, 2H), 3.98-3.81 (m, 18H), 3.79-3.44 (m, 28H), 2.02-1.98 (m, 6H), 1.17-1.11 (m, 9H). 13C NMR (200 MHz, D2O, 30° C.) δ 174.64, 174.61, 102.46, 101.70, 98.65, 98.55, 81.58, 76.95, 75.51, 75.33, 75.03, 74.88, 74.72, 74.45, 74.41, 72.98, 72.88, 72.72, 72.43, 71.87, 71.82, 71.62, 71.01, 70.49, 69.23, 69.15, 69.13, 68.31, 68.21, 67.99, 67.66, 67.60, 66.66, 66.45, 61.47, 61.43, 59.59, 55.92, 22.20, 15.47, 15.28, 15.24, 15.17. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C58H98N2NaO43 1533.5441; found 1533.5476.
Neu5Acα3Galβ4GlcNAcβ3Galβ4GlcNAcβ3Galβ4Glc (Neu5Acα2-3pLNnH, 15a). White powder, 17.1 mg, 94% yield. 1H NMR (800 MHz, D2O, 30° C.) δ 5.28 (d, J=3.8 Hz, 0.4H), 4.76 (d, J=4.2 Hz, 2H), 4.72 (d, J=8.0 Hz, 0.7H), 4.61 (d, J=7.8 Hz, 1H), 4.53 (d, J=7.9 Hz, 1H), 4.50 (d, J=7.9 Hz, 1H), 4.21 (t, J=4.1 Hz, 2H), 4.17 (dd, J=9.9, 3.2 Hz, 1H), 4.04-3.99 (m, 4H), 3.96-3.75 (m, 25H), 3.73-3.61 (m, 11H), 3.34 (t, J=8.5 Hz, 0.6H), 2.82 (dd, J=12.5, 4.7 Hz, 1H), 2.12-2.07 (m, 9H), 1.86 (t, J=12.1 Hz, 1H). 13C NMR (200 MHz, D2O, 30° C.) δ 175.08, 174.92, 173.86, 102.98, 102.94, 102.81, 102.74, 102.63, 99.88, 95.79, 91.86, 88.76, 82.12, 82.08, 82.05, 78.51, 78.41, 78.34, 78.15, 75.57, 75.23, 74.95, 74.93, 74.85, 74.62, 74.41, 73.86, 72.95, 72.23, 72.21, 71.81, 71.45, 71.20, 70.18, 70.03, 69.43, 68.41, 68.38, 68.35, 68.18, 67.55, 62.67, 61.08, 61.01, 61.00, 60.17, 60.04, 59.94, 59.63, 55.25, 55.22, 51.76, 39.72, 22.24, 22.10. HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C51H84N3O39 1362.4687; found 1362.4706.
Neu5Acα6Galβ4GlcNAcβ3Galβ4GlcNAcβ3Galβ4Glc (Neu5Acα2-6pLNnH, 16a). White powder, 17.1 mg, 94% yield. 1H NMR (800 MHz, D2O, 30° C.) δ 5.33 (d, J=3.8 Hz, 0.4H), 4.85 (d, J=7.6 Hz, 1H), 4.83 (dd, J=8.4, 3.0 Hz, 1H), 4.77 (s, 0.5H), 4.61-4.54 (m, 3H), 4.26 (t, J=3.6 Hz, 2H), 4.12-4.03 (m, 5H), 4.01-3.81 (m, 22H), 3.79-3.64 (m, 14H), 3.39 (dd, J=9.0, 7.9 Hz, 0.6H), 2.79 (dd, J=12.4, 4.7 Hz, 1H), 2.18-2.13 (m, 9H), 1.82 (t, J=12.2 Hz, 1H). 13C NMR (200 MHz, D2O, 30° C.) δ 175.05, 174.99, 174.98, 173.57, 103.53, 103.05, 103.01, 102.78, 102.63, 100.30, 95.86, 91.93, 82.14, 82.12, 80.56, 78.64, 78.54, 78.48, 74.99, 74.91, 74.69, 74.48, 74.43, 73.94, 73.82, 72.67, 72.59, 72.35, 72.31, 71.84, 71.52, 71.28, 70.88, 70.24, 70.13, 70.10, 68.57, 68.52, 68.44, 68.41, 68.32, 63.47, 62.83, 61.06, 60.34, 60.26, 60.05, 55.28, 55.13, 54.51, 52.03, 40.22, 22.43. HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C51H84N3O39 1362.4687; found 1362.4661.
Galβ3GlcNAcβ3Galβ4GlcNAcβ3Galβ4Glc (pLNH, 17a). White powder, 17.4 mg, 96% yield. 1H NMR (600 MHz, D2O, 30° C.) δ 5.22 (d, J=3.8 Hz, 0.3H), 4.74-4.68 (m, 2H), 4.47 (d, J=7.9, 2.2 Hz, 0.5H), 4.44 (dd, J=7.8, 2.4 Hz, 2H), 4.15 (t, J=2.6 Hz, 2H), 3.95 (d, J=12.3 Hz, 2H), 3.92-3.87 (m, 3H), 3.85-3.69 (m, 20H), 3.65-3.46 (m, 9H), 3.28 (t, J=8.5 Hz, 0.5H), 2.01-2.04 (m, 6H). 13C NMR (150 MHz, D2O, 30° C.) δ 174.94, 174.88, 103.46, 102.90, 102.86, 102.71, 102.55, 95.71, 91.79, 82.04, 82.00, 78.35, 78.24, 78.16, 75.26, 75.16, 74.85, 74.77, 74.52, 74.32, 73.76, 72.44, 72.15, 71.37, 71.10, 70.65, 70.09, 69.97, 68.50, 68.42, 61.00, 60.93, 60.46, 60.05, 59.83, 55.13, 54.67, 54.33, 22.20, 22.15. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C40H68N2NaO31 1095.3704; found 1095.3770.
Fucα2Galβ3GlcNAcβ3Galβ4GlcNAcβ3Galβ4Glc (F-pLNH-I, 18a). White powder, 17.1 mg, 95% yield. 1H NMR (600 MHz, D2O, 30° C.) δ 5.22 (d, J=3.6 Hz, 0.4H), 5.19 (d, J=4.1 Hz, 1H), 4.70 (dd, J=8.3, 2.2 Hz, 1H), 4.64 (ddd, J=15.9, 10.7, 8.2 Hz, 2.6H), 4.44 (t, J=8.2 Hz, 2H), 4.29 (q, J=6.5 Hz, 1H), 4.14 (dd, J=6.3, 3.3 Hz, 2H), 4.01-3.87 (m, 6H), 3.85-3.70 (m, 21H), 3.69-3.63 (m, 3H), 3.61-3.38 (m, 8H), 3.28 (dd, J=9.1, 8.0 Hz, 0.6H), 2.09-2.00 (m, 6H), 1.23 (d, J=6.6 Hz, 3H). 13C NMR (150 MHz, D20, 30° C.) δ 174.88, 174.21, 103.23, 102.89, 102.70, 100.22, 99.49, 95.87, 95.71, 92.05, 91.78, 82.00, 81.60, 78.34, 78.24, 78.05, 77.13, 76.63, 75.91, 75.71, 75.22, 75.05, 74.84, 74.78, 74.54, 74.32, 74.09, 73.76, 73.46, 72.72, 72.12, 71.83, 71.44, 71.37, 71.10, 70.16, 70.09, 69.94, 69.56, 69.41, 69.10, 68.54, 68.44, 68.32, 68.02, 66.47, 61.12, 60.92, 60.70, 60.36, 60.05, 59.83, 55.13, 54.95, 22.15, 22.12, 15.23. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C46H78N2NaO35 1241.4283; found 1241.4254.
Galβ33(Fucα4)GlcNAcβ3Galβ4(Fucα3)GlcNAcβ3Galβ4(Fucα3)Glc (TF-pLNH-III, 19a). White powder, 17.8 mg, 97% yield. 1H NMR (600 MHz, D2O, 30° C.) δ 5.43 (d, J=4.0 Hz, 0.5H), 5.37 (d, J=4.0 Hz, 0.4H), 5.18 (d, J=3.8 Hz, 0.4H), 5.12 (d, J=4.0 Hz, 1H), 5.03 (d, J=4.0 Hz, 1H), 4.88 (q, J=6.7 Hz, 1H), 4.81 (d, J=6.6 Hz, 2H), 4.70 (t, J=8.0 Hz, 2H), 4.65 (d, J=8.0 Hz, 0.5H), 4.51 (d, J=7.7 Hz, 1H), 4.45 (d, J=7.9 Hz, 1H), 4.41 (d, J=7.8 Hz, 1H), 4.10 (t, J=4.0 Hz, 2H), 4.06 (d, J=9.7 Hz, 1H), 3.98-3.83 (m, 16H), 3.82-3.67 (m, 18H), 3.64-3.44 (m, 11H), 2.03 (s, 3H), 2.02 (s, 3H), 1.21-1.13 (m, 9H). 13C NMR (150 MHz, D2O, 30° C.) δ 174.69, 174.64, 102.82, 102.55, 102.45, 101.70, 98.66, 97.97, 95.81, 92.10, 81.64, 81.52, 76.97, 75.90, 75.52, 75.34, 75.16, 75.04, 74.79, 74.73, 74.43, 72.77, 72.66, 72.34, 72.29, 72.08, 71.92, 71.84, 70.93, 70.63, 70.47, 69.55, 69.20, 69.15, 69.11, 68.33, 68.21, 67.97, 67.76, 67.62, 66.82, 66.66, 66.46, 61.63, 61.44, 59.58, 55.93, 55.84, 22.25, 22.21, 15.34, 15.25, 15.18. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C58H98N2NaO43 1533.5441; found 1553.5425.
Galβ4GlcNAcβ3Galβ4GlcNAcβ3Galβ4GlcNAcβ3Galβ4Glc (pLNnO, 20a). White powder, 17.6 mg, 96% yield. 1H NMR (800 MHz, D2O, 30° C.) δ 5.21 (d, J=3.8 Hz, 0.3H), 4.68 (dd, J=8.3, 2.3 Hz, 3H), 4.65 (d, J=8.0 Hz, 0.7H), 4.46 (dd, J=10.8, 7.8 Hz, 3H), 4.42 (d, J=7.9 Hz, 1H), 4.14 (q, J=3.2, 2.4 Hz, 3H), 3.93 (td, J=13.1, 12.7, 2.8 Hz, 5H), 3.85-3.69 (m, 29H), 3.68-3.61 (m, 3H), 3.59-3.51 (m, 8H), 3.29-3.24 (m, 0.5H), 2.02 (s, 9H). 13C NMR (200 MHz, D2O, 30° C.) δ 174.86, 102.89, 102.85, 102.82, 102.72, 102.70, 95.70, 91.77, 82.03, 81.99, 81.97, 78.32, 78.21, 78.11, 75.31, 74.83, 74.76, 74.51, 74.31, 73.75, 72.46, 72.14, 72.13, 71.36, 71.09, 70.93, 70.08, 69.95, 69.92, 68.51, 68.30, 68.28, 60.99, 60.93, 60.92, 60.03, 59.90, 59.81, 55.15, 55.11, 22.14. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C54H91N3NaO41 1460.5026; found 1460.5006.
Galβ3GlcNAcβ3Galβ4GlcNAcβ3Galβ4GlcNAcβ3Galβ4Glc (pLNO, 21a). White powder, 17.8 mg, 96% yield. 1H NMR (800 MHz, D2O, 30° C.) δ 5.28 (d, J=3.8 Hz, 0.3H), 4.80 (s, 1H), 4.76 (d, J=6.4 Hz, 2H), 4.72 (d, J=8.0 Hz, 0.7H), 4.53 (dd, J=9.7, 7.8 Hz, 3H), 4.50 (dd, J=7.8, 1.7 Hz, 1H), 4.21 (d, J=3.1 Hz, 3H), 4.02-3.76 (m, 34H), 3.73-3.53 (m, 11H), 3.34 (t, J=8.5 Hz, 0.6H), 2.07-2.11 (m, 9H). 13C NMR (200 MHz, D2O, 30° C.) δ 174.93, 103.52, 102.98, 102.95, 102.93, 102.75, 102.74, 102.58, 95.79, 91.86, 85.43, 82.18, 82.11, 82.07, 78.50, 78.40, 78.32, 75.40, 75.33, 75.25, 74.92, 74.85, 74.61, 74.41, 73.86, 72.58, 72.54, 72.23, 71.45, 71.20, 71.02, 70.74, 70.18, 70.05, 70.02, 68.61, 68.52, 68.38, 68.36, 61.07, 60.99, 60.17, 59.96, 55.26, 55.22, 54.75, 22.29, 22.24. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C54H91N3NaO41 1460.5026; found 1460.4964.
Fucα2Galβ3GlcNAcβ3Galβ4GlcNAcβ3Galβ4GlcNAcβ3Galβ4Glc (Fucα1-2pLNO, 22a). White powder, 17.5 mg, 95% yield. 1H NMR (600 MHz, D2O, 30° C.) δ 5.21 (d, J=3.8 Hz, 0.3H), 5.18 (d, J=4.1 Hz, 1H), 4.69 (d, J=8.3 Hz, 2H), 4.65 (dd, J=9.9, 7.8 Hz, 1.5H), 4.61 (d, J=8.4 Hz, 1H), 4.47-4.42 (m, 3H), 4.28 (q, J=6.7 Hz, 1H), 4.14 (dd, J=7.4, 3.2 Hz, 3H), 4.01-3.86 (m, 7H), 3.85-3.69 (m, 32H), 3.68-3.62 (m, 4H), 3.61-3.46 (m, 10H), 3.27 (t, J=8.5 Hz, 0.5H), 2.05 (s, 3H), 2.02 (s, 6H), 1.23 (d, J=6.6 Hz, 3H). 13C NMR (150 MHz, D2O, 30° C.) δ 174.87, 174.20, 103.22, 102.88, 102.86, 102.71, 100.22, 99.48, 98.39, 82.03, 81.58, 78.02, 77.12, 76.63, 75.21, 75.04, 74.83, 74.78, 74.76, 74.52, 74.32, 73.75, 73.45, 72.14, 71.37, 71.10, 69.93, 69.40, 69.12, 69.09, 68.53, 68.43, 68.31, 68.01, 66.85, 66.46, 62.44, 61.12, 60.92, 60.35, 59.81, 55.12, 54.94, 22.14, 22.11, 15.23. HRMS (ESI-Orbitrap) m/z: [M+Na]+ calculated for C60H101N3NaO45 1606.5605; found 1606.5542.
The efficiency of the MSOPME strategy presented herein demonstrated high-yield (71-91%) preparative-scale (83-155 mg) synthesis of more than twenty tagged HMOs (2-21) including fucosylated or sialylated long-chain oligosaccharides up to nonaoses (
HMOs are extended from lactose, which is a low-cost starting material that can be used directly for enzymatic synthesis of HMOs by reactions catalyzed by glycosyltransferases or mutant glycosidases. Nevertheless, lactose is a disaccharide with a free reducing terminus which forms a mixture of α and β-anomers as well as a small percentage of open-chain structure on its glucose unit in aqueous solutions. This property leads to complications in product characterization and purification. To simplify the reaction monitoring and product purification processes, carboxybenzyl (Cbz) protected β-lactosylamine (LacβNHCbz, 1) was designed herein. The Cbz tag is an ultraviolet (UV)-active hydrophobic tag that can be easily installed using commercially available materials. The Cbz tag can also be easily removed by simple catalytic hydrogenation and hydrolysis without the need for a chromatography purification process. As shown in
HMOs are complex glycans that are extended from lactose with N-acetyl-D-glucosamine (GlcNAc), D-galactose (Gal), L-fucose (Fuc), and/or N-acetyl-D-neuraminic acid (Neu5Ac) units. One-pot multienzyme (OPME) systems, each containing a suitable glycosyltransferase and a monosaccharide sugar activation (SA) enzymes and reagents, are designed to form the target HMOs with in situ generation of sugar nucleotide donor of the glycosyltransferase from inexpensive monosaccharide and nucleoside triphosphates such as adenosine 5′-triphosphate (ATP), uridine 5′-triphosphate (UTP), and/or cytidine 5′-triphosphate (CTP). As shown in
Three enzymes including Bifidobacterium longum strain ATCC55813 N-acetylhexosamine-1-kinase (BLNahK), Pasteurella multocida N-acetylglucosamine uridylyltransferase (PmGlmU), and Pasteurella multocida inorganic pyrophosphatase (PmPpA) were chosen for GlcNAc activation SA1. BLNahK was found to be highly active in catalyzing the direct phosphorylation of GlcNAc (kcat/Km=18.3 s−1 mM−1) using ATP for the formation of GlcNAc-1-phosphate. It had a high expression level in E. coli (185 mg/L culture). PmGlmU, with an expression level of 170 mg/L culture, catalyzed highly efficient synthesis of UDP-GlcNAc from GlcNAc-1-P and UTP. PmPpA, with an expression level of 488 mg/L culture, was used to break down the pyrophosphate formed in the PmGlmU reaction to inorganic phosphate to shift the coupled enzymatic reactions towards the formation of UDP-GlcNAc. Two β1-3-N-acetylglucosaminyltransferases (β3GlcNAcTs) from Neisseria meningitidis (NmLgtA) and Helicobacter pylori (Hpβ3GlcNAcT), respectively, were selected. NmLgtA (10 mg/L) with a better expression level than Hpβ3GlcNAcT (1 mg/L), was used to add β1-3-linked GlcNAc to short acceptor substrates such as LacβNHCbz (1) while Hpβ3GlcNAcT was used to add β1-3-linked GlcNAc to longer acceptor substrates which were less efficient for NmLgtA-catalyzed reactions.
Three enzymes including Streptococcus pneumoniae TIGR4 galactokinase (SpGalK), Bifidobacterium longum UDP-sugar pyrophosphorylase (BLUSP), and PmPpA were chosen for Gal activation SA2. SpGalK has an expression level of 100 mg/L culture and BLUSP has an expression level of 170 mg/L culture. Two bacterial galactosyltransferases Chromobacterium violaceum β1-3-galactosyltransferase (Cvβ3GalT) and β1-4-galactosyltransferase (NmLgtB) were chosen for the addition of a Gal with β1-3- and β1-4-linkages, respectively, to the GlcNAc-terminated glycans. Cvβ3GalT has an expression level of 40 mg/L culture and NmLgtB has an expression level of 12.5 mg/L culture. Cvβ3GalT is a highly efficient β1-3-galactosyltransferase which was used previously to synthesize LNT successfully in multi-gram scale. NmLgtB is an efficient 1-4-galactosyltransferase for synthesizing LacNAc-containing structures.
Two enzymes including a bifunctional L-fucokinase/GDP-fucose pyrophosphorylase Bacteroides fragilis NCTC9343 (BfFKP) and PmPpA were chosen for L-fucose activation SA3. BfFKP has an expression level of 20 mg/L culture. The bifunctional BfFKP contains a C-terminal fucokinase domain and an N-terminal GDP-fucose pyrophosphorylase domain. Three bacterial fucosyltransferases, including Helicobacter mustelae α1-2-fucosyltransferase (Hm2FT), Thermosynechococcus elongatus α1-2-fucosyltransferase (Te2FT), and Helicobacter pylori α11-3/4 fucosyltransferase (Hp3/4FT), were chosen for the addition of an α1-2-, or one or more α11-3/4-linked L-fucose residues. Hm2FT, with an expression level of 10 mg/L culture, catalyzed the transfer of L-fucose to both β1-3 and β1-4-linked galactosides while Te2FT, with an expression level of 15 mg/L culture, showed high selectivity towards β1-3-linked galactosides. Te2FT was chosen for the synthesis of LNFP-I. Hp3/4FT, with an expression level of 30 mg/L culture, catalyzed the transfer of L-fucose to form α1-3 and α1-4-fucosylated products. Hp3/4FT was shown herein to be a powerful enzyme for α1-3/4-fucosylation of the GlcNAc and Glc residues in HMOs.
Sialic acid was shown herein to be activated by Neisseria meningitidis CMP-sialic acid synthetase (NmCSS) (SA4) and transferred by Pasteurella multocida α2-3-sialyltransferase 3 (PmST3) or Photobacterium damselae α2-6-sialyltransferase (Pd2,6ST_A200Y/S232Y) (OPME4). NmCSS showedan expression level of 100 mg/L culture. PmST3, with an expression level of 29 mg/L culture, showed a preference for β1-4-over β1-3-linked galactoside and was used for sialylating β1-4-linked terminal galactose. Pd2,6ST_A200Y/S232Y, with an expression level of 32 mg/L culture, showed high selectivity in sialylating the terminal galactose in HMOs with an α2-6-linkage.
To assemble desired oligosaccharides, a previous common practice was to perform a product purification process after every glycosyltransferase-catalyzed reaction. Numerous methods have been used to facilitate the product purification of these enzyme-catalyzed glycosylation reactions, including solid phase immobilization, tagging with a thermosensitive polymer, a hydrophobic or an anionic tail, etc. Precipitation or solid phase extraction was used to purify every oligosaccharide intermediate before it was used as the acceptor substrate for the next glycosylation reaction.
As provided herein, the isolation and purification of oligosaccharide intermediates was not necessary and can be bypassed by using the methods provided herein. This bypassing of isolating and purifying intermediates allowed one-pot access to the desired compound from LacβNHCbz (1) with a single purification process for the final product despite its length and complexity. Such a process can be readily adaptable for automation and is a step forward towards industrial biocatalytic production of HMOs using purified or partially purified enzymes.
To avoid uncontrolled polymerization when both OPME2a containing a 1-4-galactosyltransferase, and OPME1a or OPME1b containing a β1-3-GlcNAc transferase, are present, the reaction mixture was incubated in a boiling water bath for 5 minutes to deactivate the enzymes after each OPME glycosylation is completed as indicated by the disappearance of the acceptor substrate by mass spectrometry analyses and/or thin-layer chromatography (TLC) assays. The reaction mixture was then cooled down and used as the acceptor substrate for the glycosyltransferase in the next OPME reaction without any workup or purification processes. This process is referred to herein as the multistep sequential one-pot multienzyme (MSOPME) strategy.
As shown in Table 1, LNT-IIβNHCbz (2) was synthesized from 100 mg LacβNHCbz (1) using OPME1a and purified with a 91% yield. Two-step sequential OPME reactions followed by a single C18-cartridge purification produced 152 mg LNnTβNHCbz (3) (OPME1a and OPME2a) or 155 mg LNTβNHCbz (7) (OPME1a and OPME2b) from 100 mg of LacβNHCbz (1) in 86-88% yield. Three-step sequential OPME reactions with a single C18-cartridge purification produced 83-91 mg of each of the three pure pentasaccharides GlcNAc-LNnTβ3NHCbz (4) (OPME1a, OPME2a, and OPME1b), LNnFP-Vβ3NHCbz (5) ((OPME1a, OPME3c, and OPME2a), and LNFP-VβNHCbz (9) (OPME1a, OPME3c, and OPME2b) as well as 94 mg pure hexasaccharide LNnDFH-IIβNHCbz (6) (OPME1a, OPME2a, and OPME3c) in 80% yield from 50 mg LacβNHCbz (1). Four-step sequential OPME reactions with a single C18-cartridge purification produced 95-101 mg of each of the three pure hexasaccharides LNDFH-IIβNHCbz (10) (OPME1a, OPME3c, OPME2b, and OPME3c), pLNnHβNHCbz (11) (OPME1a, OPME2a, OPME1b, and OPME2a), and pLNHβNHCbz (17) (OPME1a, OPME2a, OPME1b, and OPME2b) in 79-80% yield from 50 mg LacβNHCbz (1).
Similar processes also worked well for more complex longer chain glycans. Five-step sequential OPME reactions with a single C18-cartridge purification produced 105-123 mg of each of the pure heptasaccharides including GlcNAc-pLNnHβNHCbz (12) (OPME1a, OPME2a, OPME1b, OPME2a, and OPME1b), F-pLNnH-IβNHCbz (13) (OPME1a, OPME2a, OPME1b, OPME2a, and OPME3a), Neu5Acα2-3pLNnHβNHCbz (15) (OPME1a, OPME2a, OPME1b, OPME2a, and OPME4a), Neu5Acα2-6pLNnHβNHCbz (16) (OPME1a, OPME2a, OPME1b, OPME2a, and OPME4b), and F-pLNH-IβNHCbz (18) (OPME1a, OPME2a, OPME1b, OPME2b, and OPME3a) in 74-78% yield, as well as 124-131 mg of each of the two pure nonasaccharides TF-pLNnHβNHCbz (14) (OPME1a, OPME2a, OPME1b, OPME2a, and OPME3c) and TF-pLNH-IIIβNHCbz (19) (OPME1a, OPME2a, OPME1b, OPME2b, and OPME3c) in 72-76% yield from 50 mg of LacβNHCbz (1). Six-step sequential OPME reactions with a single C18-cartridge purification produced 117-119 mg of each of the two pure octasaccharides pLNnOβNHCbz (20) (OPME1a, OPME2a, OPME1b, OPME2a, OPME1b, and OPME2a) and pLNOβNHCbz (21) (OPME1a, OPME2a, OPME1b, OPME2a, OPME1b, and OPME2b) in 71-72% yield from 50 mg of LacβNHCbz (1).
Purified octasaccharide pLNOβNHCbz (21) was used as the acceptor substrate for Hm2FT in OPME3a for the synthesis of Fucα1-2pLNOβNHCbz (22) which was obtained as a pure compound in 92% yield after a C18-cartridge purification.
Mono-fucosylated pentasaccharide LNnFP-VβNHCbz (5) was formed from LacβNHCbz (1) by adding a β1-3-linked GlcNAc using OPME1a, followed by α1-3-fucosylation with OPME3c and 1-4-galactosylation with OPME2a. Altering the order of the latter two OPME steps led to the formation of difucosylated hexasaccharide LNnDFH-IIβNHCbz (6) (OPME1a, OPME2a, and OPME3c) when 2.86 equivalents of L-fucose and 3.05 equivalents of ATP and UTP were used in OPME3c. See, Yu, X. Chen, et al. Chem. Commun. 2017, 53, 11012-11015, which is incorporated herein by reference in its entirety. Mono-fucosylated pentasaccharide LNFP-VβNHCbz (9) was formed similarly by extending LacβNHCbz (1) with a β1-3-linked GlcNAc using OPME1a, followed by α1-3-fucosylation with OPME3c and 1-3-galactosylation with OPME2b. On the other hand, to ensure the installation of two L-fucose residues to form the difucosylated hexasaccharide LNDFH-IIβNHCbz (10), a four-step OPME process was used by adding an additional α1-3/4-fucosylation OPME3c to the three-step OPME preparation of LNFP-VβNHCbz (9). Installation of all three L-fucose residues in trifucosylated nonasaccharides TF-pLNnHβNHCbz (14) and TF-pLNH-IIIβNHCbz (19) was achieved in one-step OPME3c al-3/4-fucosylation process from pLNnHβNHCbz (11) and pLNHβNHCbz (17) intermediates, respectively, formed from four-step OPME glycosylation of LacβNHCbz (1).
As provided herein, if the glycosyltransferases used for the MSOPME synthesis of the target glycans are selective for the acceptor substrates without the complication of polymerization, the MSOPME can be carried out without the heat inactivation of the enzymes in the intermediate OPME steps. Such an example was demonstrated for gram-scale synthesis of LNFP-I (8). The LNFP-I (8) was successfully synthesized in 1.74 grams in 84% yield after three OPME steps.
The Cbz tag in the obtained glycosides (2-47) was readily removed by catalytic hydrogenation using palladium and charcoal in methanol and H2O. The glycosylamine formed was partially converted to the target HMO with a free reducing end during the hydrogenation process. After the palladium and charcoal were removed by filtering the mixture using a 0.45 μm syringe filter, the solvent was removed by rotavap and the complete conversion to the desired HMO with a free reducing end was achieved by dissolving the residue in water and incubating the mixture at 37° C. As shown in
Thus, described herein is an efficient process engineering strategy that can be combined with the substrate engineering strategy and OPME systems for multistep one-pot multienzyme (MSOPME) synthesis of target tagged glycans from a simple lactoside without the purification of intermediate oligosaccharides. A single C18-cartridge-based purification of the final tagged product followed by catalytic hydrogenation and spontaneous hydrolysis led to the formation of the desired HMO with a naturally occurring free reducing end. The strategy has been demonstrated for facile synthesis of twenty-one HMOs including those with or without sialic acid or L-fucose and a size up to nonasaccharide with high yields using only a single C18-cartridge purification process for any given target. Gram-scale synthesis has also been demonstrated for an important HMO, lacto-N-fucopentaose-I (LNFP-I). The method can be readily adapted for automation to allow quick access of HMOs as well as other glycans and glycoconjugates.
About 60-77% of HMOs are fucosylated and many of these fucosylated HMOs have more than one fucosyl residues on the structure. While the Fucα1-3Glc linkage on the Lac core is common for some short length HMOs, it is usually missing from the longer or branched-chain HMOs. Both Helicobacter pylori α1-3-fucosyltransferase (Hp3FT) and Helicobacter pylori α1-3/4-fucosyltransferase (Hp3/4FT) showed preference towards LacNAc or Galβ1-3GlcNAc type acceptors and selective fucosylation of GlcNAc over Glc by these fucosyltransferases can be achieved for some short chain simple acceptors (e.g., tetrasaccharides LNT or LNnT) by controlling the donor and acceptor ratios, the amount of the fucosyltransferase used, and the time of the reaction. However, the selective fucosylation cannot be achieved for some longer chain or more complex substrates, is not precise, and will be extremely challenging for large-scale synthesis.
In order to achieve a precise control of fucosylating the GlcNAc, but not the core Glc residue, in the HMOs, provided herein is benzyl protected L-cysteine (F6.3) (
Similarly, as shown in
To precisely control selective fucosylation of internal GlcNAc residues in long-chain HMO backbones, GlcNTFA was designed herein as a chemoenzymatic synthon for the GlcNAc that is not intended for enzymatic fucosylation (
To further improve the protection of the selective internal GlcNAc residue from enzymatic fucosylation, the GlcNH2-containing glycans obtained from the GlcNTFA/GlcNH2 synthon strategies illustrated in
Synthesis of (2R, 2′R)-dibenzyl 3,3′-disulfanediylbis(2-((tert-butyloxycarbonyl)amino)propanoate) (F6.2). To a solution of (2R, 2′R)-3,3′-disulfanediylbis(2-((tert-butyloxycarbonyl)amino)propanoic acid) (F6.1) (1.0 g, 2.27 mmol) in DMF (12 mL), benzyl bromide (0.59 mL, 4.99 mmol, 2.2 equiv) and K2CO3 (0.721 g, 5.22 mmol, 2.3 equiv) were added. The reaction mixture was then stirred for 12 h. After the reaction was completed as monitored by TLC, the crude mixture was poured in an ice-water (100 mL) solution and the precipitate was filtered. The filtrate was washed with water and was dried to obtain the crude. The crude was subjected to silica gel column purification and eluted with a mixed solvent of 10% ethyl acetate/n-hexane to obtain the desired compound F6.2 with 85% yield. 1H NMR (300 MHz, CDCl3) δ 7.48-7.30 (s, 10H), 5.61-5.27 (s, 2H), 5.25-5.14 (s, 4H), 4.78-4.50 (d, J=6.8, 2H), 3.26-3.02 (d, J=5.3, 4H), 1.53-1.40 (s, 18H); 13C NMR (100 MHz, CDCl3) δ 170.56, 155.05, 135.05, 128.64, 80.31, 67.57, 53.02, 41.19, 28.31 ppm.
Synthesis of R-benzyl 2-((tert-butyloxycarbonyl)amino)-3-mercaptopropanoate F6.3. Compound F6.2 (0.5 g, 0.8 mmol) was dissolved in THF (10 mL), tri-n-butylphosphine (0.2 mL, 0.9 mmol) was added and the reaction mixture was stirred at room temperature for 30 min. Then H2O (0.1 mL, 4.0 mmol) was added and the reaction mixture was stirred at room temperature for 1 h. After the reaction was completed as monitored by TLC, the reaction mixture was extracted with ethyl acetate (150 mL) followed by water (2×50 mL) wash. The organic layer was dried over Na2SO4, evaporated under vacuum to produce the crude. The crude was subjected to silica gel column purification and eluted with a mixed solvent of 5-10% ethyl acetate/n-hexane to produce targeted compound F6.3 with 94% yield as a transparent liquid. 1H NMR (400 MHz, CDCl3) δ=7.63-7.33 (s, 5H), 5.56-5.39 (d, J=8.0, 1H), 5.43-5.03 (m, 2H), 4.80-4.52 (m, 1H), 3.13-2.69 (m, 2H), 1.85-1.50 (m, 1H), 1.38-1.64 (m, 9H).
Synthesis of Lac-thiazolidine (F6.4). (R)-Benzyl-2-((tert-butyloxycarbonyl)amino)-3-mercaptopropanoate 3 (0.2 g, 0.6 mmol) was dissolved in a mixed solvent of TFA:DCM (1:1, by volume) (10 mL) and stirred at room temperature for 2 h. After the reaction was completed as monitored by TLC, solvent was evaporated at room temperature under vacuum, the resulting residue was dried for 1 h under high vacuum. The obtained crude was dissolved in water (10 mL), neutralized with K2CO3 until neutral pH. The solution was extracted with ethyl acetate (3×20 mL) and the combined extracts were dried over Na2SO4 and concentrated to obtain the crude. The crude was directly used for the next step reaction without further purification. The crude (0.12 g, 0.56 mmol, 1.0 equiv.) was dissolved in 15 mL of acetonitrile:phosphate buffer (pH=4)=1:1 (by volume). Then D-lactose (0.19 g, 0.56 mmol, 1.0 equiv.) was added. The clear reaction mixture at pH 4 was stirred at room temperature for 36 h. The reaction was monitored (Rf=0.4) by TLC analysis using ACN:MeOH:H2O=6:1:1 (by volume) as a developing solvent. Once the reaction was completed, solvent of the reaction mixture was completely evaporated at room temperature to produce the crude thiazolidine derivative. Then, C18 column (water/CH3CN) purification was performed to obtain the pure Lac-thiazolidine (F6.4, 76 mg, 25%, white solid) as two diastereomers (R and S configurations). 1H NMR (400 MHz, D2O) R & S mixture δ 7.42-7.29 (m, 5H), 5.22-5.06 (m, 2H), 4.80-4.74 (dd, J=7.0, 4.0, 1H), 4.50-4.37 (m, 1H), 4.29-4.18 (d, J=7.5, 1H), 4.16-4.04 (dd, J=6.9, 3.5, 1H), 3.99-3.54 (m, 9H), 3.54-3.38 (m, 2H), 3.38-3.30 (m, 1H), 3.24-3.09 (dd, J=11.0, 3.3, 1H), 3.07-2.81 (m, 1H). 13C NMR (100 MHz, D2O) δ 172.6, 172.0, 135.35, 135.35, 135.1, 128.88, 128.85, 128.7, 128.3, 128.3, 103.0, 102.9, 78.73, 78.73, 75.2, 75.0, 72.5, 72.4, 71.7, 71.6, 71.1, 71.0, 70.98, 70.93, 69.7, 69.3, 68.8, 68.5, 67.7, 67.6, 65.0, 64.4, 62.2, 61.9, 61.1, 60.9, 36.0, 35.7. HRMS calculated for C22H33NO12S ([M+H]+) 536.1797, found 536.1781.
Materials. All chemicals were obtained from commercial suppliers and used without further purification. 1H NMR and 13C NMR spectra were recorded on a Bruker 400 MHz, a 600 MHz or a 800 MHz Bruker Avance III Spectrometer at the University of California, Davis (UC Davis) NMR Facility. High-resolution electrospray ionization (ESI-Orbitrap) mass spectra were obtained from a Thermo Scientific Q Exactive HF Orbitrap Mass Spectrometer at the Mass Spectrometry Facilities in the University of California, Davis. Thin-layer chromatography (TLC, Sorbent Technologies) was performed on silica gel plates using anisaldehyde sugar stain for detection.
Bacterial strains, plasmids, materials and general methods. E. coli chemically competent BL21 (DE3) and Origami B (DE3) cells were purchased from Invitrogen (Carlsbad, CA) and Takara Bio USA Inc. (San Jose, CA), respectively. Synthetic gene for hST6GALNAC V with codon optimized for E. coli system was synthesized by Genewiz, LLC (South Plainfield, NJ, USA). Vector plasmid pMAL-c4X and Q5 mutagenesis kit were purchased from New England Biolabs, Inc. (Beverly, MA, USA). GeneJET Plasmid Miniprep kit was from Thermo scientific (San Diego, CA, USA). NmLgtA, Hpβ3GlcNAcT, NmLgtB, Cvβ3GalT, PmST3, Pd2,6ST_A200Y/S232Y, Pd2,6ST, PmST1_M144D, SpNanA, Hm2FT, Hp3/4FT, Te2FT, SpGalK, BLUSP, BLNahK, PmGlmU, NmCSS, BfFKP, and PmPpA have been cloned in other reports.
Cloning of MBP-Δ50hST6GALNAC V-His6. Δ50hST6GALNAC V-His6 with an N-terminal truncation was cloned as a C-terminal His6-tagged and N-terminal MBP fusion protein in pMAL-c4X vector using a synthetic gene with codon optimization for E. coli expression. Plasmids containing MBP-Δ50hST6GALNAC V-His6 were transformed to the E. coli BL21 (DE3) and Origami B (DE3) cells.
Cloning of V99M, ext20, and the combination mutants of MBP-Δ50hST6GALNAC V-His6. The V99M and ext20 mutants were cloned using the plasmid containing MBP-Δ50hST6GALNAC V-His6 in the pMAL-c4X vector as the template for polymer chain reactions (PCR). The primers used for the V99M mutant were forward: 5′-TTGCGCGCTGGGCACCAGTAGCG-3′ (SEQ ID NO: 15) and reverse: 5′-TCACGGCAGTGCATTTTCAGC-3′ (SEQ ID NO: 16). The primers used for the ext20 mutant were forward: 5′-CGATACAGCTTGCACCAGACTCCGAGACATCATCATCATCATCATCACTGATC-3′ (SEQ ID NO: 17) and reverse: 5′-AGGGTTACAGTCGGGCATTGACATTCCTTGGGATCCGAAAACCGGTTTG-3′(SEQ ID NO: 18). To clone the combination mutant, the plasmid for the V99M mutant was used as the template for PCR using the same primers to clone the ext20 mutant. Cloning was performed by following the procedures described in the manual of Q5® Site-Directed Mutagenesis Kit. Briefly, PCR was carried out in a total volume of 25 μL containing 12.5 μL of Q5 Hot Start High-Fidelity 2× Master Mix, 10 ng (1 μL) of plasmid pMAL-c4X-MBP-Δ50hST6GALNAC V-His6 as the template, 0.5 M (1.25 μL) of each primer, and 9 μL of nuclease-free water. Thirty cycles of PCR were completed (30 seconds of annealing with an annealing temperature of 69° C. for the V99M mutant and 61° C. for the ext20 mutant, 3.5 minutes of extension at 72° C.) followed by two more minutes of extension at 72° C. and incubation at 4° C. After a standard KLD reaction by preparing a solution of 10 μL each containing 1 μL of PCR product, 5 μL of 2×KLD Reaction Buffer, 1 μL of 10×KLD Enzyme Mix, and 3 μL of Nuclease-free Water, and incubation at room temperature for 5 minutes, the mixture was transformed to DH5α competent cell prepared by Mix and Go!™ Competent Cells protocol (Zymo Research). The plasmids with target mutants were confirmed by DNA sequencing.
Protein expression in E. coli BL21(DE3) cells. E. coli BL21 (DE3) cells harboring the recombinant plasmid containing the target gene were cultured in 50 mL Luria-Bertani (LB) media (10 g L−1 tryptone, 5 g L−1 yeast extract, and 10 g L−1 NaCl) containing 0.1 mg mL−1 ampicillin and incubated at 37° C. for overnight with shaking at 220 rpm in a C25KC incubator shaker (New Brunswick Scientific, Edison, NJ). Then 15 mL of the overnight cell culture was transferred to 1 L of LB media containing 0.1 mg mL−1 ampicillin and the resulting culture was incubated at 37° C. When the OD600nm of the cell culture reached 0.6-0.8 (usually reach in 2-3 h), isopropyl-1-thio-β-D-galactopyranoside (IPTG, 0.1 mM) was added to induce the expression of the recombinant enzyme. The culture was then incubated at 20° C. for 20 h with shaking at 220 rpm.
Protein expression in E. coli Origami B(DE3) cells. E. coli Origami B(DE3) cells with pGro7 harboring the recombinant plasmid containing the target gene were cultured at 37° C. in 50 mL LB media containing 0.05 mg mL-1 ampicillin, 0.025 mg mL−1 kanamycin, 0.0175 mg mL−1 chloramphenicol, 0.01 mg mL−1 tetracycline with rapid shaking at 220 rpm. Then 15 mL of the overnight cell culture was transferred to 1 L of LB media containing 0.05 mg mL-1 ampicillin, 0.025 mg mL-1 kanamycin, 0.0175 mg mL−1 chloramphenicol, 0.01 mg mL−1 tetracycline, and L-arabinose 0.5 mg mL−1 and incubated at 37° C. When the OD600 nm of the cell culture reached 0.5-0.6, IPTG (0.1 mM) was added to induce the expression of the recombinant enzyme followed by incubation at 16° C. for 48 h with shaking at 220 rpm in a C25KC incubator shaker (New Brunswick Scientific, Edison, NJ).
Protein purification. Bacterial cells were harvested by centrifugation at 4° C. in a Thermo Lynx 6000 centrifuge with Rotor Lynx F9-6×1000 at 4,392×g for 30 min. The cell pellet was re-suspended with lysis buffer (Tris-HCl buffer, 100 mM, pH 8.0 containing 0.1% Triton X-100) (20 mL for cells collected from 1 L cell culture). The cells were lysed by homogenization using an Avestin EmulsiFlex-C3 homogenizer or sonication using a Sonics Vibra-Cell system with the following protocol: 10 s (sonication)/30 s (rest) for a total of 6 min on ice. Cell lysate was obtained by centrifugation at 4° C. with Sorvall ST16R, HIGHConic II 6×94 mL fixed angle rotor at 9,016×g for 1 h and the supernatant (lysate) was collected. Protein purification was carried out using 5 mL Bio-Scale Mini Profinity IMAC Cartridges in a Bio-Rad NGC 100 Medium-Pressure Chromatography System with a flow rate of 5 mL min−1. The supernatant was loaded to the Ni2+-NTA column pre-equilibrated with 10 column volumes of a binding buffer (5 mM imidazole, 0.5 M NaCl, 50 mM Tris-HCl buffer, pH 7.5). The column was washed with 10 column volumes of the binding buffer followed by 10 column volumes of a washing buffer (10 mM imidazole, 0.5 M NaCl, 50 mM Tris-HCl buffer, pH 7.5). The target protein was eluted with 10 column volumes of a elution buffer (200 mM imidazole, 0.5 M NaCl, 50 mM Tris-HCl buffer, pH 7.5). Fractions containing the target protein were collected and stored at 4° C. The flow rate was 5 mL min−1 for the supernatant loading and all other steps during the purification. PmST3, SpGalK, BLUSP, BLNahK, PmGlmU, NmCSS, and PmPpA were dialyzed against 20 mM Tris-HCl buffer, pH 7.5, lyophilized, and stored at −20° C. NmLgtA, NmLgtB, PmST3, PmST1_M144D Pd2,6ST_A200Y/S232Y, Pd2,6ST, Hp3/4FT, BfFKP, SpNanA, SpGalK, BLUSP, BLNahK, PmGlmU, NmCSS, and PmPpA were dialyzed against a dialysis buffer (10% glycerol, 20 mM Tris-HCl buffer, pH 7.5) and stored at 4° C. The purified Hpβ3GlcNAcT, Cvβ3GalT, Hm2FT, and Te2FT without dialysis were added with 10% glycerol and stored at −20° C. Mutants of MBP-Δ50hST6GALNAC V-His6 were dialyzed in a dialysis buffer (10% glycerol, 50 mM Tris-HCl buffer, pH 7.5, 250 mM NaCl) and stored at −20° C.
Acceptor substrate specificity of MBP-Δ50hST6GALNAC V-His6 and mutants. Reactions were performed in a reaction mixture (10 L) containing pH 7.5 Tris-HCl (100 mM), CMP-Neu5Ac (2 mM), one acceptor (1 mM), MgCl2 (20 mM), and MBP-Δ50hST6GALNAC V-His6 (0.22 mU or 2.2 mU). The reactions were allowed to proceed at 37° C. for 30 min and 20 h, respectively. The reactions were stopped by adding 10 μL of cold ethanol to each reaction mixture, followed by incubation on ice for 30 min and centrifugation at 11,337 g for 10 min. Chromatographic separation and detection were achieved with an Infinity 1290 II HPLC (UHPLC) equipped with 1260 Infinity II Diode Array Detector WR (monitored at 215 nm, Agilent Technologies, CA) with a ZORBAX Bonus-RP (1.8 m particle, 2.1×150 mm, Agilent Technologies, CA) and an gradient flow (0.25 mL/min) of a mixed solvent (5% to 30% acetonitrile, and 95% to 70% of water with 0.1% TFA).
Ten gram-scale synthesis of LacβNHCbz (1) with an improved procedure. D-Lactose monohydrate (10.0 g, 27.8 mmol) was dissolved in 60 mL NH40H in a 500 mL round bottom flask, NH4HCO3 (3 g, 38 mmol) was then added. The reaction was heated to 45-47° C. in an oil bath using water condenser for 20 h to form LacβNH2. The solvent was removed by rotary evaporation, and LacβNH2 was re-dissolved in 20% Na2CO3 solution (65 mL) in a 1 L round bottom flask. The solution was incubated in an ice bath. Benzyl chloroformate (9.86 mL, 69.4 mmol, 2.5 equiv.) was dissolved in ethanol (“EtOH”) (15 mL) in a 50 mL centrifuge tube, and the mixture was added drop-wisely to the LacβNH2 in aqueous Na2CO3 solution. Another 70 mL of EtOH was added to the reaction. The reaction pH was adjusted by adding 20% Na2CO3 (−35 mL) solution and was kept at pH 8-9. The reaction was then removed from the ice bath and kept at room temperature for 4 hours. Furthermore, another batch of benzyl chloroformate (11.8 mL, 83.2 mmol, 3.0 equiv.) was dissolved in EtOH (20 mL) and added to the reaction mixture, followed by addition of EtOH (−100 mL) to prevent precipitation. Again, to maintain the pH, 20% sodium carbonate (−50 mL) was added drop wisely. Once the pH is adjusted, the reaction was stirred for 20 h. After the reaction was completed as monitored by TLC with EA:MeOH:H2O=7:2:1 (by volume) as the developing solvent, EtOH was removed from the reaction by rotary evaporation. To the resulting residue in a 1 L round bottom flask, 600 mL of a mixed solvent H2O and ethyl acetate (1:1 by volume) was added. The mixture was transferred to a separation funnel to extract the crude product in the aqueous layer. The ethyl acetate (referred to herein as “EA” or EtOAc) layer was washed with 60 mL of water each time for three times. The aqueous layer samples were combined, the solvent was removed by rotary evaporation, and the crude product (55 g) was collected. For crystallization, 275 mL of a mixed solvent (5 mL for each gram of crude) of H2O and 1-butanol (1:1 by volume) was transferred to a 1 μL Erlenmeyer flask. It was heated to almost boiling. To the solvent, 55 g crude was added and completely dissolved. After cooling the mixture to room temperature, a small amount of pure compound was added for seeding. The mixture was incubated at 4° C. for 24 h and the crystals were filtered and collected. The flow-through was collected, dried and used for recrystallization by following a procedure similar to that described above. A total of 9.6 g (73% yield) pure crystals were obtained from two rounds of crystallization procedures.
Single C18-cartridge purification process for StOPMe-synthesized βNHCbz-tagged HMOs. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes. The mixture was then cooled down to room temperature and centrifuged at 9016×g at 4° C. for 30 min. The supernatant was collected. The precipitate was washed twice, each time with H2O (3 mL), and the supernatants were combined. The combined supernatant was concentrated by rotavap to reduce the volume to about 3-5 mL which was purified by passing through a ODS-SM column (51 g, 50 μm, 120 Å, Yamazen) pre-equilibrated with three column volumes of mobile phase A (water) on a CombiFlash® Rf 200i system and monitored at 214 nm. The product was eluted with a mixed solvent of acetonitrile and water with a flow rate of 20 mL min1. The eluting program used was the following: Mobile phase A: water (v/v); Mobile phase B: acetonitrile (v/v); 0% B for 8 min followed by gradient 0% to 40% B over 25 min, gradient 40% to 100% B over 3 min, 100% B for 2 min, then 100% to 80% B over 2 min.
GalβGlcNAcβ3Galβ4GlcβNHCbz (LNTβNHCbz, 7). GlcNAc (0.32 mmol), Gal (0.32 mmol), ATP (0.55 mmol), UTP (0.55 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5), MgCl2 (20 mM), BLNahK (2 mg), PmGlmU (1.5 mg), SpGalK (2 mg), BLUSP (2 mg), and PmPpA (2 mg) to generate the glycosyltransferase donors UDP-GlcNAc and UDP-Gal. The reaction mixture was incubated at 30° C. for 14 h with agitation at 180 rpm. LacβNHCbz (100 mg, 0.21 mmol) and NmLgtA (3 mg) were then added to the reaction mixture to bring the concentration of LacβNHCbz to around 25 mM. The reaction mixture was incubated at 30° C. for 14 h with agitation at 180 rpm. The product formation was monitored by thin-layer chromatography (TLC) with ethyl acetate (EtOAc): Methanol (MeOH): H2O=5:1.6:1 (by volume) as the developing solvent and by high-resolution mass spectrometry (HRMS). After the trisaccharide formation process went to completion, the reaction mixture was incubated in a boiling water bath for 5 min and cooled down to room temperature. In the same reaction tube, Cvβ3GalT (4.5 mg) was added and the reaction volume was kept at around 20 mM. The reaction mixture was incubated at 30° C. for 10 h with agitation at 180 rpm. The product formation was monitored with TLC EtOAc:MeOH:H2O=5:2:1 (by volume) as the developing solvent and by HRMS. After the tetrasaccharide formation process was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, and then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (162 mg, 90%). NMR data were consistent with those described herein using the MSOPME strategy described herein.
Fucα2Galβ3GlcNAcβ3Galβ4GlcβNHCbz (LNFP-IβNHCbz, 8). GlcNAc (0.32 mmol), Gal (0.32 mmol), L-fucose (0.32 mmol), ATP (0.82 mmol), UTP (0.55 mmol) and GTP (0.32 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5), MgCl2 (20 mM), BLNahK (2.5 mg), PmGlmU (2 mg), SpGalK (2 mg), BLUSP (2 mg), BfFKP (2 mg), and PmPpA (3 mg) to generate UDP-GlcNAc, UDP-Gal, and GDP-Fuc. The reaction mixture was incubated at 30° C. for 14 h with agitation at 180 rpm. LacβNHCbz (100 mg, 0.21 mmol) and NmLgtA (3 mg) were then added and the process was the same as for the preparation of LNTβNHCbz (7) described above without the final centrifugation and purification steps. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min and cooled down to room temperature, Te2FT (4 mg) was added to the same reaction tube. The reaction mixture was incubated at 30° C. for 14 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc: MeOH: H2O=5:2.2:1 (by volume) as the developing solvent and by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (176 mg, 85%). NMR data were consistent with those described herein using the MSOPME strategy described herein.
Fucα2Galβ3(Fucα4)GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz (LNTFHepβNHCbz, 23). GlcNAc (0.32 mmol), Gal (0.32 mmol), L-fucose (0.82 mmol), ATP (1.37 mmol), UTP (0.55 mmol), and GTP (0.88 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5), MgCl2 (20 mM), BLNahK (2.5 mg), PmGlmU (2 mg), SpGalK (2.5 mg), BLUSP (2.5 mg), BfFKP (4 mg), and PmPpA (3.5 mg) to generate UDP-GlcNAc, UDP-Gal, and GDP-Fuc. The reaction mixture was incubated at 30° C. for 14 h with agitation at 180 rpm. LacβNHCbz (100 mg, 0.21 mmol) and NmLgtA (3 mg) were then added, and the process was the same as for the preparation of LNFP-IβNHCbz (8) described above without the final centrifugation and purification steps. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes and cooled down to room temperature, Hp3/4FT (3 mg) was added to the same reaction tube. The reaction mixture was incubated at 30° C. for 8 h with agitation at 180 rpm. The product formation was monitored by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (224 mg, 84%). 1H NMR (600 MHz, D2O) δ 7.49-7.39 (m, 5H), 5.43 (d, J=4.0 Hz, 1H), 5.18 (s, 2H), 5.15 (d, J=4.1 Hz, 1H), 5.03 (d, J=3.9 Hz, 1H), 4.88 (t, J=6.7 Hz, 1H), 4.82 (s, 1H), 4.66 (d, J=7.7 Hz, 1H), 4.60 (d, J=8.5 Hz, 1H), 4.41 (d, J=7.8 Hz, 1H), 4.35 (q, J=6.7 Hz, 1H), 4.13 (dd, J=10.6, 9.2 Hz, 1H), 4.07 (d, J=3.4 Hz, 1H), 3.93 (dq, J=10.7, 3.5 Hz, 4H), 3.89-3.67 (m, 20H), 3.64-3.56 (m, 5H), 3.51 (dt, J=9.8, 2.9 Hz, 1H), 3.46 (dd, J=9.8, 7.8 Hz, 1H), 2.06 (s, 3H), 1.27 (dd, J=10.3, 6.6 Hz, 6H), 1.16 (d, J=6.6 Hz, 3H). 13C NMR (150 MHz, D2O) δ 174.16, 158.11, 136.01, 128.77, 128.46, 127.77, 103.19, 101.64, 100.58, 99.52, 98.55, 97.74, 81.83, 81.10, 77.72, 76.76, 76.44, 75.08, 74.72, 74.44, 74.38, 73.59, 73.05, 72.01, 71.97, 71.91, 71.80, 71.73, 70.85, 69.38, 69.20, 69.08, 68.71, 68.53, 68.12, 67.93, 67.78, 67.38, 67.00, 66.49, 66.21, 61.58, 61.48, 59.61, 59.42, 55.79, 22.15, 15.35, 15.33, 15.26. HRMS (ESI-Orbitrap) m/z: [M+H]+ calculated for C52H83N2O34 1279.4827; found 1279.4829.
Galβ3GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz (LNFP-VβNHCbz, 9). GlcNAc (0.32 mmol), Gal (0.32 mmol), L-fucose (0.32 mmol), ATP (0.82 mmol), UTP (0.55 mmol), and GTP (0.32 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5), MgCl2 (20 mM), BLNahK (2.5 mg), PmGlmU (2 mg), SpGalK (2.5 mg), BLUSP (2.5 mg), BfFKP (2 mg), and PmPpA (2 mg) to generate UDP-GlcNAc, UDP-Gal, and GDP-Fuc. The reaction mixture was incubated at 30° C. for 14 h with agitation at 180 rpm. LacβNHCbz (100 mg, 0.21 mmol) and NmLgtA (3 mg) were added into the reaction mixture to bring the concentration of LacβNHCbz to around 20 mM. The reaction mixture was incubated at 30° C. for 14 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:1.6:1 (by volume) as the developing solvent and by HRMS. After the formation of the trisaccharide was completed, the reaction mixture was incubated in a boiling water bath for 5 min and cooled down to room temperature, Hp3/4FT (1.5 mg) was added to the same reaction tube and the reaction concentration was kept around 20 mM. The reaction mixture was incubated at 30° C. for 5 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:2:1 (by volume) as the developing solvent and by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min and cooled down to room temperature. In the same reaction tube, Cvβ3GalT (4 mg) was added, and the reaction concentration was kept around 15 mM. The reaction mixture was incubated at 30° C. for 10 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:2.4:1 (by volume) as the developing solvent and by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (183 mg, 88%). NMR data were consistent with those described herein using the MSOPME strategy described herein.
Galβ33(Fucα4)GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz (LNDFH-IIβNHCbz, 10). GlcNAc (0.32 mmol), Gal (0.32 mmol), L-fucose (0.55 mmol), ATP (1.1 mmol), UTP (0.55 mmol), and GTP (0.59 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5), MgCl2 (20 mM), BLNahK (2.5 mg), PmGlmU (2 mg), SpGalK (2.5 mg), BLUSP (2.5 mg), BfFKP (3 mg), and PmPpA (3 mg) to generate UDP-GlcNAc, UDP-Gal, and GDP-Fuc. The reaction mixture was incubated at 30° C. for 14 h with agitation at 180 rpm. LacβNHCbz (100 mg, 0.21 mmol) and NmLgtA (3 mg) were then added and the process was the same as for the preparation of LNFP-VβNHCbz (9) described above without the final centrifugation and purification steps. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min and cooled down to room temperature, Hp3/4FT (1.5 mg) was added to the same reaction tube. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, then centrifuged at 8000 rpm for min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (210 mg, 88%). NMR data were consistent with those described herein using the MSOPME strategy described herein.
Fucα2Galβ3GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz (LNDFH-IIIβNHCbz, 24). GlcNAc (0.32 mmol), Gal (0.32 mmol), L-fucose (0.59 mmol), ATP (1.1 mmol), UTP (0.55 mmol), and GTP (0.59 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5), MgCl2 (20 mM), BLNahK (2.5 mg), PmGlmU (2 mg), SpGalK (2.5 mg), BLUSP (2.5 mg), BfFKP (3 mg), and PmPpA (3 mg) to generate UDP-GlcNAc, UDP-Gal, and GDP-Fuc. The reaction mixture was incubated at 30° C. for 14 h with agitation at 180 rpm. LacβNHCbz (100 mg, 0.21 mmol) and NmLgtA (3 mg) were then added and the process was the same as for the preparation of LNFP-VβNHCbz (9) described above without the final centrifugation and purification steps. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min and cooled down to room temperature. Te2FT (5 mg) was added to the same reaction tube. The reaction mixture was incubated at 30° C. for 16 h with agitation at 180 rpm. The product formation was monitored by HRMS. After the reaction was completed, the reaction was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (220 mg, 92%). 1H NMR (600 MHz, D2O) δ 7.48-7.38 (m, 5H), 5.43 (d, J=4.0 Hz, 1H), 5.25-5.15 (m, 3H), 4.81 (d, J=6.7 Hz, 1H), 4.63 (dd, J=17.3, 8.0 Hz, 2H), 4.41 (d, J=7.8 Hz, 1H), 4.29 (q, J=6.6 Hz, 1H), 4.08 (d, J=3.4 Hz, 1H), 3.99 (dd, J=10.5, 8.5 Hz, 1H), 3.95-3.64 (m, 22H), 3.61-3.45 (m, 6H), 2.05 (s, 3H), 1.24 (d, J=6.6 Hz, 3H), 1.16 (d, J=6.6 Hz, 3H). 13C NMR (150 MHz, D2O) δ 174.19, 158.12, 136.01, 128.77, 128.46, 127.77, 103.20, 101.64, 100.22, 99.49, 98.54, 81.82, 81.11, 77.72, 77.13, 76.75, 76.65, 75.16, 75.03, 74.39, 73.46, 73.05, 71.90, 71.86, 71.81, 70.91, 69.38, 69.19, 69.10, 68.50, 68.45, 67.93, 67.37, 66.49, 66.44, 61.46, 61.13, 60.36, 59.61, 55.01, 22.11, 15.28, 15.25. HRMS (ESI-Orbitrap) m/z: [M+H]+ calculated for C46H73N2O30 1133.4248; found 1133.4214.
Galβ33(Fucα4)GlcNAcβ3Galβ4GlcβNHCbz (LNFP-IIβNHCbz, 25). GlcNAc (0.32 mmol), Gal (0.32 mmol), L-fucose (0.32 mmol), ATP (0.82 mmol), UTP (0.55 mmol), and GTP (0.32 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5), MgCl2 (20 mM), BLNahK (2.5 mg), PmGlmU (2 mg), SpGalK (2.5 mg), BLUSP (2.5 mg), BfFKP (2 mg), and PmPpA (2 mg) to generate UDP-GlcNAc, UDP-Gal, and GDP-Fuc. The reaction mixture was incubated at 30° C. for 14 h with agitation at 180 rpm. LacβNHCbz (100 mg, 0.21 mmol) and NmLgtA (3 mg) were added into the reaction mixture containing Tris-HCl buffer (100 mM, pH 7.5), MgCl2 (20 mM), and water was added to bring the concentration of LacβNHCbz (1) to around 20 mM. The reaction mixture was incubated at 30° C. for 14 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:1.6:1 (by volume) as the developing solvent and by HRMS. After getting the trisaccharide the reaction was incubated in a boiling water bath for 5 min and cooled down to room temperature. In the same reaction tube, CTP (0.53 mmol) and Neu5Ac (0.32 mmol) were added. The pH was adjusted to pH 7.5 by adding 4 M NaOH. NmCSS (1 mg) and Pd2,6ST (1.2 mg) were then added to the reaction tube. The reaction mixture was incubated at 30° C. for 4 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:2:1 (by volume) as the developing solvent and by HRMS. After the trisaccharide formation was completed, the reaction mixture was incubated in a boiling water bath for 5 min and cooled down to room temperature. In the same reaction tube, Cvβ3GalT (4 mg) was added and the reaction concentration was kept at around 15 mM. The reaction mixture was incubated at 30° C. for 20 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:2.2:1 (by volume) as the developing solvent and by HRMS. After the pentasaccharide formation was completed, the reaction mixture was incubated in a boiling water bath for 5 min and cooled down to room temperature. In the same reaction tube, Hp3/4FT (1.5 mg) was added. The reaction mixture was incubated at 30° C. for 8 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:2.4:1 (by volume) as the developing solvent and by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes and cooled down to room temperature. Then the sialidase SpNanA (0.3 mg) was added to catalyze the cleavage of Neu5Ac. The reaction was monitored by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (203 mg, 85%). 1H NMR (600 MHz, D2O) δ 7.50-7.39 (m, 5H), 5.23-5.16 (m, 2H), 5.03 (d, J=4.0 Hz, 1H), 4.88 (d, J=6.7 Hz, 1H), 4.83 (s, 1H), 4.70 (d, J=8.4 Hz, 1H), 4.51 (d, J=7.7 Hz, 1H), 4.44 (d, J=7.9 Hz, 1H), 4.15 (d, J=3.3 Hz, 1H), 4.08 (t, J=9.7 Hz, 1H), 3.98-3.85 (m, 6H), 3.82-3.52 (m, 18H), 3.49 (dd, J=9.8, 7.7 Hz, 1H), 3.42 (t, J=8.9 Hz, 1H), 2.03 (s, 3H), 1.18 (d, J=6.6 Hz, 3H). 13C NMR (150 MHz, D2O) δ 174.74, 158.09, 136.00, 128.77, 128.47, 127.82, 102.84, 102.83, 102.60, 98.00, 81.98, 81.64, 77.71, 76.12, 75.88, 75.20, 75.00, 74.85, 74.79, 72.29, 72.11, 71.92, 71.39, 70.48, 69.95, 69.11, 68.33, 68.31, 67.77, 67.40, 66.83, 61.63, 60.96, 59.84, 59.58, 55.85, 22.26, 15.35. HRMS (ESI-Orbitrap) m/z: [M+H]+ calculated for C40H63N2O26 987.3669; found 987.3668.
Fucα2Galβ3(Fucα4)GlcNAcβ3Galβ4GlcβNHCbz (LNDFH-IβNHCbz, 26). GlcNAc (0.32 mmol), Gal (0.32 mmol), L-fucose (0.55 mmol), ATP (1.1 mmol), UTP (0.55 mmol), and GTP (0.59 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5), MgCl2 (20 mM), BLNahK (2.5 mg), PmGlmU (2 mg), SpGalK (2.5 mg), BLUSP (2.5 mg), BfFKP (3 mg), and PmPpA (3 mg) to generate UDP-GlcNAc, UDP-Gal, and GDP-Fuc. The reaction mixture was incubated at 30° C. for 14 h with agitation at 180 rpm. LacβNHCbz (100 mg, 0.21 mmol) and NmLgtA (3 mg) were then added and the process was the same as for the formation of the Neu5Ac-protected LNTβNHCbz pentasaccharide intermediate during the preparation of LNFP-IIβNHCbz (25) described above. The reaction mixture containing the Neu5Ac-protected LNTβNHCbz pentasaccharide intermediate was incubated in a boiling water bath for 5 min and cooled down to room temperature. In the same reaction tube, Te2FT (4 mg) was added and the reaction mixture was incubated at 30° C. for 14 h with agitation at 180 rpm. The product formation was monitored by HRMS. After the reaction was completed, the reaction was incubated in a boiling water bath for 5 min to denature the enzymes and cooled down to room temperature. In the same reaction tube, Hp3/4FT (1.5 mg) was added. The reaction mixture was incubated at 30° C. for 8 h with agitation at 180 rpm. The product formation was monitored by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes and cooled down to room temperature. Then the sialidase SpNanA (0.3 mg) was added to catalyze the cleavage of Neu5Ac and the reaction was monitored by HRMS. Once the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (193 mg, 80%). 1H NMR (600 MHz, D2O) δ 7.47-7.39 (m, 5H), 5.25-5.17 (m, 2H), 5.15 (d, J=4.1 Hz, 1H), 5.02 (d, J=3.9 Hz, 1H), 4.87 (t, J=6.7 Hz, 1H), 4.82 (d, J=9.5 Hz, 1H), 4.66 (d, J=7.7 Hz, 1H), 4.60 (d, J=8.4 Hz, 1H), 4.42 (d, J=7.9 Hz, 1H), 4.34 (q, J=6.7 Hz, 1H), 4.17-4.10 (m, 2H), 3.92 (tt, J=6.8, 3.7 Hz, 3H), 3.88-3.50 (m, 25H), 3.41 (t, J=8.9 Hz, 1H), 2.09-2.04 (m, 3H), 1.27 (dd, J=9.3, 6.6 Hz, 6H). 13C NMR (150 MHz, D2O) δ 174.16, 158.09, 135.99, 128.77, 128.47, 127.81, 103.21, 102.87, 100.61, 99.54, 97.75, 81.64, 81.45, 77.57, 76.46, 76.14, 75.15, 74.98, 74.78, 74.73, 74.45, 73.60, 71.97, 71.95, 71.75, 71.39, 70.15, 69.42, 69.08, 68.72, 68.62, 68.24, 67.78, 67.40, 67.01, 66.22, 61.57, 60.95, 59.83, 59.44, 55.73, 22.15, 15.34, 15.30. HRMS (ESI-Orbitrap) m/z: [M+H]+ calculated for C46H73N2O30 1133.4248; found 1133.4211.
Galβ4GlcNAcβ3Galβ4GlcβNHCbz (LNnTβNHCbz, 3). GlcNAc (0.32 mmol), Gal (0.32 mmol), ATP (0.55 mmol), UTP (0.55 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5), MgCl2 (20 mM), BLNahK (2 mg), PmGlmU (1.5 mg), SpGalK (2 mg), BLUSP (2 mg), and PmPpA (2 mg) to generate UDP-GlcNAc and UDP-Gal. The reaction mixture was incubated at 30° C. for 14 h with agitation at 180 rpm. LacβNHCbz (100 mg, 0.21 mmol) and NmLgtA (3 mg) were added and the concentration of LacβNHCbz was around 25 mM. The reaction mixture was incubated at 30° C. for 14 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:1.6:1 (by volume) as the developing solvent and by HRMS. After the trisaccharide formation was completed, the reaction mixture was incubated in a boiling water bath for 5 min and cooled down to room temperature. In the same reaction tube, NmLgtB (0.5 mg) was added and the reaction concentration was kept at around 20 mM. The reaction mixture was incubated at 30° C. for 4 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:2:1 (by volume) as the developing solvent and by HRMS. After the tetrasaccharide formation was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (161 mg, 91%). NMR data were consistent with those described herein using the MSOPME strategy described herein.
Fucα2Galβ4GlcNAcβ3Galβ4GlcβNHCbz (LNnFP-IβNHCbz, 27). GlcNAc (0.32 mmol), Gal (0.32 mmol), L-fucose (0.32 mmol), ATP (0.82 mmol), UTP (0.55 mmol), and GTP (0.32 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5), MgCl2 (20 mM), BLNahK (2.5 mg), PmGlmU (2 mg), SpGalK (2 mg), BLUSP (2 mg), BfFKP (2 mg), and PmPpA (3 mg) to generate UDP-GlcNAc, UDP-Gal, and GDP-Fuc. The reaction mixture was incubated at 30° C. for 14 h with agitation at 180 rpm. LacβNHCbz (100 mg, 0.21 mmol) and NmLgtA (3 mg) were then added and the process was the same as for the preparation of LNnTβNHCbz (3) described above without the final centrifugation and purification steps. After the tetrasaccharide formation was completed, the reaction mixture was incubated in a boiling water bath for 5 min and cooled down to room temperature. In the same reaction tube, Hm2FT (3 mg) was added. The reaction mixture was incubated at 30° C. for 14 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:2.2:1 (by volume) as the developing solvent and by HRMS. After the reaction was completed, the reaction was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (191 mg, 91%). 1H NMR (600 MHz, D2O) δ 7.48-7.39 (m, 5H), 5.31 (d, J=3.2 Hz, 1H), 5.18 (d, J=3.7 Hz, 2H), 4.70 (d, J=8.4 Hz, 1H), 4.54 (d, J=7.8 Hz, 1H), 4.44 (d, J=7.8 Hz, 1H), 4.22 (q, J=6.7 Hz, 1H), 4.14 (d, J=3.4 Hz, 1H), 3.98-3.86 (m, 4H), 3.79 (ddt, J=12.5, 8.3, 5.1 Hz, 9H), 3.70 (dddd, J=27.1, 13.8, 6.8, 3.9 Hz, 10H), 3.61-3.57 (m, 1H), 3.48-3.39 (m, 2H), 2.04 (s, 3H), 1.23 (d, J=6.6 Hz, 3H). 13C NMR (150 MHz, D2O) δ 174.89, 158.09, 136.00, 128.77, 128.47, 127.82, 102.82, 102.73, 100.23, 99.39, 81.88, 81.64, 77.70, 76.42, 76.12, 75.87, 75.23, 75.07, 75.00, 74.83, 73.51, 72.03, 71.65, 71.40, 69.98, 69.59, 69.09, 68.32, 68.18, 67.40, 66.92, 61.09, 60.92, 59.99, 59.84, 55.36, 22.17, 15.29. HRMS (ESI-Orbitrap) m/z: [M+H]+ calculated for C40H63N2O26 987.3669; found 987.3652.
Fucα2Galβ4(Fucα3)GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz (LNnTFHepβNHCbz, 28). GlcNAc (0.32 mmol), Gal (0.32 mmol), L-fucose (0.82 mmol), ATP (1.37 mmol), UTP (0.55 mmol), and GTP (0.88 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5), MgCl2 (20 mM), BLNahK (2.5 mg), PmGlmU (2 mg), SpGalK (2.5 mg), BLUSP (2.5 mg), BfFKP (4 mg), and PmPpA (3.5 mg) to generate UDP-GlcNAc, UDP-Gal, and GDP-Fuc. The reaction mixture was incubated at 30° C. for 14 h with agitation at 180 rpm. LacβNHCbz (100 mg, 0.21 mmol) and NmLgtA (3 mg) were then added and the process was the same as for the preparation of LNnFP-IβNHCbz (27) described above without the final centrifugation and purification steps. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes and cooled down to room temperature. In the same reaction tube, Hp3/4FT (4 mg) was added. The reaction mixture was incubated at 30° C. for 10 h with agitation at 180 rpm. The product formation was monitored by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (223 mg, 83%). 1H NMR (600 MHz, D2O) δ 7.49-7.38 (m, 5H), 5.44 (d, J=4.0 Hz, 1H), 5.28 (d, J=3.5 Hz, 1H), 5.18 (s, 2H), 5.12 (d, J=4.0 Hz, 1H), 4.88 (t, J=6.7 Hz, 1H), 4.71 (d, J=8.5 Hz, 1H), 4.51 (d, J=7.8 Hz, 1H), 4.42 (d, J=7.8 Hz, 1H), 4.25 (q, J=6.7 Hz, 1H), 4.08 (d, J=3.4 Hz, 1H), 4.01-3.90 (m, 6H), 3.89-3.56 (m, 25H), 3.52-3.43 (m, 2H), 2.02 (s, 3H), 1.25 (dd, J=16.6, 6.6 Hz, 6H), 1.16 (d, J=6.6 Hz, 3H). 13C NMR (150 MHz, D2O) δ 174.65, 158.11, 136.02, 128.77, 128.46, 127.77, 102.42, 101.66, 100.17, 99.39, 98.57, 98.46, 81.81, 81.37, 77.59, 76.73, 76.33, 75.30, 74.84, 74.78, 74.44, 73.54, 73.03, 71.99, 71.92, 71.88, 71.68, 70.69, 69.71, 69.19, 69.15, 68.73, 68.25, 68.22, 67.95, 67.67, 67.38, 66.90, 66.76, 66.50, 61.46, 61.42, 59.76, 59.61, 56.04, 22.24, 15.44, 15.41, 15.18. HRMS (ESI-Orbitrap) m/z: [M+H]+ calculated for C52H83N2O34 1279.4827; found 1279.4783.
Galβ4GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz (LNnFP-VβNHCbz, 5). GlcNAc (0.32 mmol), Gal (0.32 mmol), L-fucose (0.32 mmol), ATP (0.82 mmol), UTP (0.55 mmol), and GTP (0.32 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5), MgCl2 (20 mM), BLNahK (2.5 mg), PmGlmU (2 mg), SpGalK (2.5 mg), BLUSP (2.5 mg), BfFKP (2 mg), and PmPpA (2 mg) to generate UDP-GlcNAc, UDP-Gal, and GDP-Fuc. The reaction mixture was incubated at 30° C. for 14 h with agitation at 180 rpm. LacβNHCbz (100 mg, 0.21 mmol) and NmLgtA (3 mg) were added into the reaction mixture to bring the concentration of LacβNHCbz to around 20 mM. The reaction mixture was incubated at 30° C. for 14 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:1.6:1 (by volume) as the developing solvent and by HRMS. After the trisaccharide formation was completed, the reaction mixture was incubated in a boiling water bath for 5 min and cooled down to room temperature. In the same reaction tube, Hp3/4FT (1.5 mg) was added and the reaction volume was kept to maintain the acceptor substrate concentration to around 20 mM. The reaction mixture was incubated at 30° C. for 5 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:2:1 (by volume) as the developing solvent and by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min and cooled down to room temperature. In the same reaction tube, NmLgtB (0.5 mg) was added and the reaction acceptor concentration was kept to around 15 mM. The reaction mixture was incubated at 30° C. for 4 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:2.4:1 (by volume) as the developing solvent and by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (183 mg, 88%). NMR data were consistent with those described herein using the MSOPME strategy described herein.
Galβ4(Fucα3)GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz (LNnDFH-IIβNHCbz, 6). GlcNAc (0.32 mmol), Gal (0.32 mmol), L-fucose (0.55 mmol), ATP (1.1 mmol), UTP (0.55 mmol), and GTP (0.59 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5), MgCl2 (20 mM), BLNahK (2.5 mg), PmGlmU (2 mg), SpGalK (2.5 mg), BLUSP (2.5 mg), BfFKP (3 mg), and PmPpA (3 mg) to generate UDP-GlcNAc, UDP-Gal, and GDP-Fuc. The reaction mixture was incubated at 30° C. for 14 h with agitation at 180 rpm. LacβNHCbz (100 mg, 0.21 mmol) and NmLgtA (3 mg) were then added and the process was the same as for the preparation of LNnFP-VβNHCbz (5) described above without the final centrifugation and purification steps. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min and cooled down to room temperature. In the same reaction tube, Hp3/4FT (1.5 mg) was added and the reaction was monitored by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (207 mg, 87%). NMR data were consistent with those described herein using the MSOPME strategy described herein.
Fucα2Galβ4GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz (LNnDFH-IIIβNHCbz, 29). GlcNAc (0.32 mmol), Gal (0.32 mmol), L-fucose (0.59 mmol), ATP (1.1 mmol), UTP (0.55 mmol), and GTP (0.59 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5), MgCl2 (20 mM), BLNahK (2.5 mg), PmGlmU (2 mg), SpGalK (2.5 mg), BLUSP (2.5 mg), BfFKP (3 mg), and PmPpA (3 mg) to generate UDP-GlcNAc, UDP-Gal, and GDP-Fuc. The reaction mixture was incubated at 30° C. for 14 h with agitation at 180 rpm. LacβNHCbz (100 mg, 0.21 mmol) and NmLgtA (3 mg) were then added and the process was the same as for the preparation of LNnFP-VβNHCbz (5) described above without the final centrifugation and purification steps. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min and cooled down to room temperature. In the same reaction tube, Hm2FT (3 mg) was added to the reaction. The reaction mixture was incubated at 30° C. for 16 h with agitation at 180 rpm. The product formation was monitored by HRMS. After the reaction was completed, the reaction was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (202 mg, 85%). 1H NMR (600 MHz, D2O) δ 7.48-7.39 (m, 5H), 5.44 (d, J=3.9 Hz, 1H), 5.31 (d, J=3.2 Hz, 1H), 5.18 (s, 2H), 4.82 (d, J=7.0 Hz, 2H), 4.70 (d, J=8.4 Hz, 1H), 4.55 (d, J=7.7 Hz, 1H), 4.43 (d, J=7.8 Hz, 1H), 4.22 (q, J=6.6 Hz, 1H), 4.08 (d, J=3.4 Hz, 1H), 3.99-3.91 (m, 3H), 3.90-3.56 (m, 25H), 3.54-3.44 (m, 2H), 2.04 (s, 3H), 1.23 (d, J=6.6 Hz, 3H), 1.17 (d, J=6.6 Hz, 3H). 13C NMR (150 MHz, D2O) δ 174.85, 158.11, 136.02, 128.78, 128.47, 127.78, 102.67, 101.67, 100.23, 99.38, 98.46, 81.81, 81.37, 77.60, 76.74, 76.41, 75.88, 75.23, 75.04, 74.45, 73.52, 73.08, 72.03, 72.00, 71.89, 71.65, 70.65, 69.59, 69.19, 69.10, 68.23, 68.18, 67.96, 67.38, 66.92, 66.50, 61.42, 61.10, 59.97, 59.60, 55.32, 22.16, 15.29, 15.19. HRMS (ESI-Orbitrap) m/z: [M+H]+ calculated for C46H73N2O30 1133.4248; found 1133.4250.
Galβ4(Fucα3)GlcNAcβ3Galβ4GlcβNHCbz (LNFP-IIIβNHCbz, 30). GlcNAc (0.32 mmol), Gal (0.32 mmol), L-fucose (0.32 mmol), ATP (0.82 mmol), UTP (0.55 mmol), and GTP (0.32 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5), MgCl2 (20 mM), BLNahK (2.5 mg), PmGlmU (2 mg), SpGalK (2.5 mg), BLUSP (2.5 mg), BfFKP (2 mg), and PmPpA (2 mg) to generate UDP-GlcNAc, UDP-Gal, and GDP-Fuc. The reaction mixture was incubated at 30° C. for 14 h with agitation at 180 rpm. LacβNHCbz (100 mg, 0.21 mmol) and NmLgtA (3 mg) were added to the reaction mixture containing Tris-HCl buffer (100 mM, pH 7.5) and MgCl2 (20 mM). Water was added to bring the concentration of LacβNHCbz to around 20 mM. The reaction mixture was incubated at 30° C. for 14 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:1.6:1 (by volume) as the developing solvent and by HRMS. After the trisaccharide formation was completed, the reaction mixture was incubated in a boiling water bath for 5 min and cooled down to room temperature. In the same reaction tube, CTP (0.53 mmol) and Neu5Ac (0.32 mmol) were added. The pH was adjusted to pH 7.5 by adding 4 M NaOH. NmCSS (1 mg) and Pd2,6ST (1.2 mg) were then added. The reaction mixture was incubated at 30° C. for 4 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:2:1 (by volume) as the developing solvent and by HRMS. After the tetrasaccharide formation was completed, the reaction mixture was incubated in a boiling water bath for 5 min and cooled down to room temperature. In the same reaction tube, NmLgtB (0.5 mg) was added and the acceptor concentration was kept around 15 mM. The reaction mixture was incubated at 30° C. for 4 h with agitation at 180 rpm. The product formation was monitored by TLC EtOAc:MeOH:H2O=5:2.2:1 (by volume) and HRMS. After the pentasaccharide formation was completed, the reaction mixture was incubated in a boiling water bath for 5 min and cooled down to room temperature. In the same reaction tube, Hp3/4FT (1.5 mg) was added. The reaction mixture was incubated at 30° C. for 8 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:2.4:1 (by volume) as the developing solvent and by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes and cooled down to room temperature. Then the sialidase SpNanA (0.3 mg) was added to cleave Neu5Ac and the reaction was monitored by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (195 mg, 82%). 1H NMR (600 MHz, D2O) δ 7.47-7.41 (m, 5H), 5.22-5.16 (m, 2H), 5.13 (d, J=4.0 Hz, 1H), 4.84 (d, J=13.6 Hz, 1H), 4.72 (d, J=8.4 Hz, 1H), 4.45 (dd, J=14.5, 7.8 Hz, 2H), 4.15 (d, J=3.3 Hz, 1H), 3.99-3.85 (m, 8H), 3.81-3.56 (m, 17H), 3.50 (dd, J=9.8, 7.8 Hz, 1H), 3.42 (t, J=9.0 Hz, 1H), 2.03 (s, 3H), 1.18 (d, J=6.6 Hz, 3H). 13C NMR (150 MHz, D2O) δ 174.68, 158.08, 136.00, 128.77, 128.47, 127.81, 102.84, 102.52, 101.75, 98.58, 81.98, 81.65, 77.69, 76.12, 75.10, 75.01, 74.89, 74.84, 74.72, 73.04, 72.45, 71.89, 71.40, 71.03, 69.96, 69.18, 68.33, 67.68, 67.40, 66.67, 61.48, 60.96, 59.85, 59.62, 55.94, 22.23, 15.29. HRMS (ESI-Orbitrap) m/z: [M+H]+ calculated for C40H63N2O26 987.3669; found 987.3663.
Fucα2Galβ4(Fucα3)GlcNAcβ3Galβ4GlcβNHCbz (LNnDFH-IβNHCbz, 31). GlcNAc (0.32 mmol), Gal (0.32 mmol), L-fucose (0.59 mmol), ATP (1.1 mmol), UTP (0.55 mmol), and GTP (0.59 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5), MgCl2 (20 mM), BLNahK (2.5 mg), PmGlmU (2 mg), SpGalK (2.5 mg), BLUSP (2.5 mg), BfFKP (3 mg), and PmPpA (3 mg) to generate UDP-GlcNAc, UDP-Gal, and GDP-Fuc. The reaction mixture was incubated at 30° C. for 14 h with agitation at 180 rpm. LacβNHCbz (100 mg, 0.21 mmol) and NmLgtA (3 mg) were then added and the process was the same as for the formation of the Neu5Ac-protected LNnTβNHCbz pentasaccharide intermediate during the preparation of LNFP-IIIβNHCbz (30) described above. The reaction mixture containing the Neu5Ac-protected LNnTβNHCbz pentasaccharide intermediate was incubated in a boiling water bath for 5 min and cooled down to room temperature. In the same reaction tube, Hm2FT (3 mg) was added. The reaction mixture was incubated at 30° C. for 16 h with agitation at 180 rpm. The product formation was monitored by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes and cooled down to room temperature. In the same reaction tube, Hp3/4FT (1.5 mg) was added. The reaction mixture was incubated at 30° C. for 8 h with agitation at 180 rpm. The product formation was monitored by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes and cooled down to room temperature. Then the sialidase SpNanA (0.3 mg) was added to cleave Neu5Ac and the reaction was monitored by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (197 mg, 83%). 1H NMR (600 MHz, D2O) δ 7.51-7.36 (m, 5H), 5.28 (d, J=3.5 Hz, 1H), 5.18 (d, J=3.6 Hz, 2H), 5.12 (d, J=3.9 Hz, 1H), 4.88 (d, J=20.2 Hz, 1H), 4.83 (s, 1H), 4.72 (d, J=8.6 Hz, 1H), 4.51 (d, J=7.8 Hz, 1H), 4.44 (d, J=7.8 Hz, 1H), 4.25 (q, J=6.6 Hz, 1H), 4.14 (d, J=3.3 Hz, 1H), 4.05-3.90 (m, 5H), 3.87-3.58 (m, 22H), 3.48-3.38 (m, 2H), 2.03 (s, 3H), 1.25 (dd, J=16.9, 6.6 Hz, 6H). 13C NMR (150 MHz, D2O) δ 174.70, 158.08, 135.99, 128.77, 128.47, 127.81, 102.82, 102.45, 100.19, 99.40, 98.57, 81.90, 81.65, 77.68, 76.35, 76.12, 75.35, 75.00, 74.84, 74.82, 74.73, 73.54, 73.05, 71.92, 71.68, 71.39, 69.99, 69.71, 69.15, 68.73, 68.30, 68.26, 67.67, 67.40, 66.90, 66.77, 61.45, 60.92, 59.85, 59.77, 56.09, 22.25, 15.44, 15.42. HRMS (ESI-Orbitrap) m/z: [M+H]+ calculated for C46H73N2O30 1133.4248; found 1133.4210.
Neu5Acα3Galβ3GlcNAcβ3Galβ4GlcβNHCbz (LSTaβNHCbz, 32). The same procedure described above for the formation of LNTβNHCbz (7) from LacβNHCbz (100 mg, 0.21 mmol) was followed without the final centrifugation and purification steps. After the tetrasaccharide formation was completed, the reaction mixture was incubated in a boiling water bath for 5 min and cooled down to room temperature. In the reaction mixture in the same reaction tube, Neu5Ac (0.32 mmol), CTP (0.42 mmol), PmST1_M144D (2 mg), and NmCSS (1.2 mg) were added. The pH of the reaction mixture was adjusted to pH 7.5 by adding 4 M NaOH. The reaction mixture was incubated at 30° C. for 6 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:2.4:1 (by volume) as the developing solvent and by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (211 mg, 89%). 1H NMR (600 MHz, D2O) δ 7.51-7.36 (m, 5H), 5.18 (d, J=3.7 Hz, 2H), 4.82 (d, J=9.4 Hz, 1H), 4.74 (d, J=8.5 Hz, 1H), 4.51 (d, J=7.8 Hz, 1H), 4.44 (dd, J=7.9, 3.8 Hz, 1H), 4.15 (d, J=3.3 Hz, 1H), 4.09 (dd, J=9.8, 3.2 Hz, 1H), 3.94-3.40 (m, 29H), 2.76 (dd, J=12.4, 4.7 Hz, 1H), 2.03 (s, 6H), 1.79 (t, J=12.2 Hz, 1H). 13C NMR (150 MHz, D2O) δ 174.95, 174.89, 173.87, 136.00, 128.77, 128.47, 128.41, 127.82, 127.67, 103.35, 102.84, 102.47, 99.62, 82.13, 81.83, 81.65, 78.08, 77.76, 76.12, 75.59, 75.19, 75.07, 75.01, 74.87, 72.78, 71.82, 71.40, 70.00, 69.07, 68.46, 68.36, 68.32, 68.03, 67.41, 67.23, 62.45, 61.01, 60.96, 60.50, 59.85, 59.64, 54.56, 51.64, 39.76, 23.67, 22.28, 22.02. HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C45H68N3O30 1130.3893; found 1130.3874.
Neu5Acα3Galβ4GlcNAcβ3Galβ4GlcβNHCbz (38). The same procedure described above for the formation of LNnTβNHCbz (3) from LacβNHCbz (100 mg, 0.21 mmol) was followed without the final centrifugation and purification steps. After the tetrasaccharide formation was completed, the reaction mixture was incubated in a boiling water bath for 5 min and cooled down to room temperature. In the same reaction tube, Neu5Ac (0.32 mmol), CTP (0.42 mmol), PmST3 (2 mg), and NmCSS (1.2 mg) were added. The pH of the reaction mixture was adjusted to pH 7.5 by adding 4 M NaOH. The reaction mixture was incubated at 30° C. for 6 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:2.4:1 (by volume) as the developing solvent and by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (219 mg, 92%). HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C45H69N3O30 1130.3893; found 1130.3874. 1H NMR (600 MHz, D2O) δ 7.49-7.38 (m, 5H), 5.18 (d, J=3.7 Hz, 2H), 4.70 (d, J=8.3 Hz, 1H), 4.56 (d, J=7.9 Hz, 1H), 4.44 (d, J=7.9 Hz, 1H), 4.15 (d, J=3.3 Hz, 1H), 4.12 (dd, J=9.8, 3.2 Hz, 1H), 3.99-3.62 (m, 26H), 3.58 (qd, J=9.9, 9.3, 4.7 Hz, 4H), 3.42 (t, J=8.9 Hz, 1H), 2.76 (dd, J=12.4, 4.6 Hz, 1H), 2.03 (s, 6H), 1.80 (t, J=12.1 Hz, 1H). 13C NMR (150 MHz, D2O) δ 174.99, 174.87, 173.76, 158.09, 136.00, 128.77, 128.47, 127.82, 102.85, 102.78, 102.52, 99.75, 81.97, 81.65, 77.99, 77.74, 76.13, 75.47, 75.15, 75.01, 74.87, 74.54, 72.88, 72.12, 71.72, 71.40, 69.96, 69.36, 68.33, 68.30, 68.07, 67.46, 62.57, 61.01, 60.97, 59.84, 59.30, 55.16, 51.66, 46.65, 39.60, 22.16, 22.02, 19.13, 12.80, 8.21. HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C45H68N3O30 1130.3893; found 1130.3873.
Neu5Acα3Galβ3(Neu5Acα6)GlcNAcβ3Galβ4GlcβNHCbz (46). GlcNAc (35 mg, 0.16 mmol), Gal (29 mg, 0.16 mmol), ATP (185 mg, 0.34 mmol), and UTP (178 mg, 0.34 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5), MgCl2 (100 mM), BLNahK (1.5 mg), PmGlmU (1.2 mg), SpGalK (1.6 mg), BLUSP (1.6 mg), and PmPpA (1.6 mg) to generate UDP-GlcNAc and UDP-Gal. The reaction mixture in a 3 mL volume was incubated at 30° C. for 14 h with agitation at 180 rpm. LacβNHCbz (1) (50 mg, 0.11 mmol) and NmLgtA (2.5 mg) were added. Then water was added to bring the concentration of LacβNHCbz to around 20 mM. The reaction mixture was incubated at 30° C. for 16 h with agitation at 180 rpm. The product formation was monitored by HRMS. When all LacβNHCbz was consumed, the reaction mixture was incubated in a boiling water bath for 5 min and cooled down to room temperature. To the same reaction tube, Cvβ3GalT (2.5 mg) was added and the reaction volume was around 7 mL. The reaction mixture was incubated at 30° C. for 16 h with agitation at 180 rpm. The product formation was monitored by HRMS. When the tetrasaccharide formation was completed, the reaction mixture was incubated in a boiling water bath for 5 min and cooled down to room temperature. CTP (94 mg, 0.18 mmol) and Neu5Ac (49 mg, 0.16 mmol) were added and the pH of the solution was adjusted to pH 8.0 by adding 4 M NaOH. NmCSS (0.6 mg) and CjCst-I (2.0 mg) were added and the reaction mixture was incubated at 30° C. for 24 h with agitation at 180 rpm. The product formation was monitored by HRMS. After the reaction was completed, CTP (94 mg, 0.18 mmol) and Neu5Ac (49 mg, 0.16 mmol) were added and the pH of the solution was adjusted to 8.0 using 4 M NaOH. NmCSS (0.6 mg) and MBP-Δ50hST6GALNAC V_Design7_ext20-His6 (6.0 mg) were added and the reaction mixture with a 10 mL volume was incubated at 30° C. with agitation at 180 rpm. The product formation was monitored by HRMS. After 20 h, additional 5 mg of MBP-Δ50hST6GALNAC V_Design7_ext20-His6 was added and the reaction was incubated at 30° C. for another 24 h with agitation at 180 rpm. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (131 mg, 85%). 1H NMR (800 MHz, D2O) δ 7.46-7.37 (m, 5H), 5.17 (t, J=9.3 Hz, 2H), 4.68 (d, J=8.5 Hz, 1H), 4.48 (d, J=7.8 Hz, 1H), 4.42 (d, J=7.9 Hz, 1H), 4.15 (d, J=3.3 Hz, 1H), 4.06 (dd, J=9.8, 3.1 Hz, 1H), 3.95 (dd, J=11.0, 5.3 Hz, 1H), 3.40 (t, J=9.1 Hz, 1H), 2.73 (td, J=13.0, 4.7 Hz, 2H), 2.04-1.96 (m, 9H), 1.77 (t, J=12.2 Hz, 1H), 1.67 (t, J=12.1 Hz, 1H). 13C NMR (200 MHz, D2O) δ 174.97, 174.90, 174.80, 173.87, 173.39, 135.98, 128.75, 128.44, 127.79, 103.38, 102.82, 102.52, 100.16, 99.58, 82.15, 81.77, 77.79, 76.11, 75.56, 75.04, 75.00, 74.95, 73.69, 72.74, 72.46, 71.79, 71.66, 71.36, 69.94, 69.06, 68.38, 68.30, 68.19, 68.14, 68.01, 67.37, 67.23, 62.83, 62.53, 62.40, 61.12, 61.09, 61.01, 59.83, 59.48, 54.58, 51.81, 51.61, 40.05, 39.75, 22.27, 22.01, 21.98. HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C56H85N4O38 1421.4847; found 1421.4830.
Neu5Acα3Galβ3(Neu5Acα6)GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz (FDS-LNT-IIβNHCbz, 47). The same procedure described above for the formation of LNFPVβNHCbz (5) from LacβNHCbz (100 mg, 0.21 mmol) was followed without the final centrifugation and purification steps. After the pentasaccharide formation was completed, the reaction mixture was incubated in a boiling water bath for 5 min and cooled down to room temperature, CTP (0.42 mmol) and Neu5Ac (0.32 mmol) were added. NmCSS (1.5 mg) and PmST1_M144D (3 mg) were then added and the pH of the reaction was kept at pH 7.5. The reaction mixture was incubated at 30° C. for 6 h with agitation at 180 rpm. The product formation was monitored by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes and cooled down to room temperature. CTP (0.42 mmol) and Neu5Ac (0.32 mmol) were added. NmCSS (1.5 mg) and MBP-Δ50hST6GALNAC V-His6 purified from 4 L culture were added. The reaction mixture was incubated at 30° C. for 30 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:2.2:1 (by volume) as the developing solvent and by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (132 mg, 40%). 1H NMR (800 MHz, D2O) δ 7.46-7.24 (m, 5H), 5.43 (d, J=4.0 Hz, 1H), 5.17 (s, 2H), 4.82-4.80 (m, 1H), 4.69 (d, J=8.8 Hz, 1H), 4.50 (d, J=8.0 Hz, 1H), 4.42 (d, J=8.0 Hz, 1H), 4.11-3.46 (m, 41H), 2.77-2.71 (m, 2H), 2.02 (s, 6H), 2.01 (s, 3H), 1.77 (t, J=12.0 Hz, 1H), 1.68 (t, J=12.0 Hz, 1H), 1.15 (d, J=6.4 Hz, 3H). 13C NMR (200 MHz, D2O) δ 175.00, 174.92, 174.79, 173.88, 173.39, 158.10, 136.00, 128.75, 128.44, 127.75, 120.35, 103.37, 102.38, 101.64, 100.14, 99.59, 98.45, 81.75, 77.60, 76.75, 75.57, 75.05, 74.56, 73.68, 73.09, 72.75, 72.45, 72.02, 71.87, 71.81, 71.66, 70.59, 69.15, 69.07, 68.42, 68.38, 68.32, 68.21, 68.06, 68.02, 67.95, 67.35, 67.24, 66.51, 62.91, 62.55, 62.41, 61.62, 61.02, 60.04, 59.55, 54.59, 51.84, 51.63, 40.08, 39.76, 22.27, 22.02, 22.00, 15.20. HRMS (ESI-Orbitrap) m/z: [M−2H]2− calculated for C62H96N4O42 783.2677; found 783.2671.
Neu5Acα3Galβ3GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz (34). LNFP-VβNHCbz (9) (50 mg, 0.051 mmol), CTP (0.13 mmol), and Neu5Ac (0.076 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5) and MgCl2 (20 mM). NmCSS (0.4 mg) and PmST1 M144D (0.4 mg) were added and the final concentration of LNFP-V βNHCbz (9) was about 20 mM. The reaction mixture was incubated at 30° C. for 6 h with agitation at 180 rpm. The product formation was monitored by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (59 mg, 91%). 1H NMR (600 MHz, D2O) δ 7.49-7.38 (m, 5H), 5.44 (d, J=4.0 Hz, 1H), 5.18 (s, 2H), 4.81 (s, 2H), 4.74 (d, J=8.5 Hz, 1H), 4.52 (d, J=7.8 Hz, 1H), 4.43 (d, J=7.8 Hz, 1H), 4.09 (dt, J=7.4, 3.4 Hz, 2H), 3.96-3.77 (m, 15H), 3.75-3.46 (m, 20H), 2.76 (dd, J=12.4, 4.6 Hz, 1H), 2.03 (d, J=3.3 Hz, 6H), 1.79 (t, J=12.1 Hz, 1H), 1.16 (d, J=6.6 Hz, 3H). 13C NMR (150 MHz, D2O) δ 174.94, 174.85, 173.87, 158.10, 136.02, 128.77, 128.46, 127.78, 103.31, 102.39, 101.64, 99.61, 98.46, 82.05, 81.81, 81.32, 77.60, 76.74, 75.58, 75.16, 75.06, 74.48, 73.10, 72.78, 72.00, 71.88, 71.82, 70.69, 69.18, 69.06, 68.44, 68.36, 68.22, 68.03, 67.96, 67.38, 67.22, 66.51, 62.45, 61.46, 61.28, 61.01, 60.49, 59.60, 59.40, 54.56, 51.64, 39.77, 22.26, 22.02, 15.18. HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C51H78N3O34 1276.4472; found 1276.4448.
Neu5Acα3Galβ3(Fucα4)GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz (36). LNDFH-IIβNHCbz (10) (50 mg, 0.044 mmol), CTP (0.11 mmol), and Neu5Ac (0.066 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5) and MgCl2 (20 mM). NmCSS (0.3 mg) and PmST1 M144D (0.4 mg) were added and the final concentration of LNDFH-IIβNHCbz (10) was about 20 mM. The reaction mixture was incubated at 30° C. for 16 h with agitation at 180 rpm. The product formation was monitored by HRMS. After the reaction was completed, the reaction was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (41 mg, 65%). 1H NMR (400 MHz, D2O) δ 7.56-7.31 (m, 5H), 5.44 (d, J=4.0 Hz, 1H), 5.18 (s, 2H), 5.01 (t, J=3.7 Hz, 1H), 4.93-4.79 (m, 3H), 4.83 (d, J=1.9 Hz, OH), 4.69 (dd, J=11.8, 8.5 Hz, 2H), 4.53 (dd, J=21.5, 7.7 Hz, 1H), 4.42 (d, J=7.8 Hz, 1H), 4.10-4.03 (m, 2H), 3.97-3.76 (m, 17H), 3.71-3.47 (m, 17H), 2.76 (dt, J=12.6, 5.2 Hz, 1H), 2.03 (dd, J=2.8, 1.0 Hz, 6H), 1.74 (dt, J=21.7, 12.2 Hz, 1H), 1.17 (td, J=3.7, 1.9 Hz, 6H). 13C NMR (100 MHz, D2O) δ 174.92, 173.92, 173.46, 136.02, 135.30, 128.78, 128.47, 128.03, 127.78, 102.80, 102.43, 101.68, 100.79, 99.36, 98.46, 77.59, 76.74, 75.59, 75.22, 75.16, 74.48, 73.53, 73.10, 72.72, 72.17, 72.02, 71.88, 69.19, 69.06, 68.26, 68.15, 67.96, 67.85, 67.79, 67.40, 66.95, 66.80, 66.73, 66.51, 65.26, 62.64, 61.64, 61.48, 60.93, 59.66, 59.59, 59.50, 55.79, 22.00, 15.18.
Neu5Acα3Galβ3(Fucα4)GlcNAcβ3Galβ4GlcβNHCbz (37). LNFP-IIβNHCbz (25) (50 mg, 0.051 mmol), CTP (0.13 mmol), and Neu5Ac (0.076 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5) and MgCl2 (20 mM). NmCSS (0.4 mg) and PmST1 M144D (0.4 mg) were added into the reaction and the final concentration of LNFP-IIβNHCbz (25) was about 20 mM. The reaction mixture was incubated at 30° C. for 6 h with agitation at 180 rpm. The product formation was monitored by HRMS. After the reaction was completed, the reaction was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (56 mg, 86%). 1H NMR (600 MHz, D2O) δ 7.54-7.26 (m, 5H), 5.19 (d, J=3.7 Hz, 2H), 5.01 (d, J=4.0 Hz, 1H), 4.87 (d, J=6.8 Hz, 1H), 4.71 (d, J=8.5 Hz, 1H), 4.55 (d, J=7.8 Hz, 1H), 4.44 (d, J=7.9 Hz, 1H), 4.15 (d, J=3.3 Hz, 1H), 4.12-4.01 (m, 2H), 3.97-3.58 (m, 28H), 3.55-3.49 (m, 3H), 3.42 (t, J=9.0 Hz, 1H), 2.77 (dd, J=12.4, 4.7 Hz, 1H), 2.03 (d, J=5.2 Hz, 6H), 1.77 (t, J=12.2 Hz, 1H), 1.18 (d, J=6.6 Hz, 3H). 13C NMR (150 MHz, D2O) δ 174.92, 174.59, 173.91, 158.09, 136.01, 128.77, 128.47, 127.82, 102.86, 102.73, 102.52, 99.37, 97.98, 81.94, 81.65, 77.76, 76.12, 75.87, 75.60, 75.25, 75.01, 74.86, 74.73, 72.73, 72.03, 71.92, 71.85, 71.39, 69.95, 69.09, 68.78, 68.40, 68.31, 68.01, 67.78, 67.40, 66.92, 66.82, 62.31, 61.64, 60.97, 59.83, 59.60, 55.80, 51.66, 39.98, 22.38, 22.03, 15.32. HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C51H78N3O34 1276.4472; found 1276.4432.
Neu5Acα3Galβ4GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz (40). LNnFP-VβNHCbz (5) (50 mg, 0.051 mmol), CTP (0.13 mmol), and Neu5Ac (0.076 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5) and MgCl2 (20 mM). NmCSS (0.4 mg) and PmST3 (0.4 mg) were added and the final concentration of LNnFP-VβNHCbz (5) was about 20 mM. The reaction mixture was incubated at 30° C. for 6 h with agitation at 180 rpm. The product formation was monitored by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (60 mg, 92%). 1H NMR (600 MHz, D2O) δ 7.48-7.39 (m, 5H), 5.44 (d, J=4.0 Hz, 1H), 5.18 (s, 2H), 4.82 (d, J=6.9 Hz, 2H), 4.69 (d, J=8.3 Hz, 1H), 4.56 (d, J=7.9 Hz, 1H), 4.42 (d, J=7.8 Hz, 1H), 4.13-4.08 (m, 2H), 3.99-3.68 (m, 25H), 3.66-3.55 (m, 8H), 3.52-3.48 (m, 1H), 2.76 (dd, J=12.4, 4.6 Hz, 1H), 2.07-2.00 (m, 6H), 1.80 (t, J=12.2 Hz, 1H), 1.17 (d, J=6.6 Hz, 3H). 13C NMR (150 MHz, D2O) δ 174.99, 174.83, 173.83, 136.02, 128.77, 128.46, 127.78, 102.72, 102.53, 101.67, 99.78, 98.46, 81.81, 81.51, 78.02, 77.60, 76.74, 75.47, 75.15, 74.50, 74.48, 73.09, 72.87, 72.06, 72.01, 71.89, 71.74, 70.61, 69.36, 69.18, 68.33, 68.24, 68.07, 67.97, 67.45, 67.37, 66.51, 62.56, 61.48, 61.32, 61.01, 59.83, 59.60, 59.37, 55.13, 51.66, 39.62, 22.14, 22.02, 15.19. HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C51H78N3O34 1276.4472; found 1276.4436.
Neu5Acα3Galβ3(Fucα4)GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz (42). LNnDFH-IIβNHCbz (6) (50 mg, 0.044 mmol), CTP (0.11 mmol), and Neu5Ac (0.066 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5) and MgCl2 (20 mM). NmCSS (0.3 mg) and PmST1 M144D (0.4 mg) were added and the final concentration of LNnDFH-IIβNHCbz (6) was about 20 mM. The reaction mixture was incubated at 30° C. for 10 h with agitation at 180 rpm. The product formation was monitored by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (57 mg, 91%). 1H NMR (400 MHz, D2O) δ 7.49-7.39 (m, 5H), 5.44 (d, J=4.0 Hz, 1H), 5.18 (s, 2H), 5.13 (d, J=4.0 Hz, 1H), 4.84-4.81 (m, 2H), 4.70 (d, J=8.4 Hz, 1H), 4.53 (d, J=7.9 Hz, 1H), 4.42 (d, J=7.8 Hz, 1H), 4.13-4.06 (m, 2H), 3.99-3.77 (m, 18H), 3.74-3.47 (m, 19H), 2.82-2.70 (m, 1H), 2.11-1.98 (m, 6H), 1.79 (t, J=12.1 Hz, 1H), 1.16 (dd, J=6.7, 3.2 Hz, 6H). 13C NMR (100 MHz, D2O) δ 175.01, 174.63, 174.37, 173.84, 128.78, 128.47, 127.78, 102.52, 101.68, 101.55, 99.64, 98.54, 98.47, 81.82, 81.54, 77.60, 76.73, 75.66, 74.98, 74.90, 74.63, 74.48, 73.10, 73.02, 72.90, 71.98, 71.89, 71.84, 70.63, 69.25, 69.17, 68.29, 68.24, 68.08, 67.96, 67.69, 67.40, 67.37, 67.29, 66.64, 66.51, 62.58, 61.48, 61.01, 59.63, 55.93, 51.68, 22.22, 22.01, 15.26, 15.18.
Neu5Acα3Gal β4(Fucα3)GlcNAcβ3Galβ4GlcβNHCbz (43). LNFP-IIIβNHCbz (30) (50 mg, 0.051 mmol), CTP (0.13 mmol), and Neu5Ac (0.076 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5) and MgCl2 (20 mM). NmCSS (0.4 mg) and PmST1 M144D (0.4 mg) were added and the final concentration of LNFP-IIIβNHCbz (30) was about 20 mM. The reaction mixture was incubated at 30° C. for 6 h with agitation at 180 rpm. The product formation was monitored by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (59 mg, 90%). 1H NMR (400 MHz, D2O) δ 7.56-7.35 (m, 5H), 5.19 (s, 2H), 5.12 (d, J=4.0 Hz, 1H), 4.83 (s, 2H), 4.71 (d, J=8.3 Hz, 1H), 4.53 (d, J=7.8 Hz, 1H), 4.44 (d, J=7.7 Hz, 1H), 4.16 (d, J=3.2 Hz, 1H), 4.09 (dd, J=9.8, 3.2 Hz, 1H), 3.99-3.85 (m, 11H), 3.83-3.75 (m, 5H), 3.73-3.64 (m, 14H), 3.62-3.49 (m, 5H), 3.41 (t, J=8.8 Hz, 1H), 2.76 (dt, J=12.6, 4.5 Hz, 1H), 2.10-1.98 (m, 6H), 1.75 (dt, J=37.4, 12.1 Hz, 1H), 1.17 (d, J=6.4 Hz, 3H). 13C NMR (100 MHz, D2O) δ 175.01, 174.66, 173.83, 128.77, 128.47, 127.81, 102.85, 102.56, 101.54, 99.64, 98.55, 82.02, 81.67, 77.73, 76.12, 75.65, 75.01, 74.87, 73.03, 71.84, 71.39, 69.95, 69.25, 69.16, 68.29, 68.09, 67.69, 67.40, 67.30, 66.65, 66.60, 62.58, 61.47, 60.98, 59.85, 59.56, 55.94, 51.68, 22.23, 22.01, 15.26.
Neu5Acα6Galβ3GlcNAcβ3Galβ4GlcβNHCbz (LSTcβNHCbz, 33). LNTβNHCbz (7) (50 mg, 0.06 mmol), CTP (0.15 mmol), and Neu5Ac (0.09 mm ol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5) and MgCl2 (20 mM). NmCSS (0.5 mg) and Pd2,6ST A200Y/S292Y (0.4 mg) were added and the final concentration of LNTβNHCbz (7) was about 20 mM. The reaction mixture was incubated at 30° C. for 6 h with agitation at 180 rpm. The product formation was monitored by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (62 mg, 92%). 1H NMR (600 MHz, D2O) δ 7.49-7.39 (m, 5H), 5.18 (d, J=3.8 Hz, 2H), 4.82 (d, J=9.7 Hz, 1H), 4.74 (d, J=8.5 Hz, 1H), 4.45 (d, J=7.9 Hz, 1H), 4.39 (d, J=7.8 Hz, 1H), 4.16 (d, J=3.3 Hz, 1H), 4.00-3.70 (m, 19H), 3.69-3.49 (m, 12H), 3.42 (t, J=8.9 Hz, 1H), 2.70 (dd, J=12.4, 4.7 Hz, 1H), 2.03 (d, J=4.5 Hz, 6H), 1.70 (t, J=12.2 Hz, 1H). 13C NMR (150 MHz, D2O) δ 174.95, 174.88, 173.47, 158.08, 136.00, 128.77, 128.47, 127.81, 103.86, 102.84, 102.52, 100.11, 83.63, 81.89, 81.64, 77.76, 76.13, 75.28, 75.02, 74.90, 73.58, 72.43, 72.36, 71.80, 71.39, 70.51, 70.01, 68.64, 68.44, 68.39, 68.31, 68.28, 67.40, 63.51, 62.61, 61.17, 60.98, 60.63, 59.85, 59.48, 54.43, 51.78, 40.13, 22.19, 22.03. HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C45H68N3O30 1130.3893; found 1130.3860.
Neu5Acα6Galβ3GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz (35) LNFP-VβNHCbz (9) (50 mg, 0.051 mmol), CTP (0.13 mmol), and Neu5Ac (0.076 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5) and MgCl2 (20 mM). NmCSS (0.4 mg) and Pd2,6ST A200Y/S292Y (0.4 mg) were added and the final concentration of LNFP-VβNHCbz (9) was about 20 mM. The reaction mixture was incubated at 30° C. for 6 h with agitation at 180 rpm. The product formation was monitored by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (60 mg, 93%). 1H NMR (600 MHz, D2O) δ 7.51-7.37 (m, 5H), 5.44 (d, J=4.0 Hz, 1H), 5.18 (s, 2H), 4.82 (d, J=7.0 Hz, 2H), 4.74 (d, J=8.5 Hz, 1H), 4.43 (d, J=7.8 Hz, 1H), 4.39 (d, J=7.8 Hz, 1H), 4.10 (d, J=3.4 Hz, 1H), 4.00-3.69 (m, 23H), 3.68-3.49 (m, 12H), 2.71 (dd, J=12.4, 4.7 Hz, 1H), 2.03 (d, J=6.9 Hz, 6H), 1.70 (t, J=12.2 Hz, 1H), 1.16 (d, J=6.6 Hz, 3H). 13C NMR (150 MHz, D2O) δ 174.93, 174.88, 173.47, 158.11, 136.02, 128.78, 128.46, 127.78, 103.85, 102.46, 101.65, 100.12, 98.46, 83.54, 81.81, 81.42, 77.60, 76.74, 75.25, 74.50, 73.57, 73.09, 72.44, 72.35, 72.01, 71.89, 71.80, 70.68, 70.51, 69.18, 68.63, 68.44, 68.39, 68.28, 68.22, 67.97, 67.38, 66.51, 63.51, 62.61, 61.48, 61.06, 60.62, 59.56, 54.43, 51.78, 40.13, 22.18, 22.03, 15.19. HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C51H78N3O34 1276.4472; found 1276.4432.
Neu5Acα6Galβ4GlcNAcβ3Galβ4GlcβNHCbz (39). LNnTβNHCbz (3) (50 mg, 0.06 mmol), CTP (0.15 mmol), and Neu5Ac (0.09 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5) and MgCl2 (20 mM). NmCSS (0.5 mg) and Pd2,6ST A200Y/S292Y (0.4 mg) were added and the final concentration of LNnTβNHCbz (3) was about 20 mM. The reaction mixture was incubated at 30° C. for 6 h with agitation at 180 rpm. The product formation was monitored by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (63 mg, 94%). 1H NMR (600 MHz, D2O) δ 7.48-7.39 (m, 5H), 5.18 (d, J=3.6 Hz, 2H), 4.83 (s, 1H), 4.73 (d, J=7.6 Hz, 1H), 4.45 (dd, J=7.9, 5.1 Hz, 2H), 4.15 (d, J=3.3 Hz, 1H), 4.02-3.51 (m, 32H), 3.42 (t, J=9.0 Hz, 1H), 2.67 (dd, J=12.4, 4.6 Hz, 1H), 2.04 (d, J=14.9 Hz, 6H), 1.72 (t, J=12.2 Hz, 1H). 13C NMR (150 MHz, D2O) δ 174.90, 173.52, 158.09, 136.01, 128.77, 128.47, 127.82, 103.43, 102.85, 102.55, 100.12, 81.92, 81.65, 80.46, 77.75, 76.12, 75.01, 74.87, 74.25, 73.68, 72.52, 72.41, 72.21, 71.69, 71.40, 70.72, 69.96, 68.38, 68.34, 68.32, 68.19, 67.40, 63.32, 62.64, 61.08, 60.97, 60.13, 59.83, 59.55, 54.92, 51.87, 40.06, 22.26, 22.01. HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C45H68N3O30 1130.3893; found 1130.3865.
Neu5Acα6Galβ4GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz (41). LNnFP-VβNHCbz (5) (50 mg, 0.051 mmol), CTP (0.13 mmol), and Neu5Ac (0.076 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5) and MgCl2 (20 mM). NmCSS (0.4 mg) and Pd2,6ST_A200Y/S292Y (0.4 mg) were added and the final concentration of LNnFP-VβNHCbz (5) was about 20 mM. The reaction mixture was incubated at 30° C. for 6 h with agitation at 180 rpm. The product formation was monitored by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (60 mg, 92%). 1H NMR (600 MHz, D2O) δ 7.51-7.38 (m, 5H), 5.44 (d, J=4.0 Hz, 1H), 5.18 (s, 2H), 4.84-4.80 (m, 2H), 4.73 (d, J=7.6 Hz, 1H), 4.44 (dd, J=16.8, 7.9 Hz, 2H), 4.10 (d, J=3.4 Hz, 1H), 4.02-3.77 (m, 17H), 3.76-3.49 (m, 18H), 2.67 (dd, J=12.4, 4.7 Hz, 1H), 2.04 (d, J=11.5 Hz, 6H), 1.72 (t, J=12.2 Hz, 1H), 1.17 (d, J=6.6 Hz, 3H). 13C NMR (150 MHz, D2O) δ 174.89, 174.86, 173.52, 158.10, 136.02, 128.78, 128.46, 127.78, 103.45, 102.51, 101.68, 100.12, 98.46, 81.81, 81.48, 80.47, 77.59, 76.74, 74.48, 74.22, 73.68, 73.10, 72.52, 72.40, 72.19, 72.01, 71.88, 71.69, 70.72, 70.60, 69.18, 68.38, 68.34, 68.23, 68.18, 67.97, 67.37, 66.52, 63.32, 62.63, 61.48, 61.13, 60.13, 59.59, 59.51, 54.88, 51.87, 40.06, 22.24, 22.01, 15.19. HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C51H78N3O34 1276.4472; found 1276.4433.
Galβ33(Neu5Acα6)GlcNAcβ3Galβ4GlcβNHCbz (LSTbβNHCbz, 44). LNTβNHCbz (7) (15 mg, 0.018 mmol), CTP (0.045 mmol), and Neu5Ac (0.027 mmol) were dissolved in water in a 15 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5) and MgCl2 (20 mM). NmCSS (0.3 mg) and MBP-Δ50hST6GALNAC V-His6 (1/3 of the purified enzyme from 2 L culture) were added and the concentration of LNTβNHCbz (7) was about 10 mM. The reaction mixture was incubated at 30° C. for 42 h with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:2.2:1 (by volume) as the developing solvent and by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (18 mg, 89%). 1H NMR (800 MHz, D2O) δ 7.48-7.37 (m, 5H), 5.16 (d, J=6.3 Hz, 2H), 4.68 (d, J=8.5 Hz, 1H), 4.42 (dd, J=7.8, 2.7 Hz, 2H), 4.16 (d, J=3.3 Hz, 1H), 3.96 (dd, J=10.9, 5.2 Hz, 1H), 3.93-3.48 (m, 29H), 3.40 (t, J=9.1 Hz, 1H), 2.73 (dd J=12.4, 4.7 Hz, 1H), 2.01 (dd, J=9.0, 0.8 Hz, 6H), 1.67 (t, J=12.2 Hz, 1H). 13C NMR (200 MHz, D2O) δ 174.99, 174.86, 173.38, 158.08, 135.99, 128.75, 128.45, 127.80, 103.38, 102.81, 102.60, 100.15, 82.19, 81.58, 77.75, 76.11, 75.23, 75.00, 74.95, 73.67, 72.47, 72.44, 71.67, 71.35, 70.65, 69.94, 68.54, 68.31, 68.28, 68.19, 68.14, 67.37, 62.79, 62.55, 61.08, 61.00, 59.83, 54.68, 51.81, 40.05, 22.18, 21.98. HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C45H68N3O30 1130.3893; found 1130.3869.
Fucα2Galβ3(Neu5Acα6)GlcNAcβ3Galβ4GlcβNHCbz ((F-LSTbβNHCbz, 45). LSTbβNHCbz (44) (30 mg, 0.027 mmol), L-fucose (0.041 mmol), GTP (0.041 mmol) were dissolved in water in a 15 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5) and MgCl2 (20 mM). BfFKP (1 mg), Hm2FT (2 mg), and PmPpA (0.5 mg) were added and the concentration of LSTbβ3NHCbz (44) was about 10 mM. The reaction mixture was incubated at 30° C. for 3 days with agitation at 180 rpm. The product formation was monitored by TLC with EtOAc:MeOH:H2O=5:2.2:1 (by volume) as the developing solvent and by HRMS. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (33 mg, 98%). 1H NMR (800 MHz, D2O) δ 7.47-7.24 (m, 5H), 5.20-5.15 (m, 3H), 4.82-4.80 (m, 1H), 4.63 (d, J=8.8 Hz, 1H), 4.58 (d, J=8.0 Hz, 1H), 4.42 (d, J=8.0 Hz, 1H), 4.33-3.39 (m, 41H), 2.77-2.71 (m, 2H), 2.02 (s, 6H), 2.01 (s, 3H), 1.77 (t, J=12.0 Hz, 1H), 1.68 (t, J=12.0 Hz, 1H), 1.15 (d, J=6.4 Hz, 3H). 2.73 (dd, J=12.4, 4.8 Hz, 1H), 2.04 (s, 3H), 2.02 (s, 3H), 1.65 (t, J=12.0 Hz, 1H), 1.24 (d, J=6.4, 2H). 13C NMR (200 MHz, D2O) δ 175.03, 174.16, 173.40, 173.36, 158.09, 135.99, 128.76, 128.45, 127.79, 103.29, 103.27, 102.87, 100.29, 100.21, 100.14, 99.50, 81.84, 81.62, 81.54, 77.65, 77.01, 76.68, 76.34, 76.14, 75.37, 75.09, 75.00, 74.90, 73.80, 73.52, 72.50, 71.82, 71.68, 71.38, 70.19, 70.13, 69.42, 69.39, 69.13, 69.01, 68.45, 68.39, 68.33, 68.27, 68.20, 68.05, 67.39, 66.50, 62.72, 62.56, 61.09, 61.07, 60.97, 60.77, 59.84, 59.76, 54.98, 51.86, 51.76, 40.08, 22.12, 21.99, 15.29. HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C51H78N3O34 1276.4472; found 1276.4471.
Neu5Acα3Galβ3(Neu5Acα6)GlcNAcβ3Galβ4GlcβNHCbz (46). Neu5Acα2-3LNTβNHCbz (32) (100 mg, 0.087 mmol), CTP (77 mg, 0.15 mmol), and Neu5Ac (40 mg, 0.13 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5) and MgCl2 (20 mM). NmCSS (0.8 mg) and MBP-Δ50hST6GALNAC V-His6_PROSS V7 (4.5 mg) were then added. The reaction mixture (8.6 mL) was incubated at 30° C. with agitation at 180 rpm. The product formation was monitored by HRMS. After the reaction went completion (30 h), the reaction mixture was incubated in a boiling water bath for 5 min to denature the enzymes, cooled down to room temperature, then centrifuged at 8000 rpm for 30 min at 4° C. The supernatant was concentrated and purified by a C18 cartridge (51 g, 50 μm, 120 Å, Yamazen) on a CombiFlash system to obtain the pure product as a white powder (124 mg, 98%). NMR data were consistent with those obtained for DSLNTβNHCbz (46) obtained by the StOPMe process described above.
Hydrogenation procedures. To remove the Cbz tag from the obtained glycosides, a catalytic amount (10-20%) of palladium on charcoal (Pd/C) (1.5 mg) was added to a solution containing 15 mg of each compound with a 20 mM final concentration in H2O:MeOH=1:1 (by volume). The mixture was stirred at room temperature under a hydrogen atmosphere with a balloon for 2-3 hours. The reaction was monitored by HRMS analyses. When the reaction was completed, the mixture was passed through a 0.45 m syringe filter to remove palladium and charcoal. The solvent was removed in vacuo. The residue obtained was dissolved in H2O and the concentration was kept at 20 mM. The mixture was incubated at 37° C. with 160 rpm agitation in an incubator shaker. The reaction was monitored by HRMS analyses until the β-glycosylamine was completely converted to the target HMO with a free reducing end (4-5 days for HMOs containing an L-fucose linked to the reducing end Glc residue or 20-48 hours for other HMOs). The pure HMO was obtained by lyophilization without further purification. NMR data of 3a and 5a-10a were consistent with those described herein prepared by the MSOPME strategy described herein.
Fucα2Galβ3(Fucα4)GlcNAcβ3Galβ4(Fucα3)Glc (23a) (LNTFHep, 12.2 mg, 91%). 1H NMR (400 MHz, D2O) δ 5.42 (d, J=4.1 Hz, 0.4H), 5.36 (d, J=4.0 Hz, 0.3H), 5.18 (d, J=3.8 Hz, 0.4H), 5.15 (d, J=4.0 Hz, 1H), 5.03 (d, J=3.9 Hz, 1H), 4.87 (d, J=6.8 Hz, 1H), 4.65 (dd, J=7.9, 4.8 Hz, 1.4H), 4.60 (dt, J=8.4, 1.9 Hz, 1H), 4.40 (d, J=7.8 Hz, 1H), 4.35 (q, J=6.6 Hz, 1H), 4.16-4.11 (m, 1H), 4.07 (d, J=3.5 Hz, 1H), 3.93 (tdd, J=8.5, 4.7, 1.5 Hz, 4H), 3.90-3.67 (m, 20H), 3.65-3.43 (m, 6H), 2.06 (d, J=4.5 Hz, 3H), 1.33-1.10 (m, 9H). 13C NMR (100 MHz, D2O) δ 174.16, 103.19, 101.68, 100.59, 99.52, 97.74, 96.20, 92.12, 81.13, 77.09, 76.43, 75.49, 75.09, 74.71, 74.44, 73.59, 72.89, 72.62, 71.97, 71.73, 70.84, 69.37, 69.21, 69.08, 68.71, 68.54, 68.11, 67.78, 67.00, 66.42, 66.21, 62.45, 61.57, 61.49, 59.43, 55.79, 22.15, 15.33. HRMS (ESI-Orbitrap) m/z: [M+H]+ calculated for C44H76NO33 1146.4300; found 1146.4315.
Fucα2Galβ3GlcNAcβ3Galβ4(Fucα3)Glc (24a) (LNDFH-III, 12 mg, 90%). 1H NMR (400 MHz, D2O) δ 5.42 (d, J=4.0 Hz, 0.4H), 5.36 (d, J=4.0 Hz, 0.3H), 5.18 (dd, J=3.9, 2.2 Hz, 1.4H), 4.67-4.61 (m, 2.4H), 4.40 (d, J=7.8 Hz, 1H), 4.29 (q, J=6.5 Hz, 1H), 4.08 (d, J=3.5 Hz, 1H), 4.02-3.66 (m, 21H), 3.61 (ddt, J=17.3, 7.7, 3.5 Hz, 3H), 3.5-3.44 (m, 3H), 2.05 (d, J=4.7 Hz, 3H), 1.33-1.08 (m, 6H). 13C NMR (100 MHz, D2O) δ 174.20, 103.19, 101.68, 100.21, 99.49, 98.50, 96.20, 81.87, 81.45, 81.16, 77.14, 76.65, 75.17, 75.04, 74.64, 74.40, 73.46, 72.90, 72.62, 72.17, 72.07, 71.86, 71.64, 71.42, 71.11, 70.96, 70.90, 69.39, 69.26, 69.22, 69.11, 68.67, 68.46, 67.94, 66.45, 61.45, 61.13, 61.05, 60.36, 59.79, 55.01, 22.11, 15.48, 15.26. HRMS (ESI-Orbitrap) m/z: [M+H]+ calculated for C38H66NO29 1000.3721; found 1000.3728.
Galβ3(Fucα4)GlcNAcβ3Galβ4Glc (25a) (LNFP-II, 12.1 mg, 94%). 1H NMR (600 MHz, D2O) δ 5.22 (d, J=3.8 Hz, 0.3H), 5.03 (d, J=4.0 Hz, 1H), 4.88 (t, J=6.7 Hz, 1H), 4.70 (dd, J=8.5, 2.7 Hz, 1H), 4.66 (d, J=8.0 Hz, 0.5H), 4.50 (d, J=7.6 Hz, 1H), 4.43 (d, J=7.8 Hz, 1H), 4.15 (d, J=3.3 Hz, 1H), 4.08 (t, J=9.7 Hz, 1H), 3.95 (ddd, J=10.0, 5.1, 2.2 Hz, 3H), 3.90-3.70 (m, 15H), 3.66-3.52 (m, 7H), 3.48 (dd, J=9.8, 7.7 Hz, 1H), 3.31-3.24 (m, 0.5H), 2.03 (s, 3H), 1.18 (d, J=6.6 Hz, 3H). 13C NMR (150 MHz, D2O) δ 174.74, 102.91, 102.87, 102.83, 102.60, 97.99, 95.71, 91.78, 82.05, 78.35, 78.25, 75.89, 75.20, 74.86, 74.79, 74.77, 74.32, 73.76, 72.29, 72.10, 71.92, 71.37, 71.10, 70.47, 70.10, 69.94, 69.11, 68.33, 68.29, 67.76, 66.82, 61.63, 60.94, 60.05, 59.92, 59.58, 55.85, 22.26, 15.34. HRMS (ESI-Orbitrap) m/z: [M+H]+ calculated for C32H56NO25854.3141; found 854.3152.
Fucα2Galβ3(Fucα4)GlcNAcβ3Galβ4Glc (26a) (LNDFH-I, 12.1 mg, 90%). 1H NMR (400 MHz, D2O) δ 5.24-5.19 (m, 0.3H), 5.15 (d, J=3.9 Hz, 1H), 5.02 (d, J=3.9 Hz, 1H), 4.87 (d, J=6.7 Hz, 1H), 4.66 (dd, J=7.9, 1.5 Hz, 1.5H), 4.60 (dd, J=8.4, 1.5 Hz, 1H), 4.42 (d, J=7.9 Hz, 1H), 4.34 (q, J=6.7 Hz, 1H), 4.18-4.08 (m, 2H), 3.99-3.88 (m, 3H), 3.91-3.67 (m, 19H), 3.68-3.48 (m, 7H), 3.34-3.23 (m, 0.5H), 2.06 (s, 3H), 1.26 (t, J=6.5 Hz, 6H). 13C NMR (100 MHz, D2O) δ 174.16, 112.71, 103.21, 102.95, 100.61, 99.54, 97.76, 95.71, 81.52, 78.13, 76.47, 75.15, 74.79, 74.73, 74.46, 74.30, 73.77, 73.61, 71.97, 71.75, 70.13, 69.41, 69.08, 68.72, 68.25, 67.78, 67.00, 66.22, 61.57, 60.94, 59.44, 55.73, 22.15, 15.34, 15.30. HRMS (ESI-Orbitrap) m/z: [M+H]+ calculated for C38H66NO29 1000.3721; found 1000.3730.
Fucα2Galβ4GlcNAcβ3Galβ4Glc (27a) (LNnFP-I, 12.3 mg, 95% yield). 1H NMR (600 MHz, D2O) δ 5.31 (t, J=2.5 Hz, 1H), 5.22 (t, J=2.8 Hz, 0.3H), 4.70 (dt, J=8.4, 2.5 Hz, 1H), 4.66 (dd, J=8.0, 2.1 Hz, 0.6H), 4.55 (dd, J=7.8, 2.2 Hz, 1H), 4.46-4.41 (m, 1H), 4.25-4.19 (m, 1H), 4.14 (t, J=2.7 Hz, 1H), 3.95 (tq, J=8.0, 3.2, 2.7 Hz, 2H), 3.88 (dq, J=12.0, 2.7 Hz, 2H), 3.84-3.76 (m, 9H), 3.75-3.63 (m, 9H), 3.62-3.56 (m, 2H), 3.46 (ddd, J=10.1, 5.1, 2.4 Hz, 1H), 3.30-3.24 (m, 0.5H), 2.04 (d, J=2.1 Hz, 3H), 1.23 (dd, J=6.7, 2.1 Hz, 3H). 13C NMR (150 MHz, D2O) δ 174.89, 102.89, 102.73, 100.22, 99.38, 95.71, 91.78, 81.94, 78.35, 78.24, 76.41, 75.86, 75.23, 75.07, 74.83, 74.77, 74.32, 73.76, 73.51, 72.04, 71.64, 71.37, 71.10, 70.10, 69.97, 69.59, 69.09, 68.30, 68.17, 66.92, 61.09, 60.90, 60.05, 59.98, 59.92, 55.36, 22.17, 15.29. HRMS (ESI-Orbitrap) m/z: [M+H]+ calculated for C32H56NO25854.3141; found 854.3126.
Fucα2Galβ4(Fucα3)GlcNAcβ3Galβ4(Fucα3)Glc (28a) (LNnTFHep, 12.4 mg, 93%). 1H NMR (400 MHz, D2O) δ 5.35 (d, J=4.1 Hz, 0.4H), 5.29 (d, J=4.1 Hz, 0.3H), 5.20 (d, J=3.1 Hz, 1H), 5.10 (d, J=3.7 Hz, 0.4H), 5.04 (t, J=3.5 Hz, 1H), 4.80 (d, J=6.8 Hz, 1H), 4.64 (s, 1H), 4.57 (d, J=8.0 Hz, 0.6H), 4.43 (d, J=7.7 Hz, 1H), 4.34 (d, J=7.8 Hz, 1H), 4.17 (q, J=6.6 Hz, 1H), 4.00 (d, J=3.4 Hz, 1H), 3.93-3.51 (m, 30H), 3.45-3.34 (m, 2H), 1.94 (d, J=2.5 Hz, 3H), 1.20-1.07 (m, 9H). 13C NMR (100 MHz, D2O) δ 174.65, 102.41, 101.70, 100.18, 99.39, 98.57, 96.20, 95.81, 76.97, 76.33, 75.52, 75.30, 74.83, 74.79, 74.65, 74.45, 73.54, 73.03, 72.90, 72.34, 72.26, 71.92, 71.67, 71.64, 71.42, 70.93, 70.84, 69.71, 69.25, 69.24, 69.16, 68.73, 68.25, 68.01, 67.98, 67.67, 66.90, 66.76, 66.46, 66.42, 61.45, 59.76, 59.72, 22.24, 15.41, 15.18. HRMS (ESI-Orbitrap) m/z: [M+H]+ calculated for C44H76NO33 1146.4300; found 1146.4318.
Fucα2Galβ4GlcNAcβ3Galβ4(Fucα3)Glc (29a) (LNnDFH-III, 12.1 mg, 90%). 1H NMR (400 MHz, D2O) δ 5.43 (d, J=4.0 Hz, 0.5H), 5.37 (d, J=4.1 Hz, 0.4H), 5.31 (d, J=2.8 Hz, 1H), 5.18 (d, J=3.8 Hz, 0.4H), 4.73-4.69 (m, 1H), 4.65 (d, J=8.0 Hz, 0.4H), 4.55 (dd, J=7.8, 1.1 Hz, 1H), 4.42 (d, J=7.8 Hz, 1H), 4.22 (q, J=6.6 Hz, 1H), 4.09 (d, J=3.3 Hz, 1H), 3.99-3.67 (m, 26H), 3.60 (dt, J=6.2, 2.9 Hz, 1H), 3.53-3.45 (m, 2H), 2.04 (d, J=3.0 Hz, 3H), 1.24-1.15 (m, 6H). 13C NMR (100 MHz, D2O) δ 174.86, 102.70, 101.72, 100.23, 99.38, 95.82, 76.99, 76.41, 75.88, 75.35, 75.24, 75.05, 74.47, 73.53, 72.90, 72.67, 72.40, 72.02, 71.90, 71.65, 70.93, 70.85, 70.69, 69.60, 69.25, 69.10, 68.18, 68.02, 66.93, 66.47, 61.11, 60.01, 55.34, 22.16, 15.29, 15.20. HRMS (ESI-Orbitrap) m/z: [M+H]+ calculated for C38H66NO29 1000.3721; found 1000.3732.
Galβ4(Fucα3)GlcNAcβ3Galβ4Glc (30a) (LNFP-III, 12.3 mg, 95%). 1H NMR (600 MHz, D2O) δ 5.22 (d, J=3.8 Hz, 0.3H), 5.13 (d, J=4.0 Hz, 1H), 4.84 (d, J=6.8 Hz, 1H), 4.72 (dd, J=8.5, 2.5 Hz, 1H), 4.66 (d, J=8.0 Hz, 0.5H), 4.45 (dd, J=16.2, 7.8 Hz, 2H), 4.15 (d, J=3.3 Hz, 1H), 4.00-3.93 (m, 4H), 3.92-3.56 (m, 21H), 3.50 (dd, J=9.8, 7.8 Hz, 1H), 3.32-3.24 (m, 0.5H), 2.02 (s, 3H), 1.17 (d, J=6.6 Hz, 3H). 13C NMR (150 MHz, D2O) δ 174.67, 102.90, 102.87, 102.52, 101.74, 98.58, 95.71, 91.78, 82.04, 78.33, 78.23, 75.10, 74.89, 74.85, 74.77, 74.72, 74.32, 73.76, 73.04, 72.45, 71.89, 71.37, 71.10, 71.03, 70.09, 69.95, 69.17, 68.32, 68.29, 67.68, 66.67, 61.48, 60.94, 60.05, 59.92, 59.61, 55.94, 22.23, 15.29. HRMS (ESI-Orbitrap) m/z: [M+H]+ calculated for C32H56NO25854.3141; found 854.3127.
Fucα2Galβ4(Fucα3)GlcNAcβ3Galβ4Glc (31a) (LNnDFH-I, 12.2 mg, 92% yield). 1H NMR (600 MHz, D2O) δ 5.27 (d, J=3.5 Hz, 1H), 5.22 (d, J=3.8 Hz, 0.3H), 5.11 (d, J=4.0 Hz, 1H), 4.88 (d, J=6.7 Hz, 1H), 4.72 (d, J=10.4 Hz, 1H), 4.68-4.63 (m, 0.6H), 4.51 (d, J=7.8 Hz, 1H), 4.44 (d, J=7.8 Hz, 1H), 4.25 (q, J=6.7 Hz, 1H), 4.16-4.12 (m, 1H), 4.03-3.89 (m, 5H), 3.88-3.55 (m, 23H), 3.48-3.40 (m, 1H), 3.31-3.23 (m, 0.5H), 2.03 (s, 3H), 1.25 (dd, J=17.6, 6.6 Hz, 6H). 13C NMR (150 MHz, D2O) δ 174.70, 102.88, 102.46, 100.18, 99.40, 98.57, 95.71, 91.78, 81.96, 78.33, 78.23, 76.34, 75.35, 74.83, 74.77, 74.32, 73.76, 73.53, 73.05, 71.92, 71.67, 71.37, 71.10, 70.10, 70.01, 69.98, 69.70, 69.15, 68.73, 68.25, 67.67, 66.90, 66.77, 61.45, 60.91, 60.05, 59.76, 56.09, 22.25, 15.44, 15.41. HRMS (ESI-Orbitrap) m/z: [M+H]+ calculated for C38H66NO29 1000.3721; found 1000.3731.
Neu5Acα3Galβ3GlcNAcβ3Galβ4Glc (32a) (LSTa, 12.4 mg, 94% yield). 1H NMR (600 MHz, D2O) δ 5.22 (d, J=3.8 Hz, 0.3H), 4.74 (dd, J=8.4, 2.4 Hz, 1H), 4.67 (d, J=8.0 Hz, 0.6H), 4.51 (d, J=7.8 Hz, 1H), 4.44 (dd, J=7.9, 1.2 Hz, 1H), 4.15 (d, J=3.4 Hz, 1H), 4.09 (dd, J=9.8, 3.2 Hz, 1H), 4.00-3.52 (m, 28H), 3.48 (ddd, J=9.9, 5.1, 2.4 Hz, 1H), 3.28 (dd, J=9.3, 8.1 Hz, 0.5H), 2.76 (dd, J=12.4, 4.7 Hz, 1H), 2.03 (s, 6H), 1.79 (t, J=12.2 Hz, 1H). 13C NMR (150 MHz, D2O) δ 174.94, 174.88, 173.87, 103.35, 102.90, 102.47, 99.62, 95.71, 91.79, 82.13, 81.89, 78.41, 78.29, 75.58, 75.19, 75.07, 74.88, 74.78, 74.34, 73.76, 72.78, 71.82, 71.38, 71.10, 70.09, 69.99, 69.06, 68.45, 68.36, 68.29, 68.02, 67.23, 62.44, 61.01, 60.95, 60.50, 60.05, 59.88, 54.56, 51.64, 39.76, 22.28, 22.02. HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C37H61N2O29997.3365; found 997.3340.
Neu5Acα3Galβ3GlcNAcβ3Galβ4(Fucα4)Glc (34a) (12.6 mg, 94%). 1H NMR (600 MHz, D2O) δ 5.44 (dd, J=13.0, 4.0 Hz, 1H), 5.37 (d, J=4.0 Hz, 0.4H), 5.18 (d, J=3.8 Hz, 0.4H), 4.75-4.72 (m, 1H), 4.65 (d, J=8.0 Hz, 0.5H), 4.52 (d, J=7.8 Hz, 1H), 4.42 (d, J=7.8 Hz, 1H), 4.12-4.07 (m, 2H), 3.99-3.93 (m, 3H), 3.92-3.46 (m, 32H), 2.76 (dd, J=12.4, 4.7 Hz, 1H), 2.03 (d, J=3.1 Hz, 6H), 1.78 (t, J=12.1 Hz, 1H), 1.16 (dd, J=6.7, 3.4 Hz, 3H). 13C NMR (150 MHz, D2O) δ 174.94, 174.85, 173.86, 103.31, 102.39, 101.68, 99.61, 98.41, 95.81, 82.05, 81.38, 76.98, 75.57, 75.53, 75.35, 75.16, 75.06, 74.48, 72.77, 72.35, 71.88, 71.82, 70.93, 70.68, 69.19, 69.06, 68.43, 68.36, 68.20, 68.02, 67.22, 66.48, 62.44, 61.56, 61.45, 61.00, 60.49, 54.56, 51.63, 39.76, 22.26, 22.02, 15.18. HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C43H71N2O331143.3945; found 1143.3901.
Neu5Acα3Galβ3(Fucα4)GlcNAcβ3Galβ4(Fucα3)Glc (36a) (12.4 mg, 91%). 1H NMR (800 MHz, D2O) δ 5.42 (d, J=4.0 Hz, 0.6H), 5.37 (d, J=4.0 Hz, 0.4H), 5.17 (d, J=4.0 Hz, 0.4H), 5.01-5.00 (m, 1H), 4.89-4.80 (m, 2H), 4.70 (d, J=8.0, 0.6H), 4.67 (d, J=8.0, 0.6H), 4.65 (d, J=8.0, 0.4H), 4.55 (d, J=8.0 Hz, 1H), 4.50 (d, J=8.0 Hz, 1H), 4.41 (d, J=8.0 Hz, 1H), 4.13-3.40 (m, 36H), 2.78-2.73 (m, 1H), 2.03-2.01 (m, 6H), 1.77-1.68 (m, 1H), 1.17-1.14 (m, 6H). 13C NMR (200 MHz, D2O) δ 175.04, 174.91, 174.72, 174.55, 173.92, 173.45, 102.79, 102.72, 102.59, 102.43, 101.71, 100.79, 99.35, 98.52, 98.41, 98.10, 97.96, 95.81, 92.10, 81.56, 81.46, 76.96, 76.00, 75.90, 75.58, 75.52, 75.35, 75.21, 75.13, 74.71, 74.65, 74.48, 72.72, 72.69, 72.67, 72.38, 72.34, 72.25, 72.16, 72.04, 72.01, 71.92, 71.89, 71.87, 71.85, 71.80, 70.93, 70.84, 70.68, 70.64, 70.61, 70.39, 69.24, 69.19, 69.08, 69.05, 68.77, 68.40, 68.25, 68.15, 68.01, 67.98, 67.92, 67.83, 67.77, 66.90, 66.81, 66.72, 66.48, 66.44, 63.40, 62.62, 62.30, 61.64, 61.49, 61.46, 60.52, 59.76, 59.68, 59.60, 59.51, 55.79, 51.80, 51.64, 40.14, 39.97, 22.36, 22.27, 22.02, 21.99, 15.41, 15.31, 15.18. HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C49H81N2O371289.4524; found 1289.4521.
Neu5Acα3Galβ3(Fucα4)GlcNAcβ3Galβ4Glc (37a) (12.4 mg, 92% yield). 1H NMR (600 MHz, D2O) δ 5.22 (d, J=3.8 Hz, 0.3H), 5.02 (t, J=4.4 Hz, 1H), 4.88 (t, J=6.7 Hz, 1H), 4.68 (dd, J=17.6, 8.5 Hz, 2H), 4.55 (d, J=7.7 Hz, 0.5H), 4.43 (dd, J=13.8, 7.9 Hz, 1H), 4.17 (dd, J=16.3, 3.4 Hz, 1H), 4.13-4.03 (m, 2H), 3.99-3.92 (m, 4H), 3.91-3.47 (m, 29H), 3.29 (dt, J=13.4, 8.3 Hz, 0.5H), 2.74 (ddd, J=36.3, 12.4, 4.7 Hz, 1H), 2.03 (d, J=6.1 Hz, 6H), 1.75 (dt, J=22.8, 12.2 Hz, 1H), 1.18 (dd, J=6.7, 2.8 Hz, 3H). 13C NMR (150 MHz, D2O) δ 174.89, 174.70, 174.59, 173.91, 173.46, 102.84, 102.73, 102.52, 100.26, 99.37, 98.00, 95.71, 95.61, 91.79, 82.21, 82.00, 79.62, 75.60, 75.20, 74.87, 74.79, 74.73, 74.64, 74.60, 73.76, 73.25, 72.73, 72.51, 72.29, 72.14, 72.03, 71.93, 71.85, 71.75, 71.60, 71.38, 71.10, 70.48, 69.95, 69.11, 68.78, 68.40, 68.34, 68.15, 68.01, 67.78, 66.92, 66.81, 62.62, 62.31, 61.63, 60.96, 59.60, 55.84, 51.77, 51.66, 40.08, 22.39, 22.27, 22.03, 15.35, 15.32. HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C43H71N2O331143.3945; found 1143.3897.
Neu5Acα3Galβ4GlcNAcβ3Galβ4Glc (38a) (12.6 mg, 95%). 1H NMR (600 MHz, D2O) δ 5.22 (d, J=3.8 Hz, 0.3H), 4.70 (dd, J=8.3, 2.3 Hz, 1H), 4.66 (d, J=7.9 Hz, 0.6H), 4.56 (d, J=7.9 Hz, 1H), 4.44 (d, J=7.9 Hz, 1H), 4.16 (d, J=3.3 Hz, 1H), 4.12 (dd, J=9.9, 3.1 Hz, 1H), 3.95 (tq, J=5.6, 3.5, 2.9 Hz, 3H), 3.92-3.67 (m, 18H), 3.69-3.54 (m, 10H), 3.32-3.25 (m, 0.6H), 2.76 (dd, J=12.4, 4.6 Hz, 1H), 2.03 (s, 6H), 1.80 (t, J=12.1 Hz, 1H). 13C NMR (150 MHz, D2O) δ 174.99, 174.87, 173.83, 102.90, 102.77, 102.52, 99.78, 95.71, 91.78, 82.02, 78.37, 78.27, 77.98, 75.47, 75.15, 74.87, 74.78, 74.53, 74.33, 73.76, 72.86, 72.12, 71.74, 71.37, 71.10, 70.10, 69.95, 69.36, 68.32, 68.06, 67.45, 62.56, 61.01, 60.96, 60.05, 59.92, 59.82, 55.16, 51.66, 39.61, 22.15, 22.02. HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C37H61N2O29997.3365; found 997.3329.
Neu5Acα3Galβ4GlcNAcβ3Galβ4(Fucα3)Glc (40a) (12.6 mg, 94%). 1H NMR (600 MHz, D2O) δ 5.44 (dd, J=12.9, 4.0 Hz, 0.6H), 5.37 (d, J=4.0 Hz, 0.3H), 5.18 (d, J=3.8 Hz, 0.3H), 4.81 (s, 1H), 4.70 (dd, J=8.3, 2.0 Hz, 1H), 4.65 (d, J=8.0 Hz, 0.4H), 4.56 (d, J=7.8 Hz, 1H), 4.41 (dd, J=7.8, 5.2 Hz, 1H), 4.14-4.08 (m, 2H), 4.00-3.44 (m, 33H), 2.76 (dd, J=12.4, 4.6 Hz, 1H), 2.03 (d, J=3.5 Hz, 6H), 1.80 (t, J=12.1 Hz, 1H), 1.25-1.12 (m, 3H). 13C NMR (150 MHz, D2O) δ 174.99, 174.83, 173.83, 102.71, 102.52, 101.70, 99.78, 98.42, 95.81, 81.57, 78.01, 76.98, 75.46, 75.35, 75.15, 74.67, 74.50, 72.87, 72.67, 72.36, 72.07, 71.90, 71.74, 70.93, 69.36, 69.19, 68.33, 68.07, 67.99, 67.45, 66.47, 62.56, 61.49, 61.01, 60.51, 59.82, 55.14, 51.66, 39.62, 22.14, 22.01, 15.20. HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C43H71N2O331143.3945; found 1143.3905.
Neu5Acα3Galβ4(Fucα3)GlcNAcβ3Galβ4(Fucα3)Glc (42a) (12.5 mg, 92%). 1H NMR (800 MHz, D2O) δ 5.42 (d, J=4.0 Hz, 0.6H), 5.36 (d, J=4.0 Hz, 0.4H), 5.17 (d, J=4.0 Hz, 0.4H), 5.11-5.10 (m, 1H), 4.83-4.79 (m, 2H), 4.69 (d, J=8.0, 1H), 4.64 (d, J=8.0, 0.4H), 4.52 (d, J=8.0, 1.0H), 4.40 (d, J=8.0 Hz, 1H), 4.10-3.44 (m, 37H), 2.76 (dd, J=12.0, 4.8 Hz, 1H), 2.03-2.01 (m, 6H), 1.78 (t, J=12.0 Hz, 1H), 1.17-1.14 (m, 6H). 13C NMR (200 MHz, D2O) δ 175.00, 174.62, 173.84, 102.51, 101.70, 101.52, 100.68, 99.63, 98.53, 98.41, 95.81, 92.10, 81.58, 81.51, 76.97, 75.64, 75.52, 75.35, 74.97, 74.89, 74.66, 74.62, 74.48, 73.00, 72.89, 72.66, 72.33, 72.23, 71.88, 71.83, 70.92, 70.66, 70.61, 69.24, 69.18, 69.14, 68.28, 68.21, 68.08, 68.01, 67.98, 67.77, 67.69, 67.29, 66.63, 66.47, 66.43, 62.56, 61.49, 61.47, 60.42, 59.77, 59.50, 55.92, 51.67, 39.76, 22.21, 22.01, 21.99, 15.37, 15.25, 15.19. HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C49H81N2O371289.4524; found 1289.4523.
Neu5Acα3Galβ4(Fucα3)GlcNAcβ3Galβ4Glc (43a) (12.5 mg, 93%). 1H NMR (800 MHz, D2O) δ 5.21 (d, J=4.0 Hz, 0.4H), 5.12-5.08 (m, 1H), 4.83-4.81 (m, 1H), 4.71-4.69 (m, 1H), 4.65 (d, J=8.0, 0.6H), 4.52 (d, J=8.0, 0.6H), 4.42 (d, J=8.0, 1H), 4.16-3.24 (m, 36H), 2.77-2.72 (m, 1H), 2.03-2.01 (m, 6H), 1.80-1.67 (m, 1H), 1.17-1.14 (m, 6H). 13C NMR (200 MHz, D2O) δ 175.00, 174.65, 173.83, 102.93, 102.90, 102.87, 102.55, 101.76, 101.54, 100.68, 99.64, 99.63, 98.55, 95.71, 91.78, 82.07, 82.04, 78.35, 78.24, 75.64, 75.15, 75.01, 74.89, 74.87, 74.86, 74.85, 74.77, 74.62, 74.61, 74.33, 73.75, 73.01, 72.89, 71.87, 71.83, 71.81, 71.72, 71.37, 71.09, 70.95, 70.09, 69.96, 69.93, 69.24, 69.15, 68.33, 68.31, 68.29, 68.28, 68.26, 68.23, 68.15, 68.07, 67.77, 67.69, 67.28, 66.64, 62.60, 62.56, 61.48, 61.47, 60.97, 60.95, 60.83, 60.81, 60.79, 60.77, 60.76, 60.05, 59.91, 59.49, 55.93, 51.81, 51.67, 39.76, 39.74, 22.24, 22.22, 22.01, 21.99, 15.36, 15.25, 15.24, 15.22. HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C43H71N2O331143.3945; found 1143.3951.
Neu5Acα6Galβ3GlcNAcβ3Galβ4Glc (33a) (LSTc, 12.2 mg, 93%). 1H NMR (600 MHz, D2O) δ 5.22 (d, J=3.6 Hz, 0.3H), 4.74 (d, J=8.5 Hz, 1H), 4.66 (d, J=8.0 Hz, 0.6H), 4.45 (d, J=7.9 Hz, 1H), 4.39 (d, J=8.0 Hz, 1H), 4.16 (d, J=3.3 Hz, 1H), 4.01-3.70 (m, 18H), 3.69-3.49 (m, 13H), 3.28 (t, J=8.5 Hz, 0.5H), 2.70 (dd, J=12.4, 4.7 Hz, 1H), 2.08-1.98 (m, 6H), 1.70 (t, J=12.1 Hz, 1H). 13C NMR (150 MHz, D2O) δ 174.95, 174.88, 173.47, 103.85, 102.89, 102.52, 100.11, 95.71, 91.78, 83.62, 81.94, 78.40, 78.29, 75.27, 74.90, 74.78, 74.34, 73.76, 73.58, 72.43, 72.36, 71.80, 71.38, 71.10, 70.51, 70.10, 70.00, 68.63, 68.43, 68.39, 68.28, 63.51, 62.61, 61.11, 60.96, 60.63, 60.06, 59.92, 54.43, 51.78, 40.12, 22.19, 22.03. HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C37H61N2O29 997.3365; found 997.3322.
Neu5Acα6Galβ3GlcNAcβ3Galβ4(Fucα3)Glc (35a) (12.2 mg, 91%). 1H NMR (600 MHz, D2O) δ 5.36 (dd, J=13.1, 4.0 Hz, 0.6H), 5.29 (d, J=4.0 Hz, 0.3H), 5.10 (d, J=3.8 Hz, 0.3H), 4.74 (d, J=6.3 Hz, 1H), 4.66 (d, J=8.5 Hz, 1H), 4.57 (d, J=8.0 Hz, 0.5H), 4.35 (d, J=7.8 Hz, 1H), 4.31 (d, J=7.9 Hz, 1H), 4.02 (s, 1H), 3.92-3.37 (m, 37H), 2.62 (dd, J=12.4, 4.7 Hz, 1H), 1.95 (d, J=7.2 Hz, 6H), 1.62 (t, J=12.1 Hz, 1H), 1.08 (dd, J=6.8, 3.3 Hz, 3H). 13C NMR (150 MHz, D2O) δ 174.93, 174.88, 173.47, 103.84, 102.45, 101.68, 100.12, 98.42, 95.82, 92.10, 83.53, 81.46, 81.40, 76.99, 75.53, 75.35, 75.25, 74.67, 74.51, 73.57, 72.67, 72.43, 72.35, 72.27, 71.90, 71.80, 70.67, 70.50, 69.24, 69.19, 68.63, 68.44, 68.38, 68.28, 67.99, 66.47, 63.51, 62.61, 61.49, 61.38, 60.62, 54.43, 51.78, 40.12, 22.17, 22.03, 15.19. HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C43H71N2O331143.3945; found 1143.3903.
Neu5Acα6Galβ4GlcNAcβ3Galβ4Glc (39a) (12.2 mg, 92% yield). 1H NMR (600 MHz, D2O) δ 5.22 (d, J=3.8 Hz, 0.3H), 4.75-4.72 (m, 1H), 4.67 (d, J=7.9 Hz, 0.6H), 4.45 (t, J=7.8 Hz, 2H), 4.16 (d, J=3.3 Hz, 1H), 4.02-3.92 (m, 4H), 3.91-3.52 (m, 28H), 3.31-3.25 (m, 0.6H), 2.67 (dd, J=12.4, 4.6 Hz, 1H), 2.04 (d, J=15.3 Hz, 6H), 1.72 (t, J=12.2 Hz, 1H). 13C NMR (150 MHz, D2O) δ 174.89, 173.52, 103.44, 102.91, 102.55, 100.11, 95.71, 91.78, 81.97, 80.45, 78.40, 78.28, 74.87, 74.78, 74.33, 74.25, 73.76, 73.67, 72.52, 72.40, 72.22, 71.69, 71.38, 71.10, 70.71, 70.09, 69.95, 68.38, 68.34, 68.30, 68.19, 63.32, 62.63, 61.36, 60.95, 60.13, 60.04, 59.91, 54.92, 51.87, 40.06, 22.26, 22.01. HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C43H71N2O331143.3945; found 1143.3907.
Neu5Acα6Galβ4GlcNAcβ3Galβ4(Fucα3)Glc (41a) (12.6 mg, 94%). 1H NMR (600 MHz, D2O) δ 5.44 (dd, J=12.9, 4.0 Hz, 0.6H), 5.37 (d, J=4.0 Hz, 0.3H), 5.18 (d, J=3.8 Hz, 0.3H), 4.82 (s, 1H), 4.65 (d, J=8.0, 0.5H), 4.44 (dd, J=21.4, 7.8 Hz, 2H), 4.10 (d, J=3.4 Hz, 1H), 4.02-3.92 (m, 5H), 3.91-3.44 (m, 30H), 2.67 (dd, J=12.4, 4.7 Hz, 1H), 2.09-1.99 (m, 6H), 1.72 (t, J=12.1 Hz, 1H), 1.17 (dd, J=6.7, 3.4 Hz, 3H). 13C NMR (150 MHz, D2O) δ 174.89, 174.86, 173.52, 103.45, 102.50, 101.71, 100.12, 98.41, 95.81, 92.10, 81.53, 80.48, 76.97, 75.53, 75.36, 74.48, 74.22, 73.68, 72.68, 72.52, 72.40, 72.28, 72.19, 71.89, 71.69, 70.93, 70.72, 70.59, 69.19, 68.38, 68.34, 68.17, 67.99, 66.48, 63.32, 62.63, 61.48, 60.77, 60.13, 54.88, 51.87, 40.05, 22.24, 22.01, 15.20. HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C43H71N2O331143.3945; found 1143.3912.
Galβ(Neu5Acα6)GlcNAcβ3Galβ4Glc (44a) (LSTb, 12.5 mg, 95% yield). 1H NMR (600 MHz, D2O) δ 5.22 (d, J=3.8 Hz, 0.3H), 4.70 (dd, J=8.5, 2.7 Hz, 1H), 4.66 (d, J=7.9 Hz, 0.5H), 4.44 (d, J=7.8 Hz, 2H), 4.18 (d, J=3.2 Hz, 1H), 4.02-3.50 (m, 30H), 3.33-3.25 (m, 0.5H), 2.75 (dd, J=12.4, 4.7 Hz, 1H), 2.03 (d, J=7.0 Hz, 6H), 1.69 (t, J=12.2 Hz, 1H). 13C NMR (150 MHz, D2O) δ 175.02, 174.88, 173.40, 103.40, 102.89, 102.61, 100.18, 95.72, 82.28, 82.24, 81.63, 78.43, 78.34, 75.25, 74.97, 74.78, 74.34, 73.74, 73.69, 72.49, 72.46, 71.69, 71.38, 71.09, 70.67, 70.10, 69.98, 69.95, 68.55, 68.35, 68.30, 68.21, 68.14, 62.82, 62.57, 61.09, 61.01, 60.08, 54.70, 51.84, 40.07, 22.21, 22.00. HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C37H61N2O29997.3365; found 997.3325.
Fucα2Galβ3(Neu5Acα6)GlcNAcβ3Galβ4Glc (45a) (F-LSTb, 12 mg, 96% yield). 25 1H NMR (800 MHz, D2O) δ 5.21 (d, J=4.0 Hz, 0.4H), 5.18 (d, J=4.0 Hz, 0.4H), 5.17 (d, J=4.0 Hz, 1H), 4.83-4.80 (m, 1H), 4.67-4.40 (m, 4H), 4.30-3.28 (m, 33H), 2.75-2.70 (m, 1H), 2.06-2.01 (m, 6H), 1.71-1.63 (m, 1H), 1.26-1.21 (m, 3H). 13C NMR (201 MHz, D2O) δ 175.03, 174.15, 173.36, 103.29, 102.93, 102.89, 100.33, 100.28, 100.21, 100.14, 99.50, 99.43, 95.70, 91.78, 87.37, 81.91, 81.86, 78.28, 78.20, 77.01, 76.68, 75.09, 74.91, 74.90, 74.80, 74.32, 73.81, 73.74, 73.57, 73.53, 72.50, 71.82, 71.68, 71.35, 71.09, 70.20, 70.16, 70.12, 69.39, 69.13, 68.45, 68.40, 68.36, 68.33, 68.27, 68.20, 68.05, 68.03, 66.50, 62.72, 62.61, 62.56, 61.09, 61.06, 60.35, 60.06, 59.92, 54.99, 51.86, 51.76, 40.13, 40.08, 22.15, 22.12, 22.03, 21.99, 15.29, 15.22. HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C43H71N2O331143.3945; found 1143.3957.
Neu5Acα3Galβ3(Neu5Acα6)GlcNAcβ3Galβ4Glc (46a) (DSLNT, 12.5 mg, 92% yield). 1H NMR (600 MHz, D2O) δ 5.22 (d, J=3.7 Hz, 0.3H), 4.72-4.69 (m, 1H), 4.66 (dd, J=8.0, 1.3 Hz, 0.7H), 4.50 (d, J=7.8 Hz, 1H), 4.46-4.43 (m, 1H), 4.17 (d, J=3.3 Hz, 1H), 4.11-4.06 (m, 1H), 4.00-3.51 (m, 36H), 3.32-3.24 (m, 0.5H), 2.75 (ddd, J=12.1, 9.8, 4.6 Hz, 2H), 2.14-1.94 (m, 8H), 1.78 (t, J=12.1 Hz, 1H), 1.69 (t, J=12.1 Hz, 1H). 13C NMR (150 MHz, D2O) δ 174.99, 173.88, 173.41, 103.41, 102.89, 102.53, 100.18, 99.60, 95.71, 82.23, 81.81, 78.37, 75.58, 75.06, 74.97, 74.78, 74.35, 73.74, 72.76, 72.48, 71.81, 71.67, 69.95, 69.08, 68.39, 68.32, 68.22, 68.03, 67.25, 62.86, 62.56, 62.42, 61.10, 61.03, 60.75, 54.60, 51.83, 51.63, 40.06, 39.76, 22.29, 22.02, 22.00. HRMS (ESI-Orbitrap) m/z: [M−H]− calculated for C48H78N3O37 1288.4320; found 1288.4264.
Neu5Acα3Galβ3(Neu5Acα6)GlcNAcβ3Galβ4(Fucα3)Glc (47a) (FDS-LNT-II, 12.0 mg, 96%). 1H NMR (800 MHz, D2O) δ 5.42 (d, J=4.0 Hz, 0.6H), 5.36 (d, J=4.0 Hz, 0.4H), 5.17 (d, J=4.0 Hz, 1H), 4.83-4.81 (m, 1H), 4.70 (d, J=8.0, 1H), 4.64 (d, J=8.0, 0.6H), 4.50 (d, J=8.0, 0.6H), 4.41 (d, J=8.0, 0.6H), 4.11-3.44 (m, 40H), 2.77-2.71 (m, 2H), 2.02-2.00 (m, 9H), 1.78 (t, J=12.0 Hz, 1H), 1.68 (t, J=12.0 Hz, 1H), 1.16-1.14 (m, 3H). 13C NMR (200 MHz, D2O) δ 175.00, 174.92, 174.79, 173.88, 173.39, 103.37, 102.37, 101.68, 101.66, 100.15, 99.59, 98.52, 98.41, 95.81, 92.10, 81.80, 81.73, 76.99, 75.57, 75.53, 75.36, 75.05, 74.67, 74.57, 73.69, 72.75, 72.68, 72.46, 72.38, 72.29, 71.88, 71.81, 71.66, 70.93, 70.63, 70.59, 69.22, 69.17, 69.07, 68.42, 68.39, 68.33, 68.22, 68.09, 68.05, 68.02, 68.01, 67.98, 67.24, 66.48, 66.44, 62.91, 62.55, 62.41, 61.61, 61.02, 59.76, 54.60, 51.84, 51.63, 40.08, 39.77, 22.28, 22.02, 22.00, 15.20. HRMS (ESI-Orbitrap) m/z: [M−2H]2-calculated for C54H89N3O41 716.7413; found 716.7424.
Provided herein is an efficient strategy to access target HMOs shown in
CMP-Neu5Ac is formed from Neu5Ac and CTP via SA4 containing Neisseria meningitidis CMP-sialic acid synthetase (NmCSS). As provided herein, these SA systems were combined with suitable GTs for highly efficient synthesis of target HMOs. The GTs provided herein include a β1-3-N-acetylglucosaminyltransferase (03GlcNAcT) GT1a; a β1-3-galactosyltransferase (β3GalT) GT2a and a β1-4-galactosyltransferase (β4GalT) GT2b; an α1-2-fucosyltransferase (α2FucT) GT3a and an α1-3/4-fucosyltransferase (α3/4FucT) GT3b; as well as an α2-3-sialyltransferase (α3SiaT) GT4a and several α2-6-sialyltransferases (α6SiaTs) GT4b-GT4e.
HMOs were produced efficiently in a target-oriented chemoenzymatic synthetic process from LacβNHCbz (1) using a Stepwise One-Pot Multienzyme (StOPMe) strategy with a donor pre-generated from the corresponding monosaccharides and nucleoside triphosphates using related sugar activation (SA) enzymes. To guide target-oriented synthesis, enzymatic assembly synthetic maps (EASyMaps) were created herein to access the sixteen possible neutral HMOs containing the LNT (
Every NHCbz-tagged HMO target was synthesized from LacβNHCbz (1) in one-pot in a stepwise procedure. Enzymes were deactivated after each glycosylation step by incubating the reaction mixture in a boiling water bath for 5 min and cooling down. The reaction mixture was then used for the next step without intermediate product purification. An α2-6-sialylation protection strategy was used to achieve regioselective α1-3/4-fucosylation of the internal GlcNAc but not the Glc at the reducing end. In this strategy, trisaccharide was used as the acceptor substrate for Pd2,6ST-catalyzed OPME sialylation to add an Neu5Ac α2-6-linked to the internal Gal on the trisaccharide which protected the Glc at the reducing end from α1-3/4-fucosylation. The resulting tetrasaccharide can be extended with a β1-3 or a β1-4-linked Gal. After regioselective α1-3/4-fucosylation was completed, the undesired α2-6-linked Neu5Ac was removed using a sialidase-catalyzed reaction.
HMOs with a tetraose core all share a common trisaccharide GlcNAcβ3Lac. Therefore, the fist glycosylation step for the synthesis of target HMOβNHCbz was the extension of LacβNHCbz (1) to GlcNAcβ3LacβNHCbz by an NmLgtA (GT1a)-catalyzed glycosylation reaction with the in situ generation of UDP-GlcNAc from GlcNAc using SA1. Before this glycosylation step, except for CMP-Neu5Ac (generated by SA4), all other sugar nucleotides needed for future glycosylation steps were generated by adding all components needed for the corresponding sugar activation systems (SA1-SA3), shown in
By following the bottom route in the map
By following the top route in the map
By following the middle in the map
With the access to all sixteen neutral HMOs containing a tetraose core, sialylation was carried out for those do not contain an α1-2-linked fucose at the terminal Gal (eight HMOβNHCbz targets marked with a four-point star in
The library of HMOs with an LNT core also include five compounds containing a quite unique Neu5Acα2-6GlcNAc linkage that has been found in several colon cancer cell lines. For example, a complex disialyl Lewis a-containing lacto-series ganglioside was shown to be most likely contributed by the combined functions of FUT-3 and ST6GALNAC VI, but not ST6GALNAC V. On the other hand, human ST6GALNAC V (hST6GALNAC V) was shown to be responsible mainly for the production of ganglio-series GD1a ganglioside by sialylating GM1b for the formation of Neu5Acα2-6GalNAc linkage. The sialyltransferase that is responsible for forming the Neu5Acα2-6GlcNAc linkage in HMOs is not clear. Nevertheless, a recombinant hST6GALNAC V expressed in HEK293 cells has been used for catalyze the addition of a Neu5Ac α2-6-linked to the GlcNAc or the GalNAc in the acceptor substrate for the synthesis of a DSLNT derivative and a disialyl Gb5 (DSGb5) ganglioside glycan, respectively. Its expression in E. coli, as provided herein, had not been reported previously.
In order to synthesize target Neu5Acα2-6GlcNAc-containing HMOs, an N-terminal truncated recombinant hST6GALNAC V fused with an N-terminal maltose binding protein (MBP) and a C-terminal His6-tag was cloned using a codon optimized synthetic gene. The resulting MBP-Δ50hST6GALNAC V-His6 was successfully expressed as a soluble and active fusion protein in E. coli Origami B (DE3) cells containing pGro7. Its expression level was 440 mU per liter culture when Neu5Acα2-3LNTβNHCbz was used as the acceptor, 1 U=1 μmol min1 at 37° C., pH 7.5. Acceptor substrate specificity was carried out and the results showed that the recombinant MBP-Δ50hST6GALNAC V-His6 is promiscuous and is capable of using different HMOβNHCbz as acceptor substrates.
Furthermore, both V99M and stop-loss (which lead to the addition of 20 amino acids at the C-terminus) mutations of ST6GALNAC V have shown to enhance its catalytic activity and to correlate to the genetic cause of coronary artery disease (CAD). Both mutants and a combination mutant containing both mutations were constructed. They were expressed in E. coli and their expression levels and activities were compared. The soluble expression levels of both MBP-Δ50hST6GALNAC V_V99M-His6 and MBP-Δ50hST6GALNAC V_ext20-His6 were higher than that of the wild-type recombinant enzyme. The combination mutant MBP-Δ50hST6GALNAC V_V99M_ext20-His6 was also successfully expressed as a soluble active enzyme. Furthermore, several PROSS designed mutants, shown in Table 2, expressed in an E. coli host were tested. Among them, MBP-Δ50hST6GALNAC V_Design7_ext20-His6, shown in Table 2, was shown to have a high soluble expression level and good catalytic activity. It was used for highly efficient synthesis of DSLNTβNHCbz (46) from LSTaβNHCbz (32) using a OP2E sialylation reacton (
With active hST6GALNAC V and mutants in hand and an understanding of their acceptor substrate specificities, EASyMap (
The preparation of compounds 44-46 shares the common first two glycosylation steps. Again, sugar nucleotides including UDP-GlcNAc, UDP-Gal, and/or GDP-Fuc were generated in the first step followed by NmLgtA (GT1a)-catalyzed extension of LacβNHCbz (1) with a 1-3-linked GlcNAc with UDP-GlcNAc generated from GlcNAc via SA1 in the first step, enzyme deactivation, and Cvβ3GalT (GT2a)-catalyzed extension with a β1-3-linked Gal using UDP-Gal generated from Gal via SA2 in the first step, and enzyme deactivation. From there for the top route, hST6GALNAC V (GT4e)-catalyzed addition of Neu5Ac α2-6-linked to the internal GlcNAc with in situ generation of CMP-Neu5Ac formed LSTbβNHCbz (44). On one hand, deactivating the enzymes in the resulting reaction mixture followed by a single C18-cartridge purification produced pure LSTbβNHCbz (44). On the other hand, the reaction mixture after deactivation of enzymes was used directly for the next step to add a terminal α1-2-linked L-fucose by Hm2FT (GT3a)-catalyzed reaction with GDP-Fuc generated from L-fucose via SA3 in the first step formed F-LSTbβNHCbz (45) which was readily purified by deactivation of enzymes followed by a C18-cartridge purification. For the middle route, after the first two glycosylation steps for the formation of the LNTβNHCbz core, PmST1_M144D (GT4a)-catalyzed α2-3-sialylation of the terminal Gal with in situ generation of CMP-Neu5Ac via SA4, enzyme deactivation, followed by hST6GALNAC V (GT4e)-catalyzed addition of Neu5Ac α2-6-linked to the internal GlcNAc with in situ generation of CMP-Neu5Ac via SA4 formed DSLNTβNHCbz (46) which was readily purified by deactivation of enzymes followed by a C18-cartridge purification.
The production of FDS-LNT-IIβNHCbz (47) with a fucose α1-3-linked to the inner most glucose residue shared the same first step glycosylation reaction by NmLgtA (GT1a)-catalyzed extension of LacβNHCbz (1) with a β1-3-linked GlcNAc with in situ generation of UDP-GlcNAc from GlcNAc via SA1 and the generation of other sugar nucleotides needed in the future steps. After deactivation of the enzymes in the reaction mixture, the bottom route in the EASyMap
For the synthesis of DSLNTβNHCbz (46), it was found that synthesizing it from LSTaβNHCbz (32) using OP2E sialylation reacton (
MBP-Δ50hST6GALNAC V_V99M-His6, MBP-Δ50hST6GALNAC V_ext20-His6, and the combination mutants MBP-Δ50hST6GALNAC V_V99M_ext20-His6 were shown to be able to catalyze the sialylation of monosialyl Lewis a structure Neu5Acα3LNFP-IIβNHCbz (S-LNF II or F-LSTa) (37) to form the complex disialyl Lewis a structure FDS-LNT IβNHCbz (48) (
Assays for synthesizing FDS-LNTβNHCbz (48) from S-LNF-IIβNHCbz (37). Assays were carried out at 30° C. in a total volume of 10 μL in Tris-HCl buffer (100 mM, pH 7.5) containing (10 mM), CMP-Neu5Ac (15 mM), MgCl2 (20 mM), MBP-Δ50hST6GALNAC V_ext20-Hiss (16 g), MBP-Δ50hST6GALNAC V_V99M-His6, or MBP-Δ50hST6GALNAC V_V99M_ext20-His6. The reactions were analyzed by HRMS at 24 hours and 72 hours. The formation of FDS-LNTβNHCbz (48) was observed (
The gene encoding full length human GCNT2-B (hGCNT2-B, GenBank ID: NP_001482), a β1-6-GlcNAcT, was codon optimized for E. coli expression, synthesized, and cloned into pET28a(+) by Twist Bioscience to obtain pET28a(+)-hGCNT2-B plasmid. The gene was subcloned into pMAL-c2X to obtain plasmid pMAL-c2X-hGCNT2-His6. In addition, the plasmid containing the N-terminal 25 amino acid-truncated version of hGCNT2-B was also constructed for expressing MBP-Δ25hGCNT2-B-His6 with an N-terminal MBP fusion component and a C-terminal His6 tag. Origami B (DE3) transformed with pGro7 harboring chaperon GroEL-GroES was used as host cell for expressing MBP-Δ25hGCNT2-B-His6. Its activity was confirmed using LNnTβNHCbz as the acceptor substrate and UDP-GlcNAc as the donor substrate in both small-scale (10 μL) reactions and preparative-scale synthesis with NMR characterization.
Enzyme cloning and truncation. The synthetic gene hGCNT2-B in the pET28a(+)-hGCNT2-B plasmid was amplified via polymer chain reactions (PCR) using forward primer: 5′- GACCGAATTCATGCCCTTATCGATGAGATATCTC-3′ with EcoRI restriction site (underlined) and reverse primer: 5′-CAGCAAGCTTTAGTGGTGGTGATGATGATGGAAGTACCATGATGGCTGAATG-3′ with HindIII restriction site (underlined). The PCR was performed in a reaction mixture (50 μL) containing the template plasmid DNA (10 ng), forward and reverse primer (0.2 M each), 1× Phusion HF buffer, dNTP mixture (0.2 mM each), and 1 U (0.5 L) of Phusion® High-Fidelity DNA Polymerase. The reaction mixture was subjected to 30 cycles of amplification at an annealing temperature of 62° C. The resulting PCR product was gel purified and double digested with EcoRI and HindIII restriction enzymes. The digested and purified PCR product was inserted into pMAL-c2X vector predigested with the same restriction enzymes via T4 ligase and transformed into E. coli DH5α Z-competent cells. Selected clones were grown for plasmid minipreps, and the nucleic acid sequence was confirmed by customer sequencing by Genewiz.
The truncation was carried out using Q5® Site-Directed Mutagenesis kit according to the procedures provided by the manufactory. The plasmid pMAL-c2X-hGCNT2-B-His6 was used as the PCR template and the primers used were forward primer: 5′-TTCGGCGGAGATCCTTCT-3′ and reverse primer: 5′-CATGAATTCTGAAATCCTTCCC-3′. Briefly, a reaction mixture (25 L) containing 12.5 μL of Q5 Hot Start High-Fidelity 2× Master Mix, 10 ng (1 μL) plasmid as the template, 0.5 μM (1.25 μL) for each of forward and reverse primers, Nuclease-free Water (9 μL) was subjected to 30 cycles of amplification with 30 seconds of annealing (61° C.), 3.5 minutes of extension at 72° C. for each cycle, and 2 more minutes extension after the amplification. The standard KLD reaction (10 μL) containing 5 μL of 2×KLD Reaction Buffer, 1 μL of 10×KLD Enzyme Mix and 3 μL Nuclease-free Water was carried out by incubating the mixture at room temperature for 5 minutes. The products were then transformed to DH5α competent cell prepared by Mix and Go!™ Competent Cells (Zymo Research). The plasmids with target mutants were confirmed by DNA sequencing by Genewiz. The nucleic acid and the amino acid sequences of MBP-Δ25hGCNT2-B-His6 are shown in SEQ ID NO: 9 and SEQ ID NO: 10, respectively.
Enzyme expression and purification. E. coli Origami B(DE3) cells with pGro7 harboring the recombinant plasmid containing the target gene were cultured at 37° C. in 50 mL LB media containing 0.05 mg mL−1 ampicillin, 0.025 mg mL−1 kanamycin, 0.0175 mg mL−1 chloramphenicol, 0.005 mg mL−1 tetracycline with rapid shaking at 220 rpm. Then 15 mL of the overnight cell culture was transferred to 1 L of LB media containing the same antibiotics and 0.5 g L−1 L-arabinose and incubated at 37° C. When the OD600 nm of the cell culture reached 0.6-0.8, IPTG (0.1 mM) was added to induce the expression of the recombinant enzyme followed by incubation at 15° C. for 24 h with shaking at 220 rpm in a C25KC incubator shaker (New Brunswick Scientific, Edison, NJ).
Bacterial cells were harvested by centrifugation at 4° C. in a Thermo Lynx 6000 centrifuge with Rotor Lynx F9-6×1000 at 4,392×g for 30 min. The cell pellet was re-suspended with lysis buffer (Tris-HCl buffer, 100 mM, pH 8.0 containing 0.1% Triton X-100) (20 mL for cells collected from 1 L cell culture). The cells were lysed by homogenization using an Avestin EmulsiFlex-C3 homogenizer or sonication using a Sonics Vibra-Cell system with the following protocol: 10 s (sonication)/59 s (rest) for a total of 7 min on ice. Cell lysate was obtained by centrifugation at 4° C. with Sorvall ST16R, HIGHConic II 6×94 mL fixed angle rotor at 9016×g for 1 hr and the supernatant (lysate) was collected. Protein purification was carried out using 5 mL Bio-Scale Mini Profinity IMAC Cartridges in a Bio-Rad NGC 100 Medium-Pressure Chromatography System with a flow rate of 5 mL min−1. The supernatant was loaded to the Ni2+-NTA column pre-equilibrated with 10 column volumes of a binding buffer (5 mM imidazole, 0.5 mM NaCl, 50 mM Tris-HCl buffer, pH 7.5). The column was washed with 10 column volumes of the binding buffer followed by 10 column volumes of a washing buffer (10 mM imidazole, 0.5 mM NaCl, 50 mM Tris-HCl buffer, pH 7.5). The target protein was eluted with 10 column volumes of a elution buffer (200 mM imidazole, 0.5 mM NaCl, 50 mM Tris-HCl buffer, pH 7.5). Fractions containing the target enzyme were collected and concentrated via MilliporeSigma™ Amicon™ Ultra-15 Centrifugal Filter Units with MWCOs of 10 kDa, 10% glycerol was added before stored at −20° C.
SDS-PAGE analysis results for enzyme expression and purification of MBP-Δ25hGCNT2-B-His6 are shown in
Enzyme activity test and synthesis. The MBP-Δ25hGCNT2-B-His6 enzyme showed good activity with incubated at 30° C. for overnight in a reaction mixture (10 L) (
Preparative-scale synthesis was also successfully carried out using LNnTβNHCbz as the acceptor substrate using a one-pot multienzyme (OPME) process (
The StOPMe strategy with pre-generation of sugar nucleotides has also been successfully applied for synthesizing other HMOβNHCbz compounds including those containing the hexoase pLNnH-core. Some examples are described below with the guidance of the EASyMap designed (
StOPMe synthesis of Galβ4GlcNAcβ3Galβ4GlcNAcβ3Galβ4GlcβNHCbz (pLNnHβNHCbz) (11). Galactose (0.504 mmol), GlcNAc (0.504 mmol), ATP (1.26 mmol), and UTP (1.26 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5) and MgCl2 (20 mM). The pH of the mixture was adjusted to 7.5 by adding 5 M NaOH. After the addition of BLNahK (4 mg), PmGlmU (4 mg), SpGalK (3.5 mg), BLUSP (3 mg), and PmPpA (2 mg), water was added to bring the final volume to 5 mL. The reaction mixture was incubated at 30° C. in an incubator shaker for 20 h with agitation at 180 rpm. Reaction progress was monitored by HRMS analysis. After incubating the reaction mixture in a boiling water bath for 5 min then cooled down, LacβNHCbz (100 mg, 0.21 mmol) and NmLgtA (3 mg) were added to synthesize GlcNAcβ3Galβ4GlcβNHCbz. The reaction mixture was incubated at 30° C. in an incubator shaker for 20 h with agitation at 180 rpm. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min then cooled down and moved to the next step without purification. For each step, pH was checked and adjusted to 7.5 using 5 M NaOH before adding enzymes. NmLgtB (0.6 mg) was then added into reaction mixture to obtain Galβ4GlcNAcβ3Galβ4GlcβNHCbz. Once the reaction was completed (8 h), the reaction was stopped by the same workup method using boiling water bath. To synthesize GlcNAcβ3Galβ4GlcNAcβ3Galβ4GlcβNHCbz, Hpβ3GlcNAcT (4 mg) was added and the reaction mixture was incubated at 30° C. in an incubator shaker for 20 h with agitation at 180 rpm. After workup, NmLgtB (0.6 mg) was added to synthesize Galβ4GlcNAcβ3Galβ4GlcNAcβ3Galβ4GlcβNHCbz (11). The reaction was incubated at 30° C. in an incubator shaker for 8 h followed by workup with boiling water bath and cooled down to room temperature. The final volume was around 20 mL with final product concentration of around 10 mM. Galβ4GlcNAcβ3Galβ4GlcNAcβ3Galβ4GlcβNHCbz (11) was obtained by single C18-cartridge purification process as a white powder (200 mg, 79% yield for 4 steps from 100 mg LacβNHCbz). The NMR data were consistent with those data described above using the MSOPME strategy.
StOPMe synthesis of Galβ4(Fucα3)GlcNAcβ3Galβ4(Fucα3)GlcNAcβ3Galβ4GlcβNHCbz (50). Galactose (0.504 mmol), GlcNAc (0.504 mmol), L-fucose (0.504 mmol), ATP (1.89 mmol), UTP (1.26 mmol), and GTP (0.63 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5) and MgCl2 (20 mM). The pH of the mixture was adjusted to 7.5 by adding 5 M NaOH. After the addition of BLNahK (4 mg), PmGlmU (4 mg), SpGalK (3.5 mg), BLUSP (3 mg), BfFKP (5 mg), and PmPpA (3 mg), water was added to bring the final volume to 5 mL. The reaction mixture was incubated at 30° C. in an incubator shaker for 20 h with agitation at 180 rpm. Reaction progress was monitored by HRMS. After incubating the reaction mixture in a boiling water bath for 5 min then cooled down to room temperature, LacβNHCbz (100 mg, 0.21 mmol) and NmLgtA (3 mg) were added to synthesize GlcNAcβ3Galβ4GlcβNHCbz. The reaction mixture was incubated at 30° C. in an incubator shaker for 20 h with agitation at 180 rpm. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min then cooled down to room temperature and moved to the next step without purification. For each step, pH was checked and adjusted to 7.5 by adding 5 M NaOH before adding enzymes. To synthesize GlcNAcβ3(Neu5Acα6)Galβ4GlcβNHCbz, Neu5Ac (0.315 mmol), CTP (0.42 mmol), NmCSS (1 mg), Pd2,6ST (3 mg) were added into the reaction mixture. The reaction was completed in 6 h by incubating the reaction mixture at 30° C. in an incubator shaker with agitation at 180 rpm, followed by incubation in a boiling water bath for 5 min. and cooled down to room temperature. NmLgtB (0.6 mg) was added to produce Galβ4GlcNAcβ3(Neu5Acα6)Galβ4GlcβNHCbz. Once the reaction was completed (8 h), the reaction was stopped by incubating in a boiling water bath for 5 min and cooled down to room temperature. To synthesize GlcNAcβ3Galβ4GlcNAcβ3(Neu5Acα6)Galβ4GlcβNHCbz, Hpβ3GlcNAcT (4 mg) was added and the reaction mixture was incubated at 30° C. in an incubator shaker for 20 h with agitation at 180 rpm. After the reaction was completed. The reaction mixture was incubated in a boiling water bath for 5 min and cooled down to room temperature. Hp3/4FT (1.5 mg) was added and the reaction mixture was incubated at 30° C. in an incubator shaker for 6 h with agitation at 180 rpm to obtain GlcNAcβ3Galβ4(Fucα3)GlcNAcβ3(Neu5Acα6)Galβ4GlcβNHCbz. Then the reaction mixture was incubated in a boiling water bath for 5 min and cooled down to room temperature. NmLgtB (0.6 mg) was added to synthesize Galβ4GlcNAcβ3Galβ4(Fucα3)GlcNAcβ3(Neu5Acα6)Galβ4GlcβNHCbz. The reaction was incubated at 30° C. in an incubator shaker for 8 h followed by incubating in a boiling water bath for 5 min. and cooled down to room temperature. Hp3/4FT (1.5 mg) was added and reaction mixture was incubated at 30° C. in an incubator shaker for 6 h with agitation at 180 rpm to obtain Galβ4(Fucα3)GlcNAcβ3Galβ4(Fucα3)GlcNAcβ3(Neu5Acα6)Galβ4GlcβNHCbz. The reaction mixture was incubated in a boiling water bath for 5 min and cooled down to room temperature. To remove Neu5Ac and obtain the final product Galβ4(Fucα3)GlcNAcβ3Galβ4(Fucα3)GlcNAcβ3Galβ4GlcβNHCbz, SpNanA (1.5 mg) was added and the reaction mixture was incubated at 30° C. in an incubator shaker for 20 h with agitation at 180 rpm. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min and cooled down to room temperature. The final volume was around 26 mL with a final product concentration of about 7.7 mM. Galβ4(Fucα3)GlcNAcβ3Galβ4(Fucα3)GlcNAcβ3Galβ4GlcβNHCbz (50) was obtained by a single C18-cartridge purification process as a white powder (236 mg, 75% yield for 8 steps from 100 mg LacβNHCbz). 1H NMR (800 MHz, D2O) δ 1.16 (dd, J=21.8, 6.6 Hz, 6H), 2.02 (d, J=2.2 Hz, 6H), 3.42 (t, J=9.1 Hz, 1H), 3.50 (dt, J=10.0, 7.2 Hz, 2H), 3.58 (dtd, J=11.9, 6.3, 2.7 Hz, 5H), 3.67-3.63 (m, 3H), 3.71-3.68 (m, 7H), 3.73 (dt, J=12.8, 4.5 Hz, 4H), 3.79 (dt, J=12.5, 3.4 Hz, 4H), 3.88 (dddd, J=32.4, 16.0, 11.3, 3.8 Hz, 8H), 3.99-3.93 (m, 6H), 4.10 (d, J=3.7 Hz, 1H), 4.15 (d, J=3.6 Hz, 1H), 4.48-4.42 (m, 3H), 4.71 (t, J=8.3 Hz, 2H), 4.84 (s, 3H), 5.13 (dd, J=16.7, 4.0 Hz, 2H), 5.19 (p, J=12.9, 11.8 Hz, 2H), 7.45 (d, J=5.7 Hz, 5H); 13C NMR (200 MHz, D2O) δ 15.2, 15.3, 22.2, 22.2, 55.9, 59.6, 59.8, 60.9, 60.9, 61.4, 61.4, 61.5, 61.5, 66.7, 66.7, 67.3, 67.6, 67.7, 68.2, 68.3, 68.3, 69.1, 69.2, 70.0, 70.5, 71.0, 71.4, 71.8, 71.9, 72.4, 72.8, 73.0, 74.4, 74.7, 74.8, 74.9, 74.9, 75.0, 75.0, 75.1, 76.1, 77.7, 81.6, 82.0, 98.6, 98.7, 101.7, 101.7, 102.5, 102.5, 102.8, 127.8, 127.8, 128.5, 128.8, 128.8, 136.0, 158.1, 174.6, 174.7; HRMS (ESI-Orbitrap) m/z: [M+H]+ Calcd for C60H96N3O40 1498.5565; found 1498.5543.
StOPMe synthesis of Galβ4GlcNAcβ3Galβ4(Fucα3)GlcNAcβ3Galβ4GlcβNHCbz (51). Galactose (0.504 mmol), GlcNAc (0.504 mmol), L-fucose (0.252 mmol), ATP (1.575 mmol), UTP (1.26 mmol), and GTP (0.315 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5) and MgCl2 (20 mM). The pH of the mixture was adjusted to 7.5 by adding 5 M NaOH. After the addition of BLNahK (4 mg), PmGlmU (4 mg), SpGalK (3.5 mg), BLUSP (3 mg), BfFKP (3 mg), and PmPpA (2 mg), water was added to bring the final volume to 5 mL. The reaction mixture was incubated at 30° C. in an incubator shaker for 20 h with agitation at 180 rpm. Reaction progress was monitored by HRMS analysis. After incubating the reaction mixture in a boiling water bath for 5 min then cooled down, LacβNHCbz (100 mg, 0.21 mmol) and NmLgtA (3 mg) were added to synthesize GlcNAcβ3Galβ4GlcβNHCbz. The reaction mixture was incubated at 30° C. in an incubator shaker for 20 h with agitation at 180 rpm. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min then cooled down and moved to the next step without purification. For each step, pH was checked and adjusted to 7.5 using 5 M NaOH before adding enzymes. To synthesize GlcNAcβ3(Neu5Acα6)Galβ4GlcβNHCbz, Neu5Ac (0.315 mmol), CTP (0.42 mmol), NmCSS (1 mg), Pd2,6ST (3 mg) were added into the reaction mixture. The reaction was completed in 6 h by incubating the reaction mixture at 30° C. in an incubator shaker for 6 h with agitation at 180 rpm, followed by boiling water bath workup and cooling down. NmLgtB (0.6 mg) was added into reaction mixture to obtain Galβ4GlcNAcβ3(Neu5Acα6)Galβ4GlcβNHCbz. Once the reaction was completed (8 h), the reaction was stopped by the same workup method using boiling water bath. To synthesize GlcNAcβ3Galβ4GlcNAcβ3(Neu5Acα6)Galβ4GlcβNHCbz, Hpβ3GlcNAcT (4 mg) was added and the reaction mixture was incubated at 30° C. in an incubator shaker for 20 h with agitation at 180 rpm. After workup, Hp3/4FT (1.5 mg) was added and reaction mixture was incubated at 30° C. in an incubator shaker for 6 h with agitation at 180 rpm to obtain GlcNAcβ3Galβ4(Fucα3)GlcNAcβ3(Neu5Acα6)Galβ4GlcβNHCbz. Then the reaction mixture was incubated in a boiling water bath for 5 min and cooled down. NmLgtB (0.6 mg) was added to synthesize Galβ4GlcNAcβ3Galβ4(Fucα3)GlcNAcβ3(Neu5Acα6)Galβ4GlcβNHCbz. The reaction was incubated at 30° C. in an incubator shaker for 8 h followed by workup with boiling water bath and cooled down. To remove the sialic acid and obtain the final product Galβ4GlcNAcβ3Galβ4(Fucα3)GlcNAcβ3Galβ4GlcβNHCbz, SpNanA (1.5 mg) was added and the reaction mixture was incubated at 30° C. in an incubator shaker for 20 h with agitation at 180 rpm, followed by workup and cooling down. The final volume was around 24 mL with a final product concentration of about 8.3 mM.
Galβ4GlcNAcβ3Galβ4(Fucα3)GlcNAcβ3Galβ4GlcβNHCbz (51) was obtained by single C18-cartridge purification process as a white powder (210 mg, 74% yield for 7 steps from 100 mg LacβNHCbz). 1H NMR (800 MHz, D2O) δ 1.15 (d, J=6.6 Hz, 3H), 2.03 (d, J=5.4 Hz, 6H), 3.42 (t, J=9.1 Hz, 1H), 3.53 (ddd, J=19.0, 9.9, 7.8 Hz, 2H), 3.58 (td, J=10.0, 9.3, 3.6 Hz, 4H), 3.67 (ddd, J=16.5, 8.4, 4.9 Hz, 5H), 3.70 (q, J=4.3 Hz, 4H), 3.76-3.71 (m, 7H), 3.81-3.76 (m, 5H), 3.85 (dd, J=12.4, 4.8 Hz, 2H), 3.91-3.87 (m, 2H), 3.95-3.91 (m, 3H), 3.98-3.95 (m, 3H), 4.10 (d, J=3.7 Hz, 1H), 4.15 (d, J=3.6 Hz, 1H), 4.49-4.42 (m, 3H), 4.70 (d, J=8.3 Hz, 3H), 5.12 (d, J=4.1 Hz, 1H), 5.21-5.16 (m, 2H), 7.45 (d, J=5.9 Hz, 5H); 13C NMR (200 MHz, D2O) δ 15.3, 22.1, 22.2, 55.1, 55.9, 59.6, 59.8, 61.0, 61.0, 61.0, 61.4, 66.7, 67.4, 67.6, 68.2, 68.3, 68.5, 69.2, 70.0, 70.5, 70.9, 70.9, 71.4, 71.8, 72.1, 72.5, 72.8, 74.4, 74.7, 74.8, 75.0, 75.1, 75.3, 75.3, 76.1, 77.7, 78.2, 81.6, 82.0, 98.7, 101.7, 102.5, 102.7, 102.8, 102.8, 127.8, 127.8, 128.5, 128.8, 128.8, 136.0, 158.1, 174.7, 174.8. HRMS (ESI-Orbitrap) m/z: [M+H]+ Calcd for C54H86N3O36 1352.4986; found 1352.4970.
StOPMe synthesis of Galβ4(Fucα3)GlcNAcβ3Galβ4GlcNAcβ3Galβ4GlcβNHCbz (52). Galactose (0.504 mmol), GlcNAc (0.504 mmol), Fucose (0.252 mmol), ATP (1.575 mmol), UTP (1.26 mmol) and GTP (0.315 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5) and MgCl2 (20 mM). The pH of the mixture was adjusted to 7.5 by adding 5 M NaOH. After the addition of BLNahK (4 mg), PmGlmU (4 mg), SpGalK (3.5 mg), BLUSP (3 mg), BfFKP (3 mg) and PmPpA (2 mg), water was added to bring the final volume to 5 ml. The reaction mixture was incubated at 30° C. in an incubator shaker for 20 h with agitation at 180 rpm. Reaction progress was monitored by HRMS analysis. After incubating the reaction mixture in a boiling water bath for 5 min then cooled down, LacβNHCbz (100 mg, 0.21 mmol) and NmLgtA (3 mg) were added to synthesize GlcNAcβ3Galβ34Glcβ3NHCbz. The reaction mixture was incubated at 30° C. in an incubator shaker for 20 h with agitation at 180 rpm. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min then cooled down and moved to the next step without purification. For each step, pH was checked and adjusted to 7.5 using 5 M NaOH before adding enzymes. To synthesize GlcNAcβ3(Neu5Acα6)Galβ4GlcβNHCbz, Neu5Ac (0.315 mmol), CTP (0.42 mmol), NmCSS (1 mg), Pd2,6ST (3 mg) were added into the reaction mixture. The reaction was completed in 6 h by incubating the reaction mixture at 30° C. in an incubator shaker for 6 h with agitation at 180 rpm, followed by boiling water bath workup and cooling down. NmLgtB (0.6 mg) was added into reaction mixture to obtain Galβ4GlcNAcβ3(Neu5Acα6)Galβ4GlcβNHCbz. Once the reaction was completed (8 h), the reaction was stopped by the same workup method using boiling water bath. To synthesize GlcNAcβ3Galβ4GlcNAcβ3(Neu5Acα6)Galβ4GlcβNHCbz, Hpβ3GlcNAcT (4 mg) was added and the reaction mixture was incubated at 30° C. in an incubator shaker for 20 h with agitation at 180 rpm. After workup, Neu5Ac (0.315 mmol), CTP (0.42 mmol), NmCSS (1 mg), Pd2,6ST (3 mg) were added into the reaction mixture to obtain GlcNAcβ3(Neu5Acα6)Galβ4GlcNAcβ3(Neu5Acα6)Galβ4GlcβNHCbz. The reaction mixture was then incubated in a boiling water bath for 5 min then cooled down. NmLgtB (0.6 mg) was added to synthesize Galβ4GlcNAcβ3(Neu5Acα6)Galβ4GlcNAcβ3(Neu5Acα6)Galβ4GlcβNHCbz. The reaction was incubated at 30° C. in an incubator shaker for 8 h followed by workup with boiling water bath and cooled down.Hp3/4FT (1.5 mg) was added and reaction mixture was incubated at 30° C. in an incubator shaker for 6 h with agitation at 180 rpm to obtain Galβ4(Fucα3)GlcNAcβ3(Neu5Acα6)Galβ4GlcNAcβ3(Neu5Acα6)Galβ4GlcβNHCbz. To remove the sialic acids and obtain the final product Galβ4(Fucα3)GlcNAcβ3Galβ4GlcNAcβ3Galβ4GlcβNHCbz, SpNanA (2 mg) was added and the reaction mixture was incubated at 30° C. in an incubator shaker for 20 h with agitation at 180 rpm, followed by workup and cooling down. The final volume was around 26 mL with a final product concentration of about 7.7 mM.
Galβ4(Fucα3)GlcNAcβ3Galβ4GlcNAcβ3Galβ4GlcβNHCbz (52) was obtained by single C18-cartridge purification process as a white powder (204 mg, 72% yield for 8 steps from 100 mg LacβNHCbz). 1H NMR (800 MHz, D2O) δ 1.17 (d, J=6.6 Hz, 3H), 2.02 (d, J=6.4 Hz, 6H), 3.52-3.47 (m, 1H), 3.58 (q, J=9.0 Hz, 5H), 3.65 (d, J=9.3 Hz, 3H), 3.72 (pd, J=19.2, 18.2, 10.3 Hz, 14H), 3.81-3.76 (m, 5H), 3.83 (dd, J=12.5, 5.7 Hz, 1H), 3.94-3.85 (m, 5H), 3.95 (t, J=9.7 Hz, 4H), 4.15 (d, J=8.4 Hz, 2H), 4.45 (dd, J=18.9, 7.8 Hz, 4H), 5.13 (d, J=5.8 Hz, 2H), 5.18 (s, 3H), 7.45 (d, J=5.7 Hz, 5H); 13C NMR (200 MHz,) δ 15.3, 22.2, 22.2, 55.1, 55.9, 59.6, 59.8, 60.9, 61.0, 61.5, 61.5, 66.7, 67.4, 67.7, 68.3, 68.3, 68.3, 69.2, 69.9, 70.0, 71.0, 71.4, 71.9, 72.1, 72.5, 73.0, 74.5, 74.7, 74.8, 74.8, 74.9, 75.0, 75.1, 76.1, 77.7, 78.1, 81.6, 82.0, 82.1, 98.6, 101.7, 102.5, 102.7, 102.8, 102.9, 127.7, 127.8, 128.5, 128.8, 128.8, 136.0, 158.1, 174.7, 174.9. HRMS (ESI-Orbitrap) m/z: [M+H]+ Calcd for C54H86N3O36 1352.4986; found 1352.4960.
StOPMe synthesis of Galβ4GlcNAcβ3Galβ4GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz (53). Galactose (0.504 mmol), GlcNAc (0.504 mmol), Fucose (0.252 mmol), ATP (1.575 mmol), UTP (1.26 mmol) and GTP (0.315 mmol) were dissolved in water in a 50 mL centrifuge tube containing Tris-HCl buffer (100 mM, pH 7.5) and MgCl2 (20 mM). The pH of the mixture was adjusted to 7.5 by adding 5 M NaOH. After the addition of BLNahK (4 mg), PmGlmU (4 mg), SpGalK (3.5 mg), BLUSP (3 mg), BfFKP (3 mg) and PmPpA (2 mg), water was added to bring the final volume to 5 ml. The reaction mixture was incubated at 30° C. in an incubator shaker for 20 h with agitation at 180 rpm. Reaction progress was monitored by HRMS analysis. After incubating the reaction mixture in a boiling water bath for 5 min then cooled down, LacβNHCbz (100 mg, 0.21 mmol) and NmLgtA (3 mg) were added to synthesize GlcNAcβ33Galβ34Glcβ3NHCbz. The reaction mixture was incubated at 30° C. in an incubator shaker for 20 h with agitation at 180 rpm. After the reaction was completed, the reaction mixture was incubated in a boiling water bath for 5 min then cooled down and moved to the next step without purification. For each step, pH was checked and adjusted to 7.5 using 5 M NaOH before adding enzymes. Hp3/4FT (1.5 mg) was added and reaction mixture was incubated at 30° C. in an incubator shaker for 6 h with agitation at 180 rpm to obtain GlcNAcβ3Galβ4(Fucα3)Glcβ3NHCbz. After incubating the reaction mixture in a boiling water bath for 5 min then cooled down, NmLgtB (0.6 mg) was added into reaction mixture to obtain Galβ4GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz. Once the reaction was completed (8 h), the reaction was stopped by the same workup method using boiling water bath. To synthesize GlcNAcβ3Galβ4GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz, Hpβ3GlcNAcT (4 mg) was added and the reaction mixture was incubated at 30° C. in an incubator shaker for 20 h with agitation at 180 rpm. After workup, NmLgtB (0.6 mg) was added to synthesize Galβ4GlcNAcβ3Galβ4GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz. The reaction was incubated at 30° C. in an incubator shaker for 8 h followed by workup with boiling water bath and cooled down. The final volume was around 22 mL with a final product concentration of about 9.1 mM. Galβ4GlcNAcβ3Galβ4GlcNAcβ3Galβ4(Fucα3)GlcβNHCbz (53) was obtained by single C18-cartridge purification process as a white powder (216 mg, 76% yield for 5 steps from 100 mg LacβNHCbz). 1H NMR (800.15 MHz, D2O) δ 1.17 (d, J=6.6 Hz, 3H), 2.03 (d, J=6.0 Hz, 6H), 3.50 (t, J=8.9 Hz, 1H), 3.54 (d, J=10.0 Hz, 1H), 3.62-3.55 (m, 6H), 3.66-3.62 (m, 1H), 3.67 (dd, J=10.0, 3.6 Hz, 1H), 3.70 (d, J=3.2 Hz, 1H), 3.73 (tdd, J=11.8, 8.2, 4.5 Hz, 12H), 3.84-3.75 (m, 10H), 3.89-3.84 (m, 3H), 3.94 (ddd, J=13.5, 9.6, 3.8 Hz, 5H), 4.09 (d, J=3.7 Hz, 1H), 4.16 (d, J=3.6 Hz, 1H), 4.42 (d, J=7.8 Hz, 1H), 4.49-4.46 (m, 2H), 5.18 (s, 2H), 5.44 (d, J=4.2 Hz, 1H), 7.47-7.41 (m, 5H); 13C NMR (201.22 MHz, D2O) δ 15.2, 22.1, 22.2, 55.1, 55.2, 59.6, 59.8, 60.9, 61.0, 61.5, 66.5, 67.4, 68.0, 68.2, 68.3, 68.5, 69.2, 69.9, 70.6, 70.9, 71.9, 72.0, 72.1, 72.2, 72.5, 73.1, 74.5, 74.5, 74.5, 74.8, 75.3, 76.7, 77.6, 78.1, 78.2, 81.5, 81.8, 82.0, 98.5, 101.7, 102.7, 102.7, 102.7, 102.8, 102.9, 127.8, 127.8, 128.5, 128.8, 128.8, 136.0, 158.1, 174.8, 174.9. HRMS (ESI-Orbitrap) m/z: [M+H]+ Calcd for C54H86N3O36 1352.4986; found 1352.4957.
ATGAAAATCGAAGAAGGTAAACTGGTAATCTGGATTAACGGCGATAAAGGCTATAACGGTCT
CGCTGAAGTCGGTAAGAAATTCGAGAAAGATACCGGAATTAAAGTCACCGTTGAGCATCCGG
ATAAACTGGAAGAGAAATTCCCACAGGTTGCGGCAACTGGCGATGGCCCTGACATTATCTTC
TGGGCACACGACCGCTTTGGTGGCTACGCTCAATCTGGCCTGTTGGCTGAAATCACCCCGGA
CAAAGCGTTCCAGGACAAGCTGTATCCGTTTACCTGGGATGCCGTACGTTACAACGGCAAGC
TGATTGCTTACCCGATCGCTGTTGAAGCGTTATCGCTGATTTATAACAAAGATCTGCTGCCG
AACCCGCCAAAAACCTGGGAAGAGATCCCGGCGCTGGATAAAGAACTGAAAGCGAAAGGTAA
GAGCGCGCTGATGTTCAACCTGCAAGAACCGTACTTCACCTGGCCGCTGATTGCTGCTGACG
GGGGTTATGCGTTCAAGTATGAAAACGGCAAGTACGACATTAAAGACGTGGGCGTGGATAAC
GCTGGCGCGAAAGCGGGTCTGACCTTCCTGGTTGACCTGATTAAAAACAAACACATGAATGC
AGACACCGATTACTCCATCGCAGAAGCTGCCTTTAATAAAGGCGAAACAGCGATGACCATCA
ACGGCCCGTGGGCATGGTCCAACATCGACACCAGCAAAGTGAATTATGGTGTAACGGTACTG
CCGACCTTCAAGGGTCAACCATCCAAACCGTTCGTTGGCGTGCTGAGCGCAGGTATTAACGC
CGCCAGTCCGAACAAAGAGCTGGCAAAAGAGTTCCTCGAAAACTATCTGCTGACTGATGAAG
GTCTGGAAGCGGTTAATAAAGACAAACCGCTGGGTGCCGTAGCGCTGAAGTCTTACGAGGAA
GAGTTGGTGAAAGATCCGCGGATTGCCGCCACTATGGAAAACGCCCAGAAAGGTGAAATCAT
GCCGAACATCCCGCAGATGTCCGCTTTCTGGTATGCCGTGCGTACTGCGGTGATCAACGCCG
CCAGCGGTCGTCAGACTGTCGATGAAGCCCTGAAAGACGCGCAGACTAATTCGAGCTCGAAC
AACAACAACAATAACAATAACAACAACCTCGGGATCGAGGGAAGGATTTCAGAATTCGAATT
C
CATATGGCCAGTGCCACCGGTAGTAGTCAGCCAGCCGCGGAAAGCAGCACCCAGCAACGCC
ATGAAAATCGAAGAAGGTAAACTGGTAATCTGGATTAACGGCGATAAAGGCTATAACGGTCT
CGCTGAAGTCGGTAAGAAATTCGAGAAAGATACCGGAATTAAAGTCACCGTTGAGCATCCGG
ATAAACTGGAAGAGAAATTCCCACAGGTTGCGGCAACTGGCGATGGCCCTGACATTATCTTC
TGGGCACACGACCGCTTTGGTGGCTACGCTCAATCTGGCCTGTTGGCTGAAATCACCCCGGA
CAAAGCGTTCCAGGACAAGCTGTATCCGTTTACCTGGGATGCCGTACGTTACAACGGCAAGC
TGATTGCTTACCCGATCGCTGTTGAAGCGTTATCGCTGATTTATAACAAAGATCTGCTGCCG
AACCCGCCAAAAACCTGGGAAGAGATCCCGGCGCTGGATAAAGAACTGAAAGCGAAAGGTAA
GAGCGCGCTGATGTTCAACCTGCAAGAACCGTACTTCACCTGGCCGCTGATTGCTGCTGACG
GGGGTTATGCGTTCAAGTATGAAAACGGCAAGTACGACATTAAAGACGTGGGCGTGGATAAC
GCTGGCGCGAAAGCGGGTCTGACCTTCCTGGTTGACCTGATTAAAAACAAACACATGAATGC
AGACACCGATTACTCCATCGCAGAAGCTGCCTTTAATAAAGGCGAAACAGCGATGACCATCA
ACGGCCCGTGGGCATGGTCCAACATCGACACCAGCAAAGTGAATTATGGTGTAACGGTACTG
CCGACCTTCAAGGGTCAACCATCCAAACCGTTCGTTGGCGTGCTGAGCGCAGGTATTAACGC
CGCCAGTCCGAACAAAGAGCTGGCAAAAGAGTTCCTCGAAAACTATCTGCTGACTGATGAAG
GTCTGGAAGCGGTTAATAAAGACAAACCGCTGGGTGCCGTAGCGCTGAAGTCTTACGAGGAA
GAGTTGGTGAAAGATCCGCGGATTGCCGCCACTATGGAAAACGCCCAGAAAGGTGAAATCAT
GCCGAACATCCCGCAGATGTCCGCTTTCTGGTATGCCGTGCGTACTGCGGTGATCAACGCCG
CCAGCGGTCGTCAGACTGTCGATGAAGCCCTGAAAGACGCGCAGACTAATTCGAGCTCGAAC
AACAACAACAATAACAATAACAACAACCTCGGGATCGAGGGAAGGATTTCAGAATTCGAATT
C
CATATGGCCAGTGCCACCGGTAGTAGTCAGCCAGCCGCGGAAAGCAGCACCCAGCAACGCC
ATGAAAATCGAAGAAGGTAAACTGGTAATCTGGATTAACGGCGATAAAGGCTATAACGGTCT
CGCTGAAGTCGGTAAGAAATTCGAGAAAGATACCGGAATTAAAGTCACCGTTGAGCATCCGG
ATAAACTGGAAGAGAAATTCCCACAGGTTGCGGCAACTGGCGATGGCCCTGACATTATCTTC
TGGGCACACGACCGCTTTGGTGGCTACGCTCAATCTGGCCTGTTGGCTGAAATCACCCCGGA
CAAAGCGTTCCAGGACAAGCTGTATCCGTTTACCTGGGATGCCGTACGTTACAACGGCAAGC
TGATTGCTTACCCGATCGCTGTTGAAGCGTTATCGCTGATTTATAACAAAGATCTGCTGCCG
AACCCGCCAAAAACCTGGGAAGAGATCCCGGCGCTGGATAAAGAACTGAAAGCGAAAGGTAA
GAGCGCGCTGATGTTCAACCTGCAAGAACCGTACTTCACCTGGCCGCTGATTGCTGCTGACG
GGGGTTATGCGTTCAAGTATGAAAACGGCAAGTACGACATTAAAGACGTGGGCGTGGATAAC
GCTGGCGCGAAAGCGGGTCTGACCTTCCTGGTTGACCTGATTAAAAACAAACACATGAATGC
AGACACCGATTACTCCATCGCAGAAGCTGCCTTTAATAAAGGCGAAACAGCGATGACCATCA
ACGGCCCGTGGGCATGGTCCAACATCGACACCAGCAAAGTGAATTATGGTGTAACGGTACTG
CCGACCTTCAAGGGTCAACCATCCAAACCGTTCGTTGGCGTGCTGAGCGCAGGTATTAACGC
CGCCAGTCCGAACAAAGAGCTGGCAAAAGAGTTCCTCGAAAACTATCTGCTGACTGATGAAG
GTCTGGAAGCGGTTAATAAAGACAAACCGCTGGGTGCCGTAGCGCTGAAGTCTTACGAGGAA
GAGTTGGTGAAAGATCCGCGGATTGCCGCCACTATGGAAAACGCCCAGAAAGGTGAAATCAT
GCCGAACATCCCGCAGATGTCCGCTTTCTGGTATGCCGTGCGTACTGCGGTGATCAACGCCG
CCAGCGGTCGTCAGACTGTCGATGAAGCCCTGAAAGACGCGCAGACTAATTCGAGCTCGAAC
AACAACAACAATAACAATAACAACAACCTCGGGATCGAGGGAAGGATTTCAGAATTCGAATT
C
CATATGGCCAGTGCCACCGGTAGTAGTCAGCCAGCCGCGGAAAGCAGCACCCAGCAACGCC
ACATCATCATCATCATCATCACTGA
ATGAAAATCGAAGAAGGTAAACTGGTAATCTGGATTAACGGCGATAAAGGCTATAACGGTCT
CGCTGAAGTCGGTAAGAAATTCGAGAAAGATACCGGAATTAAAGTCACCGTTGAGCATCCGG
ATAAACTGGAAGAGAAATTCCCACAGGTTGCGGCAACTGGCGATGGCCCTGACATTATCTTC
TGGGCACACGACCGCTTTGGTGGCTACGCTCAATCTGGCCTGTTGGCTGAAATCACCCCGGA
CAAAGCGTTCCAGGACAAGCTGTATCCGTTTACCTGGGATGCCGTACGTTACAACGGCAAGC
TGATTGCTTACCCGATCGCTGTTGAAGCGTTATCGCTGATTTATAACAAAGATCTGCTGCCG
AACCCGCCAAAAACCTGGGAAGAGATCCCGGCGCTGGATAAAGAACTGAAAGCGAAAGGTAA
GAGCGCGCTGATGTTCAACCTGCAAGAACCGTACTTCACCTGGCCGCTGATTGCTGCTGACG
GGGGTTATGCGTTCAAGTATGAAAACGGCAAGTACGACATTAAAGACGTGGGCGTGGATAAC
GCTGGCGCGAAAGCGGGTCTGACCTTCCTGGTTGACCTGATTAAAAACAAACACATGAATGC
AGACACCGATTACTCCATCGCAGAAGCTGCCTTTAATAAAGGCGAAACAGCGATGACCATCA
ACGGCCCGTGGGCATGGTCCAACATCGACACCAGCAAAGTGAATTATGGTGTAACGGTACTG
CCGACCTTCAAGGGTCAACCATCCAAACCGTTCGTTGGCGTGCTGAGCGCAGGTATTAACGC
CGCCAGTCCGAACAAAGAGCTGGCAAAAGAGTTCCTCGAAAACTATCTGCTGACTGATGAAG
GTCTGGAAGCGGTTAATAAAGACAAACCGCTGGGTGCCGTAGCGCTGAAGTCTTACGAGGAA
GAGTTGGTGAAAGATCCGCGGATTGCCGCCACTATGGAAAACGCCCAGAAAGGTGAAATCAT
GCCGAACATCCCGCAGATGTCCGCTTTCTGGTATGCCGTGCGTACTGCGGTGATCAACGCCG
CCAGCGGTCGTCAGACTGTCGATGAAGCCCTGAAAGACGCGCAGACTAATTCGAGCTCGAAC
AACAACAACAATAACAATAACAACAACCTCGGGATCGAGGGAAGGATTTCAGAATTCGAATT
C
CATATGGCCAGTGCCACCGGTAGTAGTCAGCCAGCCGCGGAAAGCAGCACCCAGCAACGCC
ACATCATCATCATCATCATCACTGA
HMKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLEEKFPQVAATGDGPDII
FWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLL
PNPPKTWEEIPALDKELKAKGKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKDVGVD
NAGAKAGLTFLVDLIKNKHMNADTDYSIAEAAFNKGETAMTINGPWAWSNIDTSKVNYGVTV
LPTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYLLTDEGLEAVNKDKPLGAVALKSYE
EELVKDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGR
Q
TVDEALKDAQTNSSS
NNNNNNNNNNLGIEGRISEFEFHM
ASATGSSQPAAESSTQQRPGVPAGPRPLDGYLGVADHK
MKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLEEKFPQVAATGDGPDIIF
WAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLP
NPPKTWEEIPALDKELKAKGKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKDVGVDN
AGAKAGLTFLVDLIKNKHMNADTDYSIAEAAFNKGETAMTINGPWAWSNIDTSKVNYGVTVL
PTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYLLTDEGLEAVNKDKPLGAVALKSYEE
ELVKDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDEALKDAQTNSSSN
NNNNNNNNNLGIEGRISEF
EFHMASATGSSQPAAESSTQQRPGVPAGPRPLDGYLGVADHKP
MKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLEEKFPQVAATGDGPDIIF
WAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLP
NPPKTWEEIPALDKELKAKGKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKDVGVDN
AGAKAGLTFLVDLIKNKHMNADTDYSIAEAAFNKGETAMTINGPWAWSNIDTSKVNYGVTVL
PTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYLLTDEGLEAVNKDKPLGAVALKSYEE
ELVKDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDEALKDAQTNSSSN
NNNNNNNNNLGIEGRISEF
EFHMASATGSSQPAAESSTQQRPGVPAGPRPLDGYLGVADHKP
MKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLEEKFPQVAATGDGPDIIF
WAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLP
NPPKTWEEIPALDKELKAKGKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKDVGVDN
AGAKAGLTFLVDLIKNKHMNADTDYSIAEAAFNKGETAMTINGPWAWSNIDTSKVNYGVTVL
PTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYLLTDEGLEAVNKDKPLGAVALKSYEE
ELVKDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDEALKDAQTNSSSN
NNNNNNNNNLGIEGRISEF
EFHMASATGSSQPAAESSTQQRPGVPAGPRPLDGYLGVADHKP
ATGAAAATCGAAGAAGGTAAACTGGTAATCTGGATTAACGGCGAT
AAAGGCTATAACGGTCTCGCTGAAGTCGGTAAGAAATTCGAGAAA
GATACCGGAATTAAAGTCACCGTTGAGCATCCGGATAAACTGGAA
GAGAAATTCCCACAGGTTGCGGCAACTGGCGATGGCCCTGACATT
ATCTTCTGGGCACACGACCGCTTTGGTGGCTACGCTCAATCTGGC
CTGTTGGCTGAAATCACCCCGGACAAAGCGTTCCAGGACAAGCTG
TATCCGTTTACCTGGGATGCCGTACGTTACAACGGCAAGCTGATT
GCTTACCCGATCGCTGTTGAAGCGTTATCGCTGATTTATAACAAA
GATCTGCTGCCGAACCCGCCAAAAACCTGGGAAGAGATCCCGGCG
CTGGATAAAGAACTGAAAGCGAAAGGTAAGAGCGCGCTGATGTTC
AACCTGCAAGAACCGTACTTCACCTGGCCGCTGATTGCTGCTGAC
GGGGGTTATGCGTTCAAGTATGAAAACGGCAAGTACGACATTAAA
GACGTGGGCGTGGATAACGCTGGCGCGAAAGCGGGTCTGACCTTC
CTGGTTGACCTGATTAAAAACAAACACATGAATGCAGACACCGAT
TACTCCATCGCAGAAGCTGCCTTTAATAAAGGCGAAACAGCGATG
ACCATCAACGGCCCGTGGGCATGGTCCAACATCGACACCAGCAAA
GTGAATTATGGTGTAACGGTACTGCCGACCTTCAAGGGTCAACCA
TCCAAACCGTTCGTTGGCGTGCTGAGCGCAGGTATTAACGCCGCC
AGTCCGAACAAAGAGCTGGCAAAAGAGTTCCTCGAAAACTATCTG
CTGACTGATGAAGGTCTGGAAGCGGTTAATAAAGACAAACCGCTG
GGTGCCGTAGCGCTGAAGTCTTACGAGGAAGAGTTGGCGAAAGAT
CCACGTATTGCCGCCACTATGGAAAACGCCCAGAAAGGTGAAATC
ATGCCGAACATCCCGCAGATGTCCGCTTTCTGGTATGCCGTGCGT
ACTGCGGTGATCAACGCCGCCAGCGGTCGTCAGACTGTCGATGAA
GCCCTGAAAGACGCGCAGACTAATTCGAGCTCGAACAACAACAAC
AATAACAATAACAACAACCTCGGGATCGAGGGAAGGATTTCAGAA
TTC
ATGTTCGGCGGAGATCCTTCTTTCCAACGCTTAAATATTAGC
SEQ ID NO: 9 shows the nucleic acid sequence of MBP-Δ25hGCNT2-B-His6. The sequences shown in italics and underlined were from the pMAL-c2X vector.
MKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLE
EKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKL
YPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPA
LDKELKAKGKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIK
DVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAEAAFNKGETAM
TINGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAA
SPNKELAKEFLENYLLTDEGLEAVNKDKPLGAVALKSYEEELAKD
PRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDE
ALKDAQTNSSSNNNNNNNNNNLGIEGRISEF
MFGGDPSFQRLNIS
SEQ ID NO: 10 shows the amino acid sequence of MBP-Δ25hGCNT2-B-His6. The sequences shown in italics and underlined were from the pMAL-c2X vector.
SEQ ID NO: 11 is the forward primer sequence for generating the A200Y mutation as described herein.
SEQ ID NO: 12 is the reverse primer sequence for generating the A200Y mutation as described herein.
SEQ ID NO: 13 is the forward primer sequence for generating the S232Y mutation as described herein.
SEQ ID NO: 14 is the reverse primer sequence for generating the S232Y mutation as described herein.
SEQ ID NO: 15 is the forward primer sequence for generating the V99M mutant as described herein.
SEQ ID NO: 16 is the reverse primer sequence for generating the V99M mutant as described herein.
SEQ ID NO: 17 is the forward primer sequence for generating the ext20 mutant as described herein.
SEQ ID NO: 18 is the reverse primer sequence for generating the ext20 mutant as described herein.
Although the foregoing has been described in some detail by way of illustration and example for purposes of clarity and understanding, one of skill in the art will appreciate that certain changes and modifications can be practiced within the scope of the appended claims. In addition, each reference provided herein is incorporated by reference in its entirety to the same extent as if each reference was individually incorporated by reference.
The present application claims priority to U.S. Provisional Pat. Appl. No. 63/300,989, filed on Jan. 19, 2022, which application is incorporated herein by reference in its entirety.
This invention was made with Government support under Grant No. U01GM120419 and Grant No. R01GM148568, awarded by the National Institutes of Health. The government has certain rights in the invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2023/060916 | 1/19/2023 | WO |
Number | Date | Country | |
---|---|---|---|
63300989 | Jan 2022 | US |