The present invention reveals novel methods for producing novel carbohydrate compositions, glycomes, from animal tissues. The tissue substrate materials can be total tissue samples and fractionated tissue parts, or artificial models of tissues such as cultivated cell lines. The invention is further directed to the compositions and compositions produced by the methods according to the invention. The invention further represent methods for analysis of the glycomes, especially mass spectrometric methods. The invention is further directed to novel human glycomes comprising novel human oligosaccharides. Human glycomes comprising novel human oligosaccharides such as i) human LEC14 and/or LEC18 structures ii) Galβ1-4(Fucα1-3)GlcNAcβ1-3(Galβ1-4GlcNAcβ1-6)Galβ1-4Glc. Furthermore the present invention revealed novel human glycomes with specific ranges of glycan groups.
Multiple methods to produce and analyze oligosaccharides from isolated glycoproteins are known. The present invention is directed to specific methods to release and purify total oligosaccharide pools quantitatively from tissues. The invention is specifically directed to methods using very low amounts of tissues. It is realized that purification of an oligosaccharide mixture from complex tissue samples to level of purity useful for analysis is more complex task than isolation of the oligosaccharides from purified proteins. It is further realized that the purification methods are novel and useful for the effective analysis of protein derived glycans. Human glycomes comprising novel human oligosaccharides such as i) human LEC14 and/or LEC18 structures or ii) Galβ1-4(Fucα1-3)GlcNAcβ1-3(Galβ-GlcNAcβ1-6)Galβ1-4Glc has not been known. Furthermore the present invention revealed novel human glycomes with specific ranges of glycan groups such ranges have not been known previously. The invention is directed to the analysis of O-linked, N-linked and glycolipid derived glycomes.
The present invention reveals novel methods for producing novel carbohydrate compositions, glycomes from animal tissues, preferably from vertebrates, more preferably human and mammalian tissues. The tissue substrate materials can be total tissue samples and fractionated tissue parts, such as serums, secretions and isolated differentiated cells from the tissues, or artificial models of tissues such as cultivated cell lines. In a preferred embodiment the invention is directed to special methods for the analysis of the surfaces of tissues. The invention is further directed to the compositions and compositions produced by the methods according to the invention. The invention further represent preferred methods for analysis of the glycomes, especially mass spectrometric methods.
The invention represents effective methods for purification of oligosaccharide fractions from tissues, especially in very low scale. The prior art has shown analysis of separate glycome components from tissues, but not total glycomes. It is further realized that the methods according to the invention are useful for analysis of glycans from isolated proteins or peptides. The invention represents effective methods for the practical analysis of glycans from isolated proteins especially from very small amounts of samples.
The invention is further directed to novel quantitative analysis methods for glycomes. The glycome analysis produces large amounts of data. The invention reveals methods for the analysis of such data quantitatively and comparison of the data between different samples. The invention is especially directed to quantitative two-dimensional representation of the data.
The present invention is specifically directed to glycomes of tissues according to the invention comprising glycan material with monosaccharide composition for each of glycan mass components according to the Formula Mn:
[Mα2]n1[Mα3]n2{[Mα2]n3[Mα6)]n4}[Mα6]n5{[Mα2]n6[Mα2]n7[Mα3]n8}Mβ4GNβ4([{Fucα6}]mGNyR2)z
wherein p, n1, n2, n3, n4, n5, n6, n7, n8, and m, and z are either independently 0 or 1; with the proviso that when n2 is 0, also n1 is 0; when n4 is 0, also n3 is 0; when n5 is 0, also n1, n2, n3, and n4 are 0; when n7 is 0, also n6 is 0; when n8 is 0, also n6 and n7 are 0;
y is anomeric linkage structure a and/or β or linkage from derivatized anomeric carbon, and
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagines N-glycoside aminoacid and/or peptides derived from protein;
[ ] and ( ) indicates determinant either being present or absent depending on the value of n1, n2, n3, n4, n5, n6, n7, n8, and m; and
{ } indicates a branch in the structure,
with the provision that z is 0 indicating soluble mannose-GlcNAc1-glycome or “low mannose glycome, wherein there is 5, more preferably 4 or less mannose residues or m is 1 and there is 6 or less mannose units.
The soluble mannose-GlcNAc1-glycome or “soluble (N-)glycome” has not been quantitatively known in context of regular N-glycan glycome with two core GlcNAc-units from cell or tissue materials. The present invention revealed that there is substantial amount of soluble glycans comparable to amounts of releasable neutral N-glycomes. Typical glycomes comprise of subgroups of glycans, including N-glycans, O-glycans, glycolipid glycans, and neutral and acidic subglycomes.
The preferred analysis method includes:
1) Preparing a tissue sample containing glycans for the analysis
2) Releasing total glycans or total glycan groups from a tissue sample, or extracting free glycans from a tissue sample
3) Optionally modifying glycans
4) Purification of the glycan fraction/fractions from biological material of the sample
5) Optionally modifying glycans
6) Analysis of the composition of the released glycans preferably by mass spectrometry
7a) Optionally presenting the data about released glycans quantitatively and
7b) Comparing the quantitative data set with another data set from another tissue sample or
8) Comparing data about the released glycans quantitatively or qualitatively with data produced from another tissue sample
The invention is directed to diagnosis of clinical state of tissue samples, based on analysis of glycans present in the samples. The invention is especially directed to diagnosing cancer and the clinical state of cancer.
The invention is further directed to structural analysis of glycan mixtures present in tissue samples.
Glycomes—Novel Glycan Mixtures from Tissue Samples
The present invention reveals novel methods for producing novel carbohydrate compositions, glycomes from animal tissues, preferably from vertebrates, more preferably human and mammalian tissues. The tissue substrate materials can be total tissue samples and
fractionated tissue parts, such as serums, secretions and isolated differentiated cells from the tissues, or
artificial models of tissues such as cultivated cell lines.
The invention revealed that the glycan structures on cell surfaces vary between the various tissues and same tissues under changing conditions.
The glycan structures on cell surfaces in general have been known to have numerous biological roles. Thus the knowledge about exact glycan mixtures from cell or tissue surfaces is important for knowledge about the status of cells. The invention revealed that multiple conditions affect the cells and cause changes in their glycomes.
The inventors were able to release or isolate various glycan fractions from tissue materials, which are useful for the characterization of the cellular material. The glycans or major part thereof are released preferably from glycoproteins or glycolipids of tissue samples. The invention is specifically directed to such glycan fractions.
The glycan fractions of tissue samples comprise typically multiple, at least about 10 “glycan mass components” typically corresponding at least ten glycans and in most cases clearly more than 10 glycan structures.
The glycan mass components correspond to certain molecular weights observable by mass spectrometry and further correspond to specific monosaccharide composition or monosaccharide compositions. Each monosaccharide component is normally present in a glycan as glycosidically linked monosaccharide residue in the nonreducing end part of glycan and the reducing end monosaccharide may be in free alditol form or modified for example by reduction or conjugated to an reducing end modifying reagent well known in the art or to one, two or several amino acids in case of glycopeptides. Monosaccharide composition can be obtained from molecular mass in a mass spectrum (glycan mass component) after correcting potential effect of the ion forms observable by the specific mass spectrometry technologue such as protonation/deprotonation, Na+, K+, Li+, or other adduct combinations, or isotope pattern derived effects. The monosaccharide compositions are calculated by fitting mixtures of individual monosaccharide (residue) masses and modification groups to corrected molecular mass of glycan mass component. Typically the molecular mass of fitting composition and the experimental mass correspond to each other very closely with similar first and even second decimals with optimal calibration.
The fitting may be further checked by measuring the experimental mass difference from the smaller and/or larger glycan mass component next in the putative biosynthetic serie of a glycan type and comparing the difference with the exact molecular mass of corresponding monosaccharide unit (residue), typically the mass differences of fitting components in a good quality mass spectrum and with correct marking of peaks in decimals, preferably in second or third decimal of the mass number depending on the resolution of the specific mass spectrometric method. For optimal mass accuracy, an internal calibration may be used, where two or more known component's mass peaks are used to re-calculate masses for each components in the spectrum. Such calibration components are preferably selected among the most abundant glycan signals present in the glycan profiles, in the case of human or other animal cell derived glycan profiles most preferably selected among the most abundant glycan signals present in Figures described in the present invention.
The monosaccharide composition includes monosaccharide component names and number, typically as subscript, indicating how many of the individual mass components is present in the monosaccharide composition; and names of assigned modifying groups and numbers indicating their abundance.
It is further realized that the masses of glycan mass component may be obtained as exact monoisotopic mass of usually smallest isotope of the glycan mass component or as an average mass of the isotope distribution of the glycan mass component. Exact mass is calculated form exact masses of individual mass components and average from masses average masses of individual mass components. Person skilled in art can recognize from the peak shapes (i.e. by the resolution obtained) in the mass spectrum whether to use monoisotopic or average masses to interpret the spectra. It is further realized that average and exact masses can be converted to each other when isotope abundances of molecules are known, typically natural abundance without enrichment of isotopes can be assumed, unless the material is deliberately labelled with radioactive or stable isotopes.
It is further realized that specific rounded mass numbers can be used as names for glycan mass components. The present invention uses preferably mass numbers rounded down from the exact mass of the monosaccharide composition (and usually observable or observed mass) to closest integer as names of glycan mass components.
The masses of gylcan mass components are obtained by calculating molecular mass of individual monosaccharide components (Hex, HexNAc, dHex, sialic acids) from the known atom compositions (for example hexose (Hex) corresponds to C6H12O6) and subtracting for water in case of monosaccharide residue, followed by calculating the sum of the monosaccharide components (and possible modifications such as SO3 or PO3H) It is further realized that molecular masses of glycans may be calculated from atomic compositions or any other suitable mass units corresponding molecular masses of these. The molecular masses and calculation thereof are known in the art and masses of monosaccharide components/residues are available in tables with multiple decimals from various sources.
It is further realized that many of the individual monosaccharide compositions described in the present invention further correspond to several isomeric individual glycans. In addition, there exist also monosaccharide compositions that have nearly equal masses, for example dHex2 and NeuAc monosaccharide residues that have nearly equal masses, and other examples can be presented by a person skilled in the art. It is realized that the ability to differentiate compositions with nearly equal masses depends on instrumentation, and the present method is especially directed to a possibility to select also such compositions in place of proposed compositions.
The preferred glycans in glycomes comprise at least two of following monosaccharide component residues selected from group: Hexoses (Hex) which are Gal, Glc and Man; N-acetylhexosamines (HexNAc) which are GlcNAc and GalNAc; pentose, which is Xyl; Hexuronic acids which are GleA and IdoA; deoxyhexoses (dHex), which is fucose and sialic acids which are NeuAc and/NeuGc; and further modification groups such as acetate (Ac), sulphate and phosphate forming esters with the glycans. The monosaccharide residues are further grouped as major backbone monosaccharides including GlcNAc, HaxA, Man and Gal; and specific terminal modifying monosaccharide units Glc, GalNAc, Xyl and sialic acids.
The present invention is directed to analyzing glycan components from biological samples, preferably as mass spectrometric signals. Specific glycan modifications can be detected among the detected signals by determined indicative signals as exemplified below. Modifications can also be detected by more specific methods such as chemical or physical methods, for example mass spectrometric fragmentation or glycosidase detection as disclosed in the present invention. In a preferred form of the present method, glycan signals are assigned to monosaccharide compositions based on the detected m/z ratios of the glycan signals, and the specific glycan modifications can be detected among the detected monosaccharide compositions.
In a further aspect of the present invention, relative molar abundances of glycan components are assigned based on their relative signal intensities detected in mass spectrometry as described in the Examples, which allows for quantification of glycan components with specific modifications in relation to other glycan components. The present method is also directed to detecting changes in relative amounts of specific modifications in cells at different time points to detect changes in cell glycan compositions.
The invention is specifically directed to glycan compositions, which further comprise at least one monosaccharide component in free form, preferably a preferred monosaccharide component described above. The monosaccharide comprising compositions are in a preferred embodiment derived from a cell material or released glycomes, which has been in contact with monosaccharide releasing chemicals or enzymes, preferably with exoglycosidase enzymes or chemicals such as oxidating reagents and/or acid, more preferably with a glycosidase enzyme. The invention is further directed to compositions comprising a specific preferred monosaccharide according to the invention, an exoglycosidase enzyme capable releasing all or part of the specific monosaccharide and an glycan composition according to the invention from which at least part of the terminal specific monosaccharide has been released.
It is further realized that by increasing the sensitivity of detection the number of glycan mass components can be increased. The analysis according to the invention can be in most cases performed from major or significant components in the glycome mixture. The present invention is preferably directed to detection of glycan mass components from a high quality glycan preparation with optimised experimental condition, when the glycan mass components have abundance at least higher than 0.01% of total amount of glycan mass components, more preferably of glycan mass components of abundance at least higher than 0.05%, and most preferably at least higher than 0.10% are detected. The invention is further directed practical quality glycome compositions and analytic process directed to it, when glycan mass components of at least about 0.5%, of total amount of glycan mass components, more preferably of glycan mass components of abundance at least higher than 1.0%, even more preferably at least higher than 2.0%, most preferably at least higher than 4.0% (presenting lower range practical quality glycome), are detected. The invention is further directed to glycomes comprising preferred number of glycan mass components of at least the abundance of observable in high quality glycomes, and in another embodiment glycomes comprising preferred number of glycan mass components of at least the abundance of observable in practical quality glycomes.
It further realized that fractionation or differential specific release methods of glycans from glycoconjugates can be applied to produce subglycomes containing part of glycome.
The subglycomes produced by fractionation of glycomes are called “fractionated subglycomes”.
The glycomes produced by specific release methods are “linkage-subglycomes”. The invention is further directed to combinations of linkage-subglycomes and fractionated subglycomes to produce “fractionated linkage-subglycomes”, for example preferred fractionated linkage-subglycomes includes neutral O-glycans, neutral N-glycans, acidic O-glycans, and acidic N-glycans, which were found very practical in characterising target material according to the invention.
The fractionation can be used to enrich components of low abundance. It is realized that enrichment would enhance the detection of rare components. The fractionation methods may be used for larger amounts of cell material. In a preferred embodiment the glycome is fractionated based on the molecular weight, charge or binding to carbohydrate binding agents.
These methods have been found useful for specific analysis of specific subglycomes and enrichment more rare components. The present invention is in a preferred embodiment directed to charge based separation of neutral and acidic glycans. This method gives for analysis method, preferably mass spectroscopy material of reduced complexity and it is useful for analysis as neutral molecules in positive mode mass spectrometry and negative mode mass spectrometry for acidic glycans.
Differential release methods may be applied to get separately linkage specific subglycomes such as O-glycan, N-glycan, glycolipid or proteoglycan comprising fractions or combinations thereof. Chemical and enzymatic methods are known for release of specific fractions, furthermore there are methods for simultaneous release of O-glycans and N-glycans.
It is realized that at least part of the glycomes have novelty as novel compositions of very large amount of components. The glycomes comprising very broad range substances are referred as complete glycomes.
Preferably the composition is a complete composition comprising essentially all degrees of polymerisation in general from at least about disaccharides, more preferably from trisaccharides to at least about 25-mers in a high resolution case and at least to about 20-mers or at least about 15-mer in case of medium and practical quality preparations. It is realized that especially the lower limit, but also upper limit of a subglycome depend on the type of subglycome and/or method used for its production. Different complete ranges may be produced in scope of general glycomes by fractionation, especially based on size of the molecules.
Novel Compositions with New Combinations of Subglycomes and Preferred Glycan Groups
It is realized that several glycan types are present as novel glycome compositions produced from the tissue samples. The invention is specifically directed to novel mixture composition comprising different subglycomes and preferred glycan groups
It is realised that the glycome composition a as described in examples represent quantitatively new data about glycomes from the preferred tissue sample types. The proportions of various components cannot be derived from background data and are very useful for the analysis methods according to the invention. The invention is specifically directed to glycome compositions according to the examples when the glycan mass components are present in essentially similar relative amounts.
The present invention is specifically directed to glycomes of tissue samples according to the invention comprising glycan material with monosaccharide composition for each of glycan mass components according to the Formula I:
NeuAcmNeuGcnHexoHexNAcpdHexqHexArPensActModXx, (I)
where m, n, o, p, q, r, s, t, and x are independent integers with values ≧0 and less than about 100,
with the proviso that for each glycan mass components at least two of the backbone monosaccharide variables o, p, or r is greater than 0, and
ModX represents a modification (or N different modifications Mod1, Mod2, . . . , ModN), present in the composition in an amount of x (or in independent amounts of x1, x2, . . . , xN), Preferably examples of such modifications (Mod) including for example SO3 or PO3H indicating esters of sulfate and phosphate, respectively
and the glycan composition is preferably derived from isolated human tissue samples or preferred subpopulations thereof according to the invention.
It is realized that usually glycomes contain glycan material for which the variables are less much less than 100, but large figures may be obtained for polymeric material comprising glycomes with repeating polymer structures, for example ones comprising glycosaminoglycan type materials. It is further realized that abundance of the glycan mass components with variables more than 10 or 15 is in general very low and observation of the glycome components may require purification and enrichment of larger glycome components from large amounts of samples.
In a preferred embodiment the invention is directed to broad mass range glycomes comprising polymeric materials and rare individual components as indicated above. Observation of large molecular weight components may require enrichment of large molecular weight molecules comprising fraction. The broad general compositions according to the Formula I are as described above,
with the proviso that
m, n, o, p, q, r, s, t, and x are independent integers with preferable values between 0 and 50, with the proviso that for each glycan mass components at least two of o, p, or r is at least 1, and the sum of the monosaccharide variables; m, n, o, p, q, r, and s, indicating the degree of polymerization or oligomerization, for each glycan mass component is less than about 100 and the glycome comprises at least about 20 different glycans of at least disaccharides.
In a preferred embodiment the invention is directed to practical mass range and high quality glycomes comprising lower molecular weight ranges of polymeric material. The lower molecular weight materials at least in part and for preferred uses are observable by mass spectrometry without enrichment.
In a more preferred general composition according to the Formula I as described above, m, n, o, p, q, r, s, t, and x are independent integers with preferable values between 0 and about 20, more preferably between 0 and about 15, even more preferably between 0 and about 10, with the proviso that at least two of o, p, or r is at least 1, and the sum of the monosaccharide variables; m, n, o, p, q, r, and s, indicating the degree of polymerization or oligomerization, for each glycan mass component is less than about 50 and more preferably less than about 30,
and the glycome comprises at least about 50 different glycans of at least trisaccharides.
In a preferred embodiment the invention is directed to practical mass range high quality glycomes which may comprise some lower molecular weight ranges of polymeric material. The lower molecular weight materials at least in part and for preferred uses are observable by mass spectrometry without enrichment.
In a more preferred general composition according to the Formula I as described above, m, n, o, p, q, r, s, t, and x are independent integers with preferable values between 0 and about 10, more preferably between 0 and about 9, even more preferably, between 0 and about 8, with the proviso that at least two of o, p, or r is at least 1,
and the sum of the monosaccharide variables; m, n, o, p, q, r, and s, indicating the degree of polymerization or oligomerization, for each glycan mass component is less than about 30 and more preferably less than about 25,
and the glycome comprises at least about 50 different glycans of at least trisaccharides.
The practical mass range glycomes may typically comprise tens of components, for example in positive ion mode MALDI-TOF mass spectrometry for neutral subglycomes it is usually possible to observe even more than 50 molecular mass components, even more than 100 mass component corresponding to much larger number of potentially isomeric glycans. The number of components detected depends on sample size and detection method.
The present invention is specifically directed to subglycomes of tissue sample glycomes according to the invention comprising glycan material with monosaccharide compositions for each of glycan mass components according to the Formula I and as defined for broad and practical mass range glycomes. Each subglycome has additional characteristics based on glycan core structures of linkage-glycomes or fractionation method used for the fractionated glycomes. The preferred linkage glycomes includes:
Protein N-glycosidase releases N-glycans comprising typically two N-acetylglycosamine units in the core, optionally a core linked fucose unit and typically then 2-3 hexoses (core mannoses), after which the structures may further comprise hexoses being mannose or in complex-type N-glycans further N-acetylglycosamines and optionally hexoses and sialic acids.
N-glycan subglycomes released by protein N-glycosidase comprise N-glycans containing N-glycan core structure and are releasable by protein N-glycosidase from cells.
The N-glycan core structure is Manβ4GlcNAcβ(Fucα6)n4GlcNAc, wherein n is 0 or 1 and the N-glycan structures can be elongated from the Manβ4 with additional mannosyl residues. The protein N-glycosidase cleaves the reducing end GlcNAc from Asn in proteins. N-glycan subglycomes released by endo-type N-glycosidases cleaving between GlcNAc units contain Manβ4GlcNAcβ-core, and the N-glycan structures can be elongated from the Manβ4 with additional mannosyl residues.
In case the Subglycome and analysis representing it as Glycan profile is formed from N-glycans liberated by N-glycosidase enzyme, the preferred additional constraints for Formula I are:
p>0, more preferably 1≦p≦100, typically p is between 2 and about 20, but polymeric structures containing glycomes may comprise larger amounts of HexNAc and it is realized that in typical core of N-glycans indicating presence of at least partially complex type structure
when p≧3 it follows that o≧1.
In case the Subglycome and analysis representing it as Glycan profile is formed from lipid-linked glycans liberated by endoglycoceramidase enzyme, the preferred additional constraints for Formula I are:
o≦0, more preferably 1≦o≦100, and when p≧1 it follows that o≧2.
Typically glycolipids comprise two hexoses (alactosylresidue) at the core. The degree of oligomerization in a usual practical glycome from glycolipids is under about 20 and more preferably under 10. Very large structures comprising glycolipids, polyglycosylceramides, may need enrichment for effective detection.
Most preferred fractionated Subglycomes includes 1) subglycome of neutral glycans and 2) subglycome of acidic glycans. The major acidic monosaccharide unit is in most cases a sialic acid, the acidic fraction may further comprise natural negatively charged structure/structures such as sulphate(s) and/phosphate(s).
In case the Subglycome and analysis representing it as Glycan profile is formed from sialylated glycans, the preferred additional constraints for Formula I are:
(m+n)>0, more preferably 1≦(m+n)≦100.
Large amounts of sialic acid in a glycan mass component would indicate presence of polysailic acid type structures. Practical and high resolutions acidic glycomes usually have m+n values for individual major glycan mass components with preferred abundance between 1 and 10, more preferably and of the between 1-5 and most preferably between 1-4 for a usual glycomes according to the invention. For neutral glycans, (m+n)=0, and they do not contain negatively charged groups as above.
The present invention is specifically directed to the glycomes of tissue samples according to the invention comprising as major components at least one of structure groups selected from the groups described below.
According to the present invention, the Glycan signals are optionally organized into Glycan groups and Glycan group profiles based on analysis and classification of the assigned monosaccharide and modification compositions and the relative amounts of monosaccharide and modification units in the compositions, according to the following classification rules:
1° The glycan structures are described by the formulae:
HexmHexNAcndHexoNeuAcpNeuGcqPenrMod1sMod1Mod2sMod2 . . . ModXsModX,
wherein m, n, o, p, q, individual sMod, and X, are each independent variables, and Mod is a functional group covalently linked to the glycan structure.
2° Glycan structures in general are classified as follows:
a. Structures (p,q=0) are classified as “non-sialylated”,
b. Structures (p,q>0) are classified as “sialylated”,
c. Structures (q>0) are classified as “NeuGc-containing”,
d. Relation [2 (p+q):(m+n)] describes the general sialylation degree of a glycan structure,
e. In the case of mammalian glycans, structures (o=0) are classified as “non-fucosylated”,
f. In the case of mammalian glycans, structures (o>0) are classified as “fucosylated”,
g. Structures (Mod=Ac and sAc>0) are classified as ‘acetylated’,
h. Structures (Mod=SO3 and sSO3>0) are classified as ‘sulfated’, and
i. Structures (Mod=PO3H and sPO3H>0) are classified as ‘phosphorylated’.
3° N-glycan glycan structures, generated e.g. by the action of peptide-N-glycosidases, are classified as follows:
a. Structures (n=2 and m>0 and p,q=0) are classified as “mannose-terminated N-glycans”,
b. Structures (n=2 and m≧5 and o,p,q=0) are classified as “high-mannose N-glycans”,
c. Structures (n=2 and m≧5 and o>0 and p,q=0) are classified as “fucosylated high-mannose N-glycans”,
d. Structures (n=2 and 4≧m≧1 and p,q=0) are classified as “low-malmose N-glycans”,
e. Structures (n=2 and 4≧m≧1 and o>0 and p,q=0) are classified as “fucosylated low-mannose N-glycans”,
f. Structures (n=3 and m≧2) are classified as “hybrid-type or monoantennary N-glycans”,
g. Structures (n≧4 and m≧3) are classified as “complex-type N-glycans”,
h. Structures (n≧m≧2) are classified as “N-glycans containing non-reducing terminal N-acetylhexosamine”,
i. Structures (n=m≧5) are classified as “N-glycans potentially containing bisecting N-acetylglucosamine”,
j. In the case of mammalian N-glycans, structures (o≧2) are classified as “N-glycans containing α2-, α3-, or α4-linked fucose”,
k. Relation [2(p+q):(m+n−5)] describes the “overall sialylation degree” of a sialylated N-glycan structure, and
l. Specifically, sum (p+q) describes the “sialylation degree” of a sialylated hybrid-type or monoantennary N-glycan structure.
4° Mucin-type O-glycan structures, generated e.g. by alkaline β-elimination, are classified as follows:
a. Structures (n=m), with (n=n=m), are classified as “Type N O-glycans”,
b. More specifically, structures (n=m=1) are classified as “Type 1 O-glycans”,
c. More specifically, structures (n=m=2) are classified as “Type 2 O-glycans”,
d. More specifically, structures (n=m=3) are classified as “Type 3 O-glycans”,
e. Relation [2 (p+q):(m+n)] describes the overall sialylation degree of a sialylated N-glycan structure, and
f. Specifically, relation [(p+q):N] describes the sialylation degree of a sialylated Type N O-glycan structure.
Lipid-linked can also be classified into structural groups based on their monosaccharide compositions, as adopted from the classifications above according to the invention.
For example, glycan signal corresponding to a tissue sample N-glycan structure:
Hex5HexNAc4dHex2NeuAc1Ac1,
is classified as belonging to the following Glycan Groups:
The present invention revealed novel unexpected components among in the glycomes studied. The present invention is especially directed to glycomes comprising such unusual materials
It is further realized that the glycans may be derivatized chemically during the process of release and isolation. Preferred modifications include modifications of the reducing end and or modifications directed especially to the hydroxyls- and/or N-atoms of the molecules. The reducing end modifications include modifications of reducing end of glycans involving known derivatization reactions, preferably reduction, glycosylamine, glycosylamide, oxime (aminooxy-) and reductive amination modifications. Most preferred modifications include modification of the reducing end. The derivatization of hydroxyl- and/or amine groups, such as produced by methylation or acetylation methods including permethylation and peracetylation has been found especially detrimental to the quantitative relation between natural glycome and the released glycome.
In a preferred embodiment the invention is directed to non-derivatized released glycomes. The benefit of the non-derivatized glycomes is that less processing needed for the production. The non-derivatized released glycomes correspond more exactly to the natural glycomes from which these are released. The present invention is further directed to quantitative purification according to the invention for the non-derivatized releases glycomes and analysis thereof.
The present invention is especially directed to released glycomes when the released glycome is not a permodified glycome such as permethylated glycome or peracetyated glycome. The released glycome is more preferably reducing end derivatized glycome or a non derivatized glycome, most preferably non-derivatized glycome.
The present invention is further directed to novel total compositions of glycans or oligosaccharides referred as glycomes and in a more specific embodiment as released glycomes observed from or produced from the target material according to the invention. The released glycome indicates the total released glycans or total specific glycan subfractions released from the target material according to the invention. The present invention is specifically directed to released glycomes meaning glycans released from the target material according to the invention and to the methods according to the invention directed to the glycomes.
The present invention preferably directed to the glycomes released as truncated and/or non-truncated glycans and/or derivatized according to the invention.
The invention is especially directed to N-linked and/or O-linked and/or Lipid linked released glycomes from the target material according to the invention. The invention is more preferably directed to released glycomes comprising glycan structures according to the invention, preferably glycan structures as defined in formula I. The invention is more preferably directed to N-linked released glycomes comprising glycan structures according to the invention, preferably glycan structures as defined in formula I.
In a preferred embodiment the invention is directed to non-derivatized released cell surface glycomes. The non-derivatized released cell surface glycomes correspond more exactly to the fractions of glycomes that are localized on the cell surfaces, and thus available for biological interactions. These cell surface localized glycans are of especial importance due to their availability for biological interactions as well as targets for reagents (e.g. antibodies, lectins etc. . . . ) targeting the cells or tissues of interest. The invention is further directed to release of the cell surface glycomes, preferably from intact cells by hydrolytic enzymes such as proteolytic enzymes, including proteinases and proteases, and/or glycan releasing enzymes, including endo-glycosidases or protein N-glycosidases. Preferably the surface glycoproteins are cleaved by proteinase such as trypsin and then glycans are analysed as glycopeptides or preferably released further by glycan releasing enzyme.
Analysis of the glycan mixtures by physical means, preferably by mass spectrometry The present invention is directed to analysis of glycan mixtures present in tissue samples.
The invention is directed to novel methods for qualitative analysis of glycome data. The inventors noticed that there are specific components in glycomes according to the invention, the presence or absence of which are connected or associated with specific cell type or cell status. It is realized that qualitative comparison about the presence of absence of such signals are useful for glycome analysis. It is further realized that signals either present or absent that are derived from a general glycome analysis may be selected to more directed assay measuring only the qualitatively changing component or components optionally with a more common component or components useful for verification of data about the presence or absence of the qualitative signal.
The present invention is further specifically directed to quantitative analysis of glycan data from tissue samples. The inventors noted that quantitative comparisons of the relative abundances of the glycome components reveal substantial differences about the glycomes useful for the analysis according to the invention.
The process contains essential key steps which should be included in every process according to the present invention.
The essential key steps of the analysis are:
1. Release of total glycans or total glycan groups from a tissue sample
2. Purification of the glycan fraction/fractions from biological material of the sample, preferably by a small scale column array or an array of solid-phase extraction steps
3. Analysis of the composition of the released glycans, preferably by mass spectrometry
In most cases it is useful to compare the data with control sample data. The control sample may be for example from a healthy tissue or cell type and the sample from same tissue altered by cancer or another disease. It is preferable to compare samples from same individual organism, preferably from the same human individual.
Comparative Analysis
The steps of a comparative analysis are:
1. Release of total glycans or total glycan groups from tissue sample
2. Purification of the glycan fraction/fractions from biological material of the sample, preferably by a small scale column array or an array of solid-phase extraction steps
3. Analysis of the composition of the released glycans, preferably by mass spectrometry
4. Comparing data about the released glycans quantitatively or qualitatively with data produced from another tissue sample
It may be useful to analyse the glycan structural motifs present in the sample, as well as their relative abundances. The ability to elucidate structural motifs results from the quantitative nature of the present analysis procedure, comparison of the data to data from previously analyzed samples, and knowledge of glycan biosynthesis.
The glycome analysis may include characterization of structural motives of released glycans. The structural motif analysis may be performed in combination with structural analysis. Preferred methods to reveal specific structural motifs include
a) direct analysis of specific structural modifications of the treatment of glycans preferably by exo- or endoglycosidases and/or chemical modification or
b) indirect analysis by analysis of correlating factors for the structural motives for such as mRNA-expression levels of glycosyltransferases or enzymes producing sugar donor molecules for glycosyltransferases.
The direct analyses are preferred as they are in general more effective and usually more quantitative methods, which can be combined to glycome analysis.
In a preferred embodiment the invention is directed to combination of analysis of structural motifs and glycome analysis.
1. Release of total glycans or total glycan groups from a tissue sample
2. Purification of the glycan fraction/fractions from biological material of the sample, preferably by a small scale column array or an array of solid-phase extraction steps
3. Analysis of the composition of the released glycans, preferably by mass spectrometry
4. Analysis of structural motifs present in of the glycan mixture, and optionally their relative abundancies
5. Optionally, comparing data about the glycan structural motifs with data produced from another tissue sample
The steps 3 and 4 may be combined or performed in order first 4 and then 3.
More detailed preferred analysis method include following analysis steps:
1. Preparing a tissue sample containing glycans for the analysis
2. Release total glycans or total glycan groups from a tissue sample
3. Optionally modifying glycans or part of the glycans.
4. Purification of the glycan fraction/fractions from biological material and reagents of the sample by a small scale column array
5. Optionally modifying glycans and optionally purifying modified glycans
6. Analysis of the composition of the released glycans preferably by mass spectrometry using at least one mass spectrometric analysis method
7. a) Optionally presenting the data about released glycans quantitatively and
7. b) Comparing the quantitative data set with another data set from another tissue sample and/or alternatively to 7a) and 7b)
8. Comparing data about the released glycans quantitatively or qualitatively with data produced from another tissue sample
The present methods further allow the possibility to use part of the non-modified material or material modified in step 3 or 5 for additional modification step or step and optionally purified after modification step or steps, optionally combining modified samples, and analysis of additionally modified samples, and comparing results from differentially modified samples.
As mentioned above, It is realized that many of the individual monosaccharide compositions in a given glycome further corresponds to several isomeric individual glycans. The present methods allow for generation of modified glycomes. This is of particular use when modifications are used to reveal such information about glycomes of interest that is not directly available from a glycan profile alone (or glycome profiles to compare). Modifications can include selective removal of particular monosaccharides bound to the glycome by a defined glycosidic bond, by degradation by specific exoglycosidases or selective chemical degradation steps such as e.g. periodic acid oxidation. Modifications can also be introduced by using selective glycosyltransferase reactions to label the free acceptor structures in glycomes and thereby introduction of a specific mass label to such structures that can act as acceptors for the given enzyme. In preferred embodiment several of such modifications steps are combined and used to glycomes to be compared to gain further insights of glycomes and to facilitate their comparison.
The present invention is specifically directed to quantitative presentation of glycome data.
The quantitative presentation means presenting quantitative signals of components of the glycome, preferably all major components of the glycome, as a two-dimensional presentation including preferably a single quantitative indicator presented together with component identifier.
The preferred two dimensional presentations includes tables and graphs presenting the two dimensional data. The preferred tables list quantitative indicators in connection with, preferably beside or under or above the component identifiers, most preferably beside the identifier because in this format the data comprising usually large number of component identifier—quantitation indicator pairs.
The quantitation indicator is a value indicating the relative abundance of the single glycome component with regard to other components of total glycome or subglycome. The quantitation indicator can be directly derived from quantitative experimental data, or experimental data corrected to be quantitative.
The quantitation indicator is preferably a normalized quantitation indicator. The normalized quantitation indicator is defined as the experimental value of a single experimental quantitation indicator divided by total sum of quantitation indicators multiplied by a constant quantitation factor.
Preferred quantitation factors includes integer numbers from 1-1000 0000 000, more preferably integer numbers 1, 10 or 100, and more preferably 1 or 100, most preferably 100.
The quantitation number one is preferred as commonly understandable portion from 1 concept and the most preferred quantitation factor 100 corresponds to common concept of percent values.
The quantitation indicators in tables are preferably rounded to correspond to practical accuracy of the measurements from which the values are derived from. Preferred rounding includes 2-5 meaningful accuracy numbers, more preferably 2-4 numbers and most preferably 2-3 numbers.
The preferred component indicators may be experimentally derived component indicators. Preferred components indicators in the context of mass spectrometric analysis includes mass numbers of the glycome components, monosaccharide or other chemical compositions of the components and abbreviation corresponding to thereof, names of the molecules preferably selected from the group: disruptive names and abbreviations; chemical names, abbreviations and codes; and molecular formulas including graphic representations of the formulas. It is further realized that molecular mass based component indicators may include multiple isomeric structures. The invention is in a preferred embodiment directed to practical analysis using molecular mass based component indicators. In more specific embodiment the invention is further directed to chemical or enzymatic modification methods or indirect methods according to the invention in order to resolve all or part of the isomeric components corresponding to a molecular mass based component indicators.
The present invention is directed to a method of accurately defining the molecular masses of glycans present in a sample, and assigning monosaccharide compositions to the detected glycan signals.
The Glycan signals according to the present invention are glycan components characterized by:
1° mass-to-charge ratio (m/z) of the detected glycan ion,
2° molecular mass of the detected glycan component, and/or
3° monosaccharide composition proposed for the glycan component.
The present invention is further directed to a method of describing mass spectrometric raw data of Glycan signals as two-dimensional tables of:
1° monosaccharide composition and
2° relative abundance,
which form the Glycan profiles according to the invention. Monosaccharide compositions are as described above. For obtaining relative abundance values for each Glycan signal, the raw data is recorded in such manner that the relative signal intensities of the glycan signals represent their relative molar proportions in the sample. Methods for relative quantitation in MALDI-TOF mass spectrometry of glycans are known in the art (Naven & Harvey, 1 gxx; Papac et al., 1996) and are described in the present invention. However, the relative signal intensities of each Glycan signal are preferably corrected by taking into account the potential artifacts caused by e.g. isotopic overlapping, alkali metal adduct overlapping, and other disturbances in the raw data, as described below.
By forming these Glycan profiles and using them instead of the raw data, analysis of the biological data carried by the Glycan profiles is improved, including for example the following operations:
1° identification of glycan signals present in the glycan profile,
2° comparison of glycan profiles obtained from different samples,
3° comparison of relative intensities of glycan signals within the glycan profile, and
4° organizing the glycan signals present in the glycan profile into subgroups or subprofiles.
Glycan signals and their associated signals may have overlapping isotope patterns. Overlapping of isotope patterns is corrected by calculating the experimental isotope patterns and subtracting overlapping isotope signals from the processed data.
Glycan signals may be associated with signals arising from multiple adduct ions in positive ion mode, e.g. different alkali metal adduct ions. Different Glycan signals may give rise to adduct ions with similar m/z ratios: as an example, the adduct ions [Hex+Na]+ and [dHex+K]+ have m/z ratios of 203.05 and 203.03, respectively. Overlapping of adduct ions is corrected by calculating the experimental alkali metal adduct ion ratios in the sample and using them to correct the relative intensities of those Glycan signals that have overlapping adduct ions in the experimental data. Preferably, the major adduct ion type is used for comparison of relative signal intensities of the Glycan signals, and the minor adduct ion types are removed from the processed data. The calculated proportions of minor adduct ion types are subtracted from the processed data.
Also in negative ion mode mass spectrometry, Glycan signals may be associated with signals arising from multiple adduct ions. Typically, this occurs with Glycan signals that correspond to multiple acidic group containing glycan structures. As an example, the adduct ions [NeuAc2−H+Na]− at m/z 621.2 and [NeuAc2−H+K]− at m/z 637.1, are associated with the Glycan signal [NeuAc2−H]− at m/z 599.2. These adduct ion signals are added to the Glycan signal and thereafter removed from the processed data. In cases where different Glycan signals and adduct ion signals overlap, this is corrected by calculating the experimental all-ali metal adduct ion ratios in the sample and using them to correct the relative intensities of those Glycan signals that have overlapping adduct ions in the experimental data.
Glycan signals may be associated with signals, e.g. elimination of water (loss of H2O), or lack of methyl ether or ester groups (effective loss of CH2), resulting in experimental m/z values 18 or 14 mass units smaller than the Glycan signal, respectively. These signals are not treated as individual Glycan signals, but are instead treated as associated signals and removed from the processed data.
Classification of Glycan Signals into Glycan Groups
According to the present invention, the Glycan signals are optionally organized into Glycan groups and Glycan group profiles based on analysis and classification of the assigned monosaccharide and modification compositions and the relative amounts of monosaccharide and modification units in the compositions, according to the classification rules described above:
To generate Glycan group profiles, the proportions of individual Glycan signals belonging to each Glycan group are summed. The proportion of each Glycan group of the total Glycan signals equals its prevalence in the Glycan profile. The Glycan group profiles of two or more samples can be compared. The Glycan group profiles can be further analyzed by arranging Glycan groups into subprofiles, and analyzing the relative proportions of different Glycan groups in the subprofiles. Similarly formed subprofiles of two or more samples can be compared.
The present invention is especially useful when low sample amounts are available. Practical cellular or tissue material may be available for example for diagnostic only in very small amounts.
The inventors found surprisingly that glycan fraction could be produced and analysed effectively from samples containing low amount of material, for example 100 000-1 000 000 cells or a cubic millimetre (microliter) of the cells.
The combination of very challenging biological samples and very low amounts of samples forms another challenge for the present analytic method. The yield of the purification process must be very high. The estimated yields of the glycan fractions of the analytical processes according to the present invention varies between about 50% and 99%. Combined with effective removal of the contaminating various biological materials even more effectively over the wide preferred mass ranges according to the present invention show the ultimate performance of the method according to the present invention.
The present invention is directed to a method of preparing an essentially unmodified glycan sample for analysis from the glycans present in a given sample.
A preferred glycan preparation process consists of the following steps:
1° isolating a glycan-containing fraction from the sample,
2° . . . Optionally purification the fraction to useful purity for glycome analysis
The preferred isolation method is chosen according to the desired glycan fraction to be analyzed. The isolation method may be either one or a combination of the following methods, or other fractionation methods that yield fractions of the original sample:
1° extraction with water or other hydrophilic solvent, yielding water-soluble glycans or glycoconjugates such as free oligosaccharides or glycopeptides,
2° extraction with hydrophobic solvent, yielding hydrophilic glycoconjugates such as glycolipids,
3° N-glycosidase treatment, especially Flavobacterium meningosepticum N-glycosidase F treatment, yielding N-glycans,
4° alkaline treatment, such as mild (e.g. 0.1 M) sodium hydroxide or concentrated ammonia treatment, either with or without a reductive agent such as borohydride, in the former case in the presence of a protecting agent such as carbonate, yielding β-elimination products such as O-glycans and/or other elimination products such as N-glycans,
5° endoglycosidase treatment, such as endo-β-galactosidase treatment, especially Escherichia freundii endo-β-galactosidase treatment, yielding fragments from poly-N-acetyllactosamine glycan chains, or similar products according to the enzyme specificity, and/or
6° protease treatment, such as broad-range or specific protease treatment, especially trypsin treatment, yielding proteolytic fragments such as glycopeptides.
The released glycans are optionally divided into sialylated and non-sialylated subfractions and analyzed separately. According to the present invention, this is preferred for improved detection of neutral glycan components, especially when they are rare in the sample to be analyzed, and/or the amount or quality of the sample is low. Preferably, this glycan fractionation is accomplished by graphite chromatography.
According to the present invention, sialylated glycans are optionally modified in such manner that they are isolated together with the non-sialylated glycan fraction in the non-sialylated glycan specific isolation procedure described above, resulting in improved detection simultaneously to both non-sialylated and sialylated glycan components. Preferably, the modification is done before the non-sialylated glycan specific isolation procedure. Preferred modification processes include neuraminidase treatment and derivatization of the sialic acid carboxyl group, while preferred derivatization processes include amidation and esterification of the carboxyl group.
The preferred glycan release methods include, but are not limited to, the following methods:
Free glycans—extraction of free glycans with for example water or suitable water-solvent mixtures.
Protein-linlced glycans including O- and N-linked glycans-alkaline elimination of protein-linked glycans, optionally with subsequent reduction of the liberated glycans.
Mucin-type and other Ser/Thr O-linked glycans-alkaline β-elimination of glycans, optionally with subsequent reduction of the liberated glycans.
N-glycans—enzymatic liberation, optionally with N-glycosidase enzymes including for example N-glycosidase F from C. meningosepticum, Endoglycosidase H from Streptomyces, or N-glycosidase A from almonds.
Lipid-linked glycans including glycosphingolipids-enzymatic liberation with endoglycoceramidase enzyme; chemical liberation; ozonolytic liberation.
Glycosaminoglycans—treatment with endo-glycosidase cleaving glycosaminoglycans such as chondroinases, chondroitin lyases, hyalurondases, heparanases, heparatinases, or keratanases/endo-beta-galactosidases; or use of O-glycan release methods for O-glycosidic Glycosaminoglycans; or N-glycan release methods for N-glycosidic glycosaminoglycans or use of enzymes cleaving specific glycosaminoglycan core structures; or specific chemical nitrous acid cleavage methods especially for amine/N-sulphate comprising glycosaminoglycans
Glycan fragments—specific exo- or endoglycosidase enzymes including for example keratanase, endo-β-galactosidase, hyaluronidase, sialidase, or other exo- and endoglycosidase enzyme; chemical cleavage methods; physical methods
The invention describes special purification methods for glycan mixtures from tissue samples. Previous glycan sample purification methods have required large amounts of material and involved often numerous chromatographic steps and even purification of specific proteins. It is known that protein glycosylation varies protein specifically and single protein specific data can thus not indicate the total tissue level glycosylation. Purification of single protein is a totally different task than purifying the glycan fraction according to the present invention.
When the purification starts from a tissue or cells, the old processes of prior art involve often laborious homogenisation steps affecting the quality of the material produced. The present purification directly from a biological sample such as cell or tissue material, involves only a few steps and allows quick purification directly from the biological material to analysis preferably by mass spectrometry.
Purification from Cellular Materials of Cells and/or Tissues
The cellular material contains various membranes, small metabolites, various ionic materials, lipids, peptides, proteins etc. All of the materials can prevent glycan analysis by mass spectrometry if these cannot be separated from the glycan fraction. Moreover, for example peptide or lipid materials may give rise to mass spectrometric signals within the preferred mass range within which glycans are analysed. Many mass spectrometric methods, including preferred MALDI-mass spectrometry for free glycan fractions, are more sensitive for peptides than glycans. With the MALDI method peptides in the sample may be analysed with approximately 1000-fold higher sensitivity in comparison to methods for glycans. Therefore the method according to the present invention should be able to remove for example potential peptide contaminations from free glycan fractions most effectively. The method should remove essential peptide contaminations from the whole preferred mass range to be analysed.
The inventors discovered that the simple purification methods would separate released glycans from all possible cell materials so that
1) The sample is technically suitable for mass spectrometric analysis.
This includes two major properties,
a) the samples is soluble for preparation of mass spectrometry sample and
b) does not have negative interactions with chemicals involved in the mass spectrometric method, preferably the sample dries or crystallizes properly with matrix chemical used in MALDI-TOF mass spectrometry
When using MALDI-technologies, the sample does not dry or crystallize properly if the sample contains harmful impurity material in a significant amount.
2) The purity allows production of mass spectrum of suitable quality.
a) The sample has so low level impurities that it gives mass spectrometric signals. Especially when using MALDI-TOF mass spectrometry, signals can be suppressed by background so that multiple components/peaks cannot be obtained.
b) the sample is purified so that there is no major impurity signals in the preferred mass ranges to be measured.
Preferably the present invention is directed to analysis of unusually small sample amounts. This provides a clear benefit over prior art, when there is small amount of sample available from a small region of diseased tissue or diagnostic sample such as tissue slice produced for microscopy or biopsy sample. Methods to achieve such purity (purity being a requirement for the sensitivity needed for such small sample amounts) from tissue or cell samples (or any other complex biological matrices e.g. serum, saliva) has not been described in the prior art.
In a preferred embodiment the method includes use of non-derived glycans and avoiding general derived glycans. There are methods of producing glycan profiles including modification of all hydroxyl groups in the sample such as permethylation. Such processes require large sample amounts and produces chemical artifacts such as undermethylated molecules lowering the effectively of the method. These artifact peaks cover all minor signals in the spectra, and they can be misinterpreted as glycan structures. It is of importance to note that in glycome analyses the important profile-to profile differences often reside in the minor signals. In a specific embodiment the present invention is directed to site specific modification of the glycans with effective chemical or enzyme reaction, preferably a quantitative reaction.
Preferred analytical technologies for glycome analysis
The present invention is specifically directed to quantitative mass spectrometric methods for the analysis of glycomes. Most preferred mass spectrometric methods are MALDI-TOF mass spectrometry methods.
The inventors were able to optimise MALDI-TOF mass spectrometry for glycome analysis. The preferred mass spectrometric analysis process is MALDI-TOF mass spectrometry, where the relative signal intensities of the unmodified glycan signals represent their relative molar proportions in the sample, allowing relative quantification of both neutral (Naven & Harvey, 19xx) and sialylated (Papac et al., 1996) glycan signals. Preferred experimental conditions according to the present invention are described under Experimental procedures of Examples listed below.
For MALDI-TOF mass spectrometry of unmodified glycans in positive ion mode, optimal mass spectrometric data recording range according to the present invention is over m/z 200, more preferentially between m/z 200-10000, or even more preferably between m/z 200-4000 for improved data quality. In the most preferred form according to the present invention, the data is recorded between m/z 700-4000 for accurate relative quantification of glycan signals.
For MALDI-TOF mass spectrometry of unmodified glycans in negative ion mode, optimal mass spectrometric data recording range according to the present invention is over m/z 300, more preferentially between m/z 300-10000, or even more preferably between m/z 300-4000 for improved data quality. In the most preferred forms according to the present invention, the data is recorded between m/z 700-4000 or most preferably between m/z 800-4000 for accurate relative quantification of glycan signals.
Practical mz-Ranges
The practical ranges comprising most of the important signals, as observed by the present invention may be more limited than these. Preferred practical ranges includes lower limit of about m/z 400, more preferably about m/z 500, and even more preferably about m/z 600, and most preferably m/z about 700 and upper limits of about m/z 4000, more preferably m/z about 3500 (especially for negative ion mode), even more preferably m/z about 3000 (especially for negative ion mode), and in particular at least about 2500 (negative or positive ion mode) and for positive ion mode to about m/z 2000 (for positive ion mode analysis). The preferred range depends on the sizes of the sample glycans, samples with high branching or polysaccharide content or high sialylation levels are preferably analysed in ranges containing higher upper limits as described for negative ion mode. The limits are preferably combined to form ranges of maximum and minimum sizes or lowest lower limit with lowest higher limit, and the other limits analogously in order of increasing size
The inventors were able to show effective quantitative analysis in both negative and positive mode mass spectrometry.
The inventors developed optimised sample handling process for preparation of the samples for MALDI-TOF mass spectrometry.
The glycan purification method according to the present invention consists of at least one of purification options, preferably in specific combinations described below, including the following purification options:
3) Hydrophobic interaction;
4) Hydrophilic interaction; and
5) Affinity to graphitized carbon.
1) Precipitation-extraction may include precipitation of glycans or precipitation of contaminants away from the glycans. Preferred precipitation methods include:
1. Glycan material precipitation, for example acetone precipitation of glycoproteins, oligosaccharides, glycopeptides, and glycans in aqueous acetone, preferentially ice-cold over 80% (v/v) aqueous acetone; optionally combined with extraction of glycans from the precipitate, and/or extraction of contaminating materials from the precipitate;
2. Protein precipitation, for example by organic solvents or trichloroacetic acid, optionally combined with extraction of glycans from the precipitate, and/or extraction of contaminating materials from the precipitate;
3. Precipitation of contaminating materials, for example precipitation with trichloroacetic acid or organic solvents such as aqueous methanol, preferentially about ⅔ aqueous methanol for selective precipitation of proteins and other non-soluble materials while leaving glycans in solution;
2) Ion-exchange may include ion-exchange purification or enrichment of glycans or removal of contaminants away from the glycans. Preferred ion-exchange methods include:
1. Cation exchange, preferably for removal of contaminants such as salts, polypeptides, or other cationizable molecules from the glycans; and
2. Anion exchange, preferably either for enrichment of acidic glycans such as sialylated glycans or removal of charged contaminants from neutral glycans, and also preferably for separation of acidic and neutral glycans into different fractions.
3) Hydrophilic interaction may include purification or enrichment of glycans due to their hydrophilicity or specific adsorption to hydrophilic materials, or removal of contaminants such as salts away from the glycans. Preferred hydrophilic interaction methods include:
1. Hydrophilic interaction chromatography, preferably for purification or enrichment of glycans and/or glycopeptides;
2. Adsorption of glycans to cellulose in hydrophobic solvents for their purification or enrichment, preferably to microcrystalline cellulose, and even more preferably using an n-butanol:methanol:water or similar solvent system for adsorption and washing the adsorbed glycans, in most preferred system n-butanol:methanol:water in relative volumes of 10:1:2, and water or water:ethanol or similar solvent system for elution of purified glycans from cellulose.
4) Affinity to graphitized carbon may include purification or enrichment of glycans due to their affinity or specific adsorption to graphitized carbon, or removal of contaminants away from the glycans. Preferred graphitized carbon affinity methods include porous graphitized carbon chromatography.
Preferred purification methods according to the invention include combinations of one or more purification options. Examples of the most preferred combinations include the following combinations:
1) For neutral underivatized glycan purification: 1. cation exchange of contaminants, 2. hydrophobic adsorption of contaminants, and 3. graphitized carbon affinity purification of glycans.
1) For sialylated underivatized glycan purification: 1. cation exchange of contaminants, 2. hydrophobic adsorption of contaminants, 3. adsorption of glycans to cellulose, and 4. graphitized carbon affinity purification of glycans.
The present invention is directed to analysis of released glycomes by spectrometric method useful for characterization of the glycomes. The invention is directed to NMR spectroscopic analysis of the mixtures of released glycans. The inventors showed that it is possible to produce a released glycome from tissue samples in large scale enough and useful purity for NMR-analysis of the glycome.
In a preferred embodiment the NMR-analysis of the tissue glycome is one dimensional proton NMR-analysis showing structural reporter groups of the major components in the glycome. The present invention is further directed to combination of the mass spectrometric and NMR analysis of small scale tissue samples.
The inventors further realized major glycome differences between samples from the same species. The invention is specifically directed to analysis of individual differences between animals. The invention is further directed to the use of the information in breeding of animals, especially production animals.
The inventors further realized major glycome differences between samples from animals related to the status of the animal. The invention is especially directed to the analysis of biological status related changes of animal.
The inventors further noticed major species specific differences in the total released glycomes analysed. It is realized that species specific glycome differences are useful for analysis of effects of glycosylations when animal materials from different species are in contact with each other.
In a preferred embodiment the glycosylation is analysed when one animal is consumed as food or feed of another and the analysis is directed to potential allergic or immunogenic effects in the animal consuming the other animal. Preferably the invention is directed to the use of analysis of animal derived human feeds and feeds derived from other animals.
In another embodiment the invention is directed to analysis the invention is directed to situations when one animal species is exposed to material from another animal in air, especially in context of allergy inducing air mediated animal contacts.
The invention revealed that glycome oligosaccharide mixtures can be produced effectively from eukaryotic species especially animal tissues. Plant and insect differentiated cells are separately preferred eukaryotic materials.
Preferred animals include vertebrate animals, more preferably mammals, more preferably domestic or farm animals or human, analysis of human samples is most preferred. The preference of animals is based on similarity of sample compositions and availability animal materials and presence of individual, species and status related changes.
Most preferred domestic or farm animals includes pets such as cat, dog, pet rodents (such as mouse, hamster, rat) and production/farm animals such as animals selected from the group: pig, ruminants (especially ones producing milk such as cow, buffalo); avian production animals (such as hen (chicken), turkey, duck), and horse.
The invention is especially directed to analysis of specific tissues of animal in context of breeding of animals especially production animals, horse and cats or dogs. The invention is especially directed to analysis of specific tissues of animal in context of breeding of especially production animals, and major pets under extensive breeding preferably cats or dogs.
The invention is further directed to analysis of species specific differences between the preferred domestic or farm animals in two preferred contexts either in context when the animal is context of food contact with human or in context of air contact with human. The preferred animal in food contact are major production/farm animals, which are also preferred in air contact with animals as well as major pets cats and dogs.
The invention is further directed to analysis of domestic and farm animals in context of the status of the animal. It is realized that it is useful to analyze of status of the cells, when the health or physiological status of the animal is needed to revealed.
The invention is in a preferred embodiment directed to analysis of human type primates such as monkeys especially apes (examples include chimpanzee, pygmy chimpanzee, gorilla, orangutan) and human, the preference is based on close similarity of primates and human on genetic and cell biological level, providing similarity for samples to be analysed and scientifically important evolution based glycosylation changes between similar species.
The invention is further directed to analysis of animals useful for development of pharmaceutical and therapeutic materials. The preferred animals include rodents (such as mouse, hamster, rat) and human type primates. It is further preferred to analyze these animals in context of air mediated contact with human or other animals.
The invention is further directed to animals involved in sports, especially horses and dogs. It is realized that development of animals in sports involves especially analysis of individual and animal status related changes. It is further preferred to analyze these animals in context of air mediated contact with human or other animals.
The present invention refers as “tissue materials” all preferred target tissue related material including for example tissues, secretions and cultivated differentiated cells
The present invention is preferably directed to specific tissue types for the analysis according to the invention. The tissue type are found to be very suitable and feasible for the analysis according to the invention. The analysis is especially directed to analysis of
1) tissues of gastrointestinal track, preferably mouth, larynx, stomach, large and small intestine
2) internal organs such as ovarian tissue, liver, lungs, or kidney
3) tissues of circulatory system, especially blood
4) cultivated cell line models of the differentiated tissues
The present invention is preferably directed to specific parts of tissue for the analysis according to the invention. The inventors realized that it is possible perform glycomics analysis of specific parts of tissues and reveal differences useful for studies of diseases and disease induced changes and other changes or presence of receptor structures on specific subtissues. Preferred subtissues includes
1) tissues surfaces, especially epithelia of gastrointestinal tract and cell surfaces and
2) components of circulatory system, preferably serum/plasma, and blood cells, especially red cells and white blood cells
The invention is further directed to material produced by tissues.
Preferably the invention is directed to the analysis of secretions of tissues, preferably liquid secretions of tissues, preferably milk, saliva or urine. It is realized that liquid secretions form a specific group of tissue derived materials found especially useful for the glycome analysis methods according to the invention.
Milk is especially preferred as a food material consumed by animals and human and analysis with regard to each of individual specific, animal status specific and species specific differences.
The invention is under separate preferred embodiment directed to the analysis of specific conjugated glycomes such as protein or lipid derived glycomes, from the secretions and in another preferred embodiment free soluble glycomes of the secretions.
Soluble Glycome Materials: Tissue and/or Secretion Materials, Especially with High Protein Content
The invention is in a preferred embodiment directed to specific methods developed for the analysis of soluble glycome material from tissues and secretions. This group includes background for purification different from solid tissue and cell derived materials. The group includes tissue solutions such as blood serum/plasma and liquid secretions such as milk, saliva and urine.
The invention is further directed to the soluble glycome materials with high protein content including preferably milk and serum/plasma. The materials are especially directed as specific target of technologies directed to analysis of high protein content solutions, in separate preferred embodiments the technologies are directed to analysis human serum or bovine milk.
The invention is directed to enriched milk derive oligosaccharide or glycopeptides fractions including N-glycan, O-glycan, glycolipid oligosaccharide fraction produced by methods according to the invention preferably method involving graphitized carbon chromatography and optionally another purification step according to the invention, preferably for the quantitative MALDI-TOF analysis methods and other preferred methods according to the invention. Preferred milks are human milk and animal milks such as ruminant milk (most preferably bovine or buffalo). The invention is especially directed to novel human milk oligosaccharide compositions and analysis and further fractionation of these by mass spectrometry, dionex chromatography, glycosidases such as β-galactosidase or α-fucosidase and methods for comparing milk products or composition with regard to the presence of the new compositions and components thereof.
The present invention is specifically directed to glycome analysis of milks form human and other animals, preferably from ruminant animals such as bovine, buffalo, sheep, goat, and camel, the most common milk production animals bovine and buffalo are preferred. Most preferably the common bovine milk is analysed.
The invention is specifically directed to analysis of colostrums and regular milks of ruminant milks. The invention represent novel total glycomics analysis methods for both secreted and conjugated glycomes. The invention is further directed to glycome analysis according to the invention to food production fractions and specific milk products of ruminant milks.
The invention is especially directed to novel total glycomics analysis methods for both secreted and conjugated glycomes from bovine milks. The invention specifically represents novel glycomes released from proteins of bovine milks.
The invention is further directed to glycomes released from proteins from regular milk and in a separate embodiment to glycomes released from proteins of bovine colostrums. The invention is further directed to glycome analysis according to the invention to food production fractions and specific milk products of bovine milk such whey, low fat milk, or buttermilk.
The invention is specifically directed to total glycomics analysis methods for secreted glycomes from human milk such as protein derived glycomes and/or free oligosaccharide glycomes.
Novel compositions were revealed from human milk oligosaccharides by the inventors, as described in Example 8 and Example 24. The invention specifically represents novel glycomes comprising human milk oligosaccharides with molecular weight of neutral and sialylated oligosaccharides up to 3700 Da and 2600 Da, respectively, with structures (GalmGlcNAcmFucn)Galβ1-4Glc, where n≦m and 1≦m≦8, when n>0, and m≦9 when n=0; and (GalmGlcNAcmFucnNeuAco)Galβ1-4Glc, where n≦m and o≦m and 1≦m≦4 when either n>0 or o<1, and m≦5 when n=0 and o=1. The present invention is further directed to practical enrichment and/or use of oligosaccharide fractions containing such components, preferably comprising the revealed high-molecular weight components. The present invention is further specifically directed to analysis and/or enrichment and/or use of oligosaccharide fractions comprising Galβ1-4(Fucα1-3)GlcNAcβ1-3(Galβ1-4GlcNAcβ1-6)Galβ1-4Glc, in one embodiment also comprising neutral and/or sialylated oligosaccharides according to the structures described above, more preferably enriched with fucosylated heptasaccharides with structures (Gal2GlcNAc2Fucα)Galβ1-4Glc, preferably including other branched and/or linear monofucosylated hexasaccharides such type II N-acetyllactosamine structures selected from the group {Galβ4GlcNAcβ3[Galβ4(Fucα3)GlcNAcβ6]Galβ4Glc,Galβ4(Fucα3)GlcNAcβ3Galβ4GlcNA c33Galβ4Glc,Galβ4GlcNAcβ3Galβ4(Fucα3)GlcNAcβ3Galβ4Glc, Galβ4GlcNAcβ3Galβ4GlcNAc3Galβ4(Fucα3)Glc} and type I structures from the group {Galβ3GlcNAcβ3[Galβ4(Fucα3)GlcNAcβ6]Galβ4Glc,Gal3 (Fucα4)GlcNAcβ3Galβ4GlcNA cβ3Galβ4Glc,Galβ3GlcNAcβ3Galβ4(Fucα3)GlcNAcβ3Galβ4Glc, Galβ3GlcNAcβ3Galβ4(Fucα3)GlcNAcβ3Galβ4Glc, Galβ3GlcNAcβ3Galβ4GlcNAcβ3Galβ4(Fucα3)Glc} and/or isomeric Fucα2-structures on terminal non-reducing end Galβ of I) either branch of Galβ4GlcNAcβ3[Galβ4GlcNAcβ6]Galβ4Glc or Galβ3GlcNAcβ3[Galβ4GlcNAcβ6]Galβ4Glc or II) at terminal Gal of linear oligosaccharides Galβ4GlcNAcβ3Galβ4GlcNAcβ3Galβ4Glc, and Galβ3GlcNAcβ3Galβ4GlcNAcβ33Galβ4Glc; most preferably as described in Example 8. Such compositions are preferably isolated from milk, more preferably from human milk.
It is realized that the novel heptasaccharide was characterized for the first time from any natural material derived or produced by cells, preferably by eukaryotic cells, the preferred milk is animal milk, more preferably mammalian milk, most preferably human milk. It is realized that natural cells such as eukaryotic cells and corresponding artificial cellular systems can produce the novel human milk oligosaccharide. The invention is directed to the novel human milk oligosaccharide produced by a cell comprising corresponding nucleotide sugar synthesis, and transport and glycosyltransferases needed for the human milk oligosaccharide backbone synthesis and at least one α3-fucosyltransferase, preferably fucosyltransferase III, IV, V or VI, VII or IX, more preferably III, IV, V or VI, or IX, and even more preferably effective α3-fucosylating transferase IV, VI, or IX and most preferably a transferase specific for synthesis of non-sialylated Lewis x structure: IV or IX. The invention is specifically directed to the artificial cellular glycan expression system producing the novel oligosaccharide. The preferred artificial eukaryotic cell systems for the production of human milk oligosaccharides includes transgenic dairy animals (e.g. animals developed by Abbott, producing such oligosaccharides or in another embodiment microorganism cells producing such oligosaccharides.
The invention is directed to novel oligosaccharide fraction comprising Galβ1-4(Fucα1-3)GlcNAcβ1-3(Galβ1-4GlcNAcβ1-6)Galβ1-4Glc, wherein the oligosaccharide is derived from natural or artificial cellular or tissue material or tissue secretion. The invention is further directed to oligosaccharide or oligosaccharide fraction comprising Galβ1-4(Fucα1-3)GlcNAcβ1-3 (Galβ1-4GlcNAcβ1-6)Galβ1-4Glc, wherein the oligosaccharide fraction is enriched from milk.
The novel glycan structure is preferred for use in analysis of mammalian or human secretions or corresponding artificial cell expression systems, preferably human milk, more preferably as a characteristic component for human milk from person lacking either secretor gene and at least part of terminal Fucα2-structures and/or Lewis blood Fucα4-structures (lack of functional fucosylatransferase genes FUTIII and/or FUTV or activity of the corresponding enzymes). The invention is further directed to analysis of the glycan from a novel glycome composition or fraction thereof derived from a secretion, when the novel glycome comprises the novel milk oligosaccharide Galβ4(Fucα3)GlcNAcβ3(Galβ4GlcNAcβ6)Galβ4Glc in amount of at least 0.1%, more preferably at least 1%, even more preferably at least 5%, even more preferably at least 10% and even more preferably at least 20% of the total monofucosylated heptasaccharides as described above. The invention is also directed to the oligosaccharide fraction wherein the oligosaccharide is used for recognition of specific fucosylation status of the biological material.
Subcomponents of Glycomes, Especially from Secreted Proteins Such as Milks
The invention is further directed to methods for selecting specific components of glycomes and searching enriched fractions such as specific protein fraction comprising the specific glycome components.
As a specific example and embodiment of a purified glycome component the invention is directed to protein linked glycome, referred as mannose protein or mannose protein linked glycans isolated from ruminant milk, preferably bovine milk. It is realized that most secreted glycoproteins carry complex type glycans with non-reducing end terminal sialic acid, galactose, or fucose residues and proteins carrying exclusively non-reducing end terminal Man are quite rare.
The invention is especially directed to bovine protein, in a preferred embodiment bovine milk protein, preferably lactoferrin carrying almost exclusively mannose glycans, preferably carrying more than 70% of glycans as mannose glycans, more preferably more than 85%, even more preferably more than 95%, even more preferably more than 98% of Mannose glycans. It was further revealed that the lactoferrin is expressed only in certain milk batches. The invention is directed to method of analysing presence of mannose glycans and/or specific mannose proteins from multiple strains or individuals of mammals, preferably ruminants and preferably use of the strain or individual with high levels of the glycans or glycoproteins a) for production of raw material for isolation of mannose protein or mannose glycans derived thereof and/or b) breeding of a mammal or mammals producing more of producing more mannose glycans and/or proteins. In a preferred embodiment the invention is directed to analysis of mannose glycans of total proteins from animal tissue, preferably mammalian tissue, more preferably total proteins milk. The invention is further directed to use of a mammal with high expression level for production of mannose protein or glycans and/or for breeding,
The present invention is further directed to analysis of milks to reveal specific animals and conditions for effective production of mannose proteins, especially mannose lactoferrin. The invention is further directed to single step chromatographic purification of the mannose lactoferrin.
The inventors found efficient methods for producing mannose-enriched free protein glycan and/or glycopeptide fractions from milk, and specific analysis of these fractions to reveal their mannose glycan content, as described in Example 9. The present invention is specifically directed to production and/or analysis of enriched milk derived free protein glycans or glycopeptides. The invention is further directed to milk derived glycome compositions enriched with mannose glycopeptides and/or free protein glycans.
The invention is further directed to the production of mannose glycans from enriched mannose proteins by release of a) free glycans chemically or enzymatically b) release of glycopeptides by protease treatment. The invention is further directed to enrichment and/or purification of the glycopeptide fraction by gel filtration.
In a preferred embodiment the invention is directed to special methods for the analysis of the surfaces of tissues.
The preferred tissue surfaces includes
1) epithelia of the preferred gastrointestinal tract tissues and
2) surfaces of cells according to cells on surface of tissues or separable homogeneously from tissue, such as blood cells and
3) surfaces of cultivated cells which may be used as models for differentiated tissues.
In a preferred embodiment the invention is directed to non-derivatized released cell surface glycomes. The non-derivatized released cell surface glycomes correspond more exactly to the fractions of glycomes that are localized on the cell surfaces, and thus available for biological interactions. These cell surface localized glycans are of especial importance due to their availability for biological interactions as well as targets for reagents (e.g. antibodies, lectins etc. . . . ) targeting the cells or tissues of interest. The invention is further directed to release of the cell surface glycomes, preferably from intact cells by hydrolytic enzymes such as proteolytic enzymes, including proteinases and proteases, and/or glycan releasing enzymes, including endo-glycosidases or protein N-glycosidases. Preferably the surface glycoproteins are cleaved by proteinase such as trypsin and then glycans are analysed as glycopeptides or preferably released further by glycan releasing enzyme.
The invention is further directed to cultured cells corresponding to differentiated cells. Such cells may be used as models for differentiated cells. The differentiated cells include differentiated cell models of cancer.
Stably growing differentiated cultured cell lines are also used in production mammalian proteins and for other biotechnical production for example in cell therapies.
It is realized that the differentiated cells would need to be controlled with regard to cells status and individual cell line specific differences. It is realized that cell status would need to be checked with regard to numerous factors related to cell status. It is further realized that there is differences between individual cell lines when these are derived from different animal individuals.
In case the differentiated cells would be used in context of contact with other animals from different species than from which the cell line is derived there is need for controlling species specific differences.
It is especially realized that it would be useful to check status of differentiated cells used in production of biotechnical products, such as recombinant therapeutic proteins such as antibodies, growth factors and recombinant receptors including recombinant TNF-alpha receptors. The invention is especially directed to the analysis of differentiated cell lines producing recombinant proteins.
The invention is further directed to the compositions and compositions produced by the methods according to the invention. The invention further represent preferred methods for analysis of the glycomes, especially mass spectrometric methods.
The invention is specifically directed to released glycomes derived conjugated glycans from preferred tissue materials and cell models of differentiated tissues.
The invention represents effective methods for purification of oligosaccharide fractions from tissues, especially in very low scale. The prior art has shown analysis of separate glycome components from tissues, but not total glycomes. It is further realized that the methods according to the invention are useful for analysis of glycans from isolated proteins or peptides.
The invention is further directed to novel quantitative analysis methods for glycomes. The glycome analysis produces large amounts of data. The invention reveals methods for the analysis of such data quantitatively and comparison of the data between different samples. The invention is especially directed to quantitative two-dimensional representation of the data.
The invention is further directed to integrated glycomics or glycome analysis process including
1) Optional release of glycans from tissues
2) isolation/purification of glycans from sample,
3) analysis of the glycome
4) quantitative presentation of the data
The first step is optional as the method is further directed to analysis of known and novel secretion derivable soluble glycomes.
The invention represents effective methods for the practical analysis of glycans from isolate proteins especially from very small amounts of samples. The invention is especially directed to the application of the methods for the analysis of proteins using the purification method, analysis methods and/or integrated glycome analysis.
The present invention is specifically directed to the glycan fraction produced according to the present invention from the pico scale tissue material sample according to the present invention. The preferred glycan fraction is essentially devoid of signals of contaminating molecules within the preferred mass range when analysed by MALDI mass spectrometry according to the present invention.
Preferred Uses of Glycomes and Analysis Thereof with Regard to Status of Cells
In the present invention the word cell refer to cells of tissue material according to the invention, especially cultivated differentiated cells.
The present invention is specifically directed to the glycan fraction produced according to the present invention from the pico scale tissue material sample according to the present invention. The preferred glycan fraction is essentially devoid of signals of contaminating molecules within the preferred mass range when analysed by MALDI mass spectrometry according to the present invention.
The glycome products from tissue samples according to present invention are produced preferably directly from complete tissue material cells or membrane fractions thereof, more preferably directly from intact cells as effectively shown in examples. In another preferred embodiment the glycome fractions are cell surface glycomes and produced directly from surfaces of complete tissue material cells, preferably intact or essentially intact cells of tissue materials or surfaces of intact tissues according to the invention. In another embodiment the glycome products according to the invention are produced directly from membrane fraction
Preferred Uses of Glycomes and Analysis Thereof with Regard to Status of Cells
It is further realized that the analysis of glycome is useful for search of most effectively altering glycan structures in the tissue materials for analysis by other methods.
The glycome component identified by glycome analysis according to the invention can be further analysed/verified by known methods such as chemical and/or glycosidase enzymatic degradation(s) and further mass spectrometric analysis and by fragmentation mass spectrometry, the glycan component can be produced in larger scale by know chromatographic methods and structure can be verified by NMR-spectroscopy.
The other methods would preferably include binding assay using specific labelled carbohydrate binding agents including especially carbohydrate binding proteins (lectins, antibodies, enzymes and engineered proteins with carbohydrate binding activity) and other chemicals such as peptides or aptamers aimed for carbohydrate binding. It is realized that the novel marker structure can be used for analysis of cells, cell status and possible effects of contaminants to cell with similar indicative value as specific signals of the glycan mass components in glycome analysis by mass spectrometry according to the invention.
The invention is especially directed to search of novel carbohydrate marker structures from cell/tissue surfaces, preferably by using cell surface profiling methods. The cell surface carbohydrate marker structures would be further preferred for the analysis and/or sorting of cells.
Species specific, tissue specific, and individual specific differences in glycan structures are known. The difference between the origin of the cell material and the potential recipient of transplanted material may cause for example immunologic or allergic problems due to glycosylation differences. It is further noticed that culture of cells may cause changes in glycosylation. When considering human derived cell materials according to the present invention, individual specific differences in glycosylation are a potent source of harmful effects.
The present invention is directed to control of glycosylation of cell populations to be used in therapy.
The present invention is specifically directed to control of glycosylation of cell materials, preferably when
1) there is difference between the origin of the cell material and the potential recipient of transplanted material. In a preferred embodiment there are potential inter-individual specific differences between the donor of cell material and the recipient of the cell material. In a preferred embodiment the invention is directed to animal or human, more preferably human specific, individual person specific glycosylation differences. The individual specific differences are preferably present in tissue materials. The invention is preferably not directed to observation of known individual specific differences such as blood group antigens changes on erythrocytes.
2) There is possibility in variation due to disease specific variation in the materials. The present invention is specifically directed to search of glycosylation differences in the tissue materials according to the present invention associated with infectious disease, inflammatory disease, or malignant disease. Part of the inventors have analysed numerous types of early human cells and observed similar glycosylation types in certain cancers and tumors.
3) There is a possibility of specific inter-individual biological differences in the animals, preferably humans, from which the cell are derived for example in relation to species, strain, population, isolated population, or race specific differences in the cell materials.
Furthermore during long term cultivation of cells spontaneous mutations may be caused in cultivated cell materials. It is noted that mutations in cultivated cell lines often cause harmful defects on glycosylation level.
It is further noticed that cultivation of cells may cause changes in glycosylation. It is realized that minor changes in any parameter of cell cultivation including quality and concentrations of various biological, organic and inorganic molecules, any physical condition such as temperature, cell density, or level of mixing may cause difference in cell materials and glycosylation. The present invention is directed to monitoring glycosylation changes according to the present invention in order to observe change of cell status caused by any cell culture parameter affecting the cells.
The present invention is in a preferred embodiment directed to analysis of glycosylation changes when the density of cells is altered. The inventors noticed that this has a major impact of the glycosylation during cell culture.
In case there is heterogeneity in cell material this may cause observable changes or harmful effects in glycosylation.
Furthermore, the changes in carbohydrate structures, even non-harmful or functionally unknown, can be used to obtain information about the exact genetic status of the cells.
The present invention is specifically directed to the analysis of changes of glycosylation, preferably changes of sialylation according to the present invention in order to observe changes of cell status during cell cultivation.
The inventors further revealed conditions and reagents inducing harmful glycans to be expressed by cells with same associated problems as the contaminating glycans. The inventors found out that several reagents used in a regular cell purification process caused changes cells being purified.
It is realized, that the materials during cell handling may affect the glycosylation of cell materials. This may be based on the adhesion, adsorption, or metabolic accumulation of the structure in cells under processing.
In a preferred embodiment the cell handling reagents are tested with regard to the presence glycan component being antigenic or harmful structure such as cell surface NeuGc, Neu-O-Ac or mannose structure.
The inventors note effects of various effector molecules in cell culture on the glycans expressed by the cells if absorption or metabolic transfer of the carbohydrate structures have not been performed. The effectors typically mediate a signal to cell for example through binding a cell surface receptor.
The effector molecules include various cytokines, growth factors, and their signalling molecules and co-receptors. The effector molecules may be also carbohydrates or carbohydrate binding proteins such as lectins.
Controlled Cell Isolation/Purification and Culture Conditions to Avoid Contaminations with Harmful Glycans or Other Alteration in Glycome Level
It is realized that cell handling including isolation/purification, and handling in context of cell storage and cell culture processes are not natural conditions for cells and cause physical and chemical stress for cells. The present invention allows control of potential changes caused by the stress. The control may be combined by regular methods may be combined with regular checking of cell viability or the intactness of cell structures by other means.
Examples of Physical and/or Chemical Stress in Cell Handling Step
Washing and centrifuging cells cause physical stress which may break or harm cell membrane structures. Cell purifications and separations or analysis under non-physiological flow conditions also expose cells to certain non-physiological stress. Cell storage processes and cell preservation and handling at lower temperatures affects the membrane structure. All handling steps involving change of composition of media or other solution, especially washing solutions around the cells affect the cells for example by altered water and salt balance or by altering concentrations of other molecules effecting biochemical and physiological control of cells.
The inventors revealed that the method according to the invention is useful for observing changes in cell membranes which usually effectively alter at least part of the glycome observed according to the invention. It is realized that this related to exact organization and intact structures cell membranes and specific glycan structures being part of the organization.
The present invention is specifically directed to observation of total glycome and/or cell surface glycomes, these methods are further aimed for the use in the analysis of intactness of cells especially in context of stressful condition for the cells, especially when the cells are exposed to physical and/or chemical stress. It is realized that each new cell handling step and/or new condition for a cell handling step is useful to be controlled by the methods according to the invention. It is further realized that the analysis of glycome is useful for search of most effectively altering glycan structures for analysis by other methods such as binding by specific carbohydrate binding agents including especially carbohydrate binding proteins (lectins, antibodies, enzymes and engineered proteins with carbohydrate binding activity).
Controlled Cell Preparation (Isolation or Purification) with Regard to Reagents
The inventors analysed process steps of common cell preparation methods. Multiple sources of potential contamination by animal materials were discovered.
The present invention is specifically directed to carbohydrate analysis methods to control of cell preparation processes. The present invention is specifically directed to the process of controlling the potential contaminations with animal type glycans, preferably N-glycolylneuraminic acid at various steps of the process.
The invention is further directed to specific glycan controlled reagents to be used in cell isolation
The glycan-controlled reagents may be controlled on three levels:
1. Reagents controlled not to contain observable levels of harmful glycan structure, preferably N-glycolylneuraminic acid or structures related to it
2. Reagents controlled not to contain observable levels of glycan structures similar to the ones in the cell preparation
3. Reagent controlled not to contain observable levels of any glycan structures. The control levels 2 and 3 are useful especially when cell status is controlled by glycan analysis and/or profiling methods. In case reagents in cell preparation would contain the indicated glycan structures this would make the control more difficult or prevent it. It is further noticed that glycan structures may represent biological activity modifying the cell status.
The present invention is further directed to specific cell purification methods including glycan-controlled reagents.
It was realized that storage of the cell materials may cause harmful changes in glycosylation or changes in cell status observable by glycosylation analysis according to the present invention.
The inventors discovered that keeping the cells in lower temperatures alters the status of cells and this is observable by analysing the chemical structures of cells, preferably the glycosylation of the cells. The lower temperatures usually vary between 0-36 degrees of Celsius including for example incubator temperature below about 36 degrees of Celsius more preferably below 35 degrees of Celsius, various room temperatures, cold room and fridge temperatures typically between 2-10 degrees of Celsius, and temperatures from incubation on ice close to 0 degrees of Celsius typically between 0-4 degrees of Celsius. The lowered temperatures are typically needed for processing of cells or temporary storage of the preferred cells.
The present invention is specifically directed to analysis of the status of cells kept in low temperatures in comparison to natural body temperatures. In a preferred embodiment the control is performed after certain time has passed from process in lower temperature in order to confirm the recovery of the cells from the lower temperature. In another preferred embodiment the present invention is directed to development of lower temperature methods by controlling the chemical structures of cells, preferably by controlling glycosylation according to the present invention.
The inventors discovered that cryopreservation alters the status of cells and this observable analysing the chemical structures of cells, preferably the glycosylation of the cells. The present invention is specifically directed to analysis of the status of cryopreserved cells. In a preferred embodiment the control is performed after certain time has passed from preservation in order to confirm the recovery of the cells from the cryopreservation. In another preferred embodiment the present invention is directed to development of cryopreservetion methods by controlling the chemical structures of cells, preferably by controlling glycosylation according to the present invention.
Contaminations with Harmful Glycans Such as Antigenic Animal Type Glycans
Several glycans structures contaminating cell products may weaken the biological activity of the product.
The harmful glycans can affect the viability during handling of cells, or viability and/or desired bioactivity and/or safety in therapeutic use of cells.
The harmful glycan structures may reduce the in vitro or in vivo viability of the cells by causing or increasing binding of destructive lectins or antibodies to the cells. Such protein material may be included e.g. in protein preparations used in cell handling materials. Carbohydrate targeting lectins are also present on human tissues and cells, especially in blood and endothelial surfaces. Carbohydrate binding antibodies in human blood can activate complement and cause other immune responses in vivo. Furthermore immune defence lectins in blood or leukocytes may direct immune defence against unusual glycan structures.
Additionally harmful glycans may cause harmful aggregation of cells in vivo or in vitro. The glycans may cause unwanted changes in developmental status of cells by aggregation and/or changes in cell surface lectin mediated biological regulation.
Additional problems include allergenic nature of harmful glycans and misdirected targeting of cells by endothelial/cellular carbohydrate receptors in vivo.
Contaminations from Reagents
The present invention is specifically directed to control of the reagents used to prevent contamination by harmful glycan structures. The harmful glycan structures may originate from reagents used during cell handling processes such as cell preservation, cell preparation, and cell culture.
Preferred reagents to be controlled according to the present invention include cell blocking reagents, such as antibody receptor blocking reagents, washing solutions during cell processing, material blocking reagents, such as blocking reagents for materials like for example magnetic beads. Preferably the materials are controlled:
1. so that these would not contain a contaminating structure, or more specifically preferred glycan structure according to the invention
2. so that the materials contain very low amounts or do not contain any potentially harmful structures according to the invention.
Structural Features Derived from the Glycome Compositions
Novel Glycomes from Tissues
The present invention revealed that it is possible to profile tissues by released glycomes from tissues. The invention revealed novel glycan groups which haven't been observed in any similar composition derived from biological materials.
The novel glycan groups include
The invention is directed to glycome compositions, derived from tissue material according to the invention, wherein the glycome composition contain at least one of the preferred novel glycan groups in combination with other glycan groups; such as neutral glycans including high mannose glycan, GlcNAcβ2Man-glycans, complex type-glycans, hybrid type glycans acidic glycans; according to the invention obtainable from tissue materials according to the invention.
Most preferred novel glycan group degraded mannose glycan. The most preferred mannose glycans includes Mannose type glycans containing less than 6 mannose units including low mannose glycans, fucosylated low mannose (up to 4-5 mannose residues) or fucosylated high mannose structures (4-5 mannose residues) and soluble mannose-GlcNAc1-glycome Most preferred Mannose type glycan including, high- and low mannose type structures are according to the Formula Mn:
[Mα2]n1[Mα3]n2{[Mα2]n3[Mα6)]n4}[Mα6]n5{[Mα2]n6[Mα2]n7[Mα3]n8}Mβ4GNβ4([{Fucα6}]mGNyR2)z
wherein p, n1, n2, n3, n4, n5, n6, n7, n8, and m, and z are either independently 0 or 1; with the proviso that when n2 is 0, also n1 is 0; when n4 is 0, also n3 is 0; when n5 is 0, also n1, n2, n3, and n4 are 0; when n7 is 0, also n6 is 0; when n8 is 0, also n6 and n7 are 0;
y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon, and
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagines N-glycoside aminoacid and/or peptides derived from protein;
[ ] and ( ) indicates determinant either being present or absent depending on the value of n1, n2, n3, n4, n5, n6, n7, n8, and m; and
{ } indicates a branch in the structure, with the provision that is 0 indicating soluble mannose-GlcNAc1-glycome or there is 5, more preferably 4 or less mannose residues or m is 1 and there is 6 or less mannose units.
The invention revealed individual glycan structures and structure groups, which are novel markers for the cell materials according to the invention. The present invention is directed to the use of the marker structures and their combinations for analysis, for labelling and for cell separation, as modification targets and for other methods according to invention.
The present invention revealed large groups of glycans, which can be derived from cells according to the invention. The present invention is especially directed to release of various protein or lipid linked oligosaccharide and/or polysaccharide chains as free glycan, glycan reducing end derivative or glycopeptide fractions referred as glycomes from the cell material according to the invention. The glycans can be released separately from differently linked glycan groups on proteins and or glycolipids or in combined process producing several isolated glycome fractions and/or combined glycome fractions, which comprise glycans released at least from two different glycomes. The relative amounts of various components/component groups observable in glycan profiling as peaks in mass spectra and in quantitative presentations of glycan based profiling information, especially in analysis of mass spectrometric and/or NMR-data were revealed to be characteristic for individual cell types. The glycomes was further revealed to contain glycan subgroups or subglycomes which are very useful for characterization of the cell materials according to the invention.
The invention revealed four major glycome types based on the linkage structures. Two protein linked glycomes are N-linked glycomes and O-linked glycomes. The majority of the glycosaminoglycan (gag) glycomes (gagomes) are also linked to certain proteins by specific core and linkage structures. The glycolipid glycome is linked to lipids, usually sphingolipids.
The invention has revealed specific glycan core structures for the specific subglycomes studied. The various structures in specific glycomes were observed to contain common reducing end core structures such as N-glycan and O-glycan, Glycosaminoglycan and glycolipid cores. The cores are elongated with varying glycan chains usually comprising groups of glycans with different chain length. The presence of a core structures is often observably as a characteristic monosaccharide composition as monosaccharide composition of the core structure causing different relation of monosaccharide residues in specific glycan signals of glycomes when profiled by mass spectrometry according to the invention. The present invention further revealed specific non-reducing end terminal structures of specific marker glycans. Part of the non-reducing end terminal structures are characteristic for several glycomes, for example N-acetylactosamine type terminal structures, including fucosylated and sialylated variants were revealed from complex N-glycans, O-glycan and Glycolipid glycomes. Part of the structures are specific for glycomes such terminal Man-structures in Low-mannose and High-mannose N-glycans.
The invention revealed similar structures on protein and lipid linked glycomes in the cell materials according to the invention. It was revealed that combined analysis of the different glycomes is useful characterization of specific cell materials according to the invention. The invention specifically revealed similar lactosamine type structures in glycolipid and glycoprotein linked glycomes.
The invention further revealed glycosaminoglycan glycome and glycome profile useful for the analysis of the cell status and certain synergistic characteristics glycosaminoglycan glycomes and other protein linked glycomes such as non-sialic acid containing acidic structures in N-liked glycomes. The biological roles of glycosaminoglycans and glycolipids in regulation of cell biology and their biosynthetic difference and distance revealed by glycome analysis make these a useful combination for analysis of cell status. It is further realized that combination of all glycomes including O-glycan and N-glycan glycomes, glycolipid glycome and glycosaminoglycan glycome are useful for analysis of cells according to the invention. The invention further revealed common chemical structural features in the all glycomes according invention supporting the effective combined production, purification and analysis of glycomes according to the invention.
In a preferred embodiment the invention is directed to combined analysis of following glycome combinations, more preferably the glycomes are analysed from same sample to obtain exact information about the status of the cell material:
The invention further revealed effective methods for the analysis of different glycomes. It was revealed that several methods developed for sample preparation are useful for both lipid and protein linked glycomes, in a preferred embodiment proteolytic treatment is used for both production of protein linked glycome and a lipid linked glycome, especially for production of cell surface glycomes. For production of Total cell glycomes according to the invention the extraction of glycolipids is preferably used for degradation of cells and protein fraction obtained from the lipid extraction is used for protein linked glycome analysis. The invention is further directed to the chemical release of glycans, preferably for simultaneous release of both O-linked and N-linked glycans. Glycolipid and other glycomes, especially N-linked glycome, can be effectively released enzymatically, the invention is directed to sequential release of glycans by enzymes, preferably including step of inactivating enzymes between the treatments and using glycan controlled enzymes to avoid contamination or controlling contamination of glycans originating from enzymes.
The present invention reveals useful glycan markers for tissue materials and combinations thereof and glycome compositions comprising specific amounts of key glycan structures. The invention is furthermore directed to specific terminal and core structures and to the combinations thereof.
The preferred glycome glycan structure(s) and/or glycomes from cells according to the invention comprise structure(s) according to the formula C0:
R1Hexβz{R3}n1Hex(NAc)n2XyR2,
Wherein X is glycosidically linked disaccharide epitope β4(Fucα6)nGN, wherein n is 0 or 1, or X is nothing and
y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon,
z is linkage position 3 or 4, with the provision that when z is 4 then HexNAc is GlcNAc and then Hex is Man or Hex is Gal or Hex is GlcA, and
when z is 3 then Hex is GlcA or Gal and HexNAc is GlcNAc or GalNAc;
n1 is 0 or 1 indicating presence or absence of R3;
n2 is 0 or 1, indicating the presence or absence of NAc, with the proviso that n2 can be 0 only when Hexβz is Galβ4, and n2 is preferably 0, n2 structures are preferably derived from glycolipids;
R1 indicates 1-4, preferably 1-3, natural type carbohydrate substituents linked to the core structures or nothing;
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagine N-glycoside aminoacids and/or peptides derived from protein, or natural serine or threonine linked O-glycoside derivative such as serine or threonine linked O-glycosides including asparagine N-glycoside aminoacids and/or peptides derived from protein, or when n2 is 1 R2 is nothing or a ceramide structure or a derivative of a ceramide structure, such as lysolipid and amide derivatives thereof;
R3 is nothing or a branching structure representing a GlcNAcβ6 or an oligosaccharide with GlcNAcβ6 at its reducing end linked to GalNAc (when HexNAc is GalNAc); or when Hex is Gal and HexNAc is GlcNAc, and when z is 3 then R3 is Fucα4 or nothing, and when z is 4 R3 is Fucα3 or nothing.
The preferred disaccharide epitopes in the glycan structures and glycomes according to the invention include structures Galβ4GlcNAc, Manβ4GlcNAc, GlcAβ4GlcNAc, Galβ3GlcNAc, Galβ3GalNAc, GlcAβ3GlcNAc, GlcAβ3GalNAc, and Galβ4Glc, which may be further derivatized from reducing end carbon atom and non-reducing monosaccharide residues and is in a separate embodiment branched from the reducing end residue. Preferred branched epitopes include Galβ4(Fucα3)GlcNAc, Galβ3(Fucα4)GlcNAc, and Galβ3(GlcNAcβ6)GalNAc, which may be further derivatized from reducing end carbon atom and non-reducing monosaccharide residues.
The two N-acetyllactosamine epitopes Galβ4GlcNAc and/or Galβ3GlcNAc represent preferred terminal epitopes present on tissue materials or backbone structures of the preferred terminal epitopes for example further comprising sialic acid or fucose derivatizations according to the invention. In a preferred embodiment the invention is directed to fucosylated and/or non-substituted glycan non-reducing end forms of the terminal epitopes, more preferably to fucosylated and non-substituted forms. The invention is especially directed to non-reducing end terminal (non-substituted) natural Galβ4GlcNAc and/or Galβ3GlcNAc-structures from human tissue material glycomes. The invention is in a specific embodiment directed to non-reducing end terminal fucosylated natural Galβ4GlcNAc and/or Galβ3GlcNAc-structures from human tissue material glycomes.
The preferred fucosylated epitopes are according to the Formula TF:
(Fucα2)n1Galβ3/4(Fucα4/3)n2GlcNAcβ-R
n1 is 0 or 1 indicating presence or absence of Fucα2;
n2 is 0 or 1, indicating the presence or absence of Fucα4/3 (branch), and
R is the reducing end core structure of N-glycan, O-glycan and/or glycolipid.
The preferred structures thus include type 1 lactosamines (Galβ3GlcNAc based):
Galβ3(Fucα4)GlcNAc (Lewis a), Fucα2Galβ3GlcNAc H-type 1, structure and,
type 2 lactosamines (Galβ4GlcNAc based):
Galβ4(Fucα3)GlcNAc (Lewis x), Fucα2Galβ4GlcNAc H-type 2, structure and,
The type 2 lactosamines (fucosylated and/or terminal non-substituted) form an especially preferred group in context of tissue materials. Type 1 lactosamines (Galβ3GlcNAc-structures) are especially preferred in context of tissue materials.
The lactosamines form a preferred structure group with lactose-based glycolipids. The structures share similar features as products of β3/4Gal-transferases. The β3/4 galactose based structures were observed to produce characteristic features of protein linked and glycolipid glycomes.
The invention revealed that furthermore Galβ3/4GlcNAc-structures are a key feature of glycolipids of human cells. Such glycolipids comprise two preferred structural epitopes according to the invention. The most preferred glycolipid types include thus lactosylceramide based glycosphingolipids and especially lacto-(Galβ3GlcNAc), such as lactotetraosylceramide Galβ3GlcNAcβ3Galβ4GlcβCer, preferred structures further including its non-reducing terminal structures selected from the group: Galβ3(Fucα4)GlcNAc (Lewis a), Fucα2Galβ3GlcNAc (H-type 1), structure and, Fucα2Galβ3(Fucα4)GlcNAc (Lewis b) or sialylated structure SAα3Galβ3GlcNAc or SAα3Galβ3(Fucα4)GlcNAc, wherein SA is a sialic acid, preferably Neu5Ac preferably replacing Galβ3GlcNAc of lactotetraosylceramide and its fucosylated and/or elongated variants such as preferably according to the Formula:
(Sacα3)n5(Fucα2)n1Galβ3(Fucα4)n3GlcNAcβ3[Galβ3/4(Fucα4/3)n2GlcNAcβ3]n4Galβ4GlcβCer
wherein
n1 is 0 or 1, indicating presence or absence of Fucα2;
n2 is 0 or 1, indicating the presence or absence of Fucα4/3 (branch),
n3 is 0 or 1, indicating the presence or absence of Fucα4 (branch)
n4 is 0 or 1, indicating the presence or absence of (fucosylated) N-acetyllactosamine elongation;
n5 is 0 or 1, indicating the presence or absence of Sacα3 elongation;
Sac is terminal structure, preferably sialic acid, with α3-linkage, with the proviso that when
Sac is present, n5 is 1, then n1 is 0
and
neolacto (Galβ4GlcNAc)-comprising glycolipids such as neolactotetraosylceramide Galβ4GlcNAcβ3Galβ4GlcβCer, preferred structures further including its non-reducing terminal Galβ4(Fucα3)GlcNAc (Lewis x), Fucα2Galβ4GlcNAc H-type 2, structure and, Fucα2Galβ4(Fucα3)GlcNAc (Lewis y)
and
its fucosylated and/or elongated variants such as preferably
(Sacα3/6)n5(Fucα2)n1Galβ4(Fucα3)n3GlcNAcβ3[Galβ4(Fucα3)n2GlcNAcβ3]n4Galβ4GlcβCer
n1 is 0 or 1 indicating presence or absence of Fucα2;
n2 is 0 or 1, indicating the presence or absence of Fucα3 (branch),
n3 is 0 or 1, indicating the presence or absence of Fucα3 (branch) n4 is 0 or 1, indicating the presence or absence of (fucosylated) N-acetyllactosamine elongation,
n5 is 0 or 1, indicating the presence or absence of Sacα3/6 elongation;
Sac is terminal structure, preferably sialic acid (SA) with α3-linkage, or sialic acid with α6-linkage, with the proviso that when Sac is present, n5 is 1, then n1 is 0, and when sialic acid is bound by α6-linkage preferably also n3 is 0.
The inventors were able to describe human cell glycolipid glycomes by mass spectrometric profiling of liberated free glycans, revealing about 80 glycan signals from different cell types. The proposed monosaccharide compositions of the neutral glycans were composed of 2-7 Hex, 0-5 HexNAc, and 0-4 dHex. The proposed monosaccharide compositions of the acidic glycan signals were composed of 0-2 NeuAc, 2-9 Hex, 0-6 HexNAc, 0-3 dHex, and/or 0-1 sulphate or phosphate esters. The present invention is especially directed to analysis and targeting of such human cell glycan profiles and/or structures for the uses described in the present invention with respect to human cells. The present invention is further specifically directed to glycosphingolipid glycan signals specific to human cell types.
Terminal glycan epitopes that were demonstrated in the present experiments in human cell glycosphingolipid glycans are useful in recognizing cells or specifically binding to the cells via glycans, and other uses according to the present invention, including terminal epitopes: Gal, Galβ4Glc (Lac), Galβ4GlcNAc (LacNAc type 2), Galβ3, Non-reducing terminal HexNAc, Fuc, α1,2-Fuc, α1,3-FucαFucα2Gal, Fucα2Galβ4GlcNAc (H type 2), Fucα2Galβ4Glc (2′-fucosyllactose), Fucα3GlcNAc, Galβ4(Fucα3)GlcNAc (Lex), Fucα3Glc, Galβ4(Fucα3)Glc (3-fucosyllactose), Neu5Ac, Neu5Acα2,3, and Neu5Acα2,6. The present invention is further directed to the total terminal epitope profiles within the total human cell glycosphingolipid glycomes and/or glycomes.
The present invention revealed characteristic variations (increased or decreased expression in comparison to similar control cell or a contaminating cell or like) of both structure types in various tissue and cell materials according to the invention. The structures were revealed with characteristic and varying expression in three different glycome types: N-glycans, O-glycans, and glycolipids. The invention revealed that the glycan structures are a characteristic feature of tissue materials and are useful for various analysis methods according to the invention. Amounts of these and relative amounts of the epitopes and/or derivatives varies between tissue materials or between cells exposed to different conditions during growing, storage, or induction with effector molecules such as cytokines and/or hormones.
The preferred glycome glycan structure(s) and/or glycomes from cells according to the invention comprise structure(s) according to the formula C1:
R1Hexβz{R3}n1HexNAcXyR2,
Wherein X is glycosidically linked disaccharide epitope β4(Fucα6)nGN, wherein n is 0 or 1, or X is nothing and
y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon,
z is linkage position 3 or 4, with the provision that when z is 4 then HexNAc is GlcNAc and then Hex is Man or Hex is Gal or Hex is GlcA, and
when z is 3 then Hex is GlcA or Gal and HexNAc is GlcNAc or GalNAc,
R1 indicates 1-4, preferably 1-3, natural type carbohydrate substituents linked to the core structures,
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagines N-glycoside aminoacids and/or peptides derived from protein, or natural serine or threonine linked O-glycoside derivative such as serine or threonine linked O-glycosides including asparagines N-glycoside aminoacids and/or peptides derived from protein.
R3 is nothing or a branching structure representing a GlcNAcβ6 or an oligosaccharide with GlcNAcβ6 at its reducing end linked to GalNAc (when HexNAc is GalNAc) or when Hex is Gal and HexNAc is GlcNAc the then when z is 3 R3 is Fucα4 or nothing and when z is 4 R3 is Fucα3 or nothing.
The preferred disaccharide epitopes in the glycan structures and glycomes according to the invention include structures Galβ4GlcNAc, Manβ4GlcNAc, GlcAβ4GlcNAc, Galβ3GlcNAc, Galβ3GalNAc, GlcAβ3GlcNAc and GlcAβ3GalNAc, which may be further derivatized from reducing end carbon atom and non-reducing monosaccharide residues and is separate embodiment branched from the reducing end residue. Preferred branched epitopes include Galβ4(Fucα3)GlcNAc, Galβ3(Fucα4)GlcNAc, Galβ3(GlcNAcβ6)GalNAc, which may be further derivatized from reducing end carbon atom and non-reducing monosaccharide residues.
The preferred disaccharide epitopes of glycoprotein or glycolipid structures present on glycans of human cells according to the invention comprise structures based on the formula C2:
R1Hexβ4GlcNAcXyR2,
Wherein Hex is Gal OR Man and when Hex is Man then X is glycosidically linked disaccharide epitope β4(Fucα6)nGN, wherein n is 0 or 1, or X is nothing and when Hex is Gal then X is β3GalNAc of O-glycan core or β2/4/6Manα3/6 terminal of N-glycan core (as in formula NC3)
y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon,
R1 indicates 1-4, preferably 1-3, natural type carbohydrate substituents linked to the core structures,
when Hex is Gal preferred R1 groups include structures SAα3/6, SAα3/6Galβ4GlcNAcβ3/6,
when Hex is Man preferred R1 groups include Manα3, Manα6, branched structure Manα3 {Manα6} and elongated variants thereof as described for low mannose, high-mannose and complex type N-glycans below,
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagines N-glycoside aminoacids and/or peptides derived from protein, or natural serine or threonine linked O-glycoside derivative such as serine or threonine linked O-glycosides including asparagines N-glycoside aminoacids and/or peptides derived from protein.
The inventors revealed that the N-glycans released by specific N-glycan release methods from the cells according to the invention, and preferred cells according to the invention, comprise mostly a specific type of N-glycan core structure.
The preferred N-glycan structure of each cell type is characterised and recognized by treating cells with a N-glycan releasing enzyme releasing practically all N-glycans with core type according to the invention. The N-glycan releasing enzyme is preferably protein N-glycosidase enzyme, preferably by protein N-glycosidase releasing effectively the N-glycomes according to the invention, more preferably protein N-glycosidase with similar specificity as protein N-glycosidase F, and in a specifically preferred embodiment the enzyme is protein N-glycosidase F from F. meningosepticum. Alternative chemical N-glycan release method was used for controlling the effective release of the N-glycomes by the N-glycan releasing enzyme.
The inventors used the NMR glycome analysis according to the invention for further characterization of released N-glycomes from small cell samples available. NMR spectroscopy revealed the N-glycan core signals of the preferred N-glycan core type of the cells according to the invention.
The present invention is directed to glycomes derived from cells and comprising a common N-glycosidic core structures. The invention is specifically directed to minimum formulas covering both GN1-glycomes and GN2-glycomes with difference in reducing end structures.
The minimum core structure includes glycans from which reducing end GlcNAc or Fucα6GlcNAc has been released. These are referred as GN1-glycomes and the components thereof as GN1-glycans. The present invention is specifically directed to natural N-glycomes from cells comprising GN1-glycans. In a preferred embodiment the invention is directed to purified or isolated practically pure natural GN1-glycome from human cells. The release of the reducing end GlcNAc-unit completely or partially may be included in the production of the N-glycome or N-glycans from cells for analysis. The invention is specifically directed to soluble high/low mannose glycome of GN2-type.
The glycomes including the reducing end GlcNAc or Fucα6GlcNAc are referred as GN2-glycomes and the components thereof as GN2-glycans. The present invention is also specifically directed to natural N-glycomes from cells and tissues comprising GN2-glycans. In a preferred embodiment the invention is directed to purified or isolated practically pure natural GN2-glycome from cells.
The preferred N-glycan core structure(s) and/or N-glycomes from cells according to the invention comprise structure(s) according to the formula NC1:
R1Mβ4GNXyR2,
Wherein X is glycosidically linked disaccharide epitope β4(Fucα6)nGN, wherein n is 0 or 1, or X is nothing and
y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon, and
R1 indicates 1-4, preferably 1-3, natural type carbohydrate substituents linked to the core structures,
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagines N-glycoside aminoacids and/or peptides derived from protein.
It is realized that when the invention is directed to a glycome, the formula indicates mixture of several or typically more than ten or even higher number of different structures according to the Formulas describing the glycomes according to the invention.
The possible carbohydrate substituents R1 comprise at least one mannose (Man) residue, and optionally one or several GlcNAc, Gal, Fuc, SA and/GalNAc residues, with possible sulphate and or phosphate modifications.
When the glycome is released by N-glycosidase the free N-glycome saccharides comprise in a preferred embodiment reducing end hydroxyl with anomeric linkage A having structure α and/or β, preferably both α and β. In another embodiment the glycome is derivatized by a molecular structure which can be reacted with the free reducing end of a released glycome, such as amine, aminooxy or hydrazine or thiol structures. The derivatizing groups comprise typically 3 to 30 atoms in aliphatic or aromatic structures or can form terminal group spacers and link the glycomes to carriers such as solid phases or microparticles, polymeric carries such as oligosaccharides and/or polysaccharide, peptides, dendrimer, proteins, organic polymers such as plastics, polyethyleneglycol and derivatives, polyamines such as polylysines.
When the glycome comprises asparagine N-glycosides, A is preferably beta and R is linked asparagine or asparagine peptide. The peptide part may comprise multiple different aminoacid residues and typically multiple forms of peptide with different sequences derived from natural proteins carrying the N-glycans in cell materials according to the invention. It is realized that for example proteolytic release of glycans may produce mixture of glycopeptides. Preferably the peptide parts of the glycopeptides comprises mainly a low number of amino acid residues, preferably two to ten residues, more preferably two to seven amino acid residues and even more preferably two to five aminoacid residues and most preferably two to four amino acid residues when “mainly” indicates preferably at least 60% of the peptide part, more preferably at least 75% and most preferably at least 90% of the peptide part comprising the peptide of desired low number of aminoacid residues.
The preferred GN2-N-Glycan Core Structure(s)
The preferred GN2-N-glycan core structure(s) and/or N-glycomes from cells according to the invention comprise structure(s) according to the formula NC2:
R1Mβ4GNβ4(Fucα6)nGNyR2,
wherein n is 0 or 1 and
wherein y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon and
R1 indicates 1-4, preferably 1-3, natural type carbohydrate substituents linked to the core structures,
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagines N-glycoside aminoacid and/or peptides derived from protein.
The preferred compositions thus include one or several of the following structures
Mα3{Mα6}Mβ4GNβ4{Fucα6}n1GNyR2 NC2a
Mα6Mβ4GNβ4{Fucα6}n1GNyR2 NC2b
Mα3Mβ4GNβ4{Fucα6}n1GNyR2 NC2c
More preferably compositions comprise at least 3 of the structures or most preferably both structures according to the formula NC2a and at least both fucosylated and non-fucosylated with core structure(s) NC2b and/or NC2c.
The preferred GN1-N-Glycan Core Structure(s)
The preferred GN1-N-glycan core structure(s) and/or N-glycomes from cells according to the invention comprise structure(s) according to the formula NC3:
R1Mβ4GNyR2,
wherein y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon and
R1 indicates 1-4, preferably 1-3, natural type carbohydrate substituents linked to the core structures,
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagine N-glycoside aminoacids and/or peptides derived from protein.
The invention is specifically directed glycans and/or glycomes derived from preferred cells according to the present invention when the natural glycome or glycan comprises Multi-mannose GN1-N-glycan core structure(s) structure(s) according to the formula NC4:
[R1Mα3]n3{R3Mα6}n2Mβ4GNXyR2,
R1 and R3 indicate nothing or one or two, natural type carbohydrate substituents linked to the core structures, when the substituents are α-linked mannose monosaccharide and/or oligosaccharides and the other variables are as described above.
Furthermore common elongated GN2-N-glycan core structures are preferred types of glycomes according to the invention
The preferred N-glycan core structures further include differently elongated GN2-N-glycan core structures according to the formula NC5:
[R1Mα3]n3{R3Mα6}n2Mβ4GNβ4{Fucα6}n1GNyR2,
wherein n1, n2 and n3 are either 0 or 1 and
wherein y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon and
R1 and R3 indicate nothing or 1-4, preferably 1-3, most preferably one or two, natural type carbohydrate substituents linked to the core structures,
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagine N-glycoside aminoacids and/or peptides derived from protein,
GN is GlcNAc, M is mannosyl-, [ ] indicate groups either present or absent in a linear sequence.
{ } indicates branching which may be also present or absent.
with the provision that at least n2 or n3 is 1. Preferably the invention is directed to compositions comprising with all possible values of n2 and n3 and all saccharide types when R1 and/or are R3 are oligosaccharide sequences or nothing.
The present invention is preferably directed to N-glycan glycomes comprising one or several of the preferred N-glycan core types according to the invention. The present invention is specifically directed to specific N-glycan core types when the compositions comprise N-glycan or N-glycans from one or several of the groups Low mannose glycans, High mannose glycans, Hybrid glycans, and Complex glycans, in a preferred embodiment the glycome comprise substantial amounts of glycans from at least three groups, more preferably from all four groups.
The invention revealed certain structural groups present in N-linked glycomes. The grouping is based on structural features of glycan groups obtained by classification based on the monosaccharide compositions and structural analysis of the structural groups. The glycans were analysed by NMR, specific binding reagents including lectins and antibodies and specific glycosidases releasing monosaccharide residues from glycans. The glycomes are preferably analysed as neutral and acidic glycomes
The neutral glycomes mean glycomes comprising no acidic monosaccharide residues such as sialic acids (especially NeuNAc and NeuGc), HexA (especially GlcA, glucuronic acid) and acid modification groups such as phosphate and/or sulphate esters. There are four major types of neutral N-linked glycomes which all share the common N-glycan core structure: High-mannose N-glycans, low-mannose N-glycans, hydrid type and complex type N-glycans. These have characteristic monosaccharide compositions and specific substructures. The complex and hybrid type glycans may include certain glycans comprising monoantennary glycans.
The groups of complex and hybrid type glycans can be further analysed with regard to the presence of one or more fucose residues. Glycans containing at least one fucose units are classified as fucosylated. Glycans containing at least two fucose residues are considered as glycans with complex fucosylation indicating that other fucose linkages, in addition to the α1,6-linkage in the N-glycan core, are present in the structure. Such linkages include α1,2-, α1,3-, and α1,4-linkage.
Furthermore the complex type N-glycans may be classified based on the relations of HexNAc (typically GlcNAc or GalNAc) and Hex residues (typically Man, Gal). Terminal HexNAc glycans comprise at least three HexNAc units and at least two Hexose units so that the number of Hex Nac residues is at least larger or equal to the number of hexose units, with the provision that for non branched, monoantennary glycans the number of HexNAcs is larger than number of hexoses.
This consideration is based on presence of two GlcNAc units in the core of N-glycan and need of at least two Mannose units to for a single complex type N-glycan branch and three mannose to form a trimannosyl core structure for most complex type structures. A specific group of HexNAc N-Glycans contains the same number of HexNAcs and Hex units, when the number is at least 5.
The invention is further directed to glycans comprising terminal Mannose such as Mα6-residue or both Manα6- and Manα3-residues, respectively, can additionally substitute other Mα2/3/6 units to form a Mannose-type structures including hybrid, low-Man and High-Man structures according to the invention.
Preferred high- and low mannose type structures with GN2-core structure are according to the Formula M2:
[Mα2]n1[Mα3]n2{[Mα2]n3[Mα6)]n4}[Mα6]n5{[Mα2]n6[Mα2]n7[Mα3]n8}Mβ4GNβ4[{Fucα6}]m GNyR2
wherein p, n1, n2, n3, n4, n5, n6, n7, n8, and m are either independently 0 or 1; with the proviso that when n2 is 0, also n1 is 0; when n4 is 0, also n3 is 0; when n5 is 0, also n1, n2, n3, and n4 are 0; when n7 is 0, also n6 is 0; when n8 is 0, also n6 and n7 are 0;
y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon, and
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagines N-glycoside aminoacid and/or peptides derived from protein;
[ ] indicates determinant either being present or absent depending on the value of n1, n2, n3, n4, n5, n6, n7, n8, and m; and
{ } indicates a branch in the structure.
Preferred yR2-structures include [β-N-Asn]p, wherein p is either 0 or 1.
As described above a preferred variant of N-glycomes comprising only single GlcNAc-residue in the core. Such structures are especially preferred as glycomes produced by endo-N-acetylglucosaminidase enzymes and Soluble glycomes. Preferred Mannose type glycomes include structures according to the Formula M2
[Mα2]n1[Mα3]n2{[Mα2]n3[Mα6)]n4}[Mα6]n5{[Mα2]n6[Mα2]n7[Mα3]n8}Mβ4GNβ4GNyR2
Fucosylated high-mannose N-glycans according to the invention have molecular compositions Man5-9GlcNAc2Fuc1. For the fucosylated high-mannose glycans according to the formula, the sum of n1, n2, n3, n4, n5, n6, n7, and n8 is an integer from 4 to 8 and m is 0.
The low-mannose structures have molecular compositions Man1-4GlcNAc2Fuc0-1. They consist of two subgroups based on the number of Fuc residues: 1) nonfucosylated low-mannose structures have molecular compositions Man1-4GlcNAc2 and 2) fucosylated low-mannose structures have molecular compositions Man1-4GlcNAc2Fuc1. For the low mannose glycans the sum of n1, n2, n3, n4, n5, n6, n7, and n8 is less than or equal to (m+3); and preferably n1, n3, n6, and n7 are 0 when m is 0.
The invention revealed a very unusual group glycans in N-glycomes of invention defined here as low mannose N-glycans. These are not clearly linked to regular biosynthesis of N-glycans, but may represent unusual biosynthetic midproducts or degradation products. The low mannose glycans are especially characteristics changing during the changes of cell status, the differentiation and other changes according to the invention, for examples changes associated with differentiation status of cells and their differentiated products and control cell materials.
The invention is especially directed to recognizing low amounts of low-mannose type glycans in cell types, such as with low degree of differentiation.
The invention revealed large differences between the low mannose glycan expression in the cell and tissue glycomes and material from tissue secretions such as human serum.
The invention is especially directed to the use of specific low mannose glycan comprising glycomes for analysis of tissues and cells, preferably cultivated cells.
The invention further revealed specific mannose directed recognition methods useful for recognizing the preferred glycomes according to the invention. The invention is especially directed to combination of glycome analysis and recognition by specific binding agents, most preferred binding agent include enzymes and thesis derivatives. The invention further revealed that specific low mannose glycans of the low mannose part of the glycomes can be recognized by degradation by specific {tilde over (α)}-mannosidase (Man2-4GlcNAc2Fuc0-1) or α-mannosidase (Man1GlcNAc2Fuc0-1) enzymes and optionally further recognition of small low mannose structures, even more preferably low mannose structures comprising terminal Man 4-structures according to the invention.
The low mannose N-glycans, and preferred subgroups and individual structures thereof, are especially preferred as markers of the novel glycome compositions of the cells according to the invention useful for characterization of the cell types.
The low-mannose type glycans includes a specific group of α3- and/or α6-linked mannose type structures according to the invention including a preferred terminal and core structure types according to the invention.
The inventions further revealed that low mannose N-glycans comprise a unique individual structural markers useful for characterization of the cells according to the invention by specific binding agents according to the invention or by combinations of specific binding agents according to the invention.
Neutral low-mannose type N-glycans comprise one to four or five terminal Man-residues, preferentially Manα structures; for example Manα0-3Manβ4GlcNAcβ4GlcNAc(β-N-Asn) or Manα0-4Manβ4GlcNAcβ4(Fucα6)GlcNAc(β-N-Asn).
Low-mannose N-glycans are smaller and more rare than the common high-mannose N-glycans (Man5-9GlcNAc2). The low-mannose N-glycans detected in cell samples fall into two subgroups: 1) non-fucosylated, with composition MannGlcNAc2, where 1≦n≦4, and 2) core-fucosylated, with composition MannGlcNAc2Fuc1, where 1≦n≦5. The largest of the detected low-mannose structure structures is Man5GlcNAc2Fuc1 (m/z 1403 for the sodium adduct ion), which due to biosynthetic reasons most likely includes the structure below (in the figure the glycan is free oligosaccharide and β-anomer; in glycoproteins in tissues the glycan is N-glycan and β-anomer):
According to the present invention, low-mannose structures are preferentially identified by mass spectrometry, preferentially based on characteristic Hex1-4HexNAc2dHex0-1 monosaccharide composition. The low-mannose structures are further preferentially identified by sensitivity to exoglycosidase digestion, preferentially α-mannosidase (Hex2-4HexNAc2dHexc0-1) or β-mannosidase (Hex1HexNAc2dHex0-1) enzymes, and/or to endoglycosidase digestion, preferentially N-glycosidase F detachment from glycoproteins, Endoglycosidase H detachment from glycoproteins (only Hex1-4HexNAc2 liberated as Hex1-4HexNAc1), and/or Endoglycosidase F2 digestion (only Hex1-4HexNAc2dHex1 digested to Hex1-4HexNAc1). The low-mannose structures are further preferentially identified in NMR spectroscopy based on characteristic resonances of the Manβ4GlcNAcβ4GlcNAc N-glycan core structure and Manα residues attached to the Manβ4 residue. Several preferred low Man glycans described above can be presented in a single Formula:
[Mα3]n2{[Mα6)]n4}[Mα6]n5{[Mα3]n8}Mβ4GNβ4[{Fucα6}]mGNyR2
wherein p, n2, n4, n5, n8, and m are either independently 0 or 1; with the proviso that when n2 is 0, also n1 is 0; when n4 is 0, also n3 is 0; when n5 is 0, also n1, n2, n3, and n4 are 0;
when n7 is 0, also n6 is 0; when n8 is 0, also n6 and n7 are 0; the sum of n1, n2, n3, n4, n5, n6, n7, and n8 is less than or equal to (m+3); [ ] indicates determinant either being present or absent depending on the value of n2, n4, n5, n8, and m; and
{ } indicates a branch in the structure;
y and R2 are as indicated above.
Preferred non-fucosylated low-mannose glycans are according to the formula:
[Mα3]n2([Mα6)]n4)[Mα6]n5{[Mα3]n8}Mβ4GNβ4GNyR2
wherein p, n2, n4, n5, n8, and m are either independently 0 or 1,
with the provision that when n5 is 0, also n2 and n4 are 0, and preferably either n2 or n4 is 0,
[ ] indicates determinant either being present or absent depending on the value of, n2, n4, n5, n8,
{ } and 0 indicates a branch in the structure,
y and R2 are as indicated above.
Small non-fucosylated low-mannose structures are especially unusual among known N-linked glycans and characteristic glycans group useful for separation of cells according to the present invention. These include:
The invention further revealed large non-fucosylated low-mannose structures that are unusual among known N-linked glycans and have special characteristic expression features among the preferred cells according to the invention. The preferred large structures include
The hexasaccharide epitopes are preferred in a specific embodiment as rare and characteristic structures in preferred cell types and as structures with preferred terminal epitopes. The heptasaccharide is also preferred as structure comprising a preferred unusual terminal epitope Mα3(Mα6)Mα useful for analysis of cells according to the invention.
Preferred fucosylated low-mannose glycans are derived according to the formula:
[Mα3]n2{[Mα6]n4}[Mα6]n5{[Mα3]n8}Mβ4GNβ4(Fucα6)GNyR2
wherein p, n2, n4, n5, n8, and m are either independently 0 or 1, with the provision that when n5 is 0, also n2 and n4 are 0, [ ] indicates determinant either being present or absent depending on the value of n1, n2, n3, n4, ( ) indicates a branch in the structure;
and wherein n1, n2, n3, n4 and m are either independently 0 or 1,
with the provision that when n3 is 0, also n1 and n2 are 0,
[ ] indicates determinant either being present or absent
depending on the value of n1, n2, n3, n4 and m,
{ } and ( ) indicate a branch in the structure.
Small fucosylated low-mannose structures are especially unusual among known N-linked glycans and form a characteristic glycan group useful for separation of cells according to the present invention. These include:
The invention further revealed large fucosylated low-mannose structures are unusual among known N-linked glycans and have special characteristic expression features among the preferred cells according to the invention. The preferred large structure includes
The heptasaccharide epitopes are preferred in a specific embodiment as rare and characteristic structures in preferred cell types and as structures with preferred terminal epitopes. The octasaccharide is also preferred as structure comprising a preferred unusual terminal epitope Mα3(Mα6)Mα useful for analysis of cells according to the invention.
The inventors revealed that mannose-structures can be labeled and/or otherwise specifically recognized on cell surfaces or cell derived fractions/materials of specific cell types. The present invention is directed to the recognition of specific mannose epitopes on cell surfaces by reagents binding to specific mannose structures from cell surfaces.
The preferred reagents for recognition of any structures according to the invention include specific antibodies and other carbohydrate recognizing binding molecules. It is known that antibodies can be produced for the specific structures by various immunization and/or library technologies such as phage display methods representing variable domains of antibodies. Similarly with antibody library technologies, including aptamer technologies and including phage display for peptides, exist for synthesis of library molecules such as polyamide molecules including peptides, especially cyclic peptides, or nucleotide type molecules such as aptamer molecules.
The invention is specifically directed to specific recognition high-mannose and low-mannose structures according to the invention. The invention is specifically directed to recognition of non-reducing end terminal Manα-epitopes, preferably at least disaccharide epitopes, according to the formula:
[Mα2]m1[Mαx]m2[Mα6]m3{{[Mα2]m9[Mα2]m8[Mα3]m7}m10(Mβ4[GN]m4)m5}m6yR2
wherein m1, m 2, m3, m4, m5, m6, m7, m8, m9 and m10 are independently either 0 or 1; with the proviso that when m3 is 0, then m1 is 0 and, when m7 is 0 then either m1-5 are 0 and m8 and m9 are 1 forming Mα2Mα2-disaccharide or both m8 and m9 are 0
y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon, and
R2 is reducing end hydroxyl, chemical reducing end derivative
and x is linkage position 3 or 6 or both 3 and 6 forming branched structure,
{ } indicates a branch in the structure.
The invention is further directed to terminal Mα2-containing glycans containing at least one Mα2-group and preferably Mα2-group on each, branch so that m1 and at least one of m8 or m9 is 1. The invention is further directed to terminal Mα3 and/or Mα6-epitopes without terminal Mα2-groups, when all m1, m8 and m9 are 1.
The invention is further directed in a preferred embodiment to the terminal epitopes linked to a Mβ-residue and for application directed to larger epitopes. The invention is especially directed to Mβ4GN-comprising reducing end terminal epitopes.
The preferred terminal epitopes comprise typically 2-5 monosaccharide residues in a linear chain. According to the invention short epitopes comprising at least 2 monosaccharide residues can be recognized under suitable background conditions and the invention is specifically directed to epitopes comprising 2 to 4 monosaccharide units and more preferably 2-3 monosaccharide units, even more preferred epitopes include linear disaccharide units and/or branched trisaccharide non-reducing residue with natural anomeric linkage structures at reducing end. The shorter epitopes may be preferred for specific applications due to practical reasons including effective production of control molecules for potential binding reagents aimed for recognition of the structures.
The shorter epitopes such as Mα2M—may is often more abundant on target cell surface as it is present on multiple arms of several common structures according to the invention.
Manα2Man, Manα3Man, Manα6Man, and more preferred anomeric forms Manα2Manα, Manα3Manβ, Manα6Manβ, Manα3Manα and Manα6Manα.
Preferred branched trisaccharides includes Manα3(Manα6)Man, Manα3(Manα6)Manβ, and Manα3(Manα6)Manα.
The invention is specifically directed to the specific recognition of non-reducing terminal Manα2-structures especially in context of high-mannose structures.
The invention is specifically directed to following linear terminal mannose epitopes:
a) preferred terminal Manα2-epitopes including following oligosaccharide sequences:
The invention is further directed to recognition of and methods directed to non-reducing end terminal Manα3- and/or Manα6-comprising target structures, which are characteristic features of specifically important low-mannose glycans according to the invention. The preferred structural groups includes linear epitopes according to b) and branched epitopes according to the c3) especially depending on the status of the target material.
b) preferred terminal Manα3- and/or Manα6-epitopes including following oligosaccharide sequences:
The present invention is further directed to increase of selectivity and sensitivity in recognition of
Target glycans by combining recognition methods for terminal Manα2 and Manα3 and/or Manα6-comprising structures. Such methods would be especially useful in context of cell material according to the invention comprising both high-mannose and low-mannose glycans.
According to the present invention, complex-type structures are preferentially identified by mass spectrometry, preferentially based on characteristic monosaccharide compositions, wherein HexNAc≧4 and Hex≧3. In a more preferred embodiment of the present invention, 4≦HexNAc≦20 and 3≦Hex≦21, and in an even more preferred embodiment of the present invention, 4≦HexNAc≦10 and 3≦Hex≦11. The complex-type structures are further preferentially identified by sensitivity to endoglycosidase digestion, preferentially N-glycosidase F detachment from glycoproteins. The complex-type structures are further preferentially identified in NMR spectroscopy based on characteristic resonances of the Manα3(Manα6)Manβ4GlcNAcβ4GlcNAc N-glycan core structure and GlcNAc residues attached to the Manα3 and/or Manα6 residues.
Beside Mannose-type glycans the preferred N-linked glycomes include GlcNAcβ2-type glycans including Complex type glycans comprising only GlcNAcβ2-branches and Hydride type glycan comprising both Mannose-type branch and GlcNAcβ2-branch.
The invention revealed GlcNAcβ2Man structures in the glycomes according to the invention. Preferably GlcNAcβ2Man-structures comprise one or several of GlcNAcβ2Manα-structures, more preferably GlcNAcβ2Manα3 or GlcNAcβ2Manα6-structure.
The Complex type glycans of the invention comprise preferably two GlcNAcβ2Manα structures, which are preferably GlcNAcβ2Manα3 and GlcNAcβ2Manα6-. The Hybrid type glycans comprise preferably GlcNAcβ2Manα3-structure.
The present invention is directed to at least one of natural oligosaccharide sequence structures and structures truncated from the reducing end of the N-glycan according to the Formula GNβ2
[R1GNβ2]n1[Mα3]n2{[R3]n3[GNβ2]n4Mα6}n5Mβ4GNXyR2,
with optionally one or two or three additional branches according to formula [RxGNβz]nx linked to Mα6-, Mα3-, or Mβ4 and Rx may be different in each branch
wherein n1, n2, n3, n4, n5 and nx, are either 0 or 1, independently,
with the proviso that when n2 is 0 then n1 is 0 and when n3 is 1 or/and n4 is 1 then n5 is also 1, and at least n1 or n4 is 1, or n3 is 1,
when n4 is 0 and n3 is 1 then R3 is a mannose type substituent or nothing and wherein X is glycosidically linked disaccharide epitope β4(Fucα6)nGN, wherein n is 0 or 1, or X is nothing and
y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon, and
R1, Rx and R3 indicate independently one, two or three, natural substituents linked to the core structure,
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagines N-glycoside aminoacids and/or peptides derived from protein.
[ ] indicate groups either present or absent in a linear sequence. { } indicates branching which may be also present or absent.
The substituents R1, Rx and R3 may form elongated structures. In the elongated structures R1, and Rx, represent substituents of GlcNAc (GN) and R3 is either substituent of GlcNAc or when n4 is 0 and n3 is 1 then R3 is a mannose type substituent linked to mannose a 6-branch forming a Hybrid type structure. The substituents of GN are monosaccharide Gal, GalNAc, or Fuc or and acidic residue such as sialic acid or sulfate or phosphate ester.
GlcNAc or GN may be elongated to N-acetyllactosaminyl also marked as GalβGN or di-N-acetyllactosdiaminyl GalNAcβGlcNAc preferably GalNAcβ4GlcNAc. LNβ2M can be further elongated and/or branched with one or several other monosaccharide residues such as by galactose, fucose, SA or LN-unit(s) which may be further substituted by SAα-structures, and/or Mα6 residue and/or Mα3 residues can be further substituted one or two β6-, and/or β4-linked additional branches according to the formula,
and/or either of Mα6 residue or Mα3 residue may be absent
and/or Mα6-residue can be additionally substitutes other Manα units to form a hybrid type structures
and/or Manβ4 can be further substituted by GNβ4,
and/or SA may include natural substituents of sialic acid and/or it may be substituted by other SA-residues preferably by α8- or α9-linkages.
The SAα-groups are linked to either 3- or 6-position of neighboring Gal residue or on 6-position of GlcNAc, preferably 3- or 6-position of neighboring Gal residue. In separately preferred embodiments the invention is directed structures comprising solely 3-linked SA or 6-linked SA, or mixtures thereof.
According to the present invention, hybrid-type or monoantennary structures are preferentially identified by mass spectrometry, preferentially based on characteristic monosaccharide compositions, wherein HexNAc=3 and Hex≧2. In a more preferred embodiment of the present invention 2≦Hex≦11, and in an even more preferred embodiment of the present invention 2≦Hex≦9. The hybrid-type structures are further preferentially identified by sensitivity to exoglycosidase digestion, preferentially α-mannosidase digestion when the structures contain non-reducing terminal α-mannose residues and Hex≧3, or even more preferably when Hex≧4, and to endoglycosidase digestion, preferentially N-glycosidase F detachment from glycoproteins. The hybrid-type structures are further preferentially identified in NMR spectroscopy based on characteristic resonances of the Manα3(Manα6)Manβ4GlcNAcβ4GlcNAc N-glycan core structure, a GlcNAcβresidue attached to a Manα residue in the N-glycan core, and the presence of characteristic resonances of non-reducing terminal α-mannose residue or residues.
The monoantennary structures are further preferentially identified by insensitivity to α-mannosidase digestion and by sensitivity to endoglycosidase digestion, preferentially N-glycosidase F detachment from glycoproteins. The monoantennary structures are further preferentially identified in NMR spectroscopy based on characteristic resonances of the Manα3Manβ4GlcNAcβ4GlcNAc N-glycan core structure, a GlcNAc residue attached to a Manα residue in the N-glycan core, and the absence of characteristic resonances of further non-reducing terminal α-mannose residues apart from those arising from a terminal α-mannose residue present in a ManαManβ sequence of the N-glycan core.
The present invention is directed to at least one of natural oligosaccharide sequence structures and structures truncated from the reducing end of the N-glycan according to the Formula HY1
R1GNβ2Mα3{[R3]n3Mα6}Mβ4GNXyR2,
wherein n3, is either 0 or 1, independently, and
wherein X is glycosidically linked disaccharide epitope β4(Fucα6)nGN, wherein n is 0 or 1, or X is nothing and
y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon, and
R1 indicate nothing or substituent or substituents linked to GlcNAc, R3 indicates nothing or Mannose-substituent(s) linked to mannose residue, so that each of R1, and R3 may correspond to one, two or three, more preferably one or two, and most preferably at least one natural substituents linked to the core structure,
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagines N-glycoside aminoacids and/or peptides derived from protein.
[ ] indicate groups either present or absent in a linear sequence. { } indicates branching which may be also present or absent.
The preferred hydrid type structures include one or two additional mannose residues on the preferred core structure.
R1GNβ2Mα3{[Mα3]m1([Mα6])m2Mα6}Mβ4GNXyR2, Formula HY2
wherein n3, is either 0 or 1, and m1 and m2 are either 0 or 1, independently,
{ } and ( ) indicates branching which may be also present or absent,
other variables are as described in Formula HY1.
Furthermore the invention is directed to structures comprising additional lactosamine type structures on GNβ2-branch. The preferred lactosamine type elongation structures includes N-acetyllactosamines and derivatives, galactose, GalNAc, GlcNAc, sialic acid and fucose.
Preferred structures according to the formula HY2 include:
Structures containing non-reducing end terminal GlcNAc
As a specific preferred group of glycans
[R1Gal[NAc]o2βz]o1GNβ2Mα3{[Mα3]m1[(Mα6)]m2Mα6}n5Mβ4GNXyR2, Formula HY3
wherein n1, n2, n3, n5, m1, m2, 01 and o2 are either 0 or 1, independently,
z is linkage position to GN being 3 or 4, ? in a preferred embodiment 4,
R1 indicates on or two a N-acetyllactosamine type elongation groups or nothing,
{ } and ( ) indicates branching which may be also present or absent, other variables are as described in Formula HY1.
Preferred structures according to the formula HY3 include especially
structures containing non-reducing end terminal Galβ, preferably Galβ3/4 forming a terminal N-acetyllactosamine structure. These are preferred as a special group of Hybrid type structures, preferred as a group of specific value in characterization of balance of Complex N-glycan glycome and High mannose glycome:
The present invention is directed to at least one of natural oligosaccharide sequence structures and structures truncated from the reducing end of the N-glycan according to the Formula CO1
[R1GNβ2]n1[Mα3]n2{[R3GNβ2]n4Mα6}n5Mβ4GNXyR2
with optionally one or two or three additional branches according to formula [RxGNβz]nx linked to Mα6-, Mα3-, or Mβ4 and Rx may be different in each branch
wherein n1, n2, n4, n5 and nx, are either 0 or 1, independently,
with the proviso that when n2 is 0 then n1 is 0 and when n4 is 1 then n5 is also 1, and at least n1 is 1 or n4 is 1, and at least either of n1 and n4 is 1
and
wherein X is glycosidically linked disaccharide epitope β4(Fucα6)nGN, wherein n is 0 or 1, or X is nothing and
y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon, and
R1, Rx and R3 indicate independently one, two or three, natural substituents linked to the core structure,
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagines N-glycoside aminoacids and/or peptides derived from protein.
[ ] indicate groups either present or absent in a linear sequence. { } indicates branching which may be also present or absent.
Incomplete monoantennary N-Glycans
The present invention revealed incomplete Complex monoantennary N-glycans, which are unusual and useful for characterization of glycomes according to the invention. The most of the in complete monoantennary structures indicate potential degradation of biantennary N-glycan structures and are thus preferred as indicators of cellular status. The incomplete Complex type monoantennary glycans comprise only one GNβ2-structure.
The invention is specifically directed to structures are according to the Formula CO1 above when only n1 is 1 or n4 is one and mixtures of such structures.
The preferred mixtures comprise at least one monoantennary complex type glycans
A) with single branches from a likely degradative biosynthetic process:
The structure B2 is preferred with A structures as product of degradative biosynthesis, it is especially preferred in context of lower degradation of Manβ3-structures. The structure B1 is useful for indication of either degradative biosynthesis or delay of biosynthetic process
The inventor revealed a major group of biantennary and multiantennary N-glycans from cells according to the invention, the preferred biantennary and multiantennary structures comprise two GNβ2 structures.
These are preferred as an additional characteristics group of glycomes according to the invention and are represented according to the Formula CO2:
R1GNβ2Mα3{R3GNβ2Mα6}Mβ4GNXyR2
with optionally one or two or three additional branches according to formula [RxGNβz]nx linked to Mα6-, Mα3-, or Mβ4 and Rx may be different in each branch
wherein nx is either 0 or 1,
and other variables are according to the Formula CO1.
A biantennary structure comprising two terminal GNP-epitopes is preferred as a potential indicator of degradative biosynthesis and/or delay of biosynthetic process. The more preferred structures are according to the Formula CO2 when R1 and R3 are nothing.
The invention revealed specific elongated complex type glycans comprising Gal and/or GalNAc-structures and elongated variants thereof. Such structures are especially preferred as informative structures because the terminal epitopes include multiple informative modifications of lactosamine type, which characterize cell types according to the invention. The present invention is directed to at least one of natural oligosaccharide sequence structure or group of structures and corresponding structure(s) truncated from the reducing end of the N-glycan according to the Formula CO3
[R1Gal[NAc]o2βz2]o1GNβ2Mα3{[R1Gal[NAc]o4βz2]o3GNβ2Mα6}Mβ4GNXyR2,
with optionally one or two or three additional branches according to formula [RxGNβz1]nx linked to Mα6-, Mα3-, or Mβ4 and Rx may be different in each branch
wherein nx, o1, o2, o3, and o4 are either 0 or 1, independently,
with the provision that at least o1 or o3 is 1, in a preferred embodiment both are 1
z2 is linkage position to GN being 3 or 4, in a preferred embodiment 4,
z1 is linkage position of the additional branches.
R1, Rx and R3 indicate one or two a N-acetyllactosamine type elongation groups or nothing,
{ } and ( ) indicates branching which may be also present or absent, other variables are as described in Formula CO1.
The inventors characterized especially directed to digalactosylated structure
Preferred elongated materials include structures wherein R1 is a sialic acid, more preferably NeuNAc or NeuGc.
The present invention revealed for the first time LacdiNAc, GalNacbGlcNAc structures from the cell according to the invention. Preferred N-glycan lacdiNAc structures are included in structures according to the Formula CO1, when at least one the variable o2 and o4 is 1.
The acidic glycomes mean glycomes comprising at least one acidic monosaccharide residue such as sialic acids (especially NeuNAc and NeuGc) forming sialylated glycome, HexA (especially GlcA, glucuronic acid) and/or acid modification groups such as phosphate and/or sulphate esters.
According to the present invention, presence of phosphate and/or sulphate ester (SP) groups in acidic glycan structures is preferentially indicated by characteristic monosaccharide compositions containing one or more SP groups. The preferred compositions containing SP groups include those formed by adding one or more SP groups into non-SP group containing glycan compositions, while the most preferential compositions containing SP groups according to the present invention are selected from the compositions described in the acidic N-glycan fraction glycan group tables. The presence of phosphate and/or sulphate ester groups in acidic glycan structures is preferentially further indicated by the characteristic fragments observed in fragmentation mass spectrometry corresponding to loss of one or more SP groups, the insensitivity of the glycans carrying SP groups to sialidase digestion. The presence of phosphate and/or sulphate ester groups in acidic glycan structures is preferentially also indicated in positive ion mode mass spectrometry by the tendency of such glycans to form salts such as sodium salts as described in the Examples of the present invention. Sulphate and phosphate ester groups are further preferentially identified based on their sensitivity to specific sulphatase and phosphatase enzyme treatments, respectively, and/or specific complexes they form with cationic probes in analytical techniques such as mass spectrometry.
Complex N-glycan Glycomes, sialylated
The present invention is directed to at least one of natural oligosaccharide sequence structures and structures truncated from the reducing end of the N-glycan according to the Formula
[{SAα3/6}s1LNβ2]r1Mα3{({SAα3/6}s2LNβ2)r2Mα6}r8{M[β4GN[β4{Fucα6}r3GN]r4]r5}r6 (I)
with optionally one or two or three additional branches according to formula
{SAα3/6}s3LNβ, (IIb)
wherein r1, r2, r3, r4, r5, r6, r7 and r8 are either 0 or 1, independently,
wherein s1, s2 and s3 are either 0 or 1, independently,
with the proviso that at least r1 is 1 or r2 is 1, and at least one of s1, s2 or s3 is 1.
LN is N-acetyllactosaminyl also marked as GalβGN or di-N-acetyllactosdiaminyl
GalNAcβGlcNAc preferably GalNAcβ4GlcNAc, GN is GlcNAc, M is mannosyl-,
with the proviso LNβ2M or GNβ2M can be further elongated and/or branched with one or several other monosaccharide residues such as by galactose, fucose, SA or LN-unit(s) which may be further substituted by SAα-structures,
and/or one LNβ can be truncated to GNP
and/or Mα6 residue and/or Mα3 residues can be further substituted one or two β6-, and/or β4-linked additional branches according to the formula,
and/or either of Mα6 residue or Mα3 residue may be absent
and/or Mα6-residue can be additionally substitutes other Manαunits to form a hybrid type structures
and/or Manβ4 can be further substituted by GNβ4,
and/or SA may include natural substituents of sialic acid and/or it may be substituted by other SA-residues preferably by α8- or α9-linkages.
( ), { }, [ ] and [ ] indicate groups either present or absent in a linear sequence. { } indicates branching which may be also present or absent.
The SAα-groups are linked to either 3- or 6-position of neighboring Gal residue or on 6-position of GlcNAc, preferably 3- or 6-position of neighboring Gal residue. In separately preferred embodiments the invention is directed structures comprising solely 3-linked SA or 6-linked SA, or mixtures thereof. In a preferred embodiment the invention is directed to glycans wherein r6 is 1 and r5 is 0, corresponding to N-glycans lacking the reducing end GlcNAc structure.
The LN unit with its various substituents can in a preferred general embodiment represented by the formula:
[Gal(NAc)n1α3]n2{Fucα2}n3Gal(NAc)n4β3/4{Fucα4/3}n5GlcNAcβ
wherein n1, n2, n3, n4, and n5 are independently either 1 or 0,
with the provision that
the substituents defined by n2 and n3 are alternative to presence of SA at the non-reducing end terminal
the reducing end GlcNAc-unit can be further β3- and/or β6-linked to another similar LN-structure forming a poly-N-acetyllactosamine structure
with the provision that for this LN-unit n2, n3 and n4 are 0,
the Gal(NAc)p and GlcNAcβunits can be ester linked a sulphate ester group,
( ), and [ ] indicate groups either present or absent in a linear sequence; { } indicates branching which may be also present or absent.
LN unit is preferably Galβ4GN and/or Galβ3GN. The inventors revealed that early human cells can express both types of N-acetyllactosamine, the invention is especially directed to mixtures of both structures. Furthermore the invention is directed to special relatively rear type 1 N-acetyllactosamines, Galβ3GN, without any non-reducing end/site modification, also called lewis c-structures, and substituted derivatives thereof, as novel markers of early human cells.
In the present invention, glycan signals with preferential monosaccharide compositions can be grouped into structure groups based on classification rules described in the present invention. The present invention includes parallel and overlapping classification systems that are used for the classification of the glycan structure groups.
Glycan signals isolated from the N-glycan fractions from the tissue material types studied in the present invention are grouped into glycan structure groups based on their preferential monosaccharide compositions according to the invention, in Table 6 for neutral glycan fractions and Table 7 for acidic glycan fractions. Taken together, the analyses revealed that all the structure groups according to the invention are present in the studied tissue material types. In another aspect of the present invention, the glycan structure grouping is used to compare different tissue materials and characterize their specific glycosylation features. According to the present invention the discovered and analyzed differences between the glycan signals within the glycan signal groups between different tissue material samples are used for comparison and characterization.
The quantitative glycan profiling combined with glycan structural classification is used according to the present invention to characterize and identify glycosylation features occurring in tissue materials, glycosylation features specific for certain tissue materials as well as differences between different tissue materials. According to the present invention, the classification is used to characterize and compare glycosylation features of different tissues, of normal and diseased tissues, preferentially cancerous tissues, and solid tissues such as lung tissue and fluid tissues such as blood and/or serum. In another aspect of the present invention, the glycan structure grouping is used to compare different tissue materials and characterize their specific glycosylation features. According to the present invention differences between relative proportions of glycan signal structure groups are used to compare different tissue material samples.
In a further aspect of the present invention, analysis of the glycan structure groups, preferentially including terminal HexNAc and/or low-mannose and optionally other groups separately or in combination, is used to differentiate between different tissue materials or different stages of tissue materials, preferentially to identify human disease and more preferentially human cancer. In a further preferred form the present method is used to differentiate between benign and malignant tumors. According to the present invention analysis of human serum glycan groups or combinations thereof according to the present invention can be used to identify the presence of other tissue materials in blood or serum samples, more preferably to identify disease and preferably malignant cancer.
The invention is directed to analysis of present cell materials based on single or several glycans (glycome profile) of cell materials according to the invention. The analysis of multiple glycans is preferably performed by physical analysis methods such as mass spectrometry and/or NMR.
The invention is specifically directed to integrated analysis process for glycomes, such as total glycomes and cell surface glycomes. The integrated process represent various novel aspects in each part of the process. The methods are especially directed to analysis of low amounts of cells. The integrated analysis process includes
A) preferred preparation of substrate cell materials for analysis, including one or several of the methods: use of a chemical buffer solution, use of detergents, chemical reagents and/or enzymes.
B) release of glycome(s), including various subglycome type based on glycan core, charge and other structural features, use of controlled reagents in the process
C) purification of glycomes and various subglycomes from complex mixtures
D) preferred glycome analysis, including profiling methods such as mass spectrometry and/or NMR spectroscopy
E) data processing and analysis, especially comparative methods between different sample types and quantitative analysis of the glycome data.
Cell substrate material and its preparation for total and cell surface glycome analysis. The integrated glycome analysis includes preferably a cell preparation step to increase the availability of cell surface glycans. The cell preparation step preferably degrades either total cell materials or cell surface to yield a glycome for more effective glycan release. The degradation step preferably includes methods of physical degradation and/or chemical degradation. In a preferred embodiment at least one physical and one chemical degradation methods are combined, more preferably at least one physical method is combined with at least two chemical methods, even more preferably with at least three chemical methods.
The physical degration include degration by energy including thermal and/or mechanical energy directed to the cells to degrade cell structures such as heating, freezing, sonication, and pressure. The chemical degradation include use of chemicals and specific concentrations of chemicals for disruption of cells preferably detergents including ionic and neutral detergents, chaotropic salts, denaturing chemicals such as urea, and non-physiological salt concentrations for disruption of the cells.
The glycome analysis according to the invention is divided to two methods including Total cell glycomes, and Cell surface glycomes. The production of Total cell glycomes involves degradation of cells by physical and/or chemical degradation methods, preferably at least by chemical methods, more preferably by physical and chemical methods. The Cell surface glycomes is preferably released from cell surface preserving cell membranes intact or as intact as possible, such methods involve preferably at least one chemical method, preferably enzymatic method. The cell surface glycomes may be alternatively released from isolated cell membranes, this method involves typically chemical and/or physical methods similarity as production of total cell glycomes, preferably at least use of detergents.
a. Total Cell Glycomes
The present invention revealed special methods for effective purification of released glycans from total cell derived materials so that free oligosaccharides can be obtained. In a preferred embodiment a total glycome is produced from a cell sample, which is degraded to form more available for release of glycans. A preferred degraded form of cells is detergent lysed cells optionally involving physical disruption of cell materials.
Preferred detergents and reaction conditions include,
a1) ionic detergents, preferably SDS type anionic detergent comprising an anionic group such as sulfate and an alkyl chain of 8-16 carbon atoms, more preferably the anionic detergent comprise 10-14 carbon atoms and it is most preferably sodium dodecyl sulfate (SDS), and/or
a2) non-ionic detergents such as alkylglycosides comprising a hexose and 4-12 carbon alkyl chain more preferably the alkyl chain comprises a hexoses being galactose, glucose, and/or mannose, more preferably glucose and/or mannose and the alkyl comprises 6-10 carbon atoms, preferably the non-ionic detergent is octylglucoside
It is realized that various detergent combinations may be produced and optimized. The combined use of an ionic, preferably anionic, and non-ionic detergents according to the invention is especially preferred.
The preferred methods of cell degration for Total cell glycomes include physical degration including at least heat treatment heat and chemical degration by a detergent method or by a non-detergent method preferably enzymatic degradation, preferably heat treatment. Preferably two physical degradation methods are included.
A non-detergent method is preferred for avoiding detergent in later purification. The preferred non-detergent method involves physical degradation of cells preferably pressure and or by heat and a chemical degradation by protease. A preferred non-detergent method includes:
i) cell degradation by physical methods, for example by pressure methods such as by French press.
The treatment is preferably performed quickly in cold temperatures, preferably at 0-2 degrees of Celsius, and more preferably at about 0 or 1 degree of celcius and/or in the presence of glycosidase inhibitors.
ii) The degraded cells are further treated with chemical degradation, preferably by effective general protease, more preferably trypsin is used for the treatment. Preferred trypsin preparation according to the invention does not cause glycan contamination to the sample/does not contain glycans releasable under the reaction conditions.
iii) optionally the physical degradation and chemical degradation are repeated.
iv) At the end of protease treatment the sample is boiled for further denaturing the sample and the protease. The boiling is performed at temperature denaturing/degrading further the sample and the protease activity (conditions thus depend on the protease used) preferably about 100 degrees Celsius for time enough for denaturing protease activity preferably about 10-20 minutes for trypsin, more preferably about 15 minutes.
The invention is in another preferred embodiment directed to detergent based method for lysing cells. The invention includes effective methods for removal of detergents in later purification steps. The detergent methods are especially preferred for denaturing proteins, which may bind or degrade glycans, and for degrading cell membranes to increase the accessibility of intracellular glycans.
For the detergent method the cell sample is preferably a cell pellet produced at cold temperature by centrifuging cells but avoiding disruption of the cells, optionally stored frozen and melted on ice. Optionally glycosidase inhibitors are used during the process.
The method includes following steps:
i) production of cell pellet preferably by centrifugation,
ii) lysis by detergent on ice, the detergent is preferably an anionic detergent according to the invention, more preferably SDS. The concentration of the detergent is preferably between about 0.1% and 5%, more preferably between 0.5%-3%, even more preferably between 0.5-1.5% and most preferably about 1% and the detergent is SDS (or between 0.9-1.1%). the solution is preferably produced in ultrapure water,
iii) mixing by effective degradation of cells, preferably mixing by a Vortex-mixer as physical degradation step,
iv) boiling on water bath, preferably for 3-10 min, most preferably about 5 min (4-6 min) as second physical degradation step, it is realized that even longer boiling may be performed for example up to 30 min or 15 min, but it is not optimal because of evaporation sample
v) adding one volume of non-ionic detergent, preferably alkyl-glycoside detergent according to the invention, most preferably n-octyl-β-D-glucoside, the preferred amount of the detergent is about 5-15% as water solution, preferably about 10% of octyl-glucoside. The non-ionic detergent is especially preferred in case an enzyme sensitive to SDS, such as a N-glycosidase, is to be used in the next reaction step.
and
vi) incubation at room temperature for about 5 min to about 1-4 hours, more preferably less than half an hour, and most preferably about 15 min.
Preferred Amount of Detergents in the Detergent Method
Preferably the anionic detergent and cationic detergent solutions are used in equal volumes. Preferably the solutions are about 1% SDS and about 10% octyl-glucoside. The preferred amounts of the solutions are preferably from 0.1 μl to about 2 μl, more preferably 0.15 μl to about 1.5 μl per and most preferably from 0.16 μl to 1 μl per 100 000 cells of each solution. Lower amounts of the detergents are preferred if possible for reduction of the amount of detergent in later purification, highest amounts in relation to the cell amounts are used for practical reasons with lowest volumes. It is further realized that corresponding weight amounts of the detergents may be used in volumes of about 10% to about 1000%, or from about 20% to about 500% and even more effectively in volumes from 30% to about 300% and most preferably in volumes of range from 50% to about 150% of that described. It is realized that critical micellar concentration based effects may reduce the effect of detergents at lowest concentrations.
In a preferred embodiment a practical methods using tip columns as described in the invention uses about 1-3 μl of each detergent solution, more preferably 1.5-2.5 μl, and most preferably about 2 μl of the preferred detergent solutions or corresponding detergent amounts are used for about 200 000 or less cells (preferably between 2000 and about 250 000 cells, more preferably from 50 000 to about 250 000 cells and most preferably from 100 000 to about 200 000 cells). Another practical method uses about 2-10 μl of each detergent solution, more preferably 4-8 μl, and most preferably about 5 μl (preferably between 4 and 6 μl and more preferably between 4.5 and 5.5 μl) of detergent solutions or corresponding amount of the detergents for lysis of cell of a cell amount from about 200 000-3 million cells (preferred more exact ranges include 200 000-3.5 million, 200 000 to 3 million and 200 000 to 2.5 million cells), preferably a fixed amount (specific amount of microliters preferably with the accuracy of at least 0.1 microliter) in a preferred range such as of 5.0 μl is used for the wider range of cells 200 000-3 million. It was invented that is possible to handle similarity wider range of materials. It is further realized that the method can be optimized so that exact amount of detergent, preferably within the ranges described, is used for exact amount of cells, such method is preferably an automized when there is possible variation in amounts of sample cells.
b. Cell Surface Glycomes
In another preferred embodiment the invention is directed to release of glycans from intact cells and analysis of released cell surface glycomes. The present invention is directed to specific buffer and enzymatic cell pre-modification conditions that would allow the efficient use of enzymes for release and optionally modification and release of glycans.
The invention is directed to various enzymatic and chemical methods to release glycomes. The release step is not needed for soluble glycomes according to the invention. The invention further revealed soluble glycome components which can be isolated from the cells using methods according to the invention.
C. Purification of glycans from cell derived materials The purification of glycome materials form cell derived molecules is a difficult task. It is especially difficult to purify glycomes to obtain picomol or low nanomol samples for glycome profiling by mass spectrometry or NMR-spectrometry. The invention is especially directed to production of material allowing quantitative analysis over a wide mass range. The invention is specifically directed to the purification of non-derivatized or reducing end derivatized glycomes according to the invention and glycomes containing specific structural characteristics according to the invention. The structural characteristics were evaluated by the preferred methods according to the invention to produce reproducible and quantitative purified glycomes.
The glycan purification method according to the present invention consists of at least one of purification options, preferably in specific combinations described below, including one or several of following the following purification process steps in varying order:
8) Hydrophobic interaction;
9) Hydrophilic interaction; and
10) Affinity to carbon materials especially graphitized carbon.
In general the purification steps may be divided to two major categories:
Prepurification steps to remove major contaminations and purification steps usually directed to specific binding and optionally fractionation of glycomes
The need for prepurification depends on the type and amounts of the samples and the amounts of impurities present. Certain samples it is possible to omit all or part of the prepurification steps. The prepurification steps are aimed for removal of major non-carbohydrate impurities by separating the impurity and the glycome fraction(s) to be purified to different phases by precipitation/extraction or binding to chromatography matrix and the separating the impurities from the glycome fraction(s).
The prepurification steps include one, two or three of following major steps:
The precipitation and/or extraction is based on the high hydrophilic nature of glycome compositions and components, which is useful for separation from different cellular components and chemicals. The prepurification ion exchange chromatography is directed to removal of classes molecules with different charge than the preferred glycome or glycome fraction to be studied. This includes removal of salt ions and aminoacids, and peptides etc. The glycome may comprise only negative charges or in more rare case also only positive charges and the same charge is selected for the chromatography matrix for removal of the impurities for the same charge without binding the glycome at prepurification.
In a preferred embodiment the invention is directed to removal of cationic impurities from glycomes containing neutral and/or negatively charged glycans. The invention is further directed to use both anion and cation exchange for removal of charged impurities from non-charged glycomes. The preferred ion exchange and cation exchange materials includes polystyrene resins such as Dowex resins.
The hydrophilic chromatography is preferably aimed for removal of hydrophobic materials such as lipids detergents and hydrophobic protein materials. The preferred hydrophobic chromatography materials includes.
It is realized that different combinations of the prepurification are useful depending on the cell preparation and sample type. Preferred combinations of the prepurification steps include: Precipitation-extraction and Ion-exchange; Precipitation-extraction and Hydrophobic interaction; and Ion-exchange and Hydrophobic interaction. The two prepurification steps are preferably performed in the given order.
The purification steps utilize two major concepts for binding to carbohydrates and combinations thereof: a) Hydrophilic interactions and b) Ion exchange
The present invention is specifically directed to use of matrices with repeating polar groups with affinity for carbohydrates for purification of glycome materials according to the invention in processes according to the invention. The hydrophilic interaction material may include additional ion exchange properties.
The preferred hydrophilic interaction materials includes carbohydrate materials such as carbohydrate polymers in presence of non-polar organic solvents. A especially preferred hydrophilic interaction chromatography matrix is cellulose.
A specific hydrophilic interaction material includes graphitized carbon. The graphitized carbon separates non-charged carbohydrate materials based mainly on the size on the glycan. There is also possible ion exchange effects. In a preferred embodiment the invention is directed to graphitized carbon chromatography of prepurified samples after desalting and removal of detergents.
The invention is specifically directed to purification of non-derivatized glycomes and neutral glycomes by cellulose chromatography. The invention is further directed to purification of non-derivatized glycomes and neutral glycomes by graphitized carbon chromatography. In a preferred embodiment the purification according to the invention includes both cellulose and graphitized carbon chromatography.
The glycome may comprise only negative charges or in more rare case also only positive charges. At purification stage the ion exchange material is selected to contain opposite charge than the glycome or glycome fraction for binding the glycome. The invention is especially directed to the use of anion exchange materials for binding of negatively charged Preferred ion exchange materials includes ion exchange and especially anion exchange materials includes polystyrene resins such as Dowex-resins, preferably quaternary amine resins anion exchange or sulfonic acid cation exchange resins
It was further revealed that even graphitized carbon can be used for binding of negatively charged glycomes and the materials can be eluted from the carbon separately from the neutral glycomes or glycome fractions according to the invention.
The invention is specifically directed to purification of anionic glycomes by anion exchange chromatography.
The invention is specifically directed to purification of anionic glycomes by anion exchange chromatography.
The invention is further directed to purification of anionic glycomes by cellulose chromatography. The preferred anionic glycomes comprise sialic acid and/or sulfo/fosfo esters, more preferably both sialic acid and sulfo/fosfo esters. A preferred class of sulfo/fosfo ester glycomes are complex type N-glycans comprising sulfate esters.
1) Precipitation-extraction may include precipitation of glycans or precipitation of contaminants away from the glycans. Preferred precipitation methods include:
2) Ion-exchange may include ion-exchange purification or enrichment of glycans or removal of contaminants away from the glycans. Preferred ion-exchange methods include:
3) Hydrophilic interaction may include purification or enrichment of glycans due to their hydrophilicity or specific adsorption to hydrophilic materials, or removal of contaminants such as salts away from the glycans. Preferred hydrophilic interaction methods include:
3. Affinity to carbon may include purification or enrichment of glycans due to their affinity or specific adsorption to specific carbon materials preferably graphitized carbon, or removal of contaminants away from the glycans. Preferred graphitized carbon affinity methods includes porous graphitized carbon chromatography.
Preferred purification methods according to the invention include combinations of one or more prepurification and/or purification steps. The preferred method include preferably at least two and more preferably at least three prepurification steps according to the invention. The preferred method include preferably at least one and more preferably at least two purification steps according to the invention. It is further realized that one prepurification step may be performed after a purification step or one purification step may be performed after a prepurification step. The method is preferably adjusted based on the amount of sample and impurities present in samples. Examples of the preferred combinations include the following combinations:
A. 1. precipitation and/or extraction 2. cation exchange of contaminants, 3. hydrophobic adsorption of contaminants, and 4. hydrophilic purification, preferably by carbon, preferably graphitized carbon, and/or carbohydrate affinity purification of glycans.
B. 1. precipitation and/or extraction, 2. hydrophobic adsorption of contaminants 3. cation exchange of contaminants, 4. hydrophilic purification by carbon, preferably graphitized carbon, and/or carbohydrate affinity purification of glycans
The preferred method variants further includes preferred variants when
Further preferred method variants include the preceding methods A. or B. when:
2) For sialylated/acidic underivatized glycan purification: The same methods are preferred but preferably both carbon and carbohydrate chromatography is performed in step 4. The carbohydrate affinity chromatography is especially preferred for acidic and/sialylated glycans. In a preferred embodiment for additional purification one or two last chromatography methods are repeated.
In a further preferred method for sialylated underivatized glycan purification, glycan sample containing sialylated glycans is derivatized at the sialic acid carboxylic groups to produce neutral sialylated glycans as described in the present invention. Thereafter the neutral glycan sample containing neutral sialylated glycans is efficiently purified like neutral glycans described above.
The present invention is specifically directed to detection various component in glycomes by specific methods for recognition of such components. The methods includes binding of the glycome components by specific binding agents according to the invention such as antibodies and/or enzymes, these methods preferably include labeling or immobilization of the glycomes. For effective analysis of the glycome a large panel of the binding agents are needed.
The invention is specifically directed to physicochemical profiling methods for exact analysis of glycomes. The preferred methods includes mass spectrometry and NMR-spectroscopy, which give simultaneously information of numerous components of the glycomes. In a preferred embodiment the mass spectrometry and NMR-spectroscopy methods are used in a combination.
The invention revealed methods to create reproducible and quantitative profiles of glycomes over large mass ranges and degrees of polymerization of glycans. The invention further reveals novel methods for quantitative analysis of the glycomics data produced by mass spectrometry. The invention is specifically directed to the analysis of non-derivatized or reducing end derivatized glycomes according to the invention and the glycomes containing specific structural characteristics according to the invention.
The invention revealed effective means of comparison of glycome profiles from different cell types or cells with difference in cell status or cell types. The invention is especially directed to the quantitative comparison of relative amount of individual glycan signal or groups of glycan signals described by the invention.
The invention is especially directed to
i) calculating average value and variance values of signal or signals, which have obtained from several experiments/samples and which correspond to an individual glycan or glycan group according to the invention for a first cell sample and for a second cell sample
ii) comparing these with values derived for the corresponding signal(s)
iii) optionally calculating statistic value for testing the probability of similarity of difference of the values obtained for the cell types or estimating the similarity or difference based on the difference of the average and optionally also based on the variance values.
iv) preferably repeating the comparison one or more signals or signal groups, and further preferably performing combined statistical analysis to estimate the similarity and/or differences between the data set or estimating the difference or similarity
v) preferably use of the data for estimating the differences between the first and second cell samples indicating difference in cell status and/or cell type
The invention is further directed to combining information of several quantitative comparisons of between corresponding signals by method of
i) calculating differences between quantitative values of corresponding most different glycan signals or glycan group signals, changing negative values to corresponding positive values, optionally multiplying selected signals by selected factors to adjust the weight of the signals in the calculation
ii) adding the positive difference values to a sum value
iii) comparing the sum values as indicators of cell status or type.
It was further revealed that there is charatrics signals that are present in certain cell types according to the invention but absent or practically absent in other cell types. The invention is therefore directed to the qualitative comparison of relative amount of individual glycan signal or groups of glycan signals described by the invention and observing signals present or absent/practically absent in a cell type. The invention is further directed to selection of a cut off value used for selecting absent or practically absent signals from a mass spectrometric data, for example the preferred cut off value may be selected in range of 0-3% of relative amount, preferably the cut off value is less than 2%, or less than 1% or less than 0.5%. In a preferred embodiment the cut off value is adjusted or defined based on quality of the mass spectrum obtained, preferably based on the signal intensities and/or based on the number of signals observable.
The invention is further directed to automized qualitative and/or quantitative comparisons of data from corresponding signals from different samples by computer and computer programs processing glycome data produced according to the invention. The invention is further directed to raw data based analysis and neural network based learning system analysis as methods for revealing differences between the glycome data according to the invention.
The present invention is specifically directed to analyzing glycan datasets and glycan profiles for comparison and characterization of different tissue materials. In one embodiment of the invention, glycan signals or signal groups associated with given tissue material are selected from the whole glycan datasets or profiles and indifferent glycan signals are removed. The resulting selected signal groups have reduced background and less observation points, but the glycan signals most important to the resolving power are included in the selection. Such selected signal groups and their patterns in different sample types serve as a signature for the identification of the cell type and/or glycan types or biosynthetic groups that are typical to it. By evaluating multiple samples from the same tissue material, glycan signals that have individual i.e. cell line specific variation can be excluded from the selection. Moreover, glycan signals can be identified that do not differ between tissue materials, including major glycans that can be considered as housekeeping glycans.
To systematically analyze the data and to find the major glycan signals associated with given tissue material according to the invention, difference-indicating variables can be calculated for the comparison of glycan signals in the glycan datasets. Preferential variables between two samples include variables for absolute and relative difference of given glycan signal between the datasets from two tissue materials. Most preferential variables according to the invention are:
1. absolute difference A=(S2−S1), and
2. relative difference R=A/S1,
wherein S1 and S2 are relative abundances of a given glycan signal in cell types 1 and 2, respectively.
It is realized that other mathematical solutions exist to express the idea of absolute and relative difference between glycan datasets, and the above equations do not limit the scope of the present invention. According to the present invention, after A and R are calculated for the glycan profile datasets of the two tissue materials, the glycan signals are thereafter sorted according to the values of A and R to identify the most significant differing glycan signals. High value of A or R indicates association with tissue material 2, and vice versa. In the list of glycan data sorted independently by R and A, the tissue material specific glycans occur at the top and the bottom of the lists. More preferentially, if a given signal has high values of both A and R, it is more significant.
Preferred Representation of the Dataset when Comparing Two Tissue Materials
The present invention is specifically directed to the comparative presentation of the quantitative glycome dataset as multidimensional graphs comparing the paraller data or as other three dimensional presentations or for example as two dimensional matrix showing the quantities with a quantitative code, preferably by a quantitative color code.
The present invention is specifically directed to methods for analysis of low amounts of samples.
The invention further revealed that it is possible to use the methods according to the invention for analysis of low sample amounts. It is realized that the cell materials are scarce and difficult to obtain from natural sources. The effective analysis methods would spare important cell materials. Under certain circumstances such as in context of cell culture the materials may be available from large scale. The required sample scale depends on the relative abundancy of the characteristic components of glycome in comparison to total amount of carbohydrates. It is further realized that the amount of glycans to be measured depend on the size and glycan content of the cell type to be measured and analysis including multiple enzymatic digestions of the samples would likely require more material. The present invention revealed especially effective methods for free released glycans.
The picoscale samples comprise preferably at least about 1000 cells, more preferably at least about 50 000 cells, even more preferably at least 100 000 cells, and most preferably at least about 500 000 cells. The invention is further directed to analysis of about 1 000 000 cells. The preferred picoscale samples contain from at least about 1000 cells to about 10 000 000 cells according to the invention. The useful range of amounts of cells is between 50 000 and 5 000 000, even more preferred range of cells is between 100 000 and 3 000 000 cells. A preferred practical range for free oligosaccharide glycomes is between about 500 000 and about 2 000 000 cells. It is realized that cell counting may have variation of less than 20%, more preferably 10% and most preferably 5%, depending on cell counting methods and cell sample, these variations may be used instead of term about. It is further understood that the methods according to the present invention can be upscaled to much larger amounts of material and the pico/nanoscale analysis is a specific application of the technology.
The invention is specifically directed to use of microcolumn technologies according to the invention for the analysis of the preferred picoscale and low amount samples according to the invention,
The invention is specifically directed to purification to level, which would allow production of high quality mass spectrum covering the broad size range of glycans of glycome compositions according to the invention.
The present invention is especially directed to use microfluidistic methods involving low sample volumes in handling of the glycomes in low volume cell preparation, low volume glycan release and various chromatographic steps. The invention is further directed to integrated cell preparation, glycan release, and purification and analysis steps to reduce loss of material and material based contaminations. It is further realized that special cleaning of materials is required for optimal results.
The invention is directed to reactions of volume of 1-100 microliters, preferably about 2-50 microliters and even more preferably 3-20 microliters, most preferably 4-10 microliter. The most preferred reaction volumes includes 5-8 microliters+/−1 microliters. The minimum volumes are preferred to get optimally concentrated sample for purification. The amount of material depend on number of experiment in analysis and larger amounts may be produced preferably when multiple structural analysis experiments are needed.
It is realized that numerous low volume chromatographic technologies may be applied, such low volume column and for example disc based microfluidistic systems.
The inventors found that the most effective methods are microcolumns. Small column can be produced with desired volume. Preferred volumes of microcolumns are from about 2 Microliters to about 500 microliters, more preferably for routine sample sizes from about 5 microliter to about 100 microliters depending on the matrix and size of the sample. Preferred microcolumn volumes for graphitised carbon, cellulose chromatography and other tip-columns are from 2 to 20 μl, more preferably from 3 to 15 μl, even more preferably from 4 to 10 μl, For the microcolumn technologies in general the samples are from about 10 000 to about million cells. The methods are useful for production of picomol amounts of total glycome mixtures from cells according to the invention.
In a preferred embodiment microcolumns are produced in regular disposable usually plastic pipette tips used for example in regular “Finnpipette”-type air-piston pipettes. The pipette-tip microcolumn contain the preferred chromatographic matrix. In a preferred embodiment the microcolumn contains two chromatographic matrixes such as an anion and cation exchange matrix or a hydrophilic and hydrophobic chromatography matrix.
The pipette tips may be chosen to be a commercial tip contain a filter. In a preferred embodiment the microcolumn is produced by narrowing a thin tip from lower half so that the preferred matrix is retained in the tip. The narrowed tip is useful as the volume of filter can be omitted from washing steps
The invention is especially directed to plastic pipette tips containing the cellulose matrix, and in an other embodiment to the pipette tip microcolumns when the matrix is graphitised carbon matrix. The invention is further directed to the preferred tip columns when the columns are narrowed tips, more preferably with column volumes of 1 microliter to 100 microliters.
The invention is further directed to the use of the tip columns containing any of the preferred chromatographic matrixes according to the invention for the purification of glycomes according to the invention, more preferably matrixes for ion exchange, especially polystyrene anion exchangers and cation exchangers according to the invention; hydrophilic chromatographic matrixes according to the invention, especially carbohydrate matrixes and most cellulose matrixes.
The present invention is especially directed to method for glycan purification. The purification consists of the steps of 1) contamination removal including hydrophobic affinity absorption of contaminants and 2) glycan isolation glycans including hydrophilic affinity chromatography.
Preferably step 1) includes also cation-exchange step. More preferably, cation-exchange and hydrophobic chromatography media are used sequentially, and even more preferably they are packed together in a single column.
Preferably the hydrophilic affinity step in step 2) is carbon affinity, more preferably graphitized carbon affinity.
The inventors realized that improved purification of small sample amounts and/or purification of glycans from complex biological matrices are especially improved using both miniaturized liquid and miniaturized chromatography media volumes as well as connecting the purification steps directly into series. This is further exemplified in the Examples of the invention, including Examples 20 and 21.
The present invention is especially directed to analysis of small glycan amounts corresponding to glycans of 1000 cells-1 million cells, with steps 1) and 2) preferentially consisting of a total of 0.1 μl-1 ml bed volume chromatography media each; total liquid volume in sample loading into step 1) of 0.2 μl-0.5 ml; and total liquid volume in sample eluting from step 2) of 0.2 μl-2.5 ml.
In a more preferred embodiment, the present invention is directed to analysis of small glycan amounts corresponding to glycans of 1000-200 000 cells, with steps 1) and 2) preferentially consisting of a total of 0.1-5 μl bed volume chromatography media each; total liquid volume in sample loading into step 1) of 0.2-5 μl; and total liquid volume in sample eluting from step 2) of 0.2 μl-20 μl.
In a further preferred embodiment, the present invention is directed to on-line analytical method of small glycan amounts corresponding to glycans of 1000-10 000 cells, with steps 1) and 2) preferentially consisting of a total of 0.1-0.5 μl bed volume chromatography media each; total liquid volume in sample loading into step 1) of 0.2-1 μl; and total liquid volume in sample eluting from step 2) of 0.2 μl-1 μl; with the sample directly eluted to the analytical method.
Preferably, the analysis method is mass spectrometry, more preferably MALDI-TOF mass spectrometry.
In a further embodiment of the invention, the steps 1) and 2) are connected by eluting the sample directly from step 1) into step 2).
In a further embodiment of the invention, acidic glycans are further purified after steps 1) and 2) by another hydrophilic chromatography step as described in the present invention, preferably using cellulose adsorption.
The inventors found that analysis sensitivity of glycan profiles and signal detection can be improved by data analysis, preferentially using quantitative analysis of glycan signal profile datasets. In another embodiment of the present invention, glycan analysis is combined with quantitative data analysis according to the present invention, preferably with correction and normalization of data according to the invention.
Glycan Purification Device and/or Apparatus
The inventors were able to demonstrate efficient purification method for glycans, essentially forming a device for glycan purification from samples with varying volumes and matrices. Further, the inventors were able to standardize such purification method to essentially form a programmed method of using such device. The present invention is specifically directed to a device for glycan purification, the device consisting of:
a) contamination removing cartridge
b) glycan isolation cartridge
c) sample inlet going through a) and b)
d) washing and elution inlet going through b)
e) outlet leading from b) to either waste, sample collection, or analysis;
and optionally the device further consists of one or more of the following:
f) switch for changing inlet between c) and d)
g) switch for changing outlet between waste, sample collection, or analysis
h) device for generating liquid flow to operate the abovementioned device
i) switch for changing inlet between sample, washing, and elution liquids.
Optionally, a) and b) are operated independently and c) only transiently connects a) and b).
Optionally, a) is omitted; then it follows that c) is a sample, washing and elution inlet going only through b); and both d) and f) are also omitted.
The switch is preferably a flowpath regulator, more preferably a valve.
The cartridge is preferably a column, vessel, or container, containing separation media according to the present invention.
Optionally, the glycan purification device is partly or fully automated and operated by a programmed liquid handling device and/or a computer.
The device is operated by liquid flow through the device, optionally using h), changing the composition of the liquid flowing through the device, optionally using i), and changing the inlet and outlet destinations, optionally using f) and/or g), respectively. The operation is done in the following order:
1) liquid containing glycan sample goes to c) and outlet e) goes to waste
2) washing liquid goes to d) and outlet e) goes to waste
3) elution liquid goes to d) and outlet e) goes to either sample collection or analysis.
Optionally, when a) is omitted then the operation is done in the following order:
1) liquid containing glycan sample goes to c) and outlet e) goes to waste
2) washing liquid goes to c) and outlet e) goes to waste
3) elution liquid goes to c) and outlet e) goes to either sample collection or analysis.
Preferentially, a) incorporates one or more chromatography media useful for contamination removal according to the present invention, and b) incorporates one or more chromatography media useful for glycan absorption or adsorption according to the present invention; they are used according to the glycan purification methods described in the present invention. More preferentially, media in a) are selected from cation exchange resin and/or hydrophobic affinity resin, and media in b) are selected from hydrophilic affinity chromatography media according to the invention, more preferentially from graphitized carbon, hydrophilic affinity resin, and/or cellulose. In a further preferred embodiment of the present invention, a) contains cation exchange and hydrophobic affinity resin and b) contains graphitized carbon. Preferentially the liquid, inlet, outlet, sample, washing, elution, chromatography media, cartridge, and sample collection volumes are minimized as described in the present invention, more preferentially using microfluidistics according to the present invention. More preferably, this minimization is done relative to the amount of cell or biological material analyzed as described in the present invention.
Examples of configurations of glycan purification devices described here are presented in
The Binding Methods for Recognition of Structures from Cell Surfaces
Recognition of Structures from Glycome Materials and on Cell Surfaces by Binding Methods
The present invention revealed that beside the physicochemical analysis by NMR and/or mass spectrometry several methods are useful for the analysis of the structures. The invention is especially directed to two methods:
The peptides and proteins are preferably recombinant proteins or corresponding carbohydrate recognition domains derived thereof, when the proteins are selected from the group monoclonal antibody, glycosidase, glycosyl transferring enzyme, plant lectin, animal lectin or a peptide mimetic thereof, and wherein the binder includes a detectable label structure.
The present invention revealed various types of binder molecules useful for characterization of cells according to the invention and more specifically the preferred cell groups and cell types according to the invention. The preferred binder molecules are classified based on the binding specificity with regard to specific structures or structural features on carbohydrates of cell surface. The preferred binders recognize specifically more than single monosaccharide residue.
It is realized that most of the current binder molecules such as all or most of the plant lectins are not optimal in their specificity and usually recognize roughly one or several monosaccharides with various linkages. Furthermore the specificities of the lectins are usually not well characterized with several glycans of human types.
The preferred high specificity binders recognize
The preferred binders includes natural human and or animal, or other proteins developed for specific recognition of glycans. The preferred high specificity binder proteins are specific antibodies preferably monoclonal antibodies; lectins, preferably mammalian or animal lectins; or specific glycosyltransferring enzymes more preferably glycosidase type enzymes, glycosyltransferases or transglycosylating enzymes.
Combination of Terminal Structures in Combination with Specific Glycan Core Structures
It is realized that part of the structural elements are specifically associated with specific glycan core structure. The recognition of terminal structures linked to specific core structures are especially preferred, such high specificity reagents have capacity of recognition almost complete individual glycans to the level of physicochemical characterization according to the invention. For example many specific mannose structures according to the invention are in general quite characteristic for N-glycan glycomes according to the invention. The present invention is especially directed to recognition terminal epitopes.
The present invention revealed that there are certain common structural features on several glycan types and that it is possible to recognize certain common epitopes on different glycan structures by specific reagents when specificity of the reagent is limited to the terminal without specificity for the core structure. The invention especially revealed characteristic terminal features for specific cell types according to the invention. The invention realized that the common epitopes increase the effect of the recognition. The common terminal structures are especially useful for recognition in the context with possible other cell types or material, which do not contain the common terminal structure in substantial amount.
The present invention is directed to recognition of oligosaccharide sequences comprising specific terminal monosaccharide types, optionally further including a specific core structure. The preferred oligosaccharide sequences classified based on the terminal monosaccharide structures.
1. Structures with Terminal Mannose Monosaccharide
Preferred mannose-type target structures have been specifically classified by the invention. These include various types of high and low-mannose structures and hybrid type structures according to the invention.
preferred for recognition of terminal mannose structures includes mannose-monosaccharide binding plant lectins.
include
i) Specific mannose residue releasing enzymes such as linkage specific mannosidases, more preferably an α-mannosidase or β-mannosidase.
Preferred α-mannosidases includes linkage specific α-mannosidases such as α-Mannosidases cleaving preferably non-reducing end terminal
α2-linked mannose residues specifically or more effectively than other linkages, more preferably cleaving specifically Manα2-structures; or
α6-linked mannose residues specifically or more effectively than other linkages, more preferably cleaving specifically Manα6-structures;
Preferred β-mannosidases includes β-mannosidases capable of cleaving β-4-linked mannose from non-reducing end terminal of N-glycan core Manβ4GlcNAc-structure without cleaving other α-linked monosaccharides in the glycomes.
ii) Specific binding proteins recognizing preferred mannose structures according to the invention. The preferred reagents include antibodies and binding domains of antibodies (Fab-fragments and like), and other engineered carbohydrate binding proteins. The invention is directed to antibodies recognizing MS2B1 and more preferably MS3B2-structures
2. Structures with Terminal Gal-Monosaccharide
Preferred galactose-type target structures have been specifically classified by the invention. These include various types of N-acetyllactosamine structures according to the invention.
Preferred for recognition of terminal galactose structures includes plant lectins such as ricin lectin (ricinus communis agglutinin RCA), and peanut lectin(/agglutinin PNA).
i) Specific galactose residue releasing enzymes such as linkage specific galactosidases, more preferably α-galactosidase or β-galactosidase.
Preferred α-galactosidases include linkage galactosidases capable of cleaving Galα3Gal-structures revealed from specific cell preparations
Preferred β-galactosidases includes β-galactosidases capable of cleaving
β-4-linked galactose from non-reducing end terminal Galβ4GlcNAc-structure without cleaving other β-linked monosaccharides in the glycomes and
β3-linked galactose from non-reducing end terminal Galβ3GlcNAc-structure without cleaving other α-linked monosaccharides in the glycomes
ii) Specific binding proteins recognizing preferred galactose structures according to the invention. The preferred reagents include antibodies and binding domains of antibodies (Fab-fragments and like), and other engineered carbohydrate binding proteins and animal lectins such as galectins.
3. Structures with Terminal GalNAc-Monosaccharide
Preferred GalNAc-type target structures have been specifically revealed by the invention. These include especially LacdiNAc, GalNAcβGlcNAc-type structures according to the invention.
Several plant lectins has been reported for recognition of terminal GalNAc. It is realized that some GalNAc-recognizing lectins may be selected for low specificity recognition of the preferred LacdiNAc-structures.
i) The invention revealed that β-linked GalNAc can be recognized by specific β-N-acetylhexosaminidase enzyme in combination with β-N-acetylhexosaminidase enzyme. This combination indicates the terminal monosaccharide and at least part of the linkage structure.
Preferred β-N-acetylehexosaminidase, includes enzyme capable of cleaving β-linked GalNAc from non-reducing end terminal GalNAcβ4/3-structures without cleaving α-linked HexNAc in the glycomes; preferred N-acetylglucosaminidase include enzyme capable of cleaving β-linked GlcNAc but not GalNAc.
ii) Specific binding proteins recognizing preferred GalNAcβ4, more preferably GalNAcβ4GlcNAc, structures according to the invention. The preferred reagents include antibodies and binding domains of antibodies (Fab-fragments and like), and other engineered carbohydrate binding proteins, and a special plant lectin WFA (Wisteria floribunda agglutinin).
4. Structures with Terminal GlcNAc-Monosaccharide
Preferred GlcNAc-type target structures have been specifically revealed by the invention. These include especially GlcNAcβ-type structures according to the invention.
Several plant lectins has been reported for recognition of terminal GlcNAc. It is realized that some GlcNAc-recognizing lectins may be selected for low specificity recognition of the preferred GlcNAc-structures.
Preferred β-N-acetylglucosaminidase includes enzyme capable of cleaving β-linked GlcNAc from non-reducing end terminal GlcNAcβ2/3/6-structures without cleaving β-linked GalNAc or α-linked HexNAc in the glycomes;
ii) Specific binding proteins recognizing preferred GlcNAcβ2/3/6, more preferably GlcNAcβ2Manα, structures according to the invention. The preferred reagents include antibodies and binding domains of antibodies (Fab-fragments and like), and other engineered carbohydrate binding proteins.
5. Structures with Terminal Fucose-Monosaccharide
Preferred fucose-type target structures have been specifically classified by the invention. These include various types of N-acetyllactosamine structures according to the invention.
Preferred for recognition of terminal fucose structures includes fucose monosaccharide binding plant lectins. Lectins of Ulex europeaus and Lotus tetragonolobus has been reported to recognize for example terminal Fucoses with some specificity binding for α2-linked structures, and branching α3-fucose, respectively.
i) Specific fucose residue releasing enzymes such as linkage fucosidases, more preferably α-fucosidase.
Preferred α-fucosidases include linkage fucosidases capable of cleaving Fucα2Gal-, and Galβ4/3(Fucα3/4)GlcNAc-structures revealed from specific cell preparations.
ii) Specific binding proteins recognizing preferred fucose structures according to the invention. The preferred reagents include antibodies and binding domains of antibodies (Fab-fragments and like), and other engineered carbohydrate binding proteins and animal lectins such as selectins recognizing especially Lewis type structures such as Lewis x, Galβ4(Fucα3)GlcNAc, and sialyl-Lewis x, SAα3Galβ4(Fucα3)GlcNAc.
The preferred antibodies includes antibodies recognizing specifically Lewis type structures such as Lewis x, and sialyl-Lewis x. More preferably the Lewis x-antibody is not classic SSEA-1 antibody, but the antibody recognizes specific protein linked Lewis x structures such as Galβ4(Fucα3)GlcNAcβ2Manα-linked to N-glycan core.
6. Structures with Terminal Sialic Acid-Monosaccharide
Preferred sialic acid-type target structures have been specifically classified by the invention.
Preferred for recognition of terminal sialic acid structures includes sialic acid monosaccharide binding plant lectins.
i) Specific sialic acid residue releasing enzymes such as linkage sialidases, more preferably α-sialidases.
Preferred α-sialidases include linkage sialidases capable of cleaving SAα3Gal- and SAα6Gal-structures revealed from specific cell preparations by the invention.
Preferred lectins, with linkage specificity include the lectins, that are specific for SAα3Gal-structures, preferably being Maackia amurensis lectin and/or lectins specific for SAα6Gal-structures, preferably being Sambucus nigra agglutinin.
ii) Specific binding proteins recognizing preferred sialic acid oligosaccharide sequence structures according to the invention. The preferred reagents include antibodies and binding domains of antibodies (Fab-fragments and like), and other engineered carbohydrate binding proteins and animal lectins such as selectins recognizing especially Lewis type structures such as sialyl-Lewis x, SAα3Galβ4(Fucα3)GlcNAc or sialic acid recognizing Siglec-proteins.
The preferred antibodies includes antibodies recognizing specifically sialyl-N-acetyllactosamines, and sialyl-Lewis x.
Preferred antibodies for NeuGc-structures includes antibodies recognizes a structure NeuGcα3Galβ4Glc(NAc)0 or 1 and/or GalNAcβ4-[NeuGcα3]Galβ4Glc(NAc)0 or 1, wherein [ ] indicates branch in the structure and ( )0 or 1 a structure being either present or absent. In a preferred embodiment the invention is directed recognition of the N-glycolyl-Neuraminic acid structures by antibody, preferably by a monoclonal antibody or human/humanized monoclonal antibody. A preferred antibody contains the variable domains of P3-antibody.
The present invention is specifically directed to the binding of the structures according to the present invention, when the binder is conjugated with “a label structure”. The label structure means a molecule observable in a assay such as for example a fluorescent molecule, a radioactive molecule, a detectable enzyme such as horse radish peroxidase or biotin/streptavidin/avidin. When the labelled binding molecule is contacted with the cells according to the invention, the cells can be monitored, observed and/or sorted based on the presence of the label on the cell surface. Monitoring and observation may occur by regular methods for observing labels such as fluorescence measuring devices, microscopes, scintillation counters and other devices for measuring radioactivity.
The invention is specifically directed to use of the binders and their labelled conjugates for sorting or selecting cells from biological materials or samples including cell materials comprising other cell types. The preferred cell types includes cultivated cells and associated cells such as feeder cells. The labels can be used for sorting cell types according to invention from other similar cells. In another embodiment the cells are sorted from different cell types such as blood cells or in context of cultured cells preferably feeder cells, for example in context of complex cell cultures corresponding feeder cells such as human or mouse feeder cells. A preferred cell sorting method is FACS sorting. Another sorting methods utilized immobilized binder structures and removal of unbound cells for separation of bound and unbound cells.
In a preferred embodiment the binder structure is conjugated to a solid phase. The cells are contacted with the solid phase, and part of the material is bound to surface. This method may be used to separation of cells and analysis of cell surface structures, or study cell biological changes of cells due to immobilization. In the analytics involving method the cells are preferably tagged with or labelled with a reagent for the detection of the cells bound to the solid phase through a binder structure on the solid phase. The methods preferably further include one or more steps of washing to remove unbound cells.
Preferred solid phases include cell suitable plastic materials used in contacting cells such as cell cultivation bottles, petri dishes and microtiter wells; fermentor surface materials
The invention is further directed to methods of recognizing different tissue materials, preferably human tissues and more preferably human excretions or serum. It is further realized, that the present reagents can be used for purification of tissue materials by any fractionation method using the specific binding reagents.
Preferred fractionation methods includes fluorescence activated cell sorting (FACS), affinity chromatography methods, and bead methods such as magnetic bead methods.
The invention is further directed to positive selection methods including specific binding to the tissue material but not to contaminating tissue materials. The invention is further directed to target selection methods including specific binding to the contaminating tissue material but not to the target tissue materials. In yet another embodiment of recognition of tissue materials the tissue material is recognized together with a homogenous reference sample, preferably when separation of other materials is needed. It is realized that a reagent for positive selection can be selected so that it binds tissue materials as in the present invention and not to the contaminating tissue materials and a reagent for negative selection by selecting opposite specificity. In case of tissue material type according to the invention is to be selected amongst novel tissue materials not studied in the present invention, the binding molecules according to the invention maybe used when verified to have suitable specificity with regard to the novel tissue material (binding or not binding). The invention is specifically directed to analysis of such binding specificity for development of a new binding or selection method according to the invention.
The preferred specificities according to the invention include recognition of:
The present invention is directed to specific cell populations comprising in vitro enzymatically altered glycosylations according to the present invention. It is realized that special structures revealed on cell surfaces have specific targeting, and immune recognition properties with regard to cells carrying the structures. It is realized that sialylated and fucosylated terminal structures such as sialyl-lewis x structures target cells to selectins involved in bone marrow homing of cells and invention is directed to methods to produce such structures on cells surfaces. It is further realized that mannose and galactose terminal structures revealed by the invention target cells to liver and/or to immune recognition, which in most cases are harmful for effective cell therapy, unless liver is not targeted by the cells. NeuGc is target for immune recognition and has harmful effects for survival of cells expressing the glycans.
The invention revealed glycosidase methods for removal of the structures from cell surface while keeping the cells intact. The invention is especially directed to sialyltransferase methods for modification of terminal galactoses. The invention further revealed novel method to remove mannose residues from intact cells by alpha-mannosidase.
The invention is further directed to metabolic regulation of glycosylation to alter the glycosylation for reduction of potentially harmful structures.
The present invention is directed to specific cell populations comprising in vitro enzymatically altered sialylation according to the present invention. The preferred cell population includes cells with decreased amount of sialic acids on the cell surfaces, preferably decreased from the preferred structures according to the present invention. The altered cell population contains in a preferred embodiment decreased amounts of α3-linked sialic acids. The present invention is preferably directed to the cell populations when the cell populations are produced by the processes according to the present invention.
Cell Populations with Altered Sialylated Structures
The invention is further directed to novel cell populations produced from the preferred cell populations according to the invention when the cell population comprises altered sialylation as described by the invention. The invention is specifically directed to cell populations comprising decreased sialylation as described by the invention. The invention is specifically directed to cell populations comprising increased sialylation of specific glycan structures as described by the invention. Furthermore invention is specifically directed to cell populations of specifically altered α3- and or α6-sialylation as described by the invention These cells are useful for studies of biological functions of the cell populations and role of sialylated, linkage specifically sialylated and non-sialylated structures in the biological activity of the cells.
Preferred Cell Populations with Decreased Sialylation
The preferred cell population includes cells with decreased amount of sialic acids on the cell surfaces, preferably decreased from the preferred structures according to the present invention. The altered cell population contains in a preferred embodiment decreased amounts of α3-linked sialic or α6-linked sialic acid. In a preferred embodiment the cell populations comprise practically only α3-sialic acid, and in another embodiment only α6-linked sialic acids, preferably on the preferred structures according to the invention, most preferably on the preferred N-glycan structures according to the invention. The present invention is preferably directed to the cell populations when the cell populations are produced by the processes according to the present invention. The cell populations with altered sialylation are preferably cultivated human or animal cell populations according to the invention.
Preferred Cell Populations with Increased Sialylation
The preferred cell population includes cells with increased amount of sialic acids on the cell surfaces, preferably decreased from the preferred structures according to the present invention. The altered cell population contains in preferred embodiments increased amounts of α3-linked sialic or α6-linked sialic acid. In a preferred embodiment the cell populations comprise practically only α3-sialic acid, and in another embodiment only α6-linked sialic acids, preferably on the preferred structures according to the invention, most preferably on the preferred N-glycan structures according to the invention. The present invention is preferably directed to the cell populations when the cell populations are produced by the processes according to the present invention. The cell populations with altered sialylation are preferably cultivated cells or tissue derived cell populations according to the invention.
Preferred Cell Populations with Altered Sialylation
The preferred cell population includes cells with altered linkage structures of sialic acids on the cell surfaces, preferably decreased from the preferred structures according to the present invention. The altered cell population contains in a preferred embodiments altered amount of α3-linked sialic and/or α6-linked sialic acid. The invention is specifically directed to cell populations having a sialylation level similar to the original cells but the linkages of structures are altered to α3-linkages and in another embodiment the linkages of structures are altered to α6-structures. In a preferred embodiment the cell populations comprise practically only α3-sialic acid, and in another embodiment only α6-linked sialic acids, preferably on the preferred structures according to the invention, most preferably on the preferred N-glycan structures according to the invention. The present invention is preferably directed to the cell populations when the cell populations are produced by the processes according to the present invention. The cell populations with altered sialylation are preferably cultivated cells or tissue derived cell populations according to the invention.
Cell Populations Comprising Preferred Cell Populations with Preferred Sialic Acid Types
The preferred cell population includes cells with altered types of sialic acids on the cell surfaces, preferably on the preferred structures according to the present invention. The altered cell population contains in a preferred embodiment altered amounts of NeuAc and/or NeuGc sialic acid. The invention is specifically directed to cell populations having sialylation levels similar to original cells but the sialic acid structures altered to NeuAc and in another embodiment the sialic acid type structures altered to NeuGc. In a preferred embodiment the cell populations comprise practically only NeuAc, and in another embodiment only NeuGc sialic acids, preferably on the preferred structures according to the invention, most preferably on the preferred N-glycan structures according to the invention. The present invention is preferably directed to the cell populations when the cell populations are produced by the processes according to the present invention. The cell populations with altered sialylation are preferably cultivated or tissue derived cell populations according to the invention.
The present invention is further directed to degradative removal of specific harmful glycan structures from cell, preferably from desired cell populations according to the invention.
The removal of the glycans or parts thereof occur preferably by enzymes such as glycosidase enzymes.
In some cases the removal of carbohydrate structure may reveal another harmful structure. In another preferred embodiment the present invention is directed to replacement of the removed structure by less harmful or better tolerated structure more optimal for the desired use.
Preferred Special Target Cell Type Effective and specific desialylation methods for the specific cell populations were developed.
The invention is specifically directed to desialylation methods for modification of human tissue and cell culture cells. The present invention is further directed to desialylation modifications of any human cell or tissue cell subpopulation according to the invention. Sialylation modifications of cultivated cells have not been studied previously in detail.
The present invention is specifically directed to methods for desialylation of the preferred structures according to the present invention from the surfaces of preferred cells. The present invention is further directed to preferred methods for the quantitative verification of the desialylation by the preferred analysis methods according to the present invention. The present invention is further directed to linkage specific desialylation and analysis of the linkage specific sialylation on the preferred carbohydrate structures using analytical methods according to the present invention.
The invention is preferably directed to linkage specific α3-desialylation of the preferred structures according to the invention without interfering with the other sialylated structures according to the present invention. The invention is further directed to simultaneous desialylation α3- and α6-sialylated structures according to the present invention.
Furthermore the present invention is directed to desialylation when both NeuAc and NeuGc are quantitatively removed from cell surface, preferably from the preferred structures according to the present invention. The present invention is specifically directed to the removal of NeuGc from preferred cell populations, most preferably cultivated or tissue derived populations and from the preferred structures according to the present invention. The invention is further directed to preferred methods according to the present invention for verification of removal of NeuGc, preferably quantitative verification and more preferably verification performed by mass spectrometry.
The inventors revealed that it is possible to produce controlled cell surface glycosylation modifications on the preferred cells according to the invention. The present invention is specifically directed to glycosyltransferase catalysed modifications of N-linked glycans on the surfaces of cells, preferably blood cells, more preferably leukocytes or cultivated cells or more preferably the preferred cell or tissue materials according to the present invention.
The present invention is directed to cell modifications by sialyltransferases and fucosyltransferases. Two most preferred transfer reactions according to the invention are α3-modification reactions such as α3-sialylation and α3-fucosylations. When combined these reactions can be used to produce important cell adhesion structures which are sialylated and fucosylated N-acetyllactosamines such as sialyl-Lewis x (sLex).
Possible α6-sialylation has been implied in bone marrow cells and in peripheral blood CD34+ cells released from bone marrow to circulation by growth factor administration, cultivated or other cell types have not been investigated. Furthermore, the previous study utilized an artificial sialic acid modification method, which may affect the specificity of the sialyltransferase enzyme and, in addition, the actual result of the enzyme reaction is not known as the reaction products were not analysed by the investigators. The reactions are likely to have been very much limited by the specificity of the α6-sialyltransferase used and cannot be considered prior art in respect to the present invention.
The inventors of the present invention further revealed effective modification of the preferred cells according to the present inventions by sialylation, in a preferred embodiment by α3-sialylation.
The prior art data cited above does not indicate the specific modifications according to the present invention to cultivated cells, preferably cultivated or tissue derived cells. The present invention is specifically directed to sialyltransferase reactions towards these cell types. The invention is directed to sialyltransferase catalyzed transfer of a natural sialic acid, preferably NeuAc, NeuGc or Neu-O-Ac, from CMP-sialic acid to target cells. Sialyltransferase catalyzed reaction according to Formula:
CMP-SA+target cell→SA-target cell+CMP,
Wherein SA is a sialic acid, preferably a natural sialic acid,
preferably NeuAc, NeuGc or Neu-O-Ac and
the reaction is catalysed by a sialyltransferase enzyme preferably by an
α3-sialyltransferase
and
the target cell is a cultured cell or tissue derived cell.
Preferably the sialic acid is transferred to at least one N-glycan structure on the cell surface, preferably to form a preferred sialylated structure according to the invention
In the prior art fucosyltransferase reactions towards unspecified cell surface structures has been studied
The prior art indicates that human cord blood cell populations may be α3-fucosylated by human fucosyltransferase VI and such modified cell populations may be directed to bone marrow due to interactions with selectins.
The present invention describes reactions effectively modifying blood cells or cultivated cells in vitro by fucosyltransferases, especially in order to produce sialylated and fucosylated N-acetyllactosamines on cell surfaces, preferably sLex and related structures. The present invention is further directed to the use of the increased sialylated and/or fucosylated structures on the cell surfaces for targeting the cells, in a preferred embodiment for selectin directed targeting of the cells.
The invention is further directed to α3- and/or α4-fucosylation of cultured cells, tissue cells.
In a specific embodiment the present invention is directed to α3-fucosylation of the total mononuclear cell populations from human peripheral blood. Preferably the modification is directed to at least to one protein linked glycan, more preferably to a N-linked glycan. The prior art reactions reported about cord blood did not describe reactions in such cell populations and the effect of possible reaction cannot be known. The invention is further directed to combined increased α3-sialylation and fucosylation, preferably α3-sialylation of human peripheral blood leukocytes. It is realized that the structures on the peripheral blood leukocytes can be used for targeting the peripheral blood leukocytes, preferably to selecting expressing sites such as selectin expressing malignant tissues.
The invention is specifically directed to selection of a cell population from the preferred cell population according to the present invention, when the cell population demonstrate increased amount of α3-sialylation when compared with the baseline cell populations.
The inventors revealed that specific subpopulations of native cells express increased amounts of α3-linked sialic acid Furthermore it was found that cultured cells according to the invention have a high tendency to express α3-sialic acid instead to α6-linked sialic acids. The present invention is preferably directed to cultured cell lines, tissue cells expressing increased amounts of α3-linked sialic acid
The present invention is preferably directed to fucosylation after α3-sialylation of cells, preferably the preferred cells according to the invention. The invention describes for the first time combined reaction by two glycosyltransferases for the production of specific terminal epitopes comprising two different monosaccharide types on cell surfaces.
The present invention is preferably directed to fucosylation after desialylation and α3-sialylation of cells, preferably the preferred cells according to the invention. The invention describes for the first time combined reaction by two glycosyltransferases and a glycosidase for the production of specific terminal epitopes comprised of two different monosaccharide types on cell surfaces.
Effective specific sialylation methods for the specific cell populations were developed. The invention is specifically directed to sialylation methods for modification of human cultivated cells and subpopulations thereof as cell lines and human tissues.
Present invention is specifically directed to methods for sialylation to produce preferred structures according to the present invention from the surfaces of preferred cells. The present invention is specifically directed to production preferred NeuGc- and NeuAc-structures. The invention is directed to production of potentially in vivo harmful structures on cells surfaces, e.g. for control materials with regard to cell labelling. The invention is further directed to production of specific preferred terminal structure types, preferably α3- and α6-sialylated structures, and specifically NeuAc- and NeuGc-structures for studies of biological activities of the cells.
The present invention is further directed to preferred methods for the quantitative verification of the sialylation by the preferred analysis methods according to the present invention. The present invention is further directed to linkage specific sialylation and analysis of the linkage specific sialylation on the preferred carbohydrate structures using analytical methods according to the present invention.
The invention is preferably directed to linkage specific α3-sialylation of the preferred structures according to the invention without interfering with the other sialylated structures according to the present invention. The invention is preferably directed to linkage specific α6-sialylation of the preferred structures according to the invention without interfering with the other sialylated structures according to the present invention.
The invention is further directed to simultaneous sialylation α3- and α6-sialylated structures according to the present invention. The present invention is further directed for the production of preferred relation of α3- and α6-sialylated structures, preferably in single reaction with two sialyl-transferases.
Furthermore the present invention is directed to sialylation when either NeuAc or NeuGc are quantitatively synthesized to the cell surface, preferably on the preferred structures according to the present invention. Furthermore the invention is directed to sialylation when both NeuAc and NeuGc are, preferably quantitatively, transferred to acceptor sites on the cell surface.
The present invention is specifically directed to the removal of NeuGc from preferred cell populations, most preferably cultivated cell populations and from the preferred structures according to the present invention, and resialylation with NeuAc.
The invention is further directed to preferred methods according to the present invention for verification of removal of NeuGc, and resialylation with NeuAc, preferably quantitative verification and more preferably verification performed by mass spectrometry with regard to the preferred structures.
The present invention is further directed to cell modification according to the invention, preferably desialylation or sialylation of the cells according to the invention, when the sialidase reagent is a controlled reagent with regard of presence of carbohydrate material.
Purification of Cells with Regard to Modification Enzyme
The preferred processes according to the invention comprise of the step of removal of the enzymes from the cell preparations, preferably the sialyl modification enzymes according to the invention. Most preferably the enzymes are removed from a cell population aimed for therapeutic use. The enzyme proteins are usually antigenic, especially when these are from non-mammalian origin. If the material is not of human origin its glycosylation likely increases the antigenicity of the material. This is particularity the case when the glycosylation has major differences with human glycosylation, preferred examples of largely different glycosylations includes: procaryotic glycosylation, plant type glycosylation, yeast or fungal glycosylation, mammalian/animal glycosylation with Galα3Galβ4GlcNAc-structures, animal glycosylations with NeuGc structures. The glycosylation of a recombinant enzyme depends on the glycosylation in the production cell line, these produce partially non-physiological glycan structures. The enzymes are preferably removed from any cell populations aimed for culture or storage or therapeutic use. The presence of enzymes which have affinity with regard to cell surface may otherwise alter the cells as detectable by carbohydrate binding reagents or mass spectrometric or other analysis according to the invention and cause adverse immunological responses.
Under separate embodiment the cell population is cultured or stored in the presence of the modification enzyme to maintain the change in the cell surface structure, when the cell surface structures are recovering from storage especially at temperatures closer physiological or culture temperatures of the cells. Preferably the cells are then purified from trace amounts of the modification enzyme before use.
The invention is furthermore directed to methods of removal of the modification reagents from cell preparations, preferably the modification reagents are desialylation or resialylation reagents. It is realized that soluble enzymes can be washed from the modified cell populations. Preferably the cell material to be washed is immobilized on a matrix or centrifuged to remove the enzyme, more preferably immobilized on a magnetic bead matrix.
However, extraneous washing causes at least partial destruction of cells and their decreased viability. Furthermore, the enzymes have affinity with regard to the cell surface. Therefore the invention is specifically directed to methods for affinity removal of the enzymes. The preferred method includes a step of contacting the modified cells with an affinity matrix binding the enzyme after modification of the cells.
Under specific embodiment the invention is directed to methods of tagging the enzyme to be removed from the cell population. The tagging step is performed before contacting the enzyme with the cells. The tagging group is designed to bind preferably covalently to the enzyme surface, without reduction or without major reduction of the enzyme activity. The invention is further directed to the removal of the tagged enzyme by binding the tag to a matrix, which can be separated from the cells. Preferably the matrix comprises at least one matrix material selected from the group: polymers, beads, magnetic beads, or solid phase surface
Under specific embodiment the invention is directed to the use for modification of the cells according to the invention, or in a separate embodiment reagents for processes according to the invention, of a human acceptable enzyme, preferably a glycosidase according to the invention or in preferred embodiment sialidase or sialyltransferase, which is acceptable at least in certain amounts to human beings without causing harmful allergic or immune reactions. It is realized that the human acceptable enzymes may not be needed to be removed from reaction mixtures or less washing steps are needed for desirable level of the removal. The human acceptable enzyme is in preferred embodiment a human glycosyltransferase or glycosidase. The present invention is separately directed to human acceptable enzyme which is a sialyltransferase. The present invention is separately directed to human acceptable enzyme which is a sialidase, the invention is more preferably directed to human sialidase which can remove specific type of sialic acid from cells.
In a preferred embodiment the human acceptable enzyme is purified from human material, preferably from human serum, urine or milk. In another preferred embodiment the enzyme is recombinant enzyme corresponding to natural human enzyme. More preferably the enzyme corresponds to human natural enzyme corresponds to natural cell surface or a secreted from of the enzyme, more preferably serum or urine or human milk form of the enzyme. Even more preferably the present invention is directed to human acceptable enzyme which corresponds to a secreted form of a human sialyltransferase or sialidase, more preferably secreted serum/blood form of the human enzyme. In a preferred embodiment the human acceptable enzyme, more preferably recombinant human acceptable enzyme, is a controlled reagent with regard to potential harmful glycan structures, preferably NeuGc-structures according to the invention. The recombinant proteins may contain harmful glycosylation structures and inventors revealed that these kinds of structures are also present on recombinant glycosyltransferases, even on secreted (truncated) recombinant glycosyltransferases.
mRNA Corresponding to Glycosylation Enzymes
The present invention is further directed to correlation of specific messenger mRNA molecules with the preferred glycan structures according to the present invention. It is realized that glycosylation can be controlled in multiple levels and one of them is transcription. The presence of glycosylated structures may in some case correlate with mRNAs involved in the synthesis of the structures.
The present invention is especially directed to analysis of mRNA-species having correlation with expressed fucosylated glycan structures and “terminal HexNAc” containing structures preferred according to the present invention. The preferred mRNA-species includes mRNA corresponding to fucosyltransferases and N-acetylglucosaminyltransferases.
Observation of Glycan Binding Structures, Lectins, Corresponding mRNA-Markers
The invention further revealed changes in mRNA-expression of glycosylation recognizing lectins such as galectins. The cells were further revealed to contain lactosamine receptors for lectins. The invention is especially directed to analysis of expression levels of human lectins and lactosamine galectins receptors, preferably analysis of galectins and selectins more preferably galectins for analysis of status of cells according to the present invention.
The invention specifically revealed novel NeuGc(N-glycolylneuraminic acid)-binding lectin activity from human cells. The lectin recognizes polyvalent NeuGc but does not bind effectively to polyvalent NeuNAc. The present invention is especially directed to labelling cells according to the invention by carbohydrate conjugates binding cells according to the invention, preferably labelled conjugates of NeuGc. The invention is further directed to sorting and selecting cells, and cell derived materials and purifying proteins from cells, using labelled carbohydrate conjugates, preferably, conjugates of NeuGc.
The present invention is directed to analysis of released glycomes by spectrometric method useful for characterization of the glycomes from tissue specimens or cells. The invention is directed to NMR spectroscopic analysis of the mixtures of released glycans.
The invention is especially directed to methods of producing NMR from specific subglycomes, preferably N-linked glycome, O-linked glycome, glycosaminoglycan glycome and/or glycolipid glycome. The NMR-profiling according to the invention is further directed to the analysis of the novel and rare structure groups revealed from cell glycomes according to the invention. The general information about complex cell glycome material directed NMR-methods are limited.
Preferably the NMR-analysis is performed from an isolated subglycome. The preferred isolated subglycomes include acidic glycomes and neutral glycomes.
NMR-Glycome Analysis of Larger Tissue Specimens or Larger Amounts of Cells
It is realized that numerous methods have been described for purification of oligosaccharide mixtures useful for NMR from various materials, including usually purified individual proteins. It is realized that present methods are useful for NMR-profiling even for larger tissue specimens or higher amounts of cells according to the invention, especially in combination with NMR-profiling according to the invention and/or when directed to the analysis specific and preferred structure groups according to the invention. The preferred purification methods are effective and form an optimised process for purification of glycomes from even larger amounts of cells and tissues than described for nanoscale methods below. The methods are preferred also for any larger amount of cells.
Moreover, when purification methods for larger amounts of carbohydrate materials exists, but very low and complex carbohydrate materials with very complex impurities such as cell-derived materials have been less studied as low amounts, especially when purity useful for NMR-analysis is needed.
The invention is directed to analysis of NMR-samples that can be produced from very low amounts of cells according to the invention. Preferred sample amounts of cells or corresponding amount of tissue material for a one-dimensional proton-NMR profiling are from about 2 million to 100 million cells, more preferably 10-50 million cells. It is further realized that good quality NMR data can be obtained from samples containing at least about 10-20 million cells.
The preferred analysis methods is directed to high resolution NMR observing oligosaccharide/saccharide conjugate mixture from an amount of at least 4 nmol, more preferably at least 1 nmol and the cell amount yielding the preferred amount of saccharide mixture. For nanoscale analysis according to the invention cell material is selected so that it would yield at least about 50 nmol of oligosaccharide mixture, more preferably at least about 5 nmol and most preferably at least about 1 nmol of oligosaccharide mixture. Preferred amounts of major components in glycomes to be observed effectively by the methods according to the invention include yield at least about 10 nmol of oligosaccharide component, more preferably at least about 1 nmol and most preferably at least about 0.2 nmol of oligosaccharide component.
The preferred cell amount for analysis of a subglycome from a cell type is preferably optimised by measuring the amounts of glycans produced from the cell amounts of preferred ranges.
It is realized that depending on the cell and subglycome type the required yield of glycans and the heterogeneity of the materials vary yielding different amounts of major components
For the production of sample for nanoscale NMR, the methods described for preparation of cell samples and release of oligosaccharides for mass spectrometric profiling according to the invention may be applied.
For the purification of sample for nanoscale NMR the methods described for purification mass spectrometry profiling samples according to the invention may be applied.
The preferred purification method for nanoscale NMR-profiling according to the invention include following general purification process steps:
2) Hydrophobic interaction;
3) Affinity to carbon material, especially graphitized carbon.
4) Gel filtration chromatography
The more preferred purification process includes precipitation/extraction aimed for removal of major non-carbohydrate impurities by separating the impurity and the glycome fraction(s) to be purified to different phases. Hydrophobic interaction step aims to purify the glycome components from more hydrophobic impurities as these are bound to hydrophobic chromatography matrix and the glycome components are not retained. Chromatography on graphitized carbon may include purification or enrichment of glycans due to their affinity or specific adsorption to graphitized carbon, or removal of contaminants from the glycans. The glycome components obtained by the aforementioned steps are then subjected to gel filtration chromatography, separating molecules according to their hydrodynamic volume, i.e. size in solution. The gel filtration chromatography step allows detection and quantitation of glycome components by absorption at low wavelengths (205-214 nm).
The most preferred purification process includes precipitation/extraction and hydrophobic interaction steps aimed for removal of major non-carbohydrate impurities and more hydrophobic impurities. Chromatography on graphitized carbon is used for removal of contaminants from the glycans, and to divide the glycome components to fractions of neutral glycome components and acidic glycome components. The neutral and acidic glycome component fractions are then subjected to gel filtration chromatography, which separates molecules according to their size. Preferably, a high-performance liquid chromatography (HPLC) type gel filtration column is used. The neutral glycome component fraction is preferably chromatographed in water and the acidic glycome component fraction is chromatographed in 50-200 mM aqueous ammonium bicarbonate solution. Fractions are collected and evaporated prior to further analyses. The gel filtration chromatography step allows detection and quantitation of glycome components by absorption at low wavelengths (205-214 nm). Quantitation is performed against external standards. The standards are preferably N-acetylglucosamine, N-acetylneuraminic acid, or oligosaccharides containing the same. Fractions showing absorbance are subjected to MALDI-TOF mass spectrometry. Preferably, the neutral glycome components are analyzed in the positive-ion mode and the acidic glycome components in the negative-ion mode in a delayed-extraction MALDI-TOF mass spectrometer.
The fractionation can be used to enrich components of low abundance. It is realized that enrichment would enhance the detection of rare components. The fractionation methods may be used for larger amounts of cell material. In a preferred embodiment the glycome is fractionated based on the molecular weight, charge or binding to carbohydrate binding agents such as lectins and/or other binding agents according to the invention.
These methods have been found useful for specific analysis of specific subglycomes and enrichment more rare components. The present invention is in a preferred embodiment directed to charge based separation of neutral and acidic glycans. This method gives for analysis method, preferably mass spectroscopy material of reduced complexity and it is useful for analysis as neutral molecules in positive mode mass spectrometry and negative mode mass spectrometry for acidic glycans.
It is realized that preferred amounts of enriched glycome oligosaccharide mixtures and major component comprising fractions can be produced from larger glycome preparations.
In a preferred embodiment the invention is directed to size based fractionation methods for effective analysis of preferred classes of glycans in glycomes. The invention is especially directed to analysis of lower abundance components with lower and higher molecular weight than the glycomes according to the invention. The preferred method for size based fractionation is gel filtration. The invention is especially directed to analysis of enriched group glycans of N-linked glycomes preferably including lower molecular weight fraction including low-mannose glycans, and one or several preferred low mannose glycan groups according to the invention.
In a preferred embodiment the NMR-analysis of the glycome is one-dimensional proton-NMR analysis showing structural reporter groups of the major components in the glycome. The invention is further directed to specific two- and multidimensional NMR-experiments of the glycomes when enough sample is available. It is realized that two-dimensional NMR-experiments require about a ten-fold increase in sample amount compared to proton-NMR analyses.
The present invention is further directed to combination of the mass spectrometric and NMR glycome analyses. The preferred method include production of any mass spectrometric profile from any glycome according to the invention from a cell sample according to the invention, optionally characterizing the glycome by other methods like glycosidase digestion, fragmentation mass spectrometry, specific binding agents, and production of NMR-profile from the same sample glycome or glycomes to compare these profiles.
The N-glycan analysis of total profiles of released N-glycans revealed beside the glycans above, which were verified to comprise
1) complex biantennary N-glycans, such as Galβ4GlcNAcβ2Manα3(Galβ4GlcNAcβ2Man α6)Manβ4GlcNAcβ4(Fucα6)0-1GlcNAcβ-,
wherein the reminal N-acetyllactosamines can be elongated from Gal with NeuNAcα3 and/or NeuNAcα6 and
2) terminal mannose containing N-glycans such as High-mannose glycans with formula Hex5-9HexNAc2 and degradation products thereof comprising low number of mannose residues (Low mannose glycans) Hex1-4HexNAc2.
The glycan share common core structure according to the Formula:
[Manα3]n1(Manα6)n2Manβ4GlcNAcβ4(Fucα6)0-1GlcNAcβAsn,
wherein n1 and n2 are integers 0 or 1, independently indicating the presence or absence of the terminal Man-residue, and
wherein the non-reducing end terminal Manα3/Manα6-residues can be elongated to the complex type, especially biantennary structures or to mannose type (high-Man and/or low Man) or to hybrid type structures as described in examples.
It was further analyzed that the N-glycan compositions contained only very minor amounts of glycans with additional HexNAx in comparison to monosaccharide compositions of the complex type glycan above, which could indicate presence of no or very low amounts of the N-glycan core linked GlcNAc-residues described by Stanley P M and Raju T S (JBC—(1998) 273 (23) 14090-8; JBC (1996) 271 (13) 7484-93) and/or bisecting GlcNAc. is realized that part of the terminal HexNAc-type structures appear to represent bisecting GlcNAc-type type glycans, and quite low or non-existent amounts of the GlcNAcα6-branching and also low amounts of GlcNAcβ2-branch of Manβ4 described by Stanley and colleagues. Here, essentially devoid of indicates less than 10% of all the protein linked N-glycans, more preferably the additional HexNAc units are present in less than 8% of the tissue material N-glycans by mass spectrometric analysis.
The invention thus describes the major core structure of N-glycans in human tissue materials verified by NMR-spectroscopy and by specific glycosidase digestions and was further quantitated to comprise a characteristic smaller structural group glycans comprising specific terminal HexNAc group and/or bisecting GlcNAc-type structures, which additionally modify part of the core structure. The invention further reveals that the core structure is a useful target structure for analysis of tissue materials.
The characteristic monosaccharide composition of the core structure will allow recognition of the major types of N-glycan structure groups present as additional modification of the core structure. Furthermore composition of the core structure is characteristic in mass spectrometric analysis of N-glycan and allow immediate recognition for example from HexxHexNAc1-type (preferentially ManxGlcNAc1) glycans also present in total glycome composition.
The invention describes novel low-molecular weight acidic glycan components within the acidic N-glycan and/or soluble glycan fractions with characteristic monosaccharide compositions SAxHex1-2HexNAc1-2, wherein x indicates that the corresponding glycans are preferentially sialylated with one or more sialic acid residues. The inventors realized that such glycans are novel and unusual with respect to N-glycan biosynthesis and described mammalian cell glycan components, as reveal also by the fact that they are classified as “other (N-)glycan types” in N-glycan classification scheme of the present invention. The invention is directed to analyzing, isolating, modifying, and/or binding to these novel glycan components according to the methods and uses of the present invention, and further to other uses of specific marker glycans as described here. As demonstrated in the Examples of the present invention, such glycan components were specific parts of total glycomes of certain tissue materials and preferentially to certain tissue material types, making their analysis and use beneficial with regard to tissue materials. The invention is further directed to tissue material glycomes and subglycomes containing these glycan components.
The present invention is specifically directed to tissue material glycomes, which are essentially pure glycan mixtures comprising various glycans as described in the invention preferably in proportions shown by the invention. The essentially pure glycan mixtures comprise the key glycan components in proportions which are characteristics to tissue material glycomes. The preferred glycomes are obtained from human tissue materials according to the invention.
The invention is further directed to glycomes as products of purification process and variations thereof according to the invention. The products purified from tissue materials by the simple, quantitative and effective methods according to the invention are essentially pure. The essentially pure means that the mixtures are essentially devoid of contaminations disturbing analysis by MALDI mass spectrometry, preferably by MALDI-TOF mass spectrometry. The mass spectra produced by the present methods from the essentially pure glycomes reveal that there is essentially no non-carbohydrate impurities with weight larger than trisaccharide and very low amount of lower molecular weight impurities so that crystallization of MALDI metric is possible and the glycan signals can be observed for broad glycomes with large variations of monosaccharide compositions and ranges of molecular weight as described by the invention. It is realized that the purification of the materials from low amounts of tissue materials comprising very broad range of cellular materials is very challenging task and the present invention has accomplished this.
Combination Compositions of the Preferred Glycome Mixtures with Matrix for Analysis
The invention further revealed that it is possible to combine the glycomes with matrix useful for a mass spectrometric analysis and to obtain combination mixture useful for spectrometric analysis. The preferred mass spectrometric matrix is matrix for MALDI (matrix assisted laser desorption ionization mass spectrometry) with mass spectrometric analysis (abbreviated as MALDI matrix), MALDI is preferably performed with TOF (time of flight) detection.
Preferred MALDI matrices include aromatic preferably benzene ring structure comprising molecules with following characteristic. The benzene ring structure molecules preferably comprises 1-4 substituents such as hydroxyl, carboxylic acid or ketone groups. Known MALDI matrixes have been reviewed in Harvey, Mass. Spec. Rev. 18, 349 (1999). The present invention is especially and separately directed to specific matrixes for analysis in negative ion mode of MIALDI mass spectrometry, preferred for analysis of negatively charged (acidic, such as sialylated and/or sulfated and/or phosphorylated) subglycome, and in positive ion mode of MALDI mass spectrometry (preferred for analysis of neutral glycomes). It is realized that the matrices can be optimized for negative ion mode and positive ion mode.
The present invention is especially directed to glycome matrix composition optimized for the use in positive ion mode, and to the use of the MALDI-TOF matrix and matrix glycome composition, that is optimized for the use in the analysis in positive ion mode, for the analysis of glycome, preferably neutral glycome. The preferred matrices for positive ion mode are aromatic matrices, e.g. 2,5-dihydroxybenzoic acid, 2,5-dihydroxybenzoic acid/2-hydroxy-5-methoxybenzoic acid, 2,4,6-trihydroxyacetophenone or 6-aza-2-thiothymine, more preferably 2,5-dihydroxybenzoic acid. The present invention is especially directed to glycome matrix composition optimized for the use in negative ion mode, and to the use of the MALDI-TOF matrix and the matrix glycome compositions, that is optimized for the negative ion mode, for the analysis of glycome, preferably acidic glycome. The preferred matrices for negative ion mode are aromatic matrices, e.g. 2,4,6-trihydroxyacetophenone, 3-hydroxypicolinic acid, 2,5-dihydroxybenzoic acid, 2,5-dihydroxybenzoic acid/2-hydroxy-5-methoxybenzoic acid, or 6-aza-2-thiothymine, more preferably 2,4,6-trihydroxyacetophenone. The invention is further directed to analysis method and glycome-matrix composition for the analysis of glycome compositions, wherein the glycome composition comprises both negative and neutral glycome components. Preferred matrices for analysis of negative and neutral glycome components comprising glycome are aromatic matrices, e.g. 2,4,6-trihydroxyacetophenone, 3-hydroxypicolinic acid, 2,5-dihydroxybenzoic acid, 2,5-dihydroxybenzoic acid/2-hydroxy-5-methoxybenzoic acid, or 6-aza-2-thiothymine, more preferably 2,4,6-trihydroxyacetophenone.
The MALDI-matrix is a molecule capable of
1) Specifically and effectively co-crystallizing with glycome composition with the matrix, crystallizing meaning here forming a solid mixture composition allowing analysis of glycome involving two steps below
2) absorbing UV-light typically provided by a laser in MALDI-TOF instrument, preferred wavelength of the light is 337 nm as defined by the manuals of MALDI-TOF methods
3) transferring energy to the glycome composition so that these will ionize and be analyzable by the MALDI-TOF mass spectrometry. The present invention is especially directed to compositions of glycomes in complex with MALDI mass spectrometry matrix.
The present invention is specifically directed to methods of searching novel MALDI-matrixes with the above characteristic, preferably useful for analysis by the method below. The method for searching novel MALDI-matrixes using the method in the next paragraph.
The present invention is specifically directed to methods of analysis of glycomes by MALDI-TOF including the steps:
1) Specifically and effectively co-crystallizing the glycome composition with the MALDI-TOF-matrix, crystallizing meaning here forming a solid mixture composition allowing analysis of glycome involving two steps below
2) Providing UV light to crystalline sample by a laser in MALDI-TOF instrument allowing the ionization of sample
3) Analysis of the ions produced by the MALDI mass spectrometer, preferably by TOF analysis. The invention is further directed to the combination of glycome purification methods and/or quantitative and qualitative data analysis methods according to the invention.
The present invention is further directed to essentially pure glycome compositions in solid co-crystalline form with MALDI matrix. The invention is preferably a neutral and/or acidic glycome as complex with a matrix optimized for analysis of the specific glycome type, preferably analysis in negative ion mode with a matrix such as 2,4,6-trihydroxyacetophenone.
The invention is preferably a neutral (or non-acidic) glycome as complex with a matrix optimized for analysis in positive ion mode such as 2,5-dihydroxybenzoic acid.
The invention revealed that it is possible to analyze glycomes using very low amount of sample. The preferred crystalline glycome composition comprises between 0.1-100 pmol, more preferably 0.5-10 pmol, more preferably 0.5-5 pmol and more preferably about 0.5-3 pmol, more preferably about 0.5-2 pmol of sample co-crystallized with optimized amount of matrix preferably about 10-200 nmol, more preferably 30-150 nmol, and more preferably about 50-120 nmol and most preferably between 60-90 nmols of the matrix, preferably when the matrix is 2,5-dihydroxybenzoic acid. The matrix and analyte amounts are optimized for a round analysis spot with radius of about 1 mm and area of about 0.8 mm2. It is realized that the amount of materials can be changed in proportion of the area of the spot, smaller amount for smaller spot. Examples of preferred amounts per area of spot are 0.1-100 pmol/0.8 mm2 and 10-200 pmol/3 mm2. Preferred molar excess of matrix is about 5000-1000000 fold, more preferably about 10000-500000 fold and more preferably about 15000 to 200 000 fold and most preferably about 20000 to 100000 fold excess when the matrix is 2,5-dihydroxybenzoic acid.
It is realized that the amount and relative amount of new matrix is optimized based on forming suitable crystals and depend on chemical structure of the matrix. The formation of crystals is observed by microscope and further tested by performing test analysis by MALDI mass spectrometry.
The invention is further directed to specific methods for crystallizing MALDI-matrix with glycome. Preferred method for crystallization in positive ion mode includes steps: (1) optionally, elimination of impurities, like salts and detergents, which interfere with the crystallization, (2) providing solution of glycome in H2O or other suitable solvent in the preferred concentration, (3) mixing the glycome with the matrix in solution or depositing the glycome in solution on a precrystallized matrix layer and (4) drying the solution preferably by a gentle stream of air.
Preferred method for crystallization in negative ion mode includes steps: (1) optionally, elimination of impurities, like salts and detergents, which interfere with the crystallization, (2) providing solution of glycome in H2O or other suitable solvent in the preferred concentration, (3) mixing the glycome with the matrix in solution or depositing the glycome in solution on a precrystallized matrix layer and (4) drying the solution preferably by vacuum. Other preferred glycome analysis compostions
The invention is further directed to compositions of essentially pure glycome composition with specific glycan binding molecules such as lectins, glycosidases or glycosyltransferases and other glycosyl modifying enzymes such as sulfateses and/or phosphatases and antibodies. It is realized that these composition are especially useful for analysis of glycomes.
The present invention revealed that the complex glycome compositions can be effectively and even quantitatively modified by glycosidases even in very low amounts. It was revealed that the numerous glycan structures similar to target structures of the enzymes do not prevent the degradation by competitive inhibition, especially by the enzymes used. The invention is specifically directed to preferred amounts directed to MALDI analysis for use in composition with a glycosyl modifying enzyme, preferably present in low amounts. Preferred enzymes suitable for analysis include enzymes according to the Examples.
The invention is further directed to binding of specific component of glycome in solution with a specific binder. The preferred method further includes affinity chromatography step for purification of the bound component or analysis of the non-bound fraction and comparing it to the glycome solution without the binding substance. Preferred binders include lectins engineered to be lectins by removal of catalytic amino acids (methods published by Roger Laine, Anomeric, Inc., USA, and Prof. Jukka Finne, Turku, Finland), lectins and antibodies or antibody fragments or minimal binding domains of the proteins.
The present invention is especially directed to the use of glycome data for production of mathematical formulas, or algorithms, for specific recognition or identification of specific tissue materials. Data analysis methods are presented e.g. in Example 17.
The invention is especially directed to selecting specific “structural features” such as mass spectrometric signals (such as individual mass spectrometric signal corresponding to one or several monosaccharide compositions and/or glycan structures), or signal groups or subglycomes or signals corresponding to specific glycan classes, which are preferably according to the invention, preferably the signal groups (preferably defined as specific structure group by the invention), from quantitative glycome data, preferably from quantitative glycome data according to the invention, for the analysis of status of tissue materials. The invention is furthermore directed to the methods of analysis of the tissue materials by the methods involving the use of the specific signals or signal groups and a mathematical algorithm for analysis of the status of a tissue material.
Preferred algorithm includes use of proportion (such as %-proportion) of the specific signals from total signals as specific values (structural features) and creating a “glycan score”, which is algorithm showing characteristics/status of a tissue material based on the specific proportional signal intensities (or quantitative presence of glycan structures measured by any quantitation method such as specific binding proteins or quantitative chromatographic or electrophoresis analysis such as HPLC analysis). Preferably signals which are, preferably most specifically, upregulated in specific tissue materials and signals which are, preferably most specifically, downregulated in the tissue material in comparison to control tissue materials are selected to for the glycan score. In a preferred embodiment value(s) of downregulated signals are subtracted from upregulated signals when glycan score is calculated. The method yields largest score values for a specific tissue material type or types selected to be differentiated from other tissue materials.
The invention is specifically directed to methods for searching characteristic structural features (values) from glycome profiling data, preferably quantitative or qualitative glycome profiling data. The preferred methods include methods for comparing the glycome data sets obtained from different samples, or from average data sets obtained from a group of similar samples such as paraller samples from same or similar tissue material preparations. Methods for searching characteristic features are briefly described in the section: identification and classification of differences in glycan datasets. The comparison of datasets of the glycome data according to the invention preferably includes calculation of relative and/or absolute differences of signals, preferably each signal between two data sets, and in another preferred embodiment between three or more datasets. The method preferably further includes step of selecting the differing signals, or part thereof, for calculating glycan score.
It is further realized that the analyzed glycome data has other uses preferred by the invention such as use of the selected characteristic signals and corresponding glycan material:
1) for targets for structural analysis of glycans (preferably chemically by glycosidases, fragmentation mass spectrometry and/or NMR spectroscopy as shown by the present invention and/or structural analysis based on the presence of other signals and knowledge of biosynthesis of glycans). The preferred use for targets includes estimation of chemical characteristics of potential corresponding glycans for complete or partial purification/separation of the specific glycan(s). The preferred chemical characteristics to be analysed preferably include one or several of following properties: a) acidity (e.g. by presence of acidic residues such as sialic acid and/or sulfate and/or phosphate) for charge based separation, b) molecular weight or hydrodynamic volume affecting chromatographic separation, e.g. estimation of the elution volume in gel filtration methods (the effect of acidic residue can be estimated from effects of similar structures and the “size” of HexNAc (GalNAc/GlcNAc) is in general twice the size of Hex (such as Gal, Man or Glc), c) estimation (e.g. based on composition and biosynthetic knowledge of glycans) of presence of epitopes for specific binding reagents for labelling identification in a mixture or for affinity purification, d) estimation of presence of target epitopes for specific glycosylmodifying enzymes including glycosidases and/or glycosyltransferases (types of binding reagents) or for specific chemical modification reagents (such as periodate for specific oxidation or acid for specific acid hydrolysis), for modification of glycans and recognition of the modification by potential chemical change such as incorporation of radioactive label or by change of mass spectrometric signal of the glycan for labelling identification in a mixture.
2) use of the signals or partially or fully analysed glycan structures corresponding to the signals for searching specific binding reagents for recognition of tissue materials which are preferably selected as described by the present invention (especially as described above) and in the methods for identification and classification of differences in glycan datasets and/or signals selected and/or tested by glycan score methods, are preferably selected for targets for structural analysis of glycans (preferably by glycosidases, fragmentation mass spectrometry and/or NMR spectroscopy as shown by the present invention) and/or for use of the signals or partially or fully analysed glycan structures corresponding to the signals for searching specific binding reagents for recognition of tissue materials.
The preferred method includes the step of comparing the values, and preferably presenting the score values in graphs such as ones shown in
It is realized that to differentiate a tissue materials type from other(s) different characteristic signals may be selected than for another tissue material type. The invention however revealed that for tissue materials and especially for human cancer patients preferred characteristic signals include ones selected in the Examples as described above. It is realized that a glycan score can be also created with less characteristic signals or with only part of signals and still relevant results can be obtained. The invention is further directed to methods for optimisation of glycan score algorithms and methods for selecting signals for glycan scores.
In case the specific proportion (value) of a characteristic signal is low in comparison to other values a specific factor can be selected for increase the relative “weight” of the value in the glycan scores to be calculated for the cell populations.
The preferred statuses of tissue materials, to be analysed by mathematical methods such as algorithms using quantitative glycome profiling data according to the invention include differentiation status, individual characteristics and mutation, cell culture or storage conditions related status, effects of chemicals or biochemicals on cells, and other statuses described by the invention.
The present invention is especially directed to following O-glycan marker structures of tissue materials:
Core 1 type O-glycan structures following the marker composition NeuAc2Hex1HexNAc1, preferably including structures SAα3Galβ3GalNAc and/or SAα3Galβ3 (Saα6)GalNAc; and Core 2 type O-glycan structures following the marker composition NeuAc0-2Hex2HexNAc2dHex0-1, more preferentially further including the glycan series NeuAc0-2Hex2+nHexNAc2+ndHex0-1, wherein n is either 1, 2, or 3 and more preferentially n is 1 or 2, and even more preferentially n is 1;
more specifically preferably including R1Galβ4(R3)GlcNAcβ6(R2Galβ3)GalNAc,
wherein R1 and R2 are independently either nothing or sialic acid residue, preferably β2,3-linked sialic acid residue, or an elongation with HexnHexNAcn, wherein n is independently an integer at least 1, preferably between 1-3, most preferably between 1-2, and most preferably 1, and the elongation may terminate in sialic acid residue, preferably α2,3-linked sialic acid residue; and
R3 is independently either nothing or fucose residue, preferably α1,3-linked fucose residue. It is realized that these structures correlate with expression of β6GlcNAc-transferases synthesizing core 2 structures.
The present invention is especially directed to glycan compositions (structures) and analysis of high-mannose type and glycosylated N-glycans according to the formula:
Hexn3HexNAcn4,
wherein n3 is 5, 6, 7, 8, 9, 10, 11, or 12, and n4=2.
According to the present invention, within total N-glycomes of tissue materials the major high-mannose type and glucosylated N-glycan signals preferentially include the compositions with 5≦n3≦10:Hex5HexNAc2 (1257), Hex6HexNAc2 (1419), Hex7HexNAc2 (1581), Hex8HexNAc2 (1743), Hex9HexNAc2 (1905), and Hex10HexNAc2 (2067); and more preferably with 5≦n3≦9: Hex5HexNAc2 (1257), Hex6HexNAc2 (1419), Hex7HexNAc2 (1581), Hex8HexNAc2 (1743), and Hex9HexNAc2 (1905).
The present invention is especially directed to glycan compositions (structures) and analysis of low-mannose type N-glycans according to the formula:
Hexn3HexNAcn4dHexn5,
wherein n3 is 1, 2, 3, or 4, n4=2, and n5 is 0 or 1.
According to the present invention, within total N-glycomes of tissue materials the major low-mannose type N-glycan signals preferably include the compositions with 2≦n3≦4: Hex2HexNAc2 (771), Hex3HexNAc2 (933), Hex4HexNAc2 (1095), Hex2HexNAc2dHex (917), Hex3HexNAc2dHex (1079), and Hex4HexNAc2dHex (1241); and more preferably when n5 is 0: Hex2HexNAc2 (771), Hex3HexNAc2 (933), and Hex4HexNAc2 (1095).
As demonstrated in the present invention by glycan structure analysis of tissue materials, preferably this glycan group in tissue materials includes the molecular structures: (Manα)1-3Manβ4GlcNAcβ4(Fucα6)0-1GlcNAc within the glycan signals 771, 917, 933, 1079, 1095, and 1095, and the preferred low-Man structures includes structures common all tissue material types, tri-Man and tetra-Man structures according to the Examples,
(Man0-1Manα3)Manβ4GlcNAcβ4(Fucα6)GlcNAc, more preferably the tri-Man structures:
even more preferably the abundant molecular structure:
Manα6(Manα3)Manβ4GlcNAcβ4GlcNAc within the glycan signal 933.
Beside the qualitative variations the low-Man glycans have specific value in quantitative analysis of tissue materials. The present invention revealed that the low-Man glycans are especially useful for the analysis of status of the cells. For example the analysis in the Examples revealed that the amounts of the glycans vary between total tissue profiles and specific organelles, preferably lysosomes.
The group of low-Man glycans form a characteristic group among glycome compositions. The relative total amount of neutral glycans is notable in average human tissues. The glycan group was revealed also to be characteristic in cancerous tissues and tumors a with total relative amount of neutral glycomes increased. The difference is more pronounced within lysosomal organelle-specific glycome, wherein low-Man structures accounted nearly 50% of the neutral protein-linked glycome. Glycome analysis of tissue materials is especially useful for methods for development of binder reagents for separation of different tissue materials.
The invention is directed to analysis of relative amounts of low-Man glycans, and to the specific quantitative glycome compositions, especially neutral glycan compositions, comprising about 0 to 50% of low-Man glycans, more preferably between about 1 to 50% of solid tissue glycomes, for the analysis of tissue materials according to the invention, and use of the composition for the analysis of tissue materials.
The present invention is especially directed to glycan compositions (structures) and analysis of fucosylated high-mannose type N-glycans according to the formula:
Hexn3HexNAcn4dHexn5,
wherein n3 is 5, 6, 7, 8, or 9, n4=2, and n5=1.
According to the present invention, within total N-glycomes of tissue materials the major fucosylated high-mannose type N-glycan signal preferentially is the composition Hex5HexNAc2dHex (1403).
The present invention is especially directed to glycan compositions (structures) and analysis of neutral soluble N-glycan type glycans according to the formula:
Hexn3HexNAcn4,
wherein n3 is 1, 2, 3, 4, 5, 6, 7, 8, or 9, and n4=1.
Within total N-glycomes of tissue materials the major soluble N-glycan signals include the compositions with 4≦n3≦8, more preferably 4≦n3≦7: Hex4HexNAc (892), Hex5HexNAc (1054), Hex6HexNAc (1216), Hex7HexNAc (1378). In the most preferred embodiment of the present invention, the major glycan signal in this group within total neutral glycomes of tissue materials is Hex5HexNAc (1054).
The present invention is especially directed to glycan compositions (structures) and analysis of neutral monoantennary or hybrid-type N-glycans according to the formula:
Hexn3HexNAcn4dHexn5,
wherein n3 is an integer greater or equal to 2, n4=3, and n5 is an integer greater or equal to 0.
According to the present invention, within total N-glycomes of tissue materials the major neutral monoantennary or hybrid-type N-glycan signals preferentially include the compositions with 2≦n3≦8 and 0≦n5≦2, more preferentially compositions with 3≦n3≦6 and 0≦n5≦1, with the proviso that when n3=6 also n5=0: preferentially Hex4HexNAc3 (1298), Hex4HexNAc3dHex (1444), Hex5HexNAc3 (1460), and Hex6HexNAc3 (1622).
The present invention is especially directed to glycan compositions (structures) and analysis of neutral complex-type N-glycans according to the formula:
Hexn3HexNAcn4dHexn5,
wherein n3 is an integer greater or equal to 3, n4 is an integer greater or equal to 4, and n5 is an integer greater or equal to 0.
Within the total N-glycomes of tissue materials the major neutral complex-type N-glycan signals preferentially include the compositions with 3≦n3≦8, 4≦n4≦7, and 0≦n5≦4, more preferentially the compositions with 3≦n3≦5, n4=4, and 0≦n5≦1, with the proviso that when n3 is 3 or 4, then n5=1: Hex3HexNAc4dHex (1485), Hex4HexNAc4dHex (1647), Hex5HexNAc4 (1663), Hex5HexNAc4dHex (1809); and even more preferentially also including the composition Hex3HexNAc5dHex (1688).
In another embodiment of the present invention, the N-glycan signal Hex3HexNAc4dHex (1485) contains non-reducing terminal GlcNAcβ, and more preferentially the total N-glycome includes the structure:
In yet another embodiment of the present invention, within the total N-glycome of tissue materials, the N-glycan signal Hex5HexNAc4dHex (1809), more preferentially also Hex5HexNAc4 (1663), contain non-reducing terminal β1,4-Gal. Even more preferentially the total N-glycome includes the structure:
Galβ4GlcNAcβ2Manα3 (Galβ4GlcNAcβ2Manα6)Manβ4GlcNAcβ4GlcNAc (1663); and in a further preferred embodiment the total N-glycome includes the structure:
The present invention is especially directed to glycan compositions (structures) and analysis of neutral fucosylated N-glycans according to the formula:
Hexn3HexNAcn4dHexn5,
wherein n5 is an integer greater than or equal to 1.
Within the total N-glycomes of tissue materials the major neutral fucosylated N-glycan signals preferentially include glycan compositions wherein 1≦n5≦4, more preferentially 1≦n5≦3, even more preferentially 1≦n5≦2, and further more preferentially compositions Hex3HexNAc2dHex (1079), more preferentially also Hex2HexNAc2dHex (917), and even more preferentially also Hex5HexNAc4dHex (1809).
The inventors further found that within the total N-glycomes of tissue materials a major fucosylation form is N-glycan core α1,6-fucosylation. In a preferred embodiment of the present invention, major fucosylated N-glycan signals contain GlcNAcβ4(Fucα6)GlcNAc reducing end sequence.
Neutral N-Glycans with Non-Reducing Terminal HexNAc
The present invention is especially directed to glycan compositions (structures) and analysis of neutral N-glycans with non-reducing terminal HexNAc according to the formula:
Hexn3HexNAcn4dHexn5,
wherein n4≧n3.
Preferably these glycan signals include Hex3HexNAc4dHex (1485) in all tissue materials.
The present invention is especially directed to glycan compositions (structures) and analysis of acidic hybrid-type or monoantennary N-glycans according to the formula:
NeuAcn1NeuGcn2Hexn3HexNAcn4dHexn5SPn6,
wherein n1 and n2 are either independently 1, 2, or 3; n3 is an integer between 3-9; n4 is 3; n5 is an integer between 0-3; and n6 is an integer between 0-2; with the proviso that the sum n1+n2+n6 is at least 1.
Within the total N-glycomes of tissue materials the major acidic hybrid-type or monoantennary N-glycan signals preferentially include glycan compositions wherein 3≦n3≦6, more preferentially 3≦n5≦5, and further more preferentially composition NeuAcHex4HexNAc3dHex (1711).
The present invention is especially directed to glycan compositions (structures) and analysis of acidic complex-type N-glycans according to the formula:
NeuAcn1NeuGcn2Hexn3HexNAcn4dHexn5SPn6,
wherein n1 and n2 are either independently 1, 2, 3, or 4; n3 is an integer between 3-10; n4 is an integer between 4-9; n5 is an integer between 0-5; and n6 is an integer between 0-2; with the proviso that the sum n1+n2+n6 is at least 1.
Within the total N-glycomes of tissue materials the major acidic complex-type N-glycan signals preferentially include glycan compositions wherein 4≦n4≦8, more preferentially 4≦n4≦6, more preferentially 4≦n4≦5, and further more preferentially compositions NeuAcHex5HexNAc4 (1930), NeuAcHex5HexNAc4dHex (2076), NeuAc2Hex5HexNAc4 (2221), NeuAcHex5HexNAc4dHex2 (2222), and NeuAc2Hex5HexNAc4dHex (2367).
The inventors found that tissue material total N-glycomes; and soluble+N-glycomes further contain characteristic modified glycan signals, including sialylated fucosylated N-glycans, multifucosylated glycans, sialylated N-glycans with terminal HexNAc (the N>H and N═H subclasses), and sulphated or phosphorylated N-glycans, which are subclasses of the abovementioned glycan classes. According to the present invention, their quantitative proportions in different tissue materials have characteristic values as described in Tables 8 and 13.
Specifically, major phosphorylated glycans typical to tissue materials, more preferentially to lysosomal organelle glycomes, include Hex5HexNAc2(HPO3) (1313), Hex6HexNAc2(HPO3) (1475), and Hex7HexNAc2(HPO3) (1637).
The preferred complete glycomes of tissue materials include low-mannose type, hybrid-type or monoantennary, hybrid, and complex-type N-glycans,
which more preferentially contain fucosylated glycans, even more preferentially also sialylated glycans, and further more preferentially also sulphated and/or phosphorylated glycans;
and most preferentially also including soluble glycans as described in the present invention.
In a preferred embodiment of the present invention the tissue material total N-glycome contains the three glycan types: i) high-mannose type, 2) hydrid-type or monoantennary, and 3) complex-type N-glycans; and more preferably, in the case of solid tissues or cells also 4) low-mannose type N-glycans; and further more preferably, in the case of solid tissues or cells additionally 5) soluble glycans.
In a preferred embodiment of the preferred glycan type combinations within the tissue material complete glycomes, their relative abundances are as described in Tables 8 and 13.
The invention revealed that the present method is useful for the characterization of O-glycans and glycolipids. It is realized that the key glycolipid structures easily analyzable are based on common lactosylceramide backbone structure cleavable by ceramide glycanase. The key structural families present varyingly on cells are
Exoglycosidase digestions and mass spectrometric fragmentation analyses suggested the presence of various glycolipid types including these glycolipid classes: lacto-, neolacto-, ganglio-, globo-, and isoglobo-type structures. By use of relative quantitation methods, quantitative data of these glycan classes could be compared within cell types and between cell types, as demonstrated in the Examples of the present invention. Cell types were found to differ both in qualitative and in quantitative expression of these glycan classes.
The Table “Examples of glycosphingolipid glycan classification” reveals quantitative differences in various classes of glycolipids. The glycolipid classes are in the example produced based on monosaccharide compositions. The Hex2-group is corresponds to lactosylceramide, the trisaccharide was associated mainly to Lactotriasylceramide GlcNAcβ3LacCer (especially preferred as characteristic signal-marker). The Gb-group was characterised to contain the Galβ3isogloboside, Galβ3GalNAcβGalα3/4Galβ4GlcβCer characteristic for the cell type, glycolipids (see Example about fragmentation mass spectrometry of glycolipids). The L1 glycan groups with characteristic monosaccharide compositions were analysed to contain lacto-, and/or neolacto-glycolipids such as Galβ3GlcNAcβ3Galβ4GlcβCer and Galβ4GlcNAcβ3Galβ4GlcβCer respectively in all analyzed cell types and for cells further comprise Galβ3GalNAcβ4(SAα3)Galβ4GlcβCer (see Example about fragmentation mass spectrometry of glycolipids), the groups. Fractions L2-L3 comprise elongated variants of the previous structures. The invention is further directed to quantitation of subspecies within the groups and providing quantitative comparisons of the different glycolipid family structures for the cells
The glycosidase analysis performed reveal presence of terminal Galβ4GlcNAc and Galβ3GlcNAc structures in the cells types indicating Neolacto- and Lacto-glycolipid series according to the invention. These appear to be characteristic to cell types.
The invention is directed to the use of the present glycomics analysis methods for oligosaccharides released from glycolipids of a cell type according to the invention. The invention is further directed to quantitative mass spectrometric analysis by the present methods, when the signal profile is quantitated according to the invention and combination of the glycolipid glycome analysis with a protein glycome analysis, preferably with analysis of O-glycans and/or N-glycans. The quantitative data is compared between cell or tissue types
The preferred classification for the quantitative analysis by mass spectrometry is represented in the Table “Examples of glycosphingolipid glycan classification”. The invention is preferably directed to glycolipid glycomes of the cells comprising the signals in the %-amounts in the ranges defined by smallest and largest numbers (preferably 5% is added to largest and 5% is subtracted from the smallest if possible to allow experimental variation) according to the Table.
The invention is directed to glycolipid oligosaccharide compostions isolated from cells according to invention and compositions thereof in complex with analysis matrix.
The Table “Examples of O-linked glycan classification” reveals quantitative differences in various classes of glycolipids. The O-glycan classes are in the example produced based on monosaccharide compositions. The O1-group correspond to characteristic core 1 structures and O2 group core 2 glycans, the fucosylation and sialylation produce characteristic changes in the quantitative O-glycan glycomes.
A preferred classification for the quantitative analysis by mass spectrometry is represented in the Table “Examples of O-linked glycan classification”. The invention is preferably directed to glycolipid glycomes of the cells comprising the signals in the %-amounts in the ranges defined by smallest and largest numbers (preferably 5% is added to largest and 5% is subtracted from the smallest if possible to allow experimental variation) according to the Table. The invention is directed to O-glycome oligosaccharide compostions isolated from cells according to invention and compositions thereof in complex with analysis matrix.
The high relative amounts neutral fucosylated O-glycans are preferably characteristic for a cell type and low amounts are preferred markers for other cells. The invention is further directed to quantitation of sulphated or fosforylated glycolipids.
The invention is directed to the use of the present glycomics analysis methods for oligosaccharides released from O-glycans of a cell type according to the invention. The invention is further directed to quantitative mass spectrometric analysis by the present methods, when the signal profile is quantitated according to the invention and combination of the O-glycan glycome analysis with a with analysis of glycolipid oligosaccharide and/or N-glycan glycomes. The quantitative data is compared between cell or tissue types
Comparison of qualitative and quantitative glycan data from N- and/or O-linked glycans revealed that cell types express specific modulation of the terminal epitopes for example by fucosylation and/or sialylation. The inventors further found that these different glycan types were differently modified by terminal epitopes both within cell type, such as fucosylation and type 1 and type 2 chain expression that were found to differ between O-glycans and glycolipids of different cell types. This characterized more completely useful glycan epitope selections in each cell type for their identification and manipulation according to the methods of the present invention.
The present invention is specifically directed to representing glycan structures in cells as combined relatively quantitative representation of glycan type, glycan class, and/or glycan epitope expression. In another embodiment of the present invention, glycans are selected based on their cell type specific qualitative or quantitative expression, such as cell type specificity, abundancy, or functionality. The present invention is further directed to using such selected glycan structures in cells for identification and/or manipulation of cells according to the methods of the present invention.
Comparison of at Least Two Glycomes from Specific Cell Preparations
The invention is directed to characterization to the method wherein at least two glycomes, selected from the group of conjugated glycomes: N-glycome, O-glycome, and lipid linked glycome; are determined from at least two cell populations according to the invention and the data from both or all glycomes are compared between the cells quantitatively. In a preferred embodiment all three glycomes are analyzed.
In a preferred embodiment the two glycomes comprise at least one protein linked glycome, preferably N-linked glycome, which is convenient to analyze. In another preferred embodiment a glycolipid glycome is compared together with comparison a protein linked glycome. In another preferred embodiment two protein linked glycomes, N-glycome and O-glycome, are determined according to the invention. In a preferred embodiment the glycomes to be compared includes both acidic and neutral glycans, in another embodiment neutral glycomes are compared or acidic glycomes are compared or both acid and neutral glycomes are compared only for part of conjugated glycomes and acidic or neutral glycomes are compared for the rest. It is realized that neutral glycomes are often easier to analyze and compare a wealth of data.
The invention is further directed to use of the glycome analysis together with specific binder analysis especially analysis of terminal epitopes.
Analysis and Utilization of Poly-N-Acetyllactosamine Sequences and Non-Reducing Terminal Epitopes Associated with Different Glycan Types
The present invention is directed to poly-N-acetyllactosamine sequences (poly-LacNAc) associated with tissue materials according to the present invention. The inventors found that different types of poly-LacNAc are characteristic to different tissue materials, as described in the Examples of the present invention. In particular, human leukocytes are characterized by linear type 2 poly-LacNAc; other human cell types were found to be characterized by branched type 2 poly-LacNAc or type 1 terminating poly-LacNAc. The present invention is especially directed to the analysis and utilization of these glycan characteristics according to the present invention. The present invention is further directed to the analysis and utilization of the specific tissue materials associated glycan sequences revealed in the present Examples according to the present invention.
The present invention is directed to non-reducing terminal epitopes in different glycan classes including N- and O-glycans, glycosphingolipid glycans, and poly-LacNAc. The inventors found that especially the relative amounts of β1,4-linked Gal, β1,3-linked Gal, β1,2-linked Fuc, α1,3/4-linked Fuc, α-linked sialic acid, and α2,3-linked sialic acid are characteristically different between the studied tissue materials; and the invention is especially directed to the analysis and utilization of these glycan characteristics according to the present invention.
The present invention is further directed to analyzing fucosylation degree in O-glycans by comparing indicative glycan signals such as neutral O-glycan signals at m/z 771 and 917 as described in the Examples. The tissue materials analyzed in the present invention had characteristic fucosylation degree determined based on these signals.
Especially, the present invention is directed to analyzing terminal epitopes associated with poly-LacNAc in tissue materials, more preferably when these epitopes are presented in the context of a poly-LacNAc chain, most preferably in O-glycans or glycosphingolipids. The present invention is further directed to analyzing such characteristic poly-LacNAc, terminal epitope, and fucosylation profiles according to the methods of the present invention, in glycan structural characterization and specific glycosylation type identification, and other uses of the present invention; especially when this analysis is done based on endo-β-galactosidase digestion, by studying the non-reducing terminal fragments and their profile, and/or by studying the reducing terminal fragments and their profile, as described in the Examples of the present invention. The inventors found that tissue materials specific glycosylation features are efficiently reflected in the endo-α-galactosidase reaction products and their profiles. The present invention is further directed to such reaction product profiles and their analysis according to the present invention. The present invention is further especially directed to compositions derived from the endo-α-galactosidase reaction products derived from tissue materials.
The inventors further found that all three most thoroughly analyzed cellular glycan classes, N-glycans, O-glycans, and glycosphingolipid glycans, were differently regulated compared to each other, especially with regard to non-reducing terminal glycan epitopes and poly-LacNAc sequences as described in the Examples and Tables of the present invention. Therefore, combining quantitative glycan profile analysis data from more than one glycan class will yield significantly more information. The present invention is especially directed to combining glycan data obtained by the methods of the present invention, from more than one glycan class selected from the group of N-glycans, O-glycans, and glycosphingolipid glycans; more preferably, all three classes are analyzed; and use of this information according to the present invention. In a preferred embodiment, N-glycan data is combined with O-glycan data; and in a further preferred embodiment, N-glycan data is combined with glycosphingolipid glycan data.
The present invention is further especially directed to combining glycan compositions obtained by the methods of the present invention, from more than one glycan class selected from the group of N-glycans, O-glycans, and glycosphingolipid glycans; more preferably, all three classes are combined. The invention is further directed to use of such compositions according to the present invention. In a preferred embodiment, N-glycan compositions are combined with O-glycan compositions; and in a further preferred embodiment, N-glycan compositions are combined with glycosphingolipid glycan compositions.
The present invention reveled novel methods and characteristic monosaccharide composition to evaluating status of a cell or tissue preparation by evaluating presence of a glycan structure comprising galactose based glycan structures:
i) oligolactosamine comprising epitope GlcNAcβ3Galβ4Glc(NAc)0 or 1, or
ii) a glycolipid or O-glycan structure comprising Galβ3GalNAc,
wherein the characteristic oligosaccharide composition for the released glycan oligosaccharides comprising the structures are released from the tissues are analysed as free unmodified oligosaccharides, and the amounts of oligosaccharides are characterized quantitatively by MALDI-TOF mass spectrometry.
The invention is further directed to the method, wherein lactosamine comprising epitope R1-GlcNAcβ3Galβ4Glc(NAc)0or1βR2,
wherein R1 and R2 are non-reducing end and reducing end glycan structures, is detected by cleaving the Galβ4-linkage by endo-β-galactosidase enzyme and composition of the oligosaccharide mixture is compared to oligosaccharide composition before the cleavage. The preferred compositions comprise at least Hex2HexNAc1, more preferably also Hex1HexNAc1, even more preferably also Hex2HexNAc1dHex1, further even more preferably also Hex3HexNAc2dHex(0-2) and/or Hex4HexNAc3dHex(0-3).
The invention is further directed to recognition of the preferred structures from various glycomes such as disease associated glycomes or individual specific glycomes, wherein the marker glycan structures are recognized based on alteration of signals corresponding to specific monosaccharide compositions, when the alteration is measured in comparison to control material such as normal tissue. The method preferably includes verification of the process by analyzing the specific structures of the sample from or a large group of similar samples with by methods such as glycosidase digestion, fragmentation mass spectrometry, or NMR. In a preferred profiling method both O-glycans and N-glycans are measures. The invention is specifically directed to various glycan composition obtained according to the invention, when the compositions comprise the indicated glycan types with similar % amounts of total glycans, the similar amount is preferably on average within 50% (in a preferred embodiment % units), more preferably 30%, more preferably 25% and even more preferably 15% most preferably within about 5% of the indicated amounts.
The invention is further directed to specific compostions of N-glycans, O-glycans and glycolipid oligosaccharides as indicated above and as indicated in the Tables e.g. to
A Human N-glycan glycome compositions comprising the defined glycan classes in relative amount according to Table 8-10 or 13 or within 30% units of the defined amounts, preferably the glycome being derived from the specific cell type.
The invention further directed to N-glycomes with compositions according to the table 13 and N-glycomes as defined in Table 8 comprising high-Mannose N-glycans 10-65% more preferably 12-58%; low-mannose 0-35%, preferably for tissues 1-40%, more preferably 3-33% and for a secretion, preferably serum 0-5% more preferably 0.1-2%; glucosylated high-Mannose glycans 0-5%, more preferably 0.1-4% for cell lines; fucosylated low-mannose glycan 0-5%, more preferably O-3%; hybrid/monoantennary glycans 1-15%; more preferably 3-12%; complex type glycans 5-75%, preferably for neutral tissue glycomes 1-35%, more preferably 4-20% and for total samples of Table 8 or for serum 40-75%, more preferably 40-65% for total samples and 50-75% for serum; soluble (N-) glycans 0-20% preferably 0.01-3% for tissues or serum and 1-20% for cell lines; fucosylated glycan 5-60% preferably 10-50% for tissues and 35-65% for serum; at least dificosylated 0-5% more preferably 0.01-3%; and terminal HexNAx group (nHexNAc>nHex) glycans from 0-30%, more preferably 1-25% for tissue a and about 15-30% more preferably 17-27% for serum and for terminal HexNAx group (nHexNAc=nhex) glycans from 1-40%, more preferably 1-15% for tissues and about 20-45% more preferably 25-37% for serum. Other preferred ranges includes 3, and 5% and 7% unit narrower ranges from upper end or both lower and upper end of the ranges, but when lower range starts from 0% then narrower range is only 1, 2, or 3% unit narrower from the lower end, respectively. The invention is further directed to method of comparing the content of glycan groups selected from the group low Mannose, high-Mannose, complex type, fucosylated, difucosylated on or both of the terminal HexNAc group, most preferably low mannose or/and terminal HexNAc groups are analyzed.
The invention is specifically directed to animal serum, preferably glycomes comprising the glycan compositions in amount described and more preferably analysis of pathogenesis associated changes in the compositions, preferably in context of cancer an/or liver diseases.
The invention is further directed to the analysis of low-mannose glycans, soluble glycans and/or difucosylated glycans from serum according to the invention. It is realized that these glycans changes in context of pathogenesis but are low in normal serum major proteins.
B. Human glycolipid glycome compositions comprising the defined glycan classes in relative amount according to Table 14 or 15 or within 30% units of the defined amounts, preferably the glycome being derived from the specific cell type.
The invention is directed to glycolipid glycome compositions as in Table 14 comprising neutral or sialylated Lac glycans 0-10%; Ltri Glycans 5-35%; L1 glycans 15-65%, preferably 15-50% neutral L1 glycans and 45-65% sialylated L1 glycans; L2 glycans 0-25% more preferably 5-25% neutral, and 0-5% sialylated L2 glycans as described; L3+glycans 0-25%; 0-35% fucosylated glycans, preferably 15-35% neutral fucosylated glycans; 0-45% terminal HexNAc glycans, preferably 10-45%. Other preferred ranges includes 3, and 5% and 7% unit narrower ranges from upper end or both lower and upper end of the ranges, but when lower range starts from 0% then narrower range is only 1, 2, or 3% unit narrower from the lower end, respectively.
C. Human O-glycan glycome compositions comprising the defined glycan classes in relative amount according to Table 14 or within 30%-units of the defined amounts, preferably the glycome being derived from the specific cell type.
The invention is directed to O-glycan glycome compositions as in Table 14 comprising neutral or sialylated O1 glycans 0-85%, preferably neutral O1 glycans within 0-10% and sialylated 35-85%; O2Glycans 15-65%; preferably neutral 35-65 and sialylated 15-45%; O3+glycans 0-25; 0-35% fucosylated glycans, preferably 15-35% neutral fucosylated glycans and 0-25% sialylated; 0-25% terminal HexNAc glycans, preferably 3-20% of neutral terminal HexNAc glycans; and 0-25% sulfated or phosphorylated glycan, more preferably sulfated in range of 0-20%. Other preferred ranges includes 3, 5, and 7% unit narrower ranges preferably from upper end or both lower and upper end of the ranges, but when lower range starts from 0% then narrower range is 1, 2, or 3% unit narrower from the lower end.
The invention is further directed to the glycome compositions in essentially pure form and comprising preferred markers structures and/or a MALDI matrix.
In a preferred profiling process at least one internal standard molecule preferably a glycan molecule is added to glycans to be analyzed. The glycan molecule is in a preferred embodiment a chemical derivative of a normal N-glycan or O-glycan or human milk oligosaccharide, the preferred derivatization occurs at limited amount of positions leaving substantial similarity to native glycans, preferred modification position is the reducing end of the glycans, and preferred example of modification is reductive amination with a hydroxyl-alkyl-amine molecule, when the length of the alkyl is selected to avoid overlap with regular glycans in the profiles. In a preferred embodiment the modification is modification by stable isotopes such as C13 or H2(deuterium) as well known in the art. Preferably the internal standard has molecular weight different from the molecular weights of the cancer glycans. More preferably the internal standard comprises specific amount of standard molecule, which is in range of the individual signals to be detected from the sample. More preferably at least two internal standards of different molecular weights are used, even more preferably the at least two internal standards comprise different amounts of the marker molecule allowing detection of efficacy of the measurement, the two standards with different amounts are preferably have similar structures (e.g. different reducing end derivatization) molecular weights close to each other, preferably within 200 Da, more preferably within 100 Da and most preferably within about 80 Da or especially for neutral glycans about 60 Da, but difference larger than the isotope patterns/ion forms do not over lap, preferably over 20 Da for neutral glycans and for acidic glycans over about 40 Da more preferably over 50 Da. The invention is further directed to use of internal standard molecules, or pairs of molecules with differing amounts, so that at least one molecule or molecule pair is at lower mass range, and one molecule or pair at higher mass range, or even more preferably additionally one molecule or pair of molecules in the middle of the mass range.
The invention revealed novel binding reagents are in a preferred embodiment used for isolation of cellular components from cells/tissues comprising the novel target/marker structures. The isolated cellular are preferably free glycans or glycans conjugated to proteins or lipids or fragment thereof. The binders are glycan binding proteins such as lectins, antibodies or enzymes. The invention is directed to analysis of cellular fractions by the methods according to the invention wherein the fractions are prepared by binding of specific binder to glycan structures.
The invention is especially directed to isolation of the cellular components comprising the structures when the structures comprises one or several types glycan materials and/or
The isolation of cellular components according to the invention means production of a molecular faction comprising increased amount of the glycans comprising the target structures according to the invention in method comprising the step of binding of the binder molecule according to the invention to the target structures.
The preferred method to isolate cellular component includes following steps
1) Providing a tissue or cell sample.
2) Contacting the binder molecule according to the invention to the target structures.
3) Isolating the complex of the binder and target structure at least part of cellular materials.
It is realized that the components are in general enriched in specific fractions of cellular structures such as cellular membrane fractions including plasma membrane and organelle fractions and soluble glycan comprising fractions such as soluble protein, lipid or free glycans fractions. It is realized that the binder can be used to total cellular fractions. In a preferred embodiment the target structures are enriched within a fraction of cellular proteins such as cell surface proteins releasable by protease or detergent soluble membrane proteins.
More preferably the invention is directed to purification of the target structure fraction in the isolation step. The purification is in a preferred mode of invention is at least partial purification. Preferably the target glycan containing material is purified at least two fold, preferably among the components of cell fraction wherein it is expressed. More preferred purification levels includes 5-fold and 10 fold purification, more preferably 100, and even more preferably 1000-fold purification. Preferably the purified fraction comprises at least 10% of the target glycan comprising molecules, even more preferably at least 30%, even more preferably at least 50%, even more preferably at least 70% pure and most preferably at least 90% pure. Preferably the % value is mole percent in comparison to other non-target glycan comprising glycoconjugate molecules, more preferably the material is essentially devoid of other major organic contaminating molecules.
The invention is also directed to isolated or purified target glycan-binder complexes and isolated target glycan molecule compositions.
Preferably the purified target glycan-binder complex compositions comprises at least 10% of the target glycan comprising molecules in complex with binder, even more preferably at least 30%, even more preferably at least 50%, even more preferably at least 70% pure and most preferably at least 90% pure target glycan comprising molecules in complex with binder.
Preferably the purified target glycan composition comprises at least 10% of the target glycan comprising molecules, even more preferably at least 30%, even more preferably at least 50%, even more preferably at least 70% pure and most preferably at least 90% pure target glycan comprising molecules.
Preferred N-Glycans According to Structural Subgroups with Terminal HexNAc
The inventors found that there are novel animal tissue derivatives, wherein the animal is preferably not CHO-cell line or ovarian tissue of chinese hamster or Chinese hamster, and especially human glycome composition such as, cancer-specific differences with regard to terminal HexNAc containing N-glycans characterized by either the formulae: nHexNAc=nHex≧5 and ndHex≧1 (group 1), or the formulae: nHexNAc=nHex≧5 and ndHex=0 (group II). The present data demonstrated that group I glycans were 1) detected in various protein-linked glycan samples isolated from malignant tumor samples and metastases, 2) overexpressed when compared to corresponding normal tissues, 3) expressed independently of group II that was not expressed in specific studied examples of malignant tumors. Therefore, the N-glycan structure group determined by the formula nHeXNAc=nHex>5 is divided into two independently expressed subgroups I and II as described above. It is realized that both of these groups preferably correspond to bisecting GlcNAc type N-glycans and other terminal HexNAc containing N-glycans.
Based on the known specificities of the biosynthetic enzymes synthesizing N-glycan core α1,6-linked fucose and β1,4-linked bisecting GlcNAc, group II preferably corresponds to bisecting GlcNAc type N-glycans while group I preferentially corresponds to other terminal HexNAc containing N-glycans, preferentially with a branching HexNAc in the N-glycan core structure, more preferentially including structures with a branching GlcNAc in the N-glycan core structure. In a specific embodiment the glycan structures of this group includes core fucosylated bisecting GlcNAc comprising N-glycan, wherein the additional GlcNAc is GlcNAcβ4 linked to Manβ4GlcNAc epitope forming epitope structure GlcNAcβ4Manβ4GlcNAc preferably between the complex type N-glycan branches.
In a preferred embodiment of the present invention, such structures include GlcNAc linked to the 2-position of the β1,4-linked mannose. In a further preferred embodiment of the present invention, such structures include GlcNAc linked to the 2-position of the β3,4-linked mannose as described for LEC14 structure (Raju and Stanley J. Biol Chem (1996) 271, 7484-93). In a further preferred embodiment of the present invention, such structures include GlcNAc linked to the 6-position of the β1,4-linked GlcNAc of the N-glycan core as described for LEC18 structure (Raju, Ray and Stanley J. Biol Chem (1995) 270, 30294-302).
The invention is specifically directed to further analysis of the subtypes of the group one glycans comprising structures according to the group I. The invention is further directed to production of specific binding reagents against the N-glycan core marker structures and use of these for analysis of the preferred cancer marker structures. The invention is further directed to the analysis of LEc14 and/or 18 structures by negative recognition by lectins PSA (pisum sativum) or Intil (Lens culinaris) lectin or core Fuc specific monoclonal antibodies, which binding is prevented by the GlcNAcs.
Invention is specifically directed to N-glycan core marker structure, wherein the disaccharide epitope is Manβ4GlcNAc structure in the core structure of N-linked glycan according to the Formula CGN:
[Manα3]n1(Manα6)n2Manβ4GlcNAcβ4(Fucα6)n3GlcNAcxR,
The invention is further directed to the N-glycan core marker structure and marker glycan compositions comprising structures of Formula CGN, wherein Manα3/Manα6-residues are elongated to the complex type, especially biantennary structures and n3 is 1 and wherein the Manβ4GlcNAc-epitope comprises the GlcNAc substitutions.
The invention is further directed to the N-glycan core marker structure and marker glycan compositions comprising structures of Formula CGN, wherein Manα3/Manα6-residues are elongated to the complex type, especially biantennary structures and n3 is 1 and wherein the Manβ4GlcNAc-epitope comprises between 1-8% of the GlcNAc substitutions.
The invention is further directed to the N-glycan core marker structure and marker glycan compositions comprising structures of Formula CGN, wherein the structure is selected from the group
The invention is further directed to the N-glycan core marker structure and marker glycan compositions comprising of Formula CGN, wherein the Manβ4GlcNAc-epitope comprises and the GlcNAc residue is β2-linked to Manβ4 forming epitope GlcNAcβ2Manβ4.
The invention is further directed to the N-glycan core marker structure and marker glycan compositions comprising of Formula CGN, wherein the Manβ4GlcNAc-epitope comprises and the GlcNAc residue is 6-linked to GlcNAc of the epitope forming epitope Manβ4(GlcNAc6)GlcNAc.
The invention is further directed to the N-glycan core marker structure and marker glycan compositions comprising of Formula CGN, wherein the Manβ4GlcNAc-epitope comprises and the GlcNAc residue is 4-linked to GlcNAc of the epitope forming epitope GlcNAcβ4Manβ4GlcNAc.
The invention is further directed to recognition of the preferred structures from various glycomes such as disease associated glycomes or individual specific glycomes, wherein the marker glycan structures are recognized based on alteration of signals corresponding to specific monosaccharide compositions, when the alteration is measured in comparison to control material such as normal tissue. The method preferably includes verification of the process by analyzing the specific structures of the sample from or a large group of similar samples with by methods such as glycosidase digestion, fragmentation mass spectrometry, or NMR. In a preferred profiling method both O-glycans and N-glycans are measures. The invention is specifically directed to various glycan composition obtained according to the invention, when the compositions comprise the indicated glycan types with similar % amounts of total glycans, the similar amount is preferably on verge within 50%, more preferably 25% and most preferably within 15% of the indicated amounts.
In a preferred profiling process at least one internal standard molecule preferably a glycan molecule is added to glycans to be analyzed. The glycan molecule is in a preferred embodiment a chemical derivative of a normal N-glycan or O-glycan or human milk oligosaccharide, the preferred derivatization occurs at limited amount of positions leaving substantial similarity to native glycans, preferred modification position is the reducing end of the glycans, and a preferred example of modification is reductive amination with a hydroxyl-alkyl-amine molecule, when the length of the alkyl is selected to avoid overlap with regular glycans in the profiles. In a preferred embodiment the modification is modification by stable isotopes such as C13 or H2(deuterium) as well known in the art. Preferably the internal standard has molecular weight different from the molecular weights of the cancer glycans. More preferably the internal standard comprises specific amount of standard molecule, which is in range of the individual signals to be detected from the sample. More preferably at least two internal standards of different molecular weights are used, even more preferably the at least two internal standards comprise different amounts of the marker molecule allowing detection of efficacy of the measurement, the two standards with different amounts are preferably have similar structures (e.g. different reducing end derivatization) molecular weights close to each other, preferably within 200 Da, more preferably within 100 Da and most preferably within about 80 Da or especially for neutral glycans about 60 Da, but difference larger than the isotope patterns/ion forms do not over lap, preferably over 20 Da for neutral glycans and for acidic glycans over about 40 Da more preferably over 50 Da. The invention is further directed to use of internal standard molecules, or pairs of molecules with differing amounts, so that at least one molecule or molecule pair is at lower mass range, and one molecule or pair at higher mass range, or even more preferably additionally one molecule or pair of molecules in the middle of the mass range.
The invention revealed novel binding reagents are in a preferred embodiment used for isolation of cellular components from cells/tissues comprising the novel target/marker structures. The isolated cellular are preferably free glycans or glycans conjugated to proteins or lipids or fragment thereof. The binders are glycan binding proteins such as lectins, antibodies or enzymes.
The invention is especially directed to isolation of the cellular components comprising the structures when the structures comprises one or several types glycan materials sele
The isolation of cellular components according to the invention means production of a molecular faction comprising increased amount of the glycans comprising the target structures according to the invention in method comprising the step of binding of the binder molecule according to the invention to the target structures.
The preferred method to isolate cellular component includes following steps
1) Providing a tissue or cell sample.
2) Contacting the binder molecule according to the invention to the target structures.
3) Isolating the complex of the binder and target structure at least part of cellular materials.
It is realized that the components are in general enriched in specific fractions of cellular structures such as cellular membrane fractions including plasma membrane and organelle fractions and soluble glycan comprising fractions such as soluble protein, lipid or free glycans fractions. It is realized that the binder can be used to total cellular fractions.
In a preferred embodiment the target structures are enriched within a fraction of cellular proteins such as cell surface proteins releasable by protease or detergent soluble membrane proteins.
More preferably the invention is directed to purification of the target structure fraction in the isolation step. The purification is in a preferred mode of invention is at least partial purification. Preferably the target glycan containing material is purified at least two fold, preferably among the components of cell fraction wherein it is expressed. More preferred purification levels includes 5-fold and 10 fold purification, more preferably 100, and even more preferably 1000-fold purification. Preferably the purified fraction comprises at least 10% of the target glycan comprising molecules, even more preferably at least 30%, even more preferably at least 50%, even more preferably at least 70% pure and most preferably at least 90% pure. Preferably the % value is mole percent in comparison to other non-target glycan comprising glycoconjugate molecules, more preferably the material is essentially devoid of other major organic contaminating molecules.
The invention is also directed to isolated or purified target glycan-binder complexes and isolated target glycan molecule compositions.
Preferably the purified target glycan-binder complex compositions comprises at least 10% of the target glycan comprising molecules in complex with binder, even more preferably at least 30%, even more preferably at least 50%, even more preferably at least 70% pure and most preferably at least 90% pure target glycan comprising molecules in complex with binder.
Preferably the purified target glycan composition comprises at least 10% of the target glycan comprising molecules, even more preferably at least 30%, even more preferably at least 50%, even more preferably at least 70% pure and most preferably at least 90% pure target glycan comprising molecules.
Preferred N-Glycans According to Structural Subgroups with Terminal HexNAc
The inventors found that there are novel human glycome composition such as, cancer-specific differences with regard to terminal HexNAc containing N-glycans characterized by either the formulae: nHexNAc=nHex≧5 and ndHex≧1 (group I), or the formulae: nHexNAc=nHex≧5 and ndHex=0 (group II). The present data demonstrated that group I glycans were 1) detected in various protein-linked glycan samples isolated from malignant tumor samples and metastases, 2) overexpressed when compared to corresponding normal tissues, 3) expressed independently of group II that was not expressed in specific studied examples of malignant tumors. Therefore, the N-glycan structure group determined by the formula nHexNAc=nHex≧5 is divided into two independently expressed subgroups I and II as described above. It is realized that both of these groups preferably correspond to bisecting GlcNAc type N-glycans and other terminal HexNAc containing N-glycans.
Based on the known specificities of the biosynthetic enzymes synthesizing N-glycan core α1,6-linked fucose and β1,4-linked bisecting GlcNAc, group II preferably corresponds to bisecting GlcNAc type N-glycans while group I preferentially corresponds to other terminal HexNAc containing N-glycans, preferentially with a branching HexNAc in the N-glycan core structure, more preferentially including structures with a branching GlcNAc in the N-glycan core structure. In a specific embodiment the glycan structures of this group includes core fucosylated bisecting GlcNAc comprising N-glycan, wherein the additional GlcNAc is GlcNAcβ4 linked to Manβ4GlcNAc epitope forming epitope structure GlcNAcβ4Manβ4GlcNAc preferably between the complex type N-glycan branches.
In a preferred embodiment of the present invention, such structures include GlcNAc linked to the 2-position of the β1,4-linked mannose. In a further preferred embodiment of the present invention, such structures include GlcNAc linked to the 2-position of the β1,4-linked mannose as described for LEC14 structure (Raju and Stanley J. Biol Chem (1996) 271, 7484-93). In a further preferred embodiment of the present invention, such structures include GlcNAc linked to the 6-position of the β1,4-linked GlcNAc of the N-glycan core as described for LEC14 structure (Raju, Ray and Stanley J. Biol Chem (1995) 270, 30294-302).
The invention is specifically directed to further analysis of the subtypes of the group one glycans comprising structures according to the group I. The invention is further directed to production of specific binding reagents against the N-glycan core marker structures and use of these for analysis of the preferred cancer marker structures. The invention is further directed to the analysis of LEc14 and/or 18 structures by negative recognition by lectins PSA (pisum sativum) or Intil (Lens culinaris) lectin or core Fuc specific monoclonal antibodies, which binding is prevented by the GlcNAcs.
Invention is specifically directed to N-glycan core marker structure, wherein the disaccharide epitope is Manβ4GlcNAc structure in the core structure of N-linked glycan according to the Formula CGN:
[Manα3]n1(Manα6)n2Manβ4GlcNAcβ4(Fucα6)n3GlcNAcxR,
The invention is further directed to the N-glycan core marker structure and marker glycan compositions comprising structures of Formula CGN, wherein Manα3/Manα6-residues are elongated to the complex type, especially biantennary structures and n3 is I and wherein the Manβ4GlcNAc-epitope comprises the GlcNAc substitutions.
The invention is further directed to the N-glycan core marker structure and marker glycan compositions comprising structures of Formula CGN, wherein Manα3/Manα6-residues are elongated to the complex type, especially biantennary structures and n3 is 1 and wherein the Manβ4GlcNAc-epitope comprises between 1-8% of the GlcNAc substitutions.
The invention is further directed to the N-glycan core marker structure and marker glycan compositions comprising structures of Formula CGN, wherein the structure is selected from the group
The invention is further directed to the N-glycan core marker structure and marker glycan compositions comprising of Formula CGN, wherein the Manβ4GlcNAc-epitope comprises and the GlcNAc residue is β2-linked to Manβ4 forming epitope GlcNAcβ2Manβ4.
The invention is further directed to the N-glycan core marker structure and marker glycan compositions comprising of Formula CGN, wherein the Manβ4GlcNAc-epitope comprises and the GlcNAc residue is 2-linked to GlcNAc of the epitope forming epitope Manβ4(GlcNAc6)GlcNAc.
The invention is further directed to the N-glycan core marker structure and marker glycan compositions comprising of Formula CGN, wherein the Manβ4GlcNAc-epitope comprises and the GlcNAc residue is 4-linked to GlcNAc of the epitope forming epitope GlcNAcβ4Manβ4GlcNAc.
Subglycome Derived from High Molecular Weight Proteins of Complex Biological Fluids Such as Body/Tissue Fluid Such as Serum, Saliva or Milk
A subglycome can also be produced by fractionation of the starting material prior to glycan isolation step. Such process leads to production of a Tissue fraction subglycome. A preferred subglycome is a glycome where a range of proteinaceous material is fractionated by size exclusion. The fractionation can be used to enrich components that display changes in glycome as a function of disease state. A preferred tissue fraction subglycome is produced by treating serum with a processing step that separates the serum components by size. Such device can be a filter with a specific MW cutoff, a gel permeation chromatography step or any other device capable of separation of biomolecules by size. It is realised that serum contains a vast amount of immunoglobulins, transferrin, and alpha-1 acid glycoprotein amongst others. These produce an excessive glycoprotein background that lowers the detection limit for changes that occur in high molecular weight glycoproteins such as mucins. It is realised that cancer tissues secrete mucins with cancer specific glycans, which can be analysed with higher sensitivity and specificity when analysing high molecular weight protein glycome of human subjects.
In another embodiment high Mw proteins from serum are produced by specific precipitation reactions such as acetone precipitation as described by the invention or acid precipitation. The invention is also directed all types of filtration methods to obtain effectively high Mw proteins such as mucin protein over Mw 100 kDA, even more preferably 200 kDA or even more preferably effective gel filtration size from regular protein standards over 500 kDA. In a preferred embodiment the filtration method is an ultrafiltration method using a membrane filter with a specific cut-off level as defined above.
A prepurification method to obtain larger Mw glycoproteins from body fluids or other complex biological fluids is, in EXAMPLE 25.
The inventors realized that improved through-put for analysis can be obtained by using array format purification of small sample amounts and/or purification of glycans from complex biological matrices. A suitable array may consist of any of the above described solid-phase extraction/purification steps. Array format solid-phase extraction (SPE) plates can be stacked for processing of multiple samples at a single run. Several array formats are in use in the field, 12*8 (96 well format) being the most widely used, but several others such as 384 well, 768 well or higher are in use. It is further realised that in analogy to a device described above, a stack of SPE array plates may be used as a means to perform the device operations.
In a plate array format, liquid flow through the SPE can be facilitated by methods known in the art for microwell plates, for example by vacuum. A switch analogous to such described above may be facilitated by adding SPE plates to a stack or removing them, or changing the sample collection in the stack. A preferred sample collection is a microtiter plate placed in the SPE stack.
For instance, a stack of cation exchange array plate, reversed phase array plate and a graphite chromatography array plate can be used to simultaneously remove the contaminants from the sample (by the cathion exchange and reversed phase material) and to trap the oligosaccharide material to the grapphige material. Purification mode is then switched by removing the two uppermost plates and subjecting the graphite plate to elution. This is further exemplified in the Examples of the invention, especially EXAMPLE 26.
The present invention is specifically directed to standard molecules and their use in mass spectrometry calibration. Specifically, use of two or more standard glycans with known structure and amount is used for accurate mass calibration, method validation, analyte quantity determination, and/or determination of relative detection response in different areas of the m/z range. In a specific embodiment, the mixture is used to monitor relative quantitation in mass spectra over the analysis m/z range. The inventors showed that equimolar amounts of N-glycans shows equimolar signal intensities in MALDI-TOF mass spectrometry. N-glycans for mixtures were obtained from Glycoseparations Inc. and their quantity was determined by NMR using known amount of internal standard (pNp-GlcA).
The inventors showed that defined amounts of glycans in a standard molecule mixture produced mass spectrometric signals accurately reflecting the relative amounts of the analytes, and further that decreasing the amounts of certain analytes was quantitatively reflected in the recorded relative signal intensities. Further, the inventors showed that this correlation was accurate in a wide m/z range. It is realized that analysis performance can be affected by sample amount and quality, sample matrix, equipment performance, and other factors known in the art. The inventors have devised methods to calibrate mass spectrometric analyses for improvement in the quality of analysis results, especially relative quantitation in mass spectrometric profiling.
The present invention is especially directed to validating, calibrating, and quality control of the analysis results and the analysis method. In a preferred embodiment of the invention, the effective dynamic range resulting from the present inventions over the m/z range in mass spectrometric analysis, more preferably MALDI-TOF mass spectrometry, is at least 102, more preferably at least 103, and even more preferably at least 104.
The present invention is specifically directed to using standard molecule mixtures to calibrate relative quantitation efficiency over the recorded m/z range in mass spectrometric analysis, more preferably MALDI-TOF mass spectrometry. In a preferred embodiment, mixtures of defined amounts of standard molecules are used to calibrate signal-to-amount coefficients over the m/z range, more preferably essentially covering the m/z range to be analyzed, even more preferably with equimolar amounts of each molecule. In a further embodiment, mixtures with different amounts of glycans are used to analyze detection performance, more preferably including dynamic range and signal-to-noise ratios. In a further embodiment, the standard mixture is at the same time additionally used for mass calibration, analyte quantitation, and optionally other uses of the invention.
In a further preferred embodiment, the standard mixture is used to validation, efficacy monitoring, and quality control of analysis-to-analysis variation over time. More preferably, the analysis performance is followed by analyzing a standard mixture together with a sample series, and comparing the results from the standard mixture to previous results. In a further preferred embodiment, the invention is specifically directed to calibrating the sample results based on correction formulae derived from standard mixtures. Calibrated variables preferably include both m/z and relative signal intensity.
The standards can be used as external or internal standards according to the present invention. The invention is specifically directed to standard mixtures selected based on knowledge presented in the Tables of the present invention, so that they do not have overlapping m/z values with the analytes to be analyzed, taking into account also adduct formation and isotope patterns as described in the invention.
This is further exemplified in the Examples especially Example 27.
In a preferred embodiment external calibration standard comprising glycans to be determined is used. The external standard is in a preferred embodiment a mixture of N-glycans, and/or O-glycans and/or oligosaccharides derived from glycolipids with specific quantity and in a preferred embodiment separate standards are produced for acidic and neutral glycans. The preferred standards for acidic glycans includes sialylated glycans and/sulfated and/or fosforylated glycans The oligosaccharides may be produced synthetically as known in the art or isolated from natural sources such as natural glycoproteins. The standard is used for producing calibration for quantity of the components. The invention reveals that glycans with mass difference of one or few monosaccharide residues behave reasonably quantitatively, when the amount of saccharides are changed in suitable mass/molar range. The external standard can be used for correction of mass bias caused by differences between signals of large and small oligosaccharides. In general there is tendency smaller signals for larger oligosaccharides. The external standard is useful for correcting the change of signal response between smaller (closer to the range 400-700 Da) and larger signals (above Mw 1500 or 2000 Da). The external standard is also useful for correction of differences caused by properties of individual glycans
The suitable range means amount enough for separation from the background noise and below the amount which would saturate the matrix.
In a preferred embodiment the invention is directed to adding at least one internal standard molecule, which structure is similar to the glycans to be analysed and effective signal from similar amount of sample is similar to the one produced from glycans to be analyzed. The effective ionization and the signal level corresponding to amount of glycans is preferably calibrated with standard oligosaccharides such as N-glycan, O-glycans or glycolipid oligosaccharides as described for external standard. The molecular size to of the external standard molecule is preferably similar to the glycans to be analyzed but it preferably does not overlap with any glycan signals in the spectrum. It is realized that different standards can be developed for different glycan classes.
A preferred group of internal standard molecules for positive ion mode includes glycans HexNAc3-HexNAc10, more preferably HexNAc4-HexNAc7 and Hex1HexNAc4-Hex1HexNAc7 and de-Na-acetylated variants thereof which are alkylated with an alkyl forming residues not known from natural glycans such as propionic acid or glycolic acid forming HexNPropyl, or HexNGc residues with characteristic similar to natural oligosaccharides especially NGc-groups and variants reduced or reductively aminated from the reducing end. The HexNAc in the structures is preferably GlcNAc and the structures are chitin derived oligosaccharides (oligomer of GlcNAcβ4GlcNAc) from chitotriose to chitodecaose GlcNAc4-GlcNAc10, smaller chitin oligosaccharides to chito octa- or heptasaccharides or more preferably to hexa saccharides are preferred because of the better solubility. The hexose can be added to the standard oligosaccharide by galactosyltransferase of transgalactosylation reaction well-known in the art, e.g by β4-galactosyltransferase (bovine milk, sigma) with UDP-Gal as nucleotide sugar donor. Another preferred HexNAc oligomer is NAGOS, N-acetylgalactosmine oligosaccharides, which comprises oligomer of GalNAcα4GalNAc available from known bacterial polysaccharides. Preferred acidic standard oligosaccharides includes chitin oligosaccharides wherein reducing end GlcNAc is ozidized to GlcNAc-onic acid e.g by a halogen catalyzed oxidation, further preferred acidic oligosaccharides includes SA1Hex1HexNAc4-SA1Hex1HexNAc7 or dimeric or oligomeric variants thereof conjugated from the reducing end or reducing end oxidized variants (e.g. by divalent aldehyde reactive amino-oxy conjugates as demonstrated for anti-influenza oligosaccharide in a copending application). The sialylated variant can be effectively synthesized by sialyltransferase (e.g. α3-sialyltransferase from Calbiochem and CMP-Neu5Ac as donor) reaction from the galactosylated chitin oligosaccharides such as Galβ4GlcNAc4-Galβ4GlcNAc7 to Neu5Aα3Galβ4GlcNAc4-Neu5Acα3Galβ4GlcNAc7
It is realized that the detection and suitable minimal amounts of sample are in a preferred embodiment optimized by one or several methods preferably including: concentrating the sample-matrix composition to more limited are on maldi-plate, technologies for this including use of hydrophobic materials for concentrating of liquid droplets on MALDI analysis plate are known, producing the sample in miniatyrized chromatography device such as chip based chromatography device with very small. Current chip base device technologies are exemplified by minityriazed devices developed by Complex Carbohydrate Research Center in Georgia Athens (M. Novotny and colleagues), another examples of similar technologies includes miniatyrized gel electrophoresis devices used for protein separation. It is realized that by the simple optimization the detection limit of current method can be reduced to detection of less than 1000 cells, such as less than 500 cells, or even more preferably 100 or less than 100 cells or even to less than 50 cells. With lower amount of cells that detection can be optimized by conjugating the sample from the reducing end as described in the invention. The invention is further directed to optimized detection of individual glycans signals below 50 femtomols, more preferably below 10 femtomols, even more preferably 5 femtomols, even more preferably below 1 femtomol, 0.5 femtomol and most preferably below 0.1 femtomols or even 0.05 femtomols.
The invention provides effective detection of low mass range signals including Hex1HexNAc2 and Hex2HexNAc2, or even HexNAc2 under optimized conditions. In a preferred embodiment the Hex1HexNAc2, and HexNAc2 are N-glycan core structure Manβ4GlcNAcβ4GlcNAc or GlcNAcβ4GlcNAc.
The invention further is directed to detection small unusual acidic saccharides such as preferably SA1Hex1HexNAc2, SA1Hex2HexNAc1 but even SA1Hex1HexNAc1, SA1HexNAc1, and SA1Hex1, from glycoproteins, preferably SA1Hex1HexNAc2, SA1Hex1HexNAc1, or SA1HexNAc1 more preferably SA1Hex1HexNAc2, SA1Hex1HexNAc1 (wherein SA is sialic acid preferably Neu5Ac or Neu5Gc) or detection of the signals from glycoprotein comprising cellular materials. In a preferred embodiment the invention is directed to MALDI-TOF mass spectrometric detection of SAα3Galβ3GalNAc and/or Galβ3(SAα6)GalNAc, SAα3Galβ3(SAα6)GalNAc and/or SAα6GalNAc released from O-glycans.
The invention is further directed to the detection low molecular weight sulfated and or fosforylated glycans including preferably SP1Hex1HexNAc2, SP1Hex2HexNAc1 but even SP1Hex1HexNAc1, SP1HexNAc1. In a preferred embodiment the glycans are detected as free non-modified or from the reducing end reduced alditol molecules. The glycans to be detected have preferably O-glycan structures SP1Galβ3(GlcNAcβ6)GalNAc SP1Galβ3GalNAc, SP1GalNAc or glycolipid structure SP1Galβ4Glc, in a preferred embodiment the glycans are sulfated in the structures, preferably to 3- or 6-position to Gal and/or 6-position of GlcNAc. It is realized that other methods such as permethylation methods are not suitable for detection of sulfated glycans and the present invention provides effective purification for sulfated oligosaccharides including the small sulfated glycans. It is generally believed that MALDI-TOF mass spectrometry would not be suitable for quantitative analysis of acidic oligosaccharides due to release of sialic acids and or sulfate groups. It is further realized specific mass spectrometry technology is preferred for the detection acidic glycans including the small acidic glycans, the preferred technology is equivalent of reflector negative analysis mode provided by e.g. Bruker Biflex mass spectometer. The laser energy is in the preferred method adjusted to level so low that sialic acid or SP (sulfate/fosfate) groups are not released. In a preferred embodiment the analysis of acidic glycans is performed from THAP matrix.
It is realized that specific MALDI technology would be preferably used for detection of lower range glycan masses such as linear and/or reflector mode analysis technologies, especially as provided by Bruker (Biflex mass spectrometer and newer versions).
Analysis of Glycans from Purified Proteins Such as Antibodies
It is realized that the present invention provides also effective method for analysis of glycans from isolated or purified proteins in very low quantities. The mass spectrometric, especially by MALDI-TOF detection of low Mw and/or acidic glycans, especially small acidic glycans, preferably by micro/nanocolumn technologies, after the optimized purification is useful method for protein analytic from various challenging protein samples.
Use of the purification and quantitative MALDI mass spectrometry for analysis of proteins from various type preparations
In a preferred embodiment the invention is directed to the detection from proteins e.g. from electrophoresis or affinity chromatography. It is realized that samples from electrophoresis comprise residues of detergents such as SDS and the present purification methods allow production of useful MALDI sample from such material. Furthermore affinity chromatography eluted material may comprise acid, base and/or salt, which are effectively removed from the glycan sample material. The present invention further provides effective removal of residues produced by protein N-glycanase or protein endo-N-glycanase reactions or from reductive or residual reagents from non-reductive beta-elimination reactions.
The invention is further directed to methods for analysis of N-glycans and/or O-glycans from recombinant antibodies and growth factors and other biotechnical proteins. The purification methods are especially useful for purifying the sample when the sample comprise biological fluid such as serum, urine, milk or like described according to the invention. It is realized that cell culture media, especially in context of mammalian cell culture, are also complex biological fluid comprising often serum, crude protein such as albumin or transferring preparations and lipids and other biological molecules which purification to level allowing detection by MALDI-TOF is not trivial. The invention provides also method for purification of the oligosaccharides when the protein sample comprises residue of the biological fluid or cell culture mixture after single chromatography step such as a binding protein (protein A, protein L, protein M or equivalent) purification of antibodies from cell culture.
Glycan isolation. N-linked glycans are preferentially detached from cellular glycoproteins by F. meningosepticum N-glycosidase F digestion (Calbiochem, USA) essentially as described previously (Nyman et al., 1998), after which the released glycans are preferentially purified for analysis by solid-phase extraction methods, including ion exchange separation, and divided into sialylated and non-sialylated fractions. For O-glycan analysis, glycoproteins are preferentially subjected to reducing alkaline β-elimination essentially as described previously (Nyman et al., 1998), after which sialylated and neutral glycan alditol fractions are isolated as described above. Free glycans are preferentially isolated by extracting them from the sample with water.
Example of a glycan purification method. Isolated oligosaccharides can be purified from complex biological matrices as follows, for example for MALDI-TOF mass spectrometric analysis. Optionally, contaminations are removed by precipitating glycans with 80-90% (v/v) aqueous acetone at −20° C., after which the glycans are extracted from the precipitate with 60% (v/v) ice-cold methanol. After glycan isolation, the glycan preparate is passed in water through a strong cation-exchange resin, and then through C18 silica resin. The glycan preparate can be further purified by subjecting it to chromatography on graphitized carbon material, such as porous graphitized carbon (Davies, 1992). To increase purification efficiency, the column can be washed with aqueous solutions. Neutral glycans can be washed from the column and separated from sialylated glycans by elution with aqueous organic solvent, such as 25% (v/v) acetonitrile. Sialylated glycans can be eluted from the column by elution with aqueous organic solvent with added acid, such as 0.05% (v/v) trifluoroacetic acid in 25% (v/v) acetonitrile, which elutes both neutral and sialylated glycans. A glycan preparation containing sialylated glycans can be further purified by subjecting it to chromatography on microcrystalline cellulose in n-butanol:ethanol:water (10:1:2, v/v) and eluted by aqueous solvent, preferentially 50% ethanol:water (v/v). Preferentially, glycans isolated from small sample amounts are purified on miniaturized chromatography columns and small elution and handling volumes. An efficient purification method comprises most of the abovementioned purification steps. In an efficient purification sequence, neutral glycan fractions from small samples are purified with methods including carbon chromatography and separate elution of the neutral glycan fraction, and glycan fractions containing sialylated glycans are purified with methods including both carbon chromatography and cellulose chromatography.
MALDI-TOF mass spectrometry. MALDI-TOF mass spectrometry is performed with a Voyager-DE STR BioSpectrometry Workstation or a Bruker Ultraflex TOF/TOF instrument, essentially as described previously (Saarinen et al., 1999; Harvey et al., 1993). Relative molar abundancies of both neutral (Naven & Harvey, 1996) and sialylated (Papac et al., 1996) glycan components are assigned based on their relative signal intensities. The mass spectrometric fragmentation analysis is done with the Bruker Ultraflex TOF/TOF instrument according to manufacturer's instructions.
Examples of analysis sensitivity. Protein-linked and free glycans, including N- and O-glycans, are typically isolated from as little as about 5×104 cells in their natural biological matrix and analyzed by MALDI-TOF mass spectrometry.
Examples of analysis reproducibility and accuracy. The present glycan analysis methods have been validated for example by subjecting a single biological sample, containing human cells in their natural biological matrix, to analysis by five different laboratory personnel. The results were highly comparable, especially by the terms of detection of individual glycan signals and their relative signal intensities, indicating that the reliability of the present methods in accurately describing glycan profiles of biological samples including cells is excellent. Each glycan isolation and purification phase has been controlled by its reproducibility and found to be very reproducible. The mass spectrometric analysis method has been validated by synthetic oligosaccharide mixtures to reproduce their molar proportions in a manner suitable for analysis of complex glycan mixtures and especially for accurate comparison of glycan profiles from two or more samples. The analysis method has also been successfully transferred from one mass spectrometer to another and found to reproduce the analysis results from complex glycan profiles accurately by means of calibration of the analysis.
Examples of biological samples and matrices for successful glycan analysis. The method has been successfully implied on analysis of e.g. blood cells, cell membranes, aldehyde-fixated cells, glycans isolated from glycolipids and glycoproteins, free cellular glycans, and free glycans present in biological matrices such as blood. The experience indicates that the method is especially useful for analysis of oligosaccharide and similar molecule mixtures and their optional and optimal purification into suitable form for analysis.
Generation of glycan profiles from mass spectrometric data.
Glycoprotein-linked glycans in human serum. Protein-linked glycans, corresponding to both N- and O-glycans, were isolated as described above from glycoproteins precipitated from serum of one donor, and analyzed by MALDI-TOF mass spectrometry. Major glycans that were detected included 1) in the neutral glycan fraction, biantennary neutral N-glycans (IgG type), and Hex5-9HexNAc2 N-glycans (high-mannose type), and 2) in the sialylated N-glycan fraction, biantennary and larger sialylated N-glycans (orosomucoid type). The obtained serum protein-linked glycomes serve as a control database against which changes in serum glycoprotein glycans can be detected. It was noted that the neutral N-glycan fraction isolated according to the present invention could be detected by the present method from significantly smaller sample amounts than total N-glycan glycomes or sialylated N-glycan glycomes. It is suggested that the neutral glycan fraction isolated from human serum glycoproteins allows very sensitive detection of changes in serum protein-linked glycan profiles.
Isolation and analysis of protein-linked glycans from human blood cells. Mononuclear cells and red blood cells are isolated from human blood for example by gradient centrifugation in solution containing sodium citrate (Vacutainer CPT, BD) and washed with phosphate buffered saline. Total cellular glycoproteins are precipitated and washed with organic solvents, such as aqueous solutions of acetone and ethanol. N-glycans and O-glycans are isolated from the precipitated glycoproteins, divided into sialylated and neutral glycan fractions, and analyzed by MALDI-TOF mass spectrometry as described above.
White blood cell N-glycan profiles. The isolated neutral N-glycans included glycan signals corresponding to glycan groups according to the present invention: high-mannose type, low-mannose type, hybrid-type/monoantennary, and complex N-glycans, as well as monosaccharide compositions Hex1-9HexNAc1, the latter possibly being free glycans and not protein-linked glycans. The isolated sialylated N-glycans included glycan signals corresponding to glycan groups according to the present invention: hybrid-type, monoantennary, and complex-type N-glycans. The resulting profiles differed from both serum glycoprotein glycan profiles and human tissue protein-linked glycans.
A lymphocyte subpopulation was isolated from human blood using Ficoll isolation (Amersham Pharmacia Biotech) and differentiation marker affinity purification. Cells were cultured for one week in a synthetic cell culture medium supplemented with 1% human serum.
N- and O-glycan profiling analysis. Glycans are isolated from cells, purified, and analyzed by MALDI-TOF mass spectrometry as described above. Typically, at least 25 and more preferentially 50 sialylated N-linked glycan signals, at least 10 and more preferentially over 15 neutral N-linked glycan signals, at least 5 sialylated O-glycan signals, and at least 2 neutral O-glycan signals can be detected and their relative abundancies determined according to the present method. Changes in glycan profiles and specific glycan structures upon in vitro cell culture can be detected by comparing the glycosylation data before and after in vitro culture. Examples of such analyses are presented below.
Glycan profile analysis. Protein-linked glycans were isolated from a human lymphocyte subpopulation grown in cell culture conditions for one week. In glycan profiling analysis it was observed that the relative amounts of five major neutral N-glycans at m/z 1257, 1419, 1581, 1743, and 1905, corresponding to [M+Na]+ ions of high-mannose type N-glycans Hex5-9HexNAc2, were significantly more abundant than the major sialylated N-glycans. This is in contrast to native human cells.
Sialic acid linkage analysis. The isolated sialylated N- and O-glycans were treated with recombinant S. pneumoniae α2,3-sialidase essentially as described previously (Saarinen et al., 1999). The vast majority of the sialylated N-glycans were susceptible to hydrolysis by the enzyme, indicating that nearly all sialic acids in the sialylated N-glycans were α2,3-linked. This is in contrast to native human cells. α2,3-linkage was also predominant in the O-glycans.
Fucosylation analysis. By sequential digestions with specific glycosidases performed essentially as described previously (Hemmerich et al., 1995), except that analysis of digestion results was performed by MALDI-TOF mass spectrometry according to the present invention, it was shown that fucose residues in O-glycans occurred in the sialyl-Lewis x epitope, Neu5Acα2-3Galβ1-4(Fucα1-3)GlcNAcβ. In contrast, fucose residues were shown by similar sequential digestions to reside mainly in the N-glycan core sequence. The latter result is in accordance with the initial glycan grouping according to the present invention, because sialylated N-glycans in the original sample did not include significant amounts of glycans with proposed monosaccharide compositions with more than one deoxyhexose residue, which was not indicative of other fucose linkages apart from α1,6-Fuc of the N-glycan core sequence.
The present results demonstrate that cell glycosylation profiles can change significantly upon in vitro cell culture even in a relatively short time. In particular, the present results indicate that cell cultivation can change the relative amounts of high-mannose N-glycans compared to other glycan types, N-glycan sialic acid linkages, and the overall glycosylation profile. It was also demonstrated that these glycosylation changes can be detected and characterized according to the present method. It was also shown that N- and O-glycan specific fucosylation can be characterized by the present method.
Cell culture. SW 480 cells (human colon adenocarcinoma cell line) were cultured in 5% CO2 atmosphere in RPMI medium containing 10% fetal calf serum (FCS). Cells were split twice a week. The bigger the passage was, more rapidly the cells grew. For starvation cells were washed twice with RPMI containing 0.2% FCS and incubated in the same medium for 14 hours. Normally cells were approximately 70% confluent, except for sample “confluent” which was 100% confluent.
Membrane protein isolation. Membrane protein isolation was performed at +0-+4° C. Cells were washed with phosphate buffered saline and collected by centrifugation. The cells were incubated in hypotonic buffer (25 mM Tris-HCl pH 8.5), broken by homogenisation, and brought back to isotonic buffer by the addition of NaCl to 150 mM. The homogenate was centrifuged at 40,000 g in order to recover the cell membranes. The crude membrane pellet was homogenized in detergent buffer containing 25 mM Tris-HCl pH 7.5, 150 mM NaCl, and 1% (w/v) β-octylglucoside. After incubation, the preparate was centrifuged at 100,000 g and the supernatant that contained the detergent extracted proteins was collected. Buffer salts and the detergent were removed by cold acetone precipitation as described previously (Verostek et al., 2000).
Glycan profiling of cultured cells in different cell culture phases.
General change in cultured human cell lines. Similar phenomena were observed in also other human cell lines studied, including over 10 cancer cell lines as well as cell lines derived from normal human tissues. The major change occurring in the cells when they are cultured in vitro, is that high-mannose type N-glycans are significantly more abundant when compared to e.g. low-mannose type N-glycans or complex-type N-glycans including sialylated N-glycans.
Analysis of antibody glycosylation. Monoclonal antibodies (mAbs) are typically produced as recombinant proteins in mammalian cell lines or in plants, and polyclonal antibodies can be purified from e.g. serum by methods known in the art. IgG is a glycoprotein with two N-glycosylation sites per molecule (one per heavy chain), but occasionally also two (or 2n) extra N-glycosylation sites per molecule (one per variable region, or n). The structures of the conserved N-glycans are highly conserved (REF 1), but they can change due to e.g. disturbances in the recombinant protein production. The present method as described above allows for detection of these abnormalities, as demonstrated by model mAbs analyzed by the present method, as described below. The glycan profiles were analyzed from glycan pools isolated by N-glycosidase F, or N-glycosidase A (from almonds; Calbiochem, USA) in the case of plant glycoproteins, and analyzed as described in the preceding Examples.
In an example where antibody molecules contain abnormal N-glycans in the conserved site, glycan signals arising from the abnormal glycans are observed in increased amounts compared to the normal glycan signals that correspond to the monosaccharide compositions Hex3-5HexNAc4dHex1. Easily detected abnormal structures include mannose type N-glycans corresponding to the monosaccharide compositions Hex2-9HexNAc2.
In an example where antibody molecules contain normal N-glycans in the conserved site, but in abnormal relative amounts compared to each other, the abnormality can be observed from the glycan profile according to the present invention. Normal glycan profiles of IgG molecules are described in the literature (e.g. Raju et al., 2000). An example of an abnormal profile of normal glycans is such that the relative amounts of the Hex3HexNAc4dHex1 glycoforms are significantly increased compared to the Hex4-5HexNAc4dHex1 glycoforms, which can give rise to side effects in the potential use of the antibody molecule, such as affinity towards receptors of the innate immunity and/or serum clearance systems. Observed examples (1. and 2.) of differential proportions of the glycoforms Hex3HexNAc4dHex1: Hex3HexNAc4dHex1:Hex3HexNAc4dHex1 are presented below together with examples from the literature (3. and 4.). The present results suggest that the method of glycan analysis according to the present invention is useful in the characterization of recombinant antibodies.
In an example where antibody molecules contain normal complex-type N-glycans in the conserved sites and abnormal glycan types in an extra N-glycosylation site, the abnormality can be observed in the glycan profile according to the present invention: in this case the abnormal variable region N-glycans and the normal conserved N-glycans occur in approximate molar proportions of 1:1. For example, high-mannose type N-glycans observed in an extra variable region N-glycosylation site can give rise to side effects in the potential use of the antibody molecule, such as affinity towards receptors of the innate immunity and/or serum clearance systems.
In an example where antibody molecules are produced in plants, the glycan profiles can differ significantly from animal-type glycan profiles. For evaluating suitability of glycosylation for antibodies, the relationship of Hex3HexNAc4dHex1, less preferentially Hex3HexNAc4, even less preferentially Hex3HexNAc2dHex0-1, monosaccharide compositions, to other glycan signals is calculated, with the proviso that the deoxyhexose-containing N-glycans can be liberated by N-glycosidase F as well as N-glycosidase A enzymes, indicating that they do not contain α1,3-linked fucose in the N-glycan core. Generally, glycan profiles generated by N-glycosidase F and N-glycosidase A enzymes should not differ at all, if the antibody is suited for human use. As another means of evaluating the suitability of glycosylation for antibodies, non-animal type glycans should not appear in the glycan profile. These include all glycan signals corresponding to monosaccharide compositions containing pentose or more than one deoxyhexose. For reasons listed above, also high-mannose or low-mannose type N-glycans are not preferred in recombinant antibodies. In conclusion, the present method allows for effective and rapid evaluation of recombinant protein glycosylation, when they are produced in plants or other non-animal systems.
In vitro modification of native preparations—galactosylation. It was observed that an antibody preparation had the relative amounts of the Hex3HexNAc4dHex0-1 glycoforms significantly increased compared to the Hex4-5HexNAc4dHex0-1 glycoforms, possibly causing side effects in the use of the antibody. A normal glycoform profile of the preparation was restored by in vitro incubation with β1,4-galactosyltransferase, uridine diphospho-galactose, divalent cations, and suitable buffer and temperature as known in the art, without denaturing the antibody.
In vitro modification of native preparations—demannosylation. It was observed that an antibody preparation had abnormal variable region N-glycans, since its N-glycan profile showed the normal conserved N-glycans and high-mannose type N-glycans occurring in approximate molar proportions of 1:1 in the profile. High-mannose glycoforms were removed from the glycan profile by in vitro incubation with either α-mannosidase from Jack beans (C. ensiformis; Sigma, USA) or Endoglycosidase H (Calbiochem, USA) in suitable reaction buffers and temperature as known in the art, without denaturing the antibody. In these reactions, the characterized reaction products were antibody preparations containing Hex1-5HexNAc2 (Manα1-4Manβ1-4GlcNAcβ1-4GlcNAc) and GlcNAc variable region extra N-glycans, respectively.
Isolation and fractionation of human milk oligosaccharides. Samples of human milk were defatted by centrifugation and deproteinized by precipitation with 68% (v/v) aqueous ethanol. The supernatant was dried and the residue was extracted with water. The water-soluble oligosaccharides were subjected to gel filtration chromatography in water. High-molecular weight neutral oligosaccharides were separated from sialylated oligosaccharides at the void volume by porous graphitized carbon chromatography as described above. The gel filtration chromatography phase resulted in enrichment of high-molecular weight neutral and sialylated oligosaccharides, as described below. Furthermore, from a sample of Lea-b-milk, neutral oligosaccharide fraction corresponding approximately to lacto-N-octaoses was further fractionated by normal-phase high-performance liquid chromatography (HPLC) on amino-bonded silica column and porous graphitized carbon HPLC (Hypercarb), with absorbance detection at 214 nm and either descending (normal-phase) or ascending (Hypercarb) acetonitrile gradient in aqueous mild ammonia solution. The enhanced separation capabilities resulted in isolation of a heptasaccharide not previously described in human milk, as described below.
Novel high-molecular weight oligosaccharides in human milk. By MALDI-TOF mass spectrometry, novel neutral and acidic oligosaccharides were detected in high-molecular weight fractions of the gel filtration chromatography step. The detected neutral and sialylated oligosaccharides had molecular masses up to nearly 3700 Da and nearly 2600 Da, respectively. Overall, the detected neutral oligosaccharides had apparent monosaccharide compositions Hexm+2HexNAcmdHexn, corresponding to (GalmGlcNAcmFucn)Galβ1-4Glc, where (n≦m) and (1≦m≦8), or even (m≦9) when (n=0). Overall, the detected sialylated oligosaccharides had apparent monosaccharide compositions NeuAcoHexm+2HexNAcmdHexn, corresponding to (GalmGlcNAcmFucnNeuAco)Galβ1-4Glc, where (n≦m) and (o≦m) and (1≦m≦4), or even (m≦5) when (n=0) and (o=1).
Novel fucosylated heptasaccharide in human milk. From the Lea-b-milk sample, the following isomeric neutral heptasaccharide eluting into the same fraction in normal-phase HPLC, were isolated and characterized: (I) Galβ1-4(Fucα1-3)GlcNAcβ1-6(Galβ1-4GlcNAcβ1-3)Galβ4Glc, (II) Galβ1-4(Fucα1-3)GlcNAcβ1-6(Galβ1-3GlcNAcβ1-3)Galβ1-4Glc, (III) Galβ1-4(Fucα1-3)GlcNAc, 1-3 (Galβ1-4GlcNAcβ1-6)Gal 1-4Glc, (IV) Fucα1-2Galβ1-3GlcNAcβ1-3(Galβ1-4GlcNAcβ1-6)Galβ1-4Glc, (V) Galβ1-4GlcNAcβ1-3Galβ1-4(Fucα1-3)GlcNAcβ1-3Galβ1-4Glc, and (VI) Gal 1-3GlcNAcβ1-3 Galβ1-4(Fucα1-3)GlcNAcβ1-3 Gal 1-4Glc. The structures were verified by sequential exoglycosidase digestions, mild acid hydrolysis, 1D 1H-NMR against known standard molecules, chromatographic coelution with known standard molecules, and MALDI-TOF mass spectrometry. The oligosaccharide III has not been previously described in human milk.
Analysis of bovine milk glycoprotein fractions. Neutral N-glycans were analyzed from samples of delipidated bovine milk and milk powder as described in the preceding Examples. Lactoferrin was also isolated from the same samples by cation exchange chromatography and the purified lactoferrin was analyzed similarly as the total milk sample.
Analysis of human digestive tract tissue samples.
Tissue-specific glycosylation information and it use. It is seen that each tissue depicted in the Figures differs from another by 1) differences in overall glycan profiles, 2) differences in individual glycan signals, and/or 3) relative abundancies of individual glycan signals or glycan signal groups according to the present invention. The tissues can be recognized based on the glycan profiles and individual glycan signals that correspond to tissue-specific expression of glycans. The sialylated glycan fractions also contain similar specific information that can be combinated with the neutral glycan fraction information to gain more specificity and resolving power.
Individual differences in tissue glycosylation.
Disease-specific differences in glycosylation.
Total N-glycans were isolated and analyzed as described above from formalin-fixed rat liver. The major difference between the glycan profiles of rat and humans was in the sialylated glycan fraction, namely the occurrence of acetylated sialylated glycans at m/z 1972, 2263, and 2305, corresponding to Ac1NeuAc1Hex5HexNAc4, Ac1NeuAc2Hex5HexNAc4, and Ac2NeuAc2Hex5HexNAc4, respectively, as major sialylated N-glycans in rat liver. It is concluded that the present method is well suited to finding species-specific differences in glycosylation.
Glycolipid and glycan isolation. Glycolipids were isolated from peripheral blood mononuclear cells essentially as described in (Karlsson et al., 2000). Sphingoglycolipids were detached by digestion with endoglycoceramidase from Macrobdella decora (Calbiochem, USA). After the reaction, liberated glycans were purified, fractionated into sialylated and neutral glycan fractions, and analyzed by MALDI-TOF mass spectrometry as described in the preceding Examples.
Glycolipid glycan profiles. Table 3 describes the detected glycan signals and their proposed monosaccharide compositions. The monosaccharide compositions correlate with known glycolipid core structures, such as gangliosides, lacto- and neolactoglycolipids, and globosides, and extensions of the core structures, such as poly-N-acetyllactosamine chains. Several glycans show fucosylation and/or sialylation of the core and extended structures.
Cells, Mononuclear cells were isolated from human peripheral blood by Ficoll-Hypaque density gradient (Amersham Biosciences, Piscataway, USA) essentially as described. The surface glycoprotein glycans were liberated by mild trypsin treatment (80 micrograms/ml in PBS) at +37 degrees Celsius for 2 hours. The intact cells were harvested by centrifugation, and the supernatant containing the liberated glycans (at this stage as cell surface glycoprotein glycopeptides) was taken for further analyses. The harvested cells and the supernatant were subjected to Glycan profiling by protein N-glycosidase as described in the preceding examples. The N-glycan profiles of the supernatant containing the cell surface glycoprotein glycopeptides, were compared against N-glycan profiles of the cells harvested from the trypsin treatment.
N-Glycan analyses of HMC cell surface glycopeptide glycomes. HMC were isolated from peripheral blood, treated with trypsin to release the surface glycoprotein glycopeptides, followed by release of glycopeptide glycans, and subjected to glycol profiling as described under Experimental procedures. In MALDI-TOF mass spectrometry of the sialylated N-glycan fractions, several glycon signals were detected in these samples. When the resulting glycome profile was compared to a corresponding glycome isolated from the trypsin treated cells, it could be observed that many sialylated components were enriched in the surface glycoprotein glycopeptide fraction, whereas some structures appeared to have more intracellular localization. Examples or the former structures are (monosaccharide compositions in parenthesis): m/z [M-H]-1930 (SaHex5HexNAc4), 2221 (Sa2Hex5HexNAc4), 2222 (SaHex5HexNAc4dHex2), 2367 (Sa2Hex5HexNAc4dHex), 2368(SaHex5HexNAc4dHex3), 2587 (SaHex6HexNAc5dHex2), and 3024 (Sa3Hex6HexNAc5dHex). Examples of the latter are m/z 1873(SaHex5HexNAc3dHex), and 2035(SaHexHexNAc3dHex).
Protein-linked glycans were isolated by non-reductive alkaline elimination essentially as described by Huang et al. (2000), or by N-glycosidase digestion to specifically retrieve N-glycans as described in the preceding Examples.
Tissue-specific glycosylation analyses and comparison of glycan profiles between tissues. Human tissue protein-linked glycan profiles were analyzed from lung, breast, kidney, stomach, pancreas, lymph nodes, liver, colon, larynx, ovaries, and blood cells and serum. In addition, cultured human cells were analyzed similarly. Tables 6 and 7 show neutral and acidic protein-linked glycan signals, respectively, observed in these human tissues and cells together with their classification into glycan structure groups. However, the individual glycan signals in each structure group varied from sample type to sample type, reflecting tissue material and cell type specific glycosylation. Importantly, in analyses of multiple samples, such as 10 samples from an individual human tissue type, glycan group feature proportions remain relatively constant with respect to variation in the occurrence of individual glycan signals.
Furthermore, it was observed that each tissue demonstrated a specific glycan profile that could be distinguished from the other tissues, cells, or blood or serum samples by comparison of glycan profiles according to the methods described in the present invention. It was also found out that glycan profile difference could be quantitated by comparing the difference between two glycan profiles, for example according to the Equation (resulting in difference expressed in %):
wherein p is the relative abundance (%) of glycan signal i in profile a or b, and n is the total number of glycan signals. For example, the Equation reveals that human lung and ovary tissue protein-linked glycan profiles differ from each other significantly more than human lung and kidney tissue protein-linked glycan profiles differ from each other. Each tissue or cell type could be compared in this manner.
Comparison of glycosylation features between human tissue materials. Table 8 shows how glycan signal structural classification according to the present invention was applied to the comparison of quantitative differences in glycan structural features in glycan profiles between human tissue materials. The results show that each sample type was different from each other with respect to the quantitative glycan grouping and classification. Specifically, normal human lung and lung cancer tissues were different from each other both in the neutral glycan and sialylated glycan fractions with respect to the quantitative glycan structure grouping. In particular, lung cancer showed increased amounts of glycan signals classified into terminal HexNAc containing glycans. In analysis of individual glycan signals by β-glucosaminidase digestion, it was found that lung cancer associated glycan signals, such as Hex3HexNAc4dHex1, contained terminal β-linked GlcNAc residues, correlating with the classification of these glycan signals into the terminal HexNAc (N>H and/or N═H) glycan groups. Furthermore, the human serum protein-linked glycan profile showed significantly lower amounts of high-mannose and especially low-mannose type N-glycan signals. It is concluded that the glycan grouping profile of human serum is significantly different from the corresponding profiles of solid tissues, and the present methods are suitable for identification of normal and diseased human tissue materials and blood or serum typical glycan profiles from each other.
Disease- and tissue-specific differences in glycan structure groups.
The analysis of protein-linked glycan profiles of human tissues revealed also that tissues with abundant epithelial structures, such as stomach, colon, and pancreas, contain increased amounts of small glycan structures, preferentially mucin-type glycans, and fucosylated glycan structures compared to the other glycan structure groups in structure classification. Similarly as epithelial tissues, mucinous carcinomas were differentiated from other carcinoma types based on analysis of their protein-linked glycan profiles and structure groups according to the methods of the present invention.
Glycan material is liberated from biological material by enzymatic or chemical means. To obtain a less complex sample, glycans are fractionated into neutral and acidic glycan fractions by chromatography on a graphitized carbon as described above. A useful purification step prior to NMR analysis is gel filtration high-performance liquid chromatography (HPLC). For glycans of glycoprotein or glycolipid origin, a Superdex Peptide HR10/300 column (Amersham Pharmacia) may be used. For larger glycans, chromatography on a Superdex 75 HR10/300 column may yield superior results. Superdex columns are eluted at a flow rate of 1 ml per minute with water or with 50-200 mM ammonium bicarbonate for the neutral and acidic glycan fractions, respectively, and absorbance at 205-214 nm is recorded. Fractions are collected (typically 0.5-1 ml) and dried. Repeated dissolving in water and evaporation may be necessary to remove residual ammonium bicarbonate salts in the fractions. The fractions are subjected to MALDI-TOF mass spectrometry and all fractions containing glycans are pooled.
Prior to NMR analysis, the pooled fractions are dissolved in deuterium oxide and evaporated. With glycan preparations containing about 100 nmol or more material, the sample is finally dissolved in 600 microliters of high-quality deuterium oxide (99.9-99.996%) and transferred to a NMR analysis tube. A roughly equimolar amount of an internal standard, e.g. acetone, is commonly added to the solution. With glycan preparations derived from small tissue specimens or from a small number of cells (5-25 million cells), the sample is preferably evaporated from very high quality deuterium oxide (99.996%) twice or more to eliminate H2O as efficiently as possible, and then finally dissolved in 99.996% deuterium oxide. These low-material samples are preferably analyzed by more sensitive NMR techniques. For example, NMR analysis tubes of smaller volumes can be used to obtain higher concentration of glycans. This kind of tubes include e.g. nanotubes (Varian) in which sample is typically dissolved in a volume of 37 microliters. Alteinatively, higher sensitivity is achieved by analyzing the sample in a cryo-NMR instrument, which increases the analysis sensitivity through low electronic noise. The latter techniques allow gathering of good quality proton-NMR data from glycan samples containing about 1-5 nmol of glycan material.
It is realized that numerous studies have shown that proton-NMR data has the ability to indicate the presence of several structural features in the glycan sample. In addition, by careful integration of the spectra, the relative abundancies of these structural features in the glycan sample can be obtained.
For example, the proton bound to monosaccharide carbon-1, i.e. H-1, yields a distinctive signal at the lower field, well separated from the other protons of sugar residues. Most monosaccharide residues e.g. in N-glycans are identified by their H-1 signals (see Tables 4 and 5 for representative examples). In addition, the H-2 signals of mannose residues are indicative of their linkages.
Sialic acids do not possess a H-1, but their H-3 signals (H-3 axial and H-3 equatorial) reside well separated from other protons of sugar residues. Moreover, differently bound sialic acids may be identified by their H-3 signals. For example, the Neu5Ac H-3 signals of Neu5Acα2-3Gal structure are found at 1.797 ppm (axial) and 2.756 ppm (equatorial). On the other hand, the Neu5Ac H-3 signals of Neu5Acα2-6Gal structure are found at 1.719 ppm (axial) and 2.668 ppm (equatorial). By comparing the integrated areas of these signals, the molar ratio of these structural features is obtained.
Other structural reporter signals are commonly known and those familiar with the art use the extensive literature for reference in glycan NMR assignments.
Lysosomal protein sample including human myeloperoxidase was chosen to represent lysosomal organelle glycoproteins. The sample was digested with N-glycosidase F to isolate N-glycans, and they were purified for MALDI-TOF mass spectrometric analysis as described in the preceding Examples.
Alkaline phosphatase digestion was performed essentially according to manufacturer's instructions. After the digestion glycans were purified for MALDI-TOF mass spectrometric analysis as above.
Neutral N-glycan profiles. The neutral N-glycan profile is presented in
Acidic N-glycan profiles. The acidic N-glycan profile is presented in
Phosphorylated N-glycans. Major glycan signals with phosphate or sulphate ester (SP) in their monosaccharide compositions were Hex5HexNAc2SP (1313), Hex6HexNAc2SP (1475), and Hex7HexNAc2SP (1637). When the acidic glycan fraction was subjected to alkaline phosphatase digestion, these major signals were specifically digested and disappeared from the acidic glycan spectrum as detected by MALDI-TOF mass spectrometry (data not shown). In contrast, the major glycan signals with sialic acids in their monosaccharide compositions were not digested, including NeuAc1Hex3HexNAc3dHex1 (1549). This indicates that the three original glycan signals corresponded to phosphorylated N-glycans (PO3H)Hex5HexNAc2, (PO3H)Hex6HexNAc2, and (PO3H)Hex7HexNAc2, respectively, wherein PO3H denotes phosphate ester.
The data further indicated that the present organelle-specific N-glycan profile included phosphorylated low-mannose type and high-mannose type N-glycans (PO3H)Hex3HexNAc2 (989), (PO3H)Hex4HexNAc2 (1151), (PO3H)Hex5HexNAc2 (1313), (PO3H)Hex6HexNAc2 (1475), (PO3H)Hex7HexNAc2 (1637), and (PO3H)Hex8HexNAc2 (1799). In this glycan profile the phosphorylated glycan residues are preferentially mannose residues, more preferentially α-mannose residues, and most preferentially 6-phospho-α-mannose residues i.e. (PO3H-6Manα).
Normal lung (Sample I) and malignant lung tumor samples (Sample II) were archival formalin-fixed and paraffin-embedded tissue sections from cancer patients with small cell lung cancer. Protein-linked glycans were isolated from the representative samples by non-reductive β-elimination, purified, and analyzed by MALDI-TOF mass spectrometry as described in the preceding Examples. In the present analysis, the total desilylated protein-linked glycomes from each sample were used.
To analyze the data and to find the major glycan signals associated with either the normal state or the disease, two variables were calculated for the comparison of glycan signals between the two samples:
1. absolute difference A=(SII−SI), and
2. relative difference R=A/SI,
wherein SI and SII are relative abundances of a given glycan signal in Sample I (normal human lung tissue) and Sample II (small cell lung cancer), respectively.
The glycan signals were further classified into structure classes by a one letter code:
abcd,
wherein a is either N (neutral) or S (sialylated); b is either L (low-mannose type), M (high-mannose type), H (hybrid-type or monoantennary), C (complex-type), S (soluble), or O (other); c is either—(nothing), F (fucosylated), or E (multifucosylated); and d is either—(nothing), T (terminal HexNAc, N>H), or B (terminal HexNAc, N═H); as described in the present invention.
To identify protein-linked glycan signals correlating with malignant tumors in total tissue glycomes from cancer patient, major signals specific to either normal lung tissue or malignant small cell lung cancer tumors were selected based on their relative abundances. When A and R were calculated for the glycan profile datasets of the two samples, and the glycan signals thereafter sorted according to the values of A and R, the most significant differing glycan signals between the two samples could be identified (Table 10). Among the most abundant protein-linked glycan signals in the data, the following three signals had emerged in II (new in Table x): 1955, 2685, and 2905, corresponding to fucosylated complex-type N-glycans. The absolute differences of these signals were among the ten most large in the data, indicating that they were significant. The signals that experienced the highest relative increase in R were: 771 (R=2.4, corresponding to 3.4-fold increase), 1905 (R=2.2, corresponding to 3.2 fold increase), and 1485 (R=1.3, corresponding to 2.3 fold increase). The latter signal corresponded to complex-type N-glycans with terminal HexNAc. Significantly, its +2Hex counterpart 1809 was the most drastically reduced glycan signal in II with A=−8.9 and R=−0.4 (corresponding to 40% decrease in II), indicating a large change in terminal HexNAc expression. Moreover, the data easily shows that the glycan signals 1704, 1866, 1136, and 755 were not present in II.
Further, the obtained results, especially the identified major glycan signals indicative of either Sample II (high A and R) or Sample I (low A and R) were used to compile two alternative algorithms to produce glycan score with which lung cancer sample could be identified from normal lung sample based on the glycan signal values of the quantitative glycome data:
1. glycan score=I(1485)−I(1809),
wherein I(1485) is the relative abundance of glycan signal 1485 and I(1809) is the relative abundance of glycan signal 1809;
and alternatively:
2. glycan score=I(1485)/I(1809)
These glycan score algorithms yield high numerical value when applied to lung cancer sample and low numerical value when applied to normal lung sample.
The present identification analysis produced selected glycan signal groups, from where indifferent glycan signals have been removed and that have reduced noise or background and less observation points, but have the resolving power of the initially obtained glycan profiles. Such selected signal groups and their patterns in different sample types can serve as a signature for the identification of for example 1) normal human glycosylation, 2) tissue-specific glycosylation, 3) disease states affecting tissue glycosylation, 4) malignant cancer, 5) malignancy in comparison to benign tumors, and grade of malignancy, or 6) glycan signals that have individual variation. Moreover, glycan signals can be identified that do not change between samples, including major glycans that can be considered as invariant or housekeeping glycans.
The present data analysis identified potential glycan marker signals for future identification of either the normal lung of the lung tumor glycome profiles. Further, glycan classes that are associated with e.g. disease state in humans can be identified. Specifically, the analysis revealed that within the total complex-type N-glycan structure class in the tissue glycomes, terminal HexNAc (N>H) were typical to small cell lung cancer.
The method also allows identification of major glycans or major changes within glycan structure classes. For example, the proportion of multifucosylated glycans within the total tissue glycome profile was increased in II(1.1%) compared to I (0.3%). The data analysis identified this change predominantly to the appearance of glycan signals 1955 and 2685 in II.
Samples from human leukocytes were analyzed. Neutral and acidic glycosphingolipid fractions were isolated from the cells essentially as described (Miller-Podraza et al., 2000; Karlsson et al., 2000). Glycans were detached by Macrobdella decora endoglycoceramidase digestion (Calbiochem, USA) essentially according to manuacturer's instructions, yielding the total glycan oligosaccharide fractions from the samples. The oligosaccharides were purified and analyzed by MALDI-TOF mass spectrometry as described in the preceding Examples for the protein-linked oligosaccharide fractions. Proposed compositions for the oligosaccharides and signal nomenclature are presented in Tables 11 and 12 for the neutral and acidic glycan fractions, respectively.
Leukocyte neutral lipid glycans. The analyzed mass spectrometric profile of the glycosphingolipid neutral glycan fraction is shown in
Structural analysis of the major neutral lipid glycans. The four major glycan signals, together comprising more than 75% of the total glycan signal intensity, corresponded to monosaccharide compositions Hex3HexNAc1 (730), Hex2HexNAc1 (568), Hex4HexNAc2 (1095), and Hex3HexNAc1dHex1, (876).
Acidic lipid glycans. The analyzed mass spectrometric profile of the glycosphingolipid sialylated glycan fraction is shown in
Terminal glycan epitopes that were demonstrated in the present experiments in leukocyte glycosphingolipid glycans include, as demonstrated by β1,4-galactosidase, α1,3/4-fucosidase, α1,2-fucosidase, and α2,3-sialidase digestions:
Galβ4GlcNAc (LacNAc type 2)
Non-reducing terminal HexNAc
Human leukocytes from peripheral blood and cultured human cell types were produced essentially as described in the preceding Examples. Glycosphingolipid glycans were isolated from glycolipid fractions isolated from these cells by endoglycoceramidase digestion; O-glycans were isolated by non-reductive alkaline β-elimination with concentrated ammonia in saturated ammonium carbonate; all glycan fractions were isolated with miniaturized solid-phase extraction steps; glycans were analyzed by MALDI-TOF mass spectrometry; terminal glycan epitopes were analyzed by specific exoglycosidase enzymatic digestions combined with analysis by mass spectrometry; the analysis steps were performed as described in the present invention.
Mass spectrometric profiles providing relative quantitative information about glycan signals and specific exoglycosidase digestions, together with antibody, lectin, and biochemical characterization of the cell types as described above, was used to further characterize tissue cell types and compare them with different cell types, here cultured cells. Table 14 describe examples of combinatorial characterization of glycan types associated with each cell type. Average values from cultured human cell types were used in the present comparisons. Analysis of glycolipid and/or O-glycan structures and classes in addition to N-glycan structures and classes yielded a more complete characterization of the cell types, revealed further differences between cell types, and provided more glycan epitopes and classes associated with each cell type. In conclusion, combination of analysis of different glycan types and epitopes was useful in analysis and identification of cell types.
Cell sample is preferably a cell pellet produced at cold temperature by centrifuging cells but avoiding disruption of the cells, optionally stored frozen and melted on ice. The cells are lysed on ice by the detergent 1% SDS, mixed by Vortex and optionally by pipette tip as physical degradation step, boiled on water bath for 5 min and one volume of 10% n-octyl-β-D-glucoside is added after which the sample is incubated at room temperature for 15 min. Detergents are used in amounts of 5 μl for 200 000-3 million cells and 2 μl for 200 000 cells.
Na-phosphate buffer pH 7.3 is added to octyl-glycoside prepared from detergent method so that the added volume is 8 times the volume of the added SDS (or octylglycoside solution), for example 16 μl for reaction containing of 2 μl of each detergent, the final concentration of the phosphate buffer is 20 mM. 2.5 U of NGF (U=1 nmol/min, Calbiochem) is added and the and the reaction is incubated overnight at 37 degrees of Celsius.
Preferred prepurification step in context of N-glycosidase reaction is acetone precipitation. In the prepurification the reaction vessel is cooled on ice. 9 volumes of ice cold acetone is added and reaction is mixed carefully and incubated for 15 min. at −20 degrees of Celsius. The sample is centrifuged at 13 000 by desk eppendorf tube centrifuge. The supernatant is removed and the sample is mixed in 60% (aq, vol/vol) ice cold methanol, 200 μl 60% ice cold methanol is added, the solution is mixed by Vortex and centrifuged as above. The supernatant is collected and the pellet is washed again with methanol as above and supernatant is collected. Methanol is evaporated from the sample and glycan fraction is subjected to further purification.
Commercial “Bond Elut” C18-column of 100 mg of silica matrix can be used for cell up to a few million cells (up to 2-4 million, more preferably less than 2 million cells). 500 mg cartridge is preferred for cell amounts about 10 million or more. The sample volumes of 100-500 μl can be used. The elution volume is about 1 ml and eluent/washing solution is ultrapure water.
Commercial “Alltech Carbograph” graphite column can be used, 150 mg graphite is useful for cell ranges of 200 000-to about 10 million cells, for amounts over 10 million cells (up to at least about 20 million cells) corresponding graphite column containing 300 mg graphite is preferred. The carbon may be also used in tip column format for amounts of 1000-200 000 cells or less the tip column is preferred. The columns separate neutral and acidic glycans. The invention is in specific embodiment directed a carbon column method involving washing the column with ammonium carbonate, this method specifically aimed for effective removal ionic organic type impurities remaining in the column.
The method is useful after the hydrophobic chromatography or in other method for concentration of larger sample volumes. Preferred volumes to be used in the process for 150 mg column include elution of neutral glycans by 2.5 ml of 25% aqueous ACN (acetonitrile) and acidic glycans by 25% ACN-0.05% TFA, for 300 mg column the volumes are 5 ml and for tip-column 40 μl.
A practical range tip column for about 1000-200 000 cells or less is build in a Eppendorf GELoader tip by narrowing the tip close to its narrower end (tip of the tip). The narrowing may be produced by pressing the tip from two sides of the tip. In a preferred embodiment the narrowing is produced during the production of the disposable pipette tip and more preferably the column materials according to the invention for the uses according to the invention are prepacked in the column by the manufacturer. The narrowing is so thin that the carbon particles (preferably graphitized carbon 120/400 mesh) remain in the column. 10 μl of carbon suspension in 50% ACN is poured into the tip column so that a bed 5 μl bed of carbon is formed above the narrowing. Preferred practical tip sizes includes carbon volumes of about 0.1 μl to about 100 μl, more preferably from about 1 μl to about 50 μl. The preferred sample volumes for practical type columns of about 5 to 20 μl varies from about 10 to 100 μl. For elution of neutral oligosaccharides from 5 μl column 40 μl of 25% aqueous ANC is used as above and for acidic glycans corresponding amount of TFA-solution as above.
Millipore ZiptipμC18 tip is packed with 10 μl washed H+ resin (preferably for example BioRad AG50W-X8) in ethanol forming about 5 μl bed of the resin. The column is especially useful combined method for both cationic and hydrophobic impurities. Practical sample volume for 5 μl column is 10 μl and elution volume 20 μl. Alternative resins includes NH4+ counterion resins and combination of cation and anion exchange resins preferably H+/Ac−-resins, preferably so that H+ resin is placed above the Ac resin. Preferred resin volumes include about 0.1 μl to about 100 μl, more preferably from about 1 μl to about 20 μl and most preferably from about 2 μλ τo αβoντ 10 μl, for large volumes larger tip may be used.
10 μl suspension containing cellulose (5 μl) mixed with 50% ethanol in poured in StarLab TipOne filter tip (tip size 0.5-10 μl). Practical sample volume for 5 μl column is produced as follows: sample is soluted 10 or 20 μl, of water, and then mixed with 55 or 100 μl of solvent A (BuOH:EtOH in mixture of 10:1), column is washed by 5×60 μl of solvent B (BuOH: EtOH:H2O in mixture of 10:1:2 vol/vol/vol) and column is eluted by 100 μl of 50% aq EtOH. Preferred cellulose volumes include about 0.1 μl to about 100 μl, more preferably from about 1 μl to about 50 μl and most preferably from about 2 μl to about 301 μl for large volumes larger tip may be used.
Glycan samples: Detached N-glycan samples were separately prepared by N-glycosidase digestion of human cell material prepurified by precipitation by cold acetone, extraction by cold methanol-water solution, and drying as described in the preceding Examples; and N-glycosidase digestion of glycoprotein.
Use of purification device. Thereafter, sample was applied in water to a purification device formed from interconnected miniaturized solid-phase extraction columns as described in the following. The sample was applied in water to a combined bed (1:1) of strong H+ form cation-exchange resin (Bio-Rad) and C18-bonded silica (ZipTip), and the flowthrough was eluted with water directly to graphitized carbon column (Carbograph), wherein the glycans were concentrated. The carbon column was separated from the first column before washing and elution. The carbon column was washed with water, neutral N-glycans were eluted with 25% acetonitrile, and acidic N-glycans were eluted with 25% acetonitrile in 0.05% trifluoroacetic acid.
Analysis of purification efficiency: The eluate was dried and applied onto a MALDI-TOF mass spectrometry plate with MALDI matrix and mass spectrometry and data analysis were performed as described in the preceding Examples. The mass spectrum demonstrated a profile of efficiently purified N-glycans, demonstrating that direct coupling of purification columns produced efficient device for glycan purification.
The glycosphingolipid glycan and reducing O-glycan samples were isolated from human leukocytes and other human cells, analyzed by mass spectrometry, and further analyzed by expoglycosidase digestions combined with mass spectrometry as described in the present invention and the preceding Examples. Non-reducing terminal epitopes were analyzed by digestion of the glycan samples with S. pneumoniae, β1,4-galactosidase (Calbiochem), bovine testes β-galactosidase (Sigma), A. ureafaciens sialidase (Calbiochem), S. pneumoniae α2,3-sialidase (Calbiochem), S. pneumoniae β-N-acetylglucosaminidase (Calbiochem), X. manihotis α1,3/4-fucosidase (Calbiochem), and α1,2-fucosidase (Calbiochem). The results were analyzed by quantitative mass spectrometric profiling data analysis as described in the present invention. The results with glycosphingolipid glycans are summarized in Table 15 including also core structure classification determined based on proposed monosaccharide compositions as described in the footnotes of the Table. Analysis of neutral O-glycan fractions revealed quantitative differences in terminal epitope glycosylation as follows: fucosylation degree of type 2 LacNAc containing O-glycan signals at m/z 771 (Hex2HexNAc2) and 917 (Hex2HexNAc2dHex1) was different between the two sample types.
In conclusion, these results from O-glycans and glycosphingolipid glycans demonstrated significant tissue materials specific differences and also were significantly different from N-glycan terminal epitopes within tissue material type analyzed in the present invention.
The substrate glycans were dried in 0.5 ml reaction tubes. The endo-β-galactosidase (E. freundii, Seikagaku Corporation, cat no 100455, 2.5 mU/reaction) reactions were carried out in 50 mM Na-acetate buffer, pH 5.5 at 37° C. for 20 hours. After the incubation the reactions mixtures were boiled for 3 minutes to stop the reactions. The substrate glycans were purified using chromatographic methods according to the present invention, and analyzed with MALDI-TOF mass spectrometry as described in the preceding Examples.
In similar reaction conditions with 2 nmol of each defined oligosaccharide control, the reaction produced signal at m/z 568 (Hex2HexNAc1) as the major reaction product from lacto-N-neotetraose and para-lacto-N-neohexaose, but not from lacto-N-neohexaose or para-lacto-N-neohexaose monofucosylated at the 3-position of the inner GlcNAc residue; and sialylated signal corresponding to NeuAc1Hex2HexNAc1 from α3′-sialyl-lacto-N-neotetraose. These results confirmed the reported specificities for the enzyme in the employed reaction conditions.
Results with Cellular Glycan Types
Human leukocyte glycosphingolipid glycans. The major digestion product in leukocyte neutral glycosphingolipid glycans was the signal at m/z 568 (Hex2HexNAc1), indicating the presence of non-fucosylated poly-LacNAc sequences. Further, signals at 714 (Hex2HexNAc1dHex1), 1079 (Hex3HexNAc2dHex1), 1225 (Hex3HexNAc2dHex2), and 1736 (Hex4HexNAc3dHex3) indicated the presence of abundant fucosylated poly-LacNAc sequences.
Major sensitive signals included 1095 (Hex4HexNAc2), 1241 (Hex4HexNAc2dHex1), 1460 (Hex5HexNAc3), 1606 (Hex5HexNAc3dHex1), 1752 (Hex5HexNAc3dHex1), 1971 (Hex6HexNAc4dHex1), 2117 (Hex6HexNAc4dHex2), 2263 (Hex6HexNAc4dHex3), 876 (Hex3HexNAc1dHex1), and 933 (Hex3HexNAc2), indicating presence of both linear non-fucosylated and multifucosylated poly-LacNAc. Residual signals left in the sensitive signals after digestion indicated presence of lesser but substantial amounts of also branched poly-LacNAc sequences.
O-glycans from human cells. The major digestion product in BM MSC neutral glycosphingolipid glycans was the signal at m/z 568 (Hex2HexNAc1), indicating the presence of non-fucosylated poly-LacNAc sequences. Further, signals at 714 (Hex2HexNAc1dHex1) and 1225 (Hex3HexNAc2dHex2) indicated the presence of fucosylated poly-LacNAc sequences.
Major sensitive signals included 1095 (Hex4HexNAc2), 1241 (Hex4HexNAc2dHex1), 876 (Hex3HexNAc1dHex1), 1606 (Hex5HexNAc3dHex1), 1460 (Hex5HexNAc3), and 933 (Hex3HexNAc2), indicating presence of both linear non-fucosylated and multifucosylated poly-LacNAc. Residual signals left in the sensitive signals after digestion indicated presence of lesser amounts of also branched poly-LacNAc sequences.
In conclusion, the profiles of endo-β-galactosidase reaction products efficiently reflected cell type specific glycosylation features as described in the preceding Examples and they represent an alternative and complementary method for analysis of cellular glycan types. Further, the present results demonstrated the presence of linear, branched, and fucosylated poly-LacNAc in all studied cell types and in different glycan types including N- and O-glycans and glycosphingolipid glycans; and further quantitative and cell-type specific proportions of these in each cell type, which are characteristic to each cell type.
Methods. To analyze the presence of terminal HexNAc containing N-glycans characterized by the formulae: nHexNAc=nHex≧5 and ndHex≧1 (group I), and to compare their occurrence to terminal HexNAc containing N-glycans characterized by the formulae: nHexNAc=nHex≧5 and ndHex=0 (group II), protein-linked glycans were isolated, purified and analyzed by MALDI-TOF mass spectrometry as described in the preceding Examples. They were assigned monosaccharide compositions and their relative proportions within the obtained glycan profiles were determined by quantitative profile analysis as described above. The following glycan signals were used as indicators of the specific glycan groups (monoisotopic masses):
Ia, Hex5HexNAc5dHex1: m/z for [M+Na]+ ion 2012.7
Ib, NeuAc1Hex5HexNAc5dHex1: m/z for [M-H]− ion 2279.8
Ic, NeuAc2Hex5HexNAc5dHex1: m/z for [M-H]− ion 2570.9
Results. As an indicator of group I glycans, Ia was detected in various protein-linked glycan samples isolated from malignant tumor samples, including ductale and lobulare type breast carcinoma, and carcinomas of the pancreas, stomach, lung, kidney, colon, ovaries, and lymph nodes (lymphoma), as well as lymph node metastases from primary stomach carcinoma and lobulare and ductale type breast carcinomas. Specifically, Ia was overexpressed in the malignant tumors of ductale type breast carcinoma when compared to corresponding normal tissues. When the two breast cancer types were more specifically studied with regard to the relationship of group I and II glycan expression, it was found that they expressed Ia, Ib, and Ic, while IIa was not expressed in similar amounts in same tissue samples as group I glycans. More specifically, IIa was not detected in lobulare type breast cancer samples, where Ib was expressed and further overexpressed when compared to the corresponding normal tissue from same individual. Further, of these indicator signals only group I but not group II was expressed in lymph node metastases from these two breast cancer types.
Production of high molecular weight protein subclycome: 500 microliters of human serum is passed through a 10×100 mm gel filtration column such as a Superdex gel filtration column (e.g. Superdex peptide, Superdex 75 or Superdex 200, GE-Amersham(Pharmacia) in a preferred embodiment the void volume of the column corresponds to proteins of about 200 kD, or 300 kD or even 500 kD) in 150 mM ammonium bicarbonate buffer, pH 8.0, and eluted with the before mentioned buffer. Void volume fraction containing high molecular weight proteins, is collected in 1 ml volume. Proteins and other material (e.g peptides and salts) with MW below the cut off of the column are retained in the column, and can be discarded. The eluted material is subjected to N-glycome, and O-glycome analyses as described in respective examples above. In N-glycome analysis, a HMW protein derived N-glycan glycome can be observed. In O-glycome analysis, a HMW protein derived O-glycan glycome can be observed.
Example of a glycan purification method. Isolated oligosaccharides can be purified from complex biological matrices in array format as follows, for example for MALDI-TOF mass spectrometric analysis. Optionally, contaminations are removed by precipitating glycans with 80-90% (v/v) aqueous acetone at −20° C., after which the glycans are extracted from the precipitate with 60% (v/v) ice-cold methanol. After glycan isolation, the glycan preparate is passed in water through a stack of SPE array plates including strong cation-exchange resin (e.g. Thermo HyperSep-96 SCX Plate, 100 mg/1 mL cat no 60300-588), and C18 silica resin (e.g. Thermo HyperSep-96 C18 Plate, 100 mg/1 m cat no L60300-428) and porous graphitized carbon resin (e.g. Thermo HyperSep-96 Hypercarb Plate, 10 mg/mL cat no 60302-606). In these conditions, the contaminants are retained in the cation exchange and reversed phase resins, and the oligosaccharides are trapped into the graphitized carbon resin. To increase purification efficiency, the column stack can be washed with aqueous solutions. After this step the column stack can be dissociated, and the upper two plates can be discarded or subjected to extensive cleaning and reuse.
Optionally Neutral glycans can be washed from the column and separated from sialylated glycans by elution with aqueous organic solvent, such as 25% (v/v) acetonitrile. Sialylated (or optionally both sialylated and neutral, if the first elution step with 25% acetonitrile is omitted) glycans can be eluted from the column by elution with aqueous organic solvent with added acid, such as 0.05% (v/v) trifluoroacetic acid in 25% (v/v) acetonitrile, which elutes both neutral and sialylated glycans. A glycan preparation containing sialylated glycans can be further purified by subjecting it to chromatography on microcrystalline cellulose in n-butanol:ethanol:water (10:1:2, v/v) and eluted by aqueous solvent, preferentially 50% ethanol:water (v/v). Preferentially, glycans isolated from small sample amounts are purified on miniaturized chromatography columns and small elution and handling volumes. An efficient purification method comprises most of the abovementioned purification steps. In an efficient purification sequence, neutral glycan fractions from small samples are purified with methods including carbon chromatography and separate elution of the neutral glycan fraction, and glycan fractions containing sialylated glycans are purified with methods including both carbon chromatography and cellulose chromatography.
Two mixtures of standard N-glycan were prepared from neutral glycans with m/z (as Na ions, see Tables of the invention): 933, 1257, 1485, and 1663. In the first mixture, the glycans were present in equimolar amounts and in the second mixture, the amounts of the glycans 1257 (down to 50%), 1485 (down to 10%), and 1663 were reduced. The mixtures were analyzed by MALDI-TOF mass spectrometry as described in the preceding Examples and the results are shown in
The m/z masses and composition of used N-glycans: 933 N-glycan Man-3 (a) Hex3HexNAc2; 1257 N-glycan Man-5, Hex5HexNAc2; 1663 N-glycan NA2, Hex5HexNAc4; 1485 N-glycan NGA2F, Hex3dHexHexNAc4; and 2028 N-glycan, NA3 Hex6HexNAc5 (Glycoseparations, Inc.).
Hex2HexNAc
568.18
568.09
Hex3HexNAc
730.24
730.18
Hex3HexNAcdHex
876.30
876.27
Hex4HexNAc2
1095.37
1095.36
Hex4HexNAc2dHex
1241.43
1241.42
NeuAcHex3HexNAc
997.34
997.52
NeuAcHex4HexNAc2
1362.47
1362.80
NeuAcHex4HexNAc2dHex
1508.53
1508.89
NeuAcHex5HexNAc3
1727.60
1728.01
NeuAcHex5HexNAc3dHex
1873.66
1874.07
1)See FIG. 17 for structures.
1)See FIG. 18 for structures
2)n.a., not assigned.
3)—, not present.
#Abbreviations:
§Figures indicate percentage of total detected glycan signals.
This Nonprovisional application claims priority under 35 U.S.C. § 119(e) on U.S. Provisional Application No(s). 61/022,200 filed on Jan. 18, 2008, the entire contents of which is hereby incorporated by reference into the present application.
Number | Date | Country | |
---|---|---|---|
61022200 | Jan 2008 | US |