The present invention relates to glycans, which are specifically expressed by certain cancer cells, tumours and other malignant tissues. The present invention describes methods to detect cancer specific glycans as well as methods for the production of reagents binding to said glycans. The invention is also directed to the use of said glycans and reagents binding to them for the diagnostics of cancer and malignancies. Furthermore, the invention is directed to the use of said glycans and reagents binding to them for the treatment of cancer and malignancies.
It is generally acknowledged that cancerous transformation of human tissues is associated with changes in the complex carbohydrate structures, glycans, which are elementary components of the glycoproteins, glycolipids, and proteoglycans that cover all human cell surfaces. Several individual glycan molecular structures have been identified as cancer-associated glycans (Dube & Bertozzi, 2005). These glycan structures have also been pursued as molecular drug targets for treatment of malignant breast cancer (Holmberg & Sandmeier, 2001) and melanoma (Fernandez, 2003). However, the whole spectrum of cancer-associated glycan changes has remained unknown because of lack of suitable analysis technology. The present invention is directed to such novel analytical methods and the application of the methods to analyses of tissue samples from human patients. The methods are applicable to clinical cancer diagnostics. The novel cancer-associated molecules discovered by the inventors are targets for cancer therapy and diagnostics.
The present invention reveals novel methods for producing and analyzing novel carbohydrate compositions, glycomes, from tissues. A preferred use of the present invention is analysis of cancer-associated glycan structures and glycan profiles. The invention further represents methods for analysis of the glycomes, especially mass spectrometric methods, and diagnostic methods thereof to detect cancer. As demonstrated in the Examples of the present invention, the inventors found novel methods to efficiently discriminate between cancerous and healthy tissue samples as well as malignant and benign tumors by mass spectrometric glycome analysis of tissue materials extracted from human patients.
The present invention reveals novel glycosylation features and glycan molecular groups generally associated with major human cancer types. The present invention is especially directed to novel glycosylation features of lung cancer, especially non-small cell lung adenocarcinoma, breast cancer, especially ductale and lobulare breast adenocarcinoma, colon carcinoma, especially colon carcinoma adenomatosum, kidney cancer, especially kidney carcinoma and hypernephroma, ovarian cancer, especially ovarian cystadenocarcinoma, gastric cancer, liver cancer, pancreas cancer, and larynx cancer. The present invention is further directed to novel glycosylation features of benign and malignant human tumors and the discrimination between benign and malignant growth, especially ovarian cystadenoma and ovarian cystadenocarcinoma, and colon adenoma and colon carcinoma adenomatosum. In another embodiment, the present invention is further directed to novel cancer type specific glycosylation features and their use for detection of specific cancer types from tissue materials.
The tissue substrate materials can be total tissue samples and fractionated tissue parts, such as serums, secretions and isolated differentiated cells from the tissues, or artificial models of tissues such as cultivated cell lines. In a preferred embodiment the invention is directed to special methods for the analysis of the surfaces of tissues. The invention is further directed to the compositions and compositions produced by the methods according to the invention, and cancer treatment methods derived thereof.
The invention represents effective methods for purification of glycan fractions from tissues, preferably from animal tissues, and more preferably from human and mammalian tissues, especially in very low scale. The prior art has shown analysis of separate glycome components from tissues, but not total glycomes. It is further realized that the methods according to the invention are useful for analysis of glycans from isolated proteins or peptides. The invention represents effective methods for the practical analysis of glycans from isolated proteins especially from very small amounts of samples.
The invention is further directed to novel quantitative analysis methods for glycomes. Typical glycomes comprise of subgroups of glycans, including N-glycans, O-glycans, glycolipid glycans, and neutral and acidic subglycomes. The glycome analysis produces large amounts of data. The invention reveals methods for the analysis of such data quantitatively and comparision of the data between different samples. The invention is especially directed to quantitative two-dimensional representation of the data and generation of reference data from different clinical states of tissue materials to detect disease-associated changes in glycosylation. The invention is further directed to simultaneous analysis of multiple cancer-associated glycan changes to detect cancer or clinical state of cancer.
The preferred analysis method includes:
The invention is further directed to structural analysis of glycan mixtures present in tissue samples and using the cancer-associated glycan structures in cancer therapy. Preferred forms of cancer therapy according to the present invention include glycan specific antibodies and cancer vaccines for passive or active immunotherapy against the cancer-associated glycan molecules, respectively.
A. deuteroacetylated m/z 899 [M+Na]+ glycan fragment at m/z 944, and
B. deuteroacetylated m/z 608 [M+Na]+ glycan fragment at m/z 653. The m/z figures in the x-axis correspond to the approximate m/z values of either [M+Na]+ or [M+H]+ adduct ions. The figures in the y-axis correspond to relative glycan signal intensities. The nomenclature of the fragment ions is after Domon and Costello (1988).
The present invention revealed that quantitative analysis of human glycomes is useful for analysis of human cancers. The glycome analysis revealed profiles of glycan expression, which can be used for the analysis of glycomes even without knowledge of exact structures of glycans. The present invention is directed in a preferred embodiment quantitative mass spectromettic profiling of human cancers according to the invention and analysis of alterations in cancer in comparision with normal corresponding normal tissues. The analysis can be performed based on signals corresponding to glycan structures, these signals were translated to likely monosaccharide compositions and further analysed to reveal structures and correlations between the signals. The invention is especially directed to analysis of N-glycan and/or O-glycan glycomes derived from cancer proteins. The glycans are analysed as neutral and/or acidic signals and glycan mixtures, multiple analysis methods are preferred to obtain maximal amount of data. The invention is also directed to methods for analysis of mixture of N-glycans and O-glycans released together.
Preferred N-Glycan and/or O-Glycan Glycomes and Characteristic Glycan Groups Therein
The invention revealed glycan groups, which are characteristically changed in cancer in protein derived N-glycans and O-glycans. Among N-glycans preferred N-glycans are glycans with non-reducing end terminal Man-residues (terminal Man-glycans), and complex type N-glycans with specific structural characteristics such as specific HexNAc-structures or fucosylstructures.
Preferred O-glycans characteristic for cancer are referred as LacNAc O-glycans, which comprise at least one N-acetyllactosamine unit with possible sialic acid and or fucosyl-modifications, and which is linked a specific O-glycan core type Galβ3GalNAc forming characteristic monosaccharide compositions in O-glycan mass spectra.
The invention is especially directed to analysis of glycomes as groups of related structures such as low-Man, high-Man, hybrid type, fucosylated, complex N-glycans and subgroups thereof etc. as these biosynthetically related groups characterize cancer tissues and cells. The invention is especially directed to methods of calculation %-part of a specific glycan groups and comparing the % values, e.g. in form of Table. It is notable that the glycan groups may occasionally comprise unusual/uncharacteristics glycans, which do not exactly correspond the title of group, but presence of such material would likely increase the characteristics of the analysis in comparisions between tissue materials. The glycan score analysis according to the invention revealed especial usefulness of the analysis by glycan groups.
Preferred terminal Man-glycans includes low-Man and high-Man glycans and acidic derivatives thereof, especially phosphorylated derivative. The invention revealed that there is alterations in high-Man type glycans, but that there is especially characteristic alterations in low-Man and acidic glycans. The invention revealed that the low-Man glycans share much similarity with glycans produced by lysosomal enzyme for lysosomal proteins,
It is further realized that key alterations in glycomes can be also analysed by other methods such as specific binding reagents after altering structure has been determined. The invention is directed to analysis of altering structures when the amount of the structure increases or decreases in a specific cancer. The invention is most preferably directed to use of a binding reagent with regard to a structure, which increases in cancer.
The analysis by binding method molecule may be preferred as a fast test, though the current mass spectrometric screening method is also quite fast and cost effective, only draw being requirement of mass spectrometer, which includes some capital investment for the method. The analysis by the specific binder may be also performed directly from the tissue and better information of tissue and/or cellular localization of the materials can be obtained. It also realized that combinations of at least two binding molecules recognizing different structures would be especially useful for analysis of cancer and multiple selcted specific binding molecules would approach the effectivity of the mass spectrometric screening methods.
The invention reveled characteristic structures among the N- and O-glycan glycomes. The invention is especially directed to these structures as targets for recognition by specific binding molecules. The invention is furthermore directed to the screening methods. The preferred structures to be recognized among the glycan groups according to the invention includes preferred terminal groups preferably recognized on the preferred glycan core structures according to the invention.
Preferred terminal Man-glycans includes low-Man and high-Man glycans and acidic derivatives thereof, especially phosphorylated derivative. The preferred terminal Man glycans comprise non-reducing end Man-residue(s), which Manα- or Manβ-residue, preferably being either one or more Manα-residues or a single Manβ-residue. Preferably the glycan comprise the Man residue(s) and optionally an additional Fuc-residue (preferably Fucα, more preferably Fucα6-branching residue at the reducing end GlcNAc residue) as the only non-reducing end terminal monosaccharide types.
When the terminal Man residue is Manβ-residue, the structure is the minimal low mannose glycan
Manβ4GlcNAcβ4(Fucα6)0oriGlcNAcβAsn. The preferred minimal epitopes to be recognized includes Manβ and terminal disaccharides and tri/tetrasaccharides either including reducing end anomeric structure or not and/or reducing end amino acid residue and/or part of the peptide chain of the potential carrier protein (marked in following by Asn) Manβ, Manβ4GlcNAc, Manβ4GlcNAcβ, Manβ4GlcNAcβ4GlcNAc, Manβ4GlcNAcβ4(Fucα6)GlcNAc, Manβ4GlcNAcβ4GlcNAcβ, Manβ4GlcNAcβ4(Fucα6)0oriGlcNAcβ, Manβ4GlcNAcβ4GlcNAcβAsn, and Manβ4GlcNAcβ4(Fucα6)GlcNAcβAsn.
The specificity of the binding reagent such as binding protein should such that the binding side covers the terminal Manβ-so that the binding molecule does not cross-react with elongated N-glycans, or less preferably cross reacts only with other preferred low-Man structures according to the invention such as the core structures elongated only by single Manα3/6-residue. The invention is directed to beta-mannosidase enzymes and corresponding engineered lectins used for sequencing N-glycans as examples specific binding reagents. The preferred minimum epitopes includes Manβ, Manβ4GlcNAc, Manβ4GlcNAcβ in a preferred embodiment so that the branching fucosyl residue does not affect the binding, or when fucose specific recognition is needed, the binding molecule is selected so that the residue affects binding in desired manner. It is realized that smaller binding epitopes are generally enough for specific recognition as similar structures are rare/non-existing in animal/human materials.
The preferred Manα-residue comprising target low-Man glycans includes both isomers of dimannosyl structures, preferably ones comprising Manα6Man, which was analysed to have higher predominance at least in part of cancers, the trimannosyl core structure low-Man glycans, tetra-Mannosylisomers and
the pentamannosyl Low-Man glycans according to the invention. The dimannosyl, trimannosyl- and pentamannosylstructures are preferred structures, and the branched trimannosyl and pentamannosylstructures are especially preferred due to prevalence observed and larger homogeneity (less isomers in target). The pentamannosyl structure is especially preferred as a major control step glycan toward low-Man glycans, and the trimannosyl-glycans is separately a preferred control step glycan with different branched accessibility for mannosidases and for synthesis of lower size low mannoses.
The terminal epitopes of antibodies against low-Man structures should recognize linear and/or branched Manα3/6Man-epitopes. The minimal epitopes includes non-reducing end terminal disaccharide and trisaccharide epitopes of the low-Man N-glycans.
Preferred minimal disaccharide epitopes for recognition of Low-Man glycan includes the disaccharides without next anomeric linkage Manα3Man, Manα6Man, and more specific epitopes disaccharides with next anomeric linkage depending on the level of the Manα3Manβ, Manα6Manβ, Manα3Manα and Manα6Manα.
Preferred branched trisaccharides includes Manα3(Manα6)Man, Manα3(Manα6)Manβ, and Manα3(Manα6)Manα. It is furthermore realized that the binding specificity may included further structures in the core of the N-glycans, but the core structures and especially possible branching Fucα-residue does not interfere with the binding. In a specific embodiment the terminal epitope is designed to be recognized by Fuc-sensitive manner.
The specificity of the binding reagent such as binding protein should such that the binding side covers the terminal Manα3/6-structures so that the binding molecule does not cross-react with elongated complex type N-glycans (GlcNAc-modifications on Manα3/6). The invention is directed to such Manα3/6-specific N-glycan core specific antibodies and lectins. The invention is further directed to development of Manα3/6-specific mannosidases to corresponding engineered lectins as examples specific binding reagents.
Examples of useful antibody types for the recognition of low-Man glycans in cancer include:
The invention is directed to recognition of specific complex type N-glycans with terminal GlcNAcβ2Manα3/6-structures by antibodies binding to the terminal structures, preferably by antibodies similar to and produced by similar method as described for non-reducing terminal GlcNAcβ2Man specific antibody OMB4 in Ozawa H. et al. (1997) Arch. Biochem. Biophys. 342, 48-57. The present inventors has been revealed terminal GlcNAc-oligosaccharide sequence recognizing natural human antibodies (PCT/FI2003/000615). Such antibodies can be selected from by page display technologies and produced by recombinant antibody technologies. The invention is further directed to the use of such antibodies in context of cancer glycomes preferably with one or more other antibodies according to the invention. The present inventors has been revealed terminal [NeuNAcα]0oriGalNAcβ4GlcNAc-oligosaccharide sequence recognizing natural human antibodies (e.g. U.S. application Ser. No. 10/486,714). Such antibodies can be selected from by page display technologies and produced by recombinant antibody technologies. The invention is further directed to the use of such antibodies in context of cancer glycomes preferably with one or more other antibodies according to the invention.
Preferred O-glycans characteristic for cancer are referred as LacNAc O-glycans, which includes preferred Groups 2 and 3, and which comprise at least one N-acetyllactosamine unit with possible sialic acid (group 3 or c in discussion) and or fucosyl-modifications, and which is linked a specific O-glycan core type Galβ3GalNAc forming characteristic monosaccharide compositions in O-glycan mass spectra.
The preferred structures among the LacNAc O-glycans includes core II structure-type O-glycan structures comprising core oligosaccharide sequence LacNAcβ6GalNAc, more preferably the branched structure LacNAcβ6(Galβ3)GalNAc, even more preferably LacNAcβ6(Galβ3)0 or 1GalNAcα, which is in a preferred embodiment recognizable in a protein linked form from a cancer sample. The preferred LacNAc unit is type Galβ4GlcNAc, though it is realized that the types of LacNAcs may vary between cancer types.
Preferred terminal structures to be recognize from the preferred core II O-glycans includes at least the terminal epitope LacNAcβ6GalNAc, with possible modifications, thus including structures according to the Formula:
{NeuNAcαX}n2Galβ4[(Fucα3)]n1GlcNAcβ6[(Galβ3)]n3GalNAc[α]n4[Ser/Thr]n5
wherein X is linkage position 3 or 6,
( ) indicates branch in the structure and
n1, n2, n3, n4 and n5 are 0 or 1, independently.
The invention is in a preferred embodiment directed to the recognition of minimal epitopes (when n4 and n5 are 0) with (n3 is 1) or without the branch (n3 is 0), which is the very minimal epitope, but could be also more easily achieved by an antibody or another binding molecule. The invention is especially directed to the binders of the minimal structures when the structure is further complicated by fucose or sialic acid substitutions. It realized that on tissue proteins the non-galactosylated form is a potential degradation form of the actual core 2 and these are can occur together. The actual core 2 structures are preferred as actual major target structures, when the Galβ3- is not recognized by the binding molecule, the specificity of the binder such as an antibody is preferably such as the Gal-structure is not preventing the binding, i.d. the antibody has dual specificity for Gal and non-Gal-structures.
The invention is especially directed to recognition of minimal structures by reagents which do not include additional specificity directing the reagents to glycolipids as known for certain antibodies or preferably the antibody favours the recognition of the epitopes on a protein or part of the peptide sequence such as Ser/Thr-residue is included in the antibody specificity. Preferred terminal sialylated (group 3) structures minimal structures to be recognized includes NeuNAcα3/6Galβ4(Fucα3)GlcNAcβ6(Galβ3)GalNAc, NeuNAcα3/6Galβ4GlcNAcβ6(Galβ3)GalNAc, NeuNAcα3/6Galβ4(Fucα3)GlcNAcβ6GalNAc, and NeuNAcα3/6Galβ4GlcNAcβ6GalNAc and
the preferred terminal non-sialyated minimal (group 2) structures include Galβ4(Fucα3)GlcNAcβ6(Galβ3)GalNAc, Galβ4GlcNAcβ6(Galβ3)GalNAc, Galβ4(Fucα3)GlcNAcβ6GalNAc, and Galβ4GlcNAcβ6GalNAc.
Preferred terminal sialylated (group 3) peptide epitope including structures to be recognized includes NeuNAcα3/6Galβ4(Fucα3)GlcNAcβ6(Galβ3)GalNAcα Ser/Thr, NeuNAcα3/6Galβ4GlcNAcβ6(Galβ3)GalNAcα Ser/Thr, NeuNAcα3/6Galβ4(Fucα3)GlcNAcβ6GalNAcα Ser/Thr, and NeuNAcα3/6Galβ4GlcNAcβ6GalNAcα Ser/Thr.
the preferred terminal peptide epitope including structures non-sialyated (group 2) structures include Galβ4(Fucα3)GlcNAcβ6(Galβ3)GalNAcα Ser/Thr, Galβ4GlcNAcβ6(Galβ3)GalNAcα Ser/Thr, Galβ4(Fucα3)GlcNAcβ6GalNAcα Ser/Thr, and Galβ4GlcNAcβ6GalNAcα Ser/Thr. 3/6 indicates either of the linkages of the sialic acid and Ser/Thr either of the linkage amino acid residues.
The neutral non-fucosylated structures Galβ4GlcNAcβ6GalNAc and Galβ4GlcNAcβ6(Galβ3)GalNAc are especially preferred in α-anomeric forms and to be specifically recognized in alfa-anomeric form, preferably linked to an Ser/Thr residue and/or being recognizable on protein and/or preferably being recognized on cancer protein. Background describes such structure on a lipid and it is known that similar branched structures do occur on galactosylglobosides (at least in mice) and GalNAcα-substitutable by branching β6-GlcNAc transferases is not known from human glycolipids. As the target structures are by different chemical linkage on different carrier both factors effective affecting immunrecognition, there is clear difference to the very limited and unique background in single cancer type.
Examples of useful antibody types fro the recognition of neutral core II O-glycans in cancer include antibodies reported to bind Galβ4GlcNAcβ6(Galβ3)GalNAc (Hep27-Mouse monoclonal antibody, Sandee D. et al. (2002) J. Biosci. Bioengin. (1993) 266-273, possible related application JP10084963, TOSOH corp; different antibody JP6046880 Ryuichi Horie. et al), and Galβ4GlcNAcβ6GalNAc (Chung Y-S et al. EP0601859, TOSOH corp). The Sandee publication reports reactivity of the antibody with a human cultivated hepatocellular carcinoma cell line (HCC-S102, not actual cancer), fetal liver, possible undisclosed cancers (relevance to human cancers or to human cultivated cell lines cannot be known) and not to adult liver, the article discusses heaptocellular carcinoma HCC based on the cell line result. Galβ4GlcNAcβ6GalNAcα Cer (lipid) binding antibody (FI alpha-75) was suggested for cancers of digestive organ, specifically stomach (Chung Y-S et al.), results show 79% reaction with stomach cancers, when faint labelling is counted, 38% reactivity with colon cancers and, 57% for colon cancers in immunohistochemistry. The antibodies were reported to be specific for glycolipids but the present invention reveals that this type of antibodies are also useful for analysis of cancer glycoproteins. The invention furthermore reveals new indication for the antibodies, especially one specific for Galβ4GlcNAcβ6(Galβ3)GalNAc. Though the prior art implied certain indications usefulness for other indications cannot be known and based on the interest of the company producing the antibody it would appear likely that they would have tested all possible cancers but unfortunately failed with revealing the novel present indications, to which phe present invention is especially directed to.
Examples of useful antibody types fro the recognition of acidic core II O-glycans in cancer include antibody CHO-131 (Walcheck B. et al. Blood (2002) 99, 4063-69) reported to bind specifically core II sLex glycan NeuNAcα3Galβ4(Fucα3)GlcNAcβ6(Galβ3)GalNAcα Ser/Thr-peptide. The antibodies were reported to be specific for certain immune cells but the present invention reveals that this type of antibodies are also useful for analysis of cancer glycoproteins according to the invention.
The present invention furthermore reveals novel indications for core 2-sLex structures in human cancer.
The invention is further directed to methods of analysing any of the glycans according to the invention from cancer derived proteins, preferably integral (cell bound/transmembrane) cancer tissue or cell released proteins and assigning the glycan structures with specific carrier protein, preferably by specific purification of the protein, e.g. by affinity methods such as immunoprecipitation or by sequencing, preferably by mass spectrometric sequencing, glycopeptides including sequencing and recognizing peptides and thus proteins linked to the proteins.
The present invention is directed to analysis of un-normally transformed tissues, when the transformation is benign and/or malignant cancer type transformation referred as cancer (or tumor). It is realized that benign transformation may be a step towards malignant transformation, and thus the benign cancers are also useful to be analysed and differentiated from normal tissue, which may have also non cancerous or non-transformation related alterations such as swelling or trauma related to physical or e.g. infectious trauma, and it is useful and preferred to differentiate with benign and malignant cancers
Preferably the tissue is human tissue or tissue part such as liquid tissue, cell and/or solid polycellular tumors, and in another embodiment preferably a solid human tissue. The solid tissues are preferred for the analysis and/or targeting specific glycan marker structures from the tissues, including intracellulalarily and extracellullarily, preferably cell surface associated, localized markers. In a preferred embodiment the invention is specifically directed to the recognition of cell surface localized and/or mostly cell surface localized marker structures from solid tumor tissues or parts thereof. It is realized that the contacts between cells and this glycomes madiateing these are are affected by presence of cells as solid tumor or as more individual cells. The preferred individual cell type cancers or tumors include preferably blood derived tumors such as leukemias and lymphomas, while solid tumors are preferably includes solid tumors derived from solid tissues suchs gastrointestinal tract tissues, other internal organs such as liver, kidneys, spleen, lungs, gonads and associated organs including preferably ovary, testicle, and prostate. The invention further reveals markers from individually or multicellularily presented cancer cells in contrast to solid tumors. The preferred cancer cells to be analyzed includes metastatic cells released from tumors/cancer and blood cell derived cancers, such as leukemias and/or lymphomas. Metastasis from solid tissue tumors forms a separately preferred class of cancer samples with specific characteristics.
The invention is furthermore directed to the anlysis of secretions from tumors such as ascites and/or cyst fluids, and cancer secreted materials present in general body fluids such as blood (or its derivatives such as serum nor plasma), urine, mucous secretions, amnion fluid, lymphatic fluid or spinal fluid, preferably blood, urine or mucous secretions, most preferably blood.
The cancer tissue materials to be analyzed according to the invention are in the invention also referred as tissue materials or simply as cells, because all tissues comprise cells, however the invention is preferably directed to unicellularily and/or multicellularily expressed cancer cells and/or solid tumors as separate preferred characterisitics. The invention further reveals normal tissue materials to be compared with the cancer materials. The invention is specifically directed to methods according to the invention for revealing status of transformed tissue or suspected cancer sample when expression of specific structure of a signal correlated with it is compared to a expression level estimated to correspond to expression in normal tissue or compared with the expression level in an standard sample from the same tissue, preferably a tissue sample from healthy part of the same tissue from the same patient.
The invention is in a preferred embodiment directed to analysis of the marker structures and/or glycome profiles from both cancer tissue and corresponding normal tissue of the same patient because part of the glycosylations includes individual changes for example related to rare glycosylation related diseases such as congenital disorders in glycosylation (of glycoproteins/carbohydrates) and/or glycan storage diseases. The invention is furthermore directed to method of verifying analysing importance and/or change of a specific structure/structure group or glycan group in glycome in specific cancer and/or a subtype of a cancer optionally with a specific status (e.g. primary cancer, metastase, benign transformation related to a cancer) by methods according to the present invention.
The present invention is directed to a set of glycan structures which are expressed by human cancers. The presence or increased amount of one or more of these glycans in a patient sample indicates cancerous status of the sample as described below in detail. The glycan structures can be divided to three basic groups:
1) neutral low-mannose type N-glycans having terminal Man, preferentially Manα structure; for example Manα0-3Manβ4GlcNAcβ4GlcNAc(β-N-Asn) or Manα0-4Manβ4GlcNAcβ4(Fucα6)GlcNAc(β-N-Asn);
2) neutral O-glycans having terminal Galβ4 structure; for example Galβ4[(Fucα3)]nGlcNAcβ6(Galβ3)GalNAc[αSer/Thr]p wherein p and n are either independently 0 or 1; and
3) sialylated Core 2 type O-glycans having terminal SAα3 structure, for example Neu5Acα3(Galβ4 [±Fucα3]GlcNAcβ3)0-2Galβ4 [±Fucα3]GlcNAcβ(6→GalNAc) and Neu5Acα3(Galβ3 [±Fucα4]GlcNAcβ3)0-2Galβ4 [±Fucα3]GlcNAcβ(6→GalNAc).
Groups 2 and 3 can be considered together as terminal LacNAc O-glycan group. The group 1 may be further included in a larger group of terminal man-glycans including high-Man glycans, which also represent cancer specific changes, but in a more modest scale.
The invention further revealed structures and structural groups, which presence or alterations of amounts of which are characteristic for cancercers such as an additional group of HexNAc comprising complex N-glycans, which can be further divided to two groups and level of which would give more specific information about the cancer. The HexNAc comprising N-glycans can be divided to two major groups:
Group 4) glycans with terminal (NeuAcα)0-1HexNAcβHexNAcβ sequences, preferably terminal LacdiNAc sequences
Group 5) terminal β-linked GlcNAc glycans
Besides the major groups above the present invention reveal additional groups such as
Group 6) a group of complex N-glycans including fucosylated multiple N-acetyllactosmine N-glycans revealed as extensively altered structures e.g. in analysis of Table 18 and
Group 7) including multiple fucosylated (or deoxyhexose, dHex, comprising) structures, e.g. present in samples of pancreatic cancer (Tables 11-13 and corresponding examples) and
Group 8) including phosphorylated and/or sulphated glycans, which are also preferred as separate groups and modifications of terminal Man-glycans.
The invention revealed furthermore useful additional glycan groups such as: control groups and/or glycan groups with usually modest changes, such as regular complex type glycans or structural groups otherwise characteristic such as blood groups structure glycans, which have carry over in the metastasis samples.
The invention revealed that mass spectrometric profiling of glycan group is very effective method for analysing cancer samples as multiple characteristic groups can be analyzed simultaneously.
Low-mannose N-glycans are smaller and more rare than the common high-mannose N-glycans (Man5-9GlcNAc2). The low-mannose N-glycans detected in tumor tissues fall into two subgroups: 1) non-fucosylated, with composition MannGlcNAc2, where 1≦n≦4, and 2) core-fucosylated, with composition MannGlcNAc2Fuc1, where 1≦n≦5. The largest of the detected low-mannose structure structures is Man5GlcNAc2Fuc1 (m/z 1403 for the sodium adduct ion), which due to biosynthetic reasons most likely includes the structure below (in the figure the glycan is free oligosaccharide and β-anomer; in glycoproteins in tissues the glycan is N-glycan and β-anomer):
The low Man glycans described above can be presented in a single Formula:
[Manα2]n1[Manα3]n2([Manα2]n3[Manα6)]n4)[Manα6]n5([Manα2]n6[Manα2]n7[Manα3]n8)Manβ4GlcNAcβ4[(Fucα6)]mGlcNAc[β-N-Asn]p
wherein p, n1, n2, n3, n4, n5, n6, n7, n8, and m are either independently 0 or 1; with the proviso that when n2 is 0, also n1 is 0; when n4 is 0, also n3 is 0; when n5 is 0, also n1, n2, n3, and n4 are 0; when n7 is 0, also n6 is 0; when n8 is 0, also n6 and n7 are 0; the sum of n1, n2, n3, n4, n5, n6, n7, and n8 is less than or equal to (m+3); and preferably n1, n3, n6, and n7 are 0 when m is 0; [ ] indicates determinant either being present or absent depending on the value of n1, n2, n3, n4, n5, n6, n7, n8, and m; and
( ) indicates a branch in the structure.
Preferred non-fucosylated low-mannose glycans are according to the formula:
[Manα3]n1[(Manα6)]n2[Manα6]n3[(Manα3)]n4Manβ4GlcNAcβ4GlcNAc[βAsn]p
wherein p, n1, n2, n3, n4 are either independently 0 or 1,
with the proviso that when n3 is 0, also n1 and n2 are 0, and preferably either n1 or n2 is 0,
[ ] indicates determinant either being present or absent
depending on the value of n1, n2, n3, n4,
( ) indicates a branch in the structure.
Preferred fucosylated low-mannose glycans are according to the formula:
[Manα3]n1[(Manα6)]n2[Manα6]n3[(Manα3)]n4Manβ4GlcNAcβ4(Fucα6)GlcNAc[βAsn]p
wherein p, n1, n2, n3, n4 are either independently 0 or 1, with the provisio that when n3 is 0,
also n1 and n2 are 0, [ ] indicates determinant either being present or absent
depending on the value of n1, n2, n3, n4, ( ) indicates a branch in the structure;
and wherein n1, n2, n3, n4 and m are either independently 0 or 1,
with the provisio that when n3 is 0, also n1 and n2 are 0,
[ ] indicates determinant either being present or absent
depending on the value of n1, n2, n3, n4 and m,
( ) indicates a branch in the structure.
The group 2 represents neutral O-glycans. The major structures represent O-glycans with LacNAc epitope (Galβ4GlcNAc), and fucosylated so-called Lewis x-structure (Galβ4[Fucα3]GlcNAc). The preferred structures include the Core 2 type O-glycans of the figures below. Alternative variants include Galβ4(Fucα3)0-1GlcNAcβ3Galβ3GalNAc Core 1 type O-glycan structures (in the figures the glycans are free oligosaccharides and β-anomers; in glycoproteins in tissues the glycans are O-glycans and α-anomers):
Based on the enzymatic digestion data and the release by β-elimination the neutral O-glycans include the following preferred structures
Galβ4[(Fucα3)]nGlcNAcβX[(]mGalβ3[)]mGalNAc[αSer/Thr]p
wherein p, n and m are either independently 0 or 1, [ ] indicates determinant either being present or absent depending on the value of m and n, ( ) indicates a branch in the structure. X is 3, when m is 0; and X is 6 when m is 1.
The most preferred structures are accoding to the formula (when m is 1)
Galβ4[(Fucα3)]nGlcNAcβ6(Galβ3)GalNAc[αSer/Thr]p
wherein
p and n are either independently 0 or 1.
In another embodiment the the neutral O-glycans are according to the formula
Galβ4[(Fucα3)]nGlcNAcβ3Galβ3GalNAc[αSer/Thr]p
wherein
p and n are either independently 0 or 1.
The Core 2 O-glycan structures are likely produced in Golgi apparatus through Core 2 structure GlcNAcβ6(Galβ3)GalNAcα, the inventors have also analysed larger Core 2 glycans from tissues, as described in the Examples.
The group 3 represents sialylated O-glycans. The major structure is a Core 2/4 type O-glycan with sialyl-LacNAc, NeuNAcα3Galβ4GlcNAcβ6(R-3)GalNAcα Ser/Thr. The R substituent at 3-position of GalNAc is preferentially β1,3-linked Gal (Core 2) or β1,3-linked GlcNAc (Core 4).
The present structures are different from sialyl-Tn, T and sialyl-T O-glycan structures indicated previously for cancer. Previously sialylation of Core 1 has been considered to prevent Core 2 synthesis in certain cancer models, leading to increased expression of small O-glycan antigens, such as sialyl-Tn. The present invention shows opposite result, the increase of Core 2 structures in tumours, especially in malignant vs. benign tumours.
The group 3 structures are a large group of sialylated O-glycans, which have in common the O-glycan core structure GlcNAcβ6(R-3)GalNAc(α-Ser/Thr), where R is a possibly variable structure. In the conditions used for glycan isolation, typical fragment structures are produced from these glycans. The most typical such fragment (at m/z 899 for the sodium adduct ion) is depicted in
The most preferred structures of group 3 are according to the formula (when m is 1):
NeuNAcαX1{Galβ4[(Fucα3)]n1GlcNAcβX2}mGalβ4[(Fucα3)]n2GlcNAcβ6([NeuNAcαX3]n3Galβ3)GalNAc[αSer/Thr]p
wherein n1, n2, n3 and p are either independently 0 or 1, m is 0, 1 or 2, { } and [ ] indicate determinant either being present or absent depending on the value of m, n1 and n2 ( ) indicates a branch in the structure. When m is 2, either or both of the Galβ4[(Fucα3)]n1GlcNAc-units may be fucosylated
X1, X2, and X3 are 3 or 6, more preferably 3
When m is 2 one X2 may be 3 and the other one 6 in a branched structure on the next Gal residue.
The present invention is directed to a method of evaluating the malignancy of a patient sample comprising the step of detecting the presence of cancer related oligosaccharide sequences in the sample, said oligosaccharide sequences comprising structures with terminal monosaccharides Manα, Galβ4, and SAα3.
Furthermore, the present invention is directed to a method of evaluating the malignancy of a patient sample comprising the step of detecting the presence or amount of a cancer related oligosaccharide sequence in the sample, said oligosaccharide sequence comprising any one of the structures from the following groups 1-3, more preferably from Groups 1-5 and/or optionally any one of the structures in Groups 5-8 and/or additional groups according to the invention.
The invention is further directed to method including analysis of at least low-mannose (abbreviated low-Man) one structure according of group 1. The invention is further directed to method including analysis of at least one LacNAc O-glycans one structure of groups 2 or 3.
To increase the effectivity of analysis a mixture of low-Man and LacNAc O-glycans or at least one low-Man and one LacNAc O-glycans is analyzed. In a preferred embodiment neutral glycans of groups 1 and 2 are analyzed.
To increase the specificity of the anlysis one or two structures of Groups 1-3, preferably as preferred above is analyzed with at least one structure (preferably 1, 2 or 3 structures) selected from the groups 4-5, or more preferably selected from the groups 4-8, preferably so that when when at least two structures are selected, these are selected from different groups.
Group 1) Low-Mannose N-Glycans with Formula
[Manα2]n1[Manα3]n2([Manα2]n3[Manα6)]n4)[Manα6]n5([Manα2]n6[Manα2]n7[Manα3]n8)Manβ4GlcNAcβ4[(Fucα6)]mGlcNAc[β-N-Asn]p
wherein p, n1, n2, n3, n4, n5, n6, n7, n8, and m are either independently 0 or 1; with the proviso that when n2 is 0, also n1 is 0; when n4 is 0, also n3 is 0; when n5 is 0, also n1, n2, n3, and n4 are 0; when n7 is 0, also n6 is 0; when n8 is 0, also n6 and n7 are 0; the sum of n1, n2, n3, n4, n5, n6, n7, and n8 is less than or equal to (m+3); and preferably n1, n3, n6, and n7 are 0 when m is 0; [ ] indicates determinant either being present or absent depending on the value of n1, n2, n3, n4, n5, n6, n7, n8, and m; and
( ) indicates a branch in the structure.
Preferred structures according to the present invention are described in the Examples, in which association of the structures with cancer was found in major human cancer types.
Group 2) Neutral O-Glycans with Formula
Galβ4[(Fucα3)]nGlcNAcβX[(]mGalβ3[)]mGalNAc[α-O-Ser/Thr]p
wherein p, n and m are either independently 0 or 1, [ ] indicates determinant either being present or absent depending on the value of m and n, ( ) indicates a branch in the structure. X is 3, when m is 0; and X is 6 when m is 1; and
Preferred structures according to the present invention are described in the Examples, in which association of the structures with cancer was found in major human cancer types.
Group 3) Sialylated O-Glycans with Formula
SAaX1{Galβ4[(Fucα3)]n1GlcNAcβX2}mGalβ4[(Fucα3)]n2GlcNAcβ6([SAaX3]n3Galβ3)GalNAc[α-O-Ser/Thr]p
wherein n1, n2, n3 and p are either independently 0 or 1; m is 0, 1 or 2; X1 and X3 are independently 3 or 6, however in the most preferred embodiment both X1 and X3 are 3; X2 is either 3 or 6, however in the most preferred embodiment X2 is 3; SA is a sialic acid residue, preferentially Neu5Ac or Neu5Gc, however in the most preferred embodiment SA is Neu5Ac; { } and [ ] indicate determinant either being present or absent depending on the value of m, n1 and n2; ( ) indicates a branch in the structure.
Preferred structures according to the present invention are described in the Examples, in which association of the structures with cancer was found in major human cancer types.
The inventors found that an increased amount of any one of the abovementioned cancer-related oligosaccharide sequences in said patient sample indicates the cancerous nature of the sample. Furthermore, simultaneous increase of more than one of the abovementioned cancer-related oligosaccharide sequences is highly indicative of cancer, especially when more than one of the abovementioned oligosaccharide sequences, even more preferentially selected from more than one of the abovementioned oligosaccharide sequence groups, is simultaneously expressed in elevated amounts compared to healthy human tissues. The abovementioned glycan structure groups were found to be cancer-associated in all cancer types studied.
It was found that the increased amount of the abovementioned oligosaccharide sequences was indicative of the malignancy of human tumors, though no such increase was found in benign tumors of the colon and the ovaries. The inventors found that malignant and benign growth in human patients could be distinguished by analyzing protein-linked glycan structures for the expression of the abovementioned oligosaccharide sequences according to the present invention. Furthermore, the analysis specificity was increased by combination with analysis of cancer type specific glycan features, as described below. The present invention is especially directed to cancer diagnostics or analysis of clinical state of cancer by analysing several glycan structures simultaneously according to the invention.
The present findings are considered medically very interesting. The novel methods to detect and diagnose cancer described in the present invention can be used in the clinical setting e.g. to give tools and data for decisions how to treat human patients. It is realized that ability to detect malignant cancer and to differentiate between benign and malignant tumors is of utmost impostance in terms of efficient clinical decision-making and selection of the correct therapy.
The inventors further discovered that individual differences occur in normal tissue glycosylation and tumor-associated glycosylation changes, and in some patients the cancer-associated glycan changes are more prominent than in others. The present invention is further directed to using the methods for selecting patients for most effective therapy options according to their individual glycosylation profiles and the glycosylation profiles expressed in the disease, preferentially in the malignant tumor.
The inventors also found that changes in the expression of two additional glycan structure groups were indicative of malignant cancer in tumors originating from specific tissues, in addition to the abovementioned oligosaccharide sequences. These indicative glycan structures can be used to detect cancer or to distinguish malignant and benign growth in human patients, either in combination with the abovementioned three glycan groups, or separately.
Both of these structure groups are characterized by a common feature, the presence of a non-reducing terminal β-linked N-acetylhexosamine residue (HexNAcβ). In glycan profiling analyses where monosaccharide compositions can be assigned to analysed glycans, these glycans are indicated among the resulting glycan signals by the formula:
n(HexNAc)>n(Hex)≧2,
wherein n(component) in the amount of the monosaccharide component in a glycan molecular formula. Oligosaccharide sequences that fulfil the formula can be used to distinguish between normal and cancerous tissue materials, and/or benign and malignant tumors according to the present invention. Preferentially, the presence or amount of these oligosaccharide sequences is determined, and optionally compared with the presence or amount of other types of oligosaccharide sequences in the sample and/or specifically chosen oligosaccharide sequences groups according to the present invention. However, these HexNAcβ structures were found to be different in specific human tissues and tumors originating from them, as described below in more detail and described in the Examples. The present findings and uses thereof as described in the present invention are considered novel and medically significant.
Group 4) Glycans with Terminal (NeuAcα)0-1HexNAcβHexNAcβ Sequences
The inventors analyzed protein-linked glycans from patients with ovarian cancer or benign ovarian tumors. The patient samples were from multiple patients with benign ovarian cystadenoma or malignant ovarian cystadenocarcinoma, and a sample from normal ovary. It was found that terminal HexNAcβ structures were present in all the samples in slightly elevated amounts compared to other human tissues, but specifically in benign ovarian tumors these structures were highly increased and were among the major glycan components, as described in the Examples of the present invention. However, in corresponding malignant tumors, the amounts of terminal HexNAcβ structures were even decreased compared to the normal ovary. These changes were found to be consistent in all the studied samples, indicating that the phenomenon is common in the human ovary and its tumors. Detection of the ovary-specific terminal HexNAcβ oligosaccharide sequence was found to effectively distinguish between benign and malignant tumors of the ovary, and contribute to the separation of the normal and malignant ovary samples.
The inventors characterized the ovary-specific HexNAcβ structures in further detail, as described in the Examples of the present invention. The results suggested that the observed glycan structures associated with benign ovarian tumors structures include GalNAcβGlcNAcβ and Neu5AcαGalNAcβGlcNAc non-reducing terminal sequences, or non-sialylated and sialylated di-N-acetyllactosediamine (LacdiNAc), occurring mainly in N-glycans. The inventors have previously characterized LacdiNAc structures in human tumors and described novel methods and reagents for the detection and modification of LacdiNAc structures as well as harnessing immune responses against LacdiNAc structures. However, the differences between normal, benign, and malignant tissue materials and especially ovarian tissue materials of the present invention are novel. It is realized that the present indication represents further uses also for previously described methods and reagents, and the present invention is specifically directed to using LacdiNAc-specific reagents and methods for the detection of cancer, preferentially ovarian cancer and especially to distinguish between glycans or glycoconjugates originating from benign and malignant ovarian tumors or normal ovarian tissue.
In a typical embodiment of the present invention, ovary-associated terminal HexNAcβ sequences are detected and their presence indicates normal ovary tissue or benign growth of the ovary. However, in a more preferred embodiment the ovary-associated terminal HexNAcβ sequences are quantitated and their increased amount, compared to other human tissues or normal ovary, indicates presence of normal ovary tissue or benign growth of the ovary. In contrast, malignant tumors of the ovary do not show similar increased amounts of HexNAcβ sequences. In an even more preferred embodiment of the present invention, ovary-associated oligosaccharide sequences are profiled according to the present invention, and the relative amounts of terminal HexNAcβ sequences are compared to the other oligosaccharide sequences present in the sample. Guidelines for recognition of terminal HexNAcβ sequences and oligosaccharide sequences for comparison are described below. In another embodiment of the present invention, experimental analysis signals corresponding to terminal HexNAcβ oligosaccharide sequences as such, or other cancer-associated oligosaccharide sequences recognized in the present invention, for example mass spectrometric signals, are used for evaluation of the cancerous status of a sample.
According to the present invention, ovary tissue and tumor samples contain HexNAcβ oligosaccharide sequences as defined by the formula above, more specifically terminal HexNAcβHexNAcβ structures. Typically, HexNAcβHexNAcβ structures include oligosaccharide sequences containing the motifs HexmHexNAcm+1, HexmHexNAcm+3, or HexmHexNAcm+5 in their monosaccharide compositions. Another useful group definition of HexNAcβ oligosaccharide sequences according to the present invention includes glycans that are susceptible to the action of β-hexosaminidase, but not to β-glucosaminidase, as described in the Examples. Typical mass spectrometric signals, monosaccharide compositions, and corresponding oligosaccharide sequences indicative of the cancerous status of a patient sample are further described in the Examples, and the present invention is specifically directed to using these signals, monosaccharide compositions, and the corresponding oligosaccharide sequences for the evaluation of the cancerous status of a sample. For practical reasons, the amounts of the HexNAcβHexNAcβ oligosaccharide sequences can be approximated and/or extrapolated from the monosaccharide compositions and experimental evidence from previous analyses of similar tissues, and also these approximations are suitable for effective diagnostic results, as shown in the Examples.
In approximate order of increasing specificity, oligosaccharide sequences useful for comparison include total glycans present in the sample, (1) sialylated or (2) neutral glycans, total N-glycans, total complex-type glycans, (1) sialylated or (2) neutral complex-type glycans, total complex-type N-glycans, (1) sialylated or (2) neutral complex-type N-glycans, and normal glycans corresponding to HexNAcβ glycans. In the present list, (1) and (2) indicate glycan groups useful for comparison of (1) sialylated and (2) neutral HexNAcβ oligosaccharide sequences, respectively. The normal glycans in the latter definition have Hex substituted for HexNAc in their monosaccharide compositions, and may be defined for example as oligosaccharide sequences containing the motifs Hexm+1HexNAcm, Hexm+2HexNAcm+1, and Hexm+3HexNAcm+2 in their monosaccharide compositions, when present in the same sample as HexNAcβ oligosaccharide sequences containing the motifs HexmHexNAcm+1, HexmHexNAcm+3, and HexmHexNAcm+5 in their monosaccharide compositions, respectively. For example, the normal monosaccharide composition motif Hex5HexNAc4 corresponds to the HexNAcβ composition Hex3HexNAc6. Another useful group of oligosaccharide sequences for comparison include those that are susceptible to the action of β-glucosaminidase, as described in the Examples of the present invention.
Practical procedures for comparison of samples and analysis results with regard to HexNAcβ structures and ovarian tissue and tumor patient samples as well as methods for detection and quantitation of oligosaccharide sequences are described in the present invention.
In glycan profiling analyses where monosaccharide compositions can be assigned to analysed glycans, these glycans are indicated among the resulting glycan signals by the formula:
n(HexNAc)>n(Hex)≧2,
wherein n(component) in the amount of the monosaccharide component in a glycan molecular formula. Oligosaccharide sequences that fulfil the formula can be used to distinguish between normal and cancerous tissue materials according to the present invention. Preferentially, the presence or amount of these oligosaccharide sequences is determined, and optionally compared with the presence or amount of other types of oligosaccharide sequences in the sample and/or specifically chosen oligosaccharide sequences groups according to the present invention. In general, terminal GlcNAcβ oligosaccharide sequences are susceptible to β-glucosaminidase as well as β-hexosaminidase and other enzymes such as specific β1,4-galactosyltransferase, which together with glycan profiling according to the present invention can be used to distinguish between cancerous and healthy tissue samples. However, as described in the present invention, glycosylation is tissue specific and cancer type specific, and information about GlcNAcβ structures can be extrapolated from the reference information of glycosylation described in the present invention, and additional glycosylation information produced by the methods of the present invention.
The inventors have previously characterized GlcNAcβ structures in human tumors and described novel methods and reagents for the detection and modification of GlcNAcβ structures as well as harnessing immune responses against GlcNAcβ structures. However, the detection method of GlcNAcβ structures by approximation through the formula above in glycan profiling of tissue materials, methods for specific comparison with other oligosaccharide sequence groups present in the sample, and especially the combination of data about HexNAcβ oligosaccharide sequences with the other cancer-associated oligosaccharide sequences described in the present invention to increase resolution power of the method are novel. It is realized that the present invention represents further uses also for the previously described methods and reagents, and the present invention is specifically directed to using GlcNAcβ-specific reagents and methods for the detection of cancer, especially in conjunction with the other oligosaccharide sequence groups described above.
The cancer related oligosaccharide sequences described herein can be a part of a glycolipid, a part of a glycoprotein, and/or a part of a N-acetyllactosamine chain. The cancer specific oligosaccharide sequences can also be a part of glycolipids, a part of N-linked glycans or O-linked glycans of glycoproteins, free oligosaccharides, or glycans such as glycopeptides. Defects or changes in biosynthetic and/or biodegradative pathways of tumors lead to the synthesis of the cancer related oligosaccharide sequences both on glycolipids and glycoproteins.
The term “oligosaccharide sequence” indicates that the monosaccharide residue/residues in the sequence are part of a larger glycoconjugate, which contains other monosaccharide residues in a chain, which may be branched, or may have natural substituted modifications of oligosaccharide chains. The oligosaccharide chain is normally conjugated to a lipid anchor or to a protein. In a preferred embodiment the oligosaccharide sequences according to the present invention are non-reducing terminal oligosaccharide sequences, which means here that the oligosaccharide sequences are not linked to other monosaccharide or oligosaccharide structures except optionally from the reducing end of the oligosaccharide sequence. The oligosaccharide sequence when present as conjugate is preferably conjugated from the reducing end of the oligosaccharide sequence, though other linkage positions which are tolerated by the antibody/binding substance binding can also be used. In a more specific embodiment the oligosaccharide sequence according to the present invention means the corresponding oligosaccharide residue which is not linked by natural glycosidic linkages to other monosaccharide or oligosaccharide structures. The oligosaccharide residue is preferably a free oligosaccharide or a conjugate or derivative from the reducing end of the oligosaccharide residue.
In one embodiment of the invention the cancer specific oligosaccharides are detected for the diagnostics of cancer or tumor.
Preferably the tumor specific oligosaccharide sequence is detected by a specific binding substance which can be an aptamer, lectin, peptide, or protein, such as an antibody, a fragment thereof or genetically engineered variants thereof. More preferably the specific binding substance is divalent, oligovalent or polyvalent. Most preferably the binding substance is a lectin or an antibody.
Specific binding combinatorial chemistry libraries can be used to search for the binding molecules. Saccharide binding proteins, antibodies or lectins can be engineered, for example, by phage display methods to produce specific binders for the structures of the invention. Labelled bacteria or cells or other polymeric surfaces containing molecules recognizing the structures can be used for the detection. Oligosaccharide sequences can also be released from cancer or tumor cells by endoglycosidase enzymes. Alternatively oligosaccharides can be released by protease enzymes, resulting in glycopeptides. Chemical methods to release oligosaccharides or derivatives thereof include, e.g., ozonolysis of glycolipids and beta-elimination or hydrazinolysis methods to release oligosaccharides from glycoproteins. Alternatively the glycolipid fraction can be isolated. A substance specifically binding to the cancer specific oligosaccharide sequences can also be used for the analysis of the same sequences on cell surfaces. Said sequences can be detected e.g. as glycoconjugates or as released and/or isolated oligosaccharide fractions. The possible methods for the analysis of said sequences in various forms also include NMR spectroscopy, mass spectrometry and glycosidase degradation methods. Preferably at least two analysis methods are used, especially when methods of limited specificity are used.
Analysis of Multiple Cancer Specific Structures Simultaneously from Mass Spectrometric Profiles
The present invention is especially directed to the analysis and/or comparision of several analytical signals, preferably mass spectrometry signals produced from a sample comprising total fraction of oligosaccharides released from a cancer or a tumor sample. A single mass spectrum of an oligosaccharide fraction comprise a profile of glycosylation and multiple peaks indicating the potential presence of the oligosaccharide sequences and potential presence of cancer specific oligosaccharide sequences and altered levels thereof in comparison to normal tissue sample or a benign tumour sample. The profiles are determined preferably by MALDI-TOF mass spectrometry as described in the Examples. The total oligosaccharide fraction corresponds preferably to the total fraction of protein oligosaccharides, preferably comprising at least one cancer or tumor specific oligosaccharide sequence according to the invention. In another preferred embodiment the total oligosaccharide fraction comprises at least one cancer or tumor specific O-glycosidic and one N-glycosidic oligosaccharide according to the invention. The present invention is further directed to analysis of the multiple mass spectrometric signals after the total oligosaccharide fraction is released from a cancer or tumor sample is subjected to an enzymatic or a chemical digestion step. The enzymatic digestion is preferably performed by a glycosidase enzyme, preferably selected from the group: galactosidase, sialidase, N-acetylhexosaminidase, N-acetylglucosaminidase, fucosidase, or mannosidase.
The present invention is also directed to the use of the tumor specific oligosaccharide sequences or analogs or derivatives thereof to produce polyclonal or monoclonal antibodies recognizing said structures using following process: 1) producing synthetically or biosynthetically a polyvalent conjugate of an oligosaccharide sequence of the invention or analogue or derivative thereof, the polyvalent conjugate being, for instance, according to the following structure: position C1 of the reducing end terminal of an oligosaccharide sequence (OS) comprising the cancer specific sequence described in the present invention is linked (-L-) to an oligovalent or a polyvalent carrier (Z), via a spacer group (Y) and optionally via a monosaccharide or oligosaccharide residue (X), forming the following structure
[OS—(X)n-L-Y]m—Z
wherein integer m has values m>1 and n is independently 0 or 1; L can be oxygen, nitrogen, sulfur, or a carbon atom; X is preferably lactosyl-, galactosyl-, poly-N-acetyl-lactosaminyl, or part of an O-glycan or an N-glycan oligosaccharide sequence, Y is a spacer group or a terminal conjugate such as a ceramide lipid moiety or a linkage to Z; 2) immunizing an animal or human with polyvalent conjugate together with an immune response activating substance. Preferably the oligosaccharide sequence is polyvalently conjugated to an immune response activating substance and the conjugate is used for immunization alone or together with an additional immune response activating substance. In a preferred embodiment the oligosaccharide conjugate is injected or administered mucosally to an antibody-producing organism with an adjuvant molecule or adjuvant molecules. For antibody production the oligosaccharide or analogs or derivatives thereof can be polyvalently conjugated to a protein such as bovine serum albumin, keyhole limpet hemocyanin, a lipopeptide, a peptide, a bacterial toxin, a part of peptidoglycan or immunoactive polysaccharide or to another antibody production activating molecule. The polyvalent conjugates can be injected to an animal with adjuvant molecules to induce antibodies by routine antibody production methods known in the art.
Antibody production or vaccination can also be achieved by analogs or derivatives of the cancer specific oligosaccharide sequences. Simple analogs of the N-acetyl-group containing oligosaccharide sequences include compounds with modified N-acetyl groups, for example, N-alkyls, such as N-propanyl.
According to the invention it is possible to use the tumor specific oligosaccharide sequences for the purification of antibodies from serum, preferably from human serum. The cancer specific oligosaccharides or derivatives or analogs, such as a close isomer, can also be immobilized for the purification of antibodies from serum, preferably from human serum. The present invention is directed to natural human antibodies that bind strongly to the cancer specific oligosaccharide sequences described in the present invention.
The cancer specific oligosaccharide sequences can also be used for detection and/or quantitation of the human antibodies binding to the cancer specific oligosaccharide sequences, for example, in enzyme-linked immunosorbent assay (ELISA) or affinity chromatography type assay formats. The detection of human antibodies binding to the cancer specific oligosaccharide sequences is preferably aimed for diagnostics of cancer, development of cancer therapies, especially cancer vaccines against the oligosaccharide sequences described in the present invention, and search for blood donors which have high amounts of the antibodies or one type of the antibody.
Furthermore, it is possible to use human antibodies or humanized antibodies against the cancer specific oligosaccharide sequences to reduce the growth of or to destroy a tumor or cancer. Human antibodies can also be tolerated analogs of natural human antibodies against the cancer specific oligosaccharide sequences; the analogs can be produced by recombinant gene technologies and/or by biotechnology and they may be fragments or optimized derivatives of human antibodies. Purified natural anti-tumor antibodies can be administered to a human patient without any expected side effect as such antibodies are transferred during regular blood transfusions. This is true under conditions that the cancer specific structures are not present on normal tissues or cells and do not vary between individuals as blood group antigens do. In another embodiment of the invention species specific animal antibodies are used against a tumor or cancer of the specific animal. The production of specific humanized antibodies by gene engineering and biotechnology is also possible: the production of humanized antibodies has been described in U.S. Pat. Nos. 5,874,060 and 6,025,481, for example. The humanized antibodies are designed to mimic the sequences of human antibodies and therefore they are not rejected by immune system as animal antibodies are, if administered to a human patient. It is realized that the method to reduce the growth of or to destroy cancer applies both to solid tumors and to cancer cells in general. It is also realized that the purified natural human antibodies recognizing any human cancer specific antigen, preferably an oligosaccharide antigen, can be used to reduce the growth of or to destroy a tumor or cancer. In another embodiment species specific animal antibodies are used against a tumor or cancer of the specific animal.
According to the invention human antibodies or humanized antibodies against the cancer specific oligosaccharides, or other tolerated substances binding the tumor specific oligosaccharides, are useful to target toxic agents to tumor or to cancer cells. The toxic agent could be, for example, a cell killing chemotherapeutics medicine, such as doxorubicin (Arap et al., 1998), a toxin protein, or a radiochemistry reagent useful for tumor destruction. Such therapies have been demonstrated in the art. The toxic agent may also cause apoptosis or regulate differentiation or potentiate defense reactions against the cancer cells or tumor. In another embodiment of the invention species specific animal antibodies are used against a tumor or cancer of the specific animal. The cancer or tumor binding antibodies according to the present invention can be also used for targeting prodrugs active against tumor or enzymes or other substances converting prodrugs to active toxic agents which can destroy or inhibit tumor or cancer, for example in so called ADEPT-approaches.
The therapeutic antibodies described above can be used in pharmaceutical compositions for the treatment or prevention of cancer or tumor. The method of treatment of the invention can also be used when patient is under immunosuppressive medication or he/she is suffering from immunodeficiency.
It is realized that numerous other agents besides antibodies, antibody fragments, humanized antibodies and the like can be used for therapeutic targeting of cancer or tumors similarly with the diagnostic substances. It is specifically preferred to use non-immunogenic and tolerable substances to target cancer or tumor. The targeting substances binding to the cancer or tumor comprise also specific toxic or cytolytic or cell regulating agents which leads to destruction or inhibition of cancer or tumor. Preferably the non-antibody molecules used for cancer or tumor targeting therapies comprise molecules specifically binding to the cancer or tumor specific oligosaccharide sequences according to the present invention are aptamers, lectins, genetically engineered lectins, glycosidases and glycosyltransferase and genetically engineered variants thereof. Labelled bacteria, viruses or cells or other polymeric surfaces containing molecules recognizing the structures can be used for the cancer or tumor targeting therapies. The cancer or tumor binding non-antibody substances according to the present invention can also be used for targeting prodrugs active against cancer or tumor or for targeting enzymes or other substances converting prodrugs to active toxic agents that can destroy or inhibit cancer or tumor.
Furthermore the present invention is directed to methods for the detection of the pathogenic entities or activities by the invention. The specific transfer of modified monosaccharides to the pathogenic entities allows the detection of the pathogenic entities. For this purpose the modification of the monosaccharide need not to be toxic. The monosaccharide is modified by a label substance like a tag substance including for example an antigen detectable by an antibody, biotin, digotoxigenin, digitoxin or a directly detectable substance with examples of fluorescent substance like rhodamine or fluorescein or substance with chemiluminesence activity or phosphorence substance or a specific molecular mass marker detectable by mass spectrometry.
In a preferred embodiment the modified monosaccharide is labeled with two label compounds, which are more preferentially a tag substance and a directly detectable substance and most preferentially a tag substance like biotin and a mass spectrometry label. The label substance is preferentially linked through a spacer to the modified monosaccharide. The invention is also directed to the use of said carbohydrate for diagnostics of the pathogenic entities and diseases related to them. The invention is specifically directed to the use of said carbohydrate/carbohydrates for diagnostics of infections, cancer and malignancies. The invention is especially directed to the use of immunologically active or toxic carbohydrate for the treatment diseases like infections, cancers and malignancies. Preferentially the cell surface carbohydrates are labeled by the modified monosaccharide. Modified monosaccharides aimed for detection are useful for detection of certain congenital disorders of glycosylation and under-sialylated LDL. Especially useful are labeled nucleotide sugars.
Furthermore, according to the invention the cancer specific oligosaccharide sequences or analogs or derivatives thereof can be used as cancer or tumor vaccines in man to stimulate immune response to inhibit or eliminate cancer or tumor cells. The treatment may not necessarily cure cancer or tumor but it can reduce tumor burden or stabilize a cancer condition and lower the metastatic potential of cancers. For the use as vaccines the oligosaccharides or analogs or derivatives thereof can be conjugated, for example, to proteins such as bovine serum albumin or keyhole limpet hemocyanin, lipids or lipopeptides, bacterial toxins such as cholera toxin or heat labile toxin, peptidoglycans, immunoreactive polysaccharides, or to other molecules activating immune reactions against a vaccine molecule. A cancer or tumor vaccine may also comprise a pharmaceutically acceptable carrier and optionally an adjuvant. Suitable carriers or adjuvants are, e.g., lipids known to stimulate the immune response. The saccharides or derivatives or analogs thereof, preferably conjugates of the saccharides, can be injected or administered mucosally, such as orally or nasally, to a cancer patient with tolerated adjuvant molecule or adjuvant molecules. The cancer or tumor vaccine can be used as a medicine in a method of treatment against cancer or tumor. Preferably the method is used for the treatment of a human patient. Preferably the method of treatment is used for the treatment of cancer or tumor of a patient, who is under immunosuppressive medication or the patient is suffering from immunodeficiency.
Furthermore it is possible to produce a pharmaceutical composition comprising the cancer specific oligosaccharide sequences or analogs or derivatives thereof for the treatment of cancer or tumor. Preferably the pharmaceutical composition is used for the treatment of a human patient. Preferably the pharmaceutical composition is used for the treatment of cancer or tumor, when patient is under immunosuppressive medication or he/she is suffering from immunodeficiency. The methods of treatment or the pharmaceutical compositions described above are especially preferred for the treatment of cancer or tumor diagnosed to express the cancer specific oligosaccharide sequences of the invention. The methods of treatment or the pharmaceutical compositions can be used together with other methods of treatment or pharmaceutical compositions for the treatment of cancer or tumor. Preferably the other methods or pharmaceutical compositions comprise cytostatics, anti-angiogenic pharmaceuticals, anti-cancer proteins, such as interferons or interleukins, or use of radioactivity.
Use of antibodies for the diagnostics of cancer or tumor and for the targetting of drugs to cancer has been described with other antigens and oligosaccharide structures (U.S. Pat. No. 4,851,511; U.S. Pat. No. 4,904,596; U.S. Pat. No. 5,874,060; U.S. Pat. No. 6,025,481; U.S. Pat. No. 5,795,961; U.S. Pat. No. 4,725,557; U.S. Pat. No. 5,059,520; U.S. Pat. No. 5,171,667; U.S. Pat. No. 5,173,292; U.S. Pat. No. 6,090,789; U.S. Pat. No. 5,708,163; U.S. Pat. No. 5,902,725 and U.S. Pat. No. 6,203,999). Use of cancer specific oligosaccharides as cancer vaccines has also been demonstrated with other oligosaccharide sequences (U.S. Pat. No. 5,102,663; U.S. Pat. No. 5,660,834; U.S. Pat. No. 5,747,048; U.S. Pat. No. 5,229,289 and U.S. Pat. No. 6,083,929).
The present invention is specifically directed to analysis of abnormal and normal glycosylation structures from human tumors and cancers and use of the analytical information for the production of therapeutic antibodies or cancer vaccines according to the invention. To achieve effective therapeutic response, it is preferred that the specific cancer type in the patients to be treated expresses cancer-associated glycans according to the present invention. The present invention is specifically directed to individually targeted treatment of cancer including following steps:
The data in the Examples shows the usefulness of the combination of analysis of the cancer specific structures according to the invention, because there are individual variations in glycosylation of tumors and normal tissues. The normal tissue close to tumor may also be partially contaminated by materials secreted by tumor that may be taken to consideration when analyzing the normal tissue data.
The substance according to the invention can be attached to a carrier. Methods for the linking of oligosaccharide sequences to a monovalent or multivalent carrier are known in the art. Preferably the conjugation is performed by linking the cancer specific oligosaccharide sequences or analogs or derivatives thereof from the reducing end to a carrier molecule. When using a carrier molecule, a number of molecules of a substance according to the invention can be attached to one carrier increasing the stimulation of immune response and the efficiency of the antibody binding. To achieve an optimal antibody production, conjugates larger than 10 kDa carrying typically more than 10 oligosaccharide sequences are preferably used.
The oligosaccharide sequences according to the invention can be synthesized, for example, enzymatically by glycosyltransferases, or by transglycosylation catalyzed by a glycosidase enzyme or a transglycosidase enzyme, for review see Ernst et al. (2000). Specificities of the enzymes and their use of co-factors such as nucleotide sugar donors, can be engineered. Specific modified enzymes can be used to obtain more effective synthesis, for example, glycosynthase is modified to achieve transglycosylation but not glycosidase reactions. Organic synthesis of the saccharides and conjugates of the invention or compounds similar to these are known (Ernst et al., 2000). Carbohydrate materials can be isolated from natural sources and be modified chemically or enzymatically into compounds according to the invention. Natural oligosaccharides can be isolated from milks of various ruminants and other animals. Transgenic organisms, such as cows or microbes, expressing glycosylating enzymes can be used for the production of saccharides.
It is possible to incorporate an oligosaccharide sequence according to the invention, optionally with a carrier, in a pharmaceutical composition, which is suitable for the treatment of cancer or tumor in a patient. Examples of conditions treatable according to the invention are cancers in which the tumor expresses one or more of the tumor specific oligosaccharides described in the invention. The treatable cancer cases can be discovered by detecting the presence of the tumor specific oligosaccharide sequences in a biological sample taken from a patient. Said sample can be a biopsy or a blood sample.
The pharmaceutical composition according to the invention may also comprise other substances, such as an inert vehicle, or pharmaceutically acceptable carriers, preservatives etc., which are well known to persons skilled in the art.
The substance or pharmaceutical composition according to the invention may be administered in any suitable way. Methods for the administration of therapeutic antibodies or vaccines are well known in the art.
The term “treatment” used herein relates to both treatment in order to cure or alleviate a disease or a condition, and to treatment in order to prevent the development of a disease or a condition. The treatment may be either performed in an acute or in a chronic way.
The term “patient”, as used herein, relates to any mammal in need of treatment according to the invention.
When a cancer specific oligosaccharide or compound specifically recognizing cancer specific oligosaccharides of the invention is used for diagnosis or typing, it may be included e.g. in a probe or a test stick, optionally in a test kit. When this probe or test stick is brought into contact with a sample containing antibodies from a cancer patient or cancer cells or tissue of a patient, components of a cancer positive sample will bind the probe or test stick and can be thus removed from the sample and further analyzed.
In the present invention the term “tumor” means solid multicellular tumor tissues. Furthermore the term “tumor” means herein premalignant tissue, which is developing to a solid tumor and has tumor specific characteristics. The present invention is preferably directed to primary human cancer samples. It is well known that glycosylations in cultivated cancer cells vary and are not in general relevant with regard to cancer. It is also known that transfections, cell culture media and dividing solid tumor to single cells may have daramatic effects for glycosylations. When referring to therapies tumor specific oligosaccharides or oligosaccharide sequences (possibly occasionally referred as cancer specific oligosaccharides/oligosaccharide sequences) are targeted for treatment of all kinds of cancers and tumors. The term cancer includes tumors.
The present invention is specifically directed to the treatment of all types of cancer or tumors expressing the tumor specific oligosaccharide sequences according to the present invention. Examples of preferred cancer types includes cancers of larynx, colon cancer, stomach cancer, breast cancer, lung cancer, kidney cancer, pancreas cancer, and ovarian cancer.
Glycolipid and carbohydrate nomenclature is according to recommendations by the IUPAC-IUB Commission on Biochemical Nomenclature (Carbohydr. Res. 1998, 322:167; Carbohydr. Res. 1997, 297:1; Eur. J. Biochem. 1998, 257:29).
It is assumed that Gal, Glc, Man, GlcNAc, GalNAc, and NeuNAc are of the D-configuration, Fuc of the L-configuration, and that all monosaccharide units are in the pyranose form. Glucosamine is referred as GlcN and galactosamine as GalN. Glycosidic linkages are shown partly in shorter and partly in longer nomenclature, the linkages α3 and α6 of the NeuNAc-residues mean the same as α2-3 and α2-6, respectively, and β1-3, β1-4, and β1-6 can be shortened as β3, β4, and β6, respectively. Lactosamine or N-acetyllactosamine or Galβ3/4GlcNAc means either type one structure residue Galβ3GlcNAc or type two structure residue Galβ1-4GlcNAc, and SA is sialic acid, NeuAc or NeuGc, preferentially NeuSAc, Lac refers to lactose and Cer is ceramide. Hex is any hexose, preferably Man, Gal, or Glc; HexNAc is any N-acetylhexosamine, preferably GlcNAc or GalNAc; and dHex is preferably Fuc.
Glycomes—Novel Glycan Mixtures from Tissue Samples
The present invention reveals novel methods for producing novel carbohydrate compositions, glycomes from animal tissues, preferably from vertebrates, more preferably human and mammalian tissues. The tissue substrate materials can be total tissue samples and fractionated tissue parts, such as serums, secretions and isolated differentiated cells from the tissues, or artificial models of tissues such as cultivated cell lines.
The invention revealed that the glycan structures on cell surfaces vary between the various tissues and same tissues under changing conditions, especially cancer.
The glycan structures on cell surfaces in general have been known to have numerous biological roles. Thus the knowledge about exact glycan mixtures from cell or tissue surfaces is important for knowledge about the status of cells. The invention revealed that multiple conditions affect the cells and cause changes in their glycomes.
The inventors were able to release or isolate various glycan fractions from tissue materials, which are useful for the characterization of the cancer cellular material. The glycans or major part thereof are released preferably from glycoproteins or glycolipids of tissue samples. The invention is specifically directed to such glycan fractions. The glycan fractions of tissue samples comprise typically multiple, at least about 10 “glycan mass components” typically corresponding at least ten glycans and in most cases clearly more than 10 glycan structures.
The glycan mass components correspond to certain molecular weights observable by mass spectrometry and further correspond to specific monosaccharide composition or monosaccharide compositions. Each monosaccharide component is normally present in a glycan as glycosidically linked monosaccharide residue in the nonreducing end part of glycan and the reducing end monosaccharide may be in free alditol form or modified for example by reduction or conjugated to an reducing end modifying reagent well known in the art or to one, two or several amino acids in case of glycopeptides. Monosaccharide composition can be obtained from molecular mass in a mass spectrum (glycan mass component) after correcting potential effect of the ion forms observable by the specific mass spectrometry technology such as protonation/deprotonation, Na+, K+, Li+, or other adduct combinations, or isotope pattern derived effects. The monosaccharide compositions are calculated by fitting mixtures of individual monosaccharide (residue) masses and modification groups to corrected molecular mass of glycan mass component. Typically the molecular mass of fitting composition and the experimental mass correspond to each other very closely with similar first and even second decimals with optimal calibration.
The fitting may be further checked by measuring the experimental mass difference from the smaller and/or larger glycan mass component next in the putative biosynthetic series of a glycan type and comparing the difference with the exact molecular mass of corresponding monosaccharide unit (residue), typically the mass differences of fitting components in a good quality mass spectrum and with correct marking of peaks in decimals, preferaby in second or third decimal of the mass number depending on the resolution of the specific mass spectrometric method. For optimal mass accuracy, an internal calibration may be used, where two or more known component's mass peaks are used to re-calculate masses for each component in the spectrum. Such calibration components are preferably selected among the most abundant glycan signals present in the glycan profiles, in the case of human or other animal cell derived glycan profiles most preferably selected among the most abundant glycan signals present e.g. in Figures described in the present invention.
The monosaccharide composition includes monosaccharide component names and number, typically as subscript, indicating how many of the individual mass components are present in the monosaccharide composition; and names of assigned modifying groups and numbers indicating their abundance.
It is further realized that the masses of glycan mass component may be obtained as exact monoisotopic mass of usually smallest isotope of the glycan mass component or as an average mass of the isotope distribution of the glycan mass component. Exact mass is calculated form exact masses of individual mass components and average from masses average masses of individual mass components. Person skilled in art can recognize from the peak shapes (i.e. by the resolution obtained) in the mass spectrum whether to use monoisotopic or average masses to interpret the spectra. It is further realized that average and exact masses can be converted to each other when isotope abundances of molecules are known, typically natural abundance without enrichment of isotopes can be assumed, unless the material is deliberately labelled with radioactive or stable isotopes.
It is further realized that specific rounded mass numbers can be used as names for glycan mass components. The present invention uses preferably mass numbers rounded down from the exact mass of the monosaccharide composition (and usually observable or observed mass) to closest integer as names of glycan mass components.
The masses of glycan mass components are obtained by calculating molecular mass of individual monosaccharide components (Hex, HexNAc, dHex, NeuAc) from the known atom compositions (for example hexose corresponds to C6H12O6) and subtracting for water in case of monosaccharide residue, followed by calculating the sum of the monosaccharide components (and possible modifications such as SO3 or PO3H). It is further realized that molecular masses of glycans may be calculated from atomic compositions or any other suitable mass units corresponding molecular masses of these. The molecular masses and calculation thereof are known in the art and masses of monosaccharide components/residues are available in tables with multiple decimals from various sources.
It is further realized that many of the individual monosaccharide compositions described in the present invention further correspond to several isomeric individual glycans. In addition, there exist also monosaccharide compositions that have nearly equal masses, for example dHex2 and NeuAc monosaccharide residues that have nearly equal masses, and other examples can be presented by a person skilled in the art. It is realized that the ability to differentiate compositions with nearly equal masses depends on instrumentation, and the present method is especially directed to a possibility to select also such compositions in place of proposed compositions.
The preferred glycans in glycomes comprise at least two of following monosaccharide component residues selected from group: Hexoses (Hex) which are Gal, Glc and Man; N-acetylhexosamines (HexNAc) which are GlcNAc and GalNAc; pentose, which is Xyl; Hexuronic acids which are GlcA and IdoA; deoxyhexoses (dHex), which is fucose and sialic acids which are NeuAc and/or NeuGc; and further modification groups such as acetate (Ac), sulphate and phosphate forming esters with the glycans. The monosaccharide residues are further grouped as major backbone monosaccharides including GlcNAc, HexA, Man and Gal; and specific terminal modifying monosaccharide units Glc, GalNAc, Xyl and sialic acids.
The present invention is directed to analyzing glycan components from biological samples, preferably as mass spectrometric signals. Specific glycan modifications can be detected among the detected signals by determined indicative signals as exemplified below. Modifications can also be detected by more specific methods such as chemical or physical methods, for example mass spectrometric fragmentation or glycosidase detection as disclosed in the present invention. In a preferred form of the present method, glycan signals are assigned to monosaccharide compositions based on the detected m/z ratios of the glycan signals, and the specific glycan modifications can be detected among the detected monosaccharide compositions.
In a further aspect of the present invention, relative molar abundances of glycan components are assigned based on their relative signal intensities detected in mass spectrometry as described in the Examples, which allows for quantification of glycan components with specific modifications in relation to other glycan components. The present method is also directed to detecting changes in relative amounts of specific modifications in cells at different time points to detect changes in cell glycan compositions.
The invention is specifically directed to glycan compositions, which further comprise at least one monosaccharide component in free form, preferably a preferred monosaccharide component described above. The monosaccharide comprising compositions are in a preferred embodiment derived from a cell material or released glycomes, which has been in contact with monosaccharide releasing chemicals or enzymes, preferably with exoglycosidase enzymes or chemicals such as oxidating reagents and/or acid or base, more preferably with a glycosidase enzyme. The invention is further directed to compositions comprising a specific preferred monosaccharide according to the invention, an exoglycosidase enzyme capable releasing all or part of the specific monosaccharide and an glycan composition according to the invention from which at least part of the terminal specific monosaccharide has been released.
It is further realized that by increasing the sensitivity of detection the number of glycan mass components in a given sample can be increased. The analysis according to the invention can in most cases be performed from major or significant components in the glycome mixture. The present invention is preferably directed to detection of glycan mass components from a high quality glycan preparation with optimised experimental condition, when the glycan mass components have abundance at least higher than 0.01% of total amount of glycan mass components, more preferably of glycan mass components of abundance at least higher than 0.05%, and most preferably at least higher than 0.10% are detected. The invention is further directed to practical quality glycome compositions and analytic process directed to it, when glycan mass components of at least about 0.5%, of total amount of glycan mass components, more preferably of glycan mass components of abundance at least higher than 1.0%, even more preferably at least higher than 2.0%, most preferably at least higher than 4.0% (presenting lower range practical quality glycome), are detected. The invention is further directed to glycomes comprising preferred number of glycan mass components of at least the abundance of observable in high quality glycomes, and in another embodiment glycomes comprising preferred number of glycan mass components of at least the abundance of observable in practical quality glycomes.
It is further realized that fractionation or differential specific release methods of glycans from glycoconjugates can be applied to produce subglycomes containing part of glycome.
The subglycomes produced by fractionation of glycomes are called “fractionated subglycomes”.
The glycomes produced by specific release methods are “linkage-subglycomes”. The invention is further directed to combinations of linkage-subglycomes and fractionated subglycomes to produce “fractionated linkage-subglycomes”, for example preferred fractionated linkage-subglycomes include neutral O-glycans, neutral N-glycans, acidic O-glycans, and acidic N-glycans, which were found very practical in characterising target materials according to the invention.
The fractionation can be used to enrich components of low abundance. It is realized that enrichment would enhance the detection of rare components. The fractionation methods may be used for larger amounts of cell material. In a preferred embodiment the glycome is fractionated based on the molecular weight, charge or binding to carbohydrate binding agents.
These methods have been found useful for specific analysis of specific subglycomes and enrichment of more rare components. The present invention is in a preferred embodiment directed to charge based separation of neutral and acidic glycans. This method gives rise for an analysis method, preferably mass spectroscopy material of reduced complexity and it is useful for analysis as neutral molecules in positive mode mass spectrometry and negative mode mass spectrometry for acidic glycans.
Differential release methods may be applied to get separately linkage specific subglycomes such as O-glycan, N-glycan, glycolipid or proteoglycan comprising fractions or combinations thereof. Chemical and enzymatic methods are known for release of specific fractions, furthermore there are methods for simultaneous release of O-glycans and N-glycans.
It is realized that at least part of the glycomes have novelty as novel compositions of very large amount of components. The glycomes comprising very broad range substances are referred as complete glycomes.
Preferably the composition is a complete composition comprising essentially all degrees of polymerisation in general from at least about disaccharides, more preferably from trisaccharides to at least about 25-mers in a high resolution case and at least to about 20-mers or at least about 15-mer in case of medium and practical quality preparations. It is realized that especially the lower limit, but also upper limit of a subglycome depend on the type of subglycome and/or method used for its production. Different complete ranges may be produced in scope of general glycomes by fractionation, especially based on size of the molecules.
Novel Compositions with New Combinations of Subglycomes and Preferred Glycan Groups
It is realized that several glycan types are present as novel glycome compositions produced from the tissue samples. The invention is specifically directed to novel mixture compositions comprising different subglycomes and preferred glycan groups.
It is realised that the glycome compositions as described in the Examples represent quantitatively new data about glycomes from the preferred tissue sample types. The proportions of various components cannot be derived from background data and are very useful for the analysis methods according to the invention. The invention is specifically directed to glycome compositions according to the Examples when the glycan mass components are present in essentially similar relative amounts.
The present invention is specifically directed to glycomes of tissue samples according to the invention comprising glycan material with monosaccharide composition for each of glycan mass components according to the Formula I:
NeuAcmNeuGcnHexoHexNAcpdHexqHexArPensActModXx,
wherein m, n, o, p, q, r, s, t, and x are independent integers with values ≧0 and less than about 100,
with the proviso that
for each glycan mass components at least two of the backbone monosaccharide variables o, p, or r is greater than 0, and
ModX represents a modification (or N different modifications Mod1, Mod2, . . . , ModN), present in the composition in an amount of x (or in independent amounts of x1, x2, . . . , xN),
Preferably examples of such modifications (Mod) including for example SO3 or PO3H indicating esters of sulfate and phosphate, respectively
and the glycan composition is preferably derived from isolated human tissue samples or preferred subpopulations thereof according to the invention.
It is realized that usually glycomes contain glycan material for which the variables are less much less than 100, but large figures may be obtained for polymeric material comprising glycomes with repeating polymer structures, for example ones comprising glycosaminoglycan type materials. It is further realized that abundance of the glycan mass components with variables more than 10 or 15 is in general very low and observation of the glycome components may require purification and enrichment of larger glycome components from large amounts of samples.
In a preferred embodiment the invention is directed to broad mass range glycomes comprising polymeric materials and rare individual components as indicated above. Observation of large molecular weight components may require enrichment of large molecular weight molecules comprising fraction. The broad general compositions according to the Formula I are as described above,
with the proviso that
m, n, o, p, q, r, s, t, and x are independent integers with preferable values between 0 and 50, with the proviso that for each glycan mass components at least two of o, p, or r is at least 1, and the sum of the monosaccharide variables; m, n, o, p, q, r, and s, indicating the degree of polymerization or oligomerization, for each glycan mass component is less than about 100 and the glycome comprises at least about 20 different glycans of at least disaccharides.
In a preferred embodiment the invention is directed to practical mass range and high quality glycomes comprising lower molecular weight ranges of polymeric material. The lower molecular weight materials at least in part and for preferred uses are observable by mass spectrometry without enrichment.
In a more preferred general composition according to the Formula I as described above,
m, n, o, p, q, r, s, t, and x are independent integers with preferable values between 0 and about 20, more preferably between 0 and about 15, even more preferably between 0 and about 10, with the proviso that at least two of o, p, or r is at least 1,
and the sum of the monosaccharide variables; m, n, o, p, q, r, and s, indicating the degree of polymerization or oligomerization, for each glycan mass component is less than about 50 and more preferably less than about 30,
and the glycome comprises at least about 50 different glycans of at least trisaccharides.
In a preferred embodiment the invention is directed to practical mass range high quality glycomes which may comprise some lower molecular weight ranges of polymeric material. The lower molecular weight materials at least in part and for preferred uses are observable by mass spectrometry without enrichment.
In a more preferred general composition according to the Formula I as described above, m, n, o, p, q, r, s, t, and x are independent integers with preferable values between 0 and about 10, more preferably between 0 and about 9, even more preferably, between 0 and about 8, with the proviso that at least two of o, p, or r is at least 1,
and the sum of the monosaccharide variables; m, n, o, p, q, r, and s, indicating the degree of polymerization or oligomerization, for each glycan mass component is less than about 30 and more preferably less than about 25,
and the glycome comprises at least about 50 different glycans of at least trisaccharides.
The practical mass range glycomes may typically comprise tens of components, for example in positive ion mode MALDI-TOF mass spectrometry for neutral subglycomes it is usually possible to observe even more than 50 molecular mass components, even more than 100 mass components corresponding to much larger number of potentially isomeric glycans. The number of components detected depends on sample size and detection method.
The present invention is specifically directed to subglycomes of tissue sample glycomes according to the invention comprising glycan material with monosaccharide compositions for each of glycan mass components according to the Formula I and as defined for broad and practical mass range glycomes. Each subglycome has additional characteristics based on glycan core structures of linkage-glycomes or fractionation method used for the fractionated glycomes. The preferred linkage glycomes include N-glycans, O-glycans, glycolipid glycans, and neutral and acidic subglycomes.
Protein N-glycosidase releases N-glycans comprising typically two N-acetylglycosamine units in the core, optionally a core linked fucose unit and typically then 2-3 hexoses (core mannoses), after which the structures may further comprise hexoses being mannose or in complex-type N-glycans further N-acetylglycosamines and optionally hexoses and sialic acids.
N-glycan subglycomes relased by protein N-glycosidase comprise N-glycans containing N-glycan core structure and are releasable by protein N-glycosidase from cells. The N-glycan core structure is Manβ4GlcNAcβ4(Fucα6)nGlcNAc, wherein n is 0 or 1 and the N-glycan structures can be elongated from the Manβ4 with additional mannosyl residues. The protein N-glycosidase cleaves the reducing end GlcNAc from Asn in proteins. N-glycan subglycomes released by endo-type N-glycosidases cleaving between GlcNAc units contain Manβ4GlcNAc-core, and the N-glycan structures can be elongated from the Manβ4 with additional mannosylresidues.
In case the Subglycome and analysis representing it as Glycan profile is formed from N-glycans liberated by N-glycosidase enzyme, the preferred additional constraints for Formula I are:
p>0, more preferably 1≦p≦100, typically p is between 2 and about 20, but polymeric structures containing glycomes may comprise larger amounts of HexNAc and it is realized that in typical core of N-glycans indicating presence of at least partially complex type structure, when p≧3 it follows that o≧1.
In case the Subglycome and analysis representing it as Glycan profile is formed from lipid-linked glycans liberated by endoglycoceramidase enzyme, the preferred additional constraints for Formula I are:
o>0, more preferably 1≦o≦100, and
when p≧1 it follows that o≧2.
Typically glycolipids comprise two hexoses (a lactosylresidue) at the core. The degree of oligomerization in a usual practical glycome from glycolipids is under about 20 and more preferably under 10. Very large structures comprising glycolipids, polyglycosylceramides, may need enrichment for effective detection.
Most preferred fractionated Subglycomes includes 1) subglycome of neutral glycans and 2) subglycome of acidic glycans. The major acidic monosaccharide unit is in most cases a sialic acid, the acidic fraction may further comprise natural negatively charged structure/structures such as sulphate(s) and/or phosphate(s).
In case the Subglycome and analysis representing it as Glycan profile is formed from sialylated glycans, the preferred additional constraints for Formula I are:
(m+n)>0, more preferably 1≦(m+n)≦100.
Large amounts of sialic acid in a glycan mass component would indicate presence of polysialic acid type structures. Practical and high resolutions acidic glycomes usually have m+n values for individual major glycan mass components with preferred abundance between 1 and 10, more preferably between 1 and 5 and most preferably between 1 and 4 for usual glycomes according to the invention. For neutral glycans, (m+n)=0, and they do not contain negatively charged groups as above. However, glycans with negatively charged groups can be eluted together with the neutral glycans into the neutral glycan fraction, and they may optionally be analyzed in the neutral glycan fraction according to the present invention.
The present invention is specifically directed to the glycomes of tissue samples according to the invention comprising as major components at least one of structure groups selected from the groups described below.
According to the present invention, the glycan signals are optionally organized into glycan groups and glycan group profiles based on analysis and classification of the assigned monosaccharide and modification compositions and the relative amounts of monosaccharide and modification units in the compositions, according to the following classification rules:
1° The glycan structures are described by the formulae:
HexmHexNAcndHexoNeuAcpNeuGcqPenrMod1smod1Mod2smod2 . . . ModXsModx,
2° Glycan structures in general are classified as follows:
3° N-glycan glycan structures, generated e.g. by the action of peptide-N-glycosidases, are classified as follows:
4° Mucin-type O-glycan structures, generated e.g. by alkaline β-elimination, are classified as follows:
Lipid-linked can also be classified into structural groups based on their monosaccharide compositions, as adopted from the classifications above according to the invention.
Hex5HexNAc4dHex2NeuAc1Ac1,
The present invention revealed novel unexpected components among in the glycomes studied. The present invention is especially directed to glycomes comprising such unusual materials.
It is further realized that the glycans may be derivatized chemically during the process of release and isolation. Preferred modifications include modifications of the reducing end and/or modifications directed especially to the hydroxyls- and/or N-atoms of the molecules. The reducing end modifications include modifications of reducing end of glycans involving known derivatization reactions, preferably reduction, glycosylamine, glycosylamide, oxime (aminooxy-) and reductive amination modifications. Most preferred modifications include modification of the reducing end. The derivatization of hydroxyl- and/or amine groups, such as produced by methylation or acetylation methods including permethylation and peracetylation has been found especially detrimental to the quantitative relation between natural glycome and the released glycome.
In a preferred embodiment the invention is directed to non-derivatized released glycomes. The benefit of the non-derivatized glycomes is that less processing is needed for the production. The non-derivatized released glycomes correspond more exactly to the natural glycomes from which these are released. The present invention is further directed to quantitative purification according to the invention for the non-derivatized released glycomes and analysis thereof.
The present invention is especially directed to released glycomes when the released glycome is not a permodified glycome such as permethylated glycome or peracetyated glycome. The released glycome is more preferably reducing end derivatized glycome or a non-derivatized glycome, most preferably non-derivatized glycome.
The present invention is further directed to novel total compositions of glycans or oligosaccharides referred as glycomes and in a more specific embodiment as released glycomes observed from or produced from the target material according to the invention. The released glycome indicates the total released glycans or total specific glycan subfractions released from the target material according to the invention. The present invention is specifically directed to released glycomes meaning glycans released from the target material according to the invention and to the methods according to the invention directed to the glycomes.
The present invention is preferably directed to the glycomes released as truncated and/or non-truncated glycans and/or derivatized according to the invention.
The invention is especially directed to N-linked and/or O-linked and/or lipid-linked released glycomes from the target material according to the invention. The invention is more preferably directed to released glycomes comprising glycan structures according to the invention, preferably glycan structures as defined in Formula I. The invention is more preferably directed to N-linked released glycomes comprising glycan structures according to the invention, preferably glycan structures as defined in Formula I.
In a preferred embodiment the invention is directed to non-derivatized released cell surface glycomes. The non-derivatized released cell surface glycomes correspond more exactly to the fractions of glycomes that are localized on the cell surfaces, and thus available for biological interactions. These cell surface localized glycans are of especial importance due to their availability for biological interactions as well as targets for reagents (e.g. antibodies, lectins, etc.) targeting the cells or tissues of interest. The invention is further directed to release of the cell surface glycomes, preferably from intact cells by hydrolytic enzymes such as proteolytic enzymes, including proteinases and proteases, and/or glycan releasing enzymes, including endo-glycosidases or protein N-glycosidases. Preferably the surface glycoproteins are cleaved by proteinase such as trypsin and then glycans are analysed as glycopeptides or preferably released further by glycan releasing enzyme.
The present invention is especially directed to analysis of glycan mixtures present in tissue samples by chemical, biochemical, or physical means, preferably by mass spectrometry, as described below.
The invention is directed to novel methods for qualitative analysis of glycome data. The inventors noticed that there are specific components in glycomes according to the invention, the presence or absence of which are connected or associated with specific cell type or cell status. It is realized that qualitative comparison about the presence of absence of such signals are useful for glycome analysis. It is further realized that signals either present or absent that are derived from a general glycome analysis may be selected to more directed assay measuring only the qualitatively changing component or components optionally with a more common component or components useful for verification of data about the presence or absence of the qualitative signal.
The present invention is further specifically directed to quantitative analysis of glycan data from tissue samples. The inventors noted that quantitative comparisons of the relative abundances of the glycome components reveal substantial differences about the glycomes useful for the analysis according to the invention.
The process contains essential key steps which should be included in every process according to the present invention.
The essential key steps of the analysis are:
In most cases it is useful to compare the data with control sample data. The control sample may be for example from a healthy tissue or cell type and the sample from same tissue altered by cancer or another disease. It is preferable to compare samples from same individual organism, preferably from the same human individual.
The steps of a comparative analysis are:
It may be useful to analyse the glycan structural motifs present in the sample, as well as their relative abundances. The ability to elucidate structural motifs results from the quantitative nature of the present analysis procedure, comparison of the data to data from previously analyzed samples, and knowledge of glycan biosynthesis.
The glycome analysis may include characterization of structural motives of released glycans. The structural motif analysis may be performed in combination with structural analysis.
Preferred methods to reveal specific structural motifs include
The direct analyses are preferred as they are in general more effective and usually more quantitative methods, which can be combined to glycome analysis.
In a preferred embodiment the invention is directed to combination of analysis of structural motifs and glycome analysis.
The steps of a structural motif analysis are:
The steps 3 and 4 may be combined or performed in order first 4 and then 3.
More detailed preferred analysis method include following analysis steps:
The present methods further allow the possibility to use part of the non-modified material or material modified in step 3 or 5 for additional modification step or step and optionally purified after modification step or steps, optionally combining modified samples, and analysis of additionally modified samples, and comparing results from differentially modified samples.
As mentioned above, it is realized that many of the individual monosaccharide compositions in a given glycome further corresponds to several isomeric individual glycans. The present methods allow for generation of modified glycomes. This is of particular use when modifications are used to reveal such information about glycomes of interest that is not directly available from a glycan profile alone (or glycome profiles to compare). Modifications can include selective removal of particular monosaccharides bound to the glycome by a defined glycosidic bond, by degradation by specific exoglycosidases or selective chemical degradation steps such as e.g. periodic acid oxidation. Modifications can also be introduced by using selective glycosyltransferase reactions to label the free acceptor structures in glycomes and thereby introduction of a specific mass label to such structures that can act as acceptors for the given enzyme. In preferred embodiment several of such modifications steps are combined and used to glycomes to be compared to gain further insights of glycomes and to facilitate their comparison.
Such modifications may also include non-covalent modification, such as ion-pairing of charged groups. Sulphate esters may be ion-paired with cationic moiety, which enhances the ionization of sulphated glycans in positive-ion mode mass spectrometry. Such cationic moieties include e.g. lysine or arginine tripeptide (KKK or RRR), as described previously for glycopeptides. The present invention is specifically directed to using ion-pairing of free oligosaccharides to enhance detection of glycans with charged groups such as sulphate or phosphate. According to the present invention, glycans containing charged groups may also be identified by analysis as differential adduct and/or ion-pairing ions. For example, comparison of spectra obtained from the same sample in the presence of sodium or lithium ions gives information about the presence of charged groups without covalent modification.
The present invention is specifically directed to quantitative presentation of glycome data.
The quantitative presentation means presenting quantitative signals of components of the glycome, preferably all major components of the glycome, as a two-dimensional presentation including preferably a single quantitative indicator presented together with component identifier. A preferred purpose of this operation is to allow reliable comparison between data obtained from different samples or to identify glycan structures present in the analyzed sample. Any given glycan can exist as multiple ions in mass spectrometry or multiple modified forms in mass spectrometry and chromatography. For example, a glycan can exist as adduct ions with e.g. sodium or potassium. This may divide the signal to e.g. three or four components in the case of sialylated glycans, and these signals have to be summed up to get the correct signal intensity value for the glycan component. Another example is related to comparison of samples with different sample quality, and normalization is required to be able to reliably compare these samples. The prior art describes comparison between glycan samples without proper correction and normalization of the data.
The preferred two-dimensional presentations includes tables and graphs presenting the two dimensional data. The preferred tables list quantitative indicators in connection with, preferably beside or under or above the component identifiers, most preferably beside the identifier because in this format the data comprising usually large number of component identifier-quantitation indicator pairs.
The quantitation indicator is a value indicating the relative abundance of the single glycome component with regard to other components of total glycome or subglycome. The quantitation indicator can be directly derived from quatitative experimental data, or experimental data corrected to be quantitative.
The quantitation indicator is preferably a normalized quantitation indicator. The normalized quantitation indicator is defined as the experimental value of a single experimental quantitation indicator divided by total sum of quantitation indicators multiplied by a constant quantitation factor.
Preferred quantitation factors includes integer numbers from 1-1000 0000 000, more preferably integer numbers 1, 10 or 100, and more preferably 1 or 100, most preferably 100. The quantitation number one is preferred as commonly understandable portion from 1 concept and the most preferred quantitation factor 100 corresponds to common concept of per cent values.
The quantitation indicators in tables are preferably rounded to correspond to practical accuracy of the measurements from which the values are derived from. Preferred rounding includes 2-5 meaningful accuracy numbers, more preferably 2-4 numbers and most preferably 2-3 numbers.
The preferred component indicators may be experimentally derived component indicators. Preferred components indicators in the context of mass spectrometric analysis includes mass numbers of the glycome components, monosaccharide or other chemical compositions of the components and abbreviation corresponding to thereof, names of the molecules preferably selected from the group: descriptive names and abbreviations; chemical names, abbreviations and codes; and molecular formulas including graphic representations of the formulas.
It is further realized that molecular mass based component indicators may include multiple isomeric structures. The invention is in a preferred embodiment directed to practical analysis using molecular mass based component indicators. In more specific embodiment the invention is further directed to chemical or enzymatic modification methods or indirect methods according to the invention in order to resolve all or part of the isomeric components corresponding to a molecular mass based component indicators.
The present invention is directed to a method of accurately defining the molecular masses of glycans present in a sample, and assigning monosaccharide compositions to the detected glycan signals.
The glycan signals according to the present invention are glycan components characterized by:
1° mass-to-charge ratio (m/z) of the detected glycan ion,
2° molecular mass of the detected glycan component, and/or
3° monosaccharide composition proposed for the glycan component.
The present invention is further directed to a method of describing mass spectrometric raw data of glycan signals as two-dimensional tables of:
1° monosaccharide composition, and
2° relative abundance,
which form the glycan profiles according to the invention. Monosaccharide compositions are as described above. For obtaining relative abundance values for each glycan signal, the raw data is recorded in such manner that the relative signal intensities of the glycan signals represent their relative molar proportions in the sample. Methods for relative quantitation in MALDI-TOF mass spectrometry of glycans are known in the art (Naven & Harvey, 1996; Papac et al., 1996) and are described in the present invention. However, the relative signal intensities of each glycan signal are preferably corrected by taking into account the potential artefacts caused by e.g. isotopic overlapping, alkali metal adduct overlapping, and other disturbances in the raw data, as described below.
By forming these glycan profiles and using them instead of the raw data, analysis of the biological data carried by the glycan profiles is improved, including for example the following operations:
1° identification of glycan signals present in the glycan profile,
2° comparison of glycan profiles obtained from different samples,
3° comparison of relative intensities of glycan signals within the glycan profile, and
4° organizing the glycan signals present in the glycan profile into subgroups or subprofiles.
Glycan signals and their associated signals may have overlapping isotope patterns. Overlapping of isotope patterns is corrected by calculating the experimental isotope patterns and subtracting overlapping isotope signals from the processed data.
Glycan signals may be associated with signals arising from multiple adduct ions in positive ion mode, e.g. different alkali metal adduct ions. Different glycan signals may give rise to adduct ions with similar m/z ratios: as an example, the adduct ions [Hex+Na]+ and [dHex+K]+ have m/z ratios of 203.05 and 203.03, respectively. Overlapping of adduct ions is corrected by calculating the experimental alkali metal adduct ion ratios in the sample and using them to correct the relative intensities of those glycan signals that have overlapping adduct ions in the experimental data. Preferably, the major adduct ion type is used for comparison of relative signal intensities of the glycan signals, and the minor adduct ion types are removed from the processed data. The calculated proportions of minor adduct ion types are subtracted from the processed data.
Also in negative ion mode mass spectrometry, glycan signals may be associated with signals arising from multiple adduct ions. Typically, this occurs with glycan signals that correspond to multiple acidic group containing glycan structures. As an example, the adduct ions [NeuAc2-H+Na]− at m/z 621.2 and [NeuAc2—H+K]− at m/z 637.1, are associated with the glycan signal [NeuAc2-H]− at m/z 599.2. These adduct ion signals are added to the glycan signal and thereafter removed from the processed data. In cases where different glycan signals and adduct ion signals overlap, this is corrected by calculating the experimental alkali metal adduct ion ratios in the sample and using them to correct the relative intensities of those glycan signals that have overlapping adduct ions in the experimental data.
Glycan signals may be associated with signals, e.g. elimination of water (loss of H2O), or lack of methyl ether or ester groups (effective loss of CH2), resulting in experimental m/z values 18 or 14 mass units smaller than the glycan signal, respectively. These signals are not treated as individual glycan signals, but are instead treated as associated signals and removed from the processed data.
Classification of Glycan Signals into Glycan Groups
According to the present invention, the glycan signals are optionally organized into glycan groups and glycan group profiles based on analysis and classification of the assigned monosaccharide and modification compositions and the relative amounts of monosaccharide and modification units in the compositions, according to the classification rules described above.
To generate glycan group profiles, the proportions of individual glycan signals belonging to each glycan group are summed. The proportion of each glycan group of the total glycan signals equals its prevalence in the glycan profile. The glycan group profiles of two or more samples can be compared. The glycan group profiles can be further analyzed by arranging glycan groups into subprofiles, and analyzing the relative proportions of different glycan groups in the subprofiles. Similarly formed subprofiles of two or more samples can be compared.
The present invention is especially useful when low sample amounts are available. Practical cellular or tissue material may be available for example for diagnostic only in very small amounts.
The inventors found surprisingly that glycan fraction could be produced and analysed effectively from samples containing low amount of material, for example 100 000-1 000 000 cells or a cubic millimetre (microliter) of the cells.
The combination of very challenging biological samples and very low amounts of samples forms another challenge for the present analytic method. The yield of the purification process must be very high. The estimated yields of the glycan fractions of the analytical processes according to the present invention varies between about 50% and 99%. Combined with effective removal of the contaminating various biological materials even more effectively over the wide preferred mass ranges according to the present invention show the ultimate performance of the method according to the present invention.
The present invention is directed to a method of preparing an essentially unmodified glycan sample for analysis from the glycans present in a given sample.
A preferred glycan preparation process consists of the following steps:
1° isolating a glycan-containing fraction from the sample,
2° optionally purification the fraction to useful purity for glycome analysis
The preferred isolation method is chosen according to the desired glycan fraction to be analyzed. The isolation method may be either one or a combination of the following methods, or other fractionation methods that yield fractions of the original sample:
1° extraction with water or other hydrophilic solvent, yielding water-soluble glycans or glycoconjugates such as free oligosaccharides or glycopeptides,
2° extraction with hydrophobic solvent, yielding hydrophilic glycoconjugates such as glycolipids,
3° N-glycosidase treatment, especially F. meningosepticum N-glycosidase F treatment, yielding N-glycans,
4° alkaline treatment, such as mild (e.g. 0.1 M) sodium hydroxide or concentrated ammonia treatment, either with or without a reductive agent such as borohydride, in the former case in the presence of a protecting agent such as carbonate, yielding β-elimination products such as O-glycans and/or other elimination products such as N-glycans,
5° endoglycosidase treatment, such as endo-β-galactosidase treatment, especially Escherichia freundii endo-β-galactosidase treatment, yielding fragments from poly-N-acetyllactosamine glycan chains, or similar products according to the enzyme specificity, and/or
6° protease treatment, such as broad-range or specific protease treatment, especially trypsin treatment, yielding proteolytic fragments such as glycopeptides.
The released glycans are optionally divided into sialylated and non-sialylated subfractions and analyzed separately. According to the present invention, this is preferred for improved detection of neutral glycan components, especially when they are rare in the sample to be analyzed, and/or the amount or quality of the sample is low. Preferably, this glycan fractionation is accomplished by graphite chromatography and/or ion-exchange chromatography.
According to the present invention, sialylated glycans are optionally modified in such manner that they are isolated together with the non-sialylated glycan fraction in the non-sialylated glycan specific isolation procedure described above, resulting in improved detection simultaneously to both non-sialylated and sialylated glycan components. Preferably, the modification is done before the non-sialylated glycan specific isolation procedure. Preferred modification processes include neuraminidase treatment and derivatization of the sialic acid carboxyl group, while preferred derivatization processes include amidation and esterification of the carboxyl group.
The preferred glycan release methods include, but are not limited to, the following methods:
Free glycans—extraction of free glycans with for example water or suitable water-solvent mixtures.
Protein-linked glycans including O- and N-linked glycans—alkaline elimination of protein-linked glycans, optionally with subsequent reduction of the liberated glycans.
Mucin-type and other Ser/Thr O-linked glycans—alkaline β-elimination of glycans, optionally with subsequent reduction of the liberated glycans.
N-glycans—enzymatic liberation, optionally with N-glycosidase enzymes including for example N-glycosidase F from C. meningosepticum, Endoglycosidase H from Streptomyces, or N-glycosidase A from almonds.
Lipid-linked glycans including glycosphingolipids—enzymatic liberation with endoglycoceramidase enzyme; chemical liberation; ozonolytic liberation.
Glycosaminoglycans—treatment with endo-glycosidase cleaving glycosaminoglycans such as chondroinases, chondroitin lyases, hyalurondases, heparanases, heparatinases, or keratanases/endo-beta-galactosidases; or use of O-glycan release methods for O-glycosidic Glycosaminoglycans; or N-glycan release methods for N-glycosidic glycosaminoglycans or use of enzymes cleaving specific glycosaminoglycan core structures; or specific chemical nitrous acid cleavage methods especially for amine/N-sulphate comprising glycosaminoglycans
Glycan fragments—specific exo- or endoglycosidase enzymes including for example keratanase, endo-β-galactosidase, hyaluronidase, sialidase, or other exo- and endoglycosidase enzyme; chemical cleavage methods; physical methods
The invention describes special purification methods for glycan mixtures from tissue samples. Previous glycan sample purification methods have required large amounts of material and involved often numerous chromatographic steps and even purification of specific proteins. It is known that protein glycosylation varies protein specifically and single protein specific data can thus not indicate the total tissue level glycosylation. Purification of single protein is a totally different task than purifying the glycan fraction according to the present invention.
When the purification starts from a tissue or cells, the old processes of prior art involve often laborious homogenisation steps affecting the quality of the material produced. The present purification directly from a biological sample such as cell or tissue material, involves only a few steps and allows quick purification directly from the biological material to analysis preferably by mass spectrometry.
Purification from Cellular Materials of Cells and/or Tissues
The cellular material contains various membranes, small metabolites, various ionic materials, lipids, peptides, proteins etc. All of the materials can prevent glycan analysis by mass spectrometry if these cannot be separated from the glycan fraction. Moreover, for example peptide or lipid materials may give rise to mass spectrometric signals within the preferred mass range within which glycans are analysed. Many mass spectrometric methods, including preferred MALDI-mass spectrometry for free glycan fractions, are more sensitive for peptides than glycans. With the MALDI method peptides in the sample may be analysed with approximately 1000-fold higher sensitivity in comparision to methods for glycans. Therefore the method according to the present invention should be able to remove for example potential peptide contaminations from free glycan fractions most effectively. The method should remove essential peptide contaminations from the whole preferred mass range to be analysed.
The inventors discovered that the simple purification methods would separate released glycans from all possible cell materials so that
1) The sample is technically suitable for mass spectrometric analysis.
When using MALDI-technologies, the sample does not dry or crystallize properly if the sample contains harmful impurity material in a significant amount.
2) The purity allows production of mass spectrum of suitable quality.
Preferably the present invention is directed to analysis of unusually small sample amounts. This provides a clear benefit over prior art, when there is small amount of sample available from a small region of diseased tissue or diagnostic sample such as tissue slice produced for microscopy or biopsy sample. Methods to achieve such purity (purity being a requirement for the sensitivity needed for such small sample amounts) from tissue or cell samples (or any other complex biological matices e.g. serum, saliva) has not been described in the prior art.
In a preferred embodiment the method includes use of non-derived glycans and avoiding general derived glycans. There are methods of producing glycan profiles including modification of all hydroxyl groups in the sample such as permethylation. Such processes require large sample amounts and produces chemical artefacts such as undermethylated molecules lowering the effectivity of the method. These artefact peaks cover all minor signals in the spectra, and they can be misinterpreted as glycan structures. It is of importance to note that in glycome analyses the important profile-to profile differences often reside in the minor signals. In a specific embodiment the present invention is directed to site specific modification of the glycans with effective chemical or enzyme reaction, preferably a quantitative reaction.
The present invention is specifically directed to quantitative mass spectrometric methods for the analysis of glycomes. Most preferred mass spectrometric methods are MALDI-TOF mass spectrometry methods.
The inventors were able to optimise MALDI-TOF mass spectrometry for glycome analysis.
The preferred mass spectrometric analysis process is MALDI-TOF mass spectrometry, where the relative signal intensities of the unmodified glycan signals represent their relative molar proportions in the sample, allowing relative quantification of both neutral (Naven & Harvey, 1996) and sialylated (Papac et al., 1996) glycan signals. Preferred experimental conditions according to the present invention are described under Experimental procedures of Examples listed below.
For MALDI-TOF mass spectrometry of unmodified glycans in positive ion mode, optimal mass spectrometric data recording range according to the present invention is over m/z 200, more preferentially between m/z 200-10000, or even more preferably between m/z 200-4000 for improved data quality. In the most preferred form according to the present invention, the data is recorded between m/z 700-4000 for accurate relative quantification of glycan signals.
For MALDI-TOF mass spectrometry of unmodified glycans in negative ion mode, optimal mass spectrometric data recording range according to the present invention is over m/z 300, more preferentially between m/z 300-10000, or even more preferably between m/z 300-4000 for improved data quality. In the most preferred forms according to the present invention, the data is recorded between m/z 700-4000 or most preferably between m/z 800-4000 for accurate relative quantification of glycan signals.
Practical m/z-Ranges
The practical ranges comprising most of the important signals, as observed by the present invention may be more limited than these. Preferred practical ranges includes lower limit of about m/z 400, more preferably about m/z 500, and even more preferably about m/z 600, and most preferably m/z about 700 and upper limits of about m/z 4000, more preferably m/z about 3500 (especially for negative ion mode), even more preferably m/z about 3000 (especially for negative ion mode), and in particular at least about 2500 (negative or positive ion mode) and for positive ion mode to about m/z 2000 (for positive ion mode analysis). The preferred range depends on the sizes of the sample glycans, samples with high branching or polysaccharide content or high sialylation levels are preferably analysed in ranges containing higher upper limits as described for negative ion mode. The limits are preferably combined to form ranges of maximum and minimum sizes or lowest lower limit with lowest higher limit, and the other limits analogously in order of increasing size.
The inventors were able to show effective quantitative analysis in both negative and positive mode mass spectrometry.
The inventors developed optimised sample handling process for preparation of the samples for MALDI-TOF mass spectrometry.
The glycan purification method according to the present invention consists of at least one of purification options, preferably in specific combinations described below, including the following purification options:
1) Precipitation-extraction;
2) Ion-exchange;
3) Hydrophobic interaction;
4) Hydrophilic interaction; and
5) Affinity to graphitized carbon.
1) Precipitation-extraction may include precipitation of glycans or precipitation of contaminants away from the glycans. Preferred precipitation methods include:
2) Ion-exchange may include ion-exchange purification or enrichment of glycans or removal of contaminants away from the glycans. Preferred ion-exchange methods include:
3) Hydrophilic interaction may include purification or enrichment of glycans due to their hydrophilicity or specific adsorption to hydrophilic materials, or removal of contaminants such as salts away from the glycans. Preferred hydrophilic interaction methods include:
4) Affinity to graphitized carbon may include purification or enrichment of glycans due to their affinity or specific adsorption to graphitized carbon, or removal of contaminants away from the glycans. Preferred graphitized carbon affinity methods include porous graphitized carbon chromatography.
Preferred purification methods according to the invention include combinations of one or more purification options. Examples of the most preferred combinations include the following combinations:
1) For neutral underivatized glycan purification: 1. cation exchange of contaminants, 2.
hydrophobic adsorption of contaminants, and 3. graphitized carbon affinity purification of glycans.
1) For sialylated underivatized glycan purification: 1. cation exchange of contaminants, 2. hydrophobic adsorption of contaminants, 3. optionally adsorption of glycans to cellulose, and 4. graphitized carbon affinity purification of glycans.
The present invention is directed to analysis of released glycomes by spectrometric method useful for characterization of the glycomes. The invention is directed to NMR spectroscopic analysis of the mixtures of released glycans. The inventors showed that it is possible to produce a released glycome from tissue samples in large scale enough and useful purity for NMR-analysis of the glycome. In a preferred embodiment the NMR-analysis of the tissue glycome is one dimensional proton NMR-analysis showing structural reporter groups of the major components in the glycome. The present invention is further directed to combination of the mass spectrometric and NMR analysis of small scale tissue samples.
The inventors further realized major glycome differences between samples from the same species. The invention is specifically directed to analysis of individual differences between animals. The invention is further directed to the use of the information in breeding of animals, especially production animals, preferentially in context of increased susceptibility to cancer, especially genetic susceptibility to cancer.
The inventors further realized major glycome differences between samples from animals related to the cancer status of the animal. The invention is especially directed to the analysis of biological status related changes of animal.
The inventors further noticed major species specific differences in the total released glycomes analysed. It is realized that species specific glycome differences are useful for analysis of effects of glycosylations in animal materials from different species in context of cancer.
The invention revealed that glycome oligosaccharide mixtures can be produced effectively from eukaryotic species especially animal tissues.
The invention is in a preferred embodiment directed to analysis of human type primates such as monkeys especially apes (examples include chimpanzee, pygmy chimpanzee, gorilla, orangutan) and human, the preference is based on close similarity of primates and human on genetic and cell biological level, providing similarity for samples to be analysed and scientifically important evolution based glycosylation changes between similar species. The invention is further directed to analysis of animals useful for development of pharmaceutical and therapeutic materials in context of cancer. The preferred animals include rodents (such as mouse, hamster, rat) and human type primates.
The present invention refers as “tissue materials” all preferred target tissue related material including for example tissues, secretions and cultivated differentiated cells
The present invention is preferably directed to specific tissue types for the analysis according to the invention. The tissue type are found to be very suitable and feasible for the analysis according to the invention. The analysis is especially directed to analysis of
1) tissues of gastrointestinal track, preferably mouth, larynx, stomach, large and small intestine
2) internal organs such as ovarian tissue, liver, lungs, or kidney
3) tissues of circulatory system, especially blood
4) cultivated cell line models of the differentiated tissues
The present invention is preferably directed to specific parts of tissue for the analysis according to the invention. The inventors realized that it is possible perform glycomics analysis of specific parts of tissues and reveal differences useful for studies of diseases and disease induced changes and other changes or presence of receptor structures on specific subtissues. Preferred subtissues includes
1) tissues surfaces, especially epithelia of gastrointestinal tract and cell surfaces and
2) components of circulatory system, preferably serum/plasma, and blood cells, especially red cells and white blood cells
The invention is further directed to material produced by tissues.
Preferably the invention is directed to the analysis of secretions of tissues, preferably liquid secretions of tissues, preferably milk, saliva or urine. It is realized that liquid secretions form a specific group of tissue derived materials found especially useful for the glycome analysis methods according to the invention. Milk is especially preferred as a food material consumed by animals and human and analysis with regard to each of individual specific, animal status specific and species specific differences.
The invention is under separate preferred embodiment directed to the analysis of specific conjugated glycomes such as protein or lipid derived glycomes, from the secretions and in another preferred embodiment free soluble glycomes of the secretions.
Soluble Glycome Materials: Tissue and/or Secretion Materials, Especially with High Protein Content
The invention is in a preferred embodiment directed to specific methods developed for the analysis of soluble glycome material from tissues and secretions. This group includes background for purification different from solid tissue and cell derived materials. The group includes tissue solutions such as blood serum/plasma and liquid secretions such as milk, saliva and urine.
Subcomponents of Glycomes, Especially from Secreted Proteins
The invention is further directed to methods for selecting specific components of glycomes and searching enriched fractions such as specific protein fraction comprising the specific glycome components. Examples of such preferred methods include search of “cancer specific”? oligosaccharide structures according to the invention from serum, saliva or urine. The “cancer specific” oligosaccharide can exist as free secreted oligosaccharides or as conjugates to other biomolecules such as proteins or lipids.
In a preferred embodiment the invention is directed to special methods for the analysis of the surfaces of tissues.
The preferred tissue surfaces includes
1) epithelia or endothelia of the preferred cancer tissues
and
2) surfaces of cells according to cells on surface of tissues or separable homogeneously from tissue, such as blood cells and
3) surfaces of cultivated cells which may be used as models for differentiated tissues.
In a preferred embodiment the invention is directed to non-derivatized released cell surface glycomes. The non-derivatized released cell surface glycomes correspond more exactly to the fractions of glycomes that are localized on the cell surfaces, and thus available for biological interactions. These cell surface localized glycans are of especial importance due to their availability for biological interactions as well as targets for reagents (e.g. antibodies, lectins, etc.) targeting the cells or tissues of interest. The invention is further directed to release of the cell surface glycomes, preferably from intact cells by hydrolytic enzymes such as proteolytic enzymes, including proteinases and proteases, and/or glycan releasing enzymes, including endo-glycosidases or protein N-glycosidases. Preferably the surface glycoproteins are cleaveed by proteinase such as trypsin and then glycans are analysed as glycopeptides or preferably relased further by glycan relasing enzyme.
The invention is further directed to cultured cells corresponding to cancer cells. Such cells may be used as models for cancer. The cancer cells include cell models of cancer.
The invention is further directed to the compositions and compositions produced by the methods according to the invention. The invention further represent preferred methods for analysis of the glycomes, especially mass spectrometric methods.
The invention is specifically directed to released glycomes derived from conjugated glycans from preferred tissue materials and cell models of differentiated tissues.
The invention represents effective methods for purification of oligosaccharide fractions from tissues, especially in very low scale. The prior art has shown analysis of separate glycome components from tissues, but not total glycomes. It is further realized that the methods according to the invention are useful for analysis of glycans from isolated proteins or peptides.
The invention is further directed to novel quantitative analysis methods for glycomes. The glycome analysis produces large amounts of data. The invention reveals methods for the analysis of such data quantitatively and comparision of the data between different samples. The invention is especially directed to quantitative two-dimensional representation of the data.
The invention is further directed to integrated glycomics or glycome analysis process including
The first step is optional as the method is further directed to analysis of known and novel secretion derivable soluble glycomes.
The invention represents effective methods for the practical analysis of glycans from isolated proteins especially from very small amounts of samples. The invention is especially directed to the application of the methods for the analysis of proteins using the purification method, analysis methods and/or integrated glycome analysis. In a specific embodiment the invention is especially directed in analysis of separated cancer associated proteins for their glycome analysis.
The present invention is specifically directed to the glycan fraction produced according to the present invention from the pico scale tissue material sample according to the present invention. The preferred glycan fraction is essentially devoid of signals of contaminating molecules within the preferred mass range when analysed by MALDI mass spectrometry according to the present invention.
Preferred Uses of Glycomes and Analysis Thereof with Regard to Status of Cells
In the present invention the word cell refer to cells of tissue material according to the invention, especially cancer cells.
The present invention is specifically directed to the glycan fraction produced according to the present invention from the pico scale tissue material sample according to the present invention. The preferred glycan fraction is essentially devoid of signals of contaminating molecules within the preferred mass range when analysed by MALDI mass spectrometry according to the present invention.
The glycome products from tissue samples according to present invention are produced preferably directly from complete tissue material cells or membrane fractions thereof, more preferably directly from intact cells as effectively shown in examples. In another preferred embodiment the glycome fractions are cell surface glycomes and produced directly from surfaces of complete tissue material cells, preferably intact or essentially intact cells of tissue materials or surfaces of intact tissues according to the invention. In another embodiment the glycome products according to the invention are produced directly from membrane fraction
Preferred Uses of Glycomes and Analysis Thereof with Regard to Status of Cells
It is further realized that the analysis of glycome is useful for search of most effectively altering glycan structures in the tissue materials for analysis by other methods.
The glycome component identified by glycome analysis according to the invention can be further analysed/verified by known methods such as chemical and/or glycosidase enzymatic degradation(s) and further mass spectrometric analysis and by fragmentation mass spectrometry, the glycan component can be produced in larger scale by known chromatographic methods and structure can be verified by NMR spectroscopy.
The other methods would preferably include binding assay using specific labelled carbohydrate binding agents including especially carbohydrate binding proteins (lectins, antibodies, enzymes and engineered proteins with carbohydrate binding activity) and other chemicals such as peptides or aptamers aimed for carbohydrate binding. It is realized that the novel marker structure can be used for analysis of cells, cell status and possible effects of contaminats to cell with similar indicative value as specific signals of the glycan mass components in glycome analysis by mass spectrometry according to the invention.
The invention is especially directed to search of novel carbohydrate marker structures from cell/tissue surfaces, preferably by using cell surface profiling methods. The cell surface carbohydrate marker structures would be further preferred for the analysis and/or sorting of cells.
The present invention is specifically directed to analyzing glycan datasets and glycan profiles for comparison and characterization of different tissue materials. In one embodiment of the invention, glycan signals or signal groups associated with given tissue material are selected from the whole glycan datasets or profiles and indifferent glycan signals are removed. The resulting selected signal groups have reduced background and less observation points, but the glycan signals most important to the resolving power are included in the selection. Such selected signal groups and their patterns in different sample types serve as a signature for the identification of the cell type and/or glycan types or biosynthetic groups that are typical to it. By evaluating multiple samples from the same tissue material, glycan signals that have individual i.e. cell line specific variation can be excluded from the selection. Moreover, glycan signals can be identified that do not differ between tissue materials, including major glycans that can be considered as housekeeping glycans.
To systematically analyze the data and to find the major glycan signals associated with given tissue material according to the invention, difference-indicating variables can be calculated for the comparison of glycan signals in the glycan datasets. Preferential variables between two samples include variables for absolute and relative difference of given glycan signal between the datasets from two tissue materials. Most preferential variables according to the invention are:
1. absolute difference A=(S2−S1), and
2. relative difference R=A/S1,
wherein S1 and S2 are relative abundances of a given glycan signal in cell types 1 and 2, respectively.
It is realized that other mathematical solutions exist to express the idea of absolute and relative difference between glycan datasets, and the above equations do not limit the scope of the present invention. According to the present invention, after A and R are calculated for the glycan profile datasets of the two tissue materials, the glycan signals are thereafter sorted according to the values of A and R to identify the most significant differing glycan signals. High value of A or R indicates association with tissue material 2, and vice versa. In the list of glycan data sorted independently by R and A, the tissue material specific glycans occur at the top and the bottom of the lists. More preferentially, if a given signal has high values of both A and R, it is more significant.
Preferred Representation of the Dataset when Comparing Two Tissue Materials
The present invention is specifically directed to the comparative presentation of the quantitative glycome dataset as multidimensional graphs comparing the paraller data or as other three dimensional presentations or for example as two dimensional matrix showing the quantities with a quantitative code, preferably by a quantitative color code.
The invention is further directed to methods of recognizing different tissue materials, preferably human tissues and more preferably human excretions or serum. It is further realized, that the present reagents can be used for purification of tissue materials by any fractionation method using the specific binding reagents.
Preferred fractionation methods includes fluorecense activated cell sorting (FACS), affinity chromatography methods, and bead methods such as magnetic bead methods.
The invention is further directed to positive selection methods including specific binding to the tissue material but not to contaminating tissue materials. The invention is further directed to target selection methods including specific binding to the contaminating tissue material but not to the target tissue materials. In yet another embodiment of recognition of tissue materials the tissue material is recognized together with a homogenous reference sample, preferably when separation of other materials is needed. It is realized that a reagent for positive selection can be selected so that it binds tissue materials as in the present invention and not to the contaminating tissue materials and a reagent for negative selection by selecting opposite specificity. In case of tissue material type according to the invention is to be selected amongst novel tissue materials not studied in the present invention, the binding molecules according to the invention maybe used when verified to have suitable specificity with regard to the novel tissue material (binding or not binding). The invention is specifically directed to analysis of such binding specificity for development of a new binding or selection method according to the invention.
The preferred specificities according to the invention include recognition of:
The N-glycan analysis of total profiles of released N-glycans revealed beside the glycans above, which were verified to comprise
1) complex biantennary N-glycans, such as Galβ4GlcNAcβ2Manα3(Galβ4GlcNAcβ2Manα6)Manβ4GlcNAcβ4(Fucα6)0-1GlcNAcβ-, wherein the reminal N-acetyllactosamines can be elongated from Gal with NeuNAcα3 aand/or NeuNAcα6 and
2) terminal mannose containing N-glycans such as High-mannose glycans with formula Hex5-9HexNAc2 and degradation products thereof comprising low number of mannose residues (Low mannose glycans) Hex1-4HexNAc2.
The glycan share common core structure according to the Formula:
[Manα3]n1(Manα6)n2Manβ4GlcNAcβ4(Fucα6)0-1GlcNAcβAsn,
wherein n1 and n2 are integers 0 or 1, independently indicating the presence or absence of the terminal Man-residue, and
wherein the non-reducing end terminal Manα3/Manα6-residues can be elongated to the complex type, especially biantennary structures or to mannose type (high-Man and/or low Man) or to hybrid type structures as described in examples.
It was further analyzed that the N-glycan compositios contained only very minor amounts of glycans with additional HexNAx in comparison to monosaccharide compositions of the complex type glycan above, which could indicate presence of no or very low amounts of the N-glycan core linked GlcNAc-residues described by Stanley P M and Raju T S (JBC-(1998) 273 (23) 14090-8; JBC (1996) 271 (13) 7484-93) and/or bisecting GlcNAc. is realized that part of the terminal HexNAc-type structures appear to represent bisecting GlcNAc-type type glycans, and quite low or non-existent amounts of the GlcNAcα6-branching and also low amounts of GlcNAcβ2-branch of Manβ4 described by Stanley and colleagues. Here, essentially devoid of indicates less than 10% of all the protein linked N-glycans, more preferably the additional HexNAc units are present in less than 8% of the tissue material N-glycans by mass spectrometric analysis.
The invention thus describes the major core structure of N-glycans in human tissue materials verified by NMR-spectroscopy and by specific glycosidase digestions and was further quantitated to comprise a characteristic smaller structural group glycans comprising specific terminal HexNAc group and/or bisecting GlcNAc-type structures, which additionally modify part of the core structure. The invention further reveals that the core structure is a useful target structure for analysis of tissue materials.
The characteristic monosaccharide composition of the core structure will allow recognition of the major types of N-glycan structure groups present as additional modification of the core structure. Furthermore composition of the core structure is characteristic in mass spectrometric analysis of N-glycan and allow immediate recognition for example from HexxHexNAc1-type (preferentially ManxGlcNAc1) glycans also present in total glycome composition.
The invention describes novel low-molecular weight acidic glycan components within the acidic N-glycan and/or soluble glycan fractions with characteristic monosaccharide compositions SAxHex1-2HexNAc1-2, wherein x indicates that the corresponding glycans are preferentially sialylated with one or more sialic acid residues. The inventors realized that such glycans are novel and unusual with respect to N-glycan biosynthesis and described mammalian cell glycan components, as reveal also by the fact that they are classified as “other (N-)glycan types” in N-glycan classification scheme of the present invention. The invention is directed to analyzing, isolating, modifying, and/or binding to these novel glycan components according to the methods and uses of the present invention, and further to other uses of specific marker glycans as described here. As demonstrated in the Examples of the present invention, such glycan components were specific parts of total glycomes of certain tissue materials and preferentially to certain tissue material types, making their analysis and use beneficial with regard to tissue materials. The invention is further directed to tissue material glycomes and subglycomes containing these glycan components.
The present invention is specifically directed to tissue material glycomes, which are essentially pure glycan mixtures comprising various glycans as described in the invention preferably in proportions shown by the invention. The essentially pure glycan mixtures comprise the key glycan components in proportions which are characteristics to tissue material glycomes. The preferred glycomes are obtained from human tissue materials according to the invention.
The invention is further directed to glycomes as products of purification process and variations thereof according to the invention. The products purified from tissue materials by the simple, quantitative and effective methods according to the invention are essentially pure. The essentially pure means that the mixtures are essentially devoid of contaminations disturbing analysis by MALDI mass spectrometry, preferably by MALDI-TOF mass spectrometry. The mass spectra produced by the present methods from the essentially pure glycomes reveal that there is essentially no non-carbohydrate impurities with weight larger than trisaccharide and very low amount of lower molecular weight impurities so that crystallization of MALDI matric is possible and the glycan signals can be observed for broad glycomes with large variations of monosaccharide compositions and ranges of molecular weight as described by the invention. It is realized that the purification of the materials from low amounts of tissue materials comprising very broad range of cellular materials is very challenging task and the present invention has accomplished this.
Combination Compositions of the Preferred Glycome Mixtures with Matrix for Analysis
The invention further revealed that it is possible to combine the glycomes with matrix useful for a mass spectrometric analysis and to obtain combination mixture useful for spectrometric analysis. The preferred mass spectrometric matrix is matrix for MALDI (matrix assisted laser desorption ionization mass spectrometry) with mass spectrometric analysis (abbreviated as MALDI matrix), MALDI is preferably performed with TOF (time of flight) detection.
Preferred MALDI matrices include aromatic preferably benzene ring structure comprising molecules with following characteristic. The benzene ring structure molecules preferably comprises 1-4 substituents such as hydroxyl, carboxylic acid or ketone groups. Known MALDI matrixes have been reviewed in Harvey, Mass. Spec. Rev. 18, 349 (1999). The present invention is especially and separately directed to specific matrixes for analysis in negative ion mode of MALDI mass spectrometry, preferred for analysis of negatively charged (acidic, such as sialylated and/or sulfated and/or phosphorylated) subglycome, and in positive ion mode of MALDI mass spectrometry (preferred for analysis of neutral glycomes). It is realized that the matrices can be optimized for negative ion mode and positive ion mode.
The present invention is especially directed to glycome matrix composition optimized for the use in positive ion mode, and to the use of the MALDI-TOF matrix and matrix glycome composition, that is optimized for the use in the analysis in positive ion mode, for the analysis of glycome, preferably neutral glycome. The preferred matrices for positive ion mode are aromatic matrices, e.g. 2,5-dihydroxybenzoic acid, 2,5-dihydroxybenzoic acid/2-hydroxy-5-methoxybenzoic acid, 2,4,6-trihydroxyacetophenone or 6-aza-2-thiothymine, more preferably 2,5-dihydroxybenzoic acid. The present invention is especially directed to glycome matrix composition optimized for the use in negative ion mode, and to the use of the MALDI-TOF matrix and the matrix glycome compositions, that is optimized for the negative ion mode, for the analysis of glycome, preferably acidic glycome. The preferred matrices for negative ion mode are aromatic matrices, e.g. 2,4,6-trihydroxyacetophenone, 3-hydroxypicolinic acid, 2,5-dihydroxybenzoic acid, 2,5-dihydroxybenzoic acid/2-hydroxy-5-methoxybenzoic acid, or 6-aza-2-thiothymine, more preferably 2,4,6-trihydroxyacetophenone. The invention is further directed to analysis method and glycome-matrix composition for the analysis of glycome compositions, wherein the glycome composition comprises both negative and neutral glycome components. Preferred matrices for analysis of negative and neutral glycome components comprising glycome are aromatic matrices, e.g. 2,4,6-trihydroxyacetophenone, 3-hydroxypicolinic acid, 2,5-dihydroxybenzoic acid, 2,5-dihydroxybenzoic acid/2-hydroxy-5-methoxybenzoic acid, or 6-aza-2-thiothymine, more preferably 2,4,6-trihydroxyacetophenone.
1) Specifically and effectively co-crystallizing with glycome composition with the matrix, crystallizing meaning here forming a solid mixture composition allowing analysis of glycome involving two steps below
2) absorbing UV-light typically provided by a laser in MALDI-TOF instrument, preferred wavelength of the light is 337 nm as defined by the manuals of MALDI-TOF methods
3) transferring energy to the glycome composition so that these will ionize and be analyzable by the MALDI-TOF mass spectrometry. The present invention is especially directed to compositions of glycomes in complex with MALDI mass spectrometry matrix.
The present invention is specifically directed to methods of searching novel MALDI-matrixes with the above characteristic, preferably useful for analysis by the method below. The method for searching novel MALDI-matrixes using the method in the next paragraph.
The present invention is specifically directed to methods of analysis of glycomes by MALDI-TOF including the steps:
1) Specifically and effectively co-crystallizing the glycome composition with the MALDI-TOF-matrix, crystallizing meaning here forming a solid mixture composition allowing analysis of glycome involving two steps below
2) Providing UV linght to crystalline sample by a laser in MALDI-TOF instrument allowing the ionization of sample
3) Analysis of the ions produced by the MALDI mass spectrometer, preferably by TOF analysis. The invention is further directed to the combination of glycome purification methods and/or quantitative and qualitative data analysis methods according to the invention.
The present invention is further directed to essentially pure glycome compositions in solid co-crystalline form with MALDI matrix. The invention is preferably a neutral and/or acidic glycome as complex with a matrix optimized for analysis of the specific glycome type, preferably analysis in negative ion mode with a matrix such as 2,4,6-trihydroxyacetophenone. The invention is preferably a neutral (or non-acidic) glycome as complex with a matrix optimized for analysis in positive ion mode such as 2,5-dihydroxybenzoic acid.
The invention revealed that it is possible to analyze glycomes using very low amount of sample. The preferred crystalline glycome composition comprises between 0.1-100 pmol, more preferably 0.5-10 pmol, more preferably 0.5-5 pmol and more preferably about 0.5-3 pmol, more preferably about 0.5-2 pmol of sample co-crystallized with optimized amount of matrix preferably about 10-200 nmol, more preferably 30-150 nmol, and more preferably about 50-120 nmol and most preferably between 60-90 nmols of the matrix, preferably when the matrix is 2,5-dihydroxybenzoic acid. The matrix and analyte amounts are optimized for a round analysis spot with radius of about 1 mm and area of about 0.8 mm2. It is realized that the amount of materials can be changed in proportion of the area of the spot, smaller amount for smaller spot. Examples of preferred amounts per area of spot are 0.1-100 pmol/0.8 mm2 and 10-200 pmol/3 mm2. Preferred molar excess of matrix is about 5000-1000000 fold, more preferably about 10000-500000 fold and more preferably about 15000 to 200 000 fold and most preferably about 20000 to 100000 fold excess when the matrix is 2,5-dihydroxybenzoic acid.
It is realized that the amount and relative amount of new matrix is optimized based on forming suitable crystals and depend on chemical structure of the matrix. The formation of crystals is observed by microscope and further tested by performing test analysis by MALDI mass spectrometry.
The invention is further directed to specific methods for crystallizing MALDI-matrix with glycome. Preferred method for crystallization in positive ion mode includes steps: (1) optionally, elimination of impurities, like salts and detergents, which interfere with the crystallization, (2) providing solution of glycome in H2O or other suitable solvent in the preferred concentration, (3) mixing the glycome with the matrix in solution or depositing the glycome in solution on a precrystallized matrix layer and (4) drying the solution preferably by a gentle stream of air.
Preferred method for crystallization in negative ion mode includes steps: (1) optionally, elimination of impurities, like salts and detergents, which interfere with the crystallization, (2) providing solution of glycome in H2O or other suitable solvent in the preferred concentration, (3) mixing the glycome with the matrix in solution or depositing the glycome in solution on a precrystallized matrix layer and (4) drying the solution preferably by vacuum.
The invention is further directed to compositions of essentially pure glycome composition with specific glycan binding molecules such as lectins, glycosidases or glycosyltransferases and other glycosyl modifying enzymes such as sulfateses and/or phosphatases and antibodies. It is realized that these composition are especially useful for analysis of glycomes.
The present invention revealed that the complex glycome compositions can be effectively and even quantitatively modified by glycosidases even in very low amounts. It was revealed that the numerous glycan structures similar to target structures of the enzymes do not prevent the degradation by competitive inhibition, especially by the enzymes used. The invention is specifically directed to preferred amounts directed to MALDI analysis for use in composition with a glycosyl modifying enzyme, preferably present in low amounts. Preferred enzymes suitable for analysis include enzymes according to the Examples.
The invention is further directed to binding of specific component of glycome in solution with a specific binder. The preferred method further includes affinity chromatography step for purification of the bound component or analysis of the non-bound fraction and comparing it to the glycome solution without the binding substance. Preferred binders include lectins engineered to be lectins by removal of catalytic amino acids (methods published by Roger Laine, Anomeric, Inc., USA, and Prof Jukka Finne, Turku, Finland), lectins and antibodies or antibody fragments or minimal binding domains of the proteins.
The present invention is especially directed to the use of glycome data for production of mathematical formulas, or algorithms, for specific recognition or identification of specific tissue materials. Data analysis methods are presented in Examples.
The invention is especially directed to selecting specific “structural features” such as mass spectrometric signals (such as individual mass spectrometric signal corresponding to one or several monosaccharide compositions and/or glycan structures), or signal groups or subglycomes or signals corresponding to specific glycan classes, which are preferably according to the invention, preferably the signal groups (preferably defined as specific structure group by the invention), from quantitative glycome data, preferably from quantitative glycome data according to the invention, for the analysis of status of tissue materials. The invention is furthermore directed to the methods of analysis of the tissue materials by the methods involving the use of the specific signals or signal groups and a mathematical algorithm for analysis of the status of a tissue material.
Preferred algorithm includes use of proportion (such as %-proportion) of the specific signals from total signals as specific values (structural features) and creating a “glycan score”, which is algorithm showing characteristics/status of a tissue material based on the specific proportional signal intensities (or quantitative presence of glycan structures measured by any quantitation method such as specific binding proteins or quantitative chromatographic or electrophoresis analysis such as HPLC analysis). Preferably signals which are, preferably most specifically, upregulated in specific tissue materials and signals which are, preferably most specifically, downregulated in the tissue material in comparison to control tissue materials are selected to for the glycan score. In a preferred embodiment value(s) of downregulated signals are subtracted from upregulated signals when glycan score is calculated. The method yields largest score values for a specific tissue material type or types selected to be differentiated from other tissue materials.
The invention is specifically directed to methods for searching characteristic structural features (values) from glycome profiling data, preferably quantitative or qualitative glycome profiling data. The preferred methods include methods for comparing the glycome data sets obtained from different samples, or from average data sets obtained from a group of similar samples such as paraller samples from same or similar tissue material preparations. Methods for searching characteristic features are briefly described in the section: identification and classification of differences in glycan datasets. The comparison of datasets of the glycome data according to the invention preferably includes calculation of relative and/or absolute differences of signals, preferably each signal between two data sets, and in another preferred embodiment between three or more datasets. The method preferably further includes step of selecting the differing signals, or part thereof, for calculating glycan score.
It is further realized that the analyzed glycome data has other uses preferred by the invention such as use of the selected characteristic signals and corresponding glycan material:
1) for targets for structural analysis of glycans (preferably chemically by glycosidases, fragmentation mass spectrometry and/or NMR spectroscopy as shown by the present invention and/or structural analysis based on the presence of other signals and knowledge of biosynthesis of glycans). The preferred use for targets includes estimation of chemical characteristics of potential corresponding glycans for complete or partial purification/separation of the specific glycan(s). The preferred chemical characteristics to be analysed preferably include one or several of following properties: a) acidity (e.g. by presence of acidic residues such as sialic acid and/or sulfate and/or phosphate) for charge based separation, b) molecular weight or hydrodymanamic volume affecting chromatographic separation, e.g. estimation of the elution volume in gel filtration methods (the effect of acidic residue can be estimated from effects of similar structures and the “size” of HexNAc (GalNAc/GlcNAc) is in general twice the size of Hex (such as Gal, Man or Glc), c) estimation (e.g. based on composition and biosynthetic knowledge of glycans) of presence of epitopes for specific binding reagents for labelling identification in a mixture or for affinity purification, d) estimation of presence of target epitopes for specific glycosylmodifying enzymes including glycosidases and/or glycosyltransferases (types of binding reagents) or for specific chemical modification reagents (such as periodate for specific oxidation or acid for specific acid hydrolysis), for modification of glycans and recognition of the modification by potential chemical change such as incorporation of radioactive label or by change of mass spectrometric signal of the glycan for labelling identification in a mixture.
2) use of the signals or partially or fully analysed glycan structures corresponding to the signals for searching specific binding reagents for recognition of tissue materials which are preferably selected as described by the present invention (especially as described above) and in the methods for identification and classification of differences in glycan datasets and/or signals selected and/or tested by glycan score methods, are preferably selected for targets for structural analysis of glycans (preferably by glycosidases, fragmentation mass spectrometry and/or NMR spectroscopy as shown by the present invention) and/or for use of the signals or partially or fully analysed glycan structures corresponding to the signals for searching specific binding reagents for recognition of tissue materials.
The preferred method includes the step of comparing the values, and preferably presenting the score values in graphs such as ones shown in
It is realized that to differentiate a tissue materials type from other(s) different characteristic signals may be selected than for another tissue material type. The invention however revealed that for tissue materials and especially for human cancer patients preferred characteristic signals include ones selected in the Examples as described above. It is realized that a glycan score can be also created with less characteristic signals or with only part of signals and still relevant results can be obtained. The invention is further directed to methods for optimisation of glycan score algorithms and methods for selecting signals for glycan scores.
In case the specific proportion (value) of a characteristic signal is low in comparision to other values a specific factor can be selected for increase the relative “weight” of the value in the glycan scores to be calculated for the cell populations.
The preferred statuses of tissue materials, to be analysed by mathematical methods such as algorithms using quantitative glycome profiling data according to the invention include differentiation status, individual characteristics and mutation, cell culture or storage conditions related status, effects of chemicals or biochemicals on cells, and other statuses described by the invention.
The present invention is especially directed to following O-glycan marker structures of tissue materials:
Core 1 type O-glycan structures following the marker composition NeuAc2Hex1HexNAc1, preferably including structures SAα3Galβ3GalNAc and/or SAα3Galβ3(Saα6)GalNAc; and Core 2 type O-glycan structures following the marker composition NeuAc0-2Hex2+nHexNAc2dHex0-1, more preferentially further including the glycan series NeuAc0-2Hex2+nHexNAc2+ndHex0-1, wherein n is either 1, 2, or 3 and more preferentially n is 1 or 2, and even more preferentially n is 1;
more specifically preferably including R1Galβ4(R3)GlcNAcβ6(R2Galβ3)GalNAc,
wherein R1 and R2 are independently either nothing or sialic acid residue, preferably α2,3-linked sialic acid residue, or an elongation with HexnHexNAcn, wherein n is independently an integer at least 1, preferably between 1-3, most preferably between 1-2, and most preferably 1, and the elongation may terminate in sialic acid residue, preferably α2,3-linked sialic acid residue; and
R3 is independently either nothing or fucose residue, preferably α1,3-linked fucose residue. It is realized that these structures correlate with expression of β6GlcNAc-transferases synthesizing core 2 structures.
The present invention is especially directed to glycan compositions (structures) and analysis of high-mannose type and glucosylated N-glycans according to the formula:
Hexn3HexNAcn4,
wherein n3 is 5, 6, 7, 8, 9, 10, 11, or 12, and n4=2.
According to the present invention, within total N-glycomes of tissue materials the major high-mannose type and glucosylated N-glycan signals preferentially include the compositions with 5≦n3≦10: Hex5HexNAc2 (1257), Hex6HexNAc2 (1419), Hex7HexNAc2 (1581), Hex8HexNAc2 (1743), Hex9HexNAc2 (1905), and Hex10HexNAc2 (2067); and more preferably with 5≦n3≦9: Hex5HexNAc2 (1257), Hex6HexNAc2 (1419), Hex7HexNAc2 (1581), Hex8HexNAc2 (1743), and Hex9HexNAc2 (1905).
The present invention is especially directed to glycan compositions (structures) and analysis of low-mannose type N-glycans according to the formula:
Hexn3HexNAcn4dHexn5,
wherein n3 is 1, 2, 3, or 4, n4=2, and n5 is 0 or 1.
According to the present invention, within total N-glycomes of tissue materials the major low-mannose type N-glycan signals preferably include the compositions with 2≦n3≦4:
Hex2HexNAc2 (771), Hex3HexNAc2 (933), Hex4HexNAc2 (1095), Hex2HexNAc2dHex (917), Hex3HexNAc2dHex (1079), and Hex4HexNAc2dHex (1241); and more preferably when n5 is 0: Hex2HexNAc2 (771), Hex3HexNAc2 (933), and Hex4HexNAc2 (1095).
As demonstrated in the present invention by glycan structure analysis of tissue materials, preferably this glycan group in tissue materials includes the molecular structures:
(Manα)1-3Manβ4GlcNAcβ4(Fucα6)0-1GlcNAc within the glycan signals 771, 917, 933, 1079, 1095, and 1095, and
the preferred low-Man structures includes structures common all tissue material types, tri-Man and tetra-Man structures according to the Examples,
(Manα)0-1Manα6(Manα3)Manβ4GlcNAcβ4(Fucα6)0-1GlcNAc, more preferably the tri-Man structures:
even more preferably the abundant molecular structure:
Manα6(Manα3)Manβ4GlcNAcβ4GlcNAc within the glycan signal 933.
Beside the qualitative variations the low-Man glycans have specific value in quantitative analysis of tissue materials. The present invention revealed that the low-Man glycans are especially useful for the analysis of status of the cells. For example the analysis in the Examples revealed that the amounts of the glycans vary between total tissue profiles and specific organelles, preferably lysosomes.
The group of low-Man glycans form a characteristic group among glycome compositions. The relative total amount of neutral glycans is notable in average human tissues. The glycan group was revealed also to be characteristic in cancerous tissues and tumorsa with total relative amount of neutral glycomes increased. The difference is more pronounced within lysosomal organelle-specific glycome, wherein low-Man structures accounted nearly 50% of the neutral protein-linked glycome. Glycome analysis of tissue materials is especially useful for methods for development of binder reagents for separation of different tissue materials.
The invention is directed to analysis of relative amounts of low-Man glycans, and to the specific quantitative glycome compositions, especially neutral glycan compositions, comprising about 0 to 50% of low-Man glycans, more preferably between about 1 to 50% of solid tissue glycomes, for the analysis of tissue materials according to the invention, and use of the composition for the analysis of tissue materials.
The present invention is especially directed to glycan compositions (structures) and analysis of fucosylated high-mannose type N-glycans according to the formula:
Hexn3HexNAcn4dHexn5,
wherein n3 is 5, 6, 7, 8, or 9, n4=2, and n5=1.
According to the present invention, within total N-glycomes of tissue materials the major fucosylated high-mannose type N-glycan signal preferentially is the composition Hex5HexNAc2dHex (1403).
The present invention is especially directed to glycan compositions (structures) and analysis of neutral soluble N-glycan type glycans according to the formula:
Hexn3HexNAcn4,
wherein n3 is 1, 2, 3, 4, 5, 6, 7, 8, or 9, and n4=1.
Within total N-glycomes of tissue materials the major soluble N-glycan signals include the compositions with 4≦n3≦8, more preferably 4≦n3≦7: Hex4HexNAc (892), Hex5HexNAc (1054), Hex6HexNAc (1216), Hex7HexNAc (1378). In the most preferred embodiment of the present invention, the major glycan signal in this group within total neutral glycomes of tissue materials is Hex5HexNAc (1054).
The present invention is especially directed to glycan compositions (structures) and analysis of neutral monoantennary or hybrid-type N-glycans according to the formula:
Hexn3HexNAcn4dHexn5,
wherein n3 is an integer greater or equal to 2, n4=3, and n5 is an integer greater or equal to 0.
According to the present invention, within total N-glycomes of tissue materials the major neutral monoantennary or hybrid-type N-glycan signals preferentially include the compositions with 2≦n3≦8 and 0≦n5≦2, more preferentially compositions with 3≦n3≦6 and 0≦n5≦1, with the proviso that when n3=6 also n5=0: preferentially Hex4HexNAc3 (1298), Hex4HexNAc3dHex (1444), Hex5HexNAc3 (1460), and Hex6HexNAc3 (1622).
The present invention is especially directed to glycan compositions (structures) and analysis of neutral complex-type N-glycans according to the formula:
Hexn3HexNAcn4dHexn5,
wherein n3 is an integer greater or equal to 3, n4 is an integer greater or equal to 4, and n5 is an integer greater or equal to 0.
Within the total N-glycomes of tissue materials the major neutral complex-type N-glycan signals preferentially include the compositions with 3≦n3≦8, 4≦n4≦7, and 0≦n5≦4, more preferentially the compositions with 3≦n3≦5, n4=4, and 0≦n5≦1, with the proviso that when n3 is 3 or 4, then n5=1: Hex3HexNAc4dHex (1485), Hex4HexNAc4dHex (1647), Hex5HexNAc4 (1663), Hex5HexNAc4dHex (1809); and even more preferentially also including the composition Hex3HexNAc5dHex (1688).
In another embodiment of the present invention, the N-glycan signal Hex3HexNAc4dHex (1485) contains non-reducing terminal GlcNAcβ, and more preferentially the total N-glycome includes the structure:
In yet another embodiment of the present invention, within the total N-glycome of tissue materials, the N-glycan signal Hex5HexNAc4dHex (1809), more preferentially also Hex5HexNAc4 (1663), contain non-reducing terminal β1,4-Gal. Even more preferentially the total N-glycome includes the structure:
Galβ4GlcNAcβ2Manα3(Galβ4GlcNAcβ2Manα6)Manβ4GlcNAcβ4GlcNAc (1663); and in a further preferred embodiment the total N-glycome includes the structure:
The present invention is especially directed to glycan compositions (structures) and analysis of neutral fucosylated N-glycans according to the formula:
Hexn3HexNAcn4dHexn5,
wherein n5 is an integer greater than or equal to 1.
Within the total N-glycomes of tissue materials the major neutral fucosylated N-glycan signals preferentially include glycan compositions wherein 1≦n5≦4, more preferentially 1≦n5≦3, even more preferentially 1≦n5≦2, and further more preferentially compositions Hex3HexNAc2dHex (1079), more preferentially also Hex2HexNAc2dHex (917), and even more preferentially also Hex5HexNAc4dHex (1809).
The inventors further found that within the total N-glycomes of tissue materials a major fucosylation form is N-glycan core a1,6-fucosylation. In a preferred embodiment of the present invention, major fucosylated N-glycan signals contain GlcNAcβ4(Fucα6)GlcNAc reducing end sequence.
Neutral N-Glycans with Non-Reducing Terminal HexNAc
The present invention is especially directed to glycan compositions (structures) and analysis of neutral N-glycans with non-reducing terminal HexNAc according to the formula:
Hexn3HexNAcn4dHexn5,
wherein n4≧n3.
Preferably these glycan signals include Hex3HexNAc4dHex (1485) in all tissue materials.
The present invention is especially directed to glycan compositions (structures) and analysis of acidic hybrid-type or monoantennary N-glycans according to the formula:
NeuAcn1NeuGcn2Hexn3HexNAcn4dHexn5SPn6,
wherein n1 and n2 are either independently 1, 2, or 3; n3 is an integer between 3-9; n4 is 3; n5 is an integer between 0-3; and n6 is an integer between 0-2; with the proviso that the sum n1+n2+n6 is at least 1.
Within the total N-glycomes of tissue materials the major acidic hybrid-type or monoantennary N-glycan signals preferentially include glycan compositions wherein 3≦n3≦6, more preferentially 3≦n5≦5, and further more preferentially composition NeuAcHex4HexNAc3dHex (1711).
The present invention is especially directed to glycan compositions (structures) and analysis of acidic complex-type N-glycans according to the formula:
NeuAcn1NeuGcn2Hexn3HexNAcn4dHexn5SPn6,
wherein n1 and n2 are either independently 1, 2, 3, or 4; n3 is an integer between 3-10; n4 is an integer between 4-9; n5 is an integer between 0-5; and n6 is an integer between 0-2; with the proviso that the sum n1+n2+n6 is at least 1.
Within the total N-glycomes of tissue materials the major acidic complex-type N-glycan signals preferentially include glycan compositions wherein 4≦n4≦8, more preferentially 4≦n4≦6, more preferentially 4≦n4≦5, and further more preferentially compositions NeuAcHex5HexNAc4 (1930), NeuAcHex5HexNAc4dHex (2076), NeuAc2Hex5HexNAc4 (2221), NeuAcHex5HexNAc4dHex2 (2222), and NeuAc2Hex5HexNAc4dHex (2367).
The inventors found that tissue material total N-glycomes; and soluble+N-glycomes further contain characteristic modified glycan signals, including sialylated fucosylated N-glycans, multifucosylated glycans, sialylated N-glycans with terminal HexNAc (the N>H and N═H subclasses), and sulphated or phosphorylated N-glycans, which are subclasses of the abovementioned glycan classes. According to the present invention, their quantitative proportions in different tissue materials have characteristic values as described in Tables 8 and 13.
Specifically, major phosphorylated glycans typical to tissue materials, more preferentially to lysosomal organelle glycomes, include Hex5HexNAc2(HPO3) (1313), Hex6HexNAc2(HPO3) (1475), and Hex7HexNAc2(HPO3) (1637).
The preferred complete glycomes of tissue materials include low-mannose type, hybrid-type or monoantennary, hybrid, and complex-type N-glycans,
which more preferentially contain fucosylated glycans, even more preferentially also sialylated glycans, and further more preferentially also sulphated and/or phosphorylated glycans;
and most preferentially also including soluble glycans as described in the present invention.
In a preferred embodiment of the present invention the tissue material total N-glycome contains the three glycan types: 1) high-mannose type, 2) hybrid-type or monoantennary, and 3) complex-type N-glycans; and more preferably, in the case of solid tissues or cells also 4) low-mannose type N-glycans; and further more preferably, in the case of solid tissues or cells additionally 5) soluble glycans.
In a preferred embodiment of the preferred glycan type combinations within the tissue material complete glycomes, their relative abundances are as described in Tables of Examples.
The inventors revealed that the N-glycans released by specific N-glycan release methods from the cells according to the invention, and preferred cells according to the invention, comprise mostly a specific type of N-glycan core structure.
The preferred N-glycan structure of each cell type is characterised and recognized by treating cells with a N-glycan releasing enzyme releasing practically all N-glycans with core type according to the invention. The N-glycan relasing enzyme is preferably protein N-glycosidase enzyme, preferably by protein N-glycosidase releasing effectively the N-glycomes according to the invention, more preferably protein N-glycosidase with similar specificity as protein N-glycosidase F, and in a specifically preferred embodiment the enzyme is protein N-glycosidase F from F. meningosepticum. Alternative chemical N-glycan release method was used for controlling the effective release of the N-glycomes by the N-glycan relasing enzyme.
The inventors used the NMR glycome analysis according to the invention for further characterization of released N-glycomes from small cell samples available. NMR spectroscopy revealed the N-glycan core signals of the preferred N-glycan core type of the cells according to the invention.
The present invention is directed to glycomes derived from cells and comprising a common N-glycosidic core structures. The invention is specifically directed to minimum formulas covering both GN1-glycomes and GN2-glycomes with difference in reducing end structures.
The minimum core structure includes glycans from which reducing end GlcNAc or Fucα6GlcNAc has been released. These are referred as GN1-glycomes and the components thereof as GN1-glycans. The present invention is specifically directed to natural N-glycomes from cells comprising GN1-glycans. In a preferred embodiment the invention is directed to purified or isolated practically pure natural GN1-glycome from human cells. The release of the reducing end GlcNAc-unit completely or partially may be included in the production of the N-glycome or N-glycans from cells for analysis. The invention is specifically directed to soluble high/low mannose glycome of GN1-type.
The glycomes including the reducing end GlcNAc or Fucα6GlcNAc are referred as GN2-glycomes and the components thereof as GN2-glycans. The present invention is also specifically directed to natural N-glycomes from cells and tissues comprising GN2-glycans. In a preferred embodiment the invention is directed to purified or isolated practically pure natural GN2-glycome from cells.
The preferred N-glycan core structure(s) and/or N-glycomes from cells according to the invention comprise structure(s) according to
the formula NC1:
R1Mβ4GNXyR2,
Wherein X is glycosidically linked disaccharide epitope β4(Fucα6)nGN, wherein n is 0 or 1, or X is nothing and
y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon, and
R1 indicates 1-4, preferably 1-3, natural type carbohydrate substituents linked to the core structures,
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagines N-glycoside aminoacids and/or peptides derived from protein.
It is realized that when the invention is directed to a glycome, the formula indicates mixture of several or typically more than ten or even higher number of different structures according to the Formulas describing the glycomes according to the invention.
The possible carbohydrate substituents R1 comprise at least one mannose (Man) residue, and optionally one or several GlcNAc, Gal, Fuc, SA and/GalNAc residues, with possible sulphate and or phosphate modifications.
When the glycome is released by N-glycosidase the free N-glycome saccharides comprise in a preferred embodiment reducing end hydroxyl with anomeric linkage A having structure a and/or β, preferably both α and β. In another embodiment the glycome is derivatized by a molecular structure which can be reacted with the free reducing end of a released glycome, such as amine, aminooxy or hydrazine or thiol structures. The derivatizing groups comprise typically 3 to 30 atoms in aliphatic or aromatic structures or can form terminal group spacers and link the glycomes to carriers such as solid phases or microparticels, polymeric carries such as oligosaccharides and/or polysaccharide, peptides, dendrimer, proteins, organic polymers such as plastics, polyethyleneglycol and derivatives, polyamines such as polylysines.
When the glycome comprises asparagine N-glycosides, A is preferably beta and R is linked asparagine or asparagine peptide. The peptide part may comprise multiple different aminoacid residues and typically multiple forms of peptide with different sequences derived from natural proteins carrying the N-glycans in cell materials according to the invention. It is realized that for example proteolytic release of glycans may produce mixture of glycopeptides. Preferably the peptide parts of the glycopeptides comprises mainly a low number of amino acid residues, preferably two to ten residues, more preferably two to seven amino acid residues and even more preferably two to five aminoacid residues and most preferably two to four amino acid residues when “mainly” indicates preferably at least 60% of the peptide part, more preferably at least 75% and most preferably at least 90% of the peptide part comprising the peptide of desired low number of aminoacid residues.
The preferred GN2— N-glycan core structure(s) and/or N-glycomes from cells according to the invention comprise structure(s) according to
the formula NC2:
R1Mβ4GNβ4(Fucα6)nGNyR2,
wherein n is 0 or 1 and
wherein y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon and
R1 indicates 1-4, preferably 1-3, natural type carbohydrate substituents linked to the core structures,
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagines N-glycoside aminoacid and/or peptides derived from protein.
The preferred compositions thus include one or several of the following structures
NC2a: Mα3{Mα6}Mβ4GNβ4{Fucα6}n1GNyR2
NC2b: Mα6Mβ4GNβ4{Fucα6}n1GNyR2
NC2c: Mα3Mβ4GNβ4{Fucα6}n1GNyR2
More preferably compositions comprise at least 3 of the structures or most preferably both structures according to the formula NC2a and at least both fucosylated and non-fucosylated with core structure(s) NC2b and/or NC2c.
The preferred GN1— N-glycan core structure(s) and/or N-glycomes from cells according to the invention comprise structure(s) according to
the formula NC3:
R1Mβ4GNyR2,
wherein y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon and
R1 indicates 1-4, preferably 1-3, natural type carbohydrate substituents linked to the core structures,
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagine N-glycoside aminoacids and/or peptides derived from protein.
The invention is specifically directed glycans and/or glycomes derived from preferred cells according to the present invention when the natural glycome or glycan comprises Multi-mannose GN1— N-glycan core structure(s) structure(s) according to
the formula NC4:
[R1Mα3]n3{R3Mα6}n2Mβ4GNXyR2,
R1 and R3 indicate nothing or one or two, natural type carbohydrate substituents linked to the core structures, when the substituents are α-linked mannose monosaccharide and/or oligosaccharides and the other variables are as described above.
Furthermore common elongated GN2—N-glycan core structures are preferred types of glycomes according to the invention
The preferred N-glycan core structures further include differently elongated GN2—N-glycan core structures according to the
formula NC5:
[R1Mα3]n3{R3Mα6}n2Mβ4GNβ4{Fucα6}n1GNyR2,
wherein n1, n2 and n3 are either 0 or 1 and
wherein y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon and
R1 and R3 indicate nothing or 1-4, preferably 1-3, most preferably one or two, natural type carbohydrate substituents linked to the core structures,
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagine N-glycoside aminoacids and/or peptides derived from protein,
GN is GlcNAc, M is mannosyl-, [ ] indicate groups either present or absent in a linear sequence.
{ } indicates branching which may be also present or absent.
with the provision that at least n2 or n3 is 1. Preferably the invention is directed to compositions comprising with all possible values of n2 and n3 and all saccharide types when R1 and/or are R3 are oligosaccharide sequences or nothing.
The present invention is preferably directed to N-glycan glycomes comprising one or several of the preferred N-glycan core types according to the invention. The present invention is specifically directed to specific N-glycan core types when the compositions comprise N-glycan or N-glycans from one or several of the groups Low mannose glycans, High mannose glycans, Hybrid glycans, and Complex glycans, in a preferred embodiment the glycome comrise substantial amounts of glycans from at least three groups, more preferably from all four groups.
The invention revealed certain structural groups present in N-linked glycomes. The grouping is based on structural features of glycan groups obtained by classification based on the monosaccharide compositions and structural analysis of the structurel groups. The glycans were analysed by NMR, specific binding reagents including lectins and antibodies and specific glycosidases releasing monosaccharide residues from glycans. The glycomes are preferably analysed as neutral and acidic glycomes
The neutral glycomes mean glycomes comprising no acidic monosaccharide residues such as sialic acids (especially NeuNAc and NeuGc), HexA (especially GlcA, glucuronic acid) and acid modification groups such as phosphate and/or sulphate esters. There are four major types of neutral N-linked glycomes which all share the common N-glycan core structure: High-mannose N-glycans, low-mannose N-glycans, hydrid type and complex type N-glycans. These have characteristic monosaccharide compositions and specific substructures. The complex and hybrid type glycans may include certain glycans comprising monoantennary glycans.
The groups of complex and hybrid type glycans can be further analysed with regard to the presence of one or more fucose residues. Glycans containing at least one fucose units are classified as fucosylated. Glycans containing at least two fucose residues are considered as glycans with complex fucosylation indicating that other fucose linkages, in addition to the α1,6-linkage in the N-glycan core, are present in the structure. Such linkages include α1,2-, α1,3-, and α1,4-linkage.
Furthermore the complex type N-glycans may be classified based on the relations of HexNAc (typically GlcNAc or GalNAc) and Hex residues (typically Man, Gal). Terminal HexNAc glycans comprise at least three HexNAc units and at least two Hexose units so that the number of Hex Nac residues is at least larger or equal to the number of hexose units, with the provisiont that for non branched, monoantennary glycans the number of HexNAcs is larger than number of hexoses.
This consideration is based on presence of two GlcNAc units in the core of N-glycan and need of at least two Mannose units to for a single complex type N-glycan branch and three mannose to form a trimannosyl core structure for most complex type structures. A specific group of HexNAc N-Glycans contains the same number of HexNAcs and Hex units, when the number is at least 5.
The invention is further directed to glycans comprising terminal Mannose such as Mα6-residue or both Manα6- and Manα3-residues, respectively, can additionally substitute other Mα2/3/6 units to form a Mannose-type structures including hydrid, low-Man and High-Man structures according to the invention.
Preferred high- and low mannose type structures with GN2-core structure are according to the Formula M2:
[Mα2]n1[Mα3]n2{[Mα2]n3[Mα6]n4}[Mα6]n5{[Mα2]n6[Mα2]n7[Mα3]n8}Mβ4GNβ4[{Fucα6}]mGNyR2
wherein p, n1, n2, n3, n4, n5, n6, n7, n8, and m are either independently 0 or 1; with the proviso that when n2 is 0, also n1 is 0; when n4 is 0, also n3 is 0; when n5 is 0, also n1, n2, n3, and n4 are 0; when n7 is 0, also n6 is 0; when n8 is 0, also n6 and n7 are 0;
y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon, and
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagines N-glycoside aminoacid and/or peptides derived from protein;
[ ] indicates determinant either being present or absent depending on the value of n1, n2, n3, n4, n5, n6, n7, n8, and m; and
{ } indicates a branch in the structure.
Preferred yR2-structures include [β-N-Asn]p, wherein p is either 0 or 1.
As described above a preferred variant of N-glycomes comprising only single GlcNAc-residue in the core. Such structures are especially preferred as glycomes produced by endo-N-acetylglucosaminidase enzymes and Soluble glycomes. Preferred Mannose type glycomesnclude structures according to the
[Mα2]n1[Mα3]n2{[Mα2]n3[Mα6]n4}[Mα6]n5{[Mα2]n6[Mα2]n7[Mα3]n8}Mβ4GNyR2 Formula M2
Fucosylated high-mannose N-glycans according to the invention have molecular compositions Man5-9GlcNAc2Fuc1. For the fucosylated high-mannose glycans according to the formula, the sum of n1, n2, n3, n4, n5, n6, n7, and n8 is an integer from 4 to 8 and m is 0.
The low-mannose structures have molecular compositions Man14GlcNAc2Fuc0-1. They consist of two subgroups based on the number of Fuc residues: 1) nonfucosylated low-mannose structures have molecular compositions Man1-4GlcNAc2 and 2) fucosylated low-mannose structures have molecular compositions Man1-4GlcNAc2Fuc1. For the low mannose glycans the sum of n1, n2, n3, n4, n5, n6, n7, and n8 is less than or equal to (m+3); and preferably n1, n3, n6, and n7 are 0 when m is 0.
The invention revealed a very unusual group glycans in N-glycomes of invention defined here as low mannose N-glycans. These are not clearly linked to regular biosynthesis of N-glycans, but may represent unusual biosynthetic midproducts or degradation products. The low mannose glycans are especially characteristics changing during the changes of cell status, the differentiation and other changes according to the invention, for examples changes associated with differentiation status of cells and their differentiated products and control cell materials.
The invention is especially directed to recognizing low amounts of low-mannose type glycans in cell types, such as with low degree of differentiation.
The invention revealed large differences between the low mannose glycan expression in the cell and tissue glycomes and material from tissue secretions such as human serum.
The invention is especially directed to the use of specific low mannose glycan comprising glycomes for analysis of tissues and cells, preferably cultivated cells.
The invention further revealed specific mannose directed recognition methods useful for recognizing the preferred glycomes according to the invention. The invention is especially directed to combination of glycome analysis and recognition by specific binding agents, most preferred binding agent include enzymes and these derivatives. The invention further revealed that specific low mannose glycans of the low mannose part of the glycomes can be recognized by degradation by specific α-mannosidase (Man2-4GlcNAc2Fuc0-1) or β-mannosidase (Man1GlcNAc2Fuc0-1) enzymes and optionally further recognition of small low mannose structures, even more preferably low mannose structures comprising terminal Manβ4-structures according to the invention.
The low mannose N-glycans, and preferred subgroups and individual structures thereof, are especially preferred as markers of the novel glycome compositions of the cells according to the invention useful for characterization of the cell types.
The low-mannose type glycans includes a specific group of α3- and/or α6-linked mannose type structures according to the invention including a preferred terminal and core structure types according to the invention.
The inventions further revealed that low mannose N-glycans comprise a unique individual structural markers useful for characterization of the cells according to the invention by specific binding agents according to the invention or by combinations of specific binding agents according to the invention.
Neutral low-mannose type N-glycans comprise one to four or five terminal Man-residues, preferentially Manα structures; for example Manα0-3Manβ4GlcNAcβ4GlcNAc(β-N-Asn) or Manαo-4Manβ4GlcNAcβ4(Fucα6)GlcNAc(β-N-Asn).
Low-mannose N-glycans are smaller and more rare than the common high-mannose N-glycans (Man5-9GlcNAc2). The low-mannose N-glycans detected in cell samples fall into two subgroups: 1) non-fucosylated, with composition MannGlcNAc2, where 1≦n≦4, and 2) core-fucosylated, with composition MannGlcNAc2Fuc1, where 1≦n≦5. The largest of the detected low-mannose structure structures is Man5GlcNAc2Fuc1 (m/z 1403 for the sodium adduct ion), which due to biosynthetic reasons most likely includes the structure below (in the figure the glycan is free oligosaccharide and β-anomer; in glycoproteins in tissues the glycan is N-glycan and β-anomer):
According to the present invention, low-mannose structures are preferentially identified by mass spectrometry, preferentially based on characteristic Hex1-4HexNAc2dHex0-1 monosaccharide composition. The low-mannose structures are further preferentially identified by sensitivity to exoglycosidase digestion, preferentially α-mannosidase (Hex2-4HexNAc2dHexc0-1) or 3-mannosidase (Hex1HexNAc2dHex0-1) enzymes, and/or to endoglycosidase digestion, preferentially N-glycosidase F detachment from glycoproteins, Endoglycosidase H detachment from glycoproteins (only Hex1-4HexNAc2 liberated as Hex1-4HexNAc1), and/or Endoglycosidase F2 digestion (only Hex1-4HexNAc2dHex1 digested to Hex1-4HexNAc1). The low-mannose structures are further preferentially identified in NMR spectroscopy based on characteristic resonances of the Manβ4GlcNAcβ4GlcNAc N-glycan core structure and Manα residues attached to the Manβ4 residue.
Several preferred low Man glycans described above can be presented in a single Formula:
[Mα3]n2{[Mα6)]n4}[Mα6]n5{[Mα3]n8}Mβ4GNβ4[{Fucα6}]mGNyR2
wherein p, n2, n4, n5, n8, and m are either independently 0 or 1; with the proviso that when n2 is 0, also n1 is 0; when n4 is 0, also n3 is 0; when n5 is 0, also n1, n2, n3, and n4 are 0; when n7 is 0, also n6 is 0; when n8 is 0, also n6 and n7 are 0; the sum of n1, n2, n3, n4, n5, n6, n7, and n8 is less than or equal to (m+3); [ ] indicates determinant either being present or absent depending on the value of n2, n4, n5, n8, and m; and
{ } indicates a branch in the structure;
y and R2 are as indicated above.
Preferred non-fucosylated low-mannose glycans are according to the formula:
[Mα3]n2([Mα6)]n4)[Mα6]n5{[Mα3]n8}Mβ4GNβ4GNyR2
wherein p, n2, n4, n5, n8, and m are either independently 0 or 1,
with the provisio that when n5 is 0, also n2 and n4 are 0, and preferably either n2 or n4 is 0,
[ ] indicates determinant either being present or absent
depending on the value of, n2, n4, n5, n8,
{ } and ( ) indicates a branch in the structure,
y and R2 are as indicated above.
Small non-fucosylated low-mannose structures are especially unsual among known N-linked glycans and characteristic glycans group useful for separation of cells according to the present invention. These include:
Mβ4GNβ4GNyR2 trisaccharide epitope is a preferred common structure alone and together with its mono-mannose derivatives Mα6Mβ4GNβ4GNyR2 and/or Mα3Mβ4GNβ4GNyR2, because these are characteristic structures commonly present in glycomes according to the invention. The invention is specifically directed to the glycomes comprising one or several of the small non-fucosylated low-mannose structures. The tetrasaccharides are in a specific embodiment preferred for specific recognition directed to α-linked, preferably α3/6-linked Mannoses as preferred terminal recognition element.
The invention further revealed large non-fucosylated low-mannose structures that are unsual among known N-linked glycans and have special characteristic expression features among the preferred cells according to the invention. The preferred large structures include
[Mα3]n2([Mα6]n4)Mα6{Mα3}Mβ4GNβ4GNyR2
more specifically
The hexasaccharide epitopes are preferred in a specific embodiment as rare and characteristic structures in preferred cell types and as structures with preferred terminal epitopes. The heptasaccharide is also preferred as structure comprising a preferred unusual terminal epitope Mα3(Mα6)Mα useful for analysis of cells according to the invention.
Preferred fucosylated low-mannose glycans are derived according to the formula:
[Mα3]n2{[Mα6]n4}[Mα6]n5{[Mα3]n8}Mβ4GNβ4(Fucα6)GNyR2
wherein p, n2, n4, n5, n8, and m are either independently 0 or 1, with the provisio that when n5 is 0, also n2 and n4 are 0, [ ] indicates determinant either being present or absent depending on the value of n1, n2, n3, n4, ( ) indicates a branch in the structure;
and wherein n1, n2, n3, n4 and m are either independently 0 or 1, with the provisio that when n3 is 0, also n1 and n2 are 0,
[ ] indicates determinant either being present or absent
depending on the value of n1, n2, n3, n4 and m,
{ } and ( ) indicate a branch in the structure.
Small fucosylated low-mannose structures are especially unusual among known N-linked glycans and form a characteristic glycan group useful for separation of cells according to the present invention. These include:
Mβ4GNβ4(Fucα6)GNyR2 tetrasaccharide epitope is a preferred common structure alone and together with its mono-mannose derivatives Mα6Mβ4GNβ4(Fucα6)GNyR2 and/or Mα3Mβ4GNβ4(Fucα6)GNyR2, because these are commonly present characteristics structures in glycomes according to the invention. The invention is specifically directed to the glycomes comprising one or several of the small non-fucosylated low-mannose structures. The tetrasaccharides are in a specific embodiment preferred for specific recognition directed to α-linked, preferably α3/6-linked Mannoses as preferred terminal recognition element.
The invention further revealed large fucosylated low-mannose structures are unusual among known N-linked glycans and have special characteristic expression features among the preferred cells according to the invention. The preferred large structure includes
[Mα3]n2([Mα6]n4)Mα6{Mα3}Mβ4GNβ4(Fucα6)GNyR2
more specifically
The heptasaccharide epitopes are preferred in a specific embodiment as rare and characteristic structures in preferred cell types and as structures with preferred terminal epitopes. The octasaccharide is also preferred as structure comprising a preferred unusual terminal epitope Mα3(Mα6)Mα useful for analysis of cells according to the invention.
The inventors revealed that mannose-structures can be labeled and/or otherwise specifically recognized on cell surfaces or cell derived fractions/matrials of specific cell types. The present invention is directed to the recognition of specific mannose epitopes on cell surfaces by reagents binding to specific mannose structures from cell surfaces.
The preferred reagents for recognition of any structures according to the invention include specific antibodies and other carbohydrate recognizing binding molecules. It is known that antibodies can be produced for the specific structures by various immunization and/or library technologies such as phage display methods representing variable domains of antibodies. Similarily with antibody library technologies, including aptamer technologies and including phage display for peptides, exist for synthesis of library molecules such as polyamide molecules including peptides, especially cyclic peptides, or nucleotide type molecules such as aptamer molecules.
The invention is specifically directed to specific recognition high-mannose and low-mannose structures according to the invention. The invention is specifically directed to recognition of non-reducing end terminal Manα-epitopes, preferably at least disaccharide epitopes, according to the formula:
[Mα2]m1 [Mαx]m2[Mα6]m3 {{[Mα2]m9[Mα2]m8[Mα3]m7}m10(Mβ4[GN]m4)m5}m6yR2
wherein m1, m2, m3, m4, m5, m6, m7, m8, m9 and m10 are independently either 0 or 1; with the proviso that when m3 is 0, then m1 is 0 and, when m7 is 0 then either m1-5 are 0 and m8 and m9 are 1 forming Mα2Mα2-disaccharide or both m8 and m9 are 0
y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon, and
R2 is reducing end hydroxyl, chemical reducing end derivative
and x is linkage position 3 or 6 or both 3 and 6 forming branched structure,
{ } indicates a branch in the structure.
The invention is further directed to terminal Mα2-containing glycans contain at least one Mα2-group and preferably Mα2-group on each, branch so that m1 and at least one of m8 or m9 is 1. The invention is further directed to terminal Mα3 and/or Mα6-epitopes without terminal Mα2-groups, when all m1, m8 and m9 are 1.
The invention is further directed in a preferred embodiment to the terminal epitopes linked to a Mβ-residue and for application directed to larger epitopes. The invention is especially directed to Mβ4GN-comprising reducing end terminal epitopes.
The preferred terminal epitopes comprise typically 2-5 monosaccharide residues in a linear chain. According to the invention short epitopes comprising at least 2 monosaccharide residues can be recognized under suitable background conditions and the invention is specifically directed to epitopes comprising 2 to 4 monosaccharide units and more preferably 2-3 monosaccharide units, even more preferred epitopes include linear disaccharide units and/or branched trisaccharide non-reducing residue with natural anomeric linkage structures at reducing end. The shorter epitopes may be preferred for specific applications due to practical reasons including effective production of control molecules for potential binding reagents aimed for recognition of the structures.
The shorter epitopes such as Mα2M-may is often more abundant on target cell surface as it is present on multiple arms of several common structures according to the invention.
Manα2Man, Manα3Man, Manα6Man, and more preferred anomeric forms Manα2Manα, Manα3Manβ, Manα6Manβ, Manα3Manα and Manα6Manα.
Preferred branched trisaccharides includes Manα3(Manα6)Man, Manα3(Manα6)Manβ, and Manα3(Manα6)Manα.
The invention is specifically directed to the specific recognition of non-reducing terminal Manα2-structures especially in context of high-mannose structures.
The invention is specifically directed to following linear terminal mannose epitopes: a) preferred terminal Manα2-epitopes including following oligosaccharide sequences:
Manα2Man,
Manα2Manα,
Manα2Manα2Man, Manα2Manα3Man, Manα2Manα6Man,
Manα2Manα2Manα, Manα2Manα3Manβ, Manα2Manα6Manα,
Manα2Manα2Manα3Man, Manα2Manα3Manα6Man, Manα2Manα6Manα6Man
Manα2Manα2Manα3Manβ, Manα2Manα3Manα6Man(3, Manα2Manα6Manα6Manβ;
The invention is further directed to recognition of and methods directed to non-reducing end terminal Manα3- and/or Manα6-comprising target structures, which are characteristic features of specifically important low-mannose glycans according to the invention. The preferred structural groups includes linear epitopes according to b) and branched epitopes according to the c3) especially depending on the status of the target material.
b) preferred terminal Manα3- and/or Manα6-epitopes including following oligosaccharide sequences:
Manα3Man, Manα6Man, Manα3Manβ, Manα6Manβ, Manα3Manα, Manα6Manα, Manα3Manα6Man, Manα6Manα6Man, Manα3Manα6Manβ, Manα6Manα6Manβ
and to following
c) branched terminal mannose epitopes, are preferred as characteristic structures of especially high mannose structures (c1 and c2) and low-mannose structures (c3), The preferred branched epitopes include:
c1) branched terminal Manα2-epitopes
Manα2Manα3(Manα2Manα6)Man, Manα2Manα3(Manα2Manα6)Manα,
Manα2Manα3(Manα2Manα6)Manα6Man, Manα2Manα3(Manα2Manα6)Manα6Manβ,
Manα2Manα3(Manα2Manα6)Manα6(Manα2Manα3)Man,
Manα2Manα3(Manα2Manα6)Manα6(Manα2Manα2Manα3)Man,
Manα2Manα3(Manα2Manα6)Manα6(Manα2Manα3)Manβ
Manα2Manα3(Manα2Manα6)Manα6(ManaManα2Manα3)Manβ
c2) branched terminal Manα2- and Manα3 or Manα6-epitopes according to formula when m1 and/or m8 and/m9 is 1 and the molecule comprise at least one nonreducing end terminal Manα3 or Manα6-epitope
c3) branched terminal Manα3 or Manα6-epitopes
Manα3(Manα6)Man, Manα3(Manα6)Manβ, Manα3(Manα6)Manα,
Manα3(Manα6)Manα6Man, Manα3(Manα6)Manα6Manβ,
Manα3(Manα6)Manα6(Manα3)Man, Manα3(Manα6)Manα6(Manα3)Manβ
The present invention is further directed to increase of selectivity and sensitivity in recognition of
Target glycans by combining recognition methods for terminal Manα2 and Manα3 and/or Manα6-comprising structures. Such methods would be especially useful in context of cell material according to the invention comprising both high-mannose and low-mannose glycans.
According to the present invention, complex-type structures are preferentially identified by mass spectrometry, preferentially based on characteristic monosaccharide compositions, wherein HexNAc≧4 and Hex≧3. In a more preferred embodiment of the present invention, 4≦HexNAc≦20 and 3≦Hex≦21, and in an even more preferred embodiment of the present invention, 4≦HexNAc≦10 and 3≦Hex≦11. The complex-type structures are further preferentially identified by sensitivity to endoglycosidase digestion, preferentially N-glycosidase F detachment from glycoproteins. The complex-type structures are further preferentially identified in NMR spectroscopy based on characteristic resonances of the Manα3(Manα6)Manβ4GlcNAcβ4GlcNAc N-glycan core structure and GlcNAc residues attached to the Manα3 and/or Manα6 residues.
Beside Mannose-type glycans the preferred N-linked glycomes include GlcNAcβ2-type glycans including Complex type glycans comprising only GlcNAcβ2-branches and Hydrid type glycan comprising both Mannose-type branch and GlcNAcβ2-branch.
The invention revealed GlcNAcβ2Man structures in the glycomes according to the invention. Preferably GlcNAcβ2Man-structures comprise one or several of GlcNAcβ2Manα-structures, more preferably GlcNAcβ2Manα3 or GlcNAcβ2Manα6-structure.
The Complex type glycans of the invention comprise preferably two GlcNAcβ2Manα structures, which are preferably GlcNAcβ2Manα3 and GlcNAcβ2Manα6-. The Hybrid type glycans comprise preferably GlcNAcβ2Manα3-structure.
The present invention is directed to at least one of natural oligosaccharide sequence structures and structures truncated from the reducing end of the N-glycan according to the
[R1GNβ2]n1[Mα3]n2{[R3]n3 [GNβ2]n4Mα6}n5Mβ4GNXyR2, Formula GNβ2
with optionally one or two or three additional branches according to formula [RxGNβz]nx linked to Mα6-, Mα3-, or Mβ4 and Rx may be different in each branch
wherein n1, n2, n3, n4, n5 and nx, are either 0 or 1, independently,
with the proviso that when n2 is 0 then n1 is 0 and when n3 is 1 or/and n4 is 1 then n5 is also 1, and at least n1 or n4 is 1, or n3 is 1,
when n4 is 0 and n3 is 1 then R3 is a mannose type substituent or nothing and
wherein X is glycosidically linked disaccharide epitope β4(Fucα6)nGN, wherein n is 0 or 1, or X is nothing and
y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon, and
R1, Rx and R3 indicate independently one, two or three, natural substituents linked to the core structure,
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagines N-glycoside aminoacids and/or peptides derived from protein.
[ ] indicate groups either present or absent in a linear sequence. { } indicates branching which may be also present or absent.
The substituents R1, Rx and R3 may form elongated structures. In the elongated structures R1, and Rx represent substituents of GlcNAc (GN) and R3 is either substituent of GlcNAc or when n4 is 0 and n3 is 1 then R3 is a mannose type substituent linked to mannosea6-branch forming a Hybrid type structure. The substituents of GN are monosaccharide Gal, GalNAc, or Fuc or and acidic residue such as sialic acid or sulfate or fosfate ester.
GlcNAc or GN may be elongated to N-acetyllactosaminyl also marked as GalβGN or di-N-acetyllactosdiaminyl GalNAcβGlcNAc preferably GalNAcβ4GlcNAc. LNβ2M can be further elongated and/or branched with one or several other monosaccharide residues such as by galactose, fucose, SA or LN-unit(s) which may be further substituted by SAα-strutures, and/or Mα6 residue and/or Mα3 residues can be further substituted one or two β6-, and/or β4-linked additional branches according to the formula,
and/or either of Mα6 residue or Mα3 residue may be absent
and/or Mα6-residue can be additionally substitutes other Manα units to form a hybrid type structures
and/or Manβ4 can be further substituted by GNβ4,
and/or SA may include natural substituents of sialic acid and/or it may be substituted by other SA-residues preferably by α8- or α9-linkages.
The SAα-groups are linked to either 3- or 6-position of neighboring Gal residue or on 6-position of GlcNAc, preferably 3- or 6-position of neighboring Gal residue. In separately preferred embodiments the invention is directed structures comprising solely 3-linked SA or 6-linked SA, or mixtures thereof.
According to the present invention, hybrid-type or monoantennary structures are preferentially identified by mass spectrometry, preferentially based on characteristic monosaccharide compositions, wherein HexNAc=3 and Hex≧2. In a more preferred embodiment of the present invention 2≦Hex≦11, and in an even more preferred embodiment of the present invention 2≦Hex≦9. The hybrid-type structures are further preferentially identified by sensitivity to exoglycosidase digestion, preferentially α-mannosidase digestion when the structures contain non-reducing terminal α-mannose residues and Hex≧3, or even more preferably when Hex≧4, and to endoglycosidase digestion, preferentially N-glycosidase F detachment from glycoproteins. The hybrid-type structures are further preferentially identified in NMR spectroscopy based on characteristic resonances of the Manα3(Manα6)Manβ4GlcNAcβ4GlcNAc N-glycan core structure, a GlcNAcβ residue attached to a Manα residue in the N-glycan core, and the presence of characteristic resonances of non-reducing terminal α-mannose residue or residues.
The monoantennary structures are further preferentially identified by insensitivity to α-mannosidase digestion and by sensitivity to endoglycosidase digestion, preferentially N-glycosidase F detachment from glycoproteins. The monoantennary structures are further preferentially identified in NMR spectroscopy based on characteristic resonances of the Manα3Manβ4GlcNAcβ4GlcNAc N-glycan core structure, a GlcNAcβ residue attached to a Manα residue in the N-glycan core, and the absence of characteristic resonances of further non-reducing terminal α-mannose residues apart from those arising from a terminal α-mannose residue present in a ManαManβ sequence of the N-glycan core.
The present invention is directed to at least one of natural oligosaccharide sequence structures and structures truncated from the reducing end of the N-glycan according to the
R1GNβ2Mα3{[R3]n3Mα6}Mβ4GNXyR2, Formula HY1
wherein n3, is either 0 or 1, independently,
and
wherein X is glycosidically linked disaccharide epitope 134(Fucα6)nGN, wherein n is 0 or 1, or X is nothing and
y is anomeric linkage structure α and/or 13 or linkage from derivatized anomeric carbon, and
R1 indicate nothing or substituent or substituents linked to GlcNAc, R3 indicates nothing or Mannose-substituent(s) linked to mannose residue, so that each of R1, and R3 may correspond to one, two or three, more preferably one or two, and most preferably at least one natural substituents linked to the core structure,
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagines N-glycoside aminoacids and/or peptides derived from protein.
[ ] indicate groups either present or absent in a linear sequence. { } indicates branching which may be also present or absent.
The preferred hydrid type structures include one or two additional mannose residues on the preferred core stucture.
R1GNβ2Mα3{[Mα3]m1([Mα6]m2Mα6}Mβ4GNXyR2, Formula HY2
wherein n3, is either 0 or 1, and m1 and m2 are either 0 or 1, independently,
{ } and ( ) indicates branching which may be also present or absent,
other variables are as described in Formula HY1.
Furthermore the invention is directed to structures comprising additional lactosamine type structures on GNβ2-branch. The preferred lactosamine type elongation structures includes N-acetyllactosamines and derivatives, galactose, GalNAc, GlcNAc, sialic acid and fucose.
Preferred structures according to the formula HY2 include:
Structures containing non-reducing end terminal GlcNAc
As a specific preferred group of glycans
GNβ2Mα3{Mα3Mα6}Mβ4GNXyR2,
GNβ2Mα3{Mα6Mα6}Mβ4GNXyR2,
GNβ2Mα3{Mα3(Mα6)Mα6}Mβ4GNXyR2,
and/or elongated variants thereof
R1GNβ2Mα3{Mα3Mα6}Mβ4GNXyR2,
R1GNβ2Mα3{Mα6Mα6}Mβ4GNXyR2,
R1GNβ2Mα3{Mα3(Mα6)Mα6}Mβ4GNXyR2,
[R1Gal[NAc]o2βz]o1GNβ2Mα3{[Mα3]m1[(Mα6)]m2Mα6}n5Mβ4GNXyR2, Formula HY3
wherein n1, n2, n3, n5, m1, m2, o1 and o2 are either 0 or 1, independently,
z is linkage position to GN being 3 or 4, ? in a preferred embodiment 4,
R1 indicates on or two a N-acetyllactosamine type elongation groups or nothing,
{ } and ( ) indicates branching which may be also present or absent,
other variables are as described in Formula HY1.
Preferred structures according to the formula HY3 include especially structures containing non-reducing end terminal Galβ, preferably Galβ3/4 forming a terminal N-acetyllactosamine structure. These are preferred as a special group of Hybrid type structures, preferred as a group of specific value in characterization of balance of Complex N-glycan glycome and High mannose glycome:
GalβzGNβ2Mα3{Mα3Mα6}Mβ4GNXyR2, GalβzGNβ2Mα3{Mα 6Mα6}Mβ4GNXyR2, GalβzGNβ2Mα3{Mα3(Mα6)Mα6}Mβ4GNXyR2,
and/or elongated variants thereof preferred for carrying additional characteristic terminal structures useful for characterization of glycan materials
R1GalβzGNβ2Mα3{Mα3Mα6}Mβ4GNXyR2,
R1GalβzGNβ2Mα3{Mα 6Mα6}Mβ4GNXyR2,
R1GalβzGNβ2Mα3{Mα3(Mα6)Mα6}Mβ4GNXyR2. Preferred elongated materials include structures wherein R1 is a sialic acid, more preferably NeuNAc or NeuGc.
The present invention is directed to at least one of natural oligosaccharide sequence structures and structures truncated from the reducing end of the N-glycan according to the
[R1GNβ2]n1[Mα3]n2{[R3GNβ2]n4Mα6}n5Mβ4GNXyR2 Formula CO1
with optionally one or two or three additional branches according to formula [RxGNβz]nx linked to Mα6-, Mα3-, or Mβ4 and Rx may be different in each branch
wherein n1, n2, n4, n5 and nx, are either 0 or 1, independently, with the proviso that when n2 is 0 then n1 is 0 and when n4 is 1 then n5 is also 1, and at least n1 is 1 or n4 is 1, and at least either of n1 and n4 is 1
and
wherein X is glycosidically linked disaccharide epitope β4(Fucα6)nGN, wherein n is 0 or 1, or X is nothing and
y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon, and
R1, Rx and R3 indicate independently one, two or three, natural substituents linked to the core structure,
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagines N-glycoside aminoacids and/or peptides derived from protein.
[ ] indicate groups either present or absent in a linear sequence. { } indicates branching which may be also present or absent.
The present invention revealed incomplete Complex monoantennary N-glycans, which are unusual and useful for characterization of glycomes according to the invention. The most of the in complete monoantennary structures indicate potential degradation of biantennary N-glycan structures and are thus preferred as indicators of cellular status. The incomplete Complex type monoantennary glycans comprise only one GNβ2-structure.
The invention is specifically directed to structures are according to the Formula CO1 above when only n1 is 1 or n4 is one and mixtures of such structures.
The preferred mixtures comprise at least one monoantennary complex type glycans
A) with single branches from a likely degradative biosynthetic process:
R1GNβ2Mα3β4GNXyR2
R3GNβ2Mα6Mβ4GNXyR2 and
B) with two branches comprising mannose branches
B1) R1GNβ2Mα3{Mα6}n5Mβ4GNXyR2
B2) Mα3{R3GNβ2Mα6}n5Mβ4GNXyR2
The structure B2 is preferred with A structures as product of degradative biosynthesis, it is especially preferred in context of lower degradation of Manα3-structures. The structure B1 is useful for indication of either degradative biosynthesis or delay of biosynthetic process
The inventor revealed a major group of biantennary and multiantennary N-glycans from cells according to the invention, the preferred biantennary and multiantennary structures comprise two GNβ2 structures.
These are preferred as an additional characteristics group of glycomes according to the invention and are represented according to the Formula CO2:
R1GNβ2Mα3{R3GNβ2Mα6}Mβ4GNXyR2
with optionally one or two or three additional branches according to formula [RxGNβz]nx linked to Mα6-, Mα3-, or Mβ4 and Rx may be different in each branch
wherein nx is either 0 or 1,
and other variables are according to the Formula CO1.
A biantennary structure comprising two terminal GNβ-epitopes is preferred as a potential indicator of degradative biosynthesis and/or delay of biosynthetic process. The more preferred structures are according to the Formula CO2 when R1 and R3 are nothing.
The invention revealed specific elongated complex type glycans comprising Gal and/or GalNAc-structures and elongated variants thereof. Such structures are especially preferred as informative structures because the terminal epitopes include multiple informative modifications of lactosamine type, which characterize cell types according to the invention. The present invention is directed to at least one of natural oligosaccharide sequence structure or group of structures and corresponding structure(s) truncated from the reducing end of the N-glycan according to the
[R1Gal[NAc]o2βz2]o1GNβ2Mα3{[R1Gal[NAc]o4βz2]o3GNβ2Mα6}Mβ4GNXyR2, Formula CO3
with optionally one or two or three additional branches according to formula [RxGNβz1]nx linked to Mα6-, Mα3-, or Mβ4 and Rx may be different in each branch
wherein nx, o1, o2, o3, and o4 are either 0 or 1, independently,
with the provisio that at least o1 or o3 is 1, in a preferred embodiement both are 1
z2 is linkage position to GN being 3 or 4, ? in a preferred embodiment 4,
z1 is linkage position of the additional branches.
R1, Rx and R3 indicate on or two a N-acetyllactosamine type elongation groups or nothing,
{ } and ( ) indicates branching which may be also present or absent,
other variables are as described in Formula CO1.
The inventors characterized especially directed to digalactosylated structure
GalβzGNβ2Mα3{GalβzGNβ2Mα6}Mβ4GNXyR2,
and monogalactosylated structures:
GalβzGNβ2Mα3{GNβ2Mα6}Mβ4GNXyR2,
GNβ2Mα3{GalβzGNβ2Mα 6}Mβ4GNXyR2,
and/or elongated variants thereof preferred for carrying additional characteristic terminal structures useful for characterization of glycan materials
R1GalβzGNβ2Mα3{R3GalβzGNβ2Mα6}Mβ4GNXyR2
R1GalβzGNβ2Mα3{GNβ2Mα6}Mβ4GNXyR2, and
GNβ2Mα3{R3GalβzGNβ2Mα 6}Mβ4GNXyR2.
Preferred elongated materials include structures wherein R1 is a sialic acid, more preferably NeuNAc or NeuGc.
The present invention revealed for the first time LacdiNAc, GalNacbGlcNAc structures from the cell according to the invention. Preferred N-glycan lacdiNAc structures are included in structures according to the Formula CO1, when at least one the variable o2 and o4 is 1.
The acidic glycomes mean glycomes comprising at least one acidic monosaccharide residue such as sialic acids (especially NeuNAc and NeuGc) forming sialylated glycome, HexA (especially GlcA, glucuronic acid) and/or acid modification groups such as phosphate and/or sulphate esters.
According to the present invention, presence of phosphate and/or sulphate ester (SP) groups in acidic glycan structures is preferentially indicated by characteristic monosaccharide compositions containing one or more SP groups. The preferred compositions containing SP groups include those formed by adding one or more SP groups into non-SP group containing glycan compositions, while the most preferential compositions containing SP groups according to the present invention are selected from the compositions described in the acidic N-glycan fraction glycan group tables. The presence of phosphate and/or sulphate ester groups in acidic glycan structures is preferentially further indicated by the characteristic fragments observed in fragmentation mass spectrometry corresponding to loss of one or more SP groups, the insensitivity of the glycans carrying SP groups to sialidase digestion. The presence of phosphate and/or sulphate ester groups in acidic glycan structures is preferentially also indicated in positive ion mode mass spectrometry by the tendency of such glycans to form salts such as sodium salts as described in the Examples of the present invention. Sulphate and phosphate ester groups are further preferentially identified based on their sensitivity to specific sulphatase and phosphatase enzyme treatments, respectively, and/or specific complexes they form with cationic probes in analytical techniques such as mass spectrometry.
The present invention is directed to at least one of natural oligosaccharide sequence structures and structures truncated from the reducing end of the N-glycan according to
[{SAα3/6}s1LNβ2]r1Mα3{({SAα3/6}s2LNβ2)r2Mα6}r8{M[β4GN[β4{Fucα6}r3GN]r4]r5}r6 (I)
with optionally one or two or three additional branches according to formula
{SAα3/6}s3LNβ, (IIb)
wherein r1, r2, r3, r4, r5, r6, r7 and r8 are either 0 or 1, independently,
wherein s1, s2 and s3 are either 0 or 1, independently,
with the proviso that at least r1 is 1 or r2 is 1, and at least one of s1, s2 or s3 is 1.
LN is N-acetyllactosaminyl also marked as GalβGN or di-N-acetyllactosdiaminyl GalNAcβGlcNAc preferably GalNAcβ4GlcNAc, GN is GlcNAc, M is mannosyl-,
with the proviso LNβ2M or GNβ2M can be further elongated and/or branched with one or several other monosaccharide residues such as by galactose, fucose, SA or LN-unit(s) which may be further substituted by SAα-structures,
and/or one LNβ can be truncated to GNP
and/or Mα6 residue and/or Mα3 residues can be further substituted one or two β6-, and/or β4-linked additional branches according to the formula,
and/or either of Mα6 residue or Mα3 residue may be absent and/or Mα6-residue can be additionally substitutes other Manα units to form a hybrid type structures
and/or Manβ4 can be further substituted by GNβ4,
and/or SA may include natural substituents of sialic acid and/or it may be substituted by other SA-residues preferably by α8- or α9-linkages.
( ) { }, H and [ ] indicate groups either present or absent in a linear sequence. { } indicates branching which may be also present or absent.
The SAα-groups are linked to either 3- or 6-position of neighboring Gal residue or on 6-position of GlcNAc, preferably 3- or 6-position of neighboring Gal residue. In separately preferred embodiments the invention is directed structures comprising solely 3-linked SA or 6-linked SA, or mixtures thereof. In a preferred embodiment the invention is directed to glycans wherein r6 is 1 and r5 is 0, corresponding to N-glycans lacking the reducing end GlcNAc structure.
The LN unit with its various substituents can in a preferred general embodiment represented by the formula:
[Gal(NAc)n1α3]n2{Fucα2}n3Gal(NAc)n4β3/4{Fucα4/3}n5GlcNAcβ
wherein n1, n2, n3, n4, and n5 are independently either 1 or 0,
with the provisio that
the substituents defined by n2 and n3 are alternative to presence of SA at the non-reducing end terminal
the reducing end GlcNAc-unit can be further β3- and/or β6-linked to another similar LN-structure forming a poly-N-acetyllactosamine structure
with the provision that for this LN-unit n2, n3 and n4 are 0,
the Gal(NAc)β and GlcNAcβ units can be ester linked a sulphate ester group,
( ) and [ ] indicate groups either present or absent in a linear sequence; { } indicates branching which may be also present or absent.
LN unit is preferably Galβ4GN and/or Galβ3GN.
In the present invention, glycan signals with preferential monosaccharide compositions can be grouped into structure groups based on classification rules described in the present invention. The present invention includes parallel and overlapping classification systems that are used for the classification of the glycan structure groups.
Glycan signals isolated from the N-glycan fractions from the tissue material types studied in the present invention are grouped into glycan structure groups based on their preferential monosaccharide compositions according to the invention, in glycan group Tables of Examples for neutral glycan fractions and for acidic glycan fractions. Taken together, the analyses revealed that all the structure groups according to the invention are present in the studied tissue material types. In another aspect of the present invention, the glycan structure grouping is used to compare different tissue materials and characterize their specific glycosylation features. According to the present invention the discovered and analyzed differencies between the glycan signals within the glycan signal groups between different tissue material samples are used for comparison and characterization.
The quantitative glycan profiling combined with glycan structural classification is used according to the present invention to characterize and identify glycosylation features occurring in tissue materials, glycosylation features specific for certain tissue materials as well as differencies between different tissue materials. According to the present invention, the classification is used to characterize and compare glycosylation features of different tissues, of normal and diseased tissues, preferentially cancerous tissues, and solid tissues such as lung tissue and fluid tissues such as blood and/or serum. In another aspect of the present invention, the glycan structure grouping is used to compare different tissue materials and characterize their specific glycosylation features. According to the present invention differencies between relative proportions of glycan signal structure groups are used to compare different tissue material samples.
In a further aspect of the present invention, analysis of the glycan structure groups, preferentially including terminal HexNAc and/or low-mannose and optionally other groups separately or in combination, is used to differentiate between different tissue materials or different stages of tissue materials, preferentially to identify human disease and more preferentially human cancer. In a futher preferred form the present method is used to differentiate between benign and malignant tumors. According to the present invention analysis of human serum glycan groups or combinations thereof according to the present invention can be used to identify the presence of other tissue materials in blood or serum samples, more preferably to identify disease and preferably malignant cancer.
The invention is directed to analysis of present cell materials based on single or several glycans (glycome profile) of cell materials according to the invention. The analysis of multiple glycans is preferably performed by physical analysis methods such as mass spectrometry and/or NMR.
The invention is specifically directed to integrated analysis process for glycomes, such as total glycomes and cell surface glycomes. The integrated process represent various novel aspects in each part of the process. The methods are especially directed to analysis of low amounts of cells. The integrated analysis process includes
A) preferred preparation of substrate cell materials for analysis, including one or several of the methods: use of a chemical buffer solution, use of detergents, chemical reagents and/or enzymes.
B) release of glycome(s), including various subglycome type based on glycan core, charge and other structural features, use of controlled reagents in the process
C) purification of glycomes and various subglycomes from complex mixtures
D) preferred glycome analysis, including profiling methods such as mass spectrometry and/or NMR spectroscopy
E) data processing and analysis, especially comparative methods between different sample types and quantitative analysis of the glycome data.
Cell substrate material and its preparation for total and cell surface glycome analysis. The integrated glycome analysis includes preferably a cell preparation step to increase the availability of cell surface glycans. The cell preparation step preferably degrades either total cell materials or cell surface to yield a glycome for more effective glycan release. The degradation step preferably includes methods of physical degradation and/or chemical degradation. In a preferred embodiment at least one physical and one chemical degradation methods are combined, more preferably at least one physical method is combined with at least two chemical methods, even more preferably with at least three chemical methods.
The physical degration include degration by energy including thermal and/or mechanical energy directed to the cells to degrade cell structures such as heating, freezing, sonication, and pressure. The chemical degradation include use of chemicals and specific concentrations of chemicals for distruption distruption of cells preferably detergents including ionic and neutral detergents, chaotropic salts, denaturing chemicals such as urea, and non-physiological salt concentrations for distruption of the cells.
The glycome analysis according to the invention is divided to two methods including Total cell glycomes, and Cell surface glycomes. The production of Total cell glycomes involves degradation of cells by physical and/or chemical degradation methods, preferably at least by chemical methods, more preferably by physical and chemical methods. The Cell surface glycomes is preferably released from cell surface preserving cell membranes intact or as intact as possible, such methods involve preferably at least one chemical method, preferably enzymatic method. The cell surface glycomes may be alternatively released from isolated cell membranes, this method involves typically chemical and/or physical methods similarily as production of total cell glycomes, preferably at least use of detergents.
a. Total Cell Glycomes
The present invention revealed special methods for effective purification of released glycans from total cell derived materials so that free oligosaccharides can be obtained. In a preferred embodiment a total glycome is produced from a cell sample, which is degraded to form more available for release of glycans. A preferred degraded form of cells is detergent lysed cells optionally involving physical distruption of cell materials.
Preferred detergents and reaction conditions include,
a1) ionic detergents, preferably SDS type anionic detergent comprising an anionic group such as sulfate and an alkyl chain of 8-16 carbon atoms, more preferably the anionic detergent comprise 10-14 carbon atoms and it is most preferably sodium dodecyl sulfate (SDS), and/or
a2) non-ionic detergents such as alkylglycosides comprising a hexose and 4-12 carbon alkyl chain more preferably the alkyl chain comprises a hexoses being galactose, glucose, and/or mannose, more preferably glucose and/or mannose and the alkyl comprises 6-10 carbon atoms, preferably the non-ionic detergent is octylglucoside.
It is realized that various detergent combinations may be produced and optimized. The combined use of an ionic, preferably anionic, and non-ionic detergents according to the invention is especially preferred.
The preferred methods of cell degration for Total cell glycomes include physical degration including at least heat treatment heat and chemical degration by a detergent method or by a non-detergent method preferably enzymatic degradation, preferably heat treatment. Preferably two physical degradation methods are included.
A non-detergent method is preferred for avoiding detergent in later purification. The preferred non-detergent method involves physical degradation of cells preferably pressure and or by heat and a chemical degradation by protease. A preferred non-detergent method includes:
i) cell degradation by physical methods, for example by pressure methods such as by French press.
The treatment is preferably performed quickly in cold temperatures, preferably at 0-2 degrees of Celsius, and more preferably at about 0 or 1 degree of celsius and/or in the presence of glycosidase inhibitors.
ii) The degraded cells are further treated with chemical degradation, preferably by effective general protease, more preferably trypsin is used for the treatment. Preferred trypsin preparation according to the invention does not cause glycan contamination to the sample/does not contain glycans releasable under the reaction conditions.
iii) optionally the physical degradation and chemical degradation are repeated.
iv) At the end of protease treatment the sample is boiled for further denaturing the sample and the protease. The boiling is performed at temperature denaturing/degrading further the sample and the protease activity (conditions thus depend on the protease used) preferably about 100 degrees Celsius for time enough for denaturing protease activity preferably about 10-20 minutes for trypsin, more preferably about 15 minutes.
The invention is in another preferred embodiment directed to detergent based method for lysing cells. The invention includes effective methods for removal of detergents in later purification steps. The detergent methods are especially preferred for denaturing proteins, which may bind or degrade glycans, and for degrading cell membranes to increase the accessibility of intracellular glycans.
For the detergent method the cell sample is preferably a cell pellet produced at cold temperature by centrifuging cells but avoiding distruption of the cells, optionally stored frozed and melted on ice. Optionally glycosidase inhibitors are used during the process.
The method includes following steps:
i) production of cell pellet preferably by centrifugation,
ii) lysis by detergent on ice, the detergent is preferably an anionic detergent according to the invention, more preferably SDS. The concentration of the detergent is preferably between about 0.1% and 5%, more preferably between 0.5%-3%, even more preferably between 0.5-1.5% and most preferably about 1% and the detergent is SDS (or between 0.9-1.1%).
the solution is preferably produced in ultrapure water,
iii) mixing by effective degradation of cells, preferably mixing by a Vortex-mixer as physical degradation step,
iv) boiling on water bath, preferebly for 3-10 min, most preferably about 5 min (4-6 min) as second physical degradation step, it is realized that even longer boiling may be performed for example up to 30 min or 15 min, but it is not optimal because of evaporation sample
v) adding one volume of non-ionic detergent, preferably alkyl-glycoside detergent according to the invention, most preferably n-octyl-β-D-glucoside, the preferred amount of the detergent is about 5-15% as water solution, preferably about 10% of octyl-glucoside. The non-ionic detergent is especially preferred in case an enzyme sensitive to SDS, such as a N-glycosidase, is to be used in the next reaction step.
and
vi) incubation at room temperature for about 5 min to about 1-4 hours, more preferably less than half an hour, and most preferably about 15 min.
Preferably the anionic detergent and cationic detergent solutions are used in equal volumes. Preferably the solutions are about 1% SDS and about 10% octyl-glucoside. The preferred amounts of the solutions are preferably from 0.1 μl to about 2 μl, more preferably 0.15 μl to about 1.5 μl per and most preferably from 0.16 μl to 1 μl per 100 000 cells of each solution. Lower amounts of the detergents are preferred if possible for reduction of the amount of detergent in later purification, highest amounts in relation to the cell amounts are used for practical reasons with lowest volumes. It is further realized that corresponding weight amounts of the detergents may be used in volumes of about 10% to about 1000%, or from about 20% to about 500% and even more effectively in volumes from 30% to about 300% and most preferably in volumes of range from 50% to about 150% of that described. It is realized that critical micellar concentration based effects may reduce the effect of detergents at lowest concentrations.
In a preferred embodiment a practical methods using tip columns as described in the invention uses about 1-3 μl of each detergent solution, more preferably 1.5-2.5 μl, and most preferably about 2 μl of the preferred detergent solutions or corresponding detergent amounts are used for about 200 000 or less cells (preferably between 2000 and about 250 000 cells, more preferably from 50 000 to about 250 000 cells and most preferably from 100 000 to about 200 000 cells). Another practical method uses uses about 2-10 μl of each detergent solution, more preferably 4-8 μl, and most preferably about 5 μl (preferably between 4 and 6 μl and more preferably between 4.5 and 5.5 μl) of detergent solutions or corresponding amount of the detergents for lysis of cell of a cell amount from about 200 000-3 million cells (preferred more exact ranges include 200 000-3.5 million, 200 000 to 3 million and 200 000 to 2.5 million cells), preferably a fixed amount (specific amount of microliters preferably with the accuracy of at least 0.1 microliter) in a preferred range such as of 5.0 μl is used for the wider range of cells 200 000-3 million. It was invented that is possible to handle similarily wider range of materials. It is further realized that the method can be optimized so that exact amount of detergent, preferably within the ranges described, is used for exact amount of cells, such method is preferably an automized when there is possible variation in amounts of sample cells.
b. Cell Surface Glycomes
In another preferred embodiment the invention is directed to release of glycans from intact cells and analysis of released cell surface glycomes. The present invention is directed to specific buffer and enzymatic cell pre-modification conditions that would allow the efficient use of enzymes for release and optionally modification and release of glycans.
The invention is directed to various enzymatic and chemical methods to release glycomes. The release step is not needed for soluble glycomes according to the invention. The invention further revealed soluble glycome components which can be isolated from the cells using methods according to the invention.
C. Purification of glycans from cell derived materials The purification of glycome materials form cell derived molecules is a difficult task. It is especially difficult to purify glycomes to obtain picomol or low nanomol samples for glycome profiling by mass spectrometry or NMR-spectrometry. The invention is especially directed to production of material allowing quantitative analysis over a wide mass range. The invention is specifically directed to the purification of non-derivatized or reducing end derivatized glycomes according to the invention and glycomes containing specific structural characteristics according to the invention. The structural characteristics were evaluated by the preferred methods according to the invention to produce reproducible and quantitative purified glycomes.
The glycan purification method according to the present invention consists of at least one of purification options, preferably in specific combinations described below, including one or several of following the following purification process steps in varying order:
6) Precipitation-extraction;
7) Ion-exchange;
8) Hydrophobic interaction;
9) Hydrophilic interaction; and
10) Affinity to carbon materials especially graphitized carbon.
In general the purification steps may be divided to two major categories: Prepurification steps to remove major contaminations and purification steps usually directed to specific binding and optionally fractionation og glycomes
The need for prepurification depends on the type and amounts of the samples and the amounts of impurities present. Certain samples it is possible to omit all or part of the prepurification steps. The prepurification steps are aimed for removal of major non-carbohydrate impurities by separating the impurity and the glycome fraction(s) to be purified to different phases by precipitation/extraction or binding to chromatography matrix and the separating the impurities from the glycome fraction(s).
The prepurification steps include one, two or three of following major steps: Precipitation-extraction, Ion-exchange, Hydrophobic interaction.
The precipitation and/or extraction is based on the high hydrophilic nature of glycome compositions and components, which is useful for separation from different cellular components and chemicals. The prepurification ion exchange chromatography is directed to removal of classes molecules with different charge than the preferred glycome or glycome fraction to be studied. This includes removal of salt ions and aminoacids, and peptides etc. The glycome may comprise only negative charges or in more rare case also only positive charges and the same charge is selected for the chromatography matrix for removal of the impurities for the same charge without binding the glycome at prepurification.
In a preferred embodiment the invention is directed to removal of cationic impurities from glycomes glycomes containing neutral and/or negatively charged glycans. The invention is further directed to use both anion and cation exchange for removal of charged impurities from non-charged glycomes. The preferred ion exchange and cation exchange materials includes polystyrene resins such as Dowex resins.
The hydrophilic chromatography is preferably aimed for removal of hydrophobic materials such as lipids detergents and hydrophobic protein materials. The preferred hydrophobic chromatography materials includes.
It is realized that different combinations of the prepurification are usuful depending on the cell preparation and sample type. Preferred combinations of the prepurification steps include: Precipitation-extraction and Ion-exchange; Precipitation-extraction and Hydrophobic interaction; and Ion-exchange and Hydrophobic interaction. The two prepurification steps are preferably performed in the given order.
The purification steps utilize two major concepts for binding to carbohydrates and combinations thereof: a) Hydrophilic interactions and b) Ion exhange
The present invention is specifically directed to use of matrices with repeating polar groups with affinity for carbohydrates for purification of glycome materials according to the invention in processes according of the invention. The hydrophilic interaction material may include additional ion exchange properties.
The preferred hydrophilic interaction materials includes carbohydrate materials such as carbohydrate polymers in presence of non-polar organic solvents. A especially preferred hydrophilic interaction chromatography matrix is cellulose.
A specific hydrophilic interaction material includes graphitized carbon. The graphitized carbon separates non-charged carbohydrate materials based mainly on the size on the glycan. There is also possible ion exchange effects. In a preferred embodiment the invention is directed to graphitized carbon chromatography of prepurified samples after desalting and removal of detergents.
The invention is specifically directed to purification of non-derivatized glycomes and neutral glycomes by cellulose chromatography. The invention is further directed to purification of non-derivatized glycomes and neutral glycomes by graphitized carbon chromatography. In a preferred embodiment the purification according to the invention includes both cellulose and graphitized carbon chromatography.
The glycome may comprise only negative charges or in more rare case also only positive charges. At purification stage the ion exchange material is selected to contain opposite charge than the glycome or glycome fraction for binding the glycome. The invention is especially directed to the use of anion exchange materials for binding of negatively charged Preferred ion exchange materials includes ion exchange and especially anion exhange materials includes polystyrene resins such as Dowex-resins, preferably quaternary amine resins anion exchange or sulfonic acid cation exchange resins
It was further revealed that even graphitized carbon can be used for binding of negatively charged glycomes and the materials can be eluted from the carbon separately from the neutral glycomes or glycome fractions according to the invention.
The invention is specifically directed to purification of anionic glycomes by anion exchange chromatography.
The invention is specifically directed to purification of anionic glycomes by anion exchange chromatography.
The invention is further directed to purification of anionic glycomes by cellulose chromatography. The preferred anionic glycomes comprise sialic acid and/or sulfo/fosfo esters, more preferably both sialic acid and sulfo/fosfo esters. A preferred class of sulfo/fosfoester glycomes are complex type N-glycans comprising sulfate esters.
1) Precipitation-extraction may include precipitation of glycans or precipitation of contaminants away from the glycans. Preferred precipitation methods include:
2) Ion-exchange may include ion-exchange purification or enrichment of glycans or removal of contaminants away from the glycans. Preferred ion-exchange methods include:
3) Hydrophilic interaction may include purification or enrichment of glycans due to their hydrophilicity or specific adsorption to hydrophilic materials, or removal of contaminants such as salts away from the glycans. Preferred hydrophilic interaction methods include:
3. Affinity to carbon may include purification or enrichment of glycans due to their affinity or specific adsorption to specific carbon materials preferably graphitized carbon, or removal of contaminants away from the glycans. Preferred graphitized carbon affinity methods includes porous graphitized carbon chromatography.
Preferred purification methods according to the invention include combinations of one or more prepurification and/or purification steps. The preferred method include preferably at least two and more preferably at least three prepurification steps according to the invention. The preferred method include preferably at least one and more preferably at least two purification steps according to the invention. It is further realized that one prepurification step may be performed after a purification step or one purification step may be performed after a prepurification step. The method is preferably adjusted based on the amount of sample and impurities present in samples. Examples of the preferred combinations include the following combinations:
For neutral underivatized glycan purification:
A. 1. precipitation and/or extraction 2. cation exchange of contaminants, 3. hydrophobic adsorption of contaminants, and 4. hydrophilic purification, preferably by carbon, preferably graphitized carbon, and/or carbohydrate affinity purification of glycans.
B. 1. precipitation and/or extraction, 2. hydrophobic adsorption of contaminants 3. cation exchange of contaminants, 4. hydrophilic purification by carbon, preferably graphitized carbon, and/or carbohydrate affinity purification of glycans
The preferred method variants further includes preferred variants when
2) For sialylated/acidic underivatized glycan purification: The same methods are preferred but preferably both carbon and carbohydrate chromatography is performed in step 4. The carbohydrate affinity chromatography is especially preferred for acidic and/sialylated glycans. In a preferred embodiment for additional purification one or two last chromatograpy methods are repeated.
The present invention is specifically directed to detection various component in glycomes by specific methods for recognition of such components. The methods includes binding of the glycome components by specific binding agents according to the invention such as antibodies and/or enzymes, these methods peferebly include labeling or immobilization of the glycomes. For effective analysis of the glycome a large panel of the binding agents are needed. The invention is specifically directed to physicochemical profiling methods for exact analysis of glycomes. The preferred methods includes mass spectrometry and NMR-spectroscopy, which give simultaneously information of numerous components of the glycomes. In a preferred embodiment the mass spectrometry and NMR-spectroscopy methods are used in a combination.
The invention revealed methods to create reproducible and quantitative profiles of glycomes over large mass ranges and degrees of polymerization of glycans. The invention further reveals novel methods for quantitative analysis of the glycomics data produced by mass spectrometry. The invention is specifically directed to the analysis of non-derivatized or reducing end derivatized glycomes according to the invention and the glycomes containing specific structureal characteristics according of the invention.
The invention revealed effective means of comparision of glycome profiles from different cell types or tissue materials with difference in cell status or cell types. The invention is especially directed to the quantitative comparision of relative amount of individual glycan signal or groups of glycan signals described by the invention.
The invention is especially directed to
i) calculating average value and variance values of signal or signals, which have obtained from several experiments/samples and which correspond to an individual glycan or glycan group according to the invention for a first cell sample and for a second cell sample
ii) comparing these with values derived for the corresponding signal(s)
iii) optionally calculating statistic value for testing the probability of similarity of difference of the values obtained for the cell types or
estimating the similarity or difference based on the difference of the average and optionally also based on the variance values.
iv) preferably repeating the comparison one or more signals or signal groups, and further preferably performing combined statistical analysis to estimate the similarity and/or differences between the data set or estimating the difference or similarity
v) preferably use of the data for estimating the differences between the first and second cell samples indicating difference in cell status and/or cell type
The invention is further directed to combining information of several quantitative comparisons of between corresponding signals by method of
i) calculating differences between quantitative values of corresponding most different glycan signals or glycan group signals, changing negative values to corresponding positive values, optionally multiplying selected signals by selected factors to adjust the weight of the signals in the calculation
ii) adding the positive difference values to a sum value
iii) comparing the sum values as indicators of cell status or type.
It was further revealed that there is characteristic signals that are present in certain cell types according to the invention but absent or practically absent in other cell types. The invention is therefore directed to the qualitative comparison of relative amount of individual glycan signal or groups of glycan signals described by the invention and observing signals present or absent/practically absent in a cell type. The invention is further directed to selection of a cut off value used for selecting absent or practically absent signals from a mass spectrometric data, for example the preferred cut off value may be selected in range of 0-3% of relative amount, preferably the cut off value is less than 2%, or less than 1% or less than 0.5%. In a preferred embodiment the cut off value is adjusted or defined based on quality of the mass spectrum obtained, preferably based on the signal intensities and/or based on the number of signals observable.
The invention is further directed to automized qualitative and/or quantitative comparisons of data from corresponding signals from different samples by computer and computer programs prosessing glycome data produced according to the invention. The invention is further directed to raw data based analysis and neural network based learning system analysis as methods for revealing differences between the glycome data according to the invention.
The present invention is specifically directed to methods for analysis of low amounts of samples.
The invention further revealed that it is possible to use the methods according to the invention for analysis of low sample amounts. It is realized that the cell materials are scarce and difficult to obtain from natural sources. The effective analysis methods would spare important cell materials. Under certain circumstances such as in context of cell culture the materials may be available from large scale. The required sample scale depends on the relative abundancy of the characteristic components of glycome in comparision to total amount of carbohydrates. It is further realized that the amount of glycans to be measured depend on the size and glycan content of the cell type to be measured and analysis including multiple enzymatic digestions of the samples would likely require more material. The present invention revealed especially effective methods for free released glycans.
The picoscale samples comprise preferably at least about 1000 cells, more preferably at least about 50 000 cells, even more more preferably at least 100 000 cells, and most preferably at least about 500 000 cells. The invention is further directed to analysis of about 1 000 000 cells. The preferred picoscale samples contain from at least about 1000 cells to about 10 000 000 cells according to the invention. The useful range of amounts of cells is between 50 000 and 5 000 000, even more preferred range of of cells is between 100 000 and 3 000 000 cells. A preferred practical range for free oligosaccharide glycoomes is between about 500 000 and about 2 000 000 cells. It is realized that cell counting may have variation of less than 20%, more preferably 10% and most preferably 5%, depending on cell counting methods and cell sample, these variations may be used instead of term about. It is further understood that the methods according to the present invention can be upscaled to much larger amounts of material and the pico/nanoscale analysis is a specific application of the technology. The invention is specifically directed to use of microcolumn technologies according to the invention for the analysis of the preferred picoscale and low amount samples according to the invention,
The invention is specifically directed to purification to level, which would allow production of high quality mass spectrum covering the broad size range of glycans of glycome compositions according to the invention.
The present invention is especially directed to use microfluidistic methods involving low sample volumes in handling of the glycomes in low volume cell preparation, low volume glycan release and various chromatographic steps. The invention is further directed to integrated cell preparation, glycan release, and purification and analysis steps to reduce loss of material and material based contaminations. It is further realized that special cleaning of materials is required for optimal results.
The invention is directed to reactions of volume of 1-100 microliters, preferably about 2-50 microliters and even more preferably 3-20 microliters, most preferably 4-10 microliter. The most preferred reaction volumes includes 5-8 microliters+/−1 microliters. The minimum volumes are preferred to get optimally concentrated sample for purification. The amount of material depend on number of experiment in analysis and larger amounts may be produced preferably when multiple structural analysis experiments are needed.
It is realized that numerous low volume chromatographic technologies may be applied, such low volume column and for example disc based microfluidistic systems.
The inventors found that the most effective methods are microcolumns. Small colomn can be produced with desired volume. Preferred volumes of microcolumns are from about 2
Microliters to about 500 microliters, more preferably for rutine sample sizes from about 5 microliter to about 100 microliters depending on the matrix and size of the sample. Preferred microcolumn volumes for graphitised carbon, cellulose chromatography and other tip-columns are from 2 to 20 μl, more preferably from 3 to 15 μl, even more preferably from 4 to 10 μl, For the microcolumn technologies in general the samples are from about 10 000 to about million cells. The methods are useful for production of picomol amounts of total glycome mixtures from tissue materials according to the invention.
In a preferred embodiment microcolumns are produced in regular disposable usually plastic pipette tips used for example in regular “Finnpipette”-type air-piston pipettes. The pipette-tip microcolumn contain the preferred chromatographic matrix. In a preferred embodiment the microcolumn contains two chromatographic matrixes such as an anion and cation exchange matrix or a hydrophilic and hydrophobic chromatography matrix.
The pipette tips may be chosen to be a commercial tip contain a filter. In a preferred embodiment the microcolumn is produced by narrowing a thin tip from lower half so that the preferred matrix is retained in the tip. The narrowed tip is useful as the volume of filter can be omitted from washing steps
The invention is especially directed to plastic pipette tips containing the cellulose matrix, and in an other embodiment to the pipette tip microclumns when the matrix is graphitised carbon matrix. The invention is further directed to the preferred tip columns when the columns are narorrowed tips, more preferably with column volumes of 1 microliter to 100 microliters.
The invention is further directed to the use of the tip columns containing any of the preferred chromatographic matrixes according to the invention for the purification of glycomes according to the invention, more preferably matrixes for ion exchange, especially polystyrene anion exchangers and cation exchangers according to the invention; hydrophilic chromatographic matrixes according to the invention, especially carbohydrate matrixes and most cellulose matrixes.
The Binding Methods for Recognition of Structures from Cell Surfaces
Recognition of Structures from Glycome Materials and on Cell Surfaces by Binding Methods
The present invention revealed that beside the physicochemical analysis by NMR and/or mass spectrometry several methods are useful for the analysis of the structures. The invention is especially directed to two methods:
The peptides and proteins are preferably recombinant proteins or corresponding carbohydrate recognition domains derived thereof, when the proteins are selected from the group monoclonal antibody, glycosidase, glycosyl transferring enzyme, plant lectin, animal lectin or a peptide mimetic thereof, and wherein the binder includes a detectable label structure.
The present invention revealed various types of binder molecules useful for characterization of tissue materials according to the invention and more specifically the preferred cell groups and cell types according to the invention. The preferred binder molecules are classified based on the binding specificity with regard to specific structures or structural features on carbohydrates of cell surface. The preferred binders recognize specifically more than single monosaccharide residue.
It is realized that most of the current binder molecules such as all or most of the plant lectins are not optimal in their specificity and usually recognize roughly one or several monosaccharides with various linkages. Furthermore the specificities of the lectins are usually not well characterized with several glycans of human types.
The preferred high specificity binders recognize
The preferred binders includes natural human and or animal, or other proteins developed for specific recognition of glycans. The preferred high specificity binder proteins are specific antibodies preferably monoclonal antibodies; lectins, preferably mammalian or animal lectins; or specific glycosyltransferring enzymes more preferably glycosidase type enzymes, glycosyltransferases or transglycosylating enzymes.
Combination of Terminal Structures in Combination with Specific Glycan Core Structures
It is realized that part of the structural elements are specifically associated with specific glycan core structure. The recognition of terminal structures linked to specific core structures are especially preferred, such high specificity reagents have capacity of recognition almost complete individual glycans to the level of physicochemical characterization according to the invention. For example many specific mannose structures according to the invention are in general quite characteristic for N-glycan glycomes according to the invention. The present invention is especially directed to recognition terminal epitopes.
The present invention revealed that there are certain common structural features on several glycan types and that it is possible to recognize certain common epitopes on different glycan structures by specific reagents when specificity of the reagent is limited to the terminal without specificity for the core structure. The invention especially revealed characteristic terminal features for specific cell types according to the invention. The invention realized that the common epitopes increase the effect of the recognition. The common terminal structures are especially useful for recognition in the context with possible other cell types or material, which do not contain the common terminal structure in substantial amount.
The present invention is directed to recognition of oligosaccharide sequences comprising specific terminal monosaccharide types, optionally further including a specific core structure. The preferred oligosaccharide sequences classified based on the terminal monosaccharide structures.
1. Structures with Terminal Mannose Monosaccharide
Preferred mannose-type target structures have been specifically classified by the invention. These include various types of high and low-mannose structures and hybrid type structures according to the invention.
preferred for recognition of terminal mannose structures includes mannose-monosaccharide binding plant lectins.
include
i) Specific mannose residue releasing enzymes such as linkage specific mannosidases, more preferably an α-mannosidase or β-mannosidase.
Preferred α-mannosidases includes linkage specific α-mannosidases such as α-Mannosidases cleaving preferably non-reducing end terminal
α2-linked mannose residues specifically or more effectively than other linkages, more preferably cleaving specifically Manα2-structures; or
α6-linked mannose residues specifically or more effectively than other linkages, more preferably cleaving specifically Manα6-structures;
Preferred β-mannosidases includes β-mannosidases capable of cleaving β4-linked mannose from non-reducing end terminal of N-glycan core Manβ4GlcNAc-structure without cleaving other β-linked monosaccharides in the glycomes.
ii) Specific binding proteins recognizing preferred mannose structures according to the invention. The preferred reagents include antibodies and binding domains of antibodies (Fab-fragments and like), and other engineered carbohydrate binding proteins. The invention is directed to antibodies recognizing MS2B1 and more preferably MS3B2-structures
2. Structures with Terminal Gal-Monosaccharide
Preferred galactose-type target structures have been specifically classified by the invention. These include various types of N-acetyllactosamine structures according to the invention.
Preferred for recognition of terminal galactose structures includes plant lectins such as ricin lectin (ricinus communis agglutinin RCA), and peanut lectin(/agglutinin PNA).
i) Specific galactose residue releasing enzymes such as linkage specific galactosidases, more preferably α-galactosidase or β-galactosidase.
Preferred α-galactosidases include linkage galactosidases capable of cleaving GalαβGal-structures revealed from specific cell preparations
Preferred β-galactosidases includes β-galactosidases capable of cleaving
β4-linked galactose from non-reducing end terminal Galβ4GlcNAc-structure without cleaving other β-linked monosaccharides in the glycomes and
β3-linked galactose from non-reducing end terminal Galβ3GlcNAc-structure without cleaving other β-linked monosaccharides in the glycomes
ii) Specific binding proteins recognizing preferred galactose structures according to the invention. The preferred reagents include antibodies and binding domains of antibodies (Fab-fragments and like), and other engineered carbohydrate binding proteins and animal lectins such as galectins.
3. Structures with Terminal GalNAc-Monosaccharide
Preferred GalNAc-type target structures have been specifically revealed by the invention. These include especially LacdiNAc, GalNAcPGlcNAc-type structures according to the invention.
Several plant lectins has been reported for recognition of terminal GalNAc. It is realized that some GalNAc-recognizing lectins may be selected for low specificity reconition of the preferred LacdiNAc-structures.
i) The invention revealed that β-linked GalNAc can be recognized by specific β-N-acetylhexosaminidase enzyme in combination with β-N-acetylhexosaminidase enzyme. This combination indicates the terminal monosaccharide and at least part of the linkage structure.
Preferred β-N-acetylehexosaminidase, includes enzyme capable of cleaving β-linked GalNAc from non-reducing end terminal GalNAcβ4/3-structures without cleaving α-linked HexNAc in the glycomes; preferred N-acetylglucosaminidases include enzyme capable of cleaving β-linked GlcNAc but not GalNAc.
ii) Specific binding proteins recognizing preferred GalNAcβ4, more preferably GalNAcβ4GlcNAc, structures according to the invention. The preferred reagents include antibodies and binding domains of antibodies (Fab-fragments and like), and other engineered carbohydrate binding proteins, and a special plant lectin WFA (Wisteria floribunda agglutinin).
4. Structures with Terminal GlcNAc-Monosaccharide
Preferred GlcNAc-type target structures have been specifically revealed by the invention. These include especially GlcNAcβ-type structures according to the invention.
Several plant lectins has been reported for recognition of terminal GlcNAc. It is realized that some GlcNAc-recognizing lectins may be selected for low specificity reconition of the preferred GlcNAc-structures.
Preferred β-N-acetylglucosaminidase includes enzyme capable of cleaving β-linked GlcNAc from non-reducing end terminal GlcNAcβ2/3/6-structures without cleaving β-linked GalNAc or α-linked HexNAc in the glycomes;
Preferred fucose-type target structures have been specifically classified by the invention. These include various types of N-acetyllactosamine structures according to the invention.
Preferred for recognition of terminal fucose structures includes fucose monosaccharide binding plant lectins. Lectins of Ulex europeaus and Lotus tetragonolobus has been reported to recognize for example terminal Fucoses with some specificity binding for α2-linked structures, and branching α3-fucose, respectively.
i) Specific fucose residue releasing enzymes such as linkage fucosidases, more preferably α-fucosidase.
Preferred α-fucosidases include linkage fucosidases capable of cleaving Fucα2Gal-, and Galβ4/3(Fucα3/4)GlcNAc-structures revealed from specific cell preparations.
ii) Specific binding proteins recognizing preferred fucose structures according to the invention.
The preferred reagents include antibodies and binding domains of antibodies (Fab-fragments and like), and other engineered carbohydrate binding proteins and animal lectins such as selectins recognizing especially Lewis type structures such as Lewis x, Galβ4(Fucα3)GlcNAc, and sialyl-Lewis x, SAα3Galβ4(Fucα3)GlcNAc.
The preferred antibodies includes antibodies recognizing specifically Lewis type structures such as Lewis x, and sialyl-Lewis x. More preferably the Lewis x-antibody is not classic SSEA-1 antibody, but the antibody recognizes specific protein linked Lewis x structures such as Galβ4(Fucα3)GlcNAcβ2Manα-linked to N-glycan core.
6. Structures with Terminal Sialic Acid-Monosaccharide
Preferred sialic acid-type target structures have been specifically classified by the invention.
Preferred for recognition of terminal sialic acid structures includes sialic acid monosaccharide binding plant lectins.
i) Specific sialic acid residue releasing enzymes such as linkage sialidases, more preferably α-sialidases.
Preferred α-sialidases include linkage sialidases capable of cleaving SAα3Gal- and SAα6Gal structures revealed from specific cell preparations by the invention.
Preferred lectins, with linkage specificity include the lectins, that are specific for SAα3Gal-structures, preferably being Maackia amurensis lectin and/or lectins specific for SAα6Gal-structures, preferably being Sambucus nigra agglutinin.
ii) Specific binding proteins recognizing preferred sialic acid oligosaccharide sequence structures according to the invention. The preferred reagents include antibodies and binding domains of antibodies (Fab-fragments and like), and other engineered carbohydrate binding proteins and animal lectins such as selectins recognizing especially Lewis type structures such as sialyl-Lewis x, SAα3Galβ4(Fucα3)GlcNAc or sialic acid recognizing Siglec-proteins.
The preferred antibodies includes antibodies recognizing specifically sialyl-N-acetyllactosamines, and sialyl-Lewis x.
Preferred antibodies for NeuGc-structures includes antibodies recognizes a structure NeuGca3Galβ4Glc(NAc)0 or 1 and/or GalNAcβ4[NeuGcα3]Galβ4Glc(NAc)0 or 1, wherein [ ] indicates branch in the structure and ( )0 or 1 a structure being either present or absent. In a preferred embodiment the invention is directed recognition of the N-glycolyl-Neuraminic acid structures by antibody, preferably by a monoclonal antibody or human/humanized monoclonal antibody. A preferred antibody contains the variable domains of P3-antibody.
The present invention is specifically directed to the binding of the structures according to the present invention, when the binder is conjugated with “a label structure”. The label structure means a molecule observable in a assay such as for example a fluorescent molecule, a radioactive molecule, a detectable enzyme such as horse radish peroxidase or biotin/streptavidin/avidin. When the labelled binding molecule is contacted with the cells or tissue materials according to the invention, the cells can be monitored, observed and/or sorted based on the presence of the label on the cell surface. Monitoring and observation may occur by regular methods for observing labels such as fluorescence measuring devices, microscopes, scintillation counters and other devices for measuring radioactivity.
The invention is specifically directed to use of the binders and their labelled cojugates for sorting or selecting cells from biological materials or samples including cell materials comprising other cell types. The preferred cell types includes cultivated cells and associated cells such as feeder cells. The labels can be used for sorting cell types according to invention from other similar cells. In another embodiment the cells are sorted from different cell types such as blood cells or in context of cultured cells preferably feeder cells, for example in context of complex cell cultures corresponding feeder cells such as human or mouse feeder cells. A preferred cell sorting method is FACS sorting. Another sorting methods utilized immobilized binder structures and removal of unbound cells for separation of bound and unbound cells.
In a preferred embodiment the binder structure is conjugated to a solid phase. The cells are contacted with the solid phase, and part of the material is bound to surface. This method may be used to separation of cells and analysis of cell surface structures, or study cell biological changes of cells due to immobilization. In the analytics involving method the cells are preferably tagged with or labelled with a reagent for the detection of the cells bound to the solid phase through a binder structure on the solid phase. The methods preferably further include one or more steps of washing to remove unbound cells.
Preferred solid phases include cell suitable plastic materials used in contacting cells such as cell cultivation bottles, petri dishes and microtiter wells; fermentor surface materials
mRNA Corresponding to Glycosylation Enzymes
The present invention is further directed to correlation of specific messenger mRNA molecules with the preferred glycan structures according to the present invention. It is realized that glycosylation can be controlled in multiple levels and one of them is transcription. The presence of glycosylated structures may in some case correlate with mRNAs involved in the synthesis of the structures.
The present invention is especially directed to analysis of mRNA-species having correlation with expressed fucosylated glycan structures and “terminal HexNAc” containing structures preferred according to the present invention. The preferred mRNA-species includes mRNA corresponding to fucosyltransferases and N-acetylglucosaminyltransferases.
The present invention is directed to analysis of released glycomes by spectrometric method useful for characterization of the glycomes from tissue specimens or cells. The invention is directed to NMR spectroscopic analysis of the mixtures of released glycans.
The invention is especially directed to methods of producing NMR from specific subglycomes, preferably N-linked glycome, O-linked glycome, glycosaminoglycan glycome and/or glycolipid glycome. The NMR-profiling according to the invention is further directed to the analysis of the novel and rare structure groups revealed from cell glycomes according to the invention. The general information about complex cell glycome material directed NMR-methods are limited.
Preferably the NMR-analysis is performed from an isolated subglycome. The preferred isolated subglycomes include acidic glycomes and neutral glycomes.
It is realized that numerous methods have been desribed for purification of oligosaccharide mixtures useful for NMR from various materials, including usually purified individual proteins. It is realized that present methods are useful for NMR-profiling even for larger tissue specimens or higher amounts of cells according to the invention, especially in combination with NMR-profiling according to the invention and/or when directed to the analysis specific and preferred structure groups according to the invention. The preferred purification methods are effective and form an optimised process for purification of glycomes from even larger amounts of cells and tissues than described for nanoscale methods below. The methods are preferred also for any larger amount of cells.
Moreover, when purification methods for larger amounts of carbohydrate materials exists, but very low and complex carbohydrate materials with very complex impurities such as cell-derived materials have been less studied as low amounts, especially when purity useful for NMR-analysis is needed.
The invention is directed to analysis of NMR-samples that can be produced from very low amounts of cells according to the invention. Preferred sample amounts of cells or corresponding amount of tissue material for a one-dimensional proton-NMR profiling are from about 2 million to 100 million cells, more preferably 10-50 million cells. It is further realized that good quality NMR data can be obtained from samples containing at least about 10-20 million cells.
The preferred analysis methods is directed to high resolution NMR observing oligosaccharide/saccharide conjugate mixture from an amount of at least 4 nmol, more preferably at least 1 nmol and the cell amount yielding the preferred amount of saccharide mixture. For nanoscale analysis according to the invention cell material is selected so that it would yield at least about 50 nmol of oligosaccharide mixture, more preferably at least about 5 nmol and most preferably at least about 1 nmol of oligosaccharide mixture. Preferred amounts of major components in glycomes to be observed effectively by the methods according to the invention include yield at least about 10 nmol of oligosaccharide component, more preferably at least about 1 nmol and most preferably at least about 0.2 nmol of oligosaccharide component.
The preferred cell amount for analysis of a subglycome from a cell type is preferably optimised by measuring the amounts of glycans produced from the cell amounts of preferred ranges.
It is realized that depending on the cell and subglycome type the required yield of glycans and the heterogeneity of the materials vary yielding different amounts of major components.
For the production of sample for nanoscale NMR, the methods described for preparation of cell samples and release of oligosaccharides for mass spectrometric profiling according to the invention may be applied.
For the purification of sample for nanoscale NMR the methods described for purification mass spectrometry profiling samples according to the invention may be applied.
The preferred purification method for nanoscale NMR-profiling according to the invention include following general purification process steps:
The more preferred purification process includes precipitation/extraction aimed for removal of major non-carbohydrate impurities by separating the impurity and the glycome fraction(s) to be purified to different phases. Hydrophobic interaction step aims to purify the glycome components from more hydrophobic impurities as these are bound to hydrophobic chromatography matrix and the glycome components are not retained. Chromatography on graphitized carbon may include purification or enrichment of glycans due to their affinity or specific adsorption to graphitized carbon, or removal of contaminants from the glycans. The glycome components obtained by the aforementioned steps are then subjected to gel filtration chromatography, separating molecules according to their hydrodynamic volume, i.e. size in solution. The gel filtration chromatography step allows detection and quantitation of glycome components by absorption at low wavelengths (205-214 nm).
The most preferred purification process includes precipitation/extraction and hydrophobic interaction steps aimed for removal of major non-carbohydrate impurities and more hydrophobic impurities. Chromatography on graphitized carbon is used for removal of contaminants from the glycans, and to divide the glycome components to fractions of neutral glycome components and acidic glycome components. The neutral and acidic glycome component fractions are then subjected to gel filtration chromatography, which separates molecules according to their size. Preferably, a high-performance liquid chromatography (HPLC) type gel filtration column is used. The neutral glycome component fraction is preferebly chromatographed in water and the acidic glycome component fraction is chromatographed in 50-200 mM aqueous ammonium bicarbonate solution. Fractions are collected and evaporated prior to further analyses. The gel filtration chromatography step allows detection and quantitation of glycome components by absorption at low wavelengths (205-214 nm). Quantitation is performed against external standards. The standards are preferably N-acetylglucosamine, N-acetylneuraminic acid, or oligosaccharides containing the same. Fractions showing absorbance are subjected to MALDI-TOF mass spectrometry. Preferably, the neutral glycome components are analyzed in the positive-ion mode and the acidic glycome components in the negative-ion mode in a delayed-extraction MALDI-TOF mass spectrometer.
The fractionation can be used to enrich components of low abundance. It is realized that enrichment would enhance the detection of rare components. The fractionation methods may be used for larger amounts of cell material. In a preferred embodiment the glycome is fractionated based on the molecular weight, charge or binding to carbohydrate binding agents such as lectins and/or other binding agents according to the invention.
These methods have been found useful for specific analysis of specific subglycomes and enrichment more rare components. The present invention is in a preferred embodiment directed to charge based separation of neutral and acidic glycans. This method gives for analysis method, preferably mass spectroscopy material of reduced complexity and it is useful for analysis as neutral molecules in positive mode mass spectrometry and negative mode mass spectrometry for acidic glycans.
It is realized that preferred amounts of enriched glycome oligosacccharide mixtures and major component comprising fractions can be produced from larger glycome preparations.
In a preferred embodiment the invention is directed to size based fractionation methods for effective analysis of preferred classes of glycans in glycomes. The invention is especially directed to analysis of lower abundance components with lower and higher molecular weight than the glycomes according to the invention. The preferred method for size based fractionation is gel filtration. The invention is especially directed to analysis of enriched group glycans of N-linked glycomes preferably including lower molecular weight fraction including low-mannose glycans, and one or several preferred low mannose glycan groups according to the invention.
In a preferred embodiment the NMR-analysis of the glycome is one-dimensional proton-NMR analysis showing structural reporter groups of the major components in the glycome. The invention is further directed to specific two- and multidimensional NMR-experiments of the glycomes when enough sample is available. It is realized that two-dimensional NMR-experiments require about a ten-fold increase in sample amount compared to proton-NMR analyses.
The present invention is further directed to combination of the mass spectrometric and NMR glycome analyses. The preferred method include production of any mass spectrometric profile from any glycome according to the invention from a cell sample according to the invention, optionally characterizing the glycome by other methods like glycosidase digestion, fragmentation mass spectrometry, specific binding agents, and production of NMR-profile from the same sample glycome or glycomes to compare these profiles.
As described in the invention the over expressed glycan structures in cancers are useful therapeutic targets, especially when the structures are not present in normal tissues. The invention is directed to production of therapeutic cancer binding antibodies and anti-cancer immune response inducing cancer vaccines. It is realized that the glycaconjugates of invention for cancer vaccines and/or antigens can be produced by multiple synthetic and or biosynthetic methods known in the art.
The invention is especially directed to the biotechnical production of cancer vaccine/antigen comprising target cancer glycans. Preferred biotechnical methods for the production of the vaccine by purification from a natural source. Preferred sources of cancer vaccines includes natural glycoproteins obtainable from pharamaceutically acceptable source, preferably the glycoprotein is an eucaryotic glycoprotein.
A preferred example of a natural glycoprotein is KLH (keyhole limpet hemocyanin). In a preferred embodiment the invention is directed to alteration of the KLH to more cancer vaccine like structure by removing non-human type β(6)-Gal-structures from KLH. The KLH thus obtained contains increased amount of low mannose antigen/vaccine structures according to the invention. It is further realized that the preferred antigenicity is more effective in the low-mannose enriched KLH-materials as the non-human structure would otherwise direct immune system to non-useful reactions. In another preferred embodiment the amount of N-glycand core fucose is specifically reduced (preferably by fucosidase or acid hydrolysis) or increased by enzymatic fucosylation. It is well-known protein for immunization. Suitable amounts of KLH for use as an immunoadjuvant in cancers are well-known in the art and the invention is especially directed to the use of similar amounts of glycan antigen enriched molecules for cancer treatment.
Recombinant Proteins from Lower Eukayots
It is further realized that multiple recombinant protein host cell systems such as yeast and fungal cells can be engineered for production specific low-Mannnose and/or high-Mannose structres (described e.g. in glycoprotein production patents of Kirin Bier Japan, Glycofi US and prof Roland Contreras Belgium) and other structures preferably core II type O-glycans, for O-glycans mammalian expression systems are currently more feasible.
The present invention is directed to the methods of clinical evaluation of effect of immunization by glycoproteins/peptides enriched with the preferred cancer antigens according to the invention for patients with one or several types of cancers containing the structures.
In a specific embodiment the invention is directed to analysis of secretory proteins at least partially derived from cancer such as serum proteins from cancer. As an example the inventors analyzed glycosyaltion of a commercial CA15-3 cancer antigen sample (Calbiochem, USA, prod number 209915, cancer associated breast tumor antigen from tumor fluids). The invention revealed the preferred neutral and sialylated core II glycans, especially fucosylated core II glycans, with substantial expression sialyl-Lewis x core II structure and even polylactosame elongated sialylated and fucosylated structures according to the invention.
The data indicates that the common and useful cancer marker CA15-3 for breast cancer is actually a core II O-glycan structure, more preferably sialylated and/or fucosylated, preferably sialylated and at least partially fucosylated core two O-glycan epitope, when the CA15-3 antigen is derived from fluids of human breast tumors. The current antigen preparations are heterogenous human derived materials which are very very difficult to standardize due to heterogeneity of the material further yielding heterogeneity in the characterization of heterogenous natural antibodies. The present invention is directed to novel synthetic CA15-3 standard, which comprise all or part of the preferred core II CA15-3 glycans, preferably linked to a polypeptide corresponding a part of CA15-3 protein (which has been considered as MUC1 mucin). The present invention is directed standardization methods, assaying the synthetic structures with regard to binding of known CA15-3 antibodies and analysis of the binding specificties of the antibodies with regard to the glycan structures. Based on the analysis one or several glycan structures according to the invention, preferably in a peptide linked form, is/are selected as synthetic CA15-3 antigen(s) or antigen types to be used for more exact cancer analysis, especially breast cancer analysis. The invention is further directed to production of recombinant CA15.3 antibody or antibody mixtures with specificities characterized with the standard glycan structures, a preferred such antibody composition would comprise. The invention is preferably directed to a novel recombinant CA15-3 antibody compositions comprising a core II slex-binding antibody, such as preferred antibody according to the invention. It is realized that the synthetic antigen can be adjusted with regard to individual cancer type specific variations of the glycan antigen structures.
The invention is further directed to analysis of the preferred glycan structures in context of cancer sample according to the invention, preferably a core II glycan on serum mucin, when the carrier protein is first purified or bound by an antibody specific for the carrier protein and then the complex is analysed by the specific glycan expression levels by other antibody or antibodies recognizing the glycan structures. The analysis may be performed by well-known sandwich ELISA type assays in which the first antibody is preferably immobilized and the second antibody is linked with a detectable label such as a fluorescent label or radiolabel; or for example by a fluorescence energy transfer analysis (FRET) when both antibodies (or other corresponding binders) are labeled with suitable fluorescent labels.
The present invention revealed a novel quantitative synthesis and analysis method for quantitative periodate(/vicinal hydroxyl) oxidation and reduction of a protein linked glycan. Under the preferred low temperature conditions and elongated reaction time as shown in Example illustrated by
As described in the Examples, especially in Example 20, the inventors detected the presence and/or altered expression levels of the glycans widely in human cancerous tissues, e.g. in following cancer types: lung cancer, both small cell lung adenocarcinoma and non-small cell lung adenocarcinoma, and lung carcinoma liver metastases; breast cancer; ductale type breast adenocarcinoma and lymph node metastases thereof; lobulare type breast adenocarcinoma and lymph node metastases thereof; ovarian cystadenocarcinoma; colon cancer/carcinoma, carcinoma adenomatosum, and liver metastases thereof; kidney cancer/carcinoma, and kidney hypernephroma; gastric cancer/carcinoma, and lymph node metastases thereof, liver cancer/carcinoma; larynx cancer/carcinoma; pancreas cancer/carcinoma; melanoma and liver metastases thereof; gall bladder cancer/carcinoma, and liver metastases thereof; salivary gland cancer/carcinoma, and skin metastases thereof; and lymph node cancer/carcinoma (lymphoma). The present invention is especially directed to uses of identified glycan structures according to the present invention in these preferred cancer types.
The present invention is specifically directed to methods studying metastasis-associated, metastasis-specific, metastasis-enriched, and/or metastasis-inducing glycans according to the invention. As described in Examples 20-23, the inventors identified metastasis-associated and metastasis-enriched glycans in various human cancer types, including mannosylated glycans, preferentially low-mannose type glycans, O-glycans, non-reducing terminal HexNAc glycans, preferentially non-reducing terminal GlcNAc glycans, blood group antigen related glycans, and glycans with +80 Da units in their monosaccharide compositions, preferentially sulphated and/or phosphorylated glycans; in different specific combinations according to both primary cancer type and site of metastasis.
The present invention is especially directed to methods for identifying metastasis-associated glycans according to the invention. The present invention is further directed to methods for identifying primary cancer type and/or site of metastasis based on identifying the glycans according to the invention. In a further embodiment, the present invention is directed to methods for studying metastasis growth and/or initiation based on identifying the glycans according to the invention.
The present invention is also directed to methods for studying the biological function(s) of the metastasis-associated glycans according to the invention, preferentially in context of metastasis formation and cancer malignancy, more preferentially mechanisms of cancer spreading and migration. It is realized that metastasis-associated glycans are potentially immunologically active, and the present invention is further directed to methods for studying immunological properties of cancer, more preferentially metastasing cancer, based on the glycans according to the invention. The invention is also directed to methods for using biological models to study cancer, more preferentially metastasing cancer, in context of using metastasis-associated glycans according to the present invention, more preferentially including modulation of the glycans or using inhibiting glycans or their analogs.
It is further realized that cancer metastasis is directly related to cancer malignancy. The present invention is specifically directed to methods for analyzing cancer malignancy based on use of the metastasis-associated glycans according to the present invention.
The present invention is further illustrated in Examples, which in no way are intended to limit the scope of the invention.
Isolation of Glycans from Formalin-Fixed and Paraffin-Embedded Tissue Samples.
Prior to glycan isolation from formalin-fixed and paraffin-embedded samples, the samples were deparaffinised. Glycans were detached from sample glycoproteins by non-reductive β-elimination essentially as described previously (Huang et al., 2001) and purified and analyzed essentially as described in Examples 11 and 12.
MALDI-TOF MS.
MALDI-TOF mass spectrometry was performed with a Voyager-DE STR BioSpectrometry Workstation, essentially as described previously (Saarinen et al., 1999; Harvey et al., 1993).
Exoglycosidase Digestions.
All exoglycosidase reactions were performed essentially as described previously (Nyman et al., 1998; Saarinen et al., 1999) and analysed by MALDI-TOF MS. The enzymes and their specific control reactions with characterised oligosaccharides were (R denotes reducing end oligosaccharide sequences in the following examples): β1,4-galactosidase (Streptococcus pneumoniae, recombinant, E. coli; Calbiochem, USA) digested Galβ1-4GlcNAc-R but not Galβ1-3GlcNAc-R; β-N-acetylglucosaminidase (Streptococcus pneumoniae, recombinant, E. coli; Calbiochem, USA) digested GlcNAcβ1-6Gal-R in β1,4-galactosidase treated lacto-N-hexaose but not GalNAcβ1-4GlcNAcβ1-3/6Gal-R in a synthetic oligosaccharide; α-mannosidase (Jack bean; Glyko, UK) transformed a mixture of high-mannose N-glycans to the Man1GlcNAc2 N-glycan core trisaccharide; β-mannosidase (Helix pomatia; Calbiochem, USA) digested the β1,4-linked mannose residue from the N-glycan core trisaccharide Manβ4GlcNAcβ4GlcNAc, without affecting the α-linked mannose residues of high-mannose N-glycans. Control digestions were performed in parallel and analysed similarly to the analytical exoglycosidase reactions. Endoglycosidase digestions were performed essentially as described previously (Plummer & Tarentino, 1991), and the reaction products were analyzed by MALDI-TOF MS after purification. The specific N-glycosidase F1 control oligosaccharides and the control reactions were as follows: the enzyme transformed all Hex5-9HexNAc2 oligosaccharides in a sample of high-mannose N-glycans into Hex5-9HexNAc1 oligosaccharides; in contrast, the enzyme was not able to digest a core-fucosylated N-glycan, namely the hexasaccharide Manα6(Manα3)Manβ4GlcNAcβ4(Fucα6)GlcNAc.
Glycan Isolation and Analysis.
Detached and purified glycans from paraffin-embedded formalin-fixed tissue samples from cancer patients were analysed by MALDI-TOF mass spectrometry after isolation of the neutral glycan fraction. Relative quantification of the glycans were done by comparing relative MALDI-TOF MS signal intensities, which is accurate for the obtained mixtures of purified glycans (Saarinen et al., 1999; Harvey et al., 1993).
Specific Mannosidase Digestion Analyses.
The proportions of the non-reducing terminal α-mannose containing glycans were determined by their sensitivity towards hydrolysis with α-mannosidase from Jack beans. After the specific exoglycosidase digestion, the presence of high-mannose and low-mannose type glycans in the original sample can be deduced by their disappearance from the recorded mass spectra, and the simultaneous increase in signal intensities of the expected reaction products at m/z 609 and 755, which correspond to the sodium adduct ions [Hex1HexNAc2+Na]+ (calc. m/z 609.21) and [Hex1HexNAc2dHex1+Na]+ (calc. m/z 755.27), respectively. An example of the reaction scheme is presented in
The Hex1HexNAc2dHex0-1 components were studied with specific β-mannosidase digestion both before and after α-mannosidase digestion. The results are summarized in Table 1. The results indicate that 1) the original components contain variable amounts of non-reducing terminal β-mannose containing oligosaccharides with the compositions Hex1HexNAc2dHex0-1, and 2) the major α-mannosidase digestion products with the compositions Hex1HexNAc2dHex0-1 have non-reducing terminal β-mannose residues, as they are susceptible towards digestion with β-mannosidase enzyme.
N-Glycosidase Digestion Analyses.
The assignment of the majority of the Hex1-9HexNAc2 and Hex1-5HexNAc2dHex1 glycan components as low-mannose and high-mannose type N-glycans was confirmed by their isolation and digestion analysis by specific endoglycosidase enzymes, namely N-glycosidase F and N-glycosidase F1 from Chryseobacterium meningosepticum. The first series of experiments was done with glycan samples isolated from a lung tumor of a patient with non-small cell lung adenocarcinoma. In addition to chemical detachment, the glycans in question could also be isolated by N-glycosidase F digestion, indicating that they are N-glycans. However, all Hex1-9HexNAc2 components, but not any of the Hex1-5HexNAc2dHex1 components, could be digested with N-glycosidase F1, resulting in transformation of the first glycan group into peaks with masses of one less HexNAc residue with monosaccharide compositions Hex1-9HexNAc1. In combination with the α- and β-mannosidase digestion results, these experiments indicate that all the components are N-glycans that have the chitobiose disaccharide sequence in their reducing end, and that the latter components have a dHex residue linked to the reducing terminal GlcNAc. Furthermore, as the latter components are susceptible to digestion with N-glycosidase F, they must have a reducing terminal sequence dHex-6(GlcNAcβ1-4)GlcNAc.
Chemical Analyses.
In individual samples, the Hex1-9HexNAc2 and Hex1-5HexNAc2dHex1 components were also studied with periodate oxidation, subsequent reduction with alkaline sodium borohydride, and MALDI-TOF mass spectrometry. Also post-source decay (PSD) MALDI-TOF mass spectrometric analyses were performed to specific components of the structure group. The results from these analyses support the structural features deduced from the experiments described above. Periodate reaction cleaves structures with vicinal hydroxyl groups including non-reducing terminal monosaccharides, and isomeric structures differ from each other in this reaction. The data is in accordance with low-mannose and high-mannose type N-glycans produced by regular biosynthesis.
More specifically, cancer cell N-glycans with the compositions Hex2HexNAc2 and Hex2HexNAc2dHex1, were studied with periodate oxidation, subsequent borohydride reduction, and MALDI-TOF MS. From the Hex2HexNAc2 component at m/z 771.59 (calc. m/z 771.26 for the ion [Hex2HexNAc2+Na]+), two product ions were observed after the reaction at m/z 717.34 (calc. m/z 717.29 for the ion [Hex2HexNAc2-C2O2+H2+Na]+) and 745.34 (calc. m/z 745.29 for the ion [Hex2HexNAc2-C1O1+H2+Na]+), which can result from Manα6Manβ4GlcNAcβ4GlcNAc and Manα3Manβ4GlcNAcβ4GlcNAc N-glycan oligosaccharide isomers, respectively. These products occurred in respective relative amounts of approximately 60% and 40%, indicating that the original sample contained nearly equal amounts of the both α-mannose linkage isomers. From the Hex2HexNAc2dHex1 component at m/z 917.63 (calc. m/z 917.32 for the ion [Hex2HexNAc2dHex1+Na]+), two product ions were observed after the reaction at m/z 835.41 (calc. m/z 835.35 for the ion [Hex2HexNAc2dHex1-C3O3+H2+Na]+) and 863.42 (calc. m/z 863.35 for the ion [Hex2HexNAc2dHex1-C2O2+H2+Na]+), which can result from Manα6Manβ4GlcNAcβ4(Fucα6)GlcNAc and Manα3Manβ4GlcNAcβ4(Fucα6)GlcNAc N-glycan oligosaccharide isomers, respectively. These components occurred in relative amounts of approximately 80% and 20%, respectively, indicating that the original sample contained significantly more of the isomer containing α6-linked mannose, but that the both isomers were present in the sample.
Taken together, all the experiments suggest that the terminal mannose-containing oligosaccharides, which have monosaccharide compositions Hex1-9HexNAc2 and Hex1-5HexNAc2dHex1, include the structures presented in
Exoglycosidase Digestions.
All exoglycosidase reactions were performed essentially as described previously (Nyman et al., 1998; Saarinen et al., 1999) and analysed by MALDI-TOF MS. The enzymes and their specific control reactions with characterised oligosaccharides were (R denotes reducing end oligosaccharide sequences in the following examples): β1,4-galactosidase (Streptococcus pneumoniae, recombinant, E. coli; Calbiochem, USA) digested Galβ1-4GlcNAc-R but not Galβ1-3GlcNAc-R; α-mannosidase (Jack bean; Glyko, UK) transformed a mixture of high-mannose N-glycans to the Man1GlcNAc2 N-glycan core trisaccharide; recombinant β1,3-galactosidase (Calbiochem, USA) digested Galβ3GlcNAc-R but not Galβ4GlcNAc-R; α3/4-fucosidase (Xanthomonas sp.; Calbiochem, USA) digested Galβ4(Fucα3)GlcNAcβ3Galβ4Glc but not Fucα2Galβ3GlcNAcβ3(Galβ4GlcNAcβ6)Galβ4Glc. Control digestions were performed in parallel and analysed similarly to the analytical exoglycosidase reactions.
Glycan Isolation and Analysis.
Detached and purified glycans from paraffin-embedded formalin-fixed tissue samples from cancer patients were analysed by MALDI-TOF mass spectrometry after isolation of the neutral glycan fraction. Relative quantification of the glycans were done by comparing relative MALDI-TOF MS signal intensities, which is accurate for the obtained mixtures of purified glycans (Saarinen et al., 1999; Harvey et al., 1993).
Specific Mannosidase Digestion Analyses.
The proportions of the non-reducing terminal α-mannose containing glycans in the samples were determined by their sensitivity towards hydrolysis with α-mannosidase from Jack beans. After the specific exoglycosidase digestion, the presence of terminal α-mannose residues containing glycans in the original samples could be deduced by their disappearance from the recorded mass spectra, and the simultaneous increase in signal intensities of the expected reaction products at m/z 609 and 755, which correspond to the sodium adduct ions [Hex1HexNAc2+Na]+ (calc. m/z 609.21) and [Hex1HexNAc2dHex1+Na]+ (calc. m/z 755.27), respectively. The glycans that resisted digestion with mannosidases, were studied further.
Specific Galactosidase and Fucosidase Digestion Analyses.
The structural features of the mannosidase-resistant glycans in the samples were investigated by digestion with S. pneumoniae β1,4-galactosidase, recombinant β1,3-galactosidase, and α3/4-fucosidase. An example of the reaction scheme is presented in
N-Glycosidase Digestion Analyses.
The assignment of the majority of the Hex1-9HexNAc2 and Hex1-5HexNAc2dHex1 glycan components as low-mannose and high-mannose type N-glycans was confirmed by their isolation and digestion analysis by a specific endoglycosidase enzyme, namely N-glycosidase F from Chryseobacterium meningosepticum. In combination with the α- and β-mannosidase digestion results, these experiments indicate that the mannosidase-resistant components at m/z 771 and 917, contain glycan species that are not N-glycans.
The presence of a glycan fragment at m/z 608 in the tissue samples corresponds to a structure, in which a Hex1HexNAc1 unit is linked to the 6-position of an O-glycan core GalNAc fragment. The presence of the m/z 608 peaks in the tissue samples indicates that part of the O-glycan structures may contain the Core 2 O-glycan structure. However, Core 1 O-glycan structures may also be present in the samples. Specific β1,3-galactosidase experiments can be used to reveal the relative proportions of these structures in the samples. Taken together, all the experiments suggest that the mannosidase-resistant oligosaccharides that have monosaccharide compositions Hex2HexNAc2 and Hex2HexNAc2dHex1, include the structures presented in
Exoglycosidase Digestions.
All exoglycosidase reactions were performed essentially as described previously (Saarinen et al., 1999) and analysed by MALDI-TOF MS. The enzymes and their specific control reactions with characterised oligosaccharides were (R denotes reducing end oligosaccharide sequences in the following examples): β1,4-galactosidase (Streptococcus pneumoniae, recombinant, E. coli; Calbiochem, USA) digested Galβ4GlcNAc-R but not Galβ3GlcNAc-R; Arthrobacter ureafaciens neuraminidase (Calbiochem, USA) digested both Neu5Acα3Galβ4GlcNAc-R and Neu5Acα6Galβ4GlcNAc-R; Streptococcus pneumoniae α2,3-sialidase (Calbiochem, USA) digested Neu5Acα3Galβ4GlcNAc-R but not Neu5Acα6Galβ4GlcNAc-R. Control digestions were performed in parallel and analysed similarly to the analytical exoglycosidase reactions.
Chemical Modification Reactions.
Mild acid hydrolysis of sialic acid residues was performed with 50 mM trifluoroacetic acid in water at 60° C. for 5 hours. After the reaction, the acid was eliminated by evaporation. Mild periodate oxidation, alkaline reduction with borohydride, mild alkaline hydrolysis for cleavage of carboxylic acid esters, and permethylation for fragmentation analyses were performed essentially as described previously (Ylonen et al., 2001). Methylation with iodomethane was performed essentially as described previously (Powell & Harvey, 1996).
Glycan Isolation and Analysis.
Detached and purified glycans from paraffin-embedded formalin-fixed tissue samples from cancer patients were analysed by MALDI-TOF mass spectrometry after isolation of the neutral glycan fraction. Relative quantification of the glycans were done by comparing relative MALDI-TOF MS signal intensities, which is accurate for the obtained mixtures of purified glycans (Papac et al., 1996; Harvey, 1993; Saarinen et al., 1999).
Indicative Mass Spectrometric Signals of the Structure Group.
The indicative mass spectrometric signals of the structure group, in both positive and negative ion mode MALDI-TOF MS, are presented in Table 5. Examples of the glycan antigen signals present in lung cancer tumor samples from a patient with non-small cell lung adenocarcinoma, are presented in
General Features of the Structure Group.
The indicative glycan signals of the structure group in the non-sialylated glycan fraction, include O-glycan fragments that share in common the presence of an unusual reducing end terminal monosaccharide (2-acetamido-3-amino-2,3-dideoxyhexose, or deoxyamino-HexNAc, in which hexose is either D-galactose, D-gulose, D-allose, or D-glucose), which results from the strong alkaline conditions in the glycan isolation method, as discussed below. The major structure present at m/z 899 in various glycan samples from cancer patients is presented in
Nature of the Acidic Group.
The acidic group of the m/z 899 peak was recognized as N-acetylneuraminic acid, based on the following experiments. Mild acid hydrolysis destroyed the m/z 899 peak without affecting the other peaks in the profile. Simultaneously, this resulted in increase of the signal at m/z 608, which has 291 mass units smaller m/z value, corresponding to a mass difference of an acetylneuraminic acid residue. Mild periodic acid oxidation and subsequent borohydride reduction, resulted in the destroying of the m/z 899 peak. However, the m/z 608 peak together with the neutral glycan peaks in the glycan profile, were not affected by the periodic acid treatment. This corresponds to a cleavage between the C7 and C8 carbons of the glycerol tail, namely removal of 60 mass units (C2H4O2) from a neuraminic acid residue, and addition of 2 mass units (due to reduction of the reducing end of the oligosaccharide). The cleavage site was further shown to reside in the acid-labile sialic acid residue, because the neutral fragment at m/z 608 was not affected by mild periodate, but was instead reduced and transformed into m/z 610 during the reaction. Furthermore, both Arthrobacter ureafaciens neuraminidase and recombinant Streptococcus pneumoniae α2,3-sialidase hydrolyzed the m/z 899 peak from the spectrum, and transformed it into the peak at m/z 608. This suggests that the sialic acid linkage is α2→3, and not either α2→6, α2→8, or a2→9, to the next monosaccharide residue in the sequence. In addition, cleavage of a 291 mass unit fragment was shown to be the major cleavage route of the m/z 899 glycan peak, in post-source decay (PSD) MALDI-TOF mass spectrometric fragmentation experiments (
Oligosaccharide Sequence of the m/z 899 Glycan.
After removal of the sialic acid residue, the oligosaccharide sequence of the remaining glycan at m/z 608 was studied by specific exoglycosidases. Streptococcus pneumoniae β1,4-galactosidase, but not a recombinant β1,3-galactosidase, transformed the m/z 608 peak into a peak at m/z 446, corresponding to the removal of one hexose residue. Together with the known specificity of the enzymes and the general biosynthetic routes of human O-glycan structures (Brockhausen, 1999), this suggests that the major component at m/z 899 in the glycan profiles contains the non-reducing terminal oligosaccharide sequence NeuNAcα2-3Galβ1-4GlcNAcβ-R, where R is the reducing end component of 243 mass units that corresponds to the sodium adduct ion of acetamido-amino-dideoxyhexose [C8H16N2O5+Na]+.
Nature of the Reducing Terminal Monosaccharide.
The m/z 899 glycan had an intact reducing terminal, as evidenced by the transformation of the peak into a peak at m/z 901 upon reduction with alkaline sodium borohydride. The reducing terminal monosaccharide also contained a free primary amino group, as evidenced by the following experiments. Upon N-acetylation by acetic anhydride, the glycan peak was transformed into a peak at m/z 941, corresponding to an addition of 42 mass units, typical for acetylation. The +42 Da modification was resistant to mild alkaline hydrolysis, which is in accordance with the suggested acetamido linkage. Furthermore, the glycan peaks at m/z 899 and m/z 608 are usually accompanied by peaks with 22 mass units lesser mass in the mass spectra of the present experiments, namely at m/z 877 and m/z 586 respectively, corresponding to proton adduct ions of the same molecule ([M+H]+). In contrast, neither the normal neutral oligosaccharides present in the same glycan profile, nor the acetylated counterparts of the same glycan peaks have the accompanying proton adduct peaks. This suggests that the non-acetylated glycans at m/z 899 and m/z 608 have an unusual basic functional group, which is in accordance with the presence of the suggested free primary amine group. The reducing terminal monosaccharide could also be efficiently methylated by iodomethane in alkaline dimethylsulfoxide. In this reaction, the parent glycan peak at m/z 899 was transformed into a peak at m/z 933 for [M+C4H9]+, corresponding to the formation of a quarternary amine group and a molecular ion, and with the sialic acid residue transformed into a methyl ester. A carboxylic acid methyl ester is alkali-labile, and accordingly, upon mild alkali hydrolysis, this group was transformed into a free carboxylic acid. The resulting ion that corresponds to the formation of a quaternary amine strongly suggests for the presence of a primary amine in the original molecule.
The formation of the m/z 899 fragment was further studied by using bovine fetuin as a model glycoprotein. Fetuin contains both Core 1 and Core 2 branched O-glycans with structures NeuNAcα2-3Galβ1-3GalNAc(α-O-Ser/Thr), NeuNAcα2-3Galβ1-3(NeuNAcα2-6)GalNAc(α-O-Ser/Thr), and NeuNAcα2-3Galβ1-4GlcNAcβ1-6([±NeuNAcα2-3]Galβ1-3)GalNAc(α-O-Ser/Thr), respectively. The non-reductive β-elimation glycan isolation procedure that was used in glycan isolation from cancer patient tissues, produced abundant glycans at m/z 899. This glycan fragment was similar in its biochemical properties to its counterpart in human tissues. The only parent molecules available in the fetuin glycoprotein for the formation of the m/z 899 fragment, are the O-glycans that contain the substructure NeuNAcα2-3Galβ1-4GlcNAcβ1-6(R-3)GalNAc(α-O-Ser/Thr), where R are [±NeuNAcα2-3]Gal(β).
Based on known susceptibility of the 3-position substituent of GalNAc to β-elimination in alkaline conditions, the structure of the m/z 899 glycan peak could be assigned as arising from elimination of the 3-substituent from the O-glycan, and subsequent addition of ammonia into the unsaturated glycan ring, which forms a primary amine functional group into the reducing end monosaccharide that arises from the GalNAc residue. Furthermore, as the fragmentation starts with elimination from the 3-position of GalNAc, the amine modification will reside in the 3-position of the monosaccharide. In conclusion, the evidence suggests that the reducing terminal monosaccharide is 2-acetamido-3-amino-2,3-dideoxyhexose, to which the rest of the glycan sequence is attached at 6-position. The sequence of the major oligosaccharide present at m/z 899 is therefore NeuNAcα2-3Galβ1-4GlcNAcβ1-6(2-acetamido-3-amino-2,3-dideoxy)hexose. As the fragment formation starts from N-acetylgalactosamine, the most likely hexose isomers in the fragment are D-galactose, D-gulose, D-allose, and D-glucose.
Mass Spectrometric Fragmentation Analyses.
The fragments obtained in post-source decay MALDI-TOF mass spectrometry, from native glycan peaks at m/z 899 and m/z 608, and their acetylated as well as deuteroacetylated forms, showed the presence and sequence of the acetylneuraminic acid, hexose, and N-acetylhexosamine residues in the m/z 899 glycan, thus confirming the structural features described above (
Analysis of Sialylated Glycans.
In negative ion mode MALDI-TOF MS of the isolated sialylated glycan fraction of lung tumor and healthy control tissues, more specifically patients with non-small cell lung adenocarcinoma and ovarian cystadenocarcinoma, when tumor samples were compared to the corresponding healthy lung and ovary samples, respectively, sialylated glycan peaks were elevated at m/z 1038, corresponding to NeuNAc1Hex2HexNAc2 (calc. m/z 1038.36 for the ion [M−H]−), at m/z 1329, corresponding to NeuNAc2Hex2HexNAc2 (calc. m/z 1329.46 for the ion [M−H]−), and at m/z 1475, corresponding to NeuNAc2Hex2HexNAc2dHex1 (calc. m/z 1475.52 for the ion [M−H]−). This indicates that these glycan components are major parent glycans from which originates the m/z 899 glycan peak present in the positive ion mode mass spectra.
In contrast, sialylated glycan peaks were decreased at m/z 673, corresponding to NeuNAc1Hex1HexNAc1 (calc. m/z 673.23 for the ion [M−H]−), and at m/z 964, corresponding to NeuNAc2Hex1HexNAc1 (calc. m/z 964.33 for the ion [M−H]−), when compared to the larger glycan peaks mentioned above. This indicates that the increase in the amounts of the larger glycans happens in conjunction with the decrease in the amounts of the smaller glycans at m/z 673 and m/z 964. Furthermore, this suggests a change from Core 1 type O-glycans to Core 2 type O-glycans associated with malignant tumor samples.
Furthermore, in the healthy control samples from the lung and the ovary, no detectable peaks were present at m/z at m/z 1081, corresponding to NeuNAc1Hex1HexNAc3 (calc. m/z 1081 for the ion [M−H]−), at m/z 1370, corresponding to NeuNAc2Hex1HexNAc3 (calc. m/z 1329.46 for the ion [M−H]−), or at m/z 1516, corresponding to NeuNAc2Hex1HexNAc3dHex1 (calc. m/z 1475.52 for the ion [M−H]−). This indicates that the major 3-position substituents of the m/z 899 component present in the positive ion mode mass spectra, may be either a hexose monosaccharide, or a neuraminic acid-hexose disaccharide, in the original sample.
Samples from benign tumors of the ovary, namely benign ovarian cystadenoma, were similar to the healthy ovary sample in respect of their specific glycan structures, indicating that the described changes in the relative amounts of the glycan peaks, reflect a change associated with malignant transformation of cancer, or at least ovarian adenocarcinoma.
Statistical Calculations.
Statistical analyses were performed with the SAS Software (SAS System, version 8.2, SAS Institute Inc., Cary, N.C., USA), using SAS/STAT and SAS/BASE modules. All tests were performed as two-sided. The distributions of the experimental data were evaluated as 1) normal and symmetric, 2) only symmetric, or 3) non-symmetric and not normal, and the statistical test used was accordingly chosen as 1) Student's t Test, 2) Wilcoxon Signed Rank Test, or 3) Sign Test. A p value of less than 0.05 was considered statistically significant.
Neutral Low-Mannose Type N-Glycans are More Abundant in Tumor Tissue Samples than in Healthy Control Tissue Samples from Cancer Patients.
Formalin-fixed samples, from tumor and surrounding healthy tissue, were obtained from patients with various types of cancer. The studied cancer types included non-small cell lung adenocarcinoma, ductale breast carcinoma, lobulare breast carcinoma, stomach cancer, colon cancer, kidney cancer, ovarian carcinoma, pancreatic cancer, and cancers of the lymph nodes and the larynx. There were significant differences between the neutral low-mannose type N-glycans isolated from tumor samples and healthy tissue samples (Table 1), more specifically the Hex2-4HexNAc2 and Hex2-5HexNAc2dHex1 neutral glycans, as described above. Neutral low-mannose type N-glycans were shown to be expressed in statistically significant manner in lung cancer and in two types of breast cancer (Table 2). In the examples below, it must be taken into account that at least m/z 609, 755, 771, and 917, glycan peaks may contain multiple oligosaccharide structures.
The Neutral O-Glycans Hex2HexNAc2dHex0-1 are More Abundant in Tumor Tissue Samples than in Healthy Control Tissue Samples from Cancer Patients.
Formalin-fixed samples, from tumor and surrounding healthy tissue, were obtained from patients with various types of cancer. The studied cancer types included non-small cell lung adenocarcinoma, ductale breast carcinoma, lobulare breast carcinoma, stomach cancer, colon cancer, kidney cancer, ovarian carcinoma, pancreatic cancer, and cancers of the lymph nodes and the larynx. There were significant differences between the neutral O-glycans isolated from tumor samples and healthy tissue samples (Table 3), more specifically the Hex2HexNAc2 and Hex2HexNAc2dHex1 neutral glycans, as described above. O-glycans were shown to be expressed in statistically significant manner in lung cancer and in two types of breast cancer (Table 4). In the examples below, it must be taken into account that the m/z 771 and 917 glycan peaks may contain multiple oligosaccharide structures. However, while the m/z 917 glycan peak contains significant amounts of mannosidase-sensitive glycans, the vast majority of the glycans in the glycan peak at m/z 771, corresponding to Hex2HexNAc2, are mannosidase-resistant.
The 899 Series Glycans are More Abundant in Tumor Tissue Samples than in Healthy Control Tissue Samples from Cancer Patients.
Formalin-fixed samples, from tumor and surrounding healthy tissue, were obtained from patients with various types of cancer. The studied cancer types included non-small cell lung adenocarcinoma, ductale breast carcinoma, lobulare breast carcinoma, stomach cancer, colon cancer, kidney cancer, ovarian carcinoma, pancreatic cancer, and cancers of the lymph nodes and the larynx. There were significant differences between the m/z 899 series glycans isolated from tumor samples and healthy tissue samples (
In a group of lung cancer patients, more specifically non-small cell lung adenocarcinoma, the glycan peaks at m/z 609, 755, 771, 917, 1079, 1095, 1241, and/or 1403 were expressed in significantly elevated amounts in the tissue samples from the tumor (Table 1), when compared to healthy control tissues from the same patients. These glycan peaks correspond to Hex1HexNAc2, Hex1HexNAc2dHex1, Hex2HexNAc2, Hex2HexNAc2dHex1, Hex3HexNAc2dHex1, Hex4HexNAc2, Hex4HexNAc2dHex1, and Hex5HexNAc2dHex1 glycan epitopes, respectively. An example pair of mass spectra from a lung cancer patient is presented in
In a group of lung cancer patients, more specifically non-small cell lung adenocarcinoma, the glycan peaks at m/z 771 and 917 were expressed in significantly elevated amounts in the tissue samples from the tumor (Table 3), when compared to healthy control tissues from the same patients. These glycan peaks correspond to Hex2HexNAc2 and Hex2HexNAc2dHex1 glycan epitopes, respectively. This difference was shown to be statistically significant (Table 4). As stated above, the glycan peak at m/z 771 is practically mannosidase-resistant, while the peak m/z 917 consists of multiple structures. An example pair of mass spectra from a lung cancer patient is presented in
In a group of lung cancer patients, more specifically non-small cell lung adenocarcinoma, the glycan peaks at m/z 899 and m/z 1045 were expressed in significantly elevated amounts in the tissue samples from the tumor (Table 6 and
In a group of breast cancer patients, more specifically ductale breast carcinoma, the glycan peaks at m/z 609, 771, 917, 933, 1079, 1095, 1241, and/or 1403 were expressed in significantly elevated amounts in the tissue samples from the tumor (Table 1), when compared to healthy control tissues from the same patients. These glycan peaks correspond to Hex1HexNAc2, Hex2HexNAc2, Hex2HexNAc2dHex1, Hex3HexNAc2, Hex3HexNAc2dHex1, Hex4HexNAc2, Hex4HexNAc2dHex1, and Hex5HexNAc2dHex1 glycan epitopes, respectively. This difference was shown to be statistically significant (Table 2). An example pair of mass spectra from a ductale breast cancer patient is presented in
In a group of breast cancer patients, more specifically ductale breast carcinoma, the glycan peaks at m/z 771 and 917 were expressed in significantly elevated amounts in the tissue samples from the tumor (Table 3), when compared to healthy control tissues from the same patients. These glycan peaks correspond to Hex2HexNAc2 and Hex2HexNAc2dHex1 glycan epitopes, respectively. This difference was shown to be statistically significant (Table 4). As stated above, the glycan peak at m/z 771 is practically mannosidase-resistant, while the peak m/z 917 consists of multiple structures. An example pair of mass spectra from a ductale breast cancer patient is presented in
In a group of breast cancer patients, more specifically ductale breast carcinoma, the glycan peak at m/z 899 was expressed in significantly elevated amounts in the tissue samples from the tumor (Table 6), when compared to healthy control tissues from the same patients. This glycan peak corresponds to NeuNAc1Hex1HexNAc1 glycan epitope linked to the 6-position of the O-glycan core GalNAc residue, when the GalNAc had been originally substituted to the 3-position in the intact tissue. This difference was shown to be statistically significant (Table 7). An example pair of mass spectra from a ductale breast cancer patient is presented in
In a group of breast cancer patients, more specifically lobulare breast carcinoma, the glycan peaks at m/z 609, 755, 771, 917, 933, 1079, 1095, 1241, and/or and 1403 were expressed in significantly elevated amounts in the tissue samples from the tumor (Table 1), when compared to healthy control tissues from the same patients. These glycan peaks correspond to Hex1HexNAc2, Hex1HexNAc2dHex1, Hex2HexNAc2, Hex2HexNAc2dHex1, Hex3HexNAc2, Hex3HexNAc2dHex1, Hex4HexNAc2, Hex4HexNAc2dHex1, and Hex5HexNAc2dHex1 glycan epitopes, respectively. An example pair of mass spectra from a lobulare breast cancer patient is presented in
In a group of breast cancer patients, more specifically lobulare breast carcinoma, the glycan peaks at m/z 771 and 917 were expressed in significantly elevated amounts in the tissue samples from the tumor (Table 3), when compared to healthy control tissues from the same patients. These glycan peaks correspond to Hex2HexNAc2 and Hex2HexNAc2dHex1 glycan epitopes, respectively. As stated above, the glycan peak at m/z 771 is practically mannosidase-resistant, while the peak m/z 917 consists of multiple structures. An example pair of mass spectra from a lobulare breast cancer patient is presented in
In a group of breast cancer patients, more specifically lobulare breast carcinoma, the glycan peak at m/z 899 was expressed in significantly elevated amounts in the tissue samples from the tumor (Table 6), when compared to healthy control tissues from the same patients. This glycan peak corresponds to NeuNAc1Hex1HexNAc1 glycan epitope linked to the 6-position of the O-glycan core GalNAc residue, when the GalNAc had been originally substituted to the 3-position in the intact tissue. This difference was shown to be statistically significant (Table 7). An example pair of mass spectra from a lobulare breast cancer patient is presented in
In a group of patients with ovarian tumors, more specifically malign ovarian cystadenocarcinoma or benign ovarian cystadenoma, the glycan peaks at m/z 899 and m/z 1045 were expressed in significantly elevated amounts in the tissue samples from the malignant tumors (Table 6 and
In addition to the statistically studied larger patient populations in lung and breast cancers, neutral low-mannose type N-glycans were expressed in many cancer types, when compared to healthy control tissues from the same patients. These results are summarized in Table 1.
The neutral O-glycans at m/z 771 and 917 were expressed in many cancer types, when compared to healthy control tissues from the same patients. These results are summarized in Table 3.
The m/z 899 series glycans were expressed in many cancer types, when compared to healthy control tissues from the same patients. These results are summarized in Table 6.
Tissue Samples and Glycan Isolation.
Archival paraffin-embedded and formalin-fixed tissue samples were from patients with non-small cell adenocarcinoma. After deparaffinisation, protein-linked glycans were detached from tissue sections with non-reductive alkaline β-elimination in concentrated ammonia-ammonium carbonate essentially as described previously (Huang et al., 2001). The isolated glycans were purified and divided into sialylated and non-sialylated glycan fractions as described in the other Examples of the present invention.
MALDI-TOF Mass Spectrometry.
MALDI-TOF mass spectrometry was performed with a Voyager-DE STR BioSpectrometry Workstation, essentially as described previously (Saarinen et al., 1999; Harvey et al., 1993). Relative molar abundancies of both neutral (Naven & Harvey, 1996) and sialylated (Papac et al., 1996) glycan components were assigned based on their relative signal intensities.
Occurrence of Multiple Cancer-Associated Protein-Linked Glycans in Tissue Glycan Profiles.
Tissue Samples and Glycan Isolation.
Archival paraffin-embedded and formalin-fixed tissue samples were from patients with malignant ovarian cystadenocarcinoma or benign ovarian cystadenoma. After deparaffinisation, protein-linked glycans were detached from tissue sections with non-reductive alkaline β-elimination in concentrated ammonia-ammonium carbonate essentially as described previously (Huang et al., 2001). The isolated glycans were purified and divided into sialylated and non-sialylated glycan fractions as described in the other Examples of the present invention.
MALDI-TOF Mass Spectrometry.
MALDI-TOF mass spectrometry was performed with a Voyager-DE STR BioSpectrometry Workstation, essentially as described previously (Saarinen et al., 1999; Harvey et al., 1993). Relative molar abundancies of both neutral (Naven & Harvey, 1996) and sialylated (Papac et al., 1996) glycan components were assigned based on their relative signal intensities. The mass spectrometric fragmentation analysis was done with the Voyager-DE STR BioSpectrometry Workstation according to manufacturer's instructions. For the fragmentation analysis, sialylated glycans were further purified by gel filtration HPLC and permethylated essentially as described previously (Nyman et al., 1998).
Exoglycosidase Digestions.
Digestions with A. ureafaciens neuraminidase (Glyko, UK), S. pneumoniae β-glucosaminidase (Calbiochem, USA), and Jack bean β-hexosaminidase (C. ensiformis; Calbiochem, USA) were performed essentially as described previously (Saarinen et al., 1999). The specificity of the two latter enzymes was controlled with synthetic oligosaccharides with terminal 1) Galβ1-4GlcNAcβ, 2) GlcNAcβ, and 3) GalNAcβ1-4GlcNAcβ epitopes: β-hexosaminidase digested the HexNAc residues in 2) and 3), but not 1), and β-glucosaminidase digested 2) but not 1) or 3).
Occurrence of HexNAcβHexNAcβ Sequences in Sialylated and Neutral Protein-Linked Glycans from Ovarian Tissue Samples.
Structures of the Glycans.
The sialylated glycans were indicated to contain sialic acid residues upon neuraminidase digestion and subsequent analysis by MALDI-TOF mass spectrometry (data not shown). Exoglycosidase digestions of both neutral and sialylated fractions of benign ovarian tumor glycan samples showed that e.g. sialylated glycans 26 and 27, as well as neutral glycans at m/z 1850 and 1891 were resistant to the action of β-glucosaminidase, but upon β-hexosaminidase digestion they lost either two or four HexNAc units. This indicates that the terminal residue in these glycans is not GlcNAcβ but that it may be GalNAcβ which is terminal to another HexNAcβ unit. The possible such structures include LacdiNAc (GalNAcβ4GlcNAcβ) that has been indicated previously in an ovarian glycoprotein.
Cancer-Associated Glycan Signals and Glycosylation Changes.
The present Example shows that multiple glycans and the glycan profiles are different between normal and malignant ovarian tissue, and that benign and malignant tumors differ from each other in multiple glycosylation features (see
Relative abundancy profiles were obtained for the neutral protein-linked glycan fractions of cancer patient tissue samples with ductale and lobulare type breast adenocarcinoma, non-small cell lung adenocarcinoma, colon carcinoma or benign colon tumor, and ovarian cystadenocarcinoma (malignant tumor) or cystadenoma (benign tumor) as described in the other Examples.
Generation of a Discrimination Formula.
A principal component and discrimination analysis was done to the results of a randomly picked group of four cancer and four healthy tissue samples from patients with ductale type breast carcinoma. It was found that three glycan signals could resolve the samples into cancer and healthy groups (
5.36×I(m/z 771)+20.0×I(m/z 899)+8.13×I(m/z 917)−60.6,
where ‘I (glycan signal)’ refers to the relative abundance of the glycan signal marked in parenthesis, the three glycan signals correspond to sodium adduct ions of Hex2HexNAc2, NeuAc1Hex1HexNAc1(deoxyamino)HexNAc, and Hex2HexNAc2dHex1, respectively, and the resulting ‘scores’ for each sample are plotted on the y-axis in
Testing the Discrimination Formula.
The obtained experimental discrimination formula was first applied to neutral protein-linked glycan analysis results of a group of ductale type breast carcinoma patients. As seen in Figure xA (‘test group’), the formula could correctly discriminate 10 out of 10 samples (100%). Similarly, the formula was applied to samples from lobulare type breast carcinoma and lung cancer patients, and it discriminated correctly 12 out of 14 (86%) and 15 out of 17 (88%) samples from these patients, respectively. When applied to samples of ovarian tumors and healthy ovarian tissue (
The present results show that a simple experimental discrimination formula derived from a small group of breast cancer patients, applied on the results of neutral protein-linked glycan profiling, could effectively discriminate between cancerous and healthy samples of multiple cancer types. In particular, the same formula could correctly differentiate between malignant and benign tumors in both colon and ovary. In addition, it was shown that benign tumors resemble normal tissues and that malignant tumors differ significantly from both normal tissues and benign tumors with respect to the relative abundancies of glycan signals including neutral O-glycans, sialylated Core 2 type O-glycans, and low-mannose N-glycans. In conclusion, similar discrimination procedures can be used in diagnostics of human tumors and cancer. The present results indicate that there are individual differences in the expression of the cancer-associated glycan structures in both normal tissues between persons and in cancerous tissues between individual tumors. However, this did not effect the detection results of the present method.
Due to the presence of tissue-specific cancer-associated glycosylation revealed in the present invention, such as HexNAcβHexNAcβ glycans in tumors of the ovary (described in the other Examples), it is indicated that tissue-specific discrimination formulas based on the results of the present invention can differentiate between cancer and healthy samples and/or benign tumors even more efficiently than the exemplary formula of the present Example.
Glycan Isolation.
N-linked glycans are preferentially detached from cellular glycoproteins by F. meningosepticum N-glycosidase F digestion (Calbiochem, USA) essentially as described previously (Nyman et al., 1998), after which the released glycans are preferentially purified for analysis by solid-phase extraction methods, including ion exchange separation, and divided into sialylated and non-sialylated fractions. For O-glycan analysis, glycoproteins are preferentially subjected to reducing alkaline β-elimination essentially as described previously (Nyman et al., 1998), after which sialylated and neutral glycan alditol fractions are isolated as described above. Free glycans are preferentially isolated by extracting them from the sample with water.
Example of a Glycan Purification Method.
Isolated oligosaccharides can be purified from complex biological matrices as follows, for example for MALDI-TOF mass spectrometric analysis. Optionally, contaminations are removed by precipitating glycans with 80-90% (v/v) aqueous acetone at −20° C., after which the glycans are extracted from the precipitate with 60% (v/v) ice-cold methanol. After glycan isolation, the glycan preparate is passed in water through a strong cation-exchange resin, and then through C18 silica resin. The glycan preparate can be further purified by subjecting it to chromatography on graphitized carbon material, such as porous graphitized carbon (Davies, 1992). To increase purification efficiency, the column can be washed with aqueous solutions. Neutral glycans can be washed from the column and separated from sialylated glycans by elution with aqueous organic solvent, such as 25% (v/v) acetonitrile. Sialylated glycans can be eluted from the column by elution with aqueous organic solvent with added acid, such as 0.05% (v/v) trifluoroacetic acid in 25% (v/v) acetonitrile, which elutes both neutral and sialylated glycans. A glycan preparation containing sialylated glycans can be further purified by subjecting it to chromatography on microcrystalline cellulose in n-butanol:ethanol:water (10:1:2, v/v) and eluted by aqueous solvent, preferentially 50% ethanol:water (v/v). Preferentially, glycans isolated from small sample amounts are purified on miniaturized chromatography columns and small elution and handling volumes. An efficient purification method comprises most of the abovementioned purification steps. In an efficient purification sequence, neutral glycan fractions from small samples are purified with methods including carbon chromatography and separate elution of the neutral glycan fraction, and glycan fractions containing sialylated glycans are purified with methods including both carbon chromatography and cellulose chromatography.
MALDI-TOF Mass Spectrometry.
MALDI-TOF mass spectrometry is performed with a Voyager-DE STR BioSpectrometry Workstation or a Bruker Ultraflex TOF/TOF instrument, essentially as described previously (Saarinen et al., 1999; Harvey et al., 1993). Relative molar abundancies of both neutral (Naven & Harvey, 1996) and sialylated (Papac et al., 1996) glycan components are assigned based on their relative signal intensities. The mass spectrometric fragmentation analysis is done with the Bruker Ultraflex TOF/TOF instrument according to manufacturer's instructions.
Examples of Analysis Sensitivity.
Protein-linked and free glycans, including N- and O-glycans, are typically isolated from as little as about 5×104 cells in their natual biological matrix and analyzed by MALDI-TOF mass spectrometry.
Examples of Analysis Reproducibility and Accuracy.
The present glycan analysis methods have been validated for example by subjecting a single biological sample, containing human cells in their natural biological matrix, to analysis by five different laboratory personnel. The results were highly comparable, especially by the terms of detection of individual glycan signals and their relative signal intensities, indicating that the reliability of the present methods in accurately describing glycan profiles of biological samples including cells is excellent. Each glycan isolation and purification phase has been controlled by its reproducibility and found to be very reproducible. The mass spectrometric analysis method has been validated by synthetic oligosaccharide mixtures to reproduce their molar proportions in a manner suitable for analysis of complex glycan mixtures and especially for accurate comparison of glycan profiles from two or more samples. The analysis method has also been successfully transferred from one mass spectrometer to another and found to reproduce the analysis results from complex glycan profiles accurately by means of calibration of the analysis.
Examples of Biological Samples and Matrices for Successful Glycan Analysis.
The method has been successfully implied on analysis of e.g. blood cells, cell membranes, aldehyde-fixated cells, glycans isolated from glycolipids and glycoproteins, free cellular glycans, and free glycans present in biological matrices such as blood. The experience indicates that the method is especially useful for analysis of oligosaccharide and similar molecule mixtures and their optional and optimal purification into suitable form for analysis.
Generation of Glycan Profiles from Mass Spectrometric Data.
Analysis of KLH Glycosylation.
Keyhole limpet hemocyanin was from Sigma (Megathura crenulata; USA). N-linked and O-linked glycans were released from KLH by N-glycosidase enzyme and alkaline β-elimination, respectively, and the released glycans were purified and analyzed essentially as described in the previous Examples. The detected N-glycan masses, as resolved by MALDI-TOF mass spectrometry, were essentially similar to those described by Kurokawa et al. (2001). However, novel glycan components could be engineered and/or identified on KLH by exoglycosidase digestions with α-mannosidase (Jack beans, C. ensiformis, Sigma) and combined β-galactosidase digestion (β1,4-galactosidase from S. pneumoniae and recombinant β1,3/6-galactosidase, Calbiochem, USA). More specifically, based on the susceptibility of the Hex3HexNAc2dHex1 glycan to β-galactosidase (one hexose removed) and N-glycosidase F (detachment from the glycoprotein) it was found that KLH contains Manα3(Galβ6)Manβ4GlcNAcβ4(Fucα6)GlcNAc that can be converted to Manα3Manβ4GlcNAcβ4(Fucα6)GlcNAc. The latter structure is a novel component in KLH. High-mannose and low-mannose N-glycans were similarly identified based on susceptibility to α-mannosidase. It was concluded that KLH contains at least the following human-type glycans in significant amounts: (Manα)3Manβ4GlcNAcβ4GlcNAc, (Manα)4Manβ4GlcNAcβ4GlcNAc, (Manα)5Manβ4GlcNAcβ4GlcNAc, (Manα)6Manβ4GlcNAcβ4GlcNAc, (Manα)1Manβ4GlcNAcβ4(Fucα6)GlcNAc,
(Manα)2Manβ4GlcNAcβ4(Fucα6)GlcNAc, (Manα)3Manβ4GlcNAcβ4(Fucα6)GlcNAc, (Manα)4Manβ4GlcNAcβ4(Fucα6)GlcNAc, and (Manα)5Manβ4GlcNAcβ4(Fucα6)GlcNAc. About ⅕ of the non-fucosylated and ⅓ of the fucosylated glycans with similar masses as the glycans listed above were assigned as containing β1,6-linked galactose residues based on their resistance to the action of α-mannosidase (Kurokawa et al., 2001). The latter are not human-type glycans.
Production of KLH with Human-Type Low-Mannose N-Glycans.
β-galactosidase was used to removal of, and the reaction result was characterized by MALDI-TOF mass spectrometry of released glycans, verifying that significant amounts of β-galactose residues, occurring in KLH glycans and capping the human-type low-mannose N-glycans, were removed. Based on the structural knowledge described previously (Kurokawa et al., 2001), it was concluded that a modified spectrum of KLH glycans was produced, containing more low-mannose N-glycans than original KLH, and especially the novel human-type low-mannose glycan Manα3Manβ4GlcNAcβ4(Fucα6)GlcNAc described above. As described in the preceding Examples, the increased glycans present in KLH are associated with malignant tumors in major human cancer types.
Protein-linked glycans were isolated by non-reductive alkaline elimination essentially as described by Huang et al. (2000), or by N-glycosidase digestion to specifically retrieve N-glycans as described in the preceding Examples.
Tissue-Specific Glycosylation Analyses and Comparison of Glycan Profiles Between Tissues.
Human tissue protein-linked glycan profiles were analyzed from lung, breast, kidney, stomach, pancreas, lymph nodes, liver, colon, larynx, ovaries, and blood cells and serum. In addition, cultured human cells were analyzed similarly. Tables 14 and 15 show neutral and acidic protein-linked glycan signals, respectively, observed in these human tissues and cells together with their classification into glycan structure groups. However, the individual glycan signals in each structure group varied from sample type to sample type, reflecting tissue material and cell type specific glycosylation. Importantly, in analyses of multiple samples, such as 10 samples from an individual human tissue type, glycan group feature proportions remain relatively constant with respect to variation in the occurrence of individual glycan signals.
Furthermore, it was observed that each tissue demonstrated a specific glycan profile that could be distinguished from the other tissues, cells, or blood or serum samples by comparison of glycan profiles according to the methods described in the present invention. It was also found out that glycan profile difference could be quantitated by comparing the difference between two glycan profiles, for example according to the Equation (resulting in difference expressed in %):
wherein p is the relative abundance (%) of glycan signal i in profile a or b, and n is the total number of glycan signals. For example, the Equation reveals that human lung and ovary tissue protein-linked glycan profiles differ from each other significantly more than human lung and kidney tissue protein-linked glycan profiles differ from each other. Each tissue or cell type could be compared in this manner.
Comparison of Glycosylation Features Between Human Tissue Materials.
Table 16 shows how glycan signal structural classification according to the present invention was applied to the comparison of quantitative differences in glycan structural features in glycan profiles between human tissue materials. The results show that each sample type was different from each other with respect to the quantitative glycan grouping and classification. Specifically, normal human lung and lung cancer tissues were different from each other both in the neutral glycan and sialylated glycan fractions with respect to the quantitative glycan structure grouping. In particular, lung cancer showed increased amounts of glycan signals classified into terminal HexNAc containing glycans. In analysis of individual glycan signals by β-glucosaminidase digestion, it was found that lung cancer associated glycan signals, such as Hex3HexNAc4dHex1, contained terminal 3-linked GlcNAc residues, correlating with the classification of these glycan signals into the terminal HexNAc (N>H and/or N═H) glycan groups. Furthermore, the human serum protein-linked glycan profile showed significantly lower amounts of high-mannose and especially low-mannose type N-glycan signals. It is concluded that the glycan grouping profile of human serum is significantly different from the corresponding profiles of solid tissues, and the present methods are suitable for identification of normal and diseased human tissue materials and blood or serum typical glycan profiles from each other.
Disease- and Tissue-Specific Differences in Glycan Structure Groups.
The analysis of protein-linked glycan profiles of human tissues revealed also that tissues with abundant epithelial structures, such as stomach, colon, and pancreas, contain increased amounts of small glycan structures, preferentially mucin-type glycans, and fucosylated glycan structures compared to the other glycan structure groups in structure classification. Similarly as epithelial tissues, mucinous carcinomas were differentiated from other carcinoma types based on analysis of their protein-linked glycan profiles and structure groups according to the methods of the present invention.
Glycan material is liberated from biological material by enzymatic or chemical means. To obtain a less complex sample, glycans are fractionated into neutral and acidic glycan fractions by chromatography on a graphitized carbon as described above. A useful purification step prior to NMR analysis is gel filtration high-performance liquid chromatography (HPLC). For glycans of glycoprotein or glycolipid origin, a Superdex Peptide HR10/300 column (Amersham Pharmacia) may be used. For larger glycans, chromatography on a Superdex 75 HR10/300 column may yield superior results. Superdex columns are eluted at a flow rate of 1 ml per minute with water or with 50-200 mM ammonium bicarbonate for the neutral and acidic glycan fractions, respectively, and absorbance at 205-214 nm is recorded. Fractions are collected (typically 0.5-1 ml) and dried. Repeated dissolving in water and evaporation may be necessary to remove residual ammonium bicarbonate salts in the fractions. The fractions are subjected to MALDI-TOF mass spectrometry and all fractions containing glycans are pooled.
Prior to NMR analysis, the pooled fractions are dissolved in deuterium oxide and evaporated. With glycan preparations containing about 100 nmol or more material, the sample is finally dissolved in 600 microliters of high-quality deuterium oxide (99.9-99.996%) and transferred to a NMR analysis tube. A roughly equimolar amount of an internal standard, e.g. acetone, is commonly added to the solution. With glycan preparations derived from small tissue specimens or from a small number of cells (5-25 million cells), the sample is preferably evaporated from very high quality deuterium oxide (99.996%) twice or more to eliminate H2O as efficiently as possible, and then finally dissolved in 99.996% deuterium oxide. These low-material samples are preferebly analyzed by more sensitive NMR techniques. For example, NMR analysis tubes of smaller volumes can be used to obtain higher concentration of glycans. This kind of tubes include e.g. nanotubes (Varian) in which sample is typically dissolved in a volume of 37 microliters. Alternatively, higher sensitivity is achieved by analyzing the sample in a cryo-NMR instrument, which increases the analysis sensitivity through low electronic noise. The latter techniques allow gathering of good quality proton-NMR data from glycan samples containing about 1-5 nmol of glycan material.
It is realized that numerous studies have shown that proton-NMR data has the ability to indicate the presence of several structural features in the glycan sample. In addition, by careful integration of the spectra, the relative abundancies of these structural features in the glycan sample can be obtained.
For example, the proton bound to monosaccharide carbon-1, i.e. H-1, yields a distinctive signal at the lower field, well separated from the other protons of sugar residues. Most monosaccharide residues e.g. in N-glycans are identified by their H-1 signals (see Tables 4 and 5 for representative examples). In addition, the H-2 signals of mannose residues are indicative of their linkages.
Sialic acids do not possess a H-1, but their H-3 signals (H-3 axial and H-3 equatorial) reside well separated from other protons of sugar residues. Moreover, differently bound sialic acids may be identified by their H-3 signals. For example, the NeuSAc H-3 signals of Neu5Acα2-3Gal structure are found at 1.797 ppm (axial) and 2.756 ppm (equatorial). On the other hand, the NeuSAc H-3 signals of Neu5Acα2-6Gal structure are found at 1.719 ppm (axial) and 2.668 ppm (equatorial). By comparing the integrated areas of these signals, the molar ratio of these structural features is obtained.
Other structural reporter signals are commonly known and those familiar with the art use the extensive literature for reference in glycan NMR assignments.
Samples:
N-glycan fractions were liberated from pancreas carcinoma samples, purified as described in the preceding Examples, ultimately by gel filtration HPLC, and fractionated into 1) large neutral, 2) small neutral, and 3) acidic N-glycan fractions. These samples were analyzed by cryo-probe 1H-NMR as described above.
Large Neutral N-Glycan Fraction:
This fraction contained high-mannose N-glycans corresponding to the structural elements of reference structures in
Small Neutral N-Glycan Fraction:
This fraction contained low-mannose N-glycans as well as MannGlcNAc1 glycans corresponding to the structural elements of reference structures in
Acidic Neutral N-Glycan Fraction:
This fraction contained complex-type N-glycans corresponding to the structural elements of biantennary N-glycan reference structures in
Lysosomal protein sample including human myeloperoxidase was chosen to represent lysosomal organelle glycoproteins. The sample was digested with N-glycosidase F to isolate N-glycans, and they were purified for MALDI-TOF mass spectrometric analysis as described in the preceding Examples.
Alkaline phosphatase digestion was performed essentially according to manufacturer's instructions. After the digestion glycans were purified for MALDI-TOF mass spectrometric analysis as above.
Neutral N-Glycan Profiles.
The neutral N-glycan profile is presented in
Acidic N-Glycan Profiles.
The acidic N-glycan profile is presented in
Phosphorylated N-Glycans.
Major glycan signals with phosphate or sulphate ester (SP) in their monosaccharide compositions were Hex5HexNAc2SP (1313), Hex6HexNAc2SP (1475), and Hex7HexNAc2SP (1637). When the acidic glycan fraction was subjected to alkaline phosphatase digestion, these major signals were specifically digested and disappeared from the acidic glycan spectrum as detected by MALDI-TOF mass spectrometry (data not shown). In contrast, the major glycan signals with sialic acids in their monosaccharide compositions were not digested, including NeuAc1Hex3HexNAc3dHex1 (1549). This indicates that the three original glycan signals corresponded to phosphorylated N-glycans (PO3H)Hex5HexNAc2, (PO3H)Hex6HexNAc2, and (PO3H)Hex7HexNAc2, respectively, wherein PO3H denotes phosphate ester.
The data further indicated that the present organelle-specific N-glycan profile included phosphorylated low-mannose type and high-mannose type N-glycans (PO3H)Hex3HexNAc2 (989), (PO3H)Hex4HexNAc2 (1151), (PO3H)Hex5HexNAc2 (1313), (PO3H)Hex6HexNAc2 (1475), (PO3H)Hex7HexNAc2 (1637), and (PO3H)Hex8HexNAc2 (1799). In this glycan profile the phosphorylated glycan residues are preferentially mannose residues, more preferentially α-mannose residues, and most preferentially 6-phospho-α-mannose residues i.e. (PO3H-6Manα).
Normal lung (Sample I) and malignant lung tumor samples (Sample II) were archival formalin-fixed and paraffin-embedded tissue sections from cancer patients with small cell lung cancer. Protein-linked glycans were isolated from the representative samples by non-reductive β-elimination, purified, and analyzed by MALDI-TOF mass spectrometry as described in the preceding Examples. In the present analysis, the total desialylated protein-linked glycomes from each sample were used.
To analyze the data and to find the major glycan signals associated with either the normal state or the disease, two variables were calculated for the comparison of glycan signals between the two samples:
1. absolute difference A=(SII−SI), and
2. relative difference R=A/SI,
wherein SI and SII are relative abundances of a given glycan signal in Sample I (normal human lung tissue) and Sample II (small cell lung cancer), respectively.
The glycan signals were further classified into structure classes by a one letter code:
a b c d,
wherein a is either N (neutral) or S (sialylated); b is either L (low-mannose type), M (high-mannose type), H (hybrid-type or monoantennary), C (complex-type), S (soluble), or O (other); c is either - (nothing), F (fucosylated), or E (multifucosylated); and d is either - (nothing), T (terminal HexNAc, N>H), or B (terminal HexNAc, N═H); as described in the present invention.
To identify protein-linked glycan signals correlating with malignant tumors in total tissue glycomes from cancer patient, major signals specific to either normal lung tissue or malignant small cell lung cancer tumors were selected based on their relative abundances. When A and R were calculated for the glycan profile datasets of the two samples, and the glycan signals thereafter sorted according to the values of A and R, the most significant differing glycan signals between the two samples could be identified (Table 18). Among the most abundant protein-linked glycan signals in the data, the following three signals had emerged in II (new in Table 18): 1955, 2685, and 2905, corresponding to fucosylated complex-type N-glycans. The absolute differences of these signals were among the ten most large in the data, indicating that they were significant. The signals that experienced the highest relative increase in R were: 771 (R=2.4, corresponding to 3.4-fold increase), 1905 (R=2.2, corresponding to 3.2 fold increase), and 1485 (R=1.3, corresponding to 2.3 fold increase). The latter signal corresponded to complex-type N-glycans with terminal HexNAc. Significantly, its +2Hex counterpart 1809 was the most drastically reduced glycan signal in II with A=−8.9 and R=−0.4 (corresponding to 40% decrease in II), indicating a large change in terminal HexNAc expression. Moreover, the data easily shows that the glycan signals 1704, 1866, 1136, and 755 were not present in II.
Further, the obtained results, especially the identified major glycan signals indicative of either Sample II (high A and R) or Sample I (low A and R) were used to compile two alternative algorithms to produce glycan score with which lung cancer sample could be identified from normal lung sample based on the glycan signal values of the quantitative glycome data:
1. glycan score=I(1485)−I(1809),
wherein I(1485) is the relative abundance of glycan signal 1485 and I(1809) is the relative abundance of glycan signal 1809;
and alternatively:
2. glycan score=I(1485)/I(1809)
These glycan score algorithms yield high numerical value when applied to lung cancer sample and low numerical value when applied to normal lung sample.
The present identification analysis produced selected glycan signal groups, from where indifferent glycan signals have been removed and that have reduced noise or background and less observation points, but have the resolving power of the initially obtained glycan profiles. Such selected signal groups and their patterns in different sample types can serve as a signature for the identification of for example 1) normal human glycosylation, 2) tissue-specific glycosylation, 3) disease states affecting tissue glycosylation, 4) malignant cancer, 5) malignancy in comparison to benign tumors, and grade of malignancy, or 6) glycan signals that have individual variation. Moreover, glycan signals can be identified that do not change between samples, including major glycans that can be considered as invariant or housekeeping glycans.
The present data analysis identified potential glycan marker signals for future identification of either the normal lung of the lung tumor glycome profiles. Further, glycan classes that are associated with e.g. disease state in humans can be identified. Specifically, the analysis revealed that within the total complex-type N-glycan structure class in the tissue glycomes, terminal HexNAc (N>H) were typical to small cell lung cancer.
The method also allows identification of major glycans or major changes within glycan structure classes. For example, the proportion of multifucosylated glycans within the total tissue glycome profile was increased in II (1.1%) compared to I (0.3%). The data analysis identified this change predominantly to the appearance of glycan signals 1955 and 2685 in II.
Cancer cell derived N-glycans were obtained by N-glycosidase F digestion and purified as described in the preceding Examples. The glycan sample was dissolved in 10 μl of 8 mM sodium metaperiodate prepared in 0.1 M sodium acetate buffer, pH 5.5. The oxidation reaction was allowed to proceed at +4° C. in the dark for two days. The excess of periodate was then destroyed by adding 10 μl of 80 mM ethylene glycol and the mixture was allowed to stand for 6 hours. The reaction mixture was then neutralized by adding 10 μl of 0.1 M aqueous ammonia. The aldehyde groups which were generated in the reaction mixture were then reduced by adding 10 μl of 1.6 M sodium borohydride and allowed to stand overnight at +4° C.
The carbohydrate material in the reaction mixture was isolated by solid phase extraction on a small column of graphitized carbon and subjected to MALDI-TOF mass spectrometry in 2,5-dihydroxybenzoic acid matrix.
Some of the major signals observed in the spectrum (
The signal m/z 1831 represents an oxidized-reduced form of (Man)9(GlcNAc)2 species ([M+Na]+, m/z 1905), containing three terminal Man units. Because periodate oxidizes vicinal hydroxyl groups, the terminal units are oxidized so that they lose the C-3 as formaldehyde, each contributing to a loss of −28 Da in mass. In addition, 2-substituted Man units may get oxidized between C-3 and C-4, yielding two primary alcohol groups (leading to +2 Da mass change each). Typical mammalian (Man)9(GlcNAc)2 species carry four Manα1,2 residues, and here four 2-substituted Man units were oxidized this way and contribute to a mass increment of +8 Da. The reducing end of the glycan is also reduced, yielding+2 Da increment. No unoxidized (Man)9(GlcNAc)2 species can be observed in the spectrum.
The signal m/z 1831 is accompanied by a signal at m/z 1861. This signal is assigned as an oxidized-reduced structure where one of the terminal Man units is oxidized only between C2-C3 or C3-C4, but not liberating formaldehyde, thus yielding a structure 30 Da larger than m/z 1831 material.
The signal m/z 1667 represents an oxidized-reduced form of (Man)8(GlcNAc)2 species ([M+Na]+, m/z 1743), containing three terminal Man units. These terminal units are oxidized so, that they lose the C-3 as formaldehyde, each contributing to a loss of −28 Da. In addition, all three 2-substituted Man units got oxidized between C-3 and C-4, yielding two primary alcohol groups (+2 Da). The m/z 1697 signal is assigned as an oxidized-reduced structure where one of the terminal Man units is oxidized only between C2-C3 or C3-C4, but not liberating formaldehyde, thus yielding a structure 30 Da larger than m/z 1667 material.
Other major signals are derived similarly from the following structures:
m/z 1503 and 1533 from (Man)7(GlcNAc)2 species (intact m/z 1581);
m/z 1339 and 1369 from (Man)6(GlcNAc)2 species (intact m/z 1419);
m/z 1175 and 1205 from (Man)5(GlcNAc)2 species (intact m/z 1257);
m/z 1041 and 1071 from (Man)4(GlcNAc)2 species (intact m/z 1095).
A part of the mass spectrum shown in
The m/z 879 signal is derived from an oxidized-reduced form of m/z 933, a (Man)3(GlcNAc)2 species. Here, two terminal unsubstituted Man units are oxidized to release formaldehyde, other monosaccharide units are stable to periodate oxidation. Preferred structure corresponding to the original abundant signal is Manα3(Manα6)Manβ4GlcNAcβ4GlcNAc.
Affinity reagent for analyzing antibody molecules recognizing type II N-acetyllactosamine on Core 2 O-glycans was prepared by coupling Galβ4GlcNAcβ6(Galβ3)GalNAcα-O—(CH2)2-(p-amino)benzyl (IsoSep, Sweden) to N-hydroxysuccinimide activated Sepharose (Amersham Pharmacia, Sweden).
Serum samples were isolated from a person that had had malignant ovarian cancer and recovered from it, as well as control individuals.
Ig antibodies were analyzed by affinity isolating them from human sera, washing, eluting with acid, and detecting Ig subunits by protein detection after SDS-PAGE according to standard procedures.
Ovarian cancer patient samples showed substantial amounts of human Ig antibodies with affinity to the type II N-acetyllactosamine Core 2 O-glycan affinity material, and proteins corresponding to Ig subunits could be detected in SDS-PAGE.
Protein-linked glycans were isolated from formalin-fixed and paraffin-embededed tissue sections and analyzed as describe in the preceding Examples. Following cancer types were analyzed: lung cancer, both small cell lung adenocarcinoma and non-small cell lung adenocarcinoma, and lung carcinoma liver metastases; breast cancer; ductale type breast adenocarcinoma and lymph node metastases thereof; lobulare type breast adenocarcinoma and lymph node metastases thereof; ovarian cystadenocarcinoma; colon cancer/carcinoma, carcinoma adenomatosum, and liver metastases thereof; kidney cancer/carcinoma, and kidney hypernephroma; gastric cancer/carcinoma, and lymph node metastases thereof, liver cancer/carcinoma; larynx cancer/carcinoma; pancreas cancer/carcinoma; melanoma and liver metastases thereof; gall bladder cancer/carcinoma, and liver metastases thereof; salivary gland cancer/carcinoma, and skin metastases thereof; and lymph node cancer/carcinoma (lymphoma).
Low-mannose type N-glycans were identified based on their indicative mass spectrometric glycan signals for [M+Na]+ ions of Hex1-4HexNAc2dHex0-1, as described in to the present invention. In the present analyses the preferred indicative signals of the glycan structure group were m/z 771, 917, 933, 1079, and 1095 due to their abundance in mass spectrometric glycan profiles; 771, 933, and 1095 especially in case of non-fucosylated low-mannose type N-glycans; 917 and 1079 especially in case of detecting fucosylated low-mannose type N-glycans; 933, 1079, and 1095 especially in case of detecting low-mannose type N-glycans when N- and O-glycans were analyzed simultaneously; 1079 and 1095 as preferred sensitive indicators of the group; and/or 1079 as a preferred sensitive single indicator of the group.
Using these criteria, low-mannose type N-glycans were detected to be cancer-associated and expressed in malignant tumors in following cancer types: lung cancer, both small cell lung adenocarcinoma and non-small cell lung adenocarcinoma, and lung carcinoma liver metastases; breast cancer; ductale type breast adenocarcinoma and lymph node metastases thereof; lobulare type breast adenocarcinoma and lymph node metastases thereof; ovarian cystadenocarcinoma; colon cancer/carcinoma, carcinoma adenomatosum, and liver metastases thereof; kidney cancer/carcinoma, and kidney hypernephroma; gastric cancer/carcinoma, and lymph node metastases thereof, liver cancer/carcinoma; larynx cancer/carcinoma; pancreas cancer/carcinoma; melanoma and liver metastases thereof; gall bladder cancer/carcinoma, and liver metastases thereof; and lymph node cancer/carcinoma (lymphoma). Further the expression of the glycans was detected in abundant amounts in salivary gland cancer/carcinoma, and skin metastases thereof
In all these cancer types, both fucosylated and non-fucosylated low-mannose type N-glycans were detected and expression levels were found to be higher in cancer in comparison to normal human tissue samples. In addition, in benign tumors of the ovary and colon, low-mannose type glycan signals were significantly lower than in the corresponding malignant tumors, indicating that low-mannose type glycans are specifically associated with malignancy in human cancers.
O-glycans were identified based on their major indicative mass spectrometric glycan signals 771, 917, 899, 1038, and 1329 corresponding to SA0-2Hex2HexNAc2dHex0-1 glycan compositions, as described in to the present invention. In the present analyses the preferred indicative signals of the glycan structure group were m/z 771, 917, and 899 due to their abundance in mass spectrometric glycan profiles; 771 especially in case of non-fucosylated neutral glycans; 917 especially in case of detecting fucosylated neutral glycans; and 899 especially in case of detecting sialylated glycans; 771 and 899 as preferred sensitive indicators of the group; and/or 771 as a preferred sensitive single indicator of the group.
Using these criteria, O-glycans were detected to be cancer-associated and expressed in malignant tumors in following cancer types: lung cancer/carcinoma, both small cell lung adenocarcinoma and non-small cell lung adenocarcinoma, and lung carcinoma liver metastases; breast cancer; ductale type breast adenocarcinoma and lymph node metastases thereof; lobulare type breast adenocarcinoma and lymph node metastases thereof; ovarian cystadenocarcinoma; colon cancer/carcinoma, and carcinoma adenomatosum; kidney cancer/carcinoma, and kidney hypernephroma; gastric cancer/carcinoma; larynx cancer/carcinoma; and pancreas cancer/carcinoma; melanoma and liver metastases thereof; gall bladder cancer/carcinoma, and liver metastases thereof. Further the expression of the glycans was detected in abundant amounts in salivary gland cancer/carcinoma, and skin metastases thereof.
In all these cancer types, neutral O-glycans were the most abundant cancer-associated structures, but both fucosylated and sialylated O-glycans were in all analyses detected to be expressed simultaneously; in most cases also their levels were found to be higher in cancer in comparison to normal human tissue samples. In addition, in benign tumors of the ovary and colon, O-glycan signals were significantly lower than in the corresponding malignant tumors, indicating that O-glycans are specifically associated with malignancy in human cancers.
Non-reducing terminal HexNAc glycan expression was detected along with one or both of the above mentioned glycan groups in the following analyzed cancer types: lung cancer, both small cell lung adenocarcinoma and non-small cell lung adenocarcinoma, and lung carcinoma liver metastases; breast cancer; ductale type breast adenocarcinoma and lymph node metastases thereof; lobulare type breast adenocarcinoma and lymph node metastases thereof; ovarian cystadenocarcinoma; colon cancer/carcinoma, carcinoma adenomatosum, and liver metastases thereof; kidney cancer/carcinoma, and kidney hypernephroma; gastric cancer/carcinoma, and lymph node metastases thereof, liver cancer/carcinoma; larynx cancer/carcinoma; pancreas cancer/carcinoma; melanoma and liver metastases thereof; gall bladder cancer/carcinoma, and liver metastases thereof; and lymph node cancer/carcinoma (lymphoma). Further the expression of the glycans was detected in abundant amounts in salivary gland cancer/carcinoma, and skin metastases thereof. The detection was based on their major indicative mass spectrometric glycan signals 1485 and/or 1850 corresponding to Hex3HexNAc4dHex1 and Hex4HexNAc5dHex1 glycan compositions, respectively, as described in the present invention.
Protein-linked glycans were isolated from formalin-fixed and paraffin-embededed tissue sections of ductale-type breast cancer and lymph node metastases derived therefrom, and analyzed as described in the preceding Examples.
The results are described as protein-linked glycan profiles of the primary breast cancer tumors and normal brease tissues, as well as metastases and normal lymph node tissues from the same patients in
Further, the acidic glycan profiles were quantitatively analyzed by composition classification into glycan structural features, as described in Table 20. Importantly, the classification analysis revealed that especially sulphation and/or phosphorylation and complex fucosylation of acidic glycans were associated with the present tumors as well as the corresponding lymph node metastases in comparison with either of the primary and the secondary normal tissues. Major glycan signals expressing these features were NeuAc1Hex2HexNAc2SP1 and NeuAc1Hex3HexNAc3SP2 wherein SP is either sulphate or phosphate ester, and NeuAc1Hex5HexNAc4dHex2 and NeuAc1Hex5HexNAc4dHex3, respectively.
A series of malignant primary tumors of the lung, colon, skin, and gall bladder, and corresponding liver metastases were analyzed in order to identify glycans associated with malignancy and metastasis formation, especially liver metastases. Protein-linked glycans were isolated from formalin-fixed and paraffin-embedded tissue sections or protein fractions of tumor tissues and analyzed by MALDI-TOF mass spectrometry as described in the preceding Examples.
By comparing the quantitative expression of indicative glycan signals of 1) liver metastases and normal liver tissue, and 2) malignant primary tumor and corresponding normal tissue, malignancy and metastasis-associated glycan signals and glycan structure groups were identified as described in the preceding Examples.
Glycan groups that were associated with liver metastases were 1) low-mannose type glycans, 2) non-reducing terminal HexNAc glycans, especially non-reducing terminal GlcNAc glycans, and 3) neutral and acidic O-glycans, especially neutral and sialylated O-glycans. Preferential glycan signals associated with malignant liver metastases included 933, 1079, and 1095 (1); 1485 (2); and 771, 917, 899, 1038, and 1329 (3), especially 771 and 899 (3); wherein the numbering (1-3) refers to the identified metastasis-associated glycan groups. The glycans were present in the primary tumor in elevated amounts in comparison to corresponding normal tissue, and significantly, further elevated in comparison to normal liver tissue i.e. enriched in the liver metastases. The results indicate that the present glycans identified in metastases are associated with metastasis formation, more specifically liver metastasis formation. Specifically, the present results indicate that low-mannose type, non-reducing terminal GlcNAc, and O-glycans are associated with malignant metastasis formation, more specifically liver metastasis formation; most specifically, low-mannose type and terminal GlcNAc glycans are associated with liver metastases.
A series of malignant primary tumors of the stomach and breast, and corresponding lymph node metastases were analyzed in order to identify glycans associated with malignancy and metastasis formation, especially lymph node metastases. Protein-linked glycans were isolated from formalin-fixed and paraffin-embedded tissue sections or protein fractions of tumor tissues and analyzed by MALDI-TOF mass spectrometry as described in the preceding Examples.
By comparing the quantitative expression of indicative glycan signals of 1) lymph node metastases and normal lymph node tissue, and 2) malignant primary tumor and corresponding normal tissue, malignancy and metastasis-associated glycan signals and glycan structure groups were identified as described in the preceding Examples.
Glycan groups that were associated with lymph node metastases were 1) low-mannose type glycans and 2) neutral and acidic O-glycans, especially neutral and sialylated O-glycans. Preferential glycan signals associated with malignant liver metastases included 933, 1079, and 1095 (1); and 771, 917, 899, 1038, and 1329 (2), especially 771 and 899 (2); wherein the numbering (1-2) refers to the identified metastasis-associated glycan groups. The glycans were present in the primary tumor in elevated amounts in comparison to corresponding normal tissue, and significantly, further elevated in comparison to normal liver tissue i.e. enriched in the lymph node metastases. The results indicate that the present glycans identified in metastases are associated with metastasis formation, more specifically lymph node metastasis formation. Specifically, the present results indicate that low-mannose type and O-glycans are associated with malignant metastasis formation, more specifically lymph node metastasis formation; most specifically, low-mannose type and sialylated O-glycans are associated with lymph node metastases.
A sample array from different cancer patients with gastric cancer revealed an interesting phenomenon: lymph node metastases of primary gastric cancers transformed the lymph node glycan profiles with glycan signals originating from primary tissue specific glycosylation. For example, in lymph node metastases small O-glycan type signals with either blood group O, A, or B characteristics were detected. These signals in the primary tissue location included in both O, A, and B patients: 1063 (Hex2HexNAc2dHex2) i.e. increased amounts of dHex in glycans with 3 or less HexNAc residues and more clearly in glycans with 2 or less HexNAc residues; specifically in A patients: 1120 (Hex2HexNAc3dHex1) and 1266 (Hex2HexNAc3dHex2) i.e. increased amounts of HexNAc and dHex in glycans with 3 or less Hex residues and more clearly in glycans with 2 or less Hex residues; and specifically in B patients: 714 (Hex2HexNAcldHexl) and/or 1225 (Hex3HexNAc2dHex2) i.e. increased amounts of Hex and dHex in glycans with 3 or less HexNAc residues and more clearly in glycans with 2 or less HexNAc residues. The present results demonstrated that the origin of the primary tumor is reflected in the glycan profile of the metastasis, and that the metastatic cancer cells carry with them abnormal glycan antigens to the site of the metastasis.
Number | Date | Country | Kind |
---|---|---|---|
20055417 | Jul 2005 | FI | national |
Number | Date | Country | |
---|---|---|---|
Parent | 11989031 | Jun 2008 | US |
Child | 14523154 | US |