The invention describes reagents and methods for specific binders to glycan structures of stem cells. Furthermore the invention is directed to screening of additional binding reagents against specific glycan epitopes on the surfaces of the stem cells. The preferred binders of the glycans structures includes proteins such as enzymes, lectins and antibodies.
Stem Cells
Stem cells are undifferentiated cells which can give rise to a succession of mature functional cells. For example, a hematopoictic stem cell may give rise to any of the different types of terminally differentiated blood cells. Embryonic stem (ES) cells are derived from the embryo and are pluripotent, thus possessing the capability of developing into any organ or tissue type or, at least potentially, into a complete embryo.
The first evidence for the existence of stem cells came from studies of embryonic carcinoma (EC) cells, the undifferentiated stem cells of teratocarcinomas, which are tumors derived from germ cells. These cells were found to be pluripotent and immortal, but possess limited developmental potential and abnormal karyotypes (Rossant and Papaioannou, Cell Differ 15,155-161, 1984). ES cells, on the other hand, are thought to retain greater developmental potential because they are derived from normal embryonic cells, without the selective pressures of the teratocarcinoma environment.
Pluripotent embryonic stem cells have traditionally been derived principally from two embryonic sources. One type can be isolated in culture from cells of the inner cell mass of a pre-implantation embryo and are termed embryonic stem (ES) cells (Evans and Kaufman, Nature 292,154-156, 1981; U.S. Pat. No. 6,200,806). A second type of pluripotent stem cell can be isolated from primordial germ cells (PGCS) in the mesenteric or genital ridges of embryos and has been termed embryonic germ cell (EG) (U.S. Pat. No. 5,453,357, U.S. Pat. No. 6,245,566). Both human ES and EG cells are pluripotent. This has been shown by differentiating cells in vitro and by injecting human cells into immunocompromised (SCUM) mice and analyzing resulting teratomas (U.S. Pat. No.6,200,806). The term “stem cell” as used herein means stem cells including embryonic stem cells or embryonic type stem cells and stem cells diffentiated thereof to more tissue specific stem cells, adults stem cells including mesenchymal stem cells and blood stem cells such as stem cells obtained from bone marrow or cord blood.
The present invention provides novel markers and target structures and binders to these for especially embryonic and adult stem cells, when these cells are not heamtopoietic stem cells. From hematopoietic CD34+ cells certain terminal structures such as terminal sialylated type two N-acetyllactosamines such as NeuNAcα3Galβ4GlcNAc (Magnani J. U.S. Pat. No. 6,362,010 ) has been suggested and there is indications for low expression of Slex type structures NeuNAcα3Galβ4(Fucα3)GlcNAc (Xia L et al Blood (2004) 104 (10) 3091-6). The invention is also directed to the NeuNAcα3Galβ4GlcNAc non-polylactosamine variants separately from specific characteristic O-glycans and N-glycans. The invention further provides novel markers for CD133+ cells and novel hematopoietic stem cell markers according to the invention, especially when the structures does not include NeuNAcα3Galβ4(Fucα3)0-1GlcNAc. Preferably the hematopoictic stem cell structures are non-sialylated, fucosylated structures Galβ1-3-structures according to the invention and even more preferably type 1 N-acetyllactosamine structures Galβ3GlcNAc or separately preferred Galβ3GalNAc based structures.
Human ES, EG and EC cells, as well as primate ES cells, express alkaline phosphatase, the stage-specific embryonic antigens SSEA-3 and SSEA-4, and surface proteoglycans that are recognized by the TRA-1-60; and TRA-1-81 antibodies. All these markers typically stain these cells, but are not entirely specific to stem cells, and thus cannot be used to isolate stem cells from organs or peripheral blood.
The SSEA-3 and SSEA4 structures are known as galactosylgloboside and sialylgalactosylgloboside, which are among the few suggested structures on embryonal stem cells, though the nature of the structures in not ambigious. An antibody called K21 has been suggested to bind a sulfated polysaccharide on embryonal carcinoma cells (Badcock G et al Cancer Res (1999) 4715-19. Due to cell type, species, tissue and other specificity aspects of glycosylation (Furukawa, K., and Kobata, A. (1992) Curr. Opin. Struct. Biol. 3, 554-559, Gagncux, and Varki, A. (1999) Glycobiology 9, 747-755;Gawlitzek, M. et al. (1995), J. Biotechnol. 42, 117-131; Goelz, S., Kumar, R., Potvin, B., Sundaram, S., Brickelmaier, M., and Stanley, P. (1994) J. Biol. Chem. 269,1033-1040; Kobata, A (1992) Eur. J. Biochem. 209 (2) 483-501.) This result does not indicate the presence of the structure on native embryonal stem cells. The present invention is directed to human stem cells.
Some low specificity plant lectin reagents have been reported in binding of embryonal stem cell like materials. Venable et al 2005, (Dev. Biol. 5:15) measured lectins the binding of SSEA-4 antibod positive subpopulation of embryonal stem cells. This approach suffers obvious problems. It does not tell the expression of the structures in native non-selected embryonal strem cells. The SSEA-4 was chosen select especially pluripotent stem cells. The scientists of the same Bresagen company have further revealed that actual role of SSEA-4 with the specific stem cell lines is not relevant for the pluripotency.
The work does not reveal: 1) The actual amount of molecules binding to the lectins or 2) presence of any molecules due to defects caused by the cell sorting and experimental problems such as tyypsination of the cells. It is really alerting that the cells were trypsinized, which removes protein and then enriched by possible glycolipid binding SSEA-4 antibody and secondary antimouse antibody, fixed with paraformaldehyde without removing the antibodies, and labelled by simultaneous with lectin and the same antibody and then the observed glycan profile is the similar as revealed by lectin analysis by same scientist for antibody glycosylation (M. Pierce US2005 ) or 3) the actual structures, which are bound by the lectins. To reveal the possible residual binding to the cells would require analysis of of the glycosylations of the antibodies used (sources and lots not revealed).
The purity of the SSEA-4 positive cells was reported to be 98-99%, which is unusually high. The quantitation of the binding is not clear as
It appears that skilled artisan would consider the results of Venable et al such convenient colocalization of SSEA-4 and the lectin binding by binding of the lectins to the anti-SSEA-4 antibody. It appears that the more rare binding would reflect lower proportion of the terminal epitope per antibody molecule leading to lower density of the labellable antibodies. It is also realized that the non-controlled cell culture process with animal derived material would lead to contamination of the cells by N-glycolyl-neuraminic acid, which may be recognized by anti-mouse antibodies used as secondary antibody (not defined what kind of anti-mouse) used in purification and analysis of purity, which could lead to convieniently high cell purity. The work is directed only to the “pluripotent” embryonal stem cells associated with SSEA-4 labelling and not to differentiated variants thereof as the present invention. The results indicated possible binding (likely on the antibodies) to certain potential monosaccharide epitopes (6th page, Table 1, and column 2 ) such Gal and Galactosamine for RCA (ricin, inhitable by Gal or lactose), GlcNAc for TL (tomato lcctin), Man or Glc for ConA, Sialic acid/Sialic acid α6GalNAc for SNA, Manα for HHL; lectins with partial binding not correlating with SSEA4: GalNAc/GalNAcβ4Gal(in text) WFA, Gal for PNA, and Sialic acid/Sialic acid α6GalNAc for SNA; and lectins associated by part of SSEA-4 cells were indicated to bind Gal by PHA-L and PHA-E, GalNAc by VVA and Fuc by UEA, and Gal by MAA (inhibited by lactose). UEA binding was discussed with reference as endothelial marker and O-linked fucose which is directly bound to Ser (Thr) on protein. The background has indicated a H type 2 specificity for the endothelial UEA receptor. The specificities of the lectins are somewhat unusual, but the product codes or isolectin numbers/names of the lectins were not indicated (except for PHA-E and PHA-L) and it is known that plants contain numerous isolectins with varying specificities.
The present invention revealed specific structures by mass spectrometric profiling, NMR spectrometry and binding reagents including glycan modifying enzymes. The lectins are in general low specificity molecules. The present invention revealed binding epitopes larger than the previously described monosaccharide epitopes. The larger epitopes allowed us to design more specific binding substances with typical binding specificities of at least disaccharides. The invention also revealed lectin reagents with specified with useful specificities for analysis of native embryonal stem cells without selection against an uncontrolled marker and/or coating with an antibody or two from different species. Clearly the binding to native embryonal stem cells is different as the binding with MAA was clear to most of cells, there was differences between cell line so that RCA, LTA and UEA was clearly binding a hESC cell line but not another.
Methods for separation and use of stem cells are known in the art.
Characterizations and isolation of hematopoietic stem cells are reported in U.S. Pat. No. 5,061,620. The hematopoietic CD34 marker is the most common marker known to identify specifically blood stem cells, and CD34 antibodies are used to isolate stem cells from blood for transplantation purposes. However, CD34+ cells can differentiate only or mainly to blood cells and differ from embryonic stem cells which have the capability of developing into different body cells. Moreover, expansion of CD34+ cells is limited as compared to embryonic stem cells which are immortal. U.S. Pat. No. 5,677,136 discloses a method for obtaining human hematopoietic stem cells by enrichment for stem cells using an antibody which is specific for the CD59 stem cell marker. The CD59 epitope is highly accessible on stem cells and less accessible or absent on mature cells. U.S. Pat. No. 6,127,135 provides an antibody specific for a unique cell marker (EM10) that is expressed on stem cells, and methods of determining hematopoictic stem cell content in a sample of hematopoietic cells. These disclosures are specific for hematopoietic cells and the markers used for selection are not absolutely absent on more mature cells.
There have been great efforts toward isolating pluripotent or multipotent stem cells, in earlier differentiation stages than hematopoictic stem cells, in substantially pure or pure form for diagnosis, replacement treatment and gene therapy purposes. Stem cells are important targets for gene therapy, where the inserted genes are intended to promote the health of the individual into whom the stem cells are transplanted. In addition, the ability to isolate stem cells may serve in the treatment of lymphomas and leukemias, as well as other neoplastic conditions where the stem cells are purified from tumor cells in the bone marrow or peripheral blood, and reinfused into a patient after myelosuppressive or mycloablative chemotherapy.
Multiple adult stem cell populations have been discovered from various adult tissues. In addition to hematopoictic stem cells, neural stem cells were identified in adult mammalian central nervous system (Ourednik et al. Clin. Genet. 56, 267, 1999). Adult stem cells have also been identified from epithelial and adipose tissues (Zuk et al. Tissue Engineering 7, 211, 2001). Mesenchymal stem cells (MSCs) have been cultured from many sources, including liver and pancreas (Hu et al. J. Lab Clin Med. 141, 342-349, 2003). Recent studies have demonstrated that certain somatic stem cells appear to have the ability to differentiate into cells of a completely different lineage (Pfendler K C and Kawase E, Obstet Gynecol Surv 58, 197-208, 2003). Monocyte derived (Zhao et al. Proc. Natl. Acad. Sci. USA 100, 2426-2431, 2003) and mesodermal derived (Schwartz et al. J. Clin. Invest 109, 1291-1301, 2002) cells that possess some multipotent characteristics were identified. The presence of multipotent “embryonic-like” progenitor cells in blood was suggested also by in-vivo experiments following bone marrow transplantations (Zhao et al. Brain Res Protoc 11, 3845, 2003). However, such multipotent “embryonic-like” stem cells cannot be identified and isolated using the known markers.
The possibility of recovering fetal cells from the maternal circulation has generated interest as a possible means, non-invasive to the fetus, of diagnosing fetal anomalies (Simpson and Elias, J. Am. Med. Assoc. 270, 2357-2361, 1993). Prenatal diagnosis is carried out widely in hospitals throughout the world. Existing procedures such as fetal, hepatic or chorionic biopsy for diagnosis of chromosomal disorders including Down's syndrome, as well as single gene defects including cystic fibrosis are very invasive and carry a considerable risk to the fetus. Amniocentesis, for example, involves a needle being inserted into the womb to collect cells from the embryonic tissue or amniotic fluid. The test, which can detect Down's syndrome and other chromosomal abnormalities, carries a miscarriage risk estimated at 1%. Fetal therapy is in its very early stages and the possibility of early tests for a wide range of disorders would undoubtedly greatly increase the pace of research in this area. Thus, relatively non-invasive methods of prenatal diagnosis are an attractive alternative to the very invasive existing procedures. A method based on maternal blood should make earlier and easier diagnosis more widely available in the first trimester, increasing options to parents and obstetricians and allowing for the eventual development of specific fetal therapy.
The present invention provides methods of identifying, characterizing and separating stem cells having characteristics of embryonic stem (ES) cells for diagnostic, therapy and tissue engineering. In particular, the present invention provides methods of identifying, selecting and separating embryonic stem cells or fetal cells from maternal blood and to reagents for use in prenatal diagnosis and tissue engineering methods. The present invention provides for the first time a specific marker/binder/binding agent that can be used for identification, separation and characterization of valuable stem cells from tissues and organs, overcoming the ethical and logistical difficulties in the currently available methods for obtaining embryonic stem cells.
The present invention overcomes the limitations of known binders/markers for identification and separation of embryonic or fetal stem cells by disclosing a very specific type of marker/binder, which does not react with differentiated somatic maternal cell types. In other aspect of the invention, a specific binder/marker/binding agent is provided which does not react, i.e. is not expressed on feeder cells, thus enabling positive selection of feeder cells and negative selection of stem cells.
By way of exemplification, the binder to Formula (I) are now disclosed as useful for identifying, selecting and isolating pluripotent or multipotent stem cells including embryonic stem cells, which have the capability of differentiating into varied cell lineages.
According to one aspect of the present invention a novel method for identifying pluripotent or multipotent stem cells in peripheral blood and other organs is disclosed. According to this aspect an embryonic stem cell binder/marker is selected based on its selective expression in stem cells and/or germ stem cells and its absence in differentiated somatic cells and/or feeder cells. Thus, glycan structures expressed in stem cells are used according to the present invention as selective binders/markers for isolation of pluripotent or multipotent stem cells from blood, tissue and organs. Preferably the blood cells and tissue samples are of mammalian origin, more preferably human origin.
According to a specific embodiment the present invention provides a method for identifying a selective embryonic stem cell binder/marker comprising the steps of:
A method for identifying a selective stem cell binder to a glycan structure of Formula (I) which comprises:
i. selecting a glycan structure exhibiting specific expression in/on stem cells and absence of expression in/on feeder cells and/or differentiated somatic cells; ii. and confirming the binding of binder to the glycan structure in/on stem cells.
By way of a non-limiting example, adult, mesenchymal, embryonal type, or hematopoictic stem cells selected using the binder may be used in regenerating the hematopoictic or ther tissue system of a host deficient in any class of stem cells. A host that is diseased can be treated by removal of bone marrow, isolation of stem cells and treatment with drugs or irradiation prior to re-engraftment of stem cells. The novel markers of the present invention may be used for identifying and isolating various stem cells; detecting and evaluating growth factors relevant to stem cell self-regeneration; the development of stem cell lineages; and assaying for factors associated with stem cell development.
The present invention is directed to analysis of broad glycan mixtures from stem cell samples by specific binder (binding) molecules.
The present invention is specifically directed to glycomes of stem cells according to the invention comprising glycan material with monosaccharide composition for each of glycan mass components according to the Formula I:
R1Hexβz{R3}n1HexNAcXyR2 (I),
wherein X is nothing or a glycosidically linked disaccharide epitope β4(Fucα6)nGN, wherein
n is 0 or 1;
Hex is Gal or Man or GlcA;
HexNAc is GlcNAc or GalNAc;
y is anomeric linkage structure α and/or β or a linkage from a derivatized anomeric carbon,
z is linkage position 3 or 4, with the provision that when z is 4, then HexNAc is GlcNAc and
Hex is Man or Hex is Gal or Hex is GlcA, and
when z is 3, then Hex is Glcα or Gal and HexNAc is GlcNAc or GalNAc;
R1 indicates 1-4 natural type carbohydrate substituents linked to the core structures,
R2 is reducing end hydroxyl, a chemical reducing end derivative or a natural asparagine linked N-glycoside derivative including asparagines, N-glycoside aminoacids and/or peptides derived from proteins, or a natural serine or threonine linked O-glycoside derivative including asparagines, N-glycoside aminoacids and/or peptides derived from proteins;
R3 is nothing or a branching structure representing GlcNAcβ6 or an oligosaccharide with GlcNAcβ6 at its reducing end linked to GalNAc, when HexNAc is GalNAc, or R3 is nothing or Fucα4, when Hex is Gal, HexNAc is GlcNAc, and z is 3, or R3 is nothing or Fucα3, when
z is 4.
Typical glycomes comprise of subgroups of glycans, including N-glycans, O-glycans, glycolipid glycans, and neutral and acidic subglycomes.
The invention is directed to diagnosis of clinical state of stem cell samples, based on analysis of glycans present in the samples. The invention is especially directed to diagnosing cancer and the clinical state of cancer, preferentially to differentiation between stem cells and cancerous cells and detection of cancerous changes in stem cell lines and preparations.
The invention is further directed to structural analysis of glycan mixtures present in stem cell samples.
Glycomes—Novel Glycan Mixtures From Stem Cells
The present invention revealed novel glycans of different sizes from stem cells. The stem cells contain glycans ranging from small oligosaccharides to large complex structures. The analysis reveals compositions with substantial amounts of numerous components and structural types. Previously the total glycomes from these rare materials has not been available and nature of the releasable glycan mixtures, the glycomes, of stem cells has been unknown.
The invention revealed that the glycan structures on cell surfaces vary between the various populations of the early human cells, the preferred target cell populations according to the invention. It was revealed that the cell populations contained specifically increased “reporter structures”.
The glycan structures on cell surfaces in general have been known to have numerous biological roles. Thus the knowledge about exact glycan mixtures from cell surfaces is important for knowledge about the status of cells. The invention revealed that multiple conditions affect the cells and cause changes in their glycomes. The present invention revealed novel glycome components and structures from human stem cells. The invention revealed especially specific terminal Glycan epitopes, which can be analyzed by specific binder molecules.
Recognition of Structures From Glycome Materials and On Cell Surfaces By Binding Methods
The present invention revealed that beside the physicochemical analysis by NMR and/or mass spectrometry several methods are useful for the analysis of the structures. The invention is especially directed to a method:
The peptides and proteins are preferably recombinant proteins or corresponding carbohydrate recognition domains derived thereof, when the proteins are selected from the group of monoclonal antibody, glycosidase, glycosyl transferring enzyme, plant lectin, animal lectin or a peptide mimetic thereof, and wherein the binder may include a detectable label structure.
The genus of enzymes in carbohydrate recognition is continuous to the genus of lectins (carbohydrate binding proteins without enzymatic activity).
a) Native glycosyltransferases (Rauvala et al. (1983) PNAS (USA) 3991-3995) and glycosidases (Rauvala and Hakomori (1981) J. Cell Biol. 88, 149-159) have lectin activities.
b) The carbohydrate binding enzymes can be modified to lectins by mutating the catalytic amino acid residues (see WO9842864; Aalto J. et al. Glycoconjugate J. (2001, 18(10); 751-8; Mega and Hase (1994) BBA 1200 (3) 331-3).
c) Natural lectins, which are structurally homologous to glycosidases are also known indicating the continuity of the genus enzymes and lectins (Sun, Y-J. et al. J. Biol. Chem. (2001) 276 (20) 17507-14).
The genus of the antibodies as carbohydrate binding proteins without enzymatic activity is also very close to the concept of lectins, but antibodies are usually not classified as lectins.
Obviousness of the Peptide Concept and Continuity With the Carbohydrate Binding Protein Concept
It is further realized that proteins consist of peptide chains and thus the recognition of carbohydrates by peptides is obvious. E.g. it is known in the art that peptides derived from active sites of carbohydrate binding proteins can recognize carbohydrates (e.g. Geng J-G. et al (1992) J. Biol. Chem. 19846-53).
As described above antibody fragment are included in description and genetically engineered variants of the binding proteins. The obvious genetically engineered variants would included truncated or fragment peptides of the enzymes, antibodies and lectins.
Revealing Cell or Differentiation and Individual Specific Terminal Variants of Structures
The invention is directed use the glycomics profiling methods for the revealing structural features with on-off changes as markers of specific differentiation stage or quantitative difference based on quantitative comparison of glycomes. The individual specific variants are based on genetic variations of glycosyltransferases and/or other components of the glycosylation machinery preventing or causing synthesis of individual specific structure.
Terminal Structural Epitopes
We have previously revealed glycome compositions of human glycomes, here we provide structural terminal epitopes useful for the characterization of stem cell glycomes, especially by specific binders.
The examples of characteristic altering terminal structures includes expression of competing terminal epitopes created as modification of key homologous core Galβ-epitopes, with either the same monosaccharides with difference in linkage position Galβ3GlcNAc, and analogue with either the same monosaccharides with difference in linkage position Galβ4GlcNAc; or the with the same linkage but 4-position epimeric backbone Galβ3GalNAc. These can be presented by specific core structures modifying the biological recognition and function of the structures. Another common feature is that the similar Galβ-structures are expressed both as protein linked (O- and N-glycan) and lipid linked (glycolipid structures). As an alternative for α2-fucosylation the terminal Gal may comprise NAc group on the same 2 position as the fucose. This leads to homologous epitopes GalNAcβ4GlcNAc and yet related GalNAcβ3Gal-structure on characteristic special glycolipid according to the invention.
The invention is directed to novel terminal disaccharide and derivative epitopes from human stem cells, preferably from human embryonal stem cells or adult stem cells, when these are not hematopoictic stem cells, which are preferably mesenchymal stem cells. It should realized that glycosylations are species, cell and tissue specific and results from cancer cells usually differ dramatically from normal cells, thus the vast and varying glycosylation data obtained from human embryonal carcinomas are not actually relevant or obvious to human embryonal stem cells (unless accidentally appeared similar). Additionally the exact differentiation level of teratocarcinomas cannot be known, so comparison of terminal epitope under specific modification machinery cannot be known. The terminal structures by specific binding molecules including glycosidases and antibodies and chemical analysis of the structures.
The present invention reveals group of terminal Gal(NAc)β1-3/4Hex(NAc) structures, which carry similar modifications by specific fucosylation/NAc-modification, and sialylation on corresponding positions of the terminal disaccharide epitopes. It is realized that the terminal structures are regulated by genetically controlled homologous family of fucosyltransferases and sialyltransferases. The regulation creates a characteristic structural patterns for communication between cells and recognition by other specific binder to be used for analysis of the cells. The key epitopes are presented in the TABLE 37. The data reveals characteristic patterns of the terminal epitopes for each types of cells, such as for example expression on hESC-cells generally much Fucα-structures such as Fucα2-structures on type 1 lactosamine (Galβ3GlcNAc), similarily β3-linked core I Galβ3GlcNAcα, and type 4 structure which is present on specific type of glycolipids and expression of α3-fucosylated structures, while α6-sialic on type II N-acetylalactosamine appear on N-glycans of embryoid bodies and st3 embryonal stem cells. E.g. terminal type lactosamine and poly-lactosamines differentiate mesenchymal stem cells from other types. The terminal Galb-information is preferably combined with information about
The invention is directed especially to high specificity binding molecules such as monoclonal antibodies for the recognition of the structures.
The structures can be presented by Formula T1. the formula describes first monosaccharide residue on left, which is a β-D-galactopyranosyl structure linked to either 3 or 4-position of the α- or β-D-(2-deoxy-2-acetamido)galactopyranosyl structure, when R5 is OH, or β-D-(2-deoxy-2-acetamido)glucopyranosyl, when R4 comprises O—. The unspecified stereochemistry of the reducing end in formulas T1 and T2 is indicated additionally (in claims) with curved line. The sialic acid residues can be linked to 3 or 6-position of Gal or 6-position of GlcNAc and fucose residues to position 2 of Gal or 3- or 4-position of GlcNAc or position 3 of Glc.
The invention is directed to Galactosyl-globoside type structures comprising terminal Fucα2-revealed as novel terminal epitope Fucα2Galβ3GalNAcβ or Galβ3GalNAcβGalα3-comprising isoglobotructures revealed from the embryonal type cells.
wherein
X is linkage position
R1, R2, and R6 are OH or glycosidically linked monosaccharide residue Sialic acid, preferably Neu5Acα2 or Neu5Gc α2, most preferably Neu5Acα2 or
R3, is OH or glycosidically linked monosaccharide residue Fucα1 (L-fucose) or N-acetyl (N-acetamido, NCOCH3);
R4, is H, OH or glycosidically linked monosaccharide residue Fucα1 (L-fucose),
R5 is OH, when R4 is H, and R5 is H, when R4 is not H;
R7 is N-acetyl or OH
X is natural oligosaccharide backbone structure from the cells, preferably N-glycan, O-glycan or glycolipid structure; or X is nothing, when n is 0,
Y is linker group preferably oxygen for O-glycans and O-linked terminal oligosaccharides and glycolipids and N for N-glycans or nothing when n is 0;
Z is the carrier structure, preferably natural carrier produced by the cells, such as protein or lipid, which is preferably a ceramide or branched glycan core structure on the carrier or H;
The arch indicates that the linkage from the galactopyranosyl is either to position 3 or to position 4 of the residue on the left and that the R4 structure is in the other position 4 or 3;
n is an integer 0 or 1, and m is an integer from 1 to 1000, preferably 1 to 100, and most preferably 1 to 10 (the number of the glycans on the carrier),
With the provisions that one of R2 and R3 is OH or R3 is N-acetyl,
R6 is OH, when the first residue on left is linked to position 4 of the residue on right:
X is not Galα4Galβ4Glc, (the core structure of SSEA-3 or 4) or R3 is Fucosyl
R7 is preferably N-acetyl, when the first residue on left is linked to position 3 of the residue on right:
Preferred terminal β3-linked subgroup is represented
by Formula T2 indicating the situation, when the first residue on the left is linked to the 3 position with backbone structures Gal(NAc)β3Gal/GlcNAc.
Wherein the variables including R1 to R7
are as described for T1
Preferred terminal β4-linked subgroup is represented by the Formula 3
Wherein the variables including R1 to R4 and R7
are as described for T1 with the provision that
R4, is OH or glycosidically linked monosaccharide residue Fucα1 (L-fucose),
Alternatively the epitope of the terminal structure can be represented by Formulas T4 and T5 Core Galepitopes formula T4:
Galβ1-xHex(NAc)p,
x is linkage position 3 or 4,
and Hex is Gal or Glc
with provision
p is 0 or 1
when x is linkage position 3, p is 1 and HexNAc is GlcNAc or GalNAc,
and when x is linkage position 4, Hex is Glc.
The core Galβ1-3/4 epitope is optionally substituted to hydroxyl
by one or two structures SAα or Fucα, preferably selected from the group
Gal linked SAα3 or SAα6 or Fucα2, and
Glc linked Fucα3 or GlcNAc linked Fucα3/4.
[Mα]mGalβ1-x[Nα]nHex(NAc)p, Formula T5
wherein m, n and p are integers 0, or 1, independently
Hex is Gal or Glc,
X is linkage position
M and N are monosaccharide residues being either
SA which is Sialic acid linked to 3-position of Gal or/and 6-position of HexNAc and/or
Fuc (L-fucose) residue linked to 2-position of Gal
and/or 3 or 4 position of HexNAc, when Gal is linked to the other position (4 or 3),
and HexNAc is GlcNAc, or 3-position of Glc when Gal is linked to the other position (3),
with the provision that sum of m and n is 2
preferably m and n are 0 or 1, independently,
The exact structural details are essential for optimal recognition by specific binding molecules designed for the analysis and/or manipulation of the cells.
The terminal key Galβ-epitopes are modified by the same modification monosaccharides NeuX (X is 5 position modification Ac or Gc of sialic acid) or Fuc, with the same linkage type alfa( modifying the same hydroxyl-positions in both structures.
NeuXα3, Fucα2 on the terminal Galβ of all the epitopes and
NeuXα6 modifying the terminal Galβ of Galβ4GlcNAc, or HexNAc, when linkage is 6 competing
or Fucα modifying the free axial primary hydroxyl left in GlcNAc (there is no free axial hydroxyl in GalNAc-residue).
The preferred structures can be divided to preferred Galβ1-3 structures analogously to T2,
[Mα]mGalβ1-3[Nα]nHexNAc, Formula T6:
Wherein the variables are as described for T5.
The preferred structures can be divided to preferred Galβ1-4 structures analogously to T4,
[Mα]mGalβ1-4[Nα]nGlc(NAc)p, Formula T7:
Wherein the variables are as described for T5.
Fucosylated and Non-Modified Structures
The invention is further directed to the core disaccharide epitope structures when the structures are not modified by sialic acid (none of the R-groups according to the Formulas T1-T3 or M or N in formulas T4-T7 is not sialic acid.
The invention is in a preferred embodiment directed to structures, which comprise at least one fucose residue according to the invention. These structures are novel specific fucosylated terminal epitopes, useful for the analysis of stem cells according to the invention. Preferably native stem cells are analyzed.
The preferred fucosylated structures include novel α3/4fucosylated markers of human stem cells such as (SAα3)0or1Galβ3/4(Fucα4/3)GlcNAc including Lewis x and and sialylated variants thereof.
Among the structures comprising terminal Fucα1-2 the invention revealed especially useful novel marker structures comprising Fucα2Galβ3GalNAcα/β and
Fucα2Galβ3(Fucα4)0or1GlcNAcβ, these were found useful studying embryonal stem cells. A especially preferred antibody/binder group among this group is antibodies specific for Fucα2Galβ3GlcNAcβ, preferred for high stem cell specificity. Another preferred structural group includes Fucα2Gal comprising glycolipids revealed to form specific structural group, especially interesting structure is globo-H-type structure and glycolipids with terminal Fucα2Galβ3GalNAcβ, preferred with interesting biosynthetic context to earlier speculated stem cell markers.
Among the antibodies recognizing Fucα2Galβ4GlcNAcβ substantial variation in binding was revealed likely based on the carrier structures, the invention is especially directed to antibodies recognizing this type of structures, when the specificity of the antibody is similar to the ones binding to the embryonal stem cells as shown in Example 18 with fucose recognizing antibodies. The invention is preferably directed to antibodies recognizing Fucα2Galβ4GlcNAcβ on N-glycans, revealed as common structural type in terminal epitope Table 37. In a separate embodiment the antibody of the non-binding clone is directed to the recognition of the feeder cells.
The preferred non-modified structures includes Galβ4Glc, Galβ3GlcNAc, Galβ3GalNAc, Galβ4GlcNAc, Galβ3GlcNAcβ, Galβ3GalNAcβ/α, and Galβ4GlcNAcβ. These are preferred novel core markers characteristics for the various stem cells. The structure Galβ3GlcNAc is especially preferred as novel marker observable in hESC cells. Preferably the structure is carried by a glycolipid core structure according to the invention or it is present on an O-glycan. The non-modified markers are preferred for the use in combination with at least one fucosylated or/and sialylated structure for analysis of cell status.
Additional preferred non-modified structures includes GalNAcβ-structures includes terminal LacdiNAc, GalNAcβ4GlcNAc, preferred on N-glycans and GalNAcβ3Gal GalNAcβ3Gal present in globoseries glycolipids as terminal of globotetraose structures. Among these characteristic subgroup of Gal(NAc)β3-comprising Galβ3GlcNAc, Galβ3GalNAc, Galβ3GlcNAcβ, Galβ3GalNAcβ/α, and GalNAcβ3Gal GalNAcβ3Gal and the characteristic subgroup of Gal(NAc)β4-comprising Galβ4Glc, Galβ4GlcNAc, and Galβ4GlcNAc are separately preferred.
Preferred sialylated structures
The preferred sialylated structures includes characteristic SAα3Gal-structures SAα3Galβ4Glc, SAα3Galβ3GlcNAc, SAα3Galβ3GalNAc, SAα3Galβ4GlcNAc, SAα3Galβ3GlcNAcβ, SAα3Galβ3GalNAcβ/α, and SAα3Galβ4GlcNAcβ; and biosynthetically partially competing SAα6Galβ-structures SAα6Galβ4Glc, SAα6Galβ4Glcβ; SAα6Galβ4GlcNAc and SAα6Galβ4GlcNAcβ; and disialo structures SAα3Galβ3(SAα6)GalNAcβ/α,
The invention is preferably directed to specific subgroup of Gal(NAc)β3-comprising SAα3Galβ3GlcNAc, SAα3Galβ3GalNAc, SAα3Galβ4GlcNAc, SAα3Galβ3GlcNAcβ, SAα3Galβ3GalNAcβ/α and SAα3Galβ3(SAα6)GalNAcβ/α, and Gal(NAc)β34-comprising sialylated structures. SAα3Galβ4Glc, and SAα3Galβ4GlcNAcβ; and SAα6Galβ4Glc, SAα6Galβ4Glcβ; SAα6Galβ4GlcNAc and SAα6Galβ4GlcNAcβ These are preferred novel regulated markers characteristics for the various stem cells.
Use Together With a Terminal ManαMan-Structure
The terminal non-modified or modified epitopes are in preferred embodiment used together with at least one ManαMan-structure. This is preferred because the structure is in different N-glycan or glycan subgroup than the other epitopes.
Preferred Structural Groups for Hematopoietic Stem Cells.
The present invention provides novel markers and target structures and binders to these for especially embryonic and adult stem cells, when these cells are not heamtopoietic stem cells. From hematopoietic CD34+ cells certain terminal structures such as terminal sialylated type two N-acetyllactosamines such as NeuNAcα3Galβ4GlcNAc (Magnani J. U.S. Pat. No. 6,362,010 ) has been suggested and there is indications for low expression of Slex type structures NeuNAcα3Galβ4(Fucα3)GlcNAc (Xia L et al Blood (2004) 104 (10) 3091-6). The invention is also directed to the NeuNAcα3Galβ4GlcNAc non-polylactosamine variants separately from specific characteristic O-glycans and N-glycans. The invention further provides novel markers for CD133+ cells and novel hematopoietic stem cell markers according to the invention, especially when the structures does not include NeuNAcα3Galβ4(Fucα3)0-1GlcNAc. Preferably the hematopoictic stem cell structures are non-sialylated, fucosylated structures Galβ1-3-structures according to the invention and even more preferably type 1 N-acetyllactosamine structures Galβ3GlcNAc or separately preferred Galβ3GalNAc based structures.
Preferred Binder Molecules
The present invention revealed various types of binder molecules useful for characterization of cells according to the invention and more specifically the preferred cell groups and cell types according to the invention. The preferred binder molecules are classified based on the binding specificity with regard to specific structures or structural features on carbohydrates of cell surface. The preferred binders recognize specifically more than single monosaccharide residue.
It is realized that most of the current binder molecules such as all or most of the plant lectins are not optimal in their specificity and usually recognize roughly one or several monosaccharides with various linkages. Furthermore the specificities of the lectins are usually not well characterized with several glycans of human types.
The preferred high specificity binders recognize
The preferred binders includes natural human and or animal, or other proteins developed for specific recognition of glycans. The preferred high specificity binder proteins are specific antibodies preferably monoclonal antibodies; lectins, preferably mammalian or animal lectins; or specific glycosyltransferring enzymes more preferably glycosidase type enzymes, glycosyltransferases or transglycosylating enzymes.
Modulation of Cells By the Binders
The invention revealed that the specific binders directed to a cell type can be used to modulate cells. In a preferred embodiment the (stem) cells are modulated with regard to carbohydrate mediated interactions. The invention revealed specific binders, which change the glycan structures and thus the receptor structure and function for the glycan, these are especially glycosidases and glycosyltransferring enzymes such as glycosyltransferases and/or transglycosylating enzymes. It is further realized that the binding of a non-enzymatic binder as such select and/or manipulate the cells. The manipulation typically depend on clustering of glycan receptors or affect of the interactions of the glycan receptors with counter receptors such as lectins present in a biological system or model in context of the cells. The invention further reveled that the modulation by the binder in context of cell culture has effect about the growth velocity of the cells.
Preferred Combinations of the Binders
The invention revealed useful combination of specific terminal structures for the analysis of status of a cells. In a preferred embodiment the invention is directed to measuring the level of two different terminal structures according to the invention, preferably by specific binding molecules, preferably at least by two different binders. In a preferred embodiment the binder molecules are directed to structures indicating modification of a terminal receptor glycan structures, preferably the structures represent sequential (substrate structure and modification thereof, such as terminal Gal-structure and corresponding sialylated structure) or competing biosynthetic steps (such as fucosylation and sialylation of terminal Galβ or terminal Galβ3GlcNAc and Galβ4GlcNAc). In another embodiment the binders are directed to three different structures representing sequential and competing steps such as such as terminal Gal-structure and corresponding sialylated structure and corresponding sialylated structure.
The invention is further directed to recognition of at least two different structures according to the invention selected from the groups of non-modified (non-sialylated or non-fucosylated) Gal(NAc)β3/4-core structures according to the invention, preferred fucosylated structures and preferred sialylated structures according to the invention. It is realized that it is useful to recognize even 3, and more preferably 4 and even more preferably five different structures, preferably within a preferred structure group.
Target Structures for Specific Binders and Examples of the Binding Molecules
Combination of Terminal Structures with Specific Glycan Core Structures
It is realized that part of the structural elements are specifically associated with specific glycan core structure. The recognition of terminal structures linked to specific core structures are especially preferred, such high specificity reagents have capacity of recognition almost complete individual glycans to the level of physicochemical characterization according to the invention. For example many specific mannose structures according to the invention are in general quite characteristic for N-glycan glycomes according to the invention. The present invention is especially directed to recognition terminal epitopes.
Common Terminal Structures on Several Glycan Core Structures
The present invention revealed that there are certain common structural features on several glycan types and that it is possible to recognize certain common epitopes on different glycan structures by specific reagents when specificity of the reagent is limited to the terminal without specificity for the core structure. The invention especially revealed characteristic terminal features for specific cell types according to the invention. The invention realized that the common epitopes increase the effect of the recognition. The common terminal structures are especially useful for recognition in the context with possible other cell types or material, which do not contain the common terminal structure in substantial amount.
The invention revealed the presence of the terminal structures on specific core structures such as N-glycan, O-glycan and/or glycolipids. The invention is preferably directed to the selection of specific binders for the structures including recognition of specific glycan core types.
The invention is further directed to glycome compositions of protein linked glycomes such as N-glycans and O-glycans and glycolipids each composition comprising specific amounts of glycan subgroups. The invention is further directed to the compositions when these comprise specific amount of Defined terminal structures.
Specific Preferred Structural Groups
The present invention is directed to recognition of oligosaccharide sequences comprising specific terminal monosaccharide types, optionally further including a specific core structure. The preferred oligosaccharide sequences are in a preferred embodiment classified based on the terminal monosaccharide structures.
The invention further revealed a family of terminal (non-reducing end terminal) disaccharide epitopes based on β-linked galactopyranosylstructures, which may be further modified by fucose and/or sialic acid residues or by N-acetylgroup, changing the terminal Gal residue to GalNAc. Such structures are present in N-glycan, O-glycan and glycolipid subglycomes.
Furthermore the invention is directed to terminal disaccharide epitopes of N-glycans comprising terminal ManαMan.
The structures were derived by mass spectrometric and optionally NMR analysis and by high specificity binders according to the invention, for the analysis of glycolipid structures permethylation and fragmentation mass spectrometry was used. Biosynthetic analysis including known biosynthetic routes to N-glycans, O-glycans and glycolipids was additionally used for the analysis of the glycan compositions and additional support, though not direct evidence due to various regulation levels after mRNA, for it was obtained from gene expression profiling data of Example 24 and Skotttman, H. et al. (2005) Stem cells and similar data obtained from the mRNA profiling for cord blood cells and used to support the biosynthetic analysis using the data of Jaatinen T et al. Stem Cells (2006) 24 (3) 631-41.
1. Structures With Terminal Mannose Monosaccharide
Preferred mannose-type target structures have been specifically classified by the invention. These include various types of high and low-mannose structures and hybrid type structures according to the invention.
The Preferred Terminal Manα-Target Structure Epitopes
The invention revealed the presence of Manα on low mannose N-glycans and high mannose N-glycans. Based on the biosynthetic knowledge and supporting this view by analysis of mRNAs of biosynthetic enzymes and by NMR-analysis the structures and terminal epitopes could be revealed:
Manα2Man, Manα3Man, Manα6Man and Manα3(Manα6)Man, wherein the reducing end Man is preferably either α- or β-linked glycoside and α-linked glycoside in case of Manα2Man:
The general structure of terminal Manα-structures is
Manαx(Manαy)zManα/β
Wherein x is linkage position 2, 3 or 6, and y is linkage position 3 or 6,
z is integer 0 or 1, indicating the presence or the absence of the branch,
with the provision that x and y are not the same position and
when x is 2, the z is 0 and reducing end Man is preferably α-linked;
The low_mannose structures includes preferably non-reducing end terminal epitopes with structures with α3- and/or α6- mannose linked to another mannose residue
Manαx(Manαy)zManα/β
wherein x and y are linkage positions being either 3 or 6,
z is integer 0 or 1, indicating the presence or the absence of the branch,
The high mannose structure includes terminal α2-linked Mannose:
Manα2Man(α) and optionally on or several of the terminal α3- and/or α6-mannose-structures as above.
The presence of terminal Manα-structures is regulated in stem cells and the proportion of the high-Man-structures with terminal Manα2-structures in relation to the low Man structures with Manα3/6- and/or to complex type N-glycans with Gal-backbone epitopes varies cell type specifically.
The data indicated that binder revealing specific terminal Manα2Man and/or Manα3/6Man is very useful in characterization of stem cells. The prior science has not characterized the epitopes as specific signals of cell types or status.
The invention is especially directed to the measuring the levels of both low-Man and high-Man structures, preferably by quantifying two structure type the Manα2Man-structures and the Manα3/6Man-structures from the same sample.
The invention is especially directed to high specificity binders such as enzymes or monoclonal antibodies for the recognition of the terminal Manα-structures from the preferred stem cells according to the invention, more preferably from differentiated embryonal type cells, more preferably differentiated beyond embryoid bodies such as stage 3 differentiated cells, most preferably the structures are recognized from stage 3 differentiated cells. The invention is especially preferably directed to detection of the structures from adult stem cells more preferably mesenchymal stem cells, especially from the surface of mesenchymal stem cells and in separate embodiment from blood derived stem cells, with separately preferred groups of cord blood and bone marrow stem cells. In a preferred embodiment the cord blood and/or peripheral blood stem cell is not hematopoietic stem cell.
Low or Uncharacterised Specificity Binders
preferred for recognition of terminal mannose structures includes mannose-monosaccharide binding plant lectins. The invention is in preferred embodiment directed to the recognition of stem cells such as embryonal type stem cells by a Manα-recognizing lectin such as lectin PSA. In a preferred embodiment the recognition is directed to the intracellular glycans in permebilized cells. In another embodiment the Manα-binding lectin is used for intact non-permeabilized cells to recognize terminal Manα-from contaminating cell population such as fibroblast type cells or feeder cells as shown in corresponding Example 6.
Preferred High Specific High Specificity Binders
include
i) Specific mannose residue releasing enzymes such as linkage specific mannosidases, more preferably an α-mannosidase or β-mannosidase.
Preferred α-mannosidases includes linkage specific α-mannosidases such as α-Mannosidases cleaving preferably non-reducing end terminal, an example of preferred mannosidascs is jack bean α-mannosidase (Canavalia ensiformis; Sigma, USA) and homologous α-mannosidases
α2-linked mannose residues specifically or more effectively than other linkages, more preferably cleaving specifically Manα2-structures; or
α3-linked mannose residues specifically or more effectively than other linkages, more preferably cleaving specifically Manα3-structures; or
α6-linked mannose residues specifically or more effectively than other linkages, more preferably cleaving specifically Manα6-structures;
Preferred β-mannosidases includes β-mannosidases capable of cleaving β4-linked mannose from non-reducing end terminal of N-glycan core Manβ4GlcNAc-structure without cleaving other β-linked monosaccharides in the glycomes.
ii) Specific binding proteins recognizing preferred mannose structures according to the invention. The preferred reagents include antibodies and binding domains of antibodies (Fab-fragments and like), and other engineered carbohydrate binding proteins. The invention is directed to antibodies recognizing MS2B1 and more preferably MS3B2-structures.
Mannosidase analyses of neutral N-glycans Examples of detection of mannosylated by α-mannosidase binder and mass spectrometric profiling of the glycans cord blood and peripheral blood mesenchymal cells in Example 1; for cord blood cells in example 19, hESC EB and stage 3 cells in Example 10, in Example 22 and 2 for embryonal stem cells and differentiated cells;, and, and indicates presence of all types of Manβ4, Manα3/6 terminal structures of Man1-4GlcNAcβ4(Fucα6)0-1GlcNAc-comprising low Mannose glycans as described by the invention.
Lectin Binding
α-linked mannose was demonstrated in Example 7 for human mesenchymal cell by lectins Hippeastrum hybrid (HUA) and Pisum sativum (PSA) lectins suggests that they express mannose, more specifically α-linked mannose residues on their surface glycoconjugates such as N-glycans. Possible α-mannose linkages include α1→2, α1→3, and α1→6. The lower binding of Galanthus nivalis (GNA) lectin suggests that some α-mannose linkages on the cell surface are more prevalent than others. The combination of the terminal Manα-recognizing low affinity reagents appears to be useful and correspond to results obtained by mannosidase screening; NMR and mass spectrometric results. Lectin binding of cord blood cells is in example 8.
Mannose-binding lectin labeling. Labelling of the mesenchymal cells in Example 7 was also detected with human serum mannose-binding lectin (MBL) coupled to fluorescein label. This indicate that ligands for this innate immunity system component may be expressed on in vitro cultured BM MSC cell surface.
The present invention is especially directed to analysis of terminal Manα-on cell surfaces as the structure is ligand for MBL and other lectins of innate immunity. It is further realized that terminal Manα-structures would direct cells in blood circulation to mannose receptor comprising tissues such as Kupfer cells of liver. The invention is especially directed to control of the amount of the structure by binding with a binder recognizing terminal Manα-structure.
In a preferred embodiment the present invention is directed to the testing of presence of ligands of lectins present in human, such as lectins of innate immunity and/or lectins of tissues or leukocytes, on stem cells by testing of the binding of the lectin (purified or preferably a recombinant form of the lectin, preferably in labeled form) to the stem cells. It is realized that such lectins includes especially lectins binding Manα and Galβ/GalNAcβ-structures (terminal non-reducing end or even α6-sialylated forms according to the invention.
Mannose Binding Antibodies
A high-mannose binding antibody has been described for example in Wang LX et al (2004) 11 (1) 127-34. Specific antibodies for short mannosylated structures such as the trimannosyl core structure have been also published.
2. Structures With Terminal Gal-Monosaccharide
Preferred galactose-type target structures have been specifically classified by the invention. These include various types of N-acetyllactosamine structures according to the invention.
Low or Uncharacterised Specificity Binders for Terminal Gal
Preferred for recognition of terminal galactose structures includes plant lectins such as ricin lectin (ricinus communis agglutinin RCA), and peanut lectin(/agglutinin PNA). The low resolution binders have different and broad specificities.
Preferred High Specific High Specificity Binders Include
i) Specific galactose residue releasing enzymes such as linkage specific galactosidases, more preferably α-galactosidase or β-galactosidase.
Preferred α-galactosidases include linkage galactosidases capable of cleaving Galα3Gal-structures revealed from specific cell preparations
Preferred β-galactosidases includes β-galactosidases capable of cleaving
β4-linked galactose from non-reducing end terminal Galβ4GlcNAc-structure without cleaving other β-linked monosaccharides in the glycomes and
β3-linked galactose from non-reducing end terminal Galβ3GlcNAc-structure without cleaving other β-linked monosaccharides in the glycomes
ii) Specific binding proteins recognizing preferred galactose structures according to the invention. The preferred reagents include antibodies and binding domains of antibodies (Fab-fragments and like), and other engineered carbohydrate binding proteins and animal lectins such as galectins.
Specific Binder Experiments and Examples for Galβ-Structures
Specific exoglycosidase and glycosyltransferase analysis for the structures are included in Example 22 and 2 for embryonal stem cells and differentiated cells; Example 1 mesenchymal cells, for cord blood cells in example 19 and in example 20 on cell surface and including glycosyltransferases, for glycolipids in Example 15. Sialylation level analysis related to terminal Galβ and Sialic acid expression is in Example 9.
Preferred enzyme binders for the binding of the Galβ-epitopes according to the invention includes β1,4-galactosidase e.g from S. pneumoniae (rec. in E. coli, Calbiochem, USA), β1,3-galactosidase (e.g rec. in E. coli, Calbiochem ); glycosyltransferases: α2,3-(N)-sialyltransferase (rat, recombinant in S. frugiperda, Calbiochem), α1,3-fucosyltransferase VI (human, recombinant in S. frugiperda, Calbiochem), which are known to recognize specific N-acetyllactosamine epitopes, Fuc-TVI especially Galβ4GlcNAc.
Plant low specificity lectin, such as RCA, PNA, ECA, STA, and
PWA, data is in Example 6 for hESC, Example 7 for MSCs, Example 8 for cord blood, effects of the lectin binders for the cell proliferation is in Example 14, cord blood cell selection is in Example 16.
Human lectin analysis by various galectin expression is Example 17 from cord blood and embryonal cells,
In example 18 there is antibody labeling of especially fucosylated and galactosylated structures.
Poly-N-acetyllactosamine sequences. Labelling of the cells by pokeweed (PWA) and less intense labelling by Solanum tuberosum (STA) lectins suggests that the cells express poly-N-acetyllactosamine sequences on their surface glycoconjugates such as N- and/or O-glycans and/or glycolipids. The results further suggest that cell surface poly-N-acetyllactosamine chains contain both linear and branched sequences.
3. Structures With Terminal GalNAc-Monosaccharide
Preferred GalNAc-type target structures have been specifically revealed by the invention. These include especially LacdiNAc, GalNAcβGlcNAc-type structures according to the invention.
Low or Uncharacterised Specificity Binders for Terminal GalNAc
Several plant lectins has been reported for recognition of terminal GalNAc. It is realized that some GalNAc-recognizing lectins may be selected for low specificity recognition of the preferred LacdiNAc-structures.
β-linked N-acetylgalactosamine. Abundant labelling of hESC by Wisteria floribunda lectin (WFA) suggests that hESC express β-linked non-reducing terminal N-acetylgalactosamine residues on their surface glycoconjugates such as N- and/or O-glycans. The absence of specific binding of WFA to mEF suggests that the lectin ligand epitopes are less abundant in mEF.
The low specificity binder plant lectins such as Wisteria floribunda agglutinin and Lotus tetragonolobus agglutinin bind to oligosaccharide sequences Srivatsan J. et al. Glycobiology (1992) 2 (5) 445-52: Do, K Y et al. Glycobiology (1997) 7 (2) 183-94; Yan, L., et al (1997) Glycoconjugate J. 14 (1) 45-55. The article also shows that the lectins are useful for recognition of the structures, when the cells are verified not to contain other structures recognized by the lectins.
In a preferred embodiment a low specificity lectin reagent is used in combination with another reagent verifying the binding.
Preferred High Specific High Specificity Binders Include
i) The invention revealed that β-linked GalNAc can be recognized by specific β-N-acetylhexosaminidase enzyme in combination with β-N-acetylhexosaminidase enzyme. This combination indicates the terminal monosaccharide and at least part of the linkage structure.
Preferred β-N-acetylehexosaminidase, includes enzyme capable of cleaving β-linked GalNAc from non-reducing end terminal GalNAcβ4/3-structures without cleaving α-linked HexNAc in the glycomes; preferred N-acetylglucosaminidases include enzyme capable of cleaving β-linked GlcNAc but not GalNAc.
Specific binding proteins recognizing preferred GalNAcβ4, more preferably GalNAcβ4GlcNAc, structures according to the invention. The preferred reagents include antibodies and binding domains of antibodies (Fab-fragments and like), and other engineered carbohydrate binding proteins.
Examples antibodies recognizing LacdiNAc-structures includes publications of Nyame A. K. et al. (1999) Glycobiology 9 (10) 1029-35; van Remoortere A. et al (2000) Glycobiology 10 (6) 601-609; and van Remoortere A. et al (2001) Infect Immun. 69 (4) 2396-2401. The antibodies were characterized in context of parasite (Schistosoma) infection of mice and humans, but according to the present invention these antibodies can also be used in screening stem cells. The present invention is especially directed to selection of specific clones of LacdiNac recognizing antibodies specific for the subglycomes and glycan structures present in N-glycomes of the invention.
The articles disclose antibody binding specificities similar to the invention and methods for producing such antibodies, therefore the antibody binders are obvious for person skilled in the art. The immunogenicity of certain LacdiNAc-structures are demonstrated in human and mice.
The use of glycosidase in recognition of the structures in known in the prior art similarity as in the present invention for example in Srivatsan J. et al. (1992) 2 (5) 445-52.
4. Structures With Terminal GlcNAc-Monosaccharide
Preferred GlcNAc-type target structures have been specifically revealed by the invention. These include especially GlcNAcβ-type structures according to the invention.
Low or Uncharacterised Specificity Binders for Terminal GlcNAc
Several plant lectins has been reported for recognition of terminal GlcNAc. It is realized that some GlcNAc-recognizing lectins may be selected for low specificity recognition of the preferred GlcNAc-structures.
Preferred High Specific High Specificity Binders Include
Preferred β-N-acetylglucosaminidase includes enzyme capable of cleaving β-linked GlcNAc from non-reducing end terminal GlcNAcβ2/3/6-structures without cleaving β-linked GalNAc or α-linked HexNAc in the glycomes;
ii) Specific binding proteins recognizing preferred GlcNAcβ2/3/6, more preferably GlcNAcβ32Manα, structures according to the invention. The preferred reagents include antibodies and binding domains of antibodies (Fab-fragments and like), and other engineered carbohydrate binding proteins.
Specific Binder Experiments and Examples for Terminal HexNAc(GalNAc/GlcNAc and GlcNAc Structures
Specific exoglycosidase analysis for the structures are included in Example 22 and 2 for embryonal stem cells and differentiated cells; Example 1 for mesenchymal cells, for cord blood cells in example 19 and for glycolipids in Example 15.
Plant low specificity lectin, such as WFA and GNAII, and data is in Example 6 for hESC, Example 7 for MSCs, Example 8 for cord blood, effects of the lectin binders for the cell proliferation is in Example 14, cord blood cell selection is in Example 16.
Preferred enzymes for the recognition of the structures includes general hexosaminidase β-hexosaminidase from Jack beans (C. ensiformis, Sigma, USA) and and specific N-acetylglucosaminidases or N-acetylgalactosaminidases such as β-glucosaminidase from S. pneumoniae (rec. in E. coli, Calbiochem, USA). Combination of these allows determination of LacdiNAc on Verification of the target structures includes NMR analysis as exemplified in Example 21
The invention is further directed to analysis of the structures by specific monoclonal antibodies recognizing terminal GlcNAcα-structures such as described in Holmes and Greene (1991) 288 (1) 87-96, with specificity for several terminal GlcNAc structures.
The invention is specifically directed to the use of the terminal structures according to the invention for selection and production of antibodies for the structures.
Verification of the target structures includes mass spectrometry and permethylation/fragmentation analysis for glycolipid structures
5. Structures With Terminal Fucose-Monosaccharide
Preferred fucose-type target structures have been specifically classified by the invention. These include various types of N-acetyllactosamine structures according to the invention.
Low or Uncharacterised Specificity Binders for Terminal Fuc
Preferred for recognition of terminal fucose structures includes fucose monosaccharide binding plant lectins. Lectins of Ulex europeaus and Lotus tetragonolobus has been reported to recognize for example terminal Fucoses with some specificity binding for α2-linked structures, and branching α3-fucose, respectively. Data is in Example 6 for hESC, Example 7 for MSCs, Example 8 for cord blood, effects of the lectin binders for the cell proliferation is in Example 14, cord blood cell selection is in Example 16.
Preferred High Specific High Specificity Binders Include
i) Specific fucose residue releasing enzymes such as linkage fucosidases, more preferably α-fucosidase.
Preferred α-fucosidases include linkage fucosidases capable of cleaving Fucα2Gal-, and Galβ4/3(Fucα3/4)GlcNAc-structures revealed from specific cell preparations.
Specific exoglycosidase and for the structures are included in Example 22 and 2 for embryonal stem cells and differentiated cells; Example 1 for mesenchymal cells, for cord blood cells in example 19 and in example 20 on cell surface for glycolipids in Example 15. Preferred fucosidases includes α1,3/4-fucosidase e.g. α1,3/4-fucosidase from Xanthomonas sp. (Calbiochem, USA), and α1,2-fucosidase e.g α1,2-fucosidase from X. manihotis (Glyko),
ii)Specific binding proteins recognizing preferred fucose structures according to the invention. The preferred reagents include antibodies and binding domains of antibodies (Fab-fragments and like), and other engineered carbohydrate binding proteins and animal lectins such as selectins recognizing especially Lewis type structures such as Lewis x, Galβ4(Fucα3)GlcNAc, and sialyl-Lewis x, SAcα3Galβ4(Fucα3)GlcNAc.
The preferred antibodies includes antibodies recognizing specifically Lewis type structures such as Lewis x, and sialyl-Lewis x. More preferably the Lewis x-antibody is not classic SSEA-1 antibody, but the antibody recognizes specific protein linked Lewis x structures such as Galβ4(Fucα3)GlcNAcβ2Manα-linked to N-glycan core.
Specific Binder Experiments and Examples for Terminal HexNAc Structures
Specific exoglycosidase analysis for the structures are included in Example 22 and 2 for embryonal stem cells and differentiated cells; Example 1 for mesenchymal cells, for cord blood cells in example 19 and for glycolipids in Example 15.
Plant low specificity lectin, such as WFA and GNAII, and data is in Example 6 for hESC, Example 7 for MSCs, Example 8 for cord blood, effects of the lectin binders for the cell proliferation is in Example 14, cord blood cell selection is in Example 16.
Preferred enzymes for the recognition of the structures includes general hexosaminidase β-hexosaminidase from Jack beans (C. ensiformis, Sigma, USA) and and specific N-acetylglucosaminidases or N-acetylgalactosaminidases such as β-glucosaminidase from S. pneumoniae (rec. in E. coli, Calbiochem, USA). Combination of these allows determination of LacdiNAc on Verification of the target structures includes NMR analysis as exemplified in Example 21
Verification of the target structures includes mass spectrometry and permethylation/fragmentation analysis for glycolipid structures
6. Structures With Terminal Sialic Acid-Monosaccharide
Preferred sialic acid-type target structures have been specifically classified by the invention.
Low or Uncharacterised Specificity Binders for Terminal Sialic Acid
Preferred for recognition of terminal sialic acid structures includes sialic acid monosaccharide binding plant lectins.
Preferred High Specific High Specificity Binders Include
i) Specific sialic acid residue releasing enzymes such as linkage sialidases, more preferably α-sialidases.
Preferred α-sialidases include linkage sialidases capable of cleaving SAα3Gal- and SAα6Gal-structures revealed from specific cell preparations by the invention.
Preferred low specificity lectins, with linkage specificity include the lectins, that are specific for SAα3Gal-structures, preferably being Maackia amurensis lectin and/or lectins specific for SAα6Gal-structures, preferably being Sambucus nigra agglutinin.
ii) Specific binding proteins recognizing preferred sialic acid oligosaccharide sequence structures according to the invention. The preferred reagents include antibodies and binding domains of antibodies (Fab-fragments and like), and other engineered carbohydrate binding proteins and animal lectins such as selectins recognizing especially Lewis type structures such as sialyl-Lewis x, SAα3Galβ4(Fucα3)GlcNAc or sialic acid recognizing Siglec-proteins. The preferred antibodies includes antibodies recognizing specifically sialyl-N-acetyllactosamines, and sialyl-Lewis x.
Preferred antibodies for NeuGc-structures includes antibodies recognizes a structure NeuGcα3Galβ4Glc(NAc)0 or 1 and/or GalNAcβ4[NeuGcα3]Galβ4Glc(NAc)0 or 1, wherein [ ] indicates branch in the structure and ( )0 or 1 a structure being either present or absent. In a preferred embodiment the invention is directed recognition of the N-glycolyl-Neuraminic acid structures by antibody, preferably by a monoclonal antibody or human/humanized monoclonal antibody. A preferred antibody contains the variable domains of P3-antibody.
Specific Binder Experiments and Examples for α3/6 Sialylated Structures
Specific exoglycosidase analysis for the structures are included in Example 22 and 2 for embryonal stem cells and differentiated cells; Example 1 for mesenchymal cells, for cord blood cells in example 19 and in example 20 on cell surface and including glycosyltransferases, for glycolipids in Example 15. Sialylation level analysis related to terminal Galβ and Sialic acid expression is in Example 9.
Preferred enzyme binders for the binding of the Sialic acid epitopes according to the invention includes: sialidases such as general sialidase α2,3/6/8/9-sialidase from A. ureafaciens (Glyko), and α2,3-Sialidases such as: α2,3-sialidase from S. pneumoniae (Calbiochem, USA). Other useful sialidases are known from E. coli, and Vibrio cholerae.
α1,3-facosyltransferase VI (human, recombinant in S. frugiperda, Calbiochem), which are known to recognize specific N-acetyllactosamine epitopes, Fuc-TVI especially including SAα3Galβ4GlcNAc.
Plant low specificity lectin, such as MAA and SNA, and data is in Example 6 for hESC, Example 7 for MSCs, Example 8 for cord blood, effects of the lectin binders for the cell proliferation is in Example 14, cord blood cell selection is in Example 16.
In example 18 there is antibody labeling of sialylstructures.
Preferred Uses For Stem Cell Type Specific Galectins and/or Galectin Ligands
As described in the Examples, the inventors also found that different stem cells have distinct galectin expression profiles and also distinct galectin (glycan) ligand expression profiles. The present invention is further directed to using galactose-binding reagents, preferentially galactose-binding lectins, more preferentially specific galectins; in a stem cell type specific fashion to modulate or bind to certain stem cells as described in the present invention to the uses described. In a further preferred embodiment, the present invention is directed to using galectin ligand structures, derivatives thereof, or ligand-mimicking reagents to uses described in the present invention in stem cell type specific fashion. The preferred galectins are listed in Example 17—
The invention is in a preferred embodiment directed to the recognition of terminal N-acetyllactosamines from cells by galectins as described above for recognition of Galβ4GlNAc and Galβ3GlcNAc structures: The results indicate that both CB CD34+/CD133+ stem cell populations and hESC have an interesting and distinct galectin expression profiles, leading to different galectin ligand affinity profiles (Hirabayashi et al., 2002). The results further correlate with the glycan analysis results showing abundant galectin ligand expression in these stem cells, especially non-reducing terminal β-Gal and type II LacNAc, poly-LacNAc, β1,6-branched poly-LacNAc, and complex-type N-glycan expression.
Specific Technical Aspects of Stem Cell Glycome Analysis
Isolation of Glycans and Glycan Fractions
Glycans of the present invention can be isolated by the methods known in the art. A preferred glycan preparation process consists of the following steps:
1° isolating a glycan-containing fraction from the sample,
2° . . . Optionally purification the fraction to useful purity for glycome analysis
The preferred isolation method is chosen according to the desired glycan fraction to be analyzed. The isolation method may be either one or a combination of the following methods, or other fractionation methods that yield fractions of the original sample:
1° extraction with water or other hydrophilic solvent, yielding water-soluble glycans or glycoconjugates such as free oligosaccharides or glycopeptides,
2° extraction with hydrophobic solvent, yielding hydrophilic glycoconjugates such as glycolipids,
3° N-glycosidase treatment, especially Flavobacterium meningosepticum N-glycosidase F treatment, yielding N-glycans,
4° alkaline treatment, such as mild (e.g. 0.1 M) sodium hydroxide or concentrated ammonia treatment, either with or without a reductive agent such as borohydride, in the former case in the presence of a protecting agent such as carbonate, yielding β-elimination products such as O-glycans and/or other elimination products such as N-glycans,
5° endoglycosidase treatment, such as endo-β-galactosidase treatment, especially Escherichia freundii endo-β-galactosidase treatment, yielding fragments from poly-N-acetyllactosamine glycan chains, or similar products according to the enzyme specificity, and/or
6° protease treatment, such as broad-range or specific protease treatment, especially trypsin treatment, yielding proteolytic fragments such as glycopeptides.
The released glycans are optionally divided into sialylated and non-sialylated subfractions and analyzed separately. According to the present invention, this is preferred for improved detection of neutral glycan components, especially when they are rare in the sample to be analyzed, and/or the amount or quality of the sample is low. Preferably, this glycan fractionation is accomplished by graphite chromatography.
According to the present invention, sialylated glycans are optionally modified in such manner that they are isolated together with the non-sialylated glycan fraction in the non-sialylated glycan specific isolation procedure described above, resulting in improved detection simultaneously to both non-sialylated and sialylated glycan components. Preferably, the modification is done before the non-sialylated glycan specific isolation procedure. Preferred modification processes include neuraminidase treatment and derivatization of the sialic acid carboxyl group, while preferred derivatization processes include amidation and esterification of the carboxyl group.
Glycan Release Methods
The preferred glycan release methods include, but are not limited to, the following methods:
Free glycans—extraction of free glycans with for example water or suitable water-solvent mixtures.
Protein-linked glycans including O- and N-linked glycans—alkaline elimination of protein-linked glycans, optionally with subsequent reduction of the liberated glycans.
Mucin-type and other Ser/Thr O-linked glycans—alkaline β-elimination of glycans, optionally with subsequent reduction of the liberated glycans.
N-glycans—enzymatic liberation, optionally with N-glycosidase enzymes including for example N-glycosidase F from C. meningosepticum, Endoglycosidase H from Streptomyces, or N-glycosidase A from almonds.
Lipid-linked glycans including glycosphingolipids—enzymatic liberation with endoglycoceramidase enzyme; chemical liberation; ozonolytic liberation.
Glycosaminoglycans—treatment with endo-glycosidase cleaving glycosaminoglycans such as chondroinases, chondroitin lyases, hyalurondases, heparanases, beparatinases, or keratanases/endo-beta-galactosidases; or use of O-glycan release methods for O-glycosidic Glycosaminoglycans; or N-glycan release methods for N-glycosidic glycosaminoglycans or use of enzymes cleaving specific glycosaminoglycan core structures; or specific chemical nitrous acid cleavage methods especially for amine/N-sulphate comprising glycosaminoglycans
Glycan fragments—specific exo- or endoglycosidase enzymes including for example keratanase, endo-β-galactosidase, hyaluronidase, sialidase, or other exo- and endoglycosidase enzyme; chemical cleavage methods; physical methods
Preferred Target Cell Populations and Types for Analysis According to the Invention
Early Human Cell Populations
Human Stem Cells and Multipotent Cells
Under broadest embodiment the present invention is directed to all types of human stem cells, meaning fresh and cultured human stem cells. The stem cells according to the invention do not include traditional cancer cell lines, which may differentiate to resemble natural cells, but represent non-natural development, which is typically due to chromosomal alteration or viral transfection. Stem cells include all types of non-malignant multipotent cells capable of differentiating to other cell types. The stem cells have special capacity stay as stem cells after cell division, the self-renewal capacity.
Under the broadest embodiment for the human stem cells, the present invention describes novel special glycan profiles and novel analytics, reagents and other methods directed to the glycan profiles. The invention shows special differences in cell populations with regard to the novel glycan profiles of human stem cells.
The present invention is further directed to the novel structures and related inventions with regard to the preferred cell populations according to the invention. The present invention is further directed to specific glycan structures, especially terminal epitopes, with regard to specific preferred cell population for which the structures are new.
Preferred Types of Early Human Cells
The invention is directed to specific types of early human cells based on the tissue origin of the cells and/or their differentiation status.
The present invention is specifically directed to early human cell populations meaning multipotent cells and cell populations derived thereof based on origins of the cells including the age of donor individual and tissue type from which the cells are derived, including preferred cord blood as well as bone marrow from older individuals or adults. Preferred differentiation status based classification includes preferably “solid tissue progenitor” cells, more preferably “mesenchymal-stem cells”, or cells differentiating to solid tissues or capable of differentiating to cells of either ectodermal, mesodermal, or endodermal, more preferentially to mesenchymal stem cells.
The invention is further directed to classification of the early human cells based on the status with regard to cell culture and to two major types of cell material. The present invention is preferably directed to two major cell material types of early human cells including fresh, frozen and cultured cells.
Cord Blood Cells, Embryonal-Type Cells and Bone Marrow Cells
The present invention is specifically directed to early human cell populations meaning multipotent cells and cell populations derived thereof based on the origin of the cells including the age of donor individual and tissue type from which the cells are derived.
Cells Differentiating to Solid Tissues, Preferably to Mesenchymal Stem Cells
The invention is specifically under a preferred embodiment directed to cells, which are capable of differentiating to non-hematopoietic tissues, referred as “solid tissue progenitors”, meaning to cells differentiating to cells other than blood cells. More preferably the cell population produced for differentiation to solid tissue are “mesenchymal-type cells”, which are multipotent cells capable of effectively differentiating to cells of mesodermal origin, more preferably mesenchymal stem cells.
Most of the prior art is directed to hematopoietic cells with characteristics quite different from the mesenchymal-type cells and mesenchymal stem cells according to the invention.
Preferred solid tissue progenitors according to the invention includes selected multipotent cell populations of cord blood, mesenchymal stem cells cultured from cord blood, mesenchymal stem cells cultured/obtained from bone marrow and embryonal-type cells. In a more specific embodiment the preferred solid tissue progenitor cells are mesenchymal stem cells, more preferably “blood related mesenchymal cells”, even more preferably mesenchymal stem cells derived from bone marrow or cord blood.
Under a specific embodiment CD34+ cells as a more hematopoietic stem cell type of cord blood or CD34+ cells in general are excluded from the solid tissue progenitor cells.
Early Blood Cell Populations and Corresponding Mesenchymal Stem Cells
Cord Blood
The early blood cell populations include blood cell materials enriched with multipotent cells. The preferred early blood cell populations include peripheral blood cells enriched with regard to multipotent cells, bone marrow blood cells, and cord blood cells. In a preferred embodiment the present invention is directed to mesenchymal stem cells derived from early blood or early blood derived cell populations, preferably to the analysis of the cell populations.
Bone Marrow
Another separately preferred group of early blood cells is bone marrow blood cells. These cell do also comprise multipotent cells. In a preferred embodiment the present invention is directed to directed to mesenchymal stem cells derived from bone marrow cell populations, preferably to the analysis of the cell populations.
Preferred Subpopulations of Early Human Blood Cells
The present invention is specifically directed to subpopulations of early human cells. In a preferred embodiment the subpopulations are produced by selection by an antibody and in another embodiment by cell culture favouring a specific cell type. In a preferred embodiment the cells are produced by an antibody selection method preferably from early blood cells. Preferably the early human blood cells are cord blood cells.
The CD34 positive cell population is relatively large and heterogenous. It is not optimal for several applications aiming to produce specific cell products. The present invention is preferably directed to specifically selected non-CD34 populations meaning cells not selected for binding to the CD34-marker, called homogenous cell populations. The homogenous cell populations may be of smaller size mononuclear cell populations for example with size corresponding to CD133+ cell populations and being smaller than specifically selected CD34+ cell populations. It is further realized that preferred homogenous subpopulations of early human cells may be larger than CD34+ cell populations.
The homogenous cell population may a subpopulation of CD34+ cell population, in preferred embodiment it is specifically a CD133+ cell population or CD133-type cell population. The “CD133-type cell populations” according to the invention are similar to the CD133+ cell populations, but preferably selected with regard to another marker than CD133. The marker is preferably a CD133-coexpressed marker. In a preferred embodiment the invention is directed to CD133+ cell population or CD133+ subpopulation as CD133-type cell populations. It is realized that the preferred homogeneous cell populations further includes other cell populations than which can be defined as special CD133-type cells.
Preferably the homogenous cell populations are selected by binding a specific binder to a cell surface marker of the cell population. In a preferred embodiment the homogenous cells are selected by a cell surface marker having lower correlation with CD34-marker and higher correlation with CD133 on cell surfaces. Preferred cell surface markers include α3-sialylated structures according to the present invention enriched in CD133-type cells. Pure, preferably complete, CD133+ cell population are preferred for the analysis according to the present invention.
The present invention is directed to essential mRNA-expression markers, which would allow analysis or recognition of the cell populations from pure cord blood derived material. The present invention is specifically directed to markers specifically expressed on early human cord blood cells.
The present invention is in a preferred embodiment directed to native cells, meaning non-genetically modified cells. Genetic modifications are known to alter cells and background from modified cells. The present invention further directed in a preferred embodiment to fresh non-cultivated cells.
The invention is directed to use of the markers for analysis of cells of special differentiation capacity, the cells being preferably human blood cells or more preferably human cord blood cells.
Preferred Purity of Reproducibly Highly Purified Mononuclear Complete Cell Populations from Human Cord Blood
The present invention is specifically directed to production of purified cell populations from human cord blood. As described above, production of highly purified complete cell preparations from human cord blood has been a problem in the field. In the broadest embodiment the invention is directed to biological equivalents of human cord blood according to the invention, when these would comprise similar markers and which would yield similar cell populations when separated similarly as the CD133+ cell population and equivalents according to the invention or when cells equivalent to the cord blood is contained in a sample further comprising other cell types. It is realized that characteristics similar to the cord blood can be at least partially present before the birth of a human. The inventors found out that it is possible to produce highly purified cell populations from early human cells with purity useful for exact analysis of sialylated glycans and related markers.
Preferred Bone Marrow Cells
The present invention is directed to multipotent cell populations or early human blood cells from human bone marrow. Most preferred arc bone marrow derived mesenchymal stem cells. In a preferred embodiment the invention is directed to mesenchymal stem cells differentiating to cells of structural support function such as bone and/or cartilage.
Embryonal-Type Cell Populations
The present invention is specifically directed to methods directed to embryonal-type cell populations, preferably when the use does not involve commercial or industrial use of human embryos nor involve destruction of human embryos. The invention is under a specific embodiment directed to use of embryonal cells and embryo derived materials such as embryonal stem cells, whenever or wherever it is legally acceptable. It is realized that the legislation varies between countries and regions.
The present invention is further directed to use of embryonal-related, discarded or spontaneously damaged material, which would not be viable as human embryo and cannot be considered as a human embryo. In yet another embodiment the present invention is directed to use of accidentally damaged embryonal material, which would not be viable as human embryo and cannot be considered as human embryo.
It is further realized that early human blood derived from human cord or placenta after birth and removal of the cord during normal delivery process is ethically uncontroversial discarded material, forming no part of human being.
The invention is further directed to cell materials equivalent to the cell materials according to the invention. It is further realized that functionally and even biologically similar cells may be obtained by artificial methods including cloning technologies.
Mesenchymal Multipotent Cells
The present invention is further directed to mesenchymal stem cells or multipotent cells as preferred cell population according to the invention. The preferred mesencymal stem cells include cells derived from early human cells, preferably human cord blood or from human bone marrow. In a preferred embodiment the invention is directed to mesenchymal stem cells differentiating to cells of structural support function such as bone and/or cartilage, or to cells forming soft tissues such as adipose tissue.
Control of Cell Status and Potential Contaminations by Glycosylation Analysis
Control of Cell Status
Control of Raw Material Cell Population
The present invention is directed to control of glycosylation of cell populations to be used in therapy.
The present invention is specifically directed to control of glycosylation of cell materials, preferably when
Time Dependent Changes During Cultivation of Cells
Furthermore during long term cultivation of cells spontaneous mutations may be caused in cultivated cell materials. It is noted that mutations in cultivated cell lines often cause harmful defects on glycosylation level.
It is further noticed that cultivation of cells may cause changes in glycosylation. It is realized that minor changes in any parameter of cell cultivation including quality and concentrations of various biological, organic and inorganic molecules, any physical condition such as temperature, cell density, or level of mixing may cause difference in cell materials and glycosylation. The present invention is directed to monitoring glycosylation changes according to the present invention in order to observe change of cell status caused by any cell culture parameter affecting the cells.
The present invention is in a preferred embodiment directed to analysis of glycosylation changes when the density of cells is altered. The inventors noticed that this has a major impact of the glycosylation during cell culture.
It is further realized that if there is limitations in genetic or differentiation stability of cells, these would increase probability for changes in glycan structures. Cell populations in early stage of differentiation have potential to produce different cell populations. The present inventors were able to discover glycosylation changes in early human cell populations.
Differentiation of Cell Lines
The present invention is specifically directed to observe glycosylation changes according to the present invention when differentiation of a cell line is observed. In a preferred embodiment the invention is directed to methods for observation of differentiation from early human cell or another preferred cell type according to the present invention to mesodermal types of stem cell
In case there is heterogeneity in cell material this may cause observable changes or harmful effects in glycosylation.
Furthermore, the changes in carbohydrate structures, even non-harmful or functionally unknown, can be used to obtain information about the exact genetic status of the cells.
The present invention is specifically directed to the analysis of changes of glycosylation, preferably changes in glycan profiles, individual glycan signals, and/or relative abundancies of individual glycans or glycan groups according to the present invention in order to observe changes of cell status during cell cultivation.
Analysis of Supporting/Feeder Cell Lines
The present invention is specifically directed to observe glycosylation differences according to the present invention, on supporting/feeder cells used in cultivation of stem cells and early human cells or other preferred cell type. It is known in the art that some cells have superior activities to act as a support/feeder cells than other cells. In a preferred embodiment the invention is directed to methods for observation of differences on glycosylation on these supporting/feeder cells. This information can be used in design of novel reagents to support the growth of the stem cells and early human cells or other preferred cell type.
Contaminations or Alterations in Cells Due to Process Conditions
Conditions and Reagents Inducing Harmful Glycosylation or Harmful Glycosylation Related Effects to Cells During Cell Handling
The inventors further revealed conditions and reagents inducing harmful glycans to be expressed by cells with same associated problems as the contaminating glycans. The inventors found out that several reagents used in a regular cell purification processes caused changes in early human cell materials.
It is realized, that the materials during cell handling may affect the glycosylation of cell materials. This may be based on the adhesion, adsorption, or metabolic accumulation of the structure in cells under processing.
In a preferred embodiment the cell handling reagents are tested with regard to the presence glycan component being antigenic or harmful structure such as cell surface NcuGc, Ncu-O-Ac or mannose structure. The testing is especially preferred for human early cell populations and preferred subpopulations thereof.
The inventors note effects of various effector molecules in cell culture on the glycans expressed by the cells if absortion or metabolic transfer of the carbohydrate structures have not been performed. The effectors typically mediate a signal to cell for example through binding a cell surface receptor.
The effector molecules include various cytokines, growth factors, and their signalling molecules and co-receptors. The effector molecules may be also carbohydrates or carbohydrate binding proteins such as lectins.
Controlled Cell Isolation/Purification and Culture Conditions to Avoid Contaminations with Harmful Glycans or Other Alteration in Glycome Level
Stress Caused by Cell Handling
It is realized that cell handling including isolation/purification, and handling in context of cell storage and cell culture processes are not natural conditions for cells and cause physical and chemical stress for cells. The present invention allows control of potential changes caused by the stress. The control may be combined by regular methods may be combined with regular checking of cell viability or the intactness of cell structures by other means.
Examples of Physical and/or Chemical Stress in Cell Handling Step
Washing and centrifuging cells cause physical stress which may break or harm cell membrane structures. Cell purifications and separations or analysis under non-physiological flow conditions also expose cells to certain non-physiological stress. Cell storage processes and cell preservation and handling at lower temperatures affects the membrane structure. All handling steps involving change of composition of media or other solution, especially washing solutions around the cells affect the cells for example by altered water and salt balance or by altering concentrations of other molecules effecting biochemical and physiological control of cells.
Observation and Control of Glycome Changes by Stress in Cell Handling Processes
The inventors revealed that the method according to the invention is useful for observing changes in cell membranes which usually effectively alter at least part of the glycome observed according to the invention. It is realized that this related to exact organization and intact structures cell membranes and specific glycan structures being part of the organization.
The present invention is specifically directed to observation of total glycome and/or cell surface glycomes, these methods are further aimed for the use in the analysis of intactness of cells especially in context of stressful condition for the cells, especially when the cells are exposed to physical and/or chemical stress. It is realized that each new cell handling step and/or new condition for a cell handling step is useful to be controlled by the methods according to the invention. It is further realized that the analysis of glycome is useful for search of most effectively altering glycan structures for analysis by other methods such as binding by specific carbohydrate binding agents including especially carbohydrate binding proteins (lectins, antibodies, enzymes and engineered proteins with carbohydrate binding activity).
Controlled Cell Preparation (Isolation or Purification) with Regard to Reagents
The inventors analysed process steps of common cell preparation methods. Multiple sources of potential contamination by animal materials were discovered.
The present invention is specifically directed to carbohydrate analysis methods to control of cell preparation processes. The present invention is specifically directed to the process of controlling the potential contaminations with animal type glycans, preferably N-glycolylneuraminic acid at various steps of the process.
The invention is further directed to specific glycan controlled reagents to be used in cell isolation
The glycan-controlled reagents may be controlled on three levels:
The control levels 2 and 3 are useful especially when cell status is controlled by glycan analysis and/or profiling methods. In case reagents in cell preparation would contain the indicated glycan structures this would make the control more difficult or prevent it. It is further noticed that glycan structures may represent biological activity modifying the cell status.
Cell Preparation Methods Including Glycan-Controlled Reagents
The present invention is further directed to specific cell purification methods including glycan-controlled reagents.
Preferred Controlled Cell Purification Process
When the binders are used for cell purification or other process after which cells are used in method where the glycans of the binder may have biological effect the binders are preferably glycan controlled or glycan neutralized proteins.
The present invention is especially directed to controlled production of human early cells containing one or several following steps. It was realized that on each step using regular reagents in following process there is risk of contamination by extragenous glycan material. The process is directed to the use of controlled reagents and materials according to the invention in the steps of the process.
Preferred purification of cells includes at least one of the steps including the use of controlled reagent, more preferably at least two steps are included, more preferably at least 3 steps and most preferably at least steps 1, 2, 3, 4, and 6.
In a preferred embodiment the preferred process is a method using immunomagnetic beads for purification of early human cells, preferably purification of cord blood cells.
The present invention is further directed to cell purification kit, preferably an immunomagnetic cell purification kit comprising at least one controlled reagent, more preferably at least two controlled reagents, even more preferably three controlled reagents, even preferably four reagents and most preferably the preferred controlled reagents are selected from the group: albumin, gelatin, antibody for cell purification and Fc-receptor blocking reagent, which may be an antibody.
Contaminations with Harmful Glycans Such as Antigenic Animal Type Glycans
Several glycans structures contaminating cell products may weaken the biological activity of the product.
The harmful glycans can affect the viability during handling of cells, or viability and/or desired bioactivity and/or safety in therapeutic use of cells.
The harmful glycan structures may reduce the in vitro or in vivo viability of the cells by causing or increasing binding of destructive lectins or antibodies to the cells. Such protein material may be included e.g. in protein preparations used in cell handling materials. Carbohydrate targeting lectins are also present on human tissues and cells, especially in blood and endothelial surfaces. Carbohydrate binding antibodies in human blood can activate complement and cause other immune responses in vivo. Furthermore immune defence lectins in blood or leukocytes may direct immune defence against unusual glycan structures.
Additionally harmful glycans may cause harmful aggregation of cells in vivo or in vitro. The glycans may cause unwanted changes in developmental status of cells by aggregation and/or changes in cell surface lectin mediated biological regulation.
Additional problems include allergenic nature of harmful glycans and misdirected targeting of cells by endothelial/cellular carbohydrate receptors in vivo.
Common Structural Features of all Glycomes and Preferred Common Subfeatures
The present invention reveals useful glycan markers for stem cells and combinations thereof and glycome compositions comprising specific amounts of key glycan structures. The invention is furthermore directed to specific terminal and core structures and to the combinations thereof.
The preferred glycome glycan structure(s) and/or glycomes from cells according to the invention comprise structure(s) according to the formula C0:
R1Hexβz{R3}n1Hex(NAc)n2XyR2,
Wherein X is glycosidically linked disaccharide epitope β4(Fucα6)nGN, wherein n is 0 or 1, or X is nothing and
Hex is Gal or Man or GlcA,
HexNAc is GlcNAc or GalNAc,
y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon,
z is linkage position 3 or 4, with the provision that when z is 4 then HexNAc is GlcNAc and then Hex is Man or Hex is Gal or Hex is GlcA, and
when z is 3 then Hex is GlcA or Gal and HexNAc is GlcNAc or GalNAc;
n1 is 0 or 1 indicating presence or absence of R3;
n2 is 0 or 1, indicating the presence or absence of NAc, with the proviso that n2 can be 0 only when Hexβz is Galβ4, and n2 is preferably 0, n2 structures are preferably derived from glycolipids;
R1 indicates 1-4, preferably 1-3, natural type carbohydrate substituents linked to the core structures or nothing;
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagine N-glycoside aminoacids and/or peptides derived from protein, or natural serine or threonine linked O-glycoside derivative such as serine or threonine linked O-glycosides including asparagine N-glycoside aminoacids and/or peptides derived from protein, or when n2 is 1 R2 is nothing or a ceramide structure or a derivative of a ceramide structure, such as lysolipid and amide derivatives thereof;
R3 is nothing or a branching structure representing a GlcNAcβ6 or an oligosaccharide with GlcNAcβ6 at its reducing end linked to GalNAc (when HexNAc is GalNAc); or when Hex is Gal and HexNAc is GlcNAc, and when z is 3 then R3 is Fucα4 or nothing, and when z is 4 R3 is Fucα3 or nothing.
The preferred disaccharide epitopes in the glycan structures and glycomes according to the invention include structures Galβ4GlcNAc, Manβ4GlcNAc, GlcAβ4GlcNAc, Galβ3GlcNAc, Galβ3GalNAc, GlcAβ3GlcNAc, GlcAβ3GalNAc, and Galβ4Glc, which may be further derivatized from reducing end carbon atom and non-reducing monosaccharide residues and is in a separate embodiment branched from the reducing end residue. Preferred branched epitopes include Galβ4(Fucα3)GlcNAc, Galβ3(Fucα4)GlcNAc, and Galβ3(GlcNAcβ6)GalNAc, which may be further derivatized from reducing end carbon atom and non-reducing monosaccharide residues.
Preferred Epitopes for Methods According to the Invention
N-Acetyllactosamine Galβ3/4GlcNAc Terminal Epitopes
The two N-acetyllactosamine epitopes Galβ4GlcNAc and/or Galβ3GlcNAc represent preferred terminal epitopes present on stem cells or backbone structures of the preferred terminal epitopes for example further comprising sialic acid or fucose derivatisations according to the invention. In a preferred embodiment the invention is directed to fucosylated and/or non-substituted glycan non-reducing end forms of the terminal epitopes, more preferably to fucosylated and non-substituted forms. The invention is especially directed to non-reducing end terminal (non-substituted) natural Galβ4GlcNAc and/or Galβ3GlcNAc-structures from human stem cell glycomes. The invention is in a specific embodiment directed to non-reducing end terminal fucosylated natural Galβ4GlcNAc and/or Galβ3GlcNAc-structures from human stem cell glycomes.
Preferred Fucosylated N-Acetyllactosamines
The preferred fucosylated epitopes are according to the Formula TF:
(Fucα2)n1Galβ3/4(Fucα4/3)GlcNAcβ-R
Wherein
n1 is 0 or 1 indicating presence or absence of Fucα2;
n2 is 0 or 1, indicating the presence or absence of Fucα4/3 (branch), and
R is the reducing end core structure of N-glycan, O-glycan and/or glycolipid.
The preferred structures thus include type 1 lactosamines (Galβ3GlcNAc based):
Galβ3(Fucα4)GlcNAc (Lewis a), Fucα2Galβ3GlcNAc H-type 1, structure and, Fucα2Galβ3(Fucα4)GlcNAc (Lewis b) and
type 2 lactosamines (Galβ4GlcNAc based):
Galβ4(Fucα3)GlcNAc (Lewis x), Fucα2Galβ4GlcNAc H-type 2, structure and, Fucα2Galβ4(Fucα3)GlcNAc (Lewis y).
The type 2 lactosamines (fucosylated and/or terminal non-substituted) form an especially preferred group in context of adult stem cells and differentiated cells derived directly from these. Type 1 lactosamines (Galβ3GlcNAc-structures) arc especially preferred in context of embryonal-type stem cells.
Lactosamines Galβ3/4GlcNAc and Glycolipid Structures Comprising Lactose Structures (Galβ4 Glc)
The lactosamines form a preferred structure group with lactose-based glycolipids. The structures share similar features as products of β3/4Gal-transferases. The β3/4 galactose based structures were observed to produce characteristic features of protein linked and glycolipid glycomes.
The invention revealed that furthermore Galβ3/4GlcNAc-structures are a key feature of differentiation related structures on glycolipids of various stem cell types. Such glycolipids comprise two preferred structural epitopes according to the invention. The most preferred glycolipid types include thus lactosylceramide based glycosphingolipids and especially lacto-(Galβ3GlcNAc), such as
lactotetraosylceramide Galβ3GlcNAcβ3Galβ4GlcβCer, preferred structures further including its non-reducing terminal structures selected from the group: Galβ3(Fucα4)GlcNAc (Lewis a), Fucα2Galβ3GlcNAc (H-type 1), structure and, Fucα2Galβ3(Fucα4)GlcNAc (Lewis b) or sialylated structure SAα3Galβ3GlcNAc or SAα3Galβ3(Fucα4)GlcNAc, wherein SA is a sialic acid, preferably Neu5Ac preferably replacing Galβ3GlcNAc of lactotetraosylceramide and its fucosylated and/or elogated variants such as preferably according to the Formula:
(Sacα3)n5(Fucα2)n1Galβ3(Fucα4)n3GlcNAcβ3[Galβ3/4(Fucα4/3)GlcNAcβ3]n4Galβ4GlcβCer
wherein
n1 is 0 or 1, indicating presence or absence of Fucα2;
n2 is 0 or 1, indicating the presence or absence of Fucα4/3 (branch),
n3 is 0 or 1, indicating the presence or absence of Fucα4 (branch)
n4 is 0 or 1, indicating the presence or absence of (fucosylated) N-acetyllactosamine elongation;
n5 is 0 or 1, indicating the presence or absence of Sacα3 elongation;
Sac is terminal structure, preferably sialic acid, with α3-linkage, with the proviso that when Sac is present, n5 is 1, then n1 is 0
and
neolacto (Galβ4GlcNAc)-comprising glycolipids such as
neolactotetraosylceramide Galβ4GlcNAcβ3Galβ4GlcβCer, preferred structures further including its non-reducing terminal Galβ4(Fucα3)GlcNAc (Lewis x), Fucα2Galβ4GlcNAc H-type 2, structure and, Fucα2Galβ4(Fucα3)GlcNAc (Lewis y)
and
its fucosylated and/or elogated variants such as preferably (Sacα3/6)n5(Fucα2)n1Galβ4(Fucα3)n3GlcNAcβ3[Galβ4(Fucα3)n2GlcNAcβ3]n4Galβ4GlcβCer
n1 is 0 or 1 indicating presence or absence of Fucα2;
n2 is 0 or 1, indicating the presence or absence of Fucα3 (branch),
n3 is 0 or 1, indicating the presence or absence of Fucα3 (branch)
n4 is 0 or 1, indicating the presence or absence of (fucosylated) N-acetyllactosamine elongation,
n5 is 0 or 1, indicating the presence or absence of Sacα3/6 elongation;
Sac is terminal structure, preferably sialic acid (SA) with α3-linkage, or sialic acid with α6-linkage, with the proviso that when Sac is present, n5 is 1, then n1 is 0, and when sialic acid is bound by α6-linkage preferably also n3 is 0.
Preferred Stem Cell Glycosphingolipid Glycan Profiles, Compositions, and Marker Structures
The inventors were able to describe stem cell glycolipid glycomes by mass spectrometric profiling of liberated free glycans, revealing about 80 glycan signals from different stem cell types. The proposed monosaccharide compositions of the neutral glycans were composed of 2-7 Hex, 0-5 HexNAc, and 0-4 dHex. The proposed monosaccharide compositions of the acidic glycan signals were composed of 0-2 NeuAc, 2-9 Hex, 0-6 HexNAc, 0-3 dHex, and/or 0-1 sulphate or phosphate esters. The present invention is especially directed to analysis and targeting of such stem cell glycan profiles and/or structures for the uses described in the present invention with respect to stem cells.
The present invention is further specifically directed to glycosphingolipid glycan signals specific tostem cell types as described in the Examples. In a preferred embodiment, glycan signals typical to hESC, preferentially including 876 and 892 are used in their analysis, more preferentially FucHexHexNAcLac, wherein α1,2-Fuc is preferential to α1,3/4-Fuc, and Hex2HexNAc1Lac, and more preferentially to Galβ3[Hex1HexNAc1]Lac. In another preferred embodiment, glycan signals typical to MSC, especially CB MSC, preferentially including 1460 and 1298, as well as large neutral glycolipids, especially Hex2-3HexNAc3Lac, more preferentially poly-N-acetyllactosamine chains, even more preferentially β1,6-branched, and preferentially terminated with type II LacNAc epitopes as described above, are used in context of MSC according to the uses described in the present invention.
Terminal glycan epitopes that were demonstrated in the present experiments in stem cell glycosphingolipid glycans are useful in recognizing stem cells or specifically binding to the stem cells via glycans, and other uses according to the present invention, including terminal epitopes: Gal, Galβ4Glc (Lac), Galβ4GlcNAc (LacNAc type 2), Galβ3, Non-reducing terminal HexNAc, Fuc, α1,2-Fuc, α1,3-Fuc, Fucα2Gal, Fucα2Galβ4GlcNAc (H type 2), Fucα2Galβ4Glc (2′-fucosyllactose), Fucα3GlcNAc, Galβ4(Fucα3)GlcNAc (Lex), Fucα3Glc, Galβ4(Fucα3)Glc (3-fucosyllactose), Neu5Ac, Neu5Acα2,3, and Neu5Acα2,6. The present invention is further directed to the total terminal epitope profiles within the total stem cell glycosphingolipid glycomes and/or glycomes.
The inventors were further able to characterize in hESC the corresponding glycan signals to SSEA-3 and SSEA-4 developmental related antigens, as well as their molar proportions within the stem cell glycome. The invention is further directed to quantitative analysis of such stem cell epitopes within the total glycomes or subglycomes, which is useful as a more efficient alternative with respect to antibodies that recognize only surface antigens. In a further embodiment, the present invention is directed to finding and characterizing the expression of cryptic developmental and/or stem cell antigens within the total glycome profiles by studying total glycan profiles, as demonstrated in the Examples for α1,2-fucosylated antigen expression in hESC in contrast to SSEA-1 expression in mouse ES cells.
The present invention revealed characteristic variations (increased or decreased expression in comparison to similar control cell or a contaminating cell or like) of both structure types in various cell materials according to the invention. The structures were revealed with characteristic and varying expression in three different glycome types: N-glycans, O-glycans, and glycolipids. The invention revealed that the glycan structures are a characteristic feature of stem cells and are useful for various analysis methods according to the invention. Amounts of these and relative amounts of the epitopes and/or derivatives varies between cell lines or between cells exposed to different conditions during growing, storage, or induction with effector molecules such as cytokines and/or hormones.
The preferred glycome glycan structure(s) and/or glycomes from cells according to the invention comprise structure(s) according to the formula C1:
R1Hexβz{R3}n1HexNAcXyR2,
Wherein X is glycosidically linked disaccharide epitope β4(Fucα6)nGN, wherein n is 0 or 1, or X is nothing and
Hex is Gal or Man or GlcA,
HexNAc is GlcNAc or GalNAc,
y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon,
z is linkage position 3 or 4, with the provision that when z is 4 then HexNAc is GlcNAc and then Hex is Man or Hex is Gal or Hex is GlcA, and
when z is 3 then Hex is GlcA or Gal and HexNAc is GlcNAc or GalNAc,
R1 indicates 1-4, preferably 1-3, natural type carbohydrate substituents linked to the core structures,
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagines N-glycoside aminoacids and/or peptides derived from protein, or natural serine or threonine linked O-glycoside derivative such as serine or threonine linked O-glycosides including asparagines N-glycoside aminoacids and/or peptides derived from protein.
R3 is nothing or a branching structure representing a GlcNAcβ6 or an oligosaccharide with GlcNAcβ6 at its reducing end linked to GalNAc (when HexNAc is GalNAc) or when Hex is Gal and HexNAc is GlcNAc the then when z is 3 R3 is Fucα4 or nothing and when z is 4 R3 is Fucα3 or nothing.
The preferred disaccharide epitopes in the glycan structures and glycomes according to the invention include structures Galβ4GlcNAc, Manβ4GlcNAc, GlcAβ4GlcNAc, Galβ3GlcNAc, Galβ3GalNAc, GlcAβ3GlcNAc and GlcAβ3GalNAc, which may be further derivatized from reducing end carbon atom and non-reducing monosaccharide residues and is separate embodiment branched from the reducing end residue. Preferred branched epitopes include Galβ4(Fucα3)GlcNAc, Galβ3(Fucα4)GlcNAc, Galβ3(GlcNAcβ6)GalNAc, which may be further derivatized from reducing end carbon atom and non-reducing monosaccharide residues.
The preferred disaccharide epitopes of glycoprotein or glycolipid structures present on glycans of human cells according to the invention comprise structures based on the formula C2:
R1Hexβ4GlcNAcXyR2,
Wherein Hex is Gal OR Man and when Hex is Man then X is glycosidically linked disaccharide epitope β4(Fucα6)nGN, wherein n is 0 or 1, or X is nothing and when Hex is Gal then X is β3GalNAc of O-glycan core or β2/4/6Manα3/6 terminal of N-glycan core (as in formula NC3)
y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon,
R1 indicates 1-4, preferably 1-3, natural type carbohydrate substituents linked to the core structures,
when Hex is Gal preferred R1 groups include structures SAα3/6, SAα3/6Galβ4GlcNAcβ3/6,
when Hex is Man preferred R1 groups include Manα3, Manα6, branched structure
Manα3 {Manα6} and elongated variants thereof as described for low mannose, high-mannose and complex type N-glycans below,
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagines N-glycoside aminoacids and/or peptides derived from protein, or natural serine or threonine linked O-glycoside derivative such as serine or threonine linked O-glycosides including asparagines N-glycoside aminoacids and/or peptides derived from protein.
Structures of N-Linked Glycomes
The Minimum Formula
The present invention is directed to glycomes derived from stem cells and comprising a common N-glycosidic core structures. The invention is specifically directed to minimum formulas covering both GN1-glycomes and GN2-glycomes with difference in reducing end structures.
The minimum core structure includes glycans from which reducing end GlcNAc or Fucα6GlcNAc has been released. These are referred as GN1-glycomes and the components thereof as GN1-glycans. The present invention is specifically directed to natural N-glycomes from human stem cells comprising GN1-glycans. In a preferred embodiment the invention is directed to purified or isolated practically pure natural GN1-glycome from human stem cells. The release of the reducing end GlcNAc-unit completely or partially may be included in the production of the N-glycome or N-glycans from stem cells for analysis.
The glycomes including the reducing end GlcNAc or Fucα6GlcNAc are referred as GN2-glycomes and the components thereof as GN2-glycans. The present invention is also specifically directed to natural N-glycomes from human stem cells comprising GN2-glycans. In a preferred embodiment the invention is directed to purified or isolated practically pure natural GN2-glycome from human stem cells.
The preferred N-glycan core structure(s) and/or N-glycomes from stem cells according to the invention comprise structure(s) according to the formula NC1:
R1Mβ4GNXyR2,
Wherein X is glycosidically linked disaccharide epitope β4(Fucα6)nGN, wherein n is 0 or 1, or X is nothing and
y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon, and
R1 indicates 1-4, preferably 1-3, natural type carbohydrate substituents linked to the core structures,
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagines N-glycoside aminoacids and/or peptides derived from protein.
It is realized that when the invention is directed to a glycome, the formula indicates mixture of several or typically more than ten or even higher number of different structures according to the Formulas describing the glycomes according to the invention.
The possible carbohydrate substituents R1 comprise at least one mannose (Man) residue, and optionally one or several GlcNAc, Gal, Fuc, SA and/GalNAc residues, with possible sulphate and or phosphate modifications.
When the glycome is released by N-glycosidase the free N-glycome saccharides comprise in a preferred embodiment reducing end hydroxyl with anomeric linkage A having structure α and/or β, preferably both α and β. In another embodiment the glycome is derivatized by a molecular structure which can be reacted with the free reducing end of a released glycome, such as amine, aminooxy or hydrazine or thiol structures. The derivatizing groups comprise typically 3 to 30 atoms in aliphatic or aromatic structures or can form terminal group spacers and link the glycomes to carriers such as solid phases or microparticels, polymeric carries such as oligosaccharides and/or polysaccharide, peptides, dendrimer, proteins, organic polymers such as plastics, polyethyleneglycol and derivatives, polyamines such as polylysines.
When the glycome comprises asparagine N-glycosides, A is preferably beta and R is linked asparagine or asparagine peptide. The peptide part may comprise multiple different aminoacid residues and typically multiple forms of peptide with different sequences derived from natural proteins carrying the N-glycans in cell materials according to the invention. It is realized that for example proteolytic release of glycans may produce mixture of glycopeptides. Preferably the peptide parts of the glycopeptides comprises mainly a low number of amino acid residues, preferably two to ten residues, more preferably two to seven amino acid residues and even more preferably two to five aminoacid residues and most preferably two to four amino acid residues when “mainly” indicates preferably at least 60% of the peptide part, more preferably at least 75% and most preferably at least 90% of the peptide part comprising the peptide of desired low number of aminoacid residues.
The Preferred GN2- N-Glycan Core Structures
The preferred GN2- N-glycan core structure(s) and/or N-glycomes from stem cells according to the invention comprise structure(s) according to the formula NC2:
R1Mβ4GNβ4(Fucα6)nGNyR2,
wherein n is 0 or 1 and
wherein y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon and
R1 indicates 1-4, preferably 1-3, natural type carbohydrate substituents linked to the core structures,
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagines N-glycoside aminoacid and/or peptides derived from protein.
The preferred compositions thus include one or several of the following structures
NC2a: Mα3 {Mα6}Mβ4GNβ4{Fucα6}n1GNyR2
NC2b: Mα6Mβ4GNβ4{Fucα6}n1GNyR2
NC2c: Mα3Mβ4GNβ4{Fucα6}n1GNyR2
More preferably compositions comprise at least 3 of the structures or most preferably both structures according to the formula NC2a and at least both fucosylated and non-fucosylated with core structure(s) NC2b and/or NC2c.
The Preferred GN1- N-Glycan Core Structure(s)
The preferred GN1- N-glycan core structure(s) and/or N-glycomes from stem cells according to the invention comprise structure(s) according to the formula NC3:
R1Mβ4GNyR2,
wherein y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon and
R1 indicates 1-4, preferably 1-3, natural type carbohydrate substituents linked to the core structures,
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagine N-glycoside aminoacids and/or peptides derived from protein.
Multi-Mannose GN1- N-Glycan Core Structure(s)
The invention is specifically directed glycans and/or glycomes derived from preferred cells according to the present invention when the natural glycome or glycan comprises Multi-mannose GN1- N-glycan core structure(s) structure(s) according to the formula NC4:
[R1Mα3]n3{R3Mα6}n2Mβ4GNXyR2,
R1 and R3 indicate nothing or one or two, natural type carbohydrate substituents linked to the core structures, when the substituents are α-linked mannose monosaccharide and/or oligosaccharides and the other variables are as described above.
Furthermore common elongated GN2- N-glycan core structures are preferred types of glycomes according to the invention
The preferred N-glycan core structures further include differently elongated GN2- N-glycan core structures according to the formula NC5:
[R1Mα3]n3{R3Mα6}n2Mβ4GNβ4{Fucα6}n1GNyR2,
wherein n1, n2 and n3 are either 0 or 1 and
wherein y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon and
R1 and R3 indicate nothing or 1-4, preferably 1-3, most preferably one or two, natural type carbohydrate substituents linked to the core structures,
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagine N-glycoside aminoacids and/or peptides derived from protein,
GN is GlcNAc, M is mannosyl-, [ ] indicate groups either present or absent in a linear sequence.
{ }indicates branching which may be also present or absent.
with the provision that at least n2 or n3 is 1. Preferably the invention is directed to compositions comprising with all possible values of n2 and n3 and all saccharide types when R1 and/or are R3 are oligosaccharide sequences or nothing.
Preferred N-Glycan Types in Glycomes Comprising N-Glycans
The present invention is preferably directed to N-glycan glycomes comprising one or several of the preferred N-glycan core types according to the invention. The present invention is specifically directed to specific N-glycan core types when the compositions comprise N-glycan or N-glycans from one or several of the groups Low mannose glycans, High mannose glycans, Hybrid glycans, and Complex glycans, in a preferred embodiment the glycome comprise substantial amounts of glycans from at least three groups, more preferably from all four groups.
Major Subtypes of N-Glycans in N-Linked Glycomes
The invention revealed certain structural groups present in N-linked glycomes. The grouping is based on structural features of glycan groups obtained by classification based on the monosaccharide compositions and structural analysis of the structurel groups. The glycans were analysed by NMR, specific binding reagents including lectins and antibodies and specific glycosidases releasing monosaccharide residues from glycans. The glycomes are preferably analysed as neutral and acidic glycomes
The Major Neutral Glycan Tapes
The neutral glycomes mean glycomes comprising no acidic monosaccharide residues such as sialic acids (especially NcuNAc and NcuGc), HexA (cspecially GlcA, glucuronic acid) and acid modification groups such as phosphate and/or sulphate esters. There are four major types of neutral N-linked glycomes which all share the common N-glycan core structure: High-mannose N-glycans, low-mannose N-glycans, hydrid type and complex type N-glycans. These have characteristic monosaccharide compositions and specific substructures. The complex and hybrid type glycans may include certain glycans comprising monoantennary glycans.
The groups of complex and hybrid type glycans can be further analysed with regard to the presence of one or more fucose residues. Glycans containing at least one fucose units are classified as fucosylated. Glycans containing at least two fucose residues are considered as glycans with complex fucosylation indicating that other fucose linkages, in addition to the α1,6-linkage in the N-glycan core, arc present in the structure. Such linkages include α1,2-, α1,3-, and α1,4-linkage.
Furthermore the complex type N-glycans may be classified based on the relations of HexNAc (typically GlcNAc or GalNAc) and Hex residues (typically Man, Gal). Terminal HexNAc glycans comprise at least three HexNAc units and at least two Hexose units so that the number of Hex Nac residues is at least larger or equal to the number of hexose units, with the provision that for non branched, monoantennary glycans the number of HexNAcs is larger than number of hexoses.
This consideration is based on presence of two GlcNAc units in the core of N-glycan and need of at least two Mannose units to for a single complex type N-glycan branch and three mannose to form a trimannosyl core structure for most complex type structures. A specific group of HexNAc N-Glycans contains the same number of HexNAcs and Hex units, when the number is at least 5.
Preferred Mannose Type Structures
The invention is further directed to glycans comprising terminal Mannose such as Mα6-residue or both Manα6- and Manα3-residues, respectively, can additionally substitute other Mα2/3/6 units to form a Mannose-type structures including hydrid, low-Man and High-Man structures according to the invention.
Preferred high- and low mannose type structures with GN2-core structure are according to the Formula M2:
[Mα2]n1[Mα3]n2{[Mα2]n3[Mα6)]n4}[Mα6]n5{[Mα2]n6[Mα2]n7[Mα3]n8}Mβ4GNβ4[{Fucα6}]mGNyR2
wherein p, n1, n2, n3, n4, n5, n6, n7, n8, and m are either independently 0 or 1; with the proviso that when n2 is 0, also n1 is 0; when n4 is 0, also n3 is 0; when n5 is 0, also n1, n2, n3, and n4 are 0; when n7 is 0, also n6 is 0; when n8 is 0, also n6 and n7 are 0;
y is anomeric linkage structure (x and/or 0 or linkage from derivatized anomeric carbon, and
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagines N-glycoside aminoacid and/or peptides derived from protein;
[ ] indicates determinant either being present or absent depending on the value of n1, n2, n3, n4, n5, n6, n7, n8, and m; and
{ } indicates a branch in the structure.
Preferred yR2-structures include [β-N-Asn]p, wherein p is either 0 or 1.
Preferred Mannose Type Glycomes Comprising GN1-Core Structures
As described above a preferred variant of N-glycomes comprising only single GlcNAc-residue in the core. Such structures are especially preferred as glycomes produced by endo-N-acetylglucosaminidase enzymes and Soluble glycomes. Preferred Mannose type glycomesnclude structures according to the Formula M2
[Mα2]n1[Mα3]n2{[Mα2]n3[Mα6)]n4}[Mα6]n5{[Mα2]n6[Mα2]n7[Mα3]n8}Mβ4GNyR2
Fucosylated high-mannose N-glycans according to the invention have molecular compositions Man5-9GlcNAc2Fuc1. For the fucosylated high-mannose glycans according to the formula, the sum of n1, n2, n3, n4, n5, n6, n7, and n8 is an integer from 4 to 8 and m is 0.
The low-mannose structures have molecular compositions Man1-4GlcNAc2Fuc0-1. They consist of two subgroups based on the number of Fuc residues: 1) nonfucosylated low-mannose structures have molecular compositions Man1-4GlcNAc2 and 2) fucosylated low-mannose structures have molecular compositions Man1-4GlcNAc2Fuc1. For the low mannose glycans the sum of n1, n2, n3, n4, n5, n6, n7, and n8 is less than or equal to (m+3); and preferably n1, n3, n6, and n7 are 0 when m is 0.
Low Mannose Glycans
The invention revealed a very unusual group of glycans in N-glycomes of the invention defined here as low mannose N-glycans. These are not clearly linked to regular biosynthesis of N-glycans, but may represent unusual biosynthetic midproducts or degradation products. The low mannose glycans are especially characteristics changing during the changes of cell status, the differentiation and other changes according to the invention, for examples changes associated with differentiation status of embryonal-type stem cells and their differentiated products and control cell materials. The invention is especially directed to recognizing low amounts of low-mannose type glycans in cell types, such as stem cells, preferably embryonal type stem cells with low degree of differentiation.
The invention revealed large differences between the low mannose glycan expression in the early human blood cell glycomes, especially in different preferred cell populations from human cord blood.
The invention is especially directed to the use of specific low mannose glycan comprising glycomes for analysis of early human blood glycomes especially glycomes from cord blood.
The invention further revealed specific mannose directed recognition methods useful for recognizing the preferred glycomes according to the invention. The invention is especially directed to combination of glycome analysis and recognition by specific binding agents, most preferred binding agent include enzymes and theis derivatives. The invention further revealed that specific low mannose glycans of the low mannose part of the glycomes can be recognized by degradation by specific α-mannosidase (Man2-4GlcNAc2Fuc0-1) or β-mannosidase (Man1GlcNAc2Fuc0-1) enzymes and optionally further recognition of small low mannose structures, even more preferably low mannose structures comprising terminal Manβ4-structures according to the invention.
The low mannose N-glycans, and preferred subgroups and individual structures thereof, are especially preferred as markers of the novel glycome compositions of the cells according to the invention useful for characterization of the cell types.
The low-mannose type glycans includes a specific group of α3- and/or α6-linked mannose type structures according to the invention including a preferred terminal and core structure types according to the invention.
The inventions further revealed that low mannose N-glycans comprise a unique individual structural markers useful for characterization of the cells according to the invention by specific binding agents according to the invention or by combinations of specific binding agents according to the invention.
Neutral low-mannose type N-glycans comprise one to four or five terminal Man-residues, preferentially Manα structures; for example Man0-3Manβ4GlcNAcβ4GlcNAc(β-N-Asn) or Manα0-4Manβ4GlcNAcβ4(Fucα6)GlcNAc(β-N-Asn).
Low-mannose N-glycans are smaller and more rare than the common high-mannose N-glycans (Man5-9GlcNAc2). The low-mannose N-glycans detected in cell samples fall into two subgroups: 1) non-fucosylated, with composition MannGlcNAc2, where 1≦n≦4, and 2) core-fucosylated, with composition MannGlcNAc2Fuc1, where 1≦n≦5. The largest of the detected low-mannose structure structures is Man5GlcNAc2Fuc1 (m/z 1403 for the sodium adduct ion), which due to biosynthetic reasons most likely includes the structure below (in the figure the glycan is free oligosaccharide and β-anomer; in glycoproteins in tissues the glycan is N-glycan and β-anomer):
Preferred general molecular structural features of low Man glycans According to the present invention, low-mannose structures arc preferentially identified by mass spectrometry, preferentially based on characteristic Hex1-4HexNAc2dHex0-1 monosaccharide composition. The low-mannose structures are further preferentially identified by sensitivity to exoglycosidase digestion, preferentially α-mannosidase (Hex2-4HexNAc2dHexc0-1) or β-mannosidase (Hex1HexNAc2dHex0-1) enzymes, and/or to endoglycosidase digestion, preferentially N-glycosidase F detachment from glycoproteins, Endoglycosidase H detachment from glycoproteins (only Hex1-4HexNAc2 liberated as Hex1-4HexNAc1), and/or Endoglycosidase F2 digestion (only Hex1-4HexNAc2dHex1 digested to Hex1-4HexNAc1). The low-mannose structures are further preferentially identified in NMR spectroscopy based on characteristic resonances of the Manβ4GlcNAcβ4GlcNAc N-glycan core structure and Manα residues attached to the Manβ4 residue.
Several preferred low Man glycans described above can be presented in a single Formula:
[Mα3]n2{[Mα6)]n4}[Mα6]n5{[Mα3]n8}Mβ4GNβ4[{Fucα6}]mGNyR2
wherein p, n2, n4, n5, n8, and m arc either independently 0 or 1; with the proviso that when n2 is 0, also n1 is 0; when n4 is 0, also n3 is 0; when n5 is 0, also n1, n2, n3, and n4 are 0; when n7 is 0, also n6 is 0; when n8 is 0, also n6 and n7 are 0; the sum of n1, n2, n3, n4, n5, n6, n7, and n8 is less than or equal to (m+3); [ ] indicates determinant either being present or absent depending on the value of n2, n4, n5, n8, and m; and
{ } indicates a branch in the structure;
y and R2 are as indicated above.
Preferred non-fucosylated low-mannose glycans are according to the formula:
[Mα3]n2([Mα6)]n4)[Mα6]n5{[Mα3]n8}Mβ4GNβ4GNyR2
wherein p, n2, n4, n5, n8, and m are either independently 0 or 1,
with the provisio that when n5 is 0, also n2 and n4 are 0, and preferably either n2 or n4 is 0,
[ ] indicates determinant either being present or absent depending on the value of, n2, n4, n5, n8,
{ } and ( ) indicates a branch in the structure,
y and R2 are as indicated above.
Preferred Individual Structures of Non-Fucosylated Low-Mannose Glycans
Special Small Structures
Small non-fucosylated low-mannose structures are especially unusual among known N-linked glycans and characteristic glycans group useful for separation of cells according to the present invention. These include:
Mβ4GNβ4GNyR2
Mα6Mβ4GNβ4GNyR2
Mα3Mβ4GNβ4GNyR2 and
Mα6{Mα3}Mβ4GNβ4GNyR2.
Mβ4GNβ4GNyR2 trisaccharide epitope is a preferred common structure alone and together with its mono-mannose derivatives Mα6Mβ4GNβ4GNyR2 and/or Mα3Mβ4GNβ4GNyR2, because these are characteristic structures commonly present in glycomes according to the invention. The invention is specifically directed to the glycomes comprising one or several of the small non-fucosylated low-mannose structures. The tetrasaccharides are in a specific embodiment preferred for specific recognition directed to α-linked, preferably α3/6-linked Mannoses as preferred terminal recognition element.
Special Large Structures
The invention further revealed large non-fucosylated low-mannose structures that are unusual among known N-linked glycans and have special characteristic expression features among the preferred cells according to the invention. The preferred large structures include [Mα3]n2([Mα6]n4)Mα6{Mα3}Mβ4GNβ4GNyR2
more specifically
Mα6Mα6{Mα3}Mβ4GNβ4GNyR2
Mα3Mα6{Mα3}Mβ4GNβ4GNyR2 and
Mα3(Mα6)Mα6{Mα3}Mβ4GNβ4GNyR2.
The hexasaccharide epitopes are preferred in a specific embodiment as rare and characteristic structures in preferred cell types and as structures with preferred terminal epitopes. The heptasaccharide is also preferred as structure comprising a preferred unusual terminal epitope Mα3(Mα6)Mα useful for analysis of cells according to the invention.
Preferred fucosylated low-mannose glycans are derived according to the formula:
[Mα3]n2{[Mα6]n4}[Mα6]n5{[Mα3]n8}Mβ4GNβ4(Fucα6)GNyR2
wherein p, n2, n4, n5, n8, and m are either independently 0 or 1, with the provisio that when n5 is 0, also n2 and n4 are 0, [ ] indicates determinant either being present or absent depending on the value of n1, n2, n3, n4, ( ) indicates a branch in the structure;
and wherein n1, n2, n3, n4 and m are either independently 0 or 1,
with the provisio that when n3 is 0, also n1 and n2 are 0,
[ ] indicates determinant either being present or absent
depending on the value of n1, n2, n3, n4 and m,
{ } and ( ) indicate a branch in the structure.
Preferred Individual Structures of Fucosylated Low-Mannose Glycans
Small fucosylated low-mannose structures are especially unusual among known N-linked glycans and form a characteristic glycan group useful for separation of cells according to the present invention. These include:
Mβ4GNβ4(Fucα6)GNyR2
Mα6Mβ4GNβ4(Fucα6)GNyR2
Mα3Mβ4GNβ4(Fucα6)GNyR2 and
Mα6{Mα3}Mβ4GNβ4(Fucα6)GNyR2.
Mβ4GNβ4(Fucα6)GNyR2 tetrasaccharide epitope is a preferred common structure alone and together with its mono-mannose derivatives Mα6Mβ4GNβ4(Fucα6)GNyR2 and/or Mα3Mβ4GNβ4(Fucα6)GNyR2, because these are commonly present characteristics structures in glycomes according to the invention. The invention is specifically directed to the glycomes comprising one or several of the small non-fucosylated low-mannose structures. The tetrasaccharides are in a specific embodiment preferred for specific recognition directed to α-linked, preferably α3/6-linked Mannoses as preferred terminal recognition element.
Special Large Structures
The invention further revealed large fucosylated low-mannose structures are unusual among known N-linked glycans and have special characteristic expression features among the preferred cells according to the invention. The preferred large structure includes
[Mα3]n2([Mα6]n4)Mα6{Mα3}Mβ4GNβ4(Fucα6)GNyR2
more specifically
Mα6Mα6{Mα3}Mβ4GNβ4(Fucα6)GNyR2
Mα3Mα6{Mα3}Mβ4GNβ4(Fucα6)GNyR2 and
Mα3(Mα6)Mα6{Mα3}Mβ4GNβ4(Fucα6)GNyR2.
The heptasaccharide epitopes are preferred in a specific embodiment as rare and characteristic structures in preferred cell types and as structures with preferred terminal epitopes. The octasaccharide is also preferred as structure comprising a preferred unusual terminal epitope Mα3(Mα6)Mα useful for analysis of cells according to the invention.
Preferred Non-Reducing End Terminal Mannose-Epitopes
The inventors revealed that mannose-structures can be labeled and/or otherwise specifically recognized on cell surfaces or cell derived fractions/materials of specific cell types. The present invention is directed to the recognition of specific mannose epitopes on cell surfaces by reagents binding to specific mannose structures from cell surfaces.
The preferred reagents for recognition of any structures according to the invention include specific antibodies and other carbohydrate recognizing binding molecules. It is known that antibodies can be produced for the specific structures by various immunization and/or library technologies such as phage display methods representing variable domains of antibodies. Similarily with antibody library technologies, including aptamer technologies and including phage display for peptides, exist for synthesis of library molecules such as polyamide molecules including peptides, especially cyclic peptides, or nucleotide type molecules such as aptamer molecules.
The invention is specifically directed to specific recognition high-mannose and low-mannose structures according to the invention. The invention is specifically directed to recognition of non-reducing end terminal Manα-epitopes, preferably at least disaccharide epitopes, according to the formula:
[Mα2]m1[Mαx]m2[Mα6]m3{{[Mα2]m9[Mα2]m8[Mα3]m7}m10(Mβ4[GN]m4)m5}m6yR2
wherein m1, m2, m3, m4, m5, m6, m7, m8, m9 and m10 are independently either 0 or 1; with the proviso that when m3 is 0, then m1 is 0 and, when m7 is 0 then either m1-5 are 0 and m8 and m9 are 1 forming Mα2Mα2-disaccharide or both m8 and m9 are 0
y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon, and
R2 is reducing end hydroxyl, chemical reducing end derivative
and x is linkage position 3 or 6 or both 3 and 6 forming branched structure,
{ } indicates a branch in the structure.
The invention is further directed to terminal Mα2-containing glycans containing at least one Mα2-group and preferably Mα2-group on each, branch so that m1 and at least one of m8 or m9 is 1. The invention is further directed to terminal Mα3 and/or Mα6-epitopes without terminal Mα2-groups, when all m1, m8 and m9 are 1.
The invention is further directed in a preferred embodiment to the terminal epitopes linked to a Mβ-residue and for application directed to larger epitopes. The invention is especially directed to Mβ4GN-comprising reducing end terminal epitopes.
The preferred terminal epitopes comprise typically 2-5 monosaccharide residues in a linear chain. According to the invention short epitopes comprising at least 2 monosaccharide residues can be recognized under suitable background conditions and the invention is specifically directed to epitopes comprising 2 to 4 monosaccharide units and more preferably 2-3 monosaccharide units, even more preferred epitopes include linear disaccharide units and/or branched trisaccharide non-reducing residue with natural anomeric linkage structures at reducing end. The shorter epitopes may be preferred for specific applications due to practical reasons including effective production of control molecules for potential binding reagents aimed for recognition of the structures.
The shorter epitopes such as Mα2M-may is often more abundant on target cell surface as it is present on multiple arms of several common structures according to the invention.
Preferred Disaccharide Epitopes Includes
Manα2Man, Manα3Man, Manα6Man, and more preferred anomeric forms Manα2Manα, Manα3Manβ, Manα6Manβ, Manα3Manα and Manα6Manα.
Preferred branched trisaccharides includes Manα3(Manα6)Man, Manα3(Manα6)Manβ, and Manα3(Manα6)Manα.
The invention is specifically directed to the specific recognition of non-reducing terminal Manα2-structures especially in context of high-mannose structures.
The invention is specifically directed to following linear terminal mannose epitopes:
a) preferred terminal Manα2-epitopes including following oligosaccharide sequences:
Manα2Man,
Manα2Manα,
Manα2Manα2Man, Manα2Manα3Man, Manα2Manα6Man,
Manα2Manα2Manα, Manα2Manα3Manβ, Manα2Manα6Manα,
Manα2Manα2Manα3Man, Manα2Manα3Manα6Man, Manα2Manα6Manα6Man
Manα2Manα2Manα3Manβ, Manα2Manα3Manα6Manβ, Manα2Manα6Manα6Manβ;
The invention is further directed to recognition of and methods directed to non-reducing end terminal Manα3- and/or Manα6-comprising target structures, which arc characteristic features of specifically important low-mannose glycans according to the invention. The preferred structural groups includes linear epitopes according to b) and branched epitopes according to the c3) especially depending on the status of the target material.
b) preferred terminal Manα3- and/or Manα6-epitopes including following oligosaccharide sequences:
Manα3Man, Manα6Man, Manα3Manβ, Manα6Manβ, Manα3Manα, Manα6Manα,
Manα3Manα6Man, Manα6Manα6Man, Manα3Manα6Manβ, Manα6Manα6Manβ
and to following
c) branched terminal mannose epitopes, are preferred as characteristic structures of especially high-mannose structures (c1 and c2) and low-mannose structures (c3), the preferred branched epitopes include:
c1) branched terminal Manα2-epitopes
Manα2Manα3(Manα2Manα6)Man, Manα2Manα3(Manα2Manα6)Manα,
Manα2Manα3(Manα2Manα6)Manα6Man, Manα2Manα3(Manα2Manα6)Manα6Manβ,
Manα2Manα3(Manα2Manα6)Manα6(Manα2Manα3)Man,
Manα2Manα3(Manα2Manα6)Manα6(Manα2Manα2Manα3)Man,
Manα2Manα3(Manα2Manα6)Manα6(Manα2Manα3)Manβ
Manα2Manα3(Manα2Manα6)Manα6(ManαManα2Manα3 )Manβ
c2) branched terminal Manα2- and Manα3 or Manα6-epitopes
according to formula when m1 and/or m8 and/m9 is 1 and the molecule comprise at least one nonreducing end terminal Manα3 or Manα6-epitope
c3) branched terminal Manα3 or Manα6-epitopes
Manα3(Manα6)Man, Manα3(Manα6)Manβ, Manα3(Manα6)Manα,
Manα3(Manα6)Manα6Man, Manα3(Manα6)Manα6Manβ,
Manα3(Manα6)Manα6(Manα3)Man, Manα3(Manα6)Manα6(Manα3)Manβ
The present invention is further directed to increase of selectivity and sensitivity in recognition of target glycans by combining recognition methods for terminal Manα2 and Manα3 and/or Manα6-comprising structures. Such methods would be especially useful in context of cell material according to the invention comprising both high-mannose and low-mannose glycans.
Complex Type N-Glycans
According to the present invention, complex-type structures are preferentially identified by mass spectrometry, preferentially based on characteristic monosaccharide compositions, wherein HexNAc≧4 and Hex≧3. In a more preferred embodiment of the present invention, 4≦HexNAc≦20 and 3≦Hex≦21, and in an even more preferred embodiment of the present invention, 4≦HexNAc≦10 and 3≦Hex≦11. The complex-type structures are further preferentially identified by sensitivity to endoglycosidase digestion, preferentially N-glycosidase F detachment from glycoproteins. The complex-type structures are further preferentially identified in NMR spectroscopy based on characteristic resonances of the Manα3(Manα6)Manβ4GlcNAcβ4GlcNAc N-glycan core structure and GlcNAc residues attached to the Manα3 and/or Manα6 residues.
Beside Mannose-type glycans the preferred N-linked glycomes include GlcNAcβ2-type glycans including Complex type glycans comprising only GlcNAcβ2-branches and Hydrid type glycan comprising both Mannose-type branch and GlcNAcβ2-branch.
GlcNAcβ2-Type Glycans
The invention revealed GlcNAcβ2Man structures in the glycomes according to the invention. Preferably GlcNAcβ2Man-structures comprise one or several of GlcNAcβ2Manα-structures, more preferably GlcNAcβ2Manα3 or GlcNAcβ2Manα6-structure.
The Complex type glycans of the invention comprise preferably two GlcNAcβ2Manα structures, which are preferably GlcNAcβ2Manα3 and GlcNAcβ2Manα6-.
The Hybrid type glycans comprise preferably GlcNAcβ2Manα3-structure.
The present invention is directed to at least one of natural oligosaccharide sequence structures and structures truncated from the reducing end of the N-glycan according to the Formula GNβ2
[R1GNβ2]n1[Mα3]n2{[R3]n3[GNβ2]n4Mα6}n5Mβ4GNXyR2,
with optionally one or two or three additional branches according to formula [RxGNβz]nx linked to Mα6-, Mα3-, or Mβ4 and Rx may be different in each branch
wherein n1, n2, n3, n4, n5 and nx, are either 0 or 1, independently,
with the proviso that when n2 is 0 then n1 is 0 and when n3 is 1 or/and n4 is 1 then n5 is also 1, and at least n1 or n4 is 1, or n3 is 1,
when n4 is 0 and n3 is 1 then R3 is a mannose type substituent or nothing and
wherein X is glycosidically linked disaccharide epitope β4(Fucα6)nGN, wherein n is 0 or 1, or X is nothing and
y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon, and
R1, Rx and R3 indicate independently one, two or three, natural substituents linked to the core structure,
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagines N-glycoside aminoacids and/or peptides derived from protein.
[ ] indicate groups either present or absent in a linear sequence. { } indicates branching which may be also present or absent.
Elongation of GlcNAcβ2-Type Structures, Complex/Hydrid Type Structures
The substituents R1, Rx and R3 may form elongated structures. In the elongated structures R1, and Rx represent substituents of GlcNAc (GN) and R3 is either substituent of GlcNAc or when n4 is 0 and n3 is 1 then R3 is a mannose type substituent linked to mannosea6-branch forming a Hybrid type structure. The substituents of GN are monosaccharide Gal, GalNAc, or Fuc or and acidic residue such as sialic acid or sulfate or fosfate ester.
GlcNAc or GN may be elongated to N-acetyllactosaminyl also marked as GalβGN or di-N-acetyllactosdiaminyl GalNAcβGlcNAc preferably GalNAcβ4GlcNAc. LNβ2M can be further elongated and/or branched with one or several other monosaccharide residues such as by galactose, fucose, SA or LN-unit(s) which may be further substituted by SAα-structures,
and/or Mα6 residue and/or Mα3 residues can be further substituted one or two β6-, and/or β4-linked additional branches according to the formula,
and/or either of Mα6 residue or Mα3 residue may be absent
and/or Mα6-residue can be additionally substitutes other Manα units to form a hybrid type structures
and/or Manβ4 can be further substituted by GNβ4,
and/or SA may include natural substituents of sialic acid and/or it may be substituted by other SA-residues preferably by α8- or α9-linkages.
The SAα-groups are linked to either 3- or 6-position of neighboring Gal residue or on 6-position of GlcNAc, preferably 3- or 6-position of neighboring Gal residue. In separately preferred embodiments the invention is directed structures comprising solely 3-linked SA or 6-linked SA, or mixtures thereof.
Hybrid Type Structures
According to the present invention, hybrid-type or monoantennary structures are preferentially identified by mass spectrometry, preferentially based on characteristic monosaccharide compositions, wherein HexNAc=3 and Hex≧2. In a more preferred embodiment of the present invention 2≦Hex≦11, and in an even more preferred embodiment of the present invention 2≦Hex≦9. The hybrid-type structures are further preferentially identified by sensitivity to exoglycosidase digestion, preferentially a-mannosidase digestion when the structures contain non-reducing terminal α-mannose residues and Hex≧3, or even more preferably when Hex≧4, and to endoglycosidase digestion, preferentially N-glycosidase F detachment from glycoproteins. The hybrid-type structures are further preferentially identified in NMR spectroscopy based on characteristic resonances of the Manα3(Manα6)Manβ4GlcNAcβ4GlcNAc N-glycan core structure, a GlcNAcβ residue attached to a Manα residue in the N-glycan core, and the presence of characteristic resonances of non-reducing terminal α-mannose residue or residues.
The monoantennary structures are further preferentially identified by insensitivity to α-mannosidase digestion and by sensitivity to endoglycosidase digestion, preferentially N-glycosidase F detachment from glycoproteins. The monoantennary structures are further preferentially identified in NMR spectroscopy based on characteristic resonances of the Manα3Manβ4GlcNAcβ4GlcNAc N-glycan core structure, a GlcNAcβ residue attached to a Manα residue in the N-glycan core, and the absence of characteristic resonances of further non-reducing terminal α-mannose residues apart from those arising from a terminal α-mannose residue present in a ManαManβ sequence of the N-glycan core.
The present invention is directed to at least one of natural oligosaccharide sequence structures and structures truncated from the reducing end of the N-glycan according to the Formula HY1
R1GNβ2Mα3{[R3]n3Mα6}Mβ4GNXyR2,
wherein n3, is either 0 or 1, independently,
AND
wherein X is glycosidically linked disaccharide epitope β4(Fucα6)nGN, wherein n is 0 or 1, or X is nothing and
y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon, and
R1 indicate nothing or substituent or substituents linked to GlcNAc, R3 indicates nothing or Mannose-substituent(s) linked to mannose residue, so that each of R1, and R3 may correspond to one, two or three, more preferably one or two, and most preferably at least one natural substituents linked to the core structure,
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagines N-glycoside aminoacids and/or peptides derived from protein.
[ ] indicate groups either present or absent in a linear sequence. { } indicates branching which may be also present or absent.
Preferred Hybrid Type Structures
The preferred hydrid type structures include one or two additional mannose residues on the preferred core structure.
R1GNβ2Mα3{[Mα3]m1([Mα6])m2Mα6}Mβ4GNXyR2, Formula HY2
wherein n3, is either 0 or 1, and m1 and m2 are either 0 or 1, independently,
{ } and ( ) indicates branching which may be also present or absent,
other variables are as described in Formula HY1 .
Furthermore the invention is directed to structures comprising additional lactosamine type structures on GNβ2-branch. The preferred lactosamine type elongation structures includes N-acetyllactosamines and derivatives, galactose, GalNAc, GlcNAc, sialic acid and fucose.
Preferred structures according to the formula HY2 include:
Structures containing non-reducing end terminal GlcNAc
As a specific preferred group of glycans
GNβ2Mα3{Mα3Mα6}Mβ4GNXyR2,
GNβ2Mα3{Mα6Mα6}Mβ4GNXyR2,
GNβ2Mα3{Mα3(Mα6)Mα6}Mβ4GNXyR2,
and/or elongated variants thereof
R1GNβ2Mα3{Mα3Mα6}Mβ4GNXyR2,
R1GNβ2Mα3{Mα6Mα6}Mβ4GNXyR2,
R1GNβ2Mα3{Mα3(Mα6)Mα6}Mβ4GNXyR2,
[R1Gal[NAc]o2βz]o1GNβ2Mα3{[Mα3]m1[(Mα6)]m2Mα6}n5Mβ4GNXyR2, Formula HY3
wherein n1, n2, n3, n5, m1, m2, o1 and o2 are either 0 or 1, independently,
z is linkage position to GN being 3 or 4 in a preferred embodiment 4,
R1 indicates on or two a N-acetyllactosamine type elongation groups or nothing,
{ } and ( ) indicates branching which may be also present or absent,
other variables are as described in Formula HY1.
Preferred structures according to the formula HY3 include especially structures containing non-reducing end terminal Galβ, preferably Galβ3/4 forming a terminal N-acetyllactosamine structure. These are preferred as a special group of Hybrid type structures, preferred as a group of specific value in characterization of balance of Complex N-glycan glycome and High mannose glycome:
GalβzGNβ2Mα3{Mα3Mα6}Mβ4GNXyR2, GalβzGNβ2Mα3{Mα6Mα6}Mβ4GNXyR2, GalβzGNβ2Mα3{Mα3(Mα6)Mα6}Mβ4GNXyR2,
and/or elongated variants thereof preferred for carrying additional characteristic terminal structures useful for characterization of glycan materials
R1GalβzGNβ2Mα3{Mα3Mα6}Mβ4GNXyR2,
R1GalβzGNβ2Mα3{Mα6Mα6}Mβ4GNXyR2,
R1GalβzGNβ2Mα3{Mα3(Mα6)Mα6}Mβ4GNXyR2. Preferred elongated materials include structures wherein R1 is a sialic acid, more preferably NeuNAc or NeuGc.
Complex N-Glycan Structures
The present invention is directed to at least one of natural oligosaccharide sequence structures and structures truncated from the reducing end of the N-glycan according to the Formula CO1
[R1GNβ2]n1[Mα3]n2{[R3GNβ2]n4Mα6}n5Mβ4GNXyR2
with optionally one or two or three additional branches according to formula [RxGNβz]nx linked to Mα6-, Mα3-, or Mβ4 and Rx may be different in each branch
wherein n1, n2, n4, n5 and nx, are either 0 or 1, independently,
with the proviso that when n2 is 0 then n1 is 0 and when n4 is 1 then n5 is also 1, and at least n1 is 1 or n4 is 1, and at least either of n1 and n4 is 1
and
wherein X is glycosidically linked disaccharide epitope β4(Fucα6)nGN, wherein n is 0 or 1, or X is nothing and
y is anomeric linkage structure α and/or β or linkage from derivatized anomeric carbon, and
R1, Rx and R3 indicate independently one, two or three, natural substituents linked to the core structure,
R2 is reducing end hydroxyl, chemical reducing end derivative or natural asparagine N-glycoside derivative such as asparagine N-glycosides including asparagines N-glycoside aminoacids and/or peptides derived from protein.
[ ] indicate groups either present or absent in a linear sequence. { } indicates branching which may be also present or absent.
Preferred Complex Type Structures
Incomplete Monoantennary N-Glycans
The present invention revealed incomplete Complex monoantennary N-glycans, which are unusual and useful for characterization of glycomes according to the invention. The most of the in complete monoantennary structures indicate potential degradation of biantennary N-glycan structures and are thus preferred as indicators of cellular status. The incomplete Complex type monoantennary glycans comprise only one GNβ2-structure.
The invention is specifically directed to structures are according to the Formula CO1 above when only n1 is 1 or n4 is one and mixtures of such structures.
The preferred mixtures comprise at least one monoantennary complex type glycans
A) with single branches from a likely degradative biosynthetic process:
R1GNβ2Mα3β4GNXyR2
R3GNβ2Mα6Mβ4GNXyR2 and
B) with two branches comprising mannose branches
B1) R1GNβ2Mα3{Mα6}n5Mβ4GNXyR2
B2) Mα3{R3GNβ2Mα6}n5Mβ4GNXyR2
The structure B2 is preferred with A structures as product of degradative biosynthesis, it is especially preferred in context of lower degradation of Manα3-structures. The structure B1 is useful for indication of either degradative biosynthesis or delay of biosynthetic process
Biantennary and Multiantennary Structures
The inventors revealed a major group of biantennary and multiantennary N-glycans from cells according to the invention, the preferred biantennary and multiantennary structures comprise two GNβ2 structures.
These are preferred as an additional characteristics group of glycomes according to the invention and are represented according to the Formula CO2:
R1GNβ2Mα3{R3GNβ2Mα6}Mβ4GNXyR2
with optionally one or two or three additional branches according to formula [RxGNβz]nx linked to Mα6-, Mα3-, or Mβ4 and Rx may be different in each branch
wherein nx is either 0 or 1,
and other variables are according to the Formula CO1.
Preferred Biantennary Structure
A biantennary structure comprising two terminal GNβ-epitopes is preferred as a potential indicator of degradative biosynthesis and/or delay of biosynthetic process. The more preferred structures are according to the Formula CO2 when R1 and R3 are nothing.
Elongated Structures
The invention revealed specific elongated complex type glycans comprising Gal and/or GalNAc-structures and elongated variants thereof. Such structures are especially preferred as informative structures because the terminal epitopes include multiple informative modifications of lactosamine type, which characterize cell types according to the invention. The present invention is directed to at least one of natural oligosaccharide sequence structure or group of structures and corresponding structure(s) truncated from the reducing end of the N-glycan according to the Formula CO3
[R1Gal[NAc]o2βz2]o1GNβ2Mα3{[R1Gal[NAc]o4βz2]o3GNβ2Mα6}Mβ4GNXyR2,
with optionally one or two or three additional branches according to formula [RxGNβz1]nx linked to Mα6-, Mα3-, or Mβ4 and Rx may be different in each branch
wherein nx, o1, o2, o3, and o4 are either 0 or 1, independently,
with the provisio that at least o1 or o3 is 1, in a preferred embodiment both are 1
z2 is linkage position to GN being 3 or 4, in a preferred embodiment 4,
z1 is linkage position of the additional branches.
R1, Rx and R3 indicate on or two a N-acetyllactosamine type elongation groups or nothing,
{ } and ( ) indicates branching which may be also present or absent,
other variables are as described in Formula CO1.
Galactosylated Structures
The inventors characterized especially directed to digalactosylated structure
GalβzGNβ2Mα3{GalβzGNβ2Mα6}Mβ4GNXyR2,
and monogalactosylated structures:
GalβzGNβ2Mα3{GNβ2Mα6}Mβ4GNXyR2,
GNβ2Mα3{GalβzGNβ2Mα6}Mβ4GNXyR2,
and/or elongated variants thereof preferred for carrying additional characteristic terminal structures useful for characterization of glycan materials
R1GalβzGNβ2Mα3{R3GalβzGNβ2Mα6}Mβ4GNXyR2
R1GalβzGNβ2Mα3{GNβ2Mα6}Mβ4GNXyR2, and
GNβ2Mα3{R3GalβzGNβ2Mα6}Mβ4GNXyR2.
Preferred elongated materials include structures wherein R1 is a sialic acid, more preferably NeuNAc or NeuGc.
LacdiNAc-Structure Comprising N-Glycans
The present invention revealed for the first time LacdiNAc, GalNacbGlcNAc structures from the cell according to the invention. Preferred N-glycan lacdiNAc structures arc included in structures according to the Formula CO1, when at least one the variable o2 and o4 is 1.
The Major Acidic Glycan Types
The acidic glycomes mean glycomes comprising at least one acidic monosaccharide residue such as sialic acids (especially NeuNAc and NeuGc) forming silylated glycome, HexA (especially GlcA, glucuronic acid) and/or acid modification groups such as phosphate and/or sulphate esters.
According to the present invention, presence of phosphate and/or sulphate ester (SP) groups in acidic glycan structures is preferentially indicated by characteristic monosaccharide compositions containing one or more SP groups. The preferred compositions containing SP groups include those formed by adding one or more SP groups into non-SP group containing glycan compositions, while the most preferential compositions containing SP groups according to the present invention arc selected from the compositions described in the acidic N-glycan fraction glycan group tables. The presence of phosphate and/or sulphate ester groups in acidic glycan structures is preferentially further indicated by the characteristic fragments observed in fragmentation mass spectrometry corresponding to loss of one or more SP groups, the insensitivity of the glycans carrying SP groups to sialidase digestion. The presence of phosphate and/or sulphate ester groups in acidic glycan structures is preferentially also indicated in positive ion mode mass spectrometry by the tendency of such glycans to form salts such as sodium salts as described in the Examples of the present invention. Sulphate and phosphate ester groups are further preferentially identified based on their sensitivity to specific sulphatase and phosphatase enzyme treatments, respectively, and/or specific complexes they form with cationic probes in analytical techniques such as mass spectrometry.
Complex N-Glycan Glycomes, Sialylated
The present invention is directed to at least one of natural oligosaccharide sequence structures and structures truncated from the reducing end of the N-glycan according to the Formula
[{SAα3/6}s1LNβ2]r1Mα3{({SAα3/6}s2LNβ2)r2Mα6}r8{M[β4GN[β4{Fucα6}r3GN]r4]r5}r6 (I)
with optionally one or two or three additional branches according to formula
{SAα3/6}s3LNβ, (IIb)
wherein r1, r2, r3, r4, r5, r6, r7 and r8 are either 0 or 1, independently,
wherein s1, s2 and s3 are either 0 or 1, independently,
with the proviso that at least r1 is 1 or r2 is 1, and at least one of s1, s2 or s3 is 1.
LN is N-acetyllactosaminyl also marked as GalβGN or di-N-acetyllactosdiaminyl GalNAcβGlcNAc preferably GalNAcβ4GlcNAc, GN is GlcNAc, M is mannosyl-,
with the proviso LNβ2M or GNβ2M can be further elongated and/or branched with one or several other monosaccharide residues such as by galactose, fucose, SA or LN-unit(s) which may be further substituted by SAα-structures,
and/or one LNβ can be truncated to GNβ
and/or Mα6 residue and/or Mα3 residues can be further substituted one or two β6-, and/or β4-linked additional branches according to the formula,
and/or either of Mα6 residue or Mα3 residue may be absent
and/or Mα6-residue can be additionally substitutes other Manα units to form a hybrid type structures
and/or Manβ4 can be further substituted by GNβ4,
and/or SA may include natural substituents of sialic acid and/or it may be substituted by other SA-residues preferably by α8- or α9-linkages.
( ), { }, [ ] and [ ] indicate groups either present or absent in a linear sequence. { }indicates branching which may be also present or absent.
The SAα-groups are linked to either 3- or 6-position of neighboring Gal residue or on 6-position of GlcNAc, preferably 3- or 6-position of neighboring Gal residue. In separately preferred embodiments the invention is directed structures comprising solely 3-linked SA or 6-linked SA, or mixtures thereof. In a preferred embodiment the invention is directed to glycans wherein r6 is 1 and r5 is 0, corresponding to N-glycans lacking the reducing end GlcNAc structure.
The LN unit with its various substituents can in a preferred general embodiment represented by the formula:
[Gal(NAc)n1α3]n2{Fucα2}n3Gal(NAc)n4β3/4{Fucα4/3}n5GlcNAcβ
wherein n1, n2, n3, n4, and n5 are independently either 1 or 0,
with the provisio that
the substituents defined by n2 and n3 are alternative to presence of SA at the non-reducing end terminal
the reducing end GlcNAc-unit can be further β3- and/or β6-linked to another similar LN-structure forming a poly-N-acetyllactosamine structure
with the provision that for this LN-unit n2, n3 and n4 are 0,
the Gal(NAc)β and GlcNAcβ units can be ester linked a sulphate ester group,
( ), and [ ] indicate groups either present or absent in a linear sequence; { }indicates branching which may be also present or absent.
LN unit is preferably Galβ4GN and/or Galβ3GN. The inventors revealed that early human cells can express both types of N-acetyllactosamine, the invention is especially directed to mixtures of both structures. Furthermore the invention is directed to special relatively rear type 1 N-acetyllactosamines, Galβ3GN, without any non-reducing end/site modification, also called lewis c-structures, and substituted derivatives thereof, as novel markers of early human cells.
Occurrence of Structure Groups in Preferred Cell Types
In the present invention, glycan signals with preferential monosaccharide compositions can be grouped into structure groups based on classification rules described in the present invention. The present invention includes parallel and overlapping classification systems that are used for the classification of the glycan structure groups.
Glycan signals isolated from the N-glycan fractions from the cell types studied in the present invention are grouped into glycan structure groups based on their preferential monosaccharide compositions according to the invention, in Table 29 for neutral N-glycan fractions and Table 30 acidic N-glycan fractions. Taken together, the analyses revealed that all the structure groups according to the invention are present in the studied cell types.
The invention is specifically directed to terminal HexNAc groups and/or other structure groups and/or combinations thereof as shown in the Examples describing and analysis of stem cell including hESC glycan structure classification. Non-reducing terminal HexNAc residues could be liberated from the cell types studied in the present invention by specific combinations of β-hexosaminidase and β-glucosaminidase digestions, confining the structural group classification of the present invention, and identifying terminal HexNAc residues as β-GlcNAc and/or β-GalNAc residues in the studied cell types. According to the present invention, specifically in hESC and cells differentiated therefrom the terminal HexNAc residues preferentially include both β-GlcNAc and β-GalNAc residues, more preferentially terminal β-GlcNAc linkages including bisecting GlcNAc linkages and other hybrid-type and complex-type GlcNAc linkages according to the present invention, and terminal β-GalNAc linkages including β4-linked GalNAc and most preferentially GalNAcβ4GlcNAcβ (LacdiNAc) structures according to the present invention.
Integrated Glycome Analysis Technology
The invention is directed to analysis of present cell materials based on single or several glycans (glycome profile) of cell materials according to the invention. The analysis of multiple glycans is preferably performed by physical analysis methods such as mass spectrometry and/or NMR.
The invention is specifically directed to integrated analysis process for glycomes, such as total glycomes and cell surface glycomes. The integrated process represent various novel aspects in each part of the process. The methods are especially directed to analysis of low amounts of cells. The integrated analysis process includes
A) preferred preparation of substrate cell materials for analysis, including one or several of the methods: use of a chemical buffer solution, use of detergents, chemical reagents and/or enzymes.
B) release of glycome(s), including various subglycome type based on glycan core, charge and other structural features, use of controlled reagents in the process
C) purification of glycomes and various subglycomes from complex mixtures
D) preferred glycome analysis, including profiling methods such as mass spectrometry and/or NMR spectroscopy
E) data processing and analysis, especially comparative methods between different sample types and quantitative analysis of the glycome data.
Low Amounts of Cells for Glycome Analysis from Stem Cells
The invention revealed that its possible to produce glycome from very low amount of cells. The preferred embodiments amount of cells is between 1000 and 10 000 000 cells, more preferably between 10 000 and 1 000 000 cells. The invention is further directed to analysis of released glycomes of amount of at least 0.1 pmol, more preferably of at least to 1 pmol, more preferably at least of 10 pmol.
(a) Total asparagine-linked glycan (N-glycan) pool was enzymatically isolated from about 100 000 cells. (b) The total N-glycan pool (picomole quantities) was purified with microscale solid-phase extraction and divided into neutral and sialylated N-glycan fractions. The N-glycan fractions were analyzed by MALDI-TOF mass spectrometry either in positive ion mode for neutral N-glycans (c) or in negative ion mode for sialylated glycans (d). Over one hundred N-glycan signals were detected from each cell type revealing the surprising complexity of hESC glycosylation. The relative abundances of the observed glycan signals were determined based on relative signal intensities (Saarinen et al., 1999, Eur. J Biochem. 259, 829-840).
Methods for Low Sample Amounts
The present invention is specifically directed to methods for analysis of low amounts of samples.
The invention further revealed that it is possible to use the methods according to the invention for analysis of low sample amounts. It is realized that the cell materials are scarce and difficult to obtain from natural sources. The effective analysis methods would spare important cell materials. Under certain circumstances such as in context of cell culture the materials may be available from large scale. The required sample scale depends on the relative abundancy of the characteristic components of glycome in comparison to total amount of carbohydrates. It is further realized that the amount of glycans to be measured depend on the size and glycan content of the cell type to be measured and analysis including multiple enzymatic digestions of the samples would likely require more material. The present invention revealed especially effective methods for free released glycans.
The picoscale samples comprise preferably at least about 1000 cells, more preferably at least about 50 000 cells, even more more preferably at least 100 000 cells, and most preferably at least about 500 000 cells. The invention is further directed to analysis of about 1 000 000 cells. The preferred picoscale samples contain from at least about 1000 cells to about 10 000 000 cells according to the invention. The useful range of amounts of cells is between 50 000 and 5 000 000, even more preferred range of of cells is between 100 000 and 3 000 000 cells. A preferred practical range for free oligosaccharide glycomes is between about 500 000 and about 2 000 000 cells. It is realized that cell counting may have variation of less than 20%, more preferably 10% and most preferably 5%, depending on cell counting methods and cell sample, these variations may be used instead of term aboul It is further understood that the methods according to the present invention can be upscaled to much larger amounts of material and the pico/nanosoale analysis is a specific application of the technology. The invention is specifically directed to use of microcolumn technologies according to the invention for the analysis of the preferred picoscale and low amount samples according to the invention,
The invention is specifically directed to purification to level, which would allow production of high quality mass spectrum covering the broad size range of glycans of glycome compositions according to the invention.
The Binding Methods for Recognition of Structures from Cell Surfaces
Recognition of Structures from Glycome Materials and on Cell Surfaces by Binding Methods
The present invention revealed that beside the physicochemical analysis by NMR and/or mass spectrometry several methods are useful for the analysis of the structures. The invention is especially directed to a method:
The peptides and proteins are preferably recombinant proteins or corresponding carbohydrate recognition domains derived thereof, when the proteins are selected from the group monoclonal antibody, glycosidase, glycosyl transferring enzyme, plant lectin, animal lectin or a peptide mimetic thereof, and wherein the binder includes a detectable label structure.
Preferred Binder Molecules
The present invention revealed various types of binder molecules useful for characterization of cells according to the invention and more specifically the preferred cell groups and cell types according to the invention. The preferred binder molecules are classified based on the binding specificity with regard to specific structures or structural features on carbohydrates of cell surface. The preferred binders recognize specifically more than single monosaccharide residue.
It is realized that most of the current binder molecules such as all or most of the plant lectins are not optimal in their specificity and usually recognize roughly one or several monosaccharides with various linkages. Furthermore the specificities of the lectins are usually not well characterized with several glycans of human types.
The preferred high specificity binders recognize
The preferred binders includes natural human and or animal, or other proteins developed for specific recognition of glycans. The preferred high specificity binder proteins are specific antibodies preferably monoclonal antibodies; lectins, preferably mammalian or animal lectins; or specific glycosyltransferring enzymes more preferably glycosidase type enzymes, glycosyltransferases or transglycosylating enzymes.
Target Structures for Specific binders and Examples of the Binding Molecules
Combination of Terminal Structures in Combination with Specific Glycan Core Structures
It is realized that part of the structural elements are specifically associated with specific glycan core structure. The recognition of terminal structures linked to specific core structures are especially preferred, such high specificity reagents have capacity of recognition almost complete individual glycans to the level of physicochemical characterization according to the invention. For example many specific mannose structures according to the invention are in general quite characteristic for N-glycan glycomes according to the invention. The present invention is especially directed to recognition terminal epitopes.
Common Terminal Structures on Several Glycan Core Structures
The present invention revealed that there are certain common structural features on several glycan types and that it is possible to recognize certain common epitopes on different glycan structures by specific reagents when specificity of the reagent is limited to the terminal without specificity for the core structure. The invention especially revealed characteristic terminal features for specific cell types according to the invention. The invention realized that the common epitopes increase the effect of the recognition. The common terminal structures are especially useful for recognition in the context with possible other cell types or material, which do not contain the common terminal structure in substantial amount.
Specific Preferred Structural Groups
The present invention is directed to recognition of oligosaccharide sequences comprising specific terminal monosaccharide types, optionally further including a specific core structure. The preferred oligosaccharide sequences classified based on the terminal monosaccharide structures.
1. Structures with Terminal Mannose Monosaccharide
Preferred mannose-type target structures have been specifically classified by the invention. These include various types of high and low-mannose structures and hybrid type structures according to the invention.
Low or Uncharacterised Specificity Binders
preferred for recognition of terminal mannose structures includes mannose-monosaccharide binding plant lectins.
Preferred High Specific High Specificity Binders
include
i) Specific mannose residue releasing enzymes such as linkage specific mannosidases, more preferably an α-mannosidase or β-mannosidase.
Preferred α-mannosidases includes linkage specific α-mannosidases such as α-Mannosidases cleaving preferably non-reducing end terminal
α2-linked mannose residues specifically or more effectively than other linkages, more preferably cleaving specifically Manα2-structures; or
α6-linked mannose residues specifically or more effectively than other linkages, more preferably cleaving specifically Manα6-structures;
Preferred β-mannosidases includes β-mannosidases capable of cleaving β4-linked mannose from non-reducing end terminal of N-glycan core Manβ4GlcNAc-structure without cleaving other β-linked monosaccharides in the glycomes.
ii) Specific binding proteins recognizing preferred mannose structures according to the invention. The preferred reagents include antibodies and binding domains of antibodies (Fab-fragments and like), and other engineered carbohydrate binding proteins. The invention is directed to antibodies recognizing MS2B1 and more preferably MS3B2-structures
2. Structures with Terminal Gal-Monosaccharide
Preferred galactose-type target structures have been specifically classified by the invention. These include various types of N-acetyllactosamine structures according to the invention.
Low or Uncharacterised Specificity Binders for Terminal Gal
Preferred for recognition of terminal galactose structures includes plant lectins such as ricin lectin (ricinus communis agglutinin RCA), and peanut lectin(/agglutinin PNA).
Preferred High Specific High Specificity Binders Include
i) Specific galactose residue releasing enzymes such as linkage specific galactosidases, more preferably α-galactosidase or β-galactosidase.
Preferred α-galactosidases include linkage galactosidases capable of cleaving Galα3Gal-structures revealed from specific cell preparations
Preferred β-galactosidases includes β-galactosidases capable of cleaving
β4-linked galactose from non-reducing end terminal Galβ4GlcNAc-structure without cleaving other β-linked monosaccharides in the glycomes and
β3-linked galactose from non-reducing end terminal Galβ3GlcNAc-structure without cleaving other β-linked monosaccharides in the glycomes
ii) Specific binding proteins recognizing preferred galactose structures according to the invention. The preferred reagents include antibodies and binding domains of antibodies (Fab-fragments and like), and other engineered carbohydrate binding proteins and animal lectins such as galectins.
3. Structures with Terminal GalNAc-Monosaccharide
Preferred GalNAc-type target structures have been specifically revealed by the invention. These include especially LacdiNAc, GalNAcβGlcNAc-type structures according to the invention.
Low or Uncharacterised Specificity Binders for Terminal GalNAc
Several plant lectins has been reported for recognition of terminal GalNAc. It is realized that some GalNAc-recognizing lectins may be selected for low specificity recognition of the preferred LacdiNAc-structures.
Preferred High Specific High Specificity binders Include
i) The invention revealed that β-linked GalNAc can be recognized by specific β-N-acetylhexosaminidase enzyme in combination with β-N-acetylhexosaminidase enzyme.
This combination indicates the terminal monosaccharide and at least part of the linkage structure.
Preferred β-N-acetylehexosaminidase, includes enzyme capable of cleaving β-linked GalNAc from non-reducing end terminal GalNAcβ4/3-structures without cleaving α-linked HexNAc in the glycomes; preferred N-acetylglucosaminidases include enzyme capable of cleaving β-linked GlcNAc but not GalNAc.
ii) Specific binding proteins recognizing preferred GalNAcβ4, more preferably GalNAcβ4GlcNAc, structures according to the invention. The preferred reagents include antibodies and binding domains of antibodies (Fab-fragments and like), and other engineered carbohydrate binding proteins, and a special plant lectin WFA (Wisteria floribunda agglutinin).
4. Structures with Terminal GlcNAc-Monosaccharide
Preferred GlcNAc-type target structures have been specifically revealed by the invention. These include especially GlcNAcβ-type structures according to the invention.
Low or Uncharacterised Specificity Binders for Terminal GlcNAc
Several plant lectins has been reported for recognition of terminal GlcNAc. It is realized that some GlcNAc-recognizing lectins may be selected for low specificity recognition of the preferred GlcNAc-structures.
Preferred High Specific High Specificity Binders Include
i) The invention revealed that C-linked GlcNAc can be recognized by specific β-N-acetyglucosaminidase enzyme.
Preferred β-N-acetyglucosaminidase includes enzyme capable of cleaving β-linked GlcNAc from non-reducing end terminal GlcNAcβ2/3/6-structures without cleaving β-linked GalNAc or α-linked HexNAc in the glycomes;
ii) Specific binding proteins recognizing preferred GlcNAcβ2/3/6, more preferably GlcNAcβ2Manα, structures according to the invention. The preferred reagents include antibodies and binding domains of antibodies (Fab-fragments and like), and other engineered carbohydrate binding proteins.
5. Structures with Terminal Fucose-Monosaccharide
Preferred fucose-type target structures have been specifically classified by the invention. These include various types of N-acetyllactosamine structures according to the invention.
Low or Uncharacterised Specificity Binders for Terminal Fuc
Preferred for recognition of terminal fucose structures includes fucose monosaccharide binding plant lectins. Lectins of Ulex europeaus and Lotus tetragonolobus has been reported to recognize for example terminal Fucoses with some specificity binding for α2-linked structures, and branching α3-fucose, respectively.
Preferred High Specific High Specificity Binders Include
i) Specific fucose residue releasing enzymes such as linkage fucosidases, more preferably α-fucosidase.
Preferred α-fucosidases include linkage fucosidases capable of cleaving Fucα2Gal-, and Galβ4/3(Fucα3/4)GlcNAc-structures revealed from specific cell preparations.
ii) Specific binding proteins recognizing preferred fucose structures according to the invention.
The preferred reagents include antibodies and binding domains of antibodies (Fab-fragments and like), and other engineered carbohydrate binding proteins and animal lectins such as selectins recognizing especially Lewis type structures such as Lewis x, Galβ4(Fucα3)GlcNAc, and sialyl-Lewis x, SAα3Galβ4(Fucα3)GlcNAc.
The preferred antibodies includes antibodies recognizing specifically Lewis type structures such as Lewis x, and sialyl-Lewis x. More preferably the Lewis x-antibody is not classic SSEA-1 antibody, but the antibody recognizes specific protein linked Lewis x structures such as Galβ4(Fucα3)GlcNAcβ2Manα-linked to N-glycan core.
6. Structures with Terminal Sialic Acid-Monosaccharide
Preferred sialic acid-type target structures have been specifically classified by the invention.
Low or Uncharacterised Specificity Binders for Terminal Fuc
Preferred for recognition of terminal sialic acid structures includes sialic acid monosaccharide binding plant lectins.
Preferred High Specific High Specificity Binders Include
i) Specific sialic acid residue releasing enzymes such as linkage sialidases, more preferably α-sialidases.
Preferred α-sialidases include linkage sialidases capable of cleaving SAα3Gal- and SAα6Gal-structures revealed from specific cell preparations by the invention.
Preferred lectins, with linkage specificity include the lectins, that are specific for SAα3Gal-structures, preferably being Maackia amurensis lectin and/or lectins specific for SAα6Gal-structures, preferably being Sambucus nigra agglutinin.
ii) Specific binding proteins recognizing preferred sialic acid oligosaccharide sequence structures according to the invention. The preferred reagents include antibodies and binding domains of antibodies (Fab-fragments and like), and other engineered carbohydrate binding proteins and animal lectins such as selectins recognizing especially Lewis type structures such as sialyl-Lewis x, SAα3Galβ4(Fucα3)GlcNAc or sialic acid recognizing Siglec-proteins. The preferred antibodies includes antibodies recognizing specifically sialyl-N-acetyllactosamines, and sialyl-Lewis x.
Preferred antibodies for NeuGc-structures includes antibodies recognizes a structure NeuGcα3Galβ4Glc(NAc)0 or 1 and/or GalNAcβ4[NeuGcα3]Galβ4Glc(NAc)0 or 1, wherein wherein [ ] indicates branch in the structure and ( )0 or 1 a structure being either present or absent. In a preferred embodiment the invention is directed recognition of the N-glycolyl-Neuraminic acid structures by antibody, preferably by a monoclonal antibody or human/humanized monoclonal antibody. A preferred antibody contains the variable domains of P3-antibody.
Preferred Epitopes and Antibody Binders
Other preferred binders and/or antibodies comprise of binders which bind to the same epitope than GF 287 (H type 1). In a preferred embodiment, an antibody binds to Fucα2Galβ3GlcNAc epitope. A more preferred antibody comprises of the antibody of clone 17-206 (ab3355) by Abeam. This epitope is suitable and can be used to detect, isolate and evaluate the differentiation stage, and/or plucipotency of stem cells, preferably human embryonic stem cells. The detection can be performed in vitro, for FACS purposes and/or for cell lineage specific purposes. This antibody can be used to positively isolate and/or separate and/or enrich stem cells, preferably human embryonice stem cells from a mixture of cells comprising feeder and stem cells.
Other preferred binders and/or antibodies comprise of binders which bind to the same epitope than GF 279 (Lewis c, Galβ3GlcNAc). In a preferred embodiment, an antibody binds to Galβ3GlcNAc epitope in glycoconjugates, more preferably in glycoproteins and glycolipids such as lactotetraosylceramide. A more preferred antibody comprises of the antibody of clone K21 (ab3352) by Abeam. This epitope is suitable and can be used to detect, isolate and evaluate the differentiation stage, and/or plucipotency of stem cells, preferably human embryonic stem cells. The detection can be performed in vitro, for FACS purposes and/or for cell lineage specific purposes. This antibody can be used to positively isolate and/or separate and/or enrich stem cells, preferably human embryonice stem cells from a mixture of cells comprising feeder and stem cells.
Other preferred binders and/or antibodies comprise of binders which bind to the same epitope than GF 288 (Globo H). In a preferred embodiment, an antibody binds to Fucα2Galβ3GalNAcβ epitope, more preferably Fucα2Galβ3GalNAcβ3GalαLacCer epitope. A more preferred antibody comprises of the antibody of clone A69-A/E8 (MAB-S206) by Glycotope. This epitope is suitable and can be used to detect, isolate and evaluate the differentiation stage, and/or plucipotency of stem cells, preferably human embryonic stem cells. The detection can be performed in vitro, for FACS purposes and/or for cell lineage specific purposes. This antibody can be used to positively isolate and/or separate and/or enrich stem cells, preferably human embryonice stem cells from a mixture of cells comprising feeder and stem cells.
Other preferred binders and/or antibodies comprise of binders which bind to the same epitope than GF 284 (H type 2). In a preferred embodiment, an antibody binds to Fucα2Galβ4GlcNAc epitope. A more preferred antibody comprises of the antibody of clone B393 (DM3015) by Acris. This epitope is suitable and can be used to detect, isolate and evaluate the differentiation stage, and/or plucipotency of stem cells, preferably human embryonic stem cells. The detection can be performed in vitro, for FACS purposes and/or for cell lineage specific purposes. This antibody can be used to positively isolate and/or separate and/or enrich stem cells, preferably human embryonice stem cells from a mixture of cells comprising feeder and stem cells.
Other preferred binders and/or antibodies comprise of binders which bind to the same epitope than GF 283 (Lewis b). In a preferred embodiment, an antibody binds to Fucα2Galβ3(Fucα4)GlcNAc epitope. A more preferred antibody comprises of the antibody of clone 2-25LE (DM3122) by Acris. This epitope is suitable and can be used to detect, isolate and evaluate the differentiation stage, and/or plucipotency of stem cells, preferably human embryonic stem cells. The detection can be performed in vitro, for FACS purposes and/or for cell lineage specific purposes. This antibody can be used to positively isolate and/or separate and/or enrich stem cells, preferably human embryonice stem cells from a mixture of cells comprising feeder and stem cells.
Other preferred binders and/or antibodies comprise of binders which bind to the same epitope than GF 286 (H type 2). In a preferred embodiment, an antibody binds to Fucα2Galβ4GlcNAc epitope. A more preferred antibody comprises of the antibody of clone B393 (BM258P) by Acris. This epitope is suitable and can be used to detect, isolate and evaluate the differentiation stage, and/or plucipotency of stem cells, preferably human embryonic stem cells. The detection can be performed in vitro, for FACS purposes and/or for cell lineage specific purposes. This antibody can be used to positively isolate and/or separate and/or enrich stem cells, preferably human embryonice stem cells from a mixture of cells comprising feeder and stem cells.
Other preferred binders and/or antibodies comprise of binders which bind to the same epitope than GF 290 (H type 2). In a preferred embodiment, an antibody binds to Fucα2Galβ4GlcNAc epitope. A more preferred antibody comprises of the antibody of clone A51-B/A6 (MAB-S204) by Glycotope. This epitope is suitable and can be used to detect, isolate and evaluate the differentiation stage, and/or plucipotency of stem cells, preferably human embryonic stem cells. The detection can be performed in vitro, for FACS purposes and/or for cell lineage specific purposes. This antibody can be used to positively isolate and/or separate and/or enrich stem cells, preferably human embryonice stem cells from a mixture of cells comprising feeder and stem cells.
Other binders binding to feeder cells, preferably mouse feeder cells, comprise of binders which bind to the same epitope than GF 285 (H type 2). In a preferred embodiment, an antibody binds to Fucα2Galβ4GlcNAc, Fucα2Galβ3(Fucα4)GlcNAc, Fucα2Galβ4(Fucα3)GlcNAc epitope. A more preferred antibody comprises of the antibody of clone B389 (DM3014) by Acris. This epitope is suitable and can be used to detect, isolate and evaluate of feeder cells, preferably mouse feeder cells in culture with human embryonic stem cells. The detection can be performed in vitro, for FACS purposes and/or for cell lineage specific purposes. This antibody can be used to positively isolate and/or separate and/or enrich feeder cells (negatively select stem cells), preferably mouse embryonic feeder cells from a mixture of cells comprising feeder and stem cells.
Other binders binding to stem cells, preferably human stem cells, comprise of binders which bind to the same epitope than GF 289 (Lewis y). In a preferred embodiment, an antibody binds to Fucα2Galβ4(Fucα3)GlcNAc epitope. A more preferred antibody comprises of the antibody of clone A70-C/C8 (MAB-S201) by Glycotope. This epitope is suitable and can be used to detect, isolate and evaluate of stem cells, preferably human stem cells in culture with feeder cells. The detection can be performed in vitro, for FACS purposes and/or for cell lineage specific purposes. This antibody can be used to positively isolate and/or separate and/or enrich stem cells (negatively select feeder cells), preferably human stem cells from a mixture of cells comprising feeder and stem cells.
The staining intensity and cell number of stained stem cells, i.e. glycan structures of the present invention on stem cells indicates suitability and usefulness of the binder for isolation and differentiation marker. For example, low relative number of a glycan structure expressing cells may indicate lineage specificity and usefulness for selection of a subset and when selected/isolated from the colonies and cultured. Low number of expression is less than 5%, less than 10%, less than 15%, less than 20%, less than 30% or less than 40%. Further, low number of expression is contemplated when the expression levels are between 1-10%, 10%-20%, 15-25%, 20-40%, 25-35% or 35-50%. Typically, FACS analysis can be performed to enrich, isolate and/or select subsets of cells expressing a glycan structure(s).
High number of glycan expressing cells may indicate usefullness in pluripotency/multipotency marker and that the binder is useful in identifying, characterizing, selecting or isolating pluripotent or multipotent stem cells in a population of mammalian cells. High number of expression is more than 50%, more preferably more than 60%, even more preferably more than 70%, and most preferably more than 80%, 90 or 95%. Further, high number of expression is contemplated when the expression levels are between 50-60, 55%-65%, 60-70%, 70-80, 80-90%, 90-100 or 95-100%. Typically, FACS analysis can be performed to enrich, isolate and/or select subsets of cells expressing a glycan structure(s).
The epitopes recognized by the binders GF 279, GF 287, and GF 289 and the binders are particularly useful in characterizing pluripotency and multipotency of stem cells in a culture. The epitopes recognized by the binders GF 283, GF 284, GF 286, GF 288, and GF 290 and the binders are particularly useful for selecting or isolating subsets of stem cells. These subset or subpopulations can be further propagated and studied in vitro for their potency to differentiate and for differentiated cells or cell committed to a certain differentiation path.
The percentage as used herein means ratio of how many cells express a glycan structure to all the cells subjected to an analysis or an experiment. For example, 20% stem cells expressing a glycan structure in a stem cell colony means that a binder, eg an antibody staining can be observed in about 20% of cells when assessed visually.
In colonies a glycan structure bearing cells can be distnrbuted in a particular regions or they can be scattered in small patch like colonies. Patch like observed stem cells are useful for cell lineage specific studies, isolation and separation. Patch like characteristics were observed with GF 283, GF 284, GF 286, GF 288, and GF 290.
For positive selection of feeder cells, preferably mouse feeder cells, most preferably embryonic fibroblasts, GF 285 is useful. It stains almost all feeder cells whereas very little if at all staining is found in stem cells. For all percentages of expression, see Table 22.
The term “mainly” indicates preferably at least 60%, more preferably at least 75% and most preferably at least 90%. In the context of stem cells, the term “mainly” indicates preferably at least 60%, more preferably at least 75% and most preferably at least 90% of cells expressing a glycan structure and useful for identifying, characterizing, sclccting or isolating pluripotcnt or multipotent stem cells in a population of mammalian cells.
As used herein, “binder”, “binding agent” and “marker” are used interchangeably.
‘Antibodies
Various procedures known in the art may be used for the production of polyclonal antibodies to peptide motifs and regions or fragments thereof. For the production of antibodies, any suitable host animal (including but not limited to rabbits, mice, rats, or hamsters) are immunized by injection with a peptide (immunogenic fragment). Various adjuvants may be used to increase the immunological response, depending on the host species, including but not limited to Freund's (complete and incomplete) adjuvant, mineral gels such as aluminum hydroxide, surface active substances such as lysolecithin, pluronic polyols, polyanions, oil emulsions, keyhole limpet hemocyanins, dinitrophenol, and potentially useful human adjuvants such as BCG {Bacille Calmette-Guerin) and Corynebacterium parvum.
A monoclonal antibody to a peptide motif(s) may be prepared by using any technique which provides for the production of antibody molecules by continuous cell lines in culture. These include but are not limited to the hybridoma technique originally described by Kδhler et al., (Nature, 256: 495-497, 1975), and the more recent human B-cell hybridoma technique (Kosbor et al., Immunology Today, 4: 72, 1983) and the EBV-hybridoma technique (Cole et al., Monoclonal Antibodies and Cancer Therapy, Alan R Liss, Inc., pp. 77-96, 1985), all specifically incorporated herein by reference. Antibodies also may be produced in bacteria from cloned immunoglobulin cDNAs. With the use of the recombinant phage antibody system it may be possible to quickly produce and select antibodies in bacterial cultures and to genetically manipulate their structure.
When the hybridoma technique is employed, mycloma cell lines may be used. Such cell lines suited for use in hybridoma-producing fusion procedures preferably are non-antibody-producing, have high fusion efficiency, and exhibit enzyme deficiencies that render them incapable of growing in certain selective media which support the growth of only the desired fused cells (hybridomas). For example, where the immunized animal is a mouse, one may use P3-X63/Ag8, P3-X63-Ag8.653, NS1/1.Ag 4 1, Sp210-Ag14, FO, NSO/U, MPC-11, MPC11-X45-GTG 1.7 and S194/5XX1 BuI; for rats, one may use R210.RCY3, Y3-Ag 1.2.3, IR983F and 4B210; and U-266, GM1500-GRG2, LICR-LON-HMy2 and UC729-6 all may be useful in connection with cell fusions.
In addition to the production of monoclonal antibodies, techniques developed for the production of “chimeric antibodies”, the splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate antigen specificity and biological activity, can be used (Morrison et al, Proc Natl Acad Sd 81: 6851-6855, 1984; Neuberger et al, Nature 312: 604-608, 1984; Takeda et al, Nature 314: 452-454; 1985). Alternatively, techniques described for the production of single-chain antibodies (U.S. Pat. No.4,946,778) can be adapted to produce influenza-specific single chain antibodies.
Antibody fragments that contain the idiotype of the molecule may be generated by known techniques. For example, such fragments include, but are not limited to, the F(ab′)2 fragment which may be produced by pepsin digestion of the antibody molecule; the Fab′ fragments which may be generated by reducing the disulfide bridges of the F(ab′)2 fragment, and the two Fab fragments which may be generated by treating the antibody molecule with papain and a reducing agent.
Non-human antibodies may be humanized by any methods known in the art. A preferred “humanized antibody” has a human constant region, while the variable region, or at least a complementarity determining region (CDR), of the antibody is derived from a non-human species. The human light chain constant region may be from either a kappa or lambda light chain, while the human heavy chain constant region may be from either an IgM, an IgG (IgG1, IgG2, IgG3, or IgG4) an IgD, an IgA, or an IgE immunoglobulin.
Methods for humanizing non-human antibodies are well known in the art (see U.S. Pat. Nos. 5,585,089, and 5,693,762). Generally, a humanized antibody has one or more amino acid residues introduced into its framework region from a source which is non-human. Humanization can be performed, for example, using methods described in Jones et al. {Nature 321: 522-525, 1986), Riechmann et al, {Nature, 332: 323-327, 1988) and Verhoeyen et al. Science 239:1534-1536, 1988), by substituting at least a portion of a rodent complementarity-determining region (CDRs) for the corresponding regions of a human antibody. Numerous techniques for preparing engineered antibodies are described, e.g., in Owens and Young, J. Immunol. Meth., 168:149-165, 1994. Further changes can then be introduced into the antibody framework to modulate affinity or immunogenicity.
Likewise, using techniques known in the art to isolate CDRs, compositions comprising CDRs are generated. Complementarity determining regions are characterized by six polypeptide loops, three loops for each of the heavy or light chain variable regions. The amino acid position in a CDR and framework region is set out by Kabat et al., “Sequences of Proteins of Immunological Interest,” U.S. Department of Health and Human Services, (1983), which is incorporated herein by reference. For example, hypervariable regions of human antibodies are roughly defined to be found at residues 28 to 35, from residues 49-59 and from residues 92-103 of the heavy and light chain variable regions (Janeway and Travers, Immunobiology, 2nd Edition, Garland Publishing, New York, 1996). The CDR regions in any given antibody may be found within several amino acids of these approximated residues set forth above. An immunoglobulin variable region also consists of “framework” regions surrounding the CDRs. The sequences of the framework regions of different light or heavy chains are highly conserved within a species, and are also conserved between human and murine sequences.
Compositions comprising one, two, and/or three CDRs of a heavy chain variable region or a light chain variable region of a monoclonal antibody are generated. Polypeptide compositions comprising one, two, three, four, five and/or six complementarity determining regions of a monoclonal antibody secreted by a hybridoma are also contemplated. Using the conserved framework sequences surrounding the CDRs, PCR primers complementary to these consensus sequences are generated to amplify a CDR sequence located between the primer regions. Techniques for cloning and expressing nucleotide and polypeptide sequences are well-established in the art [see e.g., Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd Edition, Cold Spring Harbor, New York (1989)]. The amplified CDR sequences are ligated into an appropriate plasmid. The plasmid comprising one, two, three, four, five and/or six cloned CDRs optionally contains additional polypeptide encoding regions linked to the CDR.
Preferably, the antibody is any antibody specific for a glycan structure of Formula (I) or a fragment thereof. The antibody used in the present invention encompasses any antibody or fragment thereof, either native or recombinant, synthetic or naturally-derived, monoclonal or polyclonal which retains sufficient specificity to bind specifically to the glycan structure according to Formula (I) which is indicative of stem cells. As used herein, the terms “antibody” or “antibodies” include the entire antibody and antibody fragments containing functional portions thereof. The term “antibody” includes any monospecific or bispecific compound comprised of a sufficient portion of the light chain variable region and/or the heavy chain variable region to effect binding to the epitope to which the whole antibody has binding specificity. The fragments can include the variable region of at least one heavy or light chain immunoglobulin polypeptide, and include, but are not limited to, Fab fragments, F(ab′).sub.2 fragments, and Fv fragments.
The antibodies can be conjugated to other suitable molecules and compounds including, but not limited to, enzymes, magnetic beads, colloidal magnetic beads, haptens, fluorochromes, metal compounds, radioactive compounds, chromatography resins, solid supports or drugs. The enzymes that can be conjugated to the antibodies include, but are not limited to, alkaline phosphatase, peroxidase, urease and .beta.-galactosidase. The fluorochromes that can be conjugated to the antibodies include, but are not limited to, fluorescein isothiocyanate, tetramethylrhodamine isothiocyanate, phycoerythrin, allophycocyanins and Texas Red. For additional fluorochromes that can be conjugated to antibodies see Haugland, R. P. Molecular Probes: Handbook of Fluorescent Probes and Research Chemicals (1992-1994). The metal compounds that can be conjugated to the antibodies include, but are not limited to, ferritin, colloidal gold, and particularly, colloidal superparamagnetic beads. The haptens that can be conjugated to the antibodies include, but are not limited to, biotin, digoxigenin, oxazalone, and nitrophenol. The radioactive compounds that can be conjugated or incorporated into the antibodies are known to the art, and include but are not limited to technetium 99m, sup.125 I and amino acids comprising any radionuclides, including, but not limited to .sup.14 C, .sup.3 H and .sup.35 S.
Antibodies to glycan structure(s) of Formula (I) may be obtained from any source. They may be commercially available. Effectively, any means which detects the presence of glycan structure(s) on the stem cells is with the scope of the present invention. An example of such an antibody is a H type 1 (clone 17-206; GF 287) antibody from Abeam.
The methods outlined herein are particularly useful for identifying HSCs or progeny thereof from a population of cells. However, additional markers may be used to further distinguish subpopulations within the general HSC, or stem cell, population.
The various sub-populations may be distinguished by levels of binders to glycan structures of Formula (I) on stem cells. This may manifest on the stem cell surface (or on feeder cell if feeder cell specific binder is used) which may be detected by the methods outlined herein. However, the present invention may be used to distinguish between various phenotypes of the stem cell or HSC population including, but not limited to, the CD34.sup.+, CD38.sup.−, CD90.sup.+ (thy1) and Lin.sup.− cells. Preferably the cells identified are selected from the group including, but not limited to, CD34.sup.+, CD38.sup.−, CD90+ (thy 1), or Lin.sup.−.
The present invention thus encompasses methods of enriching a population for stem and/or HSCs or progeny thereof. The methods involve combining a mixture of HSCs or progeny thereof with an antibody or marker or binding protein/agent or binder that recognizes and binds to glycan structure according to Formula (I) on stem cell(s) under conditions which allow the antibody or marker or binder to bind to glycan structure according to Formula (I) on stem cell(s) and separating the cells recognized by the antibody or marker to obtain a population substantially enriched in stem cells or progeny thereof. The methods can be used as a diagnostic assay for the number of HSCs or progeny thereof in a sample. The cells and antibody or marker are combined under conditions sufficient to allow specific binding of the antibody or marker to glycan structure according to Formula (I) on stem cell(s) which are then quantitated. The HSCs or stem cells or progeny thereof can be isolated or further purified.
As discussed above the cell population may be obtained from any source of stem cells or HSCs or progeny thereof including those samples discussed above.
The detection for the presence of glycan structure(s) according to Formula (I) on stem cell(s) may be conducted in any way to identify glycan structure according to Formula (I) on stem cell(s). Preferably the detection is by use of a marker or binding protein for glycan structure according to Formula (I) on stem cell(s). The binder/marker for glycan structure according to Formula (I) on stem cell(s) may be any of the markers discussed above. However, anti-bodies or binding proteins to glycan structure according to Formula (I) on stem cell(s) are particularly useful as a marker for glycan structure according to Formula (I) on stem cell(s). The above method is also suitable for feeder cell specific glycan structures according to Formula (I) which are useful for positive selection of feeder cells.
Various techniques can be employed to separate or enrich the cells by initially removing cells of dedicated lineage. Monoclonal antibodies, binding proteins and lectins are particularly useful for identifying cell lineages and/or stages of differentiation. The antibodies can be attached to a solid support to allow for crude separation. The separation techniques employed should maximize the retention of viability of the fraction to be collected. Various techniques of different efficacy can be employed to obtain “relatively crude” separations. The particular technique employed will depend upon efficiency of separation, associated cytotoxicity, ease and speed of performance, and necessity for sophisticated equipment and/or technical skill.
Procedures for separation or enrichment can include, but are not limited to, magnetic separation, using antibody-coated magnetic beads, affinity chromatography, cytotoxic agents joined to a monoclonal antibody or used in conjunction with a monoclonal antibody, including, but not limited to, complement and cytotoxins, and “panning” with antibody attached to a solid matrix, e.g., plate, elutriation or any other convenient technique.
The use of separation or enrichment techniques include, but are not limited to, those based on differences in physical (density gradient centrifugation and counter-flow centrifugal elutriation), cell surface (lectin and antibody affinity), and vital staining properties (mitochondria-binding dye rho123 and DNA-binding dye, Hoescht 33342).
Techniques providing accurate separation include, but are not limited to, FACS, which can have varying degrees of sophistication, e.g., a plurality of color channels, low angle and obtuse light scattering detecting channels, impedence channels, etc. Any method which can isolate and distinguish these cells according to levels of expression of glycan structure according to Formula (I) on stem cell(s) may be used.
In a first separation, typically starting with about 1.times.10.sup.10, preferably at about 5.times.10.sup.8-9 cells, antibodies or binding proteins or lectins to glycan structure according to Formula (I) on stem cell(s) can be labeled with at least one fluorochrome, while the antibodies or binding proteins for the various dedicated lineages, can be conjugated to at least one different fluorochrome. While each of the lineages can be separated in a separate step, desirably the lineages are separated at the same time as one is positively selecting for glycan structure according to Formula (I) on stem cell markers. The cells can be selected against dead cells, by employing dyes associated with dead cells (including but not limited to, propidium iodide (PI)).
To further enrich for any cell population, specific markers for those cell populations may be used. For instance, specific markers for specific cell lineages such as lymphoid, myeloid or erythroid lineages may be used to enrich for or against these cells. These markers may be used to enrich for HSCs or progeny thereof by removing or selecting out mesenchymal or keratinocyte stem cells.
The methods described above can include further enrichment steps for cells by positive selection for other stem cell specific markers. Suitable positive stem cell markers include, but are not limited to, SSEA-3, SSEA4, Tra 1-60, CD34.sup.+, Thy-1.sup.+, and c-kit.sup.+. By appropriate selection with particular factors and the development of bioassays which allow for self-regeneration of HSCs or progeny thereof and screening of the HSCs or progeny thereof as to their markers, a composition enriched for viable HSCs or progeny thereof can be produced for a variety of purposes.
Once the stem cells or HSC or progeny thereof population is isolated, further isolation techniques may be employed to isolate sub-populations within the HSCs or progeny thereof. Specific markers including cell selection systems such as FACS for cell lineages may be used to identify and isolate the various cell lineages.
In yet another aspect of the present invention there is provided a method of measuring the content of stem cells or HSC or their progeny said method comprising
obtaining a cell population comprising stem cells or progeny thereof;
combining the cell population with a binding protein or binder for glycan structure according to Formula (I) on stem cell(s) thereof;
selecting for those cells which are identified by the binding protein for glycan structure according to Formula (I) on stem cell(s) thereof, and
quantifying the amount of selected cells relative to the quantity of cells in the cell population prior to selection with the binding protein.
Binder-Label Conjugates
The present invention is specifically directed to the binding of the structures according to the present invention, when the binder is conjugated with “a label structure”. The label structure means a molecule observable in a assay such as for example a fluorescent molecule, a radioactive molecule, a detectable enzyme such as horse radish peroxidase or biotin/streptavidin/avidin. When the labelled binding molecule is contacted with the cells according to the invention, the cells can be monitored, observed and/or sorted based on the presence of the label on the cell surface. Monitoring and observation may occur by regular methods for observing labels such as fluorescence measuring devices, microscopes, scintillation counters and other devices for measuring radioactivity.
Use of Binder and labelled Binder-Conjugates for Cell Sorting
The invention is specifically directed to use of the binders and their labelled conjugates for sorting or selecting human stem cells from biological materials or samples including cell materials comprising other cell types. The preferred cell types includes cord blood, peripheral blood and embryonal stem cells and associated cells. The labels can be used for sorting cell types according to invention from other similar cells. In another embodiment the cells are sorted from different cell types such as blood cells or in context of cultured cells preferably feeder cells, for example in context of embryonal stem cells corresponding feeder cells such as human or mouse feeder cells. A preferred cell sorting method is FACS sorting. Another sorting methods utilized immobilized binder structures and removal of unbound cells for separation of bound and unbound cells.
Use of Immobilized Binder Structures
In a preferred embodiment the binder structure is conjugated to a solid phase. The cells are contacted with the solid phase, and part of the material is bound to surface. This method may be used to separation of cells and analysis of cell surface structures, or study cell biological changes of cells due to immobilization. In the analytics involving method the cells are preferably tagged with or labelled with a reagent for the detection of the cells bound to the solid phase through a binder structure on the solid phase. The methods preferably further include one or more steps of washing to remove unbound cells.
Preferred solid phases include cell suitable plastic materials used in contacting cells such as cell cultivation bottles, petri dishes and microtiter wells; fermentor surface materials
Specific Recognition between Preferred Stem Cells and Contaminating Cells
The invention is further directed to methods of recognizing stem cells from differentiated cells such as feeder cells, preferably animal feeder cells and more preferably mouse feeder cells. It is further realized, that the present reagents can be used for purification of stem cells by any fractionation method using the specific binding reagents.
Preferred fractionation methods includes fluoresce activated cell sorting (FACS), affinity chromatography methods, and bead methods such as magnetic bead methods.
Preferred reagents for recognition between preferred cells, preferably embryonal type cells, and contaminating cells, such as feeder cells, most preferably mouse feeder cells, include reagents according to the Table 31, more preferably proteins with similar specificity with lectins PSA, MAA, and PNA.
The invention is further directed to positive selection methods including specific binding to the stem cell population but not to contaminating cell population. The invention is further directed to negative selection methods including specific binding to the contaminating cell population but not to the stem cell population. In yet another embodiment of recognition of stem cells the stem cell population is recognized together with a homogenous cell population such as a feeder cell population, preferably when separation of other materials is needed. It is realized that a reagent for positive selection can be selected so that it binds stem cells as in the present invention and not to the contaminating cell population and a reagent for negative selection by selecting opposite specificity. In case of one population of cells according to the invention is to be selected from a novel cell population not studied in the present invention, the binding molecules according to the invention maybe used when verified to have suitable specificity with regard to the novel cell population (binding or not binding). The invention is specifically directed to analysis of such binding specificity for development of a new binding or selection method according to the invention.
The preferred specificities according to the invention include recognition of:
Manipulation of Cells by Binders
The invention is specifically directed to manipulation of cells by the specific binding proteins. It is realized that the glycans described have important roles in the interactions between cells and thus binders or binding molecules can be used for specific biological manipulation of cells. The manipulation may be performed by free or immobilized binders. In a preferred embodiment cells are used for manipulation of cell under cell culture conditions to affect the growth rate of the cells.
Preferred Cell Population to be Produced by Glycomodification According to the Present Invention
The present invention is directed to specific cell populations comprising in vitro enzymatically altered glycosylations according to the present invention. It is realized that special structures revealed on cell surfaces have specific targeting, and immune recognition properties with regard to cells carrying the structures. It is realized that sialylated and fucosylated terminal structures such as sialyl-lewis x structures target cells to selectins involved in bone marrow homing of cells and invention is directed to methods to produce such structures on cells surfaces. It is further realized that mannose and galactose terminal structures revealed by the invention target cells to liver and/or to immune recognition, which in most cases are harmful for effective cell therapy, unless liver is not targeted by the cells. NeuGc is target for immune recognition and has harmful effects for survival of cells expressing the glycans.
The invention revealed glycosidase methods for removal of the structures from cell surface while keeping the cells intact. The invention is especially directed to sialyltransferase methods for modification of terminal galactoses. The invention further revealed novel method to remove mannose residues from intact cells by alpha-manosidase.
The invention is further directed to metabolic regulation of glycosylation to alter the glycosylation for reduction of potentially harmful structures.
The present invention is directed to specific cell populations comprising in vitro enzymatically altered sialylation according to the present invention. The preferred cell population includes cells with decreased amount of sialic acids on the cell surfaces, preferably decreased from the preferred structures according to the present invention. The altered cell population contains in a preferred embodiment decreased amounts of α3-linked sialic acids. The present invention is preferably directed to the cell populations when the cell populations are produced by the processes according to the present invention.
Cell Populations with Altered Sialylated Structures
The invention is further directed to novel cell populations produced from the preferred cell populations according to the invention when the cell population comprises altered sialylation as described by the invention. The invention is specifically directed to cell populations comprising decreased sialylation as described by the invention. The invention is specifically directed to cell populations comprising increased sialylation of specific glycan structures as described by the invention. Furthermore invention is specifically directed to cell populations of specifically altered α3- and or α6-sialylation as described by the invention These cells are useful for studies of biological functions of the cell populations and role of sialylated, linkage specifically sialylated and non-sialylated structures in the biological activity of the cells.
Preferred Cell Populations with Decreased Sialylation
The preferred cell population includes cells with decreased amount of sialic acids on the cell surfaces, preferably decreased from the preferred structures according to the present invention. The altered cell population contains in a preferred embodiment decreased amounts of α3-linked sialic or α6-linked sialic acid. In a preferred embodiment the cell populations comprise practically only α3-sialic acid, and in another embodiment only α6-linked sialic acids, preferably on the preferred structures according to the invention, most preferably on the preferred N-glycan structures according to the invention. The present invention is preferably directed to the cell populations when the cell populations are produced by the processes according to the present invention. The cell populations with altered sialylation are preferably mesenchymal stem cell, embryonal-type cells or cord blood cell populations according to the invention.
Preferred Cell Populations with Increased Sialylation
The preferred cell population includes cells with increased amount of sialic acids on the cell surfaces, preferably decreased from the preferred structures according to the present invention. The altered cell population contains in preferred embodiments increased amounts of α3-linked sialic or α6-linked sialic acid. In a preferred embodiment the cell populations comprise practically only α3-sialic acid, and in another embodiment only α6-linked sialic acids, preferably on the preferred structures according to the invention, most preferably on the preferred N-glycan structures according to the invention. The present invention is preferably directed to the cell populations when the cell populations are produced by the processes according to the present invention. The cell populations with altered sialylation are preferably mesenchymal stem cells or embryonal-type cells or cord blood cell populations according to the invention.
Preferred Cell Populations with Altered Sialylation
The preferred cell population includes cells with altered linkage structures of sialic acids on the cell surfaces, preferably decreased from the preferred structures according to the present invention. The altered cell population contains in a preferred embodiments altered amount of α3-linked sialic and/or α6-linked sialic acid. The invention is specifically directed to cell populations having a sialylation level similar to the original cells but the linkages of structures are altered to α3-linkages and in another embodiment the linkages of structures are altered to α6-structures. In a preferred embodiment the cell populations comprise practically only α3-sialic acid, and in another embodiment only α6-linked sialic acids, preferably on the preferred structures according to the invention, most preferably on the preferred N-glycan structures according to the invention. The present invention is preferably directed to the cell populations when the cell populations are produced by the processes according to the present invention. The cell populations with altered sialylation are preferably mesenchymal stem cells or embryonal-type cells or cord blood cell populations according to the invention.
Cell Populations Comprising Preferred Cell Populations with Preferred Sialic Acid Types
The preferred cell population includes cells with altered types of sialic acids on the cell surfaces, preferably on the preferred structures according to the present invention. The altered cell population contains in a preferred embodiment altered amounts of NeuAc and/or NeuGc sialic acid. The invention is specifically directed to cell populations having sialylation levels similar to original cells but the sialic acid structures altered to NeuAc and in another embodiment the sialic acid type structures altered to NeuGc. In a preferred embodiment the cell populations comprise practically only NeuAc, and in another embodiment only NeuGc sialic acids, preferably on the preferred structures according to the invention, most preferably on the preferred N-glycan structures according to the invention. The present invention is preferably directed to the cell populations when the cell populations are produced by the processes according to the present invention. The cell populations with altered sialylation are preferably mesenchymal stem cells or embryonal-type cells or cord blood cell populations according to the invention.
Low-Molecular Weight Glycan Marker Structures and Stem Cell Glycome Components
The invention describes novel low-molecular weight acidic glycan components within the acidic N-glycan and/or soluble glycan fractions with characteristic monosaccharide compositions SAxHex1-2HexNAc1-2, wherein x indicates that the corresponding glycans are preferentially sialylated with one or more sialic acid residues. The inventors realized that such glycans are novel and unusual with respect to N-glycan biosynthesis and described mammalian cell glycan components, as reveal also by the fact that they are classified as “other (N-)glycan types” in N-glycan classification scheme of the present invention. The invention is directed to analyzing, isolating, modifying, and/or binding to these novel glycan components according to the methods and uses of the present invention, and further to other uses of specific marker glycans as described here. As demonstrated in the Examples of the present invention, such glycan components were specific parts of total glycomes of certain cell types and preferentially to certain stem cell types, making their analysis and use beneficial with regard to stem cells. The invention is further directed to stem cell glycomes and subglycomes containing these glycan components.
Stem Cell Nomenclature
The present invention is directed to analysis of all stem cell types, preferably human stem cells. A general nomenclature of the stem cells is described in
Lectins for Manipulation of Stem Cells, Especially Under Cell Culture Conditions
The present invention is especially directed to use of lectins as specific binding proteins for analysis of status of stem cells and/or for the manipulation of stems cells.
The invention is specifically directed to manipulation of stem cells under cell culture conditions growing the stem cells in presence of lectins. The manipulation is preferably performed by immobilized lectins on surface of cell culture vessels. The invention is especially directed to the manipulation of the growth rate of stem cells by growing the cells in the presence of lectins, as show in Table 32.
The invention is in a preferred embodiment directed to manipulation of stem cells by specific lectins recognizing specific glycan marker structures according to invention from the cell surfaces. The invention is in a preferred embodiment directed to use of Gal recognizing lectins such as ECA-lectin or similar human lectins such as galectins for recognition of galectin ligand glycans identified from the cell surfaces. It was further realized that there is specific variations of galectin expression in genomic levels in stem cells, especially for galectins-1, -3, and -8. The present invention is especially directed to methods of testing of these lectins for manipulation of growth rates of embryonal type stem cells and for adult stem cells in bone marrow and blood and differentiating derivatives thereof.
Sorting of Stem Cells by Specific Lectins
The invention revealed use of specific lectin types recognizing cell surface glycan epitopes according to the invention for sorting of stem cells, especially by FACS methods, most preferred cell types to be sorted includes adult stem cells in blood and bone marrow, especially cord blood cells. Preferred lectins for sorting of cord blood cells include GNA, STA, GS-II, PWA, HMA, PSA, RCA, and others as shown in Example 16. The relevance of the lectins for isolating specific stem cell populations was demonstrated by double labeling with known stem cells markers, as described in Example 16.
Preferred Structures of O-Glycan Glycomes of Stem Cells
The present invention is especially directed to following O-glycan marker structures of stem cells:
Core 1 type O-glycan structures following the marker composition NeuAc2Hex1HexNAc1, preferably including structures SAα3Galβ3GalNAc and/or SAα3Galβ3(Saα6)GalNAc; and Core 2 type O-glycan structures following the marker composition NeuAc0-2Hex2HexNAc2dHex0-1, more preferentially further including the glycan series NeuAc0-1Hex2+nHexNAc2+ndHex0-1, wherein n is either 1, 2, or 3 and more preferentially n is 1 or 2, and even more preferentially n is 1;
more specifically preferably including R1Galβ4(R3)GlcNAcβ6(R2Galβ3)GalNAc,
wherein R1 and R2 are independently either nothing or sialic acid residue, preferably α2,3-linked sialic acid residue, or an elongation with HexnHexNAcn, wherein n is independently an integer at least 1, preferably between 1-3, most preferably between 1-2, and most preferably 1, and the elongation may terminate in sialic acid residue, preferably α2,3-linked sialic acid residue; and
R3 is independently either nothing or fucose residue, preferably α1,3-linked fucose residue.
It is realized that these structures correlate with expression of β6GlcNAc-transferases synthesizing core 2 structures.
Preferred branched N-Acetyllactosamine Type Glycosphingolipids
The invention further revealed branched, I-type, poly-N-acetyllactosamines with two terminal Galβ4-residues from glycolipids of human stem cells. The structures correlate with expression of β6GlcNAc-transferases capable of branching poly-N-acetyllactosamines and further to binding of lectins specific for branched poly-N-acetyllactosamines. It was further noticed that PWA-lectin had an activity in manipulation of stem cells, especially the growth rate thereof.
Preferred Qualitative and Quantitative Complete N-Glycomes of Stem Cells
High-Mannose Type and Glucosylated N-Glycans
The present invention is especially directed to glycan compositions (structures) and analysis of high-mannose type and glucosylated N-glycans according to the formula:
Hexn3HexNAcn4,
wherein n3 is 5, 6, 7, 8, 9, 10, 11, or 12, and n4=2.
According to the present invention, within total N-glycomes of stem cells the major high-mannose type and glucosylated N-glycan signals include the compositions with 5≦n3≦10: Hex5HexNAc2 (1257), Hex6HexNAc2 (1419), Hex7HexNAc2 (1581), Hex8HexNAc2 (1743), Hex9HexNAc2 (1905), and Hex10HexNAc2 (2067);
and more preferably with 5≦n3≦9: Hex5HexNAc2 (1257), Hex6HexNAc2 (1419), Hex7HexNAc2 (1581), Hex8HexNAc2 (1743), and Hex9HexNAc2 (1905).
As demonstrated in the present invention by glycan structure analysis, preferably this glycan group in stem cells includes the molecular structure (Manα)8Manβ4GlcNAcβ4GlcNAc within the glycan signal Hex9HexNAc2 (1905), and even more preferably Manα2Manα6(Manα2Manα3)Manα6(Manα2Manα2Manα3)Manβ4GlcNAcβ4GlcNAc.
Low-Mannose Type N-Glycans
The present invention is especially directed to glycan compositions (structures) and analysis of low-mannose type N-glycans according to the formula:
Hexn3HexNAcn4dHexn5,
wherein n3 is 1, 2, 3, or 4, n4=2, and n5 is 0 or 1.
According to the present invention, within total N-glycomes of stem cells the major low-mannose type N-glycan signals preferably include the compositions with 2≦n3≦4: Hex2HexNAc2 (771), Hex3HexNAc2 (933), Hex4HexNAc2 (1095), Hex2HexNAc2dHex (917), Hex3HexNAc2dHex (1079), and Hex4HexNAc2dHex (1241); and more preferably when n5 is 0: Hex2HexNAc2 (771), Hex3HexNAc2 (933), and Hex4HexNAc2 (1095).
As demonstrated in the present invention by glycan structure analysis of stem cells, preferably this glycan group in stem cells includes the molecular structures:
(Manα)hd 1-3Manβ4GlcNAcβ4(Fucα6)0-1GlcNAc within the glycan signals 771, 917, 933, 1079, 1095, and 1095, and
the preferred low-Man structures includes structures common all stem cell types, tri-Man and tetra-Man structures according as indicated in Table 29 (Manα)0-1Manα6(Manα3)Manβ4GlcNAcβ4(Fucα6)0-1GlcNAc, more preferably the tri-Man structures:
Manα6(Manα3)Manβ4GlcNAcβ4(Fucα6)0-1GlcNAc
even more preferably the abundant molecular structure:
Manα6(Manα3)Manβ4GlcNAcβ4GlcNAc within the glycan signal 933.
The invention is further directed to analysis of presence and/or absence of structures varying characteristically between stem cells.
These include fucosylated and nonfucosylated di-Man structures,
specifically associated with certain blood associated stem cells
[Manα6]0-1(Manα3)0-1Manβ4GlcNAcβ4(Fucα6)0-1GlcNAc,
when either of the Manα-residues is present or absent.
The fucosylated structure was observed to be associated with specific blood related adult stem cells while the non-fucosylated structures was observed to have more varying expression in embryonal stem cells, embryoid bodies and more primitive cord blood stem cells (CD133+) and
cord blood mesenchymal cells. It is realized that the both di-Man structures reflect have specific qualitative analytical value with regard to specific cell populations.
Fucosylated High-Mannose Type N-Glycans
The present invention is especially directed to glycan compositions (structures) and analysis of fucosylated high-mannose type N-glycans according to the formula:
Hexn3HexNAcn4dHexn5,
wherein n3 is 5, 6, 7, 8, or 9, n4=2, and n5=1.
According to the present invention, within total N-glycomes of stem cells the major fucosylated high-mannose type N-glycan signal preferentially is the composition Hex5HexNAc2dHex (1403). As demonstrated in the present invention by glycan structure analysis of stem cells, more preferably this glycan signal in stem cells includes the molecular structure (Manα)4Manβ4GlcNAcβ4(Fucα6)GlcNAc.
Neutral Monoantennary or Hybrid-Type N-Glycans
The present invention is especially directed to glycan compositions (structures) and analysis of neutral monoantennary or hybrid-type N-glycans according to the formula:
Hexn3HexNAcn4dHexn5,
wherein n3 is an integer greater or equal to 2, n4=3, and n5 is an integer greater or equal to 0.
According to the present invention, within total N-glycomes of stem cells the major neutral monoantennary or hybrid-type N-glycan signals preferentially include the compositions with 2≦n3≦8 and 0≦n5≦2, more preferentially compositions with 3≦n3≦6 and 0≦n5≦1, with the proviso that when n3=6 also n5=0: Hex3HexNAc3 (1136), Hex3HexNAc3dHex (1282), Hex4HexNAc3 (1298), Hex4HexNAc3dHex (1444), Hex5HexNAc3 (1460), Hex5HexNAc3dHex (1606), and Hex6HexNAc3 (1622).
According to the present invention, the total N-glycomes of cultured human BM MSC, CB MSC, and cells differentiated from them preferentially additionally include the following structures: Hex2HexNAc3dHex (1120), Hex4HexNAc3dHex2 (1590), Hex 5HexNAc3dHex2 (1752), Hex6HexNAc3dHex (1768), and Hex7HexNAc3 (1784).
In a preferred embodiment of the present invention, the N-glycan signal Hex5HexNAc3 (1460), more preferentially also Hex6HexNAc3 (1622), and even more preferentially also Hex5HexNAc3dHex (1606), contain non-reducing terminal Manα.
Neutral Complex-Type N-Glycans
The present invention is especially directed to glycan compositions (structures) and analysis of neutral complex-type N-glycans according to the formula:
Hexn3HexNAcn4dHexn5,
wherein n3 is an integer greater or equal to 3, n4 is an integer greater or equal to 4, and n5 is an integer greater or equal to 0.
Within the total N-glycomes of stem cells the major neutral complex-type N-glycan signals preferentially include the compositions with 3≦n3≦8, 4≦n4≦7, and 0≦n5≦4, more preferentially the compositions with 3≦n3≦5, n4=4, and 0≦n5≦1, with the proviso that when n3 is 3 or 4, then n5=1: Hex3HexNAc4dHex (1485), Hex4HexNAc4dHex (1647), Hex5HexNAc4 (1663), Hex5HexNAc4dHex (1809); and even more preferentially also including the composition Hex3HexNAc5dHex (1688).
In another embodiment of the present invention, the total N-glycomes of cultured human BM MSC, CB MSC, and cells differentiated from them preferentially include in the major neutral complex-type N-glycan signals the compositions with 3≦n3≦5, n3=4, and 0≦n5≦1, as well as the compositions with 5≦n4≦6, n3=n4+1, and 0≦n5≦1, and even more preferentially also including the composition Hex3HexNAc5dHex: Hex3HexNAc4 (1339), Hex3HexNAc4dHex (1485), Hex4HexNAc4 (1501), Hex4HexNAc4dHex (1647), Hex5HexNAc4 (1663), Hex5HexNAc4dHex (1809), Hex6HexNAc5 (2028), Hex6HexNAc5dHex (2174), Hex7HexNAc6 (2393), Hex7HexNAc6dHex (2539), and Hex3HexNAc5dHex (1688).
In another embodiment of the present invention, the total N-glycomes of cultured hESC and cells differentiated from them preferentially further include in the major neutral complex-type N-glycan signal Hex4HexNAc5dHex (1850).
In another embodiment of the present invention, the N-glycan signal Hex3HexNAc4dHex (1485) contains non-reducing terminal GlcNAcβ, and more preferentially the total N-glycome includes the structure:
GlcNAcβ2Manα3(GlcNAcβ2Manα6)Manβ4GlcNAcβ4(Fucα6)GlcNAc (1485).
In yet another embodiment of the present invention, within the total N-glycome of stem cells, the N-glycan signal Hex5HexNAc4dHex (1809), more preferentially also Hex5HexNAc4 (1663), contain non-reducing terminal β1,4-Gal. Even more preferentially the total N-glycome includes the structure:
Galβ4GlcNAcβ2Manα3(Galβ4GlcNAcβ2Manα6)Manβ4GlcNAcβ4GlcNAc (1663); and in a further preferred embodiment the total N-glycome includes the structure:
Galβ4GlcNAcβ2Manα3(Galβ4GlcNAcβ2Manα6)Manβ4GlcNAcβ4(Fucα)GlcNAc (1809).
Neutral Fucosylated N-glycans
The present invention is especially directed to glycan compositions (structures) and analysis of neutral fucosylated N-glycans according to the formula:
Hexn3HexNAcn4dHexn5,
wherein n5 is an integer greater than or equal to 1.
Within the total N-glycomes of stem cells the major neutral fucosylated N-glycan signals preferentially include glycan compositions wherein 1≦n5≦4, more preferentially 1≦n5≦3, even more preferentially 1≦n5≦2, and further more preferentially compositions Hex3HexNAc2dHex (1079), more preferentially also Hex2HexNAc2dHex (917), and even more preferentially also Hex5HexNAc4dHex (1809).
The inventors further found that within the total N-glycomes of stem cells a major fucosylation form is N-glycan core α1,6-fucosylation. In a preferred embodiment of the present invention, major fucosylated N-glycan signals contain GlcNAcβA(Fucα6)GlcNAc reducing end sequence.
The inventors further found that stem cell total N-glycomes contain α1,2-Fuc, α1,3-Fuc, and/or α1,4-Fuc epitopes in a differentiation stage dependent manner. In a preferred embodiment of the present invention, major fucosylated N-glycan signals of stem cells contain α1,2-Fuc, α1,3-Fuc, and/or α1,4-Fuc epitopes, more preferentially in multifucosylated N-glycans, wherein 2≦n5≦4.
Within the total N-glycomes of BM and CB MSC the major neutral multifucosylated N-glycan signals preferentially include the composition Hex5HexNAc4dHex2 (1955), more preferentially also Hex5HexNAc4dHex3 (2101), even more preferentially also Hex4HexNAc3dHex2 (1590), and further more preferentially also Hex6HexNAc5dHex2 (2320).
Within the total N-glycomes of hESC the major neutral multifucosylated N-glycan signals preferentially include the composition Hex5HexNAc4dHex2 (1955), more preferentially also Hex5HexNAc4dHex3 (2101), even more preferentially also Hex4HexNAc5dHex2 (1996), and further more preferentially also Hex4HexNAc5dHex3 (2142).
Neutral N-glycans with Non-Reducing Terminal HexNAc
The present invention is especially directed to glycan compositions (structures) and analysis of neutral N-glycans with non-reducing terminal HexNAc according to the formula:
Hexn3HexNAcn4dHexn5,
wherein n4≧n3.
Preferably these glycan signals include Hex3HexNAc4dHex (1485) in all stem cell types; additionally preferably including Hex3HexNAc4 (1339), Hex3HexNAc4 (1339), and/or Hex3HexNAc5 (1542) in CB and BM MSC as well as cells differentiated directly from them; additionally preferably including Hex4HexNAx5 (1704), Hex4HexNAc5dHex (1850), and/or Hex4HexNAc5dHex2 (1996) in hESC and cells differentiated directly from them; additionally preferably including Hex5HexNAc5 (1866) and/or Hex5HexNAc5dHex (2012) in EB and st.3 differentiated cells (from hESC), as well as adipocyte and osteoblast differentiated cells (from CB MSC and BM MSC, respectively).
Acidic Hybrid-Type or Monoantennary N-glycans
The present invention is especially directed to glycan compositions (structures) and analysis of acidic hybrid-type or monoantennary N-glycans according to the formula:
NeuAcn1NeuGcn2Hexn3HexNAcn4dHexn5SPn6,
wherein n1 and n2 are either independently 1, 2, or 3; n3 is an integer between 3-9; n4 is 3; n5 is an integer between 0-3; and n6 is an integer between 0-2; with the proviso that the sum n1+n2+n6 is at least 1.
Within the total N-glycomes of stem cells the major acidic hybrid-type or monoantennary N-glycan signals preferentially include glycan compositions wherein 3≦n3≦6, more preferentially 3≦n5≦5, and further more preferentially compositions NeuAcHex4HexNAc3dHex (1711), preferentially also NeuAcHex5HexNAc3dHex (1873).
Acidic Complex-Type N-glycans
The present invention is especially directed to glycan compositions (structures) and analysis of acidic complex-type N-glycans according to the formula:
NeuAcn1NeuGcn2Hexn3HexNAcn4dHexn5SPn6,
wherein n1 and n2 are either independently 1, 2, 3, or 4; n3 is an integer between 3-10; n4 is an integer between 4-9; n5 is an integer between 0-5; and n6 is an integer between 0-2; with the proviso that the sum n1+n2+n6 is at least 1.
Within the total N-glycomes of stem cells the major acidic complex-type N-glycan signals preferentially include glycan compositions wherein 4≦n4≦8, more preferentially 4≦n4≦6, more preferentially 4≦n4≦5, and further more preferentially compositions NeuAcHex5HexNAc4 (1930), NeuAcHex5HexNAc4dHex (2076), NeuAc2Hex5HexNAc4 (2221), NeuAcHex5HexNAc4Hex2 (2222), and NeuAc2Hex5HexNAc4dHex (2367); further more preferentially also NeuAc2Hex6HexNAc5dHex (2732), and more preferentially also NeuAcHex5HexNAc5dHex (2279);
and in BM and CB MSC as well as cells directly differentiated from them, further more preferentially also NeuAc2Hex6HexNAc5 (2586) and more preferentially also NeuAc2Hex7HexNAc6 (2952).
Modified Glycan Types
The inventors found that stem cell total N-glycomes; and soluble+N-glycomes further contain characteristic modified glycan signals, including sialylated fucosylated N-glycans, multifucosylated glycans, sialylated N-glycans with terminal HexNAc (the N>H and N═H subclasses), and sulphated or phosphorylated N-glycans, which are subclasses of the abovementioned glycan classes. According to the present invention, their quantitative proportions in different stem cell types have characteristic values as described in Table 33.
Phosphorylated and Sulphated Glycans
Specifically, major phosphorylated glycans typical to stem cells include Hex5HexNAc2(HPO3) (1313), Hex6HexNAc2(HPO3) (1475), and Hex7HexNAc2(HPO3) (1637);
and major sulphated glycans typical to stem cells include Hex5HexNAc4dHex(SO3) (1865) and more preferentially also Hex6HexNAc3(SO3) (1678).
According to the present invention, their quantitative proportions in different stem cell types preferentially have characteristic values as described in Table 33.
Preferred Binders for Stem Cell Sorting and Isolation
As described in the Examples, the inventors found that especially the mannose-specific and especially α1,3-linked mannose-binding lectin GNA was suitable for negative selection enrichment of CD34+ stem cells from CB MNC. In addition, the poly-LacNAc specific lectin STA and the fucose-specific and especially α1,2-linked fucose-specific lectin UEA were suitable for positive selection enrichment of CD34+ stem cells from CB MNC.
The present invention is specifically directed to stem cell binding reagents, preferentially proteins, preferentially mannose-binding or α1,3-linked mannose-binding, poly-LacNAc binding, LacNAc-binding, and/or fucose- or preferentially α1,2-linked fucose-binding; in a preferred embodiment stem cell binding or nonbinding lectins, more preferentially GNA, STA, and/or UEA; and in a further preferred embodiment combinations thereof; to uses described in the present invention taking advantage of glycan-binding reagents that selectively either bind to or do not bind to stem cells.
Preferred Uses for Stem Cell Type Specific Galectins and/or Galectin Ligands
As described in the Examples, the inventors also found that different stem cells have distinct galectin expression profiles and also distinct galectin (glycan) ligand expression profiles. The present invention is further directed to using galactose-binding reagents, preferentially galactose-binding lectins, more preferentially specific galectins; in a stem cell type specific fashion to modulate or bind to certain stem cells as described in the present invention to the uses described. In a further preferred embodiment, the present invention is directed to using galectin ligand structures, derivatives thereof, or ligand-mimicking reagents to uses described in the present invention in stem cell type specific fashion.
Examples of Cell Sample Production
Cord Blood Derived Mesenchymal Stem Cell Lines
Collection of umbilical cord blood. Human term umbilical cord blood (UCB) units were collected after delivery with informed consent of the mothers and the UCB was processed within 24 hours of the collection. The mononuclear cells (MNCs) were isolated from each UCB unit diluting the UCB 1:1 with phosphate-buffered saline (PBS) followed by Ficoll-Paque Plus (Amersham Biosciences, Uppsala, Sweden) density gradient centrifugation (400 g/40 min). The mononuclear cell fragment was collected from the gradient and washed twice with PBS.
Umbilical cord blood cell isolation and culture. CD45/Glycophorin A (GlyA) negative cell selection was performed using immunolabeled magnetic beads (Miltenyi Biotec). MNCs were incubated simultaneously with both CD45 and GlyA magnetic microbeads for 30 minutes and negatively selected using LD columns following the manufacturer's instructions (Miltenyi Biotec). Both CD45/GlyA negative elution fraction and positive fraction were collected, suspended in culture media and counted. CD45/GlyA positive cells were plated on fibronectin (FN) coated six-well plates at the density of 1×106/cm2. CD45/GlyA negative cells were plated on FN coated 96-well plates (Nunc) about 1×104 cells/well. Most of the non-adherent cells were removed as the medium was replaced next day. The rest of the non-adherent cells were removed during subsequent twice weekly medium replacements.
The cells were initially cultured in media consisting of 56% DMEM low glucose (DMEM-LG, Gibco, http://www.invitrogen.com) 40% MCDB-201 (Sigma-Aldrich) 2% fetal calf serum (FCS), 1×penicillin-streptomycin (both form Gibco), 1×ITS liquid media supplement (insulin-transferrin-selenium), 1×linoleic acid-BSA, 5×10−8 M dexamethasone, 0.1 mM L-ascorbic acid-2-phosphate (all three from Sigma-Aldrich), 10 nM PDGF (R&D systems, http://www.RnDSystems.com) and 10 nM EGF (Sigma-Aldrich). In later passages (after passage 7) the cells were also cultured in the same proliferation medium except the FCS concentration was increased to 10%.
Plates were screened for colonies and when the cells in the colonies were 80-90% confluent the cells were subcultured. At the first passages when the cell number was still low the cells were detached with minimal amount of trypsin/EDTA (0.25%/1 mM, Gibco) at room temperature and trypsin was inhibited with FCS. Cells were flushed with serum free culture medium and suspended in normal culture medium adjusting the serum concentration to 2 %. The cells were plated about 2000-3000/cm2. In later passages the cells were detached with trypsin/EDTA from defined area at defined time points, counted with hematocytometer and replated at density of 2000-3000 cells/cm2.
Bone Marrow Derived Mesenchymal Stem Cell Lines
Isolation and culture of bone marrow derived stem cells. Bone marrow (BM)-derived MSCs were obtained as described by Leskelä et al. (2003). Briefly, bone marrow obtained during orthopedic surgery was cultured in Minimum Essential Alpha-Medium (α-MEM), supplemented with 20 mM HEPES, 10% FCS, 1×penicillin-streptomycin and 2 mM L-glutamine (all from Gibco). After a cell attachment period of 2 days the cells were washed with Ca2+ and Mg2+ free PBS (Gibco), subcultured further by plating the cells at a density of 2000-3000 cells/cm2 in the same media and removing half of the media and replacing it with fresh media twice a week until near confluence.
Experimental Procedures
Flow cytometric analysis of mesenchymal stem cell phenotype. Both UBC and BM derived mesenchymal stem cells were phenotyped by flow cytometry (FACSCalibur, Becton Dickinson). Fluorescein isothicyanate (FITC) or phycoerythrin (PE) conjugated antibodies against CD13, CD14, CD29, CD34, CD44, CD45, CD49e, CD73 and HLA-ABC (all from BD Biosciences, San Jose, Calif., http://www.bdbiosciences.com), CD105 (Abeam Ltd., Cambridge, UK, http://www.abcam.com) and CD133 (Miltenyi Biotec) were used for direct labeling. Appropriate FITC- and PE-conjugated isotypic controls (BD Biosciences) were used. Unconjugated antibodies against CD90 and HLA-DR (both from BD Biosciences) were used for indirect labeling. For indirect labeling FITC-conjugated goat anti-mouse IgG antibody (Sigma-aldrich) was used as a secondary antibody.
The UBC derived cells were negative for the hematopoietic markers CD34, CD45, CD14 and CD133. The cells stained positively for the CD13 (aminopeptidase N), CD29 (β1-integrin), CD44 (hyaluronate receptor), CD73 (SH3), CD90 (Thy1), CD105 (SH2/endoglin) and CD 49e. The cells stained also positively for HLA-ABC but were negative for HLA-DR. BM-derived cells showed to have similar phenotype. They were negative for CD14, CD34, CD45 and HLA-DR and positive for CD13, CD29, CD44, CD90, CD105 and HLA-ABC.
Adipogenic differentiation. To assess the adipogenic potential of the UCB-derived MSCs the cells were seeded at the density of 3×103/cm2 in 24-well plates (Nunc) in three replicate wells. UCB-derived MSCs were cultured for five weeks in adipogenic inducing medium which consisted of DMEM low glucose, 2% FCS (both from Gibco), 10 μg/ml insulin, 0.1 mM indomethacin, 0.1 μM dexamethasone (Sigma-Aldrich) and penicillin-streptomycin (Gibco) before samples were prepared for glycome analysis. The medium was changed twice a week during differentiation culture.
Osteogenic differentiation. To induce the osteogenic differentiation of the BM-derived MSCs the cells were seeded in their normal proliferation medium at a density of 3×103/cm2 on 24-well plates (Nunc). The next day the medium was changed to osteogenic induction medium which consisted of α-MEM (Gibco) supplemented with 10% FBS (Gibco), 0.1 μM dexamethasone, 10 mM β-glycerophosphate, 0.05 mM L-ascorbic acid-2-phosphate (Sigma-Aldrich) and penicillin-streptomycin (Gibco). BM-derived MSCs were cultured for three weeks changing the medium twice a week before preparing samples for glycome analysis.
Cell harvesting for glycome analysis. 1 ml of cell culture medium was saved for glycome analysis and the rest of the medium removed by aspiration. Cell culture plates were washed with PBS buffer pH 7.2. PBS was aspirated and cells scraped and collected with 5 ml of PBS (repeated two times). At this point small cell fraction (10 μl) was taken for cell-counting and the rest of the sample centrifuged for 5 minutes at 400 g. The supernatant was aspirated and the pellet washed in PBS for an additional 2 times.
The cells were collected with 1.5 ml of PBS, transferred from 50 ml tube into 1.5 ml collection tube and centrifuged for 7 minutes at 5400 rpm. The supernatant was aspirated and washing repeated one more time. Cell pellet was stored at −70° C. and used for glycome analysis.
Lectin stainings. FITC-labeled Maackia amurensis agglutinin (MAA) was purchased from EY Laboratories (USA) and FITC-labeled Sambucus nigra agglutinin (SNA) was purchased from Vector Laboratories (UK). Bone marrow derived mesenchymal stem cell lines were cultured as described above. After culturing, cells were rinsed 5 times with PBS (10 mM sodium phosphate, pH 7.2, 140 mM NaCl) and fixed with 4% PBS-buffered paraformaldehyde pH 7.2 at room temperature (RT) for 10 minutes. After fixation, cells were washed 3 times with PBS and non-specific binding sites were blocked with 3% HSA-PBS (FRC Blood Service, Finland) or 3% BSA-PBS (>99% pure BSA, Sigma) for 30 minutes at RT. According to manufacturers' instructions cells were washed twice with PBS, TBS (20 mM Tris-HCl, pH 7.5, 150 mM NaCl, 10 mM CaCl2) or HEPES-buffer (10 mM HEPES, pH 7.5, 150 mM NaCl) before lectin incubation. FITC-labeled lectins were diluted in 1% HSA or 1% BSA in buffer and incubated with the cells for 60 minutes at RT in the dark. Furthermore, cells were washed 3 times 10 minutes with PBS/TBS/HEPES and mounted in Vectashield mounting medium containing DAPI-stain (Vector laboratories, UK). Lectin stainings were observed with Zeiss Axioskop 2 plus-fluorescence microscope (Carl Zeiss Vision GmbH, Germany) with FITC and DAPI filters. Images were taken with Zeiss AxioCam MRc-camera and with AxioVision Software 3.1/4.0 (Carl Zeiss) with the 400× magnification.
Results
Glycan isolation from mesenchymal stem cell populations. The present results are produced from two cord blood derived mesenchymal stem cell lines and cells induced to differentiate into adipogenic direction, and two marrow derived mesenchymal stem cell lines and cells induced to differentiate into osteogenic direction. The caharacterization of the cell lines and differentiated cells derived from them are described above. N-glycans were isolated from the samples, and glycan profiles were generated from MALDI-TOF mass spectrometry data of isolated neutral and sialylated N-glycan fractions as described in the preceding examples.
Cord Blood Derived Mesenchymal Stem cell (CB MSC) Lines
Neutral N-glycan structural features. Neutral N-glycan groupings proposed for the two CB MSC lines resemble each other closely, indicating that there are no major differences in their neutral N-glycan structural features. However, CB MSCs differ from the CB mononuclear cell populations, and they have for example relatively high amounts of neutral complex-type N-glycans, as well as hybrid-type or monoantennary neutral N-glycans, compared to other structural groups in the profiles.
Identification of soluble glycan components. Similarly to CB mononuclear cell populations, in the present analysis neutral glycan components were identified in all the cell types that were assigned as soluble glycans based on their proposed monosaccharide compositions including components from the glycan group Hex2-12HexNAc1 (see Figures). The abundancies of these glycan components in relation to each other and in relation to the other glycan signals vary between individual samples and cell types.
Sialylated N-glycan profiles. Sialylated N-glycan profiles obtained from two CB MSC lines resemble closely each other with respect to their overall sialylated N-glycan profiles. However, minor differences between the profiles are observed, and some glycan signals can only be observed in one cell line, indicating that the two cell lines have glycan structures that differ them from each other. The analysis revealed in each cell type the relative proportions of about 50-70 glycan signals that were assigned as acidic N-glycan components. Typically, significant differences in the glycan profiles between cell populations are consistent throughout multiple experiments.
Differentiation-associated changes in glycan profiles. Neutral N-glycan profiles of CB MSCs change upon differentation in adipogenic cell culture medium. The present results indicate that relative abundancies of several individual glycan signals as well as glycan signal groups change due to cell culture in differentiation medium. The major change in glycan structural groups associated with differentation is increase in amounts of neutral complex-type N-glycans, such as signals at m/z 1663 and m/z 1809, corresponding to the Hex5HexNAc4 and Hex5HexNAc4dHex1 monosaccharide compositions, respectively. Changes were also observed in sialylated glycan profiles.
Glycosidase analyses of neutral N-glycans. Specific exoglycosidase digestions were performed on isolated neutral N-glycan fractions from CB MSC lines as described in Examples. The results of α-mannosidase analysis show in detail which of the neutral N-glycan signals in the neutral N-glycan profiles of CB MSC lines are susceptible to α-mannosidase digestion, indicating for the presence of non-reducing terminal α-mannose residues in the corresponding glycan structures. As an example, the major neutral N-glycan signals at m/z 1257, 1419, 1581, 1743, and 1905, which were preliminarily assigned as high-mannose type N-glycans according to their proposed monosaccharide compositions Hex5-9HexNAc2, were shown to contain terminal α-mannose residues thus confirming the preliminary assignment. The results indicate for the presence of non-reducing terminal β1,4-galactose residues in the corresponding glycan structures. As an example, the major neutral complex-type N-glycan signals at m/z 1663 and m/z 1809 were shown to contain terminal β1,4linked galactose residues.
Bone Marrow Derived Mesenchymal Stem Cell (BM MSC) Lines
Neutral N-glycan profiles and differentiation-associated changes in glycan profiles. Neutral N-glycan profiles obtained from a BM MSC line, grown in proliferation medium and in osteogenic medium resemble CB MSC lines with respect to their overall neutral N-glycan profiles. However, differences between cell lines derived from the two sources are observed, and some glycan signals can only be observed in one cell line, indicating that the cell lines have glycan structures that differ them from each other. The major characteristic structural feature of BM MSCs is even more abundant neutral complex-type N-glycans compared to CB MSC lines. Similarly to CB MSCs, these glycans were also the major increased glycan signal group upon differentiation of BM MSCs. The analysis revealed in each cell type the relative proportions of about 50-70 glycan signals that were assigned as non-sialylated N-glycan components. Typically, significant differences in the glycan profiles between cell populations are consistent throughout multiple experiments.
Sialylated N-glycan profiles. Sialylated N-glycan profiles obtained from a BM MSC line, grown in proliferation medium and in osteogenic medium. The undifferentiated and differentiated cells resemble closely each other with respect to their overall sialylated N-glycan profiles. However, minor differences between the profiles are observed, and some glycan signals can only be observed in one cell line, indicating that the two cell types have glycan structures that differ them from each other. The analysis revealed in each cell type the relative proportions of about 50 glycan signals that were assigned as acidic N-glycan components. Typically, significant differences in the glycan profiles between cell populations are consistent throughout multiple experiments.
Sialidase analysis. The sialylated N-glycan fraction isolated from BM MSCs was digested with broad-range sialidase as described in the preceding Examples. After the reaction, it was observed by MALDI-TOF mass spectrometry that the vast majority of the sialylated N-glycans were desialylated and transformed into corresponding neutral N-glycans, indicating that they had contained sialic acid residues (NeuAc and/or NeuGc) as suggested by the proposed mono saccharide compositions. Glycan profiles of combined neutral and desialylated (originally sialylated) N-glycan fractions of BM MSCs grown in proliferation medium and in osteogenic medium correspond to total N-glycan profiles isolated from the cell samples (in desialylated form). It is calculated that in undifferentiated BM MSCs (grown in osteogenic medium), approximately 53% of the N-glycan signals correspond to high-mannosc type N-glycan monosaccharide compositions, 8% to low-mannose type N-glycans, 31% to complex-type N-glycans, and 7% to hybrid-type or monoantennary N-glycan monosaccharide compositions. In differentiated BM MSCs (grown in osteogenic medium), approximately 28% of the N-glycan signals correspond to high-mannose type N-glycan monosaccharide compositions, 9% to low-mannose type N-glycans, 50% to complex-type N-glycans, and 11% to hybrid-type or monoantennary N-glycan monosaccharide compositions.
Lectin binding analysis of mesenchymal stem cells. As described under Experimental procedures, bone marrow derived mesenchymal stem cells were analyzed for the presence of ligands of α2,3-linked sialic acid specific (MAA) and α2,6-linked sialic acid specific (SNA) lectins on their surface. It was revealed that MAA bound strongly to the cells whereas SNA bound weakly, indicating that in the cell culture conditions, the cells had significantly more α2,3-linked than α2,6-linked sialic acids on their surface glycoconjugates. The present results suggest that lectin staining can be used as a further means to distinguish different cell types and complements mass spectrometric profiling results.
Detection of Potential Glycan Contaminations from Cell Culture Reagents
In the sialylated N-glycan profiles of MSC lines, specific N-glycan signals were observed that indicated contamination of mesenchymal stem cell glycoconjugates by abnormal sialic acid residues. First, when the cells were cultured in cell culture media with added animal sera, such as bovine of equine sera, potential contamination by N-glycolylneuraminic acid (Neu5Gc) was detected. The glycan signals at m/z 1946, corresponding to the [M−H]− ion of NeuGc1Hex5HexNAc4, as well as m/z 2237 and m/z 2253, corresponding to the [M−H]− ions of NeuGc1NeuAc1Hex5HexNAc4 and NeuGc2Hex5HexNAc4, respectively, were indicative of the presence of Neu5Gc, i.e. a sialic acid residue with 16 Da larger mass than N-acetylneuraminic acid (Neu5Ac). Moreover, when the cells were cultured in cell culture media with added horse serum, potential contamination by O-acetylated sialic acids was detected. Diagnostic signals used for detection of O-acetylated sialic acid containing sialylated N-glycans included [M−H]− ions of Ac1NeuAc1Hex5HexNAc4, Ac1NeuAc2Hex5HexNAc4, and Ac2NeuAc2Hex5HexNAc4, at calculated m/z 1972.7, 2263.8, and 2305.8, respectively.
Conclusions
Uses of the glycan profiling method. The results indicate that the present glycan profiling method can be used to differentiate CB MSC lines and BM MSC lines from each other, as well as from other cell types such as cord blood mononuclear cell populations. Differentation-induced changes as well as potential glycan contaminations from e.g. cell culture media can also be detected in the glycan profiles, indicating that changes in cell status can be detected by the present method. The method can also be used to detect MSC-specific glycosylation features including those discussed below.
Differences in glycosylation between cultured cells and native human cells. The present results indicate that BM MSC lines have more high-mannose type N-glycans and less low-mannose type N-glycans compared to the other N-glycan structural groups than mononuclear cells isolated from cord blood. Taken together with the results obtained from cultured human embryonal stem cells in the following Examples, it is indicated that this is a general tendency of cultured stem cells compared to native isolated stem cells. However, differentiation of BM MSCs in osteogenic medium results in significantly increased amounts of complex-type N-glycans and reduction in the amounts of high-mannose type N-glycans.
Mesenchymal stem cell line specific glycosylation features. The present results indicate that mesenchymal stem cell lines differ from the other cell types studied in the present study with regard to specific features of their glycosylation, such as:
Examples of Cell Material Production
Human Embryonic Stem Cell Lines (hESC)
Undifferentiated hESC. Processes for generation of hESC lines from blastocyst stage in vitro fertilized excess human embryos have been described previously (e.g. Thomson et al., 1998). Two of the analysed cell lines in the present work were initially derived and cultured on mouse embryonic fibroblasts feeders (MEF; 12-13 pc fetuses of the ICR strain), and two on human foreskin fibroblast feeder cells (HFF; CRL-2429 ATCC, Mananas, USA). For the present studies all the lines were transferred on HFF feeder cells treated with mitomycin-C (1 μg/ml; Sigma-Aldrich) and cultured in serum-free medium (Knockout™ D-MEM; Gibco® Cell culture systems, Invitrogen, Paisley, UK) supplemented with 2 mM L-Glutamin/Penicillin streptomycin (Sigma-Aldrich), 20% Knockout Serum Replacement (Gibco), 1×non-essential amino acids (Gibco), 0.1 mM β-mercaptoethanol (Gibco), 1×ITSF (Sigma-Aldrich) and 4 ng/ml bFGF (Sigma/Invitrogen).
Stage 2 differentiated hESC (embryoid bodies). To induce the formation of embryoid bodies (EB) the hESC colonies were first allowed to grow for 10-14 days whereafer the colonies were cut in small pieces and transferred on non-adherent Petri dishes to form suspension cultures. The formed EBs were cultured in suspension for the next 10 days in standard culture medium (see above) without bFGF.
Stage 3 differentiated hESC. For further differentiation EBs were transferred onto gelatin-coated (Sigma-Aldrich) adherent culture dishes in media consisting of DMEM/F12 mixture (Gibco) supplemented with ITS, Fibronectin (Sigma), L-glutamine and antibiotics. The attached cells were cultured for 10 days whereafter they were harvested.
Sample preparation. The cells were collected mechanically, washed, and stored frozen prior to glycan analysis.
Results
Neutral N-glycan profiles—effect of differentiation status. Neutral N-glycan profiles obtained from a human embryonal stem cell (hESC) line, its embryoid body (EB) differentiated form, and its stage 3 (st.3) differentiated form. Although the cell types resemble each other with respect to the major neutral N-glycan signals, the neutral N-glycan profiles of the two differentiated cell forms differ significantly from the undifferentiated hESC profile. In fact, the farther differentiated the cell type is, the more its neutral N-glycan profile differs from the undifferentiated hESC profile. Multiple differences between the profiles are observed, and many glycan signals can only be observed in one or two out of three cell types, indicating that differentiation induces the appearance of new glycan types. The analysis revealed in each cell type the relative proportions of about 40-55 glycan signals that were assigned as non-sialylated N-glycan components. Typically, significant differences in the glycan profiles between cell populations arc consistent throughout multiple experiments.
Neutral N-glycan profiles—comparison of hESC lines. Neutral N-glycan profiles obtained from four hESC lines closely resemble each other. Individual profile characteristics and cell line specific glycan signals are present in the glycan profiles, but it is concluded that hESC lines resemble more each other with respect to their neutral N-glycan profiles and are different from differentiated EB and st.3 cell types. hESC lines 3 and 4 are derived from sibling embryos, and their neutral N-glycan profiles resemble more each other and are different from the two other cell lines, i.e. they contain common glycan signals. The analysis revealed in each cell type the relative proportions of about 40-55 glycan signals that were assigned as non-sialylated N-glycan components. Typically, significant differences in the glycan profiles between cell populations are consistent throughout multiple experiments.
Neutral N-glycan structural features. Neutral N-glycan groupings proposed for analysed cell types are presented in Table 7. Again, the analysed three major cell types, namely undifferentiated hESCs, differentiated cells, and human fibroblast feeder cells, differ from each other significantly. Within each cell type, however, there are minor differences between individual cell lines. Moreover, differentiation-associated neutral N-glycan structural features are expressed more strongly in st.3 differentiated cells than in EB cells. Cell-type specific glycosylation features are discussed below in Conclusions.
Glycosidase analysis of neutral N-glycan fractions. Specific exoglycosidase digestions were performed on isolated neutral N-glycan fractions from hESC lines as described in the preceding Examples. In α-mannosidase analysis, several neutral glycan signals were shown to be susceptible to α-mannosidase digestion, indicating for potential presence of non-reducing terminal α-mannose residues in the corresponding glycan structures. In hESC and EB cells, these signals included m/z 917, 1079, 1095, 1241, 1257, 1378, 1393, 1403, 1444, 1555, 1540, 1565, 1581, 1606, 1622, 1688, 1743, 1768, 1905, 1996, 2041, 2067, 2158, and 2320. In β1,4-galactosidase analysis, several neutral glycan signals were shown to be susceptible to β1,4-galactosidase digestion, indicating for potential presence of non-reducing terminal β1,4-galactose residues in the corresponding glycan structures. In hESC and EB cells, these signals included m/z 609, 771, 892, 917, 1241, 1378, 1393, 1555, 1565, 1606, 1622, 1647, 1663, 1704, 1809, 1850, 1866, 1955, 1971, 1996, 2012, 2028, 2041, 2142, 2174, and 2320. In α1,3/4-fucosidase analysis, several neutral glycan signals were shown to be susceptible to α1,3/4-fucosidase digestion, indicating for potential presence of non-reducing terminal α1,3- and/or α1,4-fucose residues in the corresponding glycan structures. In hESC and EB cells, these signals included m/z 1120, 1590, 1784, 1793, 1955, 1996, 2101, 2117, 2142, 2158, 2190, 2215, 2247, 2263, 2304, 2320, 2393, and 2466.
Identification of soluble glycan components. Similarly to the cell types described in the preceding examples, in the present analysis neutral glycan components were identified in all the cell types that were assigned as soluble glycans based on their proposed monosaccharide compositions including components from the glycan group Hex2-12HexNAc1 (see Figures). The abundancies of these glycan components in relation to each other and in relation to the other glycan signals vary between individual samples and cell types.
Sialylated N-glycan profiles—effect of differentiation status. Sialylated N-glycan profiles obtained from a human embryonal stem cell (hESC) line, its embryoid body (EB) differentiated form, and its stage 3 (st.3) differentiated form. Although the cell types resemble each other with respect to the major sialylated N-glycan signals, the sialylated N-glycan profiles of the two differentiated cell forms differ significantly from the undifferentiated hESC profile. In fact, the farther differentiated the cell type is, the more its sialylated N-glycan profile differs from the undifferentiated hESC profile. Multiple differences between the profiles are observed, and many glycan signals can only be observed in one or two out of three cell types, indicating that differentiation induces the appearance of new glycan types as well as decrease in amounts of stem cell specific glycan types. For example, there is significant differentation-associated deercase in relative amounts of glycan signals at m/z 1946 and 2222, corresponding to monosaccharide compositions NeuGc1Hex5HexNAc4 and NeuAc1Hex5HexNAc4dHex2, respectively. The analysis revealed in each cell type the relative proportions of about 50-70 glycan signals that were assigned as acidic N-glycan components. Typically, significant differences in the glycan profiles between cell populations are consistent throughout multiple experiments.
Sialylated N-glycan profiles—comparison of hESC lines. Sialylated N-glycan profiles obtained from four hESC lines closely resemble each other. Individual profile characteristics and cell line specific glycan signals are present in the glycan profiles, but it is concluded that hESC lines resemble more each other with respect to their sialylated N-glycan profiles and are different from differentiated EB and st.3 cell types. The analysis revealed in each cell type the relative proportions of about 50-70 glycan signals that were assigned as acidic N-glycan components. Typically, significant differences in the glycan profiles between cell populations are consistent throughout multiple experiments.
Human fibroblast feeder cell lines. Sialylated N-glycan profiles obtained from human fibroblast feeder cell lines differ from hESC, EB, and st.3 differentiated cells, and that feeder cells grown separately and with hESC cells differ from each other.
Sialylated N-glycan structural features. Sialylated N-glycan groupings proposed for analysed cell types are presented in Table 8. Again, the analysed three major cell types, namely undifferentiated hESCs, differentiated cells, and human fibroblast feeder cells, differ from each other significantly. Within each cell type, however, there are minor differences between individual cell lines. Moreover, differentiation-associated sialylated N-glycan structural features are expressed more strongly in st.3 differentiated cells than in EB cells. Cell-type specific glycosylation features are discussed below in Conclusions.
Conclusions
Comparison of glycan profiles. Differences in the glycan profiles between cell types were consistent throughout multiple samples and experiments, indicating that the present method of glycan profiling and the differences in the present glycan profiles can be used to identify hESCs or cells differentiated therefrom, or other cells such as feeder cells, or to determine their purity, or to identify cell types present in a sample. The present method and the present results can also be used to identify cell-type specific glycan structural features or cell-type specific glycan profiles. The method proved especially useful in determination of differentiation stage, as demonstrated by comparing analysis results between hESC, EB, and st.3 differentiated cells. Furthermore, hESCs were shown to have unique glycosylation profiles, which can be differentiated from differentiated cell types as well as from other stem cell types such as MSCs, indicating that stem cells in general and also specific stem cell types can be identified by the present method. The present method could also detect glycan structures common to hESC lines derived from sibling embryos, indicating that related structural features can be identified in different cell lines or their similarity be estimated by the present method.
Comparison of neutral N-glycan structural features. Differences in glycosylation profiles between analyzed cell types were identified based on proposed structural features, which can be used to identify cell-type specific glycan structural features. Identified cell-type specific features of neutral N-glycan profiles are concluded below:
hESC lines:
EBs and st.3 differentiated cells (st.3 cells expressing the features more strongly):
Human fibroblast feeder cells:
Comparison of sialylated N-glycan structural features. Differences in glycosylation profiles between analyzed cell types were identified based on proposed structural features, which can be used to identify cell-type specific glycan structural features. Identified cell-type specific features of sialylated N-glycan profiles are concluded below:
hESC lines:
EBs and st.3 differentiated cells (st.3 cells expressing the features more strongly):
Human fibroblast feeder cells:
Results
N-glycans were isolated, divided into sialylated and neutral fractions, and analysed by MALDI-TOF mass spectrometry as described in the preceding Examples. Comparison of sialylated N-glycan profiles of human fibroblast feeder cells and mouse fibroblast feeder cells. There are numerous differences in the glycan profiles and it is concluded that human and murine feeder cells differ from each other significantly with respect to their overall glycan profiles as well as many individual glycan signals. The major differences are 2092 and 2238, corresponding to the monosaccharide compositions NeuAc1Hex6HexNAc4 and NeuAc1Hex6HexNAc4dHex1, respectively. These signals correspond to the major sialylated N-glycans that human embryonal stem cells interact with on the cell surfaces of their feeder cells. The present results indicate that the glycan analysis method can be used to study species-specific differences in stem cell to feeder cell interactions.
Methods
Reductive β-elimination. The procedure has been described (Nyman et al., 1998). Briefly, glycoproteins were dissolved in 1 M NaBH4 in 0.1 M NaOH and incubated at 37° C. for two days. Borohydride was destroyed by repeated evaporation from mild acetic acid in methanol. The resulting glycan alditols were purified by solid-phase extraction methods as described above.
Non-reductive β-elimination. The procedure has been described (Huang et al., 2001). Briefly, glycoproteins were dissolved in ammonium carbonate in concentrated ammonia and incubated at 60° C. for two days. The reagents were removed by evaporation and glycosylamines by brief incubation and evaporation from mild aqueous acetic acid. The resulting reducing glycans were purified by solid-phase extraction methods as described above.
Mass spectrometry and data analysis were performed as described in the preceding Examples.
Results and Discussion
O-glycans in cord blood mononuclear cells. O-glycan fraction was isolated by reductive β-elimination from total glycoprotein fractions of cord blood mononuclear cells. The glycan alditols were divided into neutral and acidic fractions and analyzed by MALDI-TOF mass spectrometry as described above. The glycan signals in the present example include both N- and O-glycan alditol signals.
O-glycans in human embryonic stem cells. O-glycans were isolated by non-reductive β-elimination from total glycoprotein fractions of human embryonic stem cells (hESC) grown on mouse feeder cell layers. The glycans were divided into neutral and acidic fractions and analyzed by MALDI-TOF mass spectrometry as described above. The most abundant potential O-glycan signals were Hex1HexNAc2, Hex2HexNAc2, Hex2HexNAc2dHex1, Hex3HexNAc3, Hex3HexNAc3dHex1, NeuAc2Hex1HexNAc1, NeuAc1Hex2HexNAc2, NeuAc1Hex2HexNAc2dHex1, NeuAc2Hex2HexNAc2, NeuAc1Hex3HexNAc3, NeuAc2Hex2HexNAc2dHex1, NeuAc1Hex3HexNAc3, Hex3HexNAc3SP, Hex4HexNAc4SP, and Hex4HexNAc4dHex1SP, wherein SP corresponds to a charged group with a mass of sulphate or phosphate such as sulphate ester linked to an N-acetyllactosamine structure.
N-glycan and soluble glycan fractions were prepared from human cord blood cell populations as described in the preceding Examples. In cord blood mononuclear cells as well as affinity-purified cord blood CD34+, CD34−, CD133−, and LIN+ cell populations, following glycan fragments were identified (approximate experimental m/z for [M−H]− ions in parenthesis): R1 (816), R1HexNAc1 (1019), R2 (1058), R1HexNAc1HexA1 (1195), R2HexA1 (1234), R1HexNAc2HexA1 (1398), R2HexNAc1HexA1 (1437), R1HexNAc2HexA2 (1574), R2HexNAc1HexA2 (1613), R1HexNAc3HexA2 (1777), R2HexNAc2HexA2 (1816), R2HexNAc2HexA3 (1992), and R2HexNAc3HexA3 (2195), wherein R1 is preferentially HexA1Hex2Pen1R3, R2 is preferentially HexA1Hex3Pen1R4, R3 is preferentially SO3Ser1 or HPO3Ser1, R4 is preferentially (SO3)2Ser1, SO3HPO3Ser1, or (HPO3)2Ser1. The identified glycans are indicated as being glycosaminoglycan fragments present in stem cell and mononuclear cell populations in human cord blood.
Experimental Procedures
Cell samples. Human embryonic stem cell (hESC) lines FES 22 and FES 30 (Family Federation of Finland) were propagated on mouse feeder cell (mEF) layers as described above.
FITC-labeled lectins. Fluorescein isotiocyanate (FITC) labeled lectins were purchased from several manufacturers: FITC-GNA, -HHA, -MAA, -PWA, -STA and -LTA were from EY Laboratories (USA); FITC-PSA and -UEA and biotin-labelled WFA were from Sigma (USA); and FITC-RCA, -PNA and -SNA were from Vector Laboratories (UK).
Fluorescence microscopy labeling experiments were conducted essentially as described in the preceding Examples. Biotin label was visualized by fluorescein-conjugated streptavidin.
Results
Table 22 shows the tested FITC-labelled lectins and antibodies, examples of their target saccharide sequences, and the graded lectin binding intensities as described in the Table legend, in fluorescence microscopy of fixed cells grown on microscopy slides. Multiple binding specificities for the used lectins are described in the art and in general the binding of a lectin in the present experiments means that the cells express specific ligands for the lectin on their surface, but does not exclude the presence of also other ligands that are recognized by the lectin. See Example 18 for specificities for GF antibodies.
α-linked mannose. Abundant labelling of mEF by Pisum sativum (PSA) lectins suggests that they express mannose, more specifically α-linked mannose residues on their surface glycoconjugates such as N-glycans. The results further suggest that the both hESC lines do not express these ligands at as high concentrations as mEF on their surface.
β-linked galactose. Abundant labelling of hESC by peanut lectin (PNA) and less intense labelling by Ricinus communis lectin I (RCA-I) suggests that hESC express β-linked non-reducing terminal galactose residues on their surface glycoconjugates such as N- and/or O-glycans. More specifically, RCA-I binding suggests that the cells contain high amounts of unsubstituted Galβ epitopes on their surface. PNA binding suggests for the presence of unsubstituted Galβ, and the absence of specific binding of PNA to mEF suggests that the binding epitopes for this lectin are less abundant in mEF.
Sialic acids. Specific labelling of hESC by both Maackia amurensis (MAA) and Sambucus nigra (SNA) lectins suggests that the cells express sialic acid residues on their surface glycoconjugates such as N- and/or O-glycans and/or glycolipids. More specifically, the specific MAA binding of hESC suggests that the cells contain high amounts of α2,3-linked sialic acid residues. In contrast, the results suggest that these epitopes are less abundant in mEF. SNA binding in both cell types suggests for the presence of also α2,6-linkages in the sialic acid residues on the cell surface.
Poly-N-acetyllactosamine sequences. Labelling of the cells by pokeweed (PWA) and less intense labelling by Solanum tuberosum (STA) lectins suggests that the cells express poly-N-acetyllactosamine sequences on their surface glycoconjugates such as N- and/or O-glycans and/or glycolipids. The results further suggest that cell surface poly-N-acetyllactosamine chains contain both linear and branched sequences.
β-linked N-acetylgalactosamine. Abundant labelling of hESC by Wisteria floribunda lectin (WFA) suggests that hESC express β-linked non-reducing terminal N-acetylgalactosamine residues on their surface glycoconjugates such as N- and/or O-glycans. The absence of specific binding of WFA to mEF suggests that the lectin ligand epitopes are less abundant in mEF.
Fucosylation. Labelling of the cells by Ulex europaeus (UEA) and less intense labelling by Lotus tetragonolobus (LTA) lectins suggests that the cells express fucose residues on their surface glycoconjugates such as N- and/or O-glycans and/or glycolipids. More specifically, the UEA binding suggests that the cells contain α-linked fucose residues including α1,2-linked fucose residues. LTA binding suggests for the presence of α-linked fucose residues including α1,3- or α1,4-linked fucose residues on the cell surface.
The specific antibody anti-Lex and anti-sLex antibody binding results indicate that the hESC samples contain Galβ4(Fucα3)GlcNAcβ and SAα3Galβ4(Fucα3)GlcNAcβ carbohydrate epitopes on their surface, respectively.
Taken together, in the present experiments the lectins PNA, MAA, and WFA as well as the antibodies anti-Lex and anti-sLex bound specifically to hESC but not to mEF. In contrast, the lectin PSA bound specifically to mEF but not to hESC. This suggests that the glycan epitopes that these reagents recognize have hESC or mEF specific expression patterns. On the other hand, other reagents in the tested reagent panel bound differentially to the two hESC lines FES 22 and FES 30, indicating cell line specific glycosylation of the hESC cell surfaces (Table 22).
Discussion
Venable, A., et al. (2005 BMC Dev. Biol.) have previously described lectin binding profiles of SSEA-4 enriched human embryonic stem cells (hESC) grown on mouse feeder cells. The lectins used were Lycopersicon esculentum (LEA, TL), RCA, Concanavalin A (ConA), WFA, PNA, SNA, Hippeastrum hybrid (HHA, HHL), Vicia villosa (VVA), UEA, Phaseolus vulganis (PHA-L and PHA-E), MAA, LTA (LTL), and Dolichos biflorus (DBA) lectins. In FACS and cytochemistry analysis, four lectins were found to have similar binding percentage as SSEA-4 (LEA, RCA, ConA, and WFA) and in addition two lectins also had high binding percentage (PNA and SNA). Two lectins did not bind to hESCs (DBA and LTA). Six lectins were found to partially bind to hESC (PHA-E, VVA, UEA, PHA-L, MAA, and HHA). The authors suggested that the differential lectin binding specificities can be used to distinguish hESC and differentiated hESC types based on carbohydrate presentation.
Venable et al. (2005) discuss some carbohydrate structures that they claim to have high expression on the surface of pluripotent SSEA-4 hESC (corresponding lectins according to Venable et al. in parenthesis): α-Man (ConA, HHA), Glc (ConA), Galβ3GalNAcβ (PNA), non-reducing terminal Gal (RCA), non-reducing terminal β-GalNAc (RCA), GalNAcβ4Gal (WFA), GlcNAc (LEA), and SAα6GalNAc (SNA). In addition, Venable et al. discuss some carbohydrate structures that they claim to have expression on surface of a proportion of pluripotent SSEA-4 hESC (corresponding lectins according to Venable et al. in parenthesis): Gal (PHA-L, PHA-E, MAA), GalNAc (VVA) and Fuc (UEA). However, based on the monosaccharide specificities oligosaccharide specifificities on the target cannot be known e.g. ConA is not easily assigned to any specific to Glc or Man-structure and our MAA has no specificity to Gal residues, but SAα3-strcutures; it is realized that large differencies exist between often numerous isolectins of a plant species and Venable did not disclose the exact lectins used. Technical problems avoiding exact interpretation is Background section.
In the present experiments, RCA binding was observed on both hESC line FES 22 and mEF, but not on FES 30. This suggests that RCA binding specificity in hESC varies from cell line to another. The present experiments also show other lectins to be expressed on only one out of the two hESC lines (Table 22), suggesting that there is individual variation in binding of some lectins.
Based on LTA not binding to hESC in their experiments, Venable et al. (2005) suggest that on hESC surface there are no non-modified fucose residues that are α-linked to GlcNAc. However, in the present experiments LTA as well as anti-Lex and anti-sLex monoclonal antibodies were found to bind to the hESC line FES 22. The present antibody binding results indicate that FucαGlcNAc epitopes, specifically Galβ4(Fucα3)GlcNAc sequences, are present on hESC surface.
Venable et al. (2005) describe that PNA recognizes in their hESC samples specifically Galβ3GalNAc structures, wherein the GalNAcresidue is β-linked. In the present experiments, PNA was used to recognize carbohydrate structures generally including β-linked galactose residues and without β-linkage requirement for the GalNAc residue.
Venable et al. (2005) describe that SNA recognizes in their hESC samples specifically SAα6GalNAc structures. In the present experiments, SNA was used to recognize α2,6-linked sialic acids in general and its ligands were also found on mEF.
Inhibition of MAA binding by 200 mM lactose in the experiments described by Venable et al. (2005) suggests non-specific binding of their MAA with respect to sialic acids. According to the present experiments, our MAA can recognize α2,3-linked sialic acid residues on hESC surface and differentiate between hESC and mEF.
Experimental Procedures
Cell samples. Bone marrow derived human mesenchymal stem cell lines (MSC) were generated and cultured in proliferation medium as described above.
FITC-labeled lectins. Fluorescein isotiocyanate (FITC) labelled lectins were purchased from several manufacturers: FITC-GNA, -HHA, -MAA, -PWA, -STA and -LTA were from EY Laboratories (USA); FITC-PSA and -UEA were from Sigma (USA); and FITC-RCA, -PNA and -SNA were from Vector Laboratories (UK). Lectins were used in dilution of 5 μg/105 cells in 1% human serum albumin (HSA; FRC Blood Service, Finland) in phosphate buffered saline (PBS).
Flow cytometry. Flow cytometric analysis of lectin binding was used to study the cell surface carbohydrate expression of MSC. 90% confluent MSC layers on passages 9-11 were washed with PBS and harvested into single cell suspensions by 0.25% trypsin-1 mM EDTA solution (Gibco). Detached cells were centrifuged at 600 g for five minutes at room temperature. Cell pellet was washed twice with 1% HSA-PBS, centrifuged at 600 g and resuspended in 1% HSA-PBS. Cells were placed in conical tubes in aliquots of 70000-83000 cells each. Cell aliquots were incubated with one of the FITC labelled lectin for 20 minutes at room temperature. After incubation cells were washed with 1% HSA-PBS, centrifuged and resuspended in 1% HSA-PBS. Untreated cells were used as controls. Lectin binding was detected by flow cytometry (FACSCalibur, Becton Dickinson). Data analysis was made with Windows Multi Document Interface for Flow Cytometry (WinMDI 2.8). Two independent experiments were carried out.
Fluorescence microscopy labeling experiments were conducted as described in the preceding Examples.
Results and Discussion
Table 23 shows the tested FITC-labelled lectins, examples of their target saccharide sequences, and the amount of cells showing positive lectin binding (%) in FACS analysis after mild trypsin treatment. Table 24 shows the tested FITC-labelled lectins, examples of their target saccharide sequences, and the graded lectin binding intensities as described in the Table legend, in fluorescence microscopy of fixed cells grown on microscopy slides. Binding specificities of the used lectins are described in the art and in general the binding of a lectin in the present experiments means that the cells express specific ligands for the lectin on their surface. The examples of some of the specificities discussed below and those marked in the Tables are therefore non-exclusive in nature.
α-linked mannose. Abundant labelling of the cells by both Hippeastrum hybrid (HHA) and Pisum sativum (PSA) lectins suggests that they express mannose, more specifically α-linked mannose residues on their surface glycoconjugates such as N-glycans. Possible α-mannose linkages include α1→2, α1→3, and α1→6. The lower binding of Galanthus nivalis (GNA) lectin suggests that some α-mannose linkages on the cell surface are more prevalent than others.
β-linked galactose. Abundant labelling of the cells by Ricinus communis lectin I (RCA-I) and less intense labelling by peanut lectin (PNA) suggests that the cells express β-linked non-reducing terminal galactose residues on their surface glycoconjugates such as N- and/or O-glycans. More specifically, the intense RCA-I binding suggests that the cells contain high amounts of unsubstituted Galβ epitopes on their surface. The binding of RCA-I was increased by sialidase treatment of the cells before lectin binding, indicating that the ligands of RCA-I on MSC were originally partly covered by sialic acid residues. PNA binding suggests for the presence of another type of unsubstituted Galβ epitopes such as Core 1 O-glycan epitopes on the cell surface. The binding of PNA was also increased by sialidase treatment of the cells before lectin binding, indicating that the ligands of PNA on MSC were originally mostly covered by sialic acid residues. These results suggest that both RCA-I and PNA can be used to assess the amount of their specific ligands on the cell surface of BM MSC, and with or without conjunction with sialidase treatment to assess the sialylation level of their specific epitopes.
Sialic acids. Abundant labelling of the cells by Maackia amurensis (MAA) and less intense labelling by Sambucus nigra (SNA) lectins suggests that the cells express sialic acid residues on their surface glycoconjugates such as N- and/or O-glycans and/or glycolipids. More specifically, the intense MAA binding suggests that the cells contain high amounts of α2,3-linked sialic acid residues on their surface. SNA binding suggests for the presence of also α2,6-linked sialic acid residues on the cell surface, however in lower amounts than α2,3-linked sialic acids. Both of these lectin binding activities could be reduced by sialidase treatment, indicating that the specificities of the lectins in BM MSC are mostly targeted to sialic acids.
Poly-N-acetyllactosamine sequences. Labelling of the cells by Solanum tuberosum (STA) and less intense labelling by pokeweed (PWA) lectins suggests that the cells express poly-N-acetyllactosamine sequences on their surface glycoconjugates such as N- and/or O-glycans and/or glycolipids. Higher intensity labelling with STA than with PWA suggests that most of the cell surface poly-N-acetyllactosamine sequences are linear and not branched or substituted chains.
Fucosylation. Labelling of the cells by Ulex europaeus (UEA) and less intense labelling by Lotus tetragonolobus (LTA) lectins suggests that the cells express fucose residues on their surface glycoconjugates such as N- and/or O-glycans and/or glycolipids. More specifically, the UEA binding suggests that the cells contain α-linked fucose residues, including α1,2-linked fucose residues, on their surface. LTA binding suggests for the presence of also α-linked fucose residues, including α1,3-linked fucose residues on the cell surface, however in lower amounts than UEA ligand fucose residues.
Mannose-binding lectin labelling. Low labelling intensity was also detected with human serum mannose-binding lectin (MBL) coupled to fluorescein label, suggesting that ligands for this innate immunity system component may be expressed on in vitro cultured BM MSC cell surface.
Binding of a NeuGc polymeric probe (Lectinity Ltd., Russia) to non-fixed hESC indicates the presence of NeuGc-specific lectin on the cell surfaces. In contrast, polymeric NeuAc probe did not bind to the cells with same intensity in the present experiments.
The binding of the specific antibodies to hESC indicates the presence of Lex and sialyl-Lewis x epitopes on their surfaces, and binding of NeuGc-specific antibody to hESC indicates the presence of NeuGc epitopes on their surfaces.
Results and Discussion
Experimental Procedures
Cell and glycan samples were prepared as described in the preceding Examples.
Relative proportions of neutral and acidic N-glycan fractions were studied by desialylating isolated acidic glycan fraction with A. ureafaciens sialidase as described in the preceding Examples and then combining the desialylated glycans with neutral glycans isolated from the same sample. Then the combined glycan fractions were analyzed by positive ion mode MALDI-TOF mass spectrometry as described in the preceding Examples. The proportion of sialylated N-glycans of the combined N-glycans was calculated by calculating the percentual decrease in the relative intensity of neutral N-glycans in the combined N-glycan fraction compared to the original neutral N-glycan fraction, according to the equation:
wherein Ineutral and Icombined correspond to the sum of relative intensities of the five high-mannose type N-glycan [M+Na]+ ion signals at m/z 1257, 1419, 1581, 1743, and 1905 in the neutral and combined N-glycan fractions, respectively.
Results and Discussion
The relative proportions of acidic N-glycan fractions in studied stem cell types were as follows: in human embryonic stem cells (hESC) approximately 35% (proportion of sialylated and neutral N-glycans is approximately 1:2), in human bone marrow derived mesenchymal stem cells (BM MSC) approximately 19% (proportion of sialylated and neutral N-glycans is approximately 1:4), in osteoblast-differentiated BM MSC approximately 28% (proportion of sialylated and neutral N-glycans is approximately 1:3), and in human cord blood (CB) CD133+ cells approximately 38%
(proportion of sialylated and neutral N-glycans is approximately 2:3).
In conclusion, BM MSC differ from hESC and CB CD133+ cells in that they contain significantly lower amounts of sialylated N-glycans compared to neutral N-glycans. However, after osteoblast differentiation of the BM MSC the proportion of sialylated N-glycans increases.
Experimental Procedures
Human embryonic stem cell lines (hESC). Four Finnish hESC lines, FES 21, FES 22, FES 29, and FES 30, were used in the present study. Generation of the lines has been described (Skottman et al., 2005, and M. M., C. O., T. T., and T. O., manuscript submitted for publication). Two of the analysed cell lines in the present work were initially derived and cultured on mouse embryonic fibroblast feeders, and two on human foreskin fibroblast feeder cells. For the mass spectrometry studies all of the lines were transferred on HFF feeder cells treated with mitomycin-C (1 μg/ml, Sigma-Aldrich, USA) and cultured in serum-free medium (Knockout™ D-MEM; Gibco® Cell culture systems, Invitrogen, UK) supplemented with 2 mM L-Glutamin/Penicillin streptomycin (Sigma-Aldrich), 20% Knockout Serum Replacement (Gibco), 1×non-essential amino acids (Gibco), 0.1 mM β-mercaptoethanol (Gibco), 1×ITS (Sigma-Aldrich) and 4 ng/ml bFGF (Sigma/Invitrogen). To induce the formation of embryoid bodies (EB) the hESC colonies were first allowed to grow for 10-14 days whereafter the colonies were cut in small pieces and transferred on non-adherent Petri dishes to form suspension cultures. The formed EBs were cultured in suspension for the next 10 days in standard culture medium (see above) without bFGF. For further differentiation (into stage 3 differentiated cells) EBs were transferred onto gelatin-coated (Sigma-Aldrich) adherent culture dishes in media consisting of DMEM/F12 mixture (Gibco) supplemented with ITS, Fibronectin (Sigma), L-glutamine and antibiotics. The attached cells were cultured for 10 days whereafter they were harvested. For glycan analysis, the cells were collected mechanically, washed, and stored frozen until the analysis. In FACS analyses 70-90% of cells from mechanically isolated hESC colonies were typically Tra 1-60 and Tra 1-81 positive (not shown). Cells differentiated into embryoid bodies (EB) and further differentiated cells grown out of the EB as monolayers (stage 3 differentiated) were used for comparison against hESC. The differentiation protocol favors the development of neuroepithelial cells while not directing the differentiation into distinct terminally differentiated cell types (Okabe et al., 1996). Stage 3 cultures consisted of a heterogenous population of cells dominated by fibroblastoid and neuronal morphologies.
Glycan isolation. Asparagine-linked glycans were detached from cellular glycoproteins by F. meningosepticum N-glycosidase F digestion (Calbiochem, USA) essentially as described (Nyman et al., 1998). The detached glycans were divided into sialylated and non-sialylated fractions based on the negative charge of sialic acid residues. Cellular contaminations were removed by precipitating the glycans with 80-90% (v/v) aqueous acetone at −20° C. and extracting them with 60% (v/v) ice-cold methanol essentially as described previously (Verostek et al., 2000). The glycans were then passed in water through C18 silica resin (BondElut, Varian, USA) and adsorbed to porous graphitized carbon (Carbograph, Alltech, USA) based on previous method (Davies et al., 1993). The carbon column was washed with water, then the neutral glycans were eluted with 25% acetonitrile in water (v/v) and the sialylated glycans with 0.05% (v/v) trifluoroacetic acid in 25% acetonitrile in water (v/v). Both glycan fractions were additionally passed in water through strong cation-exchange resin (Bio-Rad, USA) and C18 silica resin (ZipTip, Millipore, USA). The sialylated glycans were further purified by adsorbing them to microcrystalline cellulose in n-butanol:ethanol:water (10:1:2, v/v), washing with the same solvent, and eluting by 50% ethanol:water (v/v). All the above steps were performed on miniaturized chromatography columns and small elution and handling volumes were used. The glycan analysis method was validated by subjecting human cell samples to analysis by five different persons. The results were highly comparable, especially by the terms of detection of individual glycan signals and their relative signal intensities, showing that the reliability of the present methods is suitable for comparing analysis results from different cell types.
Mass spectrometry and data analysis. MALDI-TOF mass spectrometry was performed with a Bruker Ultraflex TOF/TOF instrument (Bruker, Germany) essentially as described (Saarinen et al., 1999). Relative molar abundancies of both neutral and sialylated glycan components can be accurately assigned based on their relative signal intensities in the mass spectra (Naven and Harvey, 1996; Papac et al., 1996; Saarinen et al., 1999; Harvey, 1993). Each step of the mass spectrometric analysis methods were controlled for their reproducibility by mixtures of synthetic glycans or glycan mixtures extracted from human cells. The mass spectrometric raw data was transformed into the present glycan profiles by carefully removing the effect of isotopic pattern overlapping, multiple alkali metal adduct signals, products of elimination of water from the reducing oligosaccharides, and other interfering mass spectrometric signals not arising from the original glycans in the sample. The resulting glycan signals in the presented glycan profiles were normalized to 100% to allow comparison between samples. Quantitative difference between two glycan profiles (%) was calculated according to the equation:
wherein p is the relative abundance (%) of glycan signal i in profile a or b, and n is the total number of glycan signals.
Glycosidase analysis. The neutral N-glycan fraction was subjected to digestion with Jack bean α-mannosidase (Canavalia ensiformis; Sigma, USA) essentially as described (Saarinen et al., 1999). The specificity of the enzyme was controlled with glycans isolated from human tissues as well as purified otigosaccharides.
NMR methods. For NMR analysis, larger amounts of hESC were grown on mouse feeder cell (MEF) layers. The purity of the collected hESC sample (about 70%), was lower than in the mass spectrometry samples grown on HFF. However, the same H5-9N2 glycans were the major neutral N-glycan signals in both MEF and hESC. The isolated glycans were further purified for the analysis by gel filtration high-pressure liquid chromatography in a column of Superdex peptide HR 10/30 (Amersham), with water (neutral glycans) or 50 mM NH4HCO3 (sialylated glycans) as the eluant at a flow rate of 1 ml/min. The eluant was monitored at 214 nm, and oligosaccharides were quantified against external standards. The amount of N-glycans in NMR analysis was below five nanomoles.
Statistical procedures. Glycan score distributions of all three differentiation stages (hESC, EB, and st.3) were analyzed by the Kruskal-Wallis test. Pairwise comparisons were performed by the 2-tailed Student's t-test with Welch's approximation and 2-tailed Mann-Whitney U test. A p value less than 0.05 was considered significant.
Lectin staining. Fluorescein-labeled lectins were from EY Laboratories (USA) and the stainings were performed essentially after manufacturer's instructions. The specificity of the staining was controlled in parallel experiments by inhibiting lectin binding with specific oligo- and monosaccharides.
Results
Mass Spectrometric Profiling of the hESC N-glycome
In order to generate glycan profiles of hESC, embryonic bodies, and further differentiated cells, a MALDI-TOF mass spectrometry based analysis was performed. We focused on the most common type of protein post-translational modifications, the asparagine-linked glycans (N-glycans), which were enzymatically released from cellular glycoproteins. During glycan isolation and purification, the total N-glycan pool was separated by an ion-exchange step into neutral N-glycans and sialylated N-glycans. These two glycan fractions were then analyzed separately by mass spectrometric profiling (
The proposed monosaccharide compositions corresponding to the detected masses of each individual signal in
In most of the previous glycomic studies of other mammalian tissues the isolated glycans have been derivatized (permethylated) prior to mass spectrometric profiling (Sutton-Smith et al., 2002; Dell and Morris, 2001; Consortium for Functional Glycomics, http://www.functionalglycomics.org) or chromatographic separation (Callewaert et al., 2004). However, in the present study we chose to directly analyze picomolar quantities of unmodified glycans and increased sensitivity was attained by omitting the derivatization and the subsequent additional purification steps. Further, instead of studying the glycan signals one at a time, we were able to simultaneously study all the glycans present in the unmodified glycomes by nuclear magnetic resonance spectroscopy (NMR) and specific glycosidase enzymes. The present data demonstrate that mass spectrometric profiling can be used in the quantitative analysis of total glycomes, especially to pin-point the major glycosylation differences between related samples.
Overview of the hESC N-glycome: Neutral N-glycans
Neutral N-glycans comprised approximately two thirds of the combined neutral and sialylated N-glycan pools. The 50 most abundant neutral N-glycan signals of the hESC lines are presented in
Sialylated N-glycans
All N-glycan signals in the sialylated N-glycan fraction (
Importantly, we were able to detect N-glycans containing N-glycolylneuraminic acid (G), for example glycans G1H5N4, G1S1H5N4, and G2H5N4, in the hESC samples. N-glycolylneuraminic acid has previously been reported in hESC as an antigen transferred from culture media containing animal-derived materials (Martin et al., 2005). Accordingly, the serum replacement medium used in the present experiments contained bovine serum proteins.
Variation Between Individual Cell Lines
Although the four hESC lines shared the same overall N-glycan profile, there was cell line specific variation within the profiles. Individual glycan signals unique to each cell line were detected, indicating that every cell line was slightly different from each other with respect to the approximately one hundred most abundant N-glycan structures they synthesized.
In general, the 30 most common N-glycan signals in cach hESC fine accounted for circa 85% of the total detected N-glycans, and represent a useful approximation of the hESC N-glycome. In other words, more than five out of six glycoprotein molecules isolated from any of the present hESC lines would carry such N-glycan structures.
Transformation of the N-Glycome During hESC Differentiation
A major goal of the present study was to identify glycan structures that would be specific to either stem cells or differentiated cells, and could therefore serve as differentiation stage markers. In order to determine whether the hESC N-glycome undergoes changes during differentiation, the N-glycan profiles obtained from hESC, EB, and stage 3 differentiated cells were compared (
Taken together, differentiation induced the appearance of new N-glycan types while earlier glycan types disappeared. Further, we found that the major hESC-specific N-glycosylation features were not expressed as discrete glycan signals, but instead as glycan signal groups that were characterized by a specific monosaccharide composition feature (see below). In other words, differentiation of hESC into EB induced the disappearance of not only one but multiple glycan signals with hESC-associated features, and simultaneously also the appearance of glycan signal groups with other features associated with the differentiated cell types.
The N-glycan profiles of the differentiated cells were also quantitatively different from the undifferentiated hESC profiles. A practical way of quantifying the differences between individual glycan profiles is to calculate the sum of the signal intensity differences between two cell profiles (see Methods). According to this method, the EB neutral and sialylated N-glycan profiles had undergone a quantitative change of 14% and 29% from the hESC profiles, respectively. Similarly, the stage 3 differentiated cell neutral and sialylated N-glycan profiles had changed by 15% and 43% from the hESC profiles, respectively. This indicates that upon differentiation of hESC into stage 3 differentiated cells, nearly half of the total sialylated N-glycans present in the cells were transformed into different molecular structures, while significantly smaller proportion of the neutral N-glycan molecules were changed during the differentiation process. Taking into account that the proportion of sialylated to neutral N-glycans in hESC was approximately 1:2, the total N-glycome change was approximately 25% during the transition from hESC to stage 3 differentiated cells. Again, the N-glycan profile of EB appeared to lie between hESC and stage 3 differentiated cells.
The data indicated that the hESC N-glycome consisted of two discrete parts regarding propensity to change during hESC differentiation—a constant part of circa 75% and a changing part of circa 25%. In order to characterize the associated N-glycan structures, and to identify the potential biological roles of the constant and changing parts of the N-glycome, we performed structural analyses of the isolated hESC N-glycan samples.
Structural Analyses of the Major hESC N-glycans: Preliminary Structure Assignment Based on Monosaccharide Compositions
Human N-glycans can be divided into the major biosynthetic groups of high-mannose type, hybrid-type, and complex-type N-glycans. To determine the presence of these N-glycan groups in hESC and their progeny, assignment of probable structures matching the monosaccharide compositions of each individual signal was performed utilizing the established pathways of human N-glycan biosynthesis (Kornfeld and Kornfeld, 1985; Schachter, 1991). Here, the detected N-glycan signals were classified into four N-glycan groups according to the number of N and H residues: 1) high-mannose type and 2) low-mannose type N-glycans, which are both characterized by two N residues (N=2), 3) hybrid-type or monoantennary N-glycans, which are classified by three N residues (N=3), and 4) complex-type N-glycans, which are characterized by four or more N residues (N4) in their proposed monosaccharide compositions. This is an approximation: for example, in addition to complex-type N-glycans also hybrid-type and monoantennary N-glycans may contain more than three N residues.
The data was analyzed quantitatively by calculating the percentage of glycan signals in the total N-glycome belonging to each structure group (Table 25, rows A-E and J-L). The quantitative changes in the structural groups reflect the relative activities of different biosynthetic pathways in each cell type. For example, the proportion of hybrid-type or monoantennary N-glycans was increased when hESC differentiated into EB. In general, the relative proportions of most glycan structure classes remained approximately constant through the hESC differentiation process, which indicated that both hESC and the differentiated cell types were capable of equally sophisticated N-glycosylation. The high proportion of N-glycans classified as low-mannose N-glycans in all the studied cell types was somewhat surprising in the light of earlier published studies of human N-glycosylation. However, previous studies had not explored the total N-glycan profiles of living cells. We have detected significant amounts of low-mannose N-glycans also in other human cells and tissues, and they are not specific to hESC (T. S., A. H., M. B., A. O., J. H., J. N, J. S. et al., unpublished results).
Verification of Structure Assignments by Enzymatic Degradation and Nuclear Magnetic Resonance Spectroscopy
In order to verify the validity of the glycan structure assignments made based on the detected mass and the probable monosaccharide compositions we performed enzymatic degradation and proton nuclear magnetic resonance spectroscopic analyses (1H-NMR) of selected neutral and sialylated N-glycans.
For the validation of neutral N-glycans we chose glycans with 5-9 hexose (H) and two N-acetylhexosamine (N) residues in their monosaccharide compositions (H5N2, H6N2, H7N2, H8N2, and H9N2) which were the most abundant N-glycans in all studied cell types (
The neutral N-glycan fraction was further analyzed by nanoscale proton nuclear magnetic resonance spectroscopic analysis (1H-NMR). In the obtained 1H-NMR spectrum of the hESC neutral N-glycans signals consistent with high-mannose type N-glycans were detected, supporting the conclusion that they were the major glycan components in the sample.
Both α-mannosidase and NMR experiments indicated that the H5-9N2 glycan signals corresponded to high-mannose type N-glycans. From the data in
For the validation of structure assignments among the sialylated N-glycans we noted that the majority of the sialylated N-glycan signals isolated from hESC were characterized by the N≧4 monosaccharide composition (
Differentiation Stage Associated Structural Glycosylation Features
The glycan signal classification described above indicated changes in the core sequences of N-glycans. The present data also suggested that there were differences in variable epitopes added to the N-glycan core structures i.e. glycan features present in many individual glycan signals. In order to quantify such glycan structural features, the N-glycome data were further classified into glycan signal groups that share similar features in their proposed monosaccharide compositions (Table 25, rows F-I and M-P). As a result, the majority of the differentiation-associated glycan signals in the EB and stage 3 differentiated cell samples fell into different groups than the hESC specific glycans. Glycan signals with complex fucosylation (Table 25, row N) were associated with undifferentiated hESC, whereas glycan signals with potential terminal N-acetylhexosamine (Table 25, rows H and P) were associated with the differentiated cells.
Complex Fucosylation of N-glycans is Characteristic of hESC
Differentiation stage associated changes in the sialylated N-glycan profile were more drastic than in the neutral N-glycan fraction and the group of five most abundant sialylated N-glycan signals was different at every differentiation stage (
The most common fucosylation type in human N-glycans is α1,6-fucosylation of the N-glycan core structure. The NMR analysis of the sialylated N-glycan fraction of hESC also revealed α1,6-fucosylation of the N-glycan core as the most abundant type of fucosylation. In the N-glycans containing more than one fucose residue, there must have been other fucose linkages in addition to the α1,6-linkage (Staudacher et al., 1999). The F≧2 structural feature decreased as the cells differentiated, indicating that complex fucosylation was characteristic of undifferentiated hESC.
N-glycans with Terminal N-acetylhexosamine Residues Become more Common with Differentiation
A group of N-glycan signals which increased during differentiation contained equal amounts of N-acetylhexosamine and hexose residues (N═H) in their monosaccharide composition, e.g. S1H5N5F1. This was consistent with structures containing non-reducing terminal N-acetylhexosamine residues. Usually N-glycan core structures contain more hexose than N-acetylhexosamine residues. However, if complex-type N-glycans contain terminal N-acetylhexosamine residues that are not capped by hexoses, their monosaccharide compositions change to either the N═H or the N>H. EB and stage 3 differentiated cells showed increased amounts of potential terminal N-acetylhexosamine structures, of which the N═H structural feature was increased in both neutral and sialylated N-glycan pools (Table 25, rows I and P), whereas the N>H structural feature was elevated in the neutral N-glycan pool, but decreased in the sialylated N-glycan pool during differentiation (Table 25, rows H and O).
Glycome Profiling can Identify the Differentiation Stage of hESC
The analysis of glycome profiles indicated that the studied hESC lines and differentiated cells had differentiation stage specific N-glycan features. However, the data also demonstrated that N-glycan profiles of the individual hESC lines were different from each other and in particular the hESC line FES 22 was different from the other three stem cell lines (Table 25, rows C and I). To test whether the obtained N-glycan profiles could be used to generate an algorithm that would discriminate between hESC and differentiated cells even taking into account cell line specific variation, an analysis was performed using the data of Table 25. The hESC line FES 29 and embryoid bodies derived from it (EB 29) were selected as the training group for the calculation. The algorithm glycan score (equation 1) was defined as the sum of those structural features that were at least two times greater in FES 29 than in EB 29 (row N in Table 25), from which the sum of the structural feature percentages that were at least two times greater in EB 29 than in FES 29 was subtracted (rows C, I, J, and P in Table 25):
glycan score=N−(C+I+J+P), (1)
wherein the letters refer to the row numbering of Table 25.
The Identified hESC Glycans can be Targeted at the Cell Surface
From a practical perspective stem cell research would be best served by the identification of target structures on cell surface. To investigate whether individual glycan structures we had identified would be accessible to reagents targeting them at the cell surface we performed lectin labelling of two candidate structure types. Lectins are proteins that recognize glycans with specificity to certain glycan structures also in hESC (Venable et al., 2005). To study the localization of glycan components in hESC, stem cell colonies grown on mouse feeder cell layers were labeled in vitro by fluorescein-labelled lectins (
Comparative Analysis of the N-glycome
Although the N-glycan profiles of the four hESC lines share a similar overall profile shape, there was cell line specific variation in the N-glycan profiles. Individual glycan signals unique to each cell line were found, indicating that every cell line was slightly different from each other with respect to the approximately one hundred most abundant glycan structures they synthesize. This is represented in 0.34a as Venn diagrams combining all the detected glycan signals from both the neutral and the acidic N-glycan fractions. FES 29 and FES 30 were derived from sibling embryos, but their N-glycan profiles did not resemble each other more than they resembled FES 21 in the Venn diagram. Furthermore, FES 30 that has the karyotype XX did not differ significantly from the three XY hESC lines.
In order to determine whether the hESC N-glycome undergoes changes during differentiation, N-glycan profiles obtained from hESC, EB, and stage 3 differentiated cells were compared (
Discussion
In the present study, novel mass spectrometric methods were applied to the first structural analysis of human embryonic stem cell N-glycan profiles. Previously, such investigation of whole cell glycosylation has not been feasible due to the lack of methods with sufficiently high sensitivity to analyze the scarce stem cells. The present method was validated for samples of approximately 100 000 cells and the glycan profiles of the analyzed cell types were consistent throughout multiple samples. The objective in the use of the present method was to provide a global view on the glycome profile, or a “fingerprint” of hESC glycosylation, rather than to present the stem cell glycome in terms of the molecular structures of each glycan component. However, changes observed in the N-glycan profiles provide vast amount of information regarding hESC glycosylation and its changes during differentiation, and allows rational design of detailed structural studies of selected glycan components or glycan groups.
The results indicate that a defined group of N-glycan signals dominate the hESC N-glycome and form a unique stem cell glycan profile. It seems that specific monosaccharide compositions were favored over the possible alternatives by the hESC N-glycan biosynthetic machinery. For example, the fifteen most abundant neutral N-glycan signals and fifteen most abundant sialylated N-glycan signals in hESC together comprised over 85% of the N-glycome. Further, different glycan structures were favored during the differentiation of the cells. This suggests that N-glycan biosynthesis in hESC is a controlled and predetermined process. As hundreds of genes, consisting of up to 1% of the human genome, are involved in glycan biosynthesis (Haltiwanger and Lowe, 2004), a future challenge is to characterize the regulatory processes that control hESC glycosylation during differentiation into specialized cell types.
Based on our results the hESC N-glycome seems to contain both a constant part consisting of “housekeeping glycans”, and a changeable part that was altered when the hESC differentiated (
Our data show that the differentiation-associated change in the N-glycome was generated by addition of variable epitopes on similar N-glycan core compositions. For example, the present lectin staining experiments demonstrated that sialylated glycans were abundant on the cell surface of hESC, indicating that they are potential targets for development of more specific recognition reagents. In contrast, the constantly expressed mannosylated glycans were found to reside mainly inside the cells. It seems plausible that knowledge of the changing surface glycan epitopes could be utilized as a basis in developing reagents and culture systems that would allow improved identification, selection, manipulation, and culture of hESC and their progeny. We are currently characterizing the stem cell specific glycosylation changes at the level of individual molecular structures.
The specific cellular glycan structures perform their functions mainly by 1) acting as ligands for specific glycan receptors (Kilpatrick, 2002; Zanetta and Vergoten, 2003), 2) functioning as structural elements of the cell (Imperiali and O'Connor, 1999), and 3) modulating the activity of their carrier proteins and lipids (Varki, 1993;). More than half of all proteins are glycosylated. Consequently, a global change in protein-linked glycan biosynthesis can simultaneously modulate the properties of multiple proteins. It is likely that the large changes in N-glycans during hESC differentiation have major influences on a number of cellular signaling cascades and affect in profound fashion biological processes within the cells. Our data may provide insight into the regulation of some of these processes.
The major hESC specific glycosylation feature we identified was the presence of more than one deoxyhexose residue in N-glycans, indicating complex fucosylation. Fucosylation is known to be important in cell adhesion and signalling events (Becker and Lowe, 2003) as well as essential for embryonic development Knock-out of the N-glycan core α1,6-fucosyltransferase gene FUT8 leads to postnatal lethality in mice (Wang et al., 2005), and mice completely deficient in fucosylated glycan biosynthesis do not survive past early embryonic development (Smith et al., 2002). Fucosylation defects in humans cause a disease known as leukocyte adhesion deficiency (LAD; Luhn et al., 2001).
Fucosylated glycans such as the SSEA-1 antigen have previously been associated with both mouse embryonic stem cells (mESC) and human embryonic carcinoma cells (EC; Muramatsu and Muramatsu, 2004), but not with hESC. In addition, structurally related Lex oligosaecharides are able to inhibit embryonic compaction (Fenderson et al., 1984), suggesting that fucosylated glycans arc directly involved in cell-to-cell contacts during embryonic development The α1,3-fucosyltransferase genes indicated in the synthesis of the embryonic Lex and SSEA-1 antigens arc FUT4 and FU79 (Nakayama et al., 2001; Kudo et al., 2004). Interestingly, the published gene expression profiles for the same hESC lines as studied here (Skottman et al., 2005) have demonstrated that three human fucosyltransferase genes, FUT1, FUT4, and FUT8 are expressed in hESC, and that FUT1 and FUT4 are overexpressed in hESC when compared to EB. The known specificities of these fucosyltransferases (Mollicone et al., 1995) correlate with our findings of simple fucosylation in EB and complex fucosylation in hESC (
New N-glycan forms emerged in EB and stage 3 differentiated cells. These structural features included additional N-acetylhexosamine residues, potentially leading to new N-glycan terminal epitopes. Another differentiation-associated feature was an increase in the molar proportions of hybrid-type or monoantennary N-glycans. Biosynthesis of hybrid-type and complex-type N-glycans has been demonstrated to be biologically significant for embryonic and postnatal development in the mouse (loffe and Stanley, 1994 PNAS; Metzler et al., 1994 EMBO J; Wang et al., 2001 Glycobiology; Akama et al., 2006 PNAS). The preferential expression of complex-type N-glycans in hESC and then the change in the differentiating EB to express more hybrid-type or monoantennary N-glycans may thus be significant for the process of stem cell differentiation.
Human embryonic stem cell lines have previously been demonstrated to have a common genetic stem cell signature that can be identified using gene expression profiling techniques (Skottman et al., 2005; Sato et al., 2003; Abeyta et al., 2004; Bhattacharya et al., 2004). Such signatures have been proposed to be utilized in the characterization of cell lines. The present report provides the first glycomic signatures for hESC. The profile of the expressed N-glycans might be a useful tool for analyzing and classifying the differentiation stage in association with gene and protein expression analyses. Here we demonstrate that the glycan score algorithm was able to reliably differentiate cell samples of separate differentiation stage (
In conclusion, hESC have a unique glycome which undergoes major changes when the cells differentiate. Information regarding the specific glycome may be utilized in developing reagents for the targeting of these cells and their progeny. Future studies investigating the developmental and molecular regulatory processes resulting in the observed glycan profiles may provide significant insight into mechanisms of human development and regulation of glycosylation.
Abeyta, M. J., Clark, A. T., Rodriguez, R. T., Bodnar, M. S., Pera, R. A., and Firpo, M. T. (2004). Unique gene expression signatures of independently-derived human embryonic stem cell lines. Hum. Mol. Genet 13, 601-608.
Apweiler, R., Hermjakob, H., and Sharon, N. (1999). On the frequency of protein glycosylation, as deduced from analysis of the SWISS-PROT database. Biochem. Biophys. Acta 1473, 4-8.
Badcock, G., Pigott, C., Goepel, J., and Andrews, P. W. (1999). The human embryonal carcinoma marker antigen TRA-1-60 is a sialylated keratan sulfate proteoglycan. Cancer Res. 59, 4715-4719.
Becker, D. J., and Lowe, J. B. (2003). Fucose: biosynthesis and biological function in mammals. Glycobiology. 13:41R-5R.
Bhattacharya, B., Miura, T., Brandenberger, R., Mejido, J., Luo, Y., Yang, A. X., Joshi, B. H., Ginis, I., Thies, R. S., Amit, M., Lyons, I., Condie, B. G., Itskovitz-Eldor, J., Rao, M. S., and Puri, R. K. (2004). Gene expression in human embryonic stem cell lines: Unique molecular signature. Blood 103, 2956-2964.
Callewacrt, N., Van Vlicrberghe, H., Van Hecke, A., Laroy, W., Delanghe, J., and Contreras, R. (2004). Noninvasive diagnosis of liver cirrhosis using DNA sequencer-based total serum protein glycomics. Nat Med. 10:429-34.
Carson, D. D., and Lennarz, W. J. (1979). Inhibition of polyisoprenoid and glycoprotein biosynthesis causes abnormal embryonic development. Proc. Natl. Acad. Sci. U.S.A. 76, 5709-5713.
Cooper, D. K. (1998). Xenoantigens and xenoantibodies. Xenotransplantation 5, 6-17.
Davies, M. J., Smith, K. D., Carruthers, R. A., Chai, W., Lawson, A. M., and Hounsell, E. F. (1993). Use of a porous graphitised carbon column for the high-performance liquid chromatography of oligosaecharides, alditols and glycopeptides with subsequent mass spectrometry analysis. J. Chromatogr. 646, 317-326.
Dell, A., and Morris, H. R. (2001). Glycoprotein structure determination by mass spectrometry. Science 291, 2351-2356.
Fenderson, B. A., Zehavi, U., and IIakomori, S. (1984). A multivalent lacto-N-fucopentaose III lysyllysine conjugate decompacts preimplantation mouse embryos, while the free oligosaccharide is ineffective. J. Exp. Med. 160, 1591-1596.
Goldberg, D., Sutton-Smith, M., Paulson, J., and Dell, A. (2005). Automatic annotation of matrix-assisted laser desorption/ionization N-glycan spectra. Proteomics 5, 865-875.
Gooi, H. C., Feizi, T., Kapadia, A., Knowles, B. B., Solter, D., and Evans, M. J. (1981). Stage-specific embryonic antigen involves α1→3 fucosylated type 2 blood group chains. Nature 292, 156-158.
Haltiwanger, R. S., and Lowe, J. B. (2004). Role of glycosylation in development. Annu. Rev. Biochem. 73, 491-537.
Handel, T. M., Johnson, Z., Crown, S. E., Lau, E. K., and Proudfoot, A. E. (2005). Regulation of protein function by glycosaminoglycans—as exemplified by chemokines. Annu. Rev. Biochem. 74:385-410.
Harvey, D. J. (1993). Quantitative aspects of the matrix-assisted laser desorption mass spectrometry of complex oligosaccharides. Rapid Commun. Mass Spectrom. 7, 614-619.
Helenius, A., and Aebi, M. (2001). Intracellular functions of N-linked glycans. Science 291, 2364-2369.
Helenius, A., and Aebi, M. (2004). Roles of N-linked glycans in the endoplasmic reticulum. Annu. Rev. Biochem. 73, 1019-1049.
Homeister, J. W., Thall, A. D., Petryniak, B., Maly, P., Rogers, C. E., Smith, P. L., Kelly, R. J., Gersten, K. M., Askari, S. W., Cheng, G., Smithson, G., Marks, R. M., Misra, A. K., Hindsgaul, O., von Andrian, U. H., and Lowe, J. B. (2001). The α(1,3)fucosyltransferases FucT-IV and FucT-VII exert collaborative control over selectin-dependent leukocyte recruitment and lymphocyte homing. Immunity. 15:115-26.
Imperiali, B., and O'Connor, S. E. (1999). Effect of N-linked glycosylation on glycopeptide and glycoprotein structure. Curr. Opin. Chem. Biol. 3, 643-649.
Kannagi, R., Cochran, N. A., Ishigari, F., Hakomori, S., Andrews, P. W., Knowles, B. B., and Solter, D. (1983). Stage-specific embryonic antigens (SSEA-3 and -4) are epitopes of a unique globo-series ganglioside isolated from human teratocarcinoma cells. EMBO J. 2, 2355-2361.
Kilpatrick, D. C. (2002). Animal lectins: A historical introduction and overview. Biochem. Biophys. Acta 1572, 187-197.
Kornfeld, R., and Kornfeld, S. (1985). Assembly of asparagine-linked oligosaccharides. Annu. Rev. Biochem. 54, 631-664.
Kornfeld, S. (1986). Trafficking of lysosomal enzymes in normal and disease states. J. Clin. Invest. 77:1-6.
Kudo, T., Kaneko, M., Iwasaki, H., Togayachi, A., Nishihara, S., Abe, K., and Narimatsu, H. (2004). Normal embryonic and germ cell development in mice lacking α1,3-fucosyltransferase IX (Fut9) which show disappearance of stage-specific embryonic antigen 1. Mol. Cell. Biol. 24:4221-4228.
Lowe, J. B. (2002). Glycosylation in the control of selectin counter-receptor structure and function. Immunol. Rev. 186:19-36.
Luhn, K., Wild, M. K., Eckhardt, M., Gerardy-Sehahn, R., and Vestweber, D. (2001). The gene defective in leukocyte adhesion deficiency II encodes a putative GDP-fucose transporter. Nat. Genet. 28:69-72.
Martin, M. J., Muotri, A., Gage, F., and Varki, A. (2005). Human embryonic stem cells express an immunogenic nonhuman sialic acid. Nat. Med. 11, 228-232.
Mollicone, R., Cailleau, A., and Oriol, R. (1995). Molecular genetics of H, Se, Lewis, and other fucosyltransferase genes. Transfusion Clin. Biol. 4:235-242.
Muramatsu, T., and Muramatsu, H. (2004). Carbohydrate antigens expressed on stem cells and early embryonic cells. Glycoconj. J. 21, 41-45.
Nakayama, F., Nishihara, S., Iwasaki, H., Kudo, T., Okubo, R., Kaneko, M., Nakamura, M., Karube, M., Sasaki, K., and Narimatsu, H. (2001). CD15 expression in mature granulocytes is determined by α1,3-fucosyltransferase IX, but in promyelocytes and monocytes by α1,3-fucosyltransferase IV. J. Biol. Chem. 276:16100-16106.
Naven, T. J., and Harvey, D. J. (1996). Effect of structure on the signal strength of oligosaccharides in matrix-assisted laser desorption/ionization mass spectrometry on time-of-flight and magnetic sector instruments. Rapid Commun. Mass Spectrom. 10, 1361-1366.
Nguyen, D. H., Tangvoranuntakul, P., and Varki, A. (2005). Effects of natural human antibodies against a nonhuman sialic acid that metabolically incorporates into activated and malignant immune cells J. Immunol. 175, 228-236.
Nyman, T. A., Kallkinen, N., Tölö, H., and Helin, J. (1998). Structural characterisation of N-linked and O-linked oligosaccharides derived from interferon-α2b and interferon-α14c produced by Sendai-virus-induced human peripheral blood leukocytes. Eur. J. Biochem. 253, 485-493.
Okabe, S., Forsberg-Nilsson, K., Spiro, A. C., Segal, M., and McKay, R. D. (1996). Development of neuronal precursor cells and functional postmitotic neurons from embryonic stem cells in vitro. Mech. Dev. 59:89-102.
Papac, D. I., Wong, A., and Jones, A. J. (1996). Analysis of acidic oligosaecharides and glycopeptides by matrix-assisted laser desorption/ionization time-of-flight mass spectrometry. Anal. Chem. 68, 3215-3223.
Saarinen, J., Welgus, H. G., Flizar, C. A., Kalkkeinen, N., and Helin, J. (1999). N-glycan structures of matrix metalloproteinase-1 derived from human fibroblasts and from HT-1080 fibrosarcoma cells. Eur. J. Biochem. 259, 829-840.
Sato, N., Sanjuan, I. M., Heke, M., Uchida, M., Naef, F., and Brivanlou, A. H. (2003). Molecular signature of human embryonic stem cells and its comparison with the mouse. Dev. Biol. 260, 404-413.
Schachter, H. (1991). The ‘yellow brick road’ to branched complex N-glycans. Glycobiology 1, 453-461.
Schneider, E. G., Nguyen, H. T., and Lennarz, W. J. (1978). The effect of tunicamycin, an inhibitor of protein glycosylation, on embryonic development in the sea urchin. J. Biol. Chem. 253, 2348-2355.
Shriver, Z., Raguram, S., and Sasisekharan, R. (2004). Glycomics: A pathway to a class of new and improved therapeutics. Nat. Rev. Drug Disc. 3, 863-873.
Skottman, H., Mikkola, M., Lundin, K., Olsson, C., Strömberg, A. M., Tuuri, T., Otonkoski, T., Hovatta, O., and Lahesmaa, R. (2005). Gene expression signatures of seven individual human embryonic stem cell lines. Stem cells 23, 1343-1356.
Smith, P. L., Myers, J. T., Rogers, C. E., Zhou, L., Petryniak, B., Becker, D. J., Homeister, J. W., and Lowe, J. B. (2002). Conditional control of selectin ligand expression and global fucosylation events in mice with a targeted mutation at the FX locus. J. Cell Biol. 158:801-815.
Solter, D., and Knowles, B. B. (1978). Monoclonal antibody defining a stage-specific mouse embryonic antigen (SSEA-1). Proc. Natl. Acad. Sci. U.S.A. 75, 5565-5569.
Staudacher, E., Altmann, F., Wilson, I. B. H., and März, L. (1999). Fucose in N-glycans: From plant to man. Biochim. Biophys. Acta 1473, 216-346.
Sutton-Smith, M., Morris, H. R., Grewal, P. K., Hewitt, J. E., Bittner, R. E., Goldin, E., Schiffmann, R., and Dell, A. (2002). MS screening strategies: Investigating the glycomes of knockout and myodystrophic mice and leukodystrophic human brains. Biochem. Soc. Symp. 69, 105-115.
Thomson, J. A., Itskovitz-Eldor, J., Shapiro, S. S., Waknitz, M. A., Swiergiel, J. J., Marshall, V. S., and Jones, J. M. (1998). Embryonic stem cell lines derived from human blastocysts. Science 282, 1145-1147.
Varki, A. (1993). Biological roles of oligosaccharides: All of the theories are correct. Glycobiology 3, 97-130.
Venable, A., Mitalipova, M., Lyons, I., Jones, K., Shin, S., Pierce, M., and Stice, S. (2005). Lectin binding profiles of SSEA-4 enriched, pluripotent human embryonic stem cell surfaces. BMC Dev. Biol. 2005 5, 15.
Verostek, M. F., Lubowski, C., and Trimble, R. B. (2000). Selective organic precipitation/extraction of released N-glycans following large-scale enzymatic deglycosylation of glycoproteins. Anal. Biochem. 278, 111-122.
Wang, X., Inouc, S., Gu, J., Miyoshi, E., Noda, K., Li, W., Mizuno-Horikawa, Y., Nakano, M., Asahi, M., Takahashi, M., Uozumi, N., Ihara, S., Lee, S. H., Ikeda, Y., Yamaguchi, Y., Aze, Y., Tomiyama, Y., Fujii, J., Suzuki, K., Kondo, A., Shapiro, S. D., Lopez-Otin, C., Kuwaki, T., Okabe, M., Honke, K., and Taniguchi, N. (2005). Dysregulation of TGF-β1 receptor activation leads to abnormal lung development and emphysema-like phenotype in core fucose-deficient mice. Proc. Natl. Acad. Sci. U.S.A. 102:15791-15796.
Wobus, A. M., and Boheler, K. R. (2005). Embryonic stem cells: Prospects for developmental biology and cell therapy. Physiol. Rev. 85, 635-678.
Zanetta, J. P., and Vergoten, G. (2003). Lectin domains on cytokines. Adv. Exp. Med. Biol. 535, 107-124.
Murine (mEF) and human (hEF) fibroblast feeder cells were prepared and their N-glycan fractions analyzed as described in the preceding Examples.
Results and Discussion
The results showed that mEF and hEF cellular N-glycan fractions differ significantly from each other. The differencies include differential proportions of glycan groups, major glycan signals, and the glycan profiles obtained from the cell samples. In addition, the major difference is the presence of Galα3Gal epitopes in the mEF cells, as discussed in the preceding Examples of the present invention.
Complex carbohydrate structures, glycans, are elementary components of glycoproteins, glycolipids, and proteoglycans. These glycoconjugates form a layer of glycans that covers all human cell surfaces and forms the first line of contact towards the cell's environment. Glycan structures called stage specific embryonic antigens (SSEA) are used to assess the undifferentiated stage of embryonic stem cells. However, the whole spectrum of stem cell glycan structures has remained unknown, largely due to lack of suitable analysis technology. We describe the first global study of glycoprotein glycans of human embryonic stem cells, embryoid bodies, and further differentiated cells by MALDI-TOF mass spectrometric profiling. The analysis reveals how certain asparagine-linked glycan structures characteristic to stem cells arc lost during differentiation white new structures emerge in the differentiated cells. The results indicate that human embryonic stem cells have a unique glycome and that their differentiation stage can be identified by glycome analysis. We suggest that knowledge about stem cell specific glycan structures can be used for e.g. purification, manipulation, and quality control of stem cells.
Materials & Methods
Human embryonic stem cell lines. Four Finnish hESC lines, FES 21, FES 22, FES 29, and FES 30 (Skottman et al., 2005. Stem cells 23:1343-56) were used in the present study. These lines are included in the International Stem Cell Initiative (Andrews et al., 2005. Nat. Biotechnol. 23:795-7). The cells were propagated on human foreskin fibroblast (hFF) feeder cells in serum-free medium (Knockout™, Gibco/Invitrogen). In FACS analyses 70-90% of cells from mechanically isolated colonies were typically Tra 1-60 and Tra 1-81 positive (not shown). Cells differentiated into embryoid bodies (EB, stage 2 differentiated) and further differentiated cells grown out of the EB as monolayers (stage 3 differentiated) were used for comparison against hESC. The differentiation protocol favors the development of neuroepithelial cells while not directing the differentiation into distinct terminally differentiated cell types (Okabe et al., 1996. Mech. Dev. 59:89-102). FB derived from FES 30 had less differentiated cell types than the other three EB. Stage 3 cultures consisted of a heterogenous population of cells dominated by fibroblastoid and neuronal morphologies. For the glycome studies the cells were collected mechanically, washed, and stored frozen until analysis.
In a preferred embodiment the invention is directed to the use of data obtained embryoid bodies or ESC-cell line cultivated under conditions favouring neuroepithelial cells for search of specific structures indicating neuroepithelial development, preferably by comparing the material with cell materials comprising neuronal and/or epithelial type cells.
Asparagine-linked glycome profiling. Total asparagine-linked glycan (N-glycan) pool was enzymatically isolated from about 100 000 cells. The total N-glycan pool (picomole quantities) was purified with microscale solid-phase extraction and divided into neutral and sialylated N-glycan fractions. The N-glycan fractions were analyzed by MALDI-TOF mass spectrometry either in positive ion mode for neutral N-glycans or in negative ion mode for sialylated glycans (Saarinen et al., 1999, Eur. J Biochem. 259, 829-840). Over one hundred N-glycan signals were detected from each cell type revealing the surprising complexity of hESC glycosylation. The relative abundances of the observed glycan signals were determined based on relative signal intensities (Harvey, 1993. Rapid Commun. Mass Spectrom. 7:614-9; Papac et al., 1996. Anal. Chem. 68:3215-23).
Results
In the present study, we analyzed the N-glycome profiles of hESC, EB, and st.3 differentiated cells (
The similarity of the N-glycan profiles within the group of four hESC lines suggested that the obtained N-glycan profiles are a description of the characteristic N-glycome of hESC. Overall, 10% of the 100 most abundant N-glycan signals present in hESC disappeared in st.3 differentiated cells, and 16% of the most abundant signals in st3 differentiated cells were not present in hESC. This indicates that differentiation induced the appearance of new N-glycan types while earlier glycan types disappeared. In quantitative terms, the differences between the glycan profiles of hESC, EB, and st.3 differentiated cells were: hESC vs. EB 19%, hESC vs. st.3 24%, and EB vs. st.3 12%.
The glycome profile data was used to design glycan-specific labeling reagents for hESC. The most interesting glycan types were chosen to study their expression profiles by lectin histochemistry as exemplified in
Table 31 further represent differential recognition feeder and stem cells by two other lectins, Ricinus communis agglutinin (RCA, ricin lectin), known to recognize especially terminal Galβ-structures, especially Galβ4Gle(NAc)-type structures and peanut agglutinin (PNA) reconnizing Gal/GalNAc structures. The cell surface expression of ligand for two other lectin RCA and PNA on hESC cells, but only RCA ligands of feeder cells.
The present results indicate and the invention is directed to the hESC glycans are potential targets for recognition by stem cell specific reagents. The invention is further directed to methods of specific recognition and/or separation of hESC and differentiated cells such as feeder cells by glycan structure specific reagents such as lectins. Human embryonic stem cells have a unique glycome that reflects their differentiation stage. The invention is specifically directed to analysis of cells according to the invention with regard to differentiation stage.
Conclusions
The present data represent the glycome profiling of hESC:
Utility of hESC glycome data:
Especially preferred uses of the data are
Use of the hESC glycome for identification of specific cell surface markers characteristic for the pluripotent hESCs.
The invention is directed to further analysis and production of present and analogous glycome data and use of the methods for further identification of novel stem cell specific glycosylation features and form the basis for studies of hESC glycobiology and its eventual applications according to the invention
To identify differentiation stage specific N-glycan signals in sialylated N-glycan profiles of hESC, EB, and stage 3 differentiated cells (see Example 12 above), major signals specific to either the undifferentiated (
To further analyze the data and to find the major glycan signals associated in given hESC differentiation stage, two variables were calculated for the comparison of glycan signals in the N-glycan profile dataset described above, between two samples:
1. absolute difference A=(S2−S1), and
2. relative difference R=A|S1,
wherein S1 and S2 are relative abundances of a given glycan signal in samples 1 (the four EB samples) and 2 (the four st.3 cell samples), respectively.
When A and R were calculated for the glycan profile datasets of the two cell types, and the glycan signals thereafter sorted according to the values of A and R, the most significant differing glycan signals between the two samples could be identified. Among the fifty most abundant neutral N-glycan signals in the data (
EB derived from the hESC line FES 30 were different in their overall N-glycan profiles compared to the other three EB samples (
Experimental Procedures
Lectins (EY laboratories, USA) were passively adsorbed on 48-well plates (Nunclon surface, catalog No 150687, Nunc, Denmark) by overnight incubation in phosphate buffered saline.
Human bone marrow derived mesenechymal stem cells (BM MSC) were cultured in minimum essential α-medium (α-MFM) supplemented with 20 mM HFPES, 10% FCS, penicillin-streptomycin, and 2 mM L-glutamine (all from Gibco) on 48-well plates coated with different lectins. Cells were cultivated in Cell IQ (ChipMan Technologies, Tampere, Finland) at +37° C. with 5% CO2. Images were taken every 15 minutes. Data were analyzed with Cell IQ Analyzer software by analyzer protocol built by Dr. Ulla Impola (Finnish Red Cross Blood Service, Helsinki, Finland).
Results and Discussion
The growth rates of BM MSC varied on different lectin-coated surfaces compared to each other and uncoated plastic surface (Table 32), indicating that proteins with different glycan binding specificities binding to stem cell surface glycans specifically influence their proliferation rate.
Lectins that had an enhancing effect on BM MSC growth rate included in order of relative efficacy:
GS II (β-GlcNAc)>ECA (LacNAc/β-Gal)>PWA (I-branched poly-LacNAc)>LTA (α1,3-Fuc)>PSA (α-Man),
wherein the preferred oligosaccharide specificities of the lectins are indicated in parenthesis. However, PSA was nearly equal to plastic in the present experiments.
Lectins that had an inhibitory effect on BM MSC growth rate included in order of relative efficacy:
RCA (βGal/LacNAc)>>UEA (α1,2-Fuc)>WFA (β-GalNAc)>STA (linear poly-LacNAc)>NPA (α-Man)>SNA (α2,6-linked sialic acids)=MAA (α2,3-linked sialic acids/(α3′-sialyl LacNAc),
wherein the preferred oligosaccharide specificities of the lectins arc indicated in parenthesis. However, NPA, SNA, and MAA were nearly equal to plastic in the present experiments.
Experimental Procedures
Samples from MSC, CB MNC, and hESC grown on mouse fibroblast feeder cells were produced as described in the preceding Examples. Neutral and acidic glycosphingolipid fractions were isolated from cells essentially as described (Miller-Podraza et al., 2000). Glycans were detached by Macrobdella decora endoglycoceramidase digestion (Calbiochem, USA) essentially according to manuacturer's instructions, yielding the total glycan oligosaccharide fractions from the samples. The oligosaccharides were purified and analyzed by MALDI-TOF mass spectrometry as described in the preceding Examples for the protein-linked oligosaccharide fractions.
Results and Discussion
Human Embryonic Stem Cells (hESC)
hESC neutral lipid glycans. The analyzed mass spectrometric profile of the hESC glycosphingolipid neutral glycan fraction is shown in
Structural analysis of the major neutral lipid glycans. The six major glycan signals, together comprising more than 90% of the total glycan signal intensity, corresponded to monosaccharide compositions Hex3HexNAc1 (730), Hex3HexNAc1dHex1 (876), Hex2HexNAc1 (568), Hex3HexNAc2 (933), Hex4HexNAc1 (892), and Hex4HexNAc2 (1095).
In β1,4-galactosidase digestion, the relative signal intensities of 1095 and 730 were reduced by about 30% and 10%, respectively. This suggests that 730 and 1095 contain minor components with non-reducing terminal β1,4-Gal epitopes, preferably including the structures Galβ4GlcNAcLac and Galβ4GlcNAc[Hex1HexNAc1]Lac. The other major components were thus shown to contain other terminal epitopes. Further, the glycan signal Hex5HexNAc3 (1460) was digested to Hex3HexNAc3 (1136), indicating that the original signal contained glycan structures containing two β1,4-Gal.
The major glycan signals were not sensitive to α-galactosidase digestion.
In α1,3/4-fucosidase digestion, the signal intensity of 876 was reduced by about 10%, indicating that only a minor proportion of the glycan signal corresponded to glycans with α1,3- or α1,4-linked fucose residue. The major affected signal in the total profile was Hex3HexNAc1dHex2 (1022), indicating that it included glycans with either α1,3-Fuc or α1,4-Fuc. 511 was reduced by about 30%, indicating that the signal contained a minor component with α1,2-Fuc, preferentially including Fucα2Galβ4Glc (Fucα2′Lac, 2′-fucosyllactose).
When the α1,3/4-fucosidase reaction product was further digested with α1,2-fucosidase, 876 was completely digested into 730, indicating that the structure of the majority of the signal intensity contained non-reducing terminal α1,2-Fuc, preferably including the structure Fucα2[Hex1HexNAc1]Lac, more preferably including Fucα2GalHexNAcLac. Another partly digested glycan signal was Hex4HexNAc2dHex1 (1241) that was thus indicated to contain α1,2-Fuc, preferably including the structure Fucα2[Hex2HexNAc2]Lac, more preferably including Fucα2Gal[Hex1HexNAc2]Lac. 511 was completely digested, indicating that the original signal contained a major component with α1,3/4-Fuc, preferentially including Galβ4(Fucα3)Glc (3-fucosyllactose).
When the α1,3/4-fucosidase and α1,2-fucosidase reaction product was further digested with β1,4-galactosidase, the majority of the newly formed 730 was not digested, i.e. the relative proportion of 568 was not increased compared to β1,4-galactosidase digestion without preceding fucosidase treatments. This indicated that the majority of 876 did not contain β1,4-Gal subterminal to Fuc. Further, 892 was not digested, indicating that it did not contain non-reducing terminal β1,4-Gal.
When the α1,3/4-fucosidase, α1,2-fucosidase, and β1,4-galactosidase reaction product was further digested with β1,3-galactosidase, the signal intensity of 892 was reduced, indicating that it included glycans with terminal β1,3-Gal. The signal intensity of 568 was increased relative to 730, indicating that also 730 included glycans with terminal β1,3-Gal.
The experimental structures of the major hESC glycosphingolipid neutral glycan signals were thus determined (‘>’ indicates the order of preference among the lipid glycan structures of hESC; ‘[ ]’ indicates that the oligosaccharide sequence in brackets may be either branched or unbranched; ‘( )’ indicates a branch in the structure):
Acidic lipid glycans. The analyzed mass spectrometric profile of the hESC glycosphingolipid sialylated glycan fraction is shown in
The acidic glycan fraction was subjected to a2,3-sialidase digestion and the resulting neutral and acidic glycan fractions were purified and analyzed separately. In the acidic fraction, signals 1159 and 1288 were digested and 835 was partly digested. In the neutral fraction, signals 730 and 892 were the major appeared signals. These results indicated that: 1159 consisted mainly of glycans with α2,3-NcuAc, 1288 contained at least one α2,3-NcuAc, a major proportion of glycans in 835 contained α2,3-NeuAc, and in the original sample a major proportion of NeuAc1-2Hex3HexNAc1 contained solely α2,3-linked NeuAc.
Human Mesenechymal Stem Cells (MSC)
Bone marrow derived (BM) MSC neutral lipid glycans. The analyzed mass spectrometric profile of the BM MSC glycosphingolipid neutral glycan fraction is shown in
Cord blood derived (CB) MSC neutral lipid glycans. The analyzed mass spectrometric profile of the CB MSC glycosphingolipid neutral glycan fraction is shown in
In β1,4-galactosidase digestion, the relative signal intensities of 1095, 1460, and 730 were reduced by about 90%, 95%, and 20%, respectively. This suggests that CB MSC contained major glycan components with non-reducing terminal β1,4-Gal epitopes, preferably including the structures Galβ4GlcNAcβ[Hex1HexNAc1]Lac, Galβ4GlcNAc[Hex2HexNAc2]Lac, and GalβGlcNAcLac. Further, the glycan signal Hex5HexNAc3 (1460) was digested into Hex4HexNAc3 (1298) and mostly into Hex3HexNAc3 (1136), indicating that the original signal contained glycan structures containing either one or two β1,4-Gal, and that the majority of the original glycans contained two β1,4-Gal, preferentially including the structure GalβGlcNAc(Galβ4GlcNAc)[Hex1HexNAc1]Lac. Similarly, 1095 was digested into Hex2HexNAc2 (771) in addition to 933, indicating that the original signal contained glycan structures containing either one or two β1,4-Gal, and that the minority of the original glycans contained two β1,4-Gal, preferentially including the structure Galβ4GlcNAc(Galβ4GlcNAc)Lac.
The experimental structures of the major CB MSC glycosphingolipid neutral glycan signals were thus determined (‘>’ indicates the order of preference among the lipid glycan structures of hESC; ‘[ ]’ indicates that the oligosaccharide sequence in brackets may be either branched or unbranched; ‘( )’ indicates a branch in the structure):
Sialylated lipid glycans. The analyzed mass spectrometric profile of the hESC glycosphingolipid sialylated glycan fraction is shown in
Human Cord Blood Mononuclear Cells (CB MNC)
CB MNC neutral lipid glycans. The analyzed mass spectrometric profile of the CB MNC glycosphingolipid neutral glycan fraction is shown in
In β1,4-galactosidase digestion, the relative signal intensities of 730 and 1095 were reduced by about 50% and 90%, respectively. This suggests that the signals contained major components with non-reducing terminal β1,4-Gal epitopes, preferably including the structures Galβ4GlcNAcβLac and Galβ4GlcNAcβ[Hex1HexNAc1]Lac. Further, the glycan signal Hex5HexNAc3 (1460) was digested to Hex4HexNAc3 (1298) and Hex3HexNAc3 (1136), indicating that the original signal contained glycan structures containing either one or two β1,4Gal.
The experimental structures of the major CB MNC glycosphingolipid neutral glycan signals were thus determined (‘>’ indicates the order of preference among the lipid glycan structures of hESC; ‘[ ]’ indicates that the oligosaccharide sequence in brackets may be either branched or unbranched; ‘( )’ indicates a branch in the structure):
Sialylated lipid glycans. The analyzed mass spectrometric profile of the CB MNC glycosphingolipid sialylated glycan fraction is shown in
Overview of Human Stem Cell Glycosphingolipid Glycan Profiles
The neutral glycan fractions of all the present sample types altogether comprised 45 glycan signals. The proposed monosaccharide compositions of the signals were composed of 2-7 Hex, 0-5 HexNAc, and 0-4 dHex. Glycan signals were detected at monoisotopic m/z values between 511 and 2263 (for [M+Na]+ ion).
Major neutral glycan signals common to all the sample types were 730, 568, 1095, and 933, corresponding to the glycan structure groups Hex0-1HexNAc1Lac (568 or 730) and Hex1-2HexNAc2Lac (933 or 1095), of which the former glycans were more abundant and the latter less abundant. A general formula of these common glycans is HexmHexNAcnLac, wherein m is either n or n-1, and n is either 1 or 2.
Neutral Glycolipid Profiles of Human Stem Cell Types:
Glycan signals typical to hESC preferentially include 876 and 892 (especially compared to MSC); the former preferentially corresponds to FucHexHexNAcLac, wherein α1,2-Fuc is preferential to α1,3/4-Fuc, and the latter preferentially corresponds to Hex2HexNAc1Lac, and more preferentially to Galβ3[Hex1HexNAc1]Lac; the glycan core composition Hex4HexNAc1 was especially characteristic of hESC compared to other human stem cell types, in addition to fucosylation and more preferentially α1,2-linked fucosylation.
Glycan signals typical to both CB and BM MSC preferentially include 771, 1063, 1225; more preferentially including compositions dHex0/2Hex0-1HexNAc2Lac.
Glycan signals typical to especially BM MSC preferentially include 511 and fucosylated structures, preferentially multifucosylated structures.
Glycan signals typical to especially CB MSC preferentially include 1460 and 1298, as well as large neutral glycolipids, especially Hex2-3HexNAc3Lac. In addition, low fucosylation and/or high expression of terminal β1,4-Gal was typical to especially CB MSC.
Glycan signals typical to CB MNC preferentially include compositions dHex0-1[HexHexNAc]1-2Lac, more preferentially high relative amounts of 730 compared to other signals; and fucosylated structures; and glycan profiles with less variability and/or complexity than other stem cell types.
The acidic glycan fractions of all the present sample types altogether comprised 38 glycan signals. The proposed monosaccharide compositions of the signals were composed of 0-2 NeuAc, 2-9 Hex, 0-6 HexNAc, 0-3 dHex, and/or 0-1 sulphate or phosphate esters. Glycan signals were detected at monoisotopic m/z values between 786 and 2781 (for [M−H]− ion).
The acidic glycosphingolipid glycans of CB MNC were mainly composed of NeuAc1Hexn+2HexNAcn, wherein 1≦n≦3, indicating that their structures were NeuAc1[HexHexNAc]1-3Lac.
Terminal glycan epitopes that were demonstrated in the present experiments in stem cell glycosphingolipid glycans include:
Gal
Galβ4Glc (Lac)
Galβ4GlcNAc (LacNAc type 2)
Galβ3
Non-reducing terminal HexNAc
Fuc
α1,2-Fuc
α1,3-Fuc
Fucα2Gal
Fucα2Galβ4GlcNAc (H type 2)
Fucα2Galβ4Glc (2′-fucosyllactose)
Fucα3GlcNAc
Galβ4(Fucα3)GlcNAc (Lex)
Fucα3Glc
Galβ4(Fucα3)Glc (3-fucosyllactose)
Neu5Ac
Neu5Acα2,3
Neu5Acα2,6
Development-related glycan epitope expression. According to the present invention, the glycosphingolipid glycan composition Hex4HexNAc1 preferentially corresponds to (iso)globo structures. The glycan sequence of the SSEA-3 glycolipid antigen has been determined to be Galβ3GalNAcβ3Galα4Galβ4Glc, which corresponds to the glycan signal Hex4HexNAc1 (892) detected in the present experiments only in hESC. Similarly, the glycan sequence of the SSEA-4 glycolipid antigen has been determined to be NeuAcα3Galβ3GalNAcβ3Galα4Galβ4Glc, which corresponds to the glycan signal NeuAc1Hex4HexNAc1 (1159) detected in the present experiments only in hESC. Consistent with the present glycan structure analyses, the hESC samples were determined to be SSEA-3 and SSEA-4 positive by monoclonal antibody staining as described in the preceding Examples. In contrast to mouse ES cells, hESC do not express the SSEA-1 antigen; consistent with this we found only low expression levels of α1,3/4-fucosylated neutral glycolipid glycans. In contrast, we were able to show that the major fucosylated structures of hESC glycosphingolipid glycans contain α1,2-Fuc, which is a molecular level explanation to the mouse-human difference in SSEA-1 reactivity.
The FACS experiments with fluorescein-labeled lectins and CB MNC were performed essentially similarly to Example 7. Double stainings were performed with CD34 specific monoclonal antibody (Jaatinen et al., 2006) with complementary fluorescent dye. Erythroblast depletion from CD MNC fraction was performed by anti-glycophorin A (GlyA) monoclonal antibody negative selection.
Results and Discussion
Compared to the CB MNC fraction, GlyA depleted CB MNC showed decreased staining in FACS with the following lectins (the decrease in % in parenthesis): PWA (48%), LTA (59%), UEA (34%), STA, MAA, and PNA (all latter three less than 23%); indicating that GlyA depletion increased the resolving power of the lectins in cell sorting.
In FACS double staining with both fluorescein-labeled lectins and anti-CD34 antibody, the following lectins colocalized with CD34+ cells: STA (3/3 samples), HHA (3/3 samples), PSA (3/3 samples), RCA (3/3 samples), and partly also NPA (2/3 samples). In contrast, the following lectins did not colocalize with CD34+ cells: GNA (3/3 samples) and PWA (3/3 samples), and partly also LTA (2/3 samples), WFA (2/3 samples), and GS-II (2/3 samples).
Taken together with the results of Example 8, the present results indicate that lectins can enrich CD34+ cells from CB MNC by both negative and positive selection, for example:
Experimental Procedures
Gene expression analysis of CB CD133+ cells has been described (Jaatinen et al., 2006) and the present analysis was performed essentially similarly. The galectins whose gene expression profile was analyzed included (corresponding Affymetrix codes in parenthesis): Galectin-1 (201105_at), galectin-2 (208450_at), galectin-3 (208949_s_at), galectin-4 (204272_at), galectin-6 (200923_at), galectin-7 (206400_at), galectin-8 (208933_s_at), galectin-9 (203236_s_at), galectin-10 (206207_at), galectin-13 (220158_at).
Results and Discussion
In CB CD133+ versus CD133−, as well as CD34+ versus CD34− CB MNC cells, the galectin gene expression profile was as follows: Overall, galectins 1, 2, 3, 6, 8, 9, and 10 showed gene expression in both CD34+/CD133+ cells. Galectins 1, 2, and 3 were downregulated in both CD34+/CD133+ cells with respect to CD34−/CD133− cells, and in addition galectin 10 was downregulated in CD133+ cells with respect to CD133− cells. In contrast, in both CD34+/CD133+ cells galectin 8 was upregulated with respect to CD34−/CD133− cells.
In hESC versus EB samples, the galectin gene expression profile was as follows: Overall, galectins 1, 3, 6, 8, and 13 showed gene expression in hESC. Galectin 3 was clearly downregulated with respect to EB, and in addition galectin 13 was downregulated in 2 out of 4 hESC lines. In contrast, galectin 1 was clearly upregulated in all hESC lines.
The results indicate that both CB CD34+/CD133+ stem cell populations and hESC have an interesting and distinct galectin expression profiles, leading to different galectin ligand affinity profiles (Hirabayashi et al., 2002). The results further correlate with the glycan analysis results showing abundant galectin ligand expression in these stem cells, especially non-reducing terminal β-Gal and type II LacNAc, poly-LacNAc, β1,6-branched poly-LacNAc, and complex-type N-glycan expression.
Immunohistochemical Studies of Stem Cells (GF Series of Stainings)
After rinsing with PBS the sections were incubated in 3% highly purified BSA in PBS for 30 minutes at RT to block nonspecific binding sites. Primary antibodies (GF279, 288, 287, 284, 285, 283,286,290 and 289) were diluted (1:10) in PBS containing 1% BSA-PBS and incubated 1 hour at RT. After rinsing three times with PBS, the sections were incubated with biotinylated rabbit anti-mouse, secondary antibody (Zymed Laboratories, San Francisco, Calif., USA) in PBS for 30 minutes at RT, rinsed in PBS and incubated with peroxidase conjugated streptavidin (Zymed Laboratories) diluted in PBS. The sections were finally developed with AEC substrate (3-amino-9-ethyl carbazole; Lab Vision Corporation, Fremont, Calif., USA). After rinsing with water counterstaining was performed with Mayer's hemalum solution.
Antibodies used in the immunostainings. See also Table 22 for results.
Experimental Procedures
Exoglycosidase digestions. Neutral N-glycan fractions were isolated from cord blood mononuclear cell populations as described above. Exoglycosidase reactions were performed essentially after manufacturers' instructions and as described in (Saarinen et al., 1999). The different reactions were; α-Man: α-mannosidase from Jack beans (C. ensiformis; Sigma, USA); β1,4-Gal: β1,4-galactosidase from S. pneumoniae (recombinant in E. coli; Calbiochem, USA); β1,3-Gal: recombinant β1,3-galactosidase (Calbiochem, USA); β-GlcNAc: β-glucosaminidase from S. pneumoniae (Calbiochem, USA); α2,3-SA: α2,3-sialidase from S. pneumoniae (Calbiochem, USA). The analytical reactions were carefully controlled for specificity with synthetic oligosaccharides in parallel control reactions that were analyzed by MALDI-TOF mass spectrometry. The sialic acid linkage specificity of α2,3-SA was controlled with synthetic oligosaccharides in parallel control reactions, and it was confirmed that in the reaction conditions the enzyme hydrolyzed α2,3-linked but not α2,6-linked sialic acids. The analysis was performed by MALDI-TOF mass spectrometry as described in the preceding examples. Digestion results were analyzed by comparing glycan profiles before and after the reaction.
RESULTS Glycosidase profiling of neutral N-glycans. Neutral N-glycan fractions from affinity-purified CD34+, CD34−, CD133+, CD133−, Lin+, and Lin− cell samples from cord blood mononuclear cells were isolated as described above. The glycan samples were subjected to parallel glycosidase digestions as described under Experimental procedures. Profiling results are summarized in Table 3 (CD34+ and CD34− cells), Table 4 (CD133+ and CD133− cells), and Table 5 (Lin− and Lin+ cells). The present results show that several neutral N-glycan signals are individually sensitive towards all the exoglycosidases, indicating that in all the cell types several neutral N-glycans contain specific substrate glycan structures in their non-reducing termini. The results also show clear differences between the cell types in both the sensitivity of individual glycan signals towards each enzyme and also profile-wide differences between cell types, as detailed in the Tables cited above.
Glycosidase profiling of sialylated N-glycans. Sialylated N-glycan fractions from affinity-purified CD133+ and CD133− cell samples from cord blood mononuclear cells were isolated as described above. The glycan samples were subjected to parallel glycosidase digestions as described under Experimental procedures. Profiling results by α2,3-sialidase are shown in Table 6. The results show significant differences between the glycan profiles of the analyzed cell types in the sialylated and neutral glycan fractions resulting in the reaction. The present results show that differences are seen in multiple signals in a profile-wide fashion. Also individual signals differ between cell types, as discussed below.
Cord blood CD133+ and CD133− cell N-glycans are differentially α2,3-sialylated. Sialylated N-glycans from cord blood CD133+ and CD133− cells were treated with c2,3-sialidase, after which the resulting glycans were divided into sialylated and non-sialylated fractions, as described under Experimental procedures. Both α2,3-sialidase resistant and sensitive sialylated N-glycans were observed, i.e. after the sialidase treatment sialylated glycans were observed in the sialylated N-glycan fraction and desialylated glycans were observed in the neutral N-glycan fraction. The results indicate that cord blood CD133+ and CD133− cells are differentially α2,3-sialylated. For example, after α2,3-sialidase treatment the relative proportions of monosialylated (SA1) glycan signal at m/z 2076, corresponding to the [M−H]− ion of NeuAc1Hex5HexNAc4dHex1, and the disialylated (SA2) glycan signal at m/z 2367, corresponding to the [M−H]− ion of NeuAc2Hex5HexNAc4dHex1, indicate that α2,3-sialidase resistant disialylated N-glycans are relatively more abundant in CD133− than in CD133+ cells, when compared to α2,3-sialidase resistant monosialylated N-glycans. It is concluded that N-glycan α2,3-sialylation in relation to other sialic acid linkages including especially α2,6-sialylation, is more abundant in cord blood CD133+ cells than in CD133− cells.
In cord blood CD133− cells, several sialylated N-glycans were observed that were resistant to α2,3-sialidase treatment, i.e. neutral glycans were not observed that would correspond to the desialylated forms of the original sialylated glycans. The results revealing differential α2,3-sialylation of individual N-glycan structures between cord blood CD133+ and CD133− cells are presented in Table 6. The present results indicate that N-glycan α2,3-sialylation in relation to other sialic acid linkages is more abundant in cord blood CD133+ cells than in CD133− cells.
Sialidase analysis. The sialylated N-glycan fraction isolated from a cord blood mononuclear cell population (CB MNC) was digested with broad-range sialidase as described in the preceding Examples. After the reaction, it was observed by MALDI-TOF mass spectrometry that the vast majority of the sialylated N-glycans were desialylated and transformed into corresponding neutral N-glycans, indicating that they had contained sialic acid residues (NeuAc and/or NeuGc) as suggested by the proposed monosaccharide compositions. Combined glycan profiles of neutral and desialylated (originally sialylated) N-glycan fractions of a CB MNC population was produced. The profiles correspond to total N-glycan profiles isolated from the cell samples (in desialylated form). It is calculated that approximately 25% of the N-glycan signals correspond to high-mannosc type N-glycan monosaccharide compositions, and 28% to low-mannose type N-glycans, 34% to complex-type N-glycans, and 13% to hybrid-type or monoantennary N-glycans monosaccharide compositions.
CONCLUSIONS The present results suggest that 1) the glycosidase profiling method can be used to analyze structural features of individual glycan signals, as well as differences in individual glycans between cell types, 2) different cell types differ from each other with respect to both individual glycan signals' and glycan profiles' susceptibility to glycosidases, and 3) glycosidase profiling can be used as a further means to distinguish different cell types, and in such case the parameters for comparison are both individual signals and profile-wide differences.
Experimental Procedures
Enzymatic modifications. Sialyltransferase reaction: Human cord blood mononuclear cells (3×106 cells) were modified with 60 mU α2,3-(N)-sialyltransferase (rat, recombinant in S. frugiperda, Calbiochem), 1.6 μmol CMP-Neu5Ac in 50 mM sodium 3-morpholinopropanesulfonic acid (MOPS) buffer pH 7.4, 150 mM NaCl at total volume of 100 μl for up to 12 hours. Fucosyltransferase reaction: Human cord blood mononuclear cells (3×106 cells) were modified with 4 mU α1,3-fucosyltransferase VI (human, recombinant in S. frugiperda, Calbiochem), 1 μmol GDP-Fuc in 50 mM MOPS buffer pH 7.2, 150 mM NaCl at total volume of 100 μl for up to 3 hours. Broad-range sialidase reaction: Human cord blood mononuclear cells (3×106 cells) were modified with 5 mU sialidase (A. ureafaciens, Glyko, UK) in 50 mM sodium acetate buffer pH 5.5, 150 mM NaCl at total volume of 100 μl for up to 12 hours. α2,3-specific sialidase reaction: Cells were modified with α2,3-sialidase (S. pneumoniae, recombinant in E. coli) in 50 mM sodium acetate buffer pH 5.5, 150 mM NaCl at total volume of 100 μl. α-mannosidase reaction: α-mannosidase was from Jack beans and reaction was performed essentially similarly as with other enzymes described above. Sequential enzymatic modifications: Between sequential reactions cells were pelleted with centrifugation and supernatant was discarded, after which the next modification enzyme in appropriate buffer and substrate solution was applied to the cells as described above. Washing procedure: After modification, cells were washed with phosphate buffered saline.
Glycan analysis. After washing the cells, total cellular glycoproteins were subjected to N-glycosidase digestion, and sialylated and neutral N-glycans isolated and analyzed with mass spectrometry as described above. For O-glycan analysis, the glycoproteins were subjected to reducing alkaline β-elimination essentially as described previously (Nyman et al., 1998), after which sialylated and neutral glycan alditol fractions were isolated and analyzed with mass spectrometry as described above.
Results
Sialidase digestion. Upon broad-range sialidase catalyzed desialylation of living cord blood mononuclear cells, sialylated N-glycan structures as well as O-glycan structures (data not shown) were desialylated, as indicated by increase in relative amounts of corresponding neutral N-glycan structures, for example Hex6HexNAc3, Hex5HexNAc4dHex0-2, and Hex6HexNAc5dHex0-1 monosaccharide compositions (Table 10). In general, a shift in glycosylation profiles towards glycan structures with less sialic acid residues was observed in sialylated N-glycan analyses upon broad-range sialidase treatment. The shift in glycan profiles of the cells upon the reaction served as an effective means to characterize the reaction results. It is concluded that the resulting modified cells contained less sialic acid residues and more terminal galactose residues at their surface after the reaction.
α2,3-specific sialidase digestion. Similarly, upon α2,3-specific sialidase catalyzed desialylation of living mononuclear cells, sialylated N-glycan structures were desialylated, as indicated by increase in relative amounts of corresponding neutral N-glycan structures (data not shown). In general, a shift in glycosylation profiles towards glycan structures with loss sialic acid residues was observed in sialylated N-glycan analyses upon α2,3-specific sialidase treatment. The shift in glycan profiles of the cells upon the reaction served as an effective means to characterize the reaction results. It is concluded that the resulting modified cells contained less α2,3-linked sialic acid residues and more terminal galactose residues at their surface after the reaction.
Sialyltransferase reaction. Upon α2,3-sialyltransferase catalyzed sialylation of living cord blood mononuclear cells, numerous neutral (Table 10) and sialylated N-glycan (Table 9) structures as well as O-glycan structures (data not shown) were sialylated, as indicated by decrease in relative amounts of neutral N-glycan structures (Hex5HexNAc4dHex0-3 and Hex6HexNAc5dHex0-2 monosaccharide compositions in Table 10) and increase in the corresponding sialylated structures (for example the NeuAc2Hex5HexNAc4dHex1 glycan in Table 9). In general, a shift in glycosylation profiles towards glycan structures with more sialic acid residues was observed both in N-glycan and O-glycan analyses. It is concluded that the resulting modified cells contained more α2,3-linked sialic acid residues and less terminal galactose residues at their surface after the reaction.
Fucosyltransferase reaction. Upon α1,3-fucosyltransferase catalyzed fucosylation of living cord blood mononuclear cells, numerous neutral (Table 10) and sialylated N-glycan structures as well as O-glycan structures (see below) were fucosylated, as indicated by decrease in relative amounts of nonfucosylated glycan structures (without dHex in the proposed monosaccharide compositions) and increase in the corresponding fucosylated structures (with ndHex>0 in the proposed monosaccharide compositions). For example, before fucosylation O-glycan alditol signals at m/z 773, corresponding to the [M+Na]+ ion of Hex2HexNAc2 alditol, and at m/z 919, corresponding to the [M+Na]+ ion of Hex2HexNAc2dHex1 alditol, were observed in approximate relative proportions 9:1, respectively (data not shown). After fucosylation, the approximate relative proportions of the signals were 3:1, indicating that significant fucosylation of neutral O-glycans had occurred. Some fucosylated N-glycan structures were even observed after the reaction that had not been observed in the original cells, for example neutral N-glycans with proposed structures Hex6HexNAc5dHex1 and Hex6HexNAc5dHex2 (Table 10), indicating that in α1,3-fucosyltransferase reaction the cell surface of living cells can be modified with increased amounts or extraordinary structure types of fucosylated glycans, especially terminal Lewis x epitopes in protein-linked N-glycans as well as in O-glycans.
Sialidase digestion followed by sialyltransferase reaction. Cord blood mononuclear cells were subjected to broad-range sialidase reaction, after which α2,3-sialyltransferase and CMP-Neu5Ac were added to the same reaction, as described under Experimental procedures. The effects of this reaction sequence were observable on the N-glycan profiles. The sialylated N-glycan profile was also analyzed between the reaction steps, and the result clearly indicated that sialic acids were first removed from the sialylated N-glycans (indicated for example by appearance of increased amounts of neutral N-glycans), and then replaced by α2,3-linked sialic acid residues (indicated for example by disappearance of the newly formed neutral N-glycans; data not shown). It is concluded that the resulting modified cells contained more α2,3-linked sialic acid residues after the reaction.
Sialyltransferase reaction followed by fucosyltransferase reaction. Cord blood mononuclear cells were subjected to α2,3-sialyltransferase reaction, after which α1,3-fucosyltransferase and GDP-fucose were added to the same reaction, as described under Experimental procedures. The effects of this reaction sequence were observable on the sialylated N-glycan profiles of the cells. The results show that a major part of the glycan signals (examples in Tables 9 and 110) have undergone changes in their relative intensities, indicating that a major part of the sialylated N-glycans present in the cells were substrates of the enzymes. It was also clear that the combination of the enzymatic reaction steps resulted in different result than either one of the reaction steps alone.
Different from the α1,3-fucosyltransferase reaction described above, sialylation before fucosylation apparently sialylated the neutral fucosyltransferase acceptor glycan structures present on cord blood mononuclear cell surfaces, resulting in no detectable formation of the neutral fucosylated N-glycan structures that had emerged after α1,3-fucosyltransferase reaction alone (discussed above; Table 10).
α-mannosidase reaction. α-mannosidase reaction of whole cells showed a minor reduction of glycan signals including those indicated to contain α-mannose residues in the preceding examples.
Glycosyltransferase-derived glycan structures. We detected that glycosylated glycosyltransferase enzymes can contaminate cells in modification reactions. For example, when cells were incubated with recombinant fucosyltransferase or sialyltransferase enzymes produced in S. frugiperda cells, N-glycosidase and mass spectrometric analysis of cellular and/or cell-associated glycoproteins resulted in detection of an abundant neutral N-glycan signal at m/z 1079, corresponding to [M+Na]+ ion of Hex3HexNAc2dHex1 glycan component (calc. m/z 1079.38). Typically, in recombinant glycosyltransferase treated cells, this glycan signal was more abundant than or at least comparable to the cells' own glycan signals, indicating that insect-derived glycoconjugates are a very potent contaminant associated with recombinant glycan-modified enzymes produced in insect cells. Moreover, this glycan contamination persisted even after washing of the cells, indicating that the insect-type glycoconjugate corresponding to or associated with the glycosyltransferase enzymes has affinity towards cells or has tendency to resist washing from cells. To confirm the origin of the glycan signal, we analyzed glycan contents of commercial recombinant fucosyltransferase and sialyltransferase enzyme preparations and found that the m/z 1079 glycan signal was a major N-glycan signal associated with these enzymes. Corresponding N-glycan structures, e.g. Manα3(Manα6)Manβ4GlcNAc(Fucα3/6)GlcNAc(β-N-Asn), have been described previously from glycoproteins produced in S. frugiperda cells (Staudacher et al., 1992; Kretzchmar et al., 1994; Kubelka et al., 1994; Altmann et al., 1999). As described in the literature, these glycan structures, as well as other glycan structures potentially contaminating cells treated with recombinant or purified enzymes, especially insect-derived products, are potentially immunogenic in humans and/or otherwise harmful to the use of the modified cells. It is concluded that glycan-modifying enzymes must be carefully selected for modification of human cells, especially for clinical use, not to contain immunogenic glycan epitopes, non-human glycan structures, and/or other glycan structures potentially having unwanted biological effects.
Experimental Procedures
N-glycans were isolated from human embryonic stem cell (hESC) line (25 million cells) and fractionated into neutral and acidic N-glycan fractions as described above. The final purification prior to NMR analysis was performed by gel filtration high-performance liquid chromatography (HPLC) on a Superdex Peptide HR10/300 column in water or 50 mM ammonium bicarbonate for the neutral and acidic fractions, respectively. Fractions were collected and MALDI-TOF mass spectra were recorded from each fraction as described above (data not shown). All fractions containing N-glycans were pooled and prepared for the NMR experiment. The yields of neutral and acidic glycans were 4.0 and 6.6 nmol, respectively. Prior to NMR analysis the purified glycome fractions were repeatedly dissolved in 99.996% deuterium oxide and dried to omit H2O and to exchange sample protons. The 1H-NMR spectra at 800 MHz were recorded using a cryo-probe for enhanced sensitivity. Chemical shifts are expressed in parts per million (ppm) by reference to internal standard acetone (2.225 ppm).
Results and Discussion
Neutral N-glycan fraction. The identified signals in the neutral N-glycan spectrum are described in Table 11. The identified signals were consistent with N-glycan structures, more specifically high-mannose type N-glycan structures such as the structures A-D in
Neutral N-glycan core sequences. The identified N-glycan core structure common to all the identified glycan structures in the NMR spectrum includes the following glycan sequences: the internal core sequences Manβ4GlcNAc, Manα3Manβ4GlcNAc, Manα6Manβ4GlcNAc, and Manα3(Manα6)Manβ4GlcNAc, and the reducing terminal glycan core sequences GlcNAcβ4GlcNAc, Manα4GlcNAcβ4GlcNAc, Manα3Manβ4GlcNAcβ4GlcNAc, Manα6Manβ4GlcNAcβ4GlcNAc, and Manα3(Manα6)Manβ4GlcNAcβ4GlcNAc. The N-glycans in the sample were liberated by N-glycosidase F enzyme indicating that the reducing terminal core sequences were β-N-linked to asparagine residues in the original sample glycoproteins. Other glycan core structures could not be identified in the spectrum.
Neutral N-glycan antennae. In the identified structures A-D, the common reducing terminal N-glycan core sequence Manα3(Manα6)Manβ4GlcNAc4GlcNAc is further elongated by the following antennae: Manα2Manα2 or Manα2 to the α3-linked Man; and/or Manα2Manα3, Manα2Manα6, Manα3, and/or Manα6 to the α6-linked Man. Other glycan antennae could not be identified in the spectrum.
Acidic N-glycan fraction. The identified signals in the acidic N-glycan spectrum are described in Table 12. The identified signals were consistent with N-glycan structures, more specifically complex type N-glycan structures such as the reference structures A-E in
Acidic N-glycan core sequences. The identified N-glycan core structure common to all the identified glycan structures in the NMR spectrum includes the following glycan sequences: the reducing terminal glycan core sequences GlcNAcβ4(±Fucα6)GlcNAc, Manβ4GlcNAcβ4(±Fucα6)GlcNAc, Manα3Manβ4GlcNAcβ4(±Fucα6)GlcNAc, Manα6Manβ4GlcNAcβ4(±Fucα6)GlcNAc, and Manα3(Manα6)Manβ4GlcNAcβ4(±Fucα6)GlcNAc, wherein ±Fucα6 indicates the site of N-glycan core fucosylation. The N-glycans in the sample were liberated by N-glycosidase F enzyme indicating that the reducing terminal core sequences were β-N-linked to asparagine residues in the original sample glycoproteins. Other glycan core structures could not be identified in the spectrum.
Acidic N-glycan antennae. In the reference structures A-D, the reducing terminal N-glycan core sequences are further elongated by the following antennae, which were also identified in the recorded spectrum: Neu5Acα3Galβ4GlcNAcβ2, Neu5Acα6Galβ4GlcNAcβ2, Galβ4GlcNAcβ2, and/or Galα3Galβ4GlcNAcβ2 to either α3-linked Man or α6-linked Man. The identified antennae in the NMR spectrum include the internal glycan sequence GlcNAc β-linked or more specifically β2-linked to the N-glycan core structure. Other glycan antennae could not be identified in the spectrum, indicating that these antennae were the most abundant antenna structures in the sample.
Galα3Gal sequences. In the mass spectrum recorded from the pooled acidic N-glycan fraction, the signals corresponding to glycan structures containing the Hex6HexNAc4 composition feature together accounted for about 16% of the total signal intensity, which is consistent with the NMR result that these signals correspond to major glycans in the sample.
Comparison of NMR profiling and mass spectrometric profiling results. As described above, the 1H-NMR spectra were consistent with the mass spectra recorded from the hESC samples and support the quantitative and structural assignments made based on the mass spectrometric profiles in the preceding Examples.
Fu D., Chen L. and O'Neill R. A. (1994) Carbohydr. Res. 261, 173-186
Helin J., Maaheimo H., Seppo A., Keane A. and Renkonen O. (1995) Carbohydr. Res. 266, 191-209
Hård K., Mekking A., Kamerling J. P., Dacremont G. A. A. and Vliegenthart J. F. G. (1991) Glycoconjugate J. 8, 17-28
Hård K., Van Zadelhoff G., Moonen P., Kamerling J. P. and Vliegenthart J. F. G. (1992) Eur. J. Biochem. 209, 895-915
Experimental Procedures
hESC and differentiated cell samples. The human embryonic stem cell (hESC) and embryoid body (EB) samples were prepared from hESC line FES 29 (Skottman et al., 2005) essentially as described in the preceding Examples, however in the present Example the hESCs were propagated on murine fibroblast feeder cells (mEF) and the hESC samples contained some mEF cells.
Exoglycosidase digestions were performed essentially as described (Saarinen et al., 1999) and as described in the preceding Examples. The enzymes used were α-mannosidase and β-hexosaminidase from Jack beans (C. ensiformis, Sigma, USA), β-glucosaminidase and β1,4-galactosidase from S. pneumoniae (rec. in E. coli, Calbiochem, USA), α2,3-sialidase from S. pneumoniae (Glyko, UK), α1,3/4-fucosidase from Xanthomonas sp. (Calbiochem, USA), α1,2-fucosidase from X. manihotis (Glyko), β1,3-galactosidase (rec. in E. coli, Calbiochem), and α2,3/6/8/9-sialidase from A. ureafaciens (Glyko). The specific activities of the enzymes were controlled in parallel reactions with purified oligosaccharides or oligosaccharide mixtures, and analyzed similarly as the analytic reactions. The changes in the exoglycosidase digestion result Tables are relative changes in the recorded mass spectra and they do not reflect absolute changes in the glycan profiles resulting from glycosidase treatments.
Results and Discussion
hESC. Neutral and acidic N-glycan fractions were isolated from hESC grown on both murine and human fibroblast feeder cells as described in the preceding Examples. The results of parallel exoglycosidase digestions of the neutral (Tables 13 and 14) and acidic (Table 15) glycan fractions are discussed below. In the following chapters, the glycan signals are referred to by their proposed monosaccharide compositions according to the Tables of the present invention and the corresponding m/z values can be read from the Tables.
α-mannosidase sensitive structures. All the glycan signals that showed decrease upon α-mannosidase digestion of the neutral N-glycan fraction (Tables 13 and 14) are indicated to correspond to glycans that contain terminal α-mannose residues. The present results indicate that the majority of the neutral N-glycans of hESC contain terminal α-mannose residues. On the other hand, increased signals correspond to their reaction products. Structure groups that form series of α-mannosylated glycans in the neutral N-glycan fraction as well as individual α-mannosylated glycans are discussed below in detail.
The Hex1-9HexNAc1 glycan series was digested so that Hex3-9HexNAc1 were digested and transformed into Hex1HexNAc1 (data not shown), indicating that they had contained terminal α-mannose residues. Because they were transformed into Hex1HexNAc1, their experimental structures were (Manα)1-8Hex1HexNAc1.
The Hex1-12HexNAc2 glycan series was digested so that Hex3-12HexNAc2 were digested and transformed into Hex1-7HexNAc2 and especially into Hex1HexNAc2 that had not existed before the reaction and was the major reaction product. This indicates that 1) glycans Hex3-12HexNAc2 include glycans containing terminal α-mannose residues, 2) glycans Hex1-7HexNAc2 could be formed from larger α-mannosylated glycans, and 3) majority of the glycans Hex3-12HexNAc2 were transformed into newly formed Hex1HexNAc2 and therefore had the experimental structures (Manα)nHex1HexNAc2, wherein n≧1. The fact that the α-mannosidase reaction was only partially completed for many of the signals suggests that also other glycan components are included in the the Hex1-12HexNAc2 glycan series. In particular, the Hex10-12HexNAc2 components contain 1-3 hexose residues more than the largest typical mammalian high-mannose type N-glycan, suggesting that they contains glucosylated structures including (Glcα)1-3Hex8HexNAc2, preferentially α2- and/or α3-linked Glc and even more preferentially present in the glucosylated N-glycans Glcα3→Man9GlcNAc2, Glcα2Glcα3→Man9GlcNAc2, and/or Glcα2Glcα2Glcα3→Man9GlcNAc2. The corresponding glucosylated fragments were observed after the α-mannosidase digestion, preferentially corresponding to Glc1-3Man4GlcNAc2 (Hex5-7HexNAc2).
The Hex1-6HexNAc1dHex1 glycan series was digested so that Hex3-9HexNAc1dHex1 were digested and transformed into Hex1HexNAc1dHex1, indicating that they had contained terminal α-mannose residues and their experimental structures were (Manα)2-5Hex1HexNAc1dHex1. Hex1HexNAc1dHex1 appeared as a new signal indicating that glycans with structures (Manα)nHex1HexNAc1dHex1, wherein n≧1, had existed in the sample.
The Hex2-7HexNAc3 glycan series was digested so that Hex5-7HexNAc3 were digested and transformed into other glycans in the series, indicating that they had contained terminal α-mannose residues. Hex2HexNAc3 appeared as a new signal indicating that glycans with structures (Manα)nHex2HexNAc3, wherein n≧1, had existed in the sample.
The Hex2-7HexNAc3dHex1 glycan series was digested so that Hex5-7HexNAc3dHex1 were digested and transformed into other glycans in the series, indicating that they had contained terminal α-mannose residues. Hex2HexNAc3dHex1 was increased significantly indicating that glycans with structures (Manα)nHex2HexNAc3dHex1, wherein n≧1, had existed in the sample.
Hex3HexNAc3dHex2 appeared as a new signal indicating that glycans with structures (Manα)nHex3HexNAc3dHex2, wherein n≧1, had existed in the sample.
β-glucosaminidase sensitive structures. The Hex3HexNAc2-5 and Hex3HexNAc2-5dHex1 glycan series were digested so that Hex3-5HexNAc1dHex0-1 were digested and transformed into Hex3HexNAc2dHex0-1, indicating that they had contained terminal β-GlcNAc residues and their experimental structures were (GlcNAcβ→)1-3Hex3HexNAc2 and (GlcNAcβ→)1-3Hex3HexNAc2dHex1, respectively.
Hex4HexNAc4, Hex4HexNAc4dHex1, Hex4HexNAc4dHex2, and Hex5HexNAc5dHex1 were also digested indicating they contained structures including (GlcNAcβ→)Hex4HexNAc3, (GlcNAcβ→)Hex4HexNAc3dHex1, (GlcNAcβ→)Hex4HexNAc3dHex2, and (GlcNAcβ→)Hex5HexNAc4dHex1, respectively.
Hex4HexNAc5dHex1 and Hex4HexNAc5dHex2 were digested by β-glucosaminidase and indicated to contain two β-GlcNAc residues each. In contrast, Hex4HexNAc5 was not digested with β-glucosaminidase.
β-hexosaminidase sensitive structures. The Hex4HexNAc5 glycan signal was sensitive to β-hexosaminidase but not to β-glucosaminidase indicating that it corresponded to glycan structures containing terminal β-N-acetylhexosamine residues other than β-GlcNAc, preferentially β-GalNAc. Upon β-hexosaminidase digestion, the signal was transformed into Hex4HexNAc3 indicating that the enzyme liberated two HexNAc residues from the corresponding glycan structures.
β1,4-galactosidase sensitive structures. Glycan signals that were sensitive to β1,4-galactosidase comprised a major proportion of hESC glycans, indicating that β1,4-linked galactose is a common terminal epitope in hESC neutral N-glycans.
Hex5HexNAc4 and Hex5HexNAc4dHex1 were digested into Hex3HexNAc4 and Hex3HexNAc4dHex1 indicating they had the structures (Galβ4GlcNAcβ→)2Hex3HexNAc2 and (Galβ4GlcNAcβ→)2Hex3HexNAc2dHex1, respectively. In contrast, Hex5HexNAc4dHex2 was digested into Hex4HexNAc4dHex2 indicating that it had the structure (Galβ4GlcNAcβ→)Hex4HexNAc3dHex2, and Hex5HexNAc4dHex3 was not digested at all. Taken together, in hESC, hexose residues are protected by deoxyhexose residues from the action of β1,4-galactosidase in the N-glycan structures. Such dHex-protected structures containing β1,4-linked galactose include Galβ4(Fucα3)GlcNAc and Fucα2Galβ4GlcNAc.
Hex4HexNAc5 that also included a β-hexosaminidase sensitive component was digested by β1,4-galactosidase. Taken together, the results suggest that the Hex4HexNAc5 glycan signal includes glycan structures including Galβ4GlcNAc(GalNAcβHexNAcβ)Hex3HexNAc2.
β1,3-galactosidase sensitive structures. Because only few structures in hESC neutral N-glycan fraction were sensitive to the action of β1,3-galactosidase, the majority of terminal galactose residues appear to be β1,4-linked.
Glycosidase resistant structures. In the present experiments, Hex4HexNAc3, Hex4HexNAc3dHex2, and Hex5HexNAc5 were resistant to the tested exoglycosidases. The second monosaccharide composition contains more than one deoxyhexose residues suggesting that it is protected from glycosidase digestions by d&ex residues such as α2-, α3-, or α4-linked fucose residues, preferentially present in Fucα2Gal, Fucα3GlcNAc, and/or Fucα4GlcNAc epitopes.
The compiled neutral N-glycan fraction glycan structures based on the exoglycosidase digestions of hESC are presented in Table 16.
Acidic N-glycan fraction. The acidic N-glycan fraction of hESC grown on mEF cell layers were characterized by parallel α2,3-sialidase and A. ureafaciens sialidase treatments as well as sequential digestions with α1,3/4-fucosidase and α1,2-fucosidase. The results from these reactions as analyzed by MALDI-TOF mass spectrometry are described in Table 15. The results suggest that multiple N-glycan components in the hESC sample contain the specific glycan substrates for these enzymes, namely α2,3-linked and other sialic acid residues, and both α1,2- and α1,3/4-linked fucose residues. Some glycan signals showed the presence of many of these epitopes, such as the glycan signal at m/z 2222 (corresponding to NeuAc1Hex5HexNAc4dHex2) that was suggested to contain all these epitopes, preferentially in multiple glycan structures. The compiled acidic N-glycan fraction glycan structures based on the exoglycosidase digestions of hESC are presented in Table 34.
EB. Differentiation specific changes between embryoid bodies (EB; FES 29 st 2 in Table 13) and hESC (FES 29 st 1 in Table 13) were reflected in their neutral N-glycan fraction exoglycosidase digestion profiles, as described in Table 13. Differential exoglycosidase digestion results were observed in glycan signals including m/z 1688, 1704, 1793, 1866, 1955, 1971, 2012, 2028, 2142, 2158, and 2320, corresponding to different neutral N-glycan fraction glycan profiles.
mEF. By comparison of Table 35 and Table 13, murine feeder cell (mEF) specific neutral N-glycan fraction glycan components were identified and they are listed in Table 36. These glycan components are characterized by additional hexose residues compared to hESC or hEF specific structures according to the present invention. The exoglycosidase experiments also suggest that β1,4-linked galactose epitopes are protected from β1,4-galactosidase digestion by any additional hexose residues in the monosaccharide compositions. Taken together with the NMR analysis results of the present invention, the additional hexose residues are suggested to be α-linked galactose residues, more specifically including Galα3Gal epitopes in the N-glycan antennae, as described in Table 36.
The changes in the exoglycosidase digestion result Tables are relative changes in the recorded mass spectra and they do not reflect absolute changes in the glycan profiles resulting from glycosidase treatments. The experimental procedures are described in the preceding Example.
Results
Undifferentiated BM MSC
Neutral and acidic N-glycan fractions were isolated from BM MSC as described. The results of parallel exoglycosidase digestions of the neutral (Table 17) and acidic (data not shown) glycan fractions are discussed below. In the following chapters, the glycan signals are referred to by their proposed monosaccharide compositions according to the Tables of the present invention and the corresponding m/z values can be read from the Tables.
α-mannosidase sensitive structures. All the glycan signals that showed decrease upon α-mannosidase digestion of the neutral N-glycan fraction (Table 17) are indicated to correspond to glycans that contain terminal α-mannose residues. The present results indicate that the majority of the neutral N-glycans of BM MSC contain terminal α-mannose residues. On the other hand, increased signals correspond to their reaction products. Structure groups that form series of α-mannosylated glycans in the neutral N-glycan fraction as well as individual α-mannosylated glycans are discussed below in detail.
The Hex1-9HexNAc1 glycan series was digested so that Hex3-9HexNAc1 were digested and transformed into Hex1HexNAc1 (data not shown), indicating that they had contained terminal α-mannose residues. Because they were transformed into Hex1HexNAc1, their experimental structures were (Manα)1-8Hex1HexNAc1.
The Hex1-10HexNAc2 glycan series was digested so that Hex4-10HexNAc2 were digested and transformed into Hex1-4HexNAc2 and especially into Hex1HexNAc2 that had not existed before the reaction and was the major reaction product. This indicates that 1) glycans Hex1-10HexNAc2 include glycans containing terminal α-mannose residues, 2) glycans Hex1-4HexNAc2 could be formed from larger α-mannosylated glycans, and 3) majority of the glycans Hex4-10HexNAc2 were transformed into newly formed Hex1HexNAc2 and therefore had the experimental structures (Manα)nHex1HexNAc2, wherein n≧1. The fact that the α-mannosidase reaction was only partially completed for many of the signals suggests that also other glycan components are included in the the Hex1-10HexNAc2 glycan series. In particular, the Hex10HexNAc2 component contains one hexose residue more than the largest typical mammalian high-mannose type N-glycan, suggesting that it contains glucosylated structures including (Glcα→)Hex8HexNAc2, preferentially α3-linked Glc and even more preferentially present in the glucosylated N-glycan (Glcα3)Man9GlcNAc2.
The Hex1-6HexNAc1dHex1 glycan series was digested so that Hex3-9HexNAc1dHex1 were digested and transformed into Hex1HexNAc1dHex1, indicating that they had contained terminal α-mannose residues and their experimental structures were (Manα)2-5Hex1HexNAc1dHex1. Hex1HexNAc1dHex1 appeared as a new signal indicating that glycans with structures (Manα)nHex1HexNAc1dHex1, wherein n≧1, had existed in the sample.
The Hex2-7HexNAc3 glycan series was digested so that Hex6-7HexNAc3 were digested and transformed into other glycans in the series, indicating that they had contained terminal α-mannose residues. Hex2HexNAc3 appeared as a new signal indicating that glycans with structures (Manα)nHex2HexNAc3, wherein n≧1, had existed in the sample.
The Hex2-7HexNAc3dHex1 glycan series was digested so that Hex6-7HexNAc3dHex1 were digested and transformed into other glycans in the series, indicating that they had contained terminal α-mannose residues. Hex2HexNAc3dHex1 appeared as a new signal indicating that glycans with structures (Manα)nHex2HexNAc3dHex1, wherein n≧1, had existed in the sample. Hex3HexNAc3dHex2 and Hex3HexNAc4 appeared as new signals indicating that glycans with structures (Manα)nHex3HexNAc3dHex2 and (Manα)nHex3HexNAc4, respectively, wherein n≧1, had existed in the sample.
β-glucosaminidase sensitive structures. The Hex3HexNAc2-5dHex1 glycan series was digested so that Hex3-9HexNAc1dHex1 were digested and transformed into Hex1HexNAc1dHex1, indicating that they had contained terminal α-mannose residues and their experimental structures were (Manα)2-5Hex1HexNAc1dHex1. Hex1HexNAc1dHex1 appeared as a new signal indicating that glycans with structures (Manα)nHex1HexNAc1dHex1, wherein n≧1, had existed in the sample. However, Hex3HexNAc6dHex1 was not digested indicating that it contained other terminal HexNAc residues than β-linked GlcNAc residues.
Hex2HexNAc3 and Hex2HexNAc3dHex1 were digested into Hex2HexNAc2 and Hex2HexNAc2dHex1 indicating they had the structures (GlcNAcβ→)Hex2HexNAc2 and (GlcNAcβ→)Hex2HexNAc2dHex1, respectively.
Hex4HexNAc4dHex1, Hex4HexNAc4dHex2, Hex4HexNAc5dHex2, and Hex5HexNAc5dHex1 were also digested indicating they contained structures including (GlcNAcβ→)Hex4HexNAc3dHex1, (GlcNAc⊖→)Hex4HexNAc3dHex2, (GlcNAcβ→)Hex4HexNAc4dHex2, and (GlcNAcβ→)Hex5HexNAc4dHex1, respectively.
β1,4-galactosidase sensitive structures. Glycan signals that were sensitive to β1,4-galactosidase comprised a major proportion of BM MSC glycans, indicating that β1,4-linked galactose is a common terminal epitope in BM MSC neutral N-glycans.
Hex5HexNAc4 and Hex5HexNAc4dHex1 were digested into Hex3HexNAc4 and Hex3HexNAc4dHex1 indicating they had the structures (Galβ4GlcNAcβ→)2Hex3HexNAc2 and (Galβ4GlcNAcβ→)2Hex3HexNAc2dHex1, respectively. In contrast, Hex5HexNAc4dHex2 was digested into Hex4HexNAc4dHex2 indicating that it had the structure (Galβ4GlcNAcβ→)Hex4HexNAc3dHex2, respectively, and Hex5HexNAc4dHex3 was not digested at all. Taken together, in BM MSC, n-1 hexose residues are protected by deoxyhexose residues from the action of β1,4-galactosidase in the N-glycan structures Hex5HexNAc4dHexn, wherein 0≦n≦3. Such dHex-protected structures containing β1,4-linked galactose include Galβ4(Fucα3)GlcNAc and Fucα2Galβ4GlcNAc.
Similarly, Hex6HexNAc5, Rex5HexNAc5dHex1, Hex6HexNAc5, and Hex5HexNAc5dHex1 were digested into Hex3HexNAc5, Hex3HexNAc5dHex1, and Hex3HexNAc6dHex1 indicating they had the structures (Galβ4GlcNAcβ→)3Hex3HexNAc2, (Galβ4GlcNAcβ→)2Hex3HexNAc3dHex1, and (Galβ4GlcNAc⊖→)3Hex3HexNAc3dHex1, respectively. In contrast, Hex4HexNAc5dHex2, Hex5HexNAc5dHex3, Hex6HexNAc5dHex2, and Hex6HexNAc5dHex3 were not digested, indicating that hexose residues in these structures were protected by deoxyhexose residues. Such dHex-protected structures containing β1,4-linked galactose include Galβ4(Fucα3)GlcNAc and Fucα2Galβ4GlcNAc. However, Hex4HexNAc5dHex3 was digested indicating that it contained one or more terminal β1,4-linked galactose residues.
Hex7HexNAc3, Hex6HexNAc3dHex1, Hex6HexNAc3, and Hex5HexNAc3dHex1 were digested into products including Hex5HexNAc3 and Hex4HexNAc3dHex1, indicating they had the structures (Galβ4GlcNAcβ→)Hex5-6HexNAc2 and (Galβ4GlcNAcβ→)Hex4-5HexNAc3dHex1, respectively. The relative amounts of Hex3HexNAc3, and Hex3HexNAc3dHex1 were increased indicating that they were products of (Galβ4GlcNAcβ→)Hex3HexNAc2 and (Galβ4GlcNAcβ→)Hex3HexNAc2dHex1, respectively.
β1,3-galactosidase sensitive structures. Because only few structures in BM MSC neutral N-glycan fraction are sensitive to the action of β1,3-galactosidase, the majority of terminal galactose residues appear to be β1,4-linked. The glycan signals corresponding to β1,3-galactosidase sensitive glycans include Hex5HexNAc5dHex1 and Hex4HexNAc5dHex3.
Glycosidase resistant structures. In the present experiments, Hex2HexNAc3dHex2, Hex4HexNAc3dHex2, and Hex11HexNAc2 were resistant to the tested exoglycosidases. The first two proposed monosaccharide compositions contain more than one deoxyhexose residues suggesting that they are protected from glycosidase digestions by the second dHex residues such as α2-, α3-, or α4-linked fucose residues, preferentially present in Fucα2Gal, Fucα3GlcNAc, and/or Fucα4GlcNAc epitopes. The last proposed monosaccharide composition contains two hexose residues more than the largest typical mammalian high-mannose type N-glycan, suggesting that it contains glucosylated structures including (Glcα→)2Hex9HexNAc2, preferentially α2- and/or α3-linked Glc and even more preferentially present in the diglucosylated N-glycan (GlcαGlcα→)Man9GlcNAc2.
The compiled neutral N-glycan fraction glycan structures based on the exoglycosidase digestions of BM MSC are presented in Table 18.
Osteoblast-Differentiated BM MSC
The analysis of osteoblast differentiated BM MSC are presented in Table 19, allowing comparison of differentiation specific changes in CB MSC. The exoglycosidase profiles produced for BM MSC and osteoblast differentiated BM MSC are characteristic for the two cell types. For example, signals at m/z 1339, 1784, and 2466 are digested differentially in the two experiments. Specifically, the presence of β1,3-galactosidase sensitive neutral N-glycan signals in osteoblast differentiated BM MSC indicate that the differentiated cells contain more β1,3-linked galactose residues than the undifferentiated cells.
The sialidase analysis performed for the acidic N-glycan fraction of BM MSC supported the proposed monosaccharide compositions based on sialylated (NeuAc or NeuGc containing) N-glycans in the acidic N-glycan fraction.
Analysis of CB MSC Neutral Glycan Fraction by Exoglycosidases
The results of the analysis by β1,4-galactosidase and β-glucosaminidase are presented in Table 20. The results suggest that also in CB MSC neutral N-glycans containing non-reducing terminal β1,4-linked galactose residues are abundant, and they suggest the presence of characteristic non-reducing terminal epitopes for most of the observed glycan signals. The analysis of adipocyte differentiated CB MSC are presented in Table 21, allowing comparison of differentiation specific changes in CB MSC, similarly as described above for BM MSC. The sialidase analysis performed for the acidic N-glycan fraction of CB MSC supported the proposed monosaccharide compositions based on sialylated (NeuAc or NeuGc containing) N-glycans in the acidic N-glycan fraction.
Results and Discussion
Obtaining of the gene expression data from the hESC lines FES 21, 22, 29, and 30 has been described (Skottman et al., 2005) and the present data was produced essentially similarity. The results of the gene expression profiling analysis with regard to a selection of potentially glycan-processing and accessory enzymes are presented in Table 26, where gene expression is both qualitatively determined as being present (P) or absent (A) and quantitatively measured in comparison to embryoid bodies (EB) derived from the same cell lines.
Fucosyltransferase expression levels. Three fucosyltransferase transcripts were detected in hESC: FUT1 (α1,2-fucosyltransferase; increased in all FES cell lines), FUT4 (α1,3-fucosyltransferase IV; increased in all FES cell lines), and FUT8 (N-glycan core α1,6-fucosyltransferase).
Hexosaminyltransferase expression levels. The following transcripts in the selection of Table 26 were detected in hESC: MGAT3, MGAT2 (increased in three FES cell lines), MGAT1, GNT4b, β3GlcNAc-T5, β3GlcNAc-T7, β3GlcNAc-T4 (present in two FES cell lines), β6GlcNAcT (increased in one FES cell line), iβ3GlcNAcT, globosideT, and α4GlcNAcT (present in two FES cell lines).
Other gene expression levels. The following transcripts in the selection of Table 26 were detected in hESC: AER1 (increased in all FES cell lines), AGO61, β3GALT3, MAN1C1, and LGALS3.
Isolation of Subset Expressing Glycan Structures of Formula (I) on Human Embryonic Stem Cells
Cell Culture and Passaging
FES hESC lines with normal karyotypes are obtained and grown as described in Mikkola et al. (2006; Distinct differentiation characteristics of individual human embryonic stem cell lines, BMC Dev Biol. 2006; 6: 40).
Human ESCs are maintained on mitotically inactivated primary mouse embryonic fibroblasts (MEF) feeder layers for routine maintenance. Cells are grown in tissue culture treated dishes (Corning Incorporated). Cells are passaged every 6 days using either a pretreatment with 10 mg/ml collagenase 5 minutes or manual dissection with a fire pulled Pasteur pipette.
Immunocytochemistry is performed on routinely maintained adherent hESC colonies, and flow cytometry is performed using routinely maintained hESC colonies that are stained for antibodies, lectins or glycosidases of the present invention.
Enrichment of Glycan Structure of Formula (1) Expressing Stem Cells
The FACS analysis is performed essentially as described in Venable et al. (2005) but living cells are used instead and FACSAria™ cell sorter (BD).
Human ESCs are harvested into single cell suspensions using collagenase and cell dissociation solution (Sigma). Then, cells are placed in sterile tube in aliquots 106 cells each and stained with one of the GF antibody in 1:100 solution. Cells are washed 3 times with PBS and then stained with secondary antibodies (antigoat mouse IgG or IgM FITC conjugated). Unstained FES used as control. The FITC positive cells are collected into cell culture media (in +4° C.) (according to BD instructions).
Then, cells are placed on MEF or HHF feeder layers and monitored for clonal or cell lineage. To check the undifferentiation stage, the gene expression of sorted cells are analyzed with real-time PCR.
Alternatively, FACS enriched cells are let to spontaneously differentiate on gelatin. Immunohistochemistry is performed with various tissue specific antibodies as described in Mikkola et al. (2006) or analyzed with PCR.
Experimental Procedures
Human embryonic stem cells (hESC), human bone marrow derived (BM) and cord blood derived (CB) mesenchymal stem cells (MSC), and human cord blood mononuclear cells (CB MNC) were produced as described in the preceding Examples. Glycosphingolipid glycans were isolated from glycolipid fractions isolated from these cells by endoglycoceramidase digestion; O-glycans were isolated by non-reductive alkaline β-elimination with concentrated ammonia in saturated ammonium carbonate; all glycan fractions were isolated with miniaturized solid-phase extraction steps; glycans were analyzed by MALDI-TOF mass spectrometry; terminal glycan epitopes were analyzed by specific exoglycosidase enzymatic digestions combined with analysis by mass spectrometry; the analysis steps were performed as described in the present invention.
Results and Discussion
Mass spectrometric profiles providing relative quantitative information about glycan signals and specific exoglycosidase digestions, together with antibody, lectin, and biochemical characterization of the cell types as described above, was used to further characterize different stem cell types and differentiated cell types. Tables 37 and 38 describe examples of combinatiorial characterization of glycan types associated with each cell type. Analysis of glycolipid and/or O-glycan structures and classes in addition to N-glycan structures and classes yielded a more complete characterization of the cell types, revealed further differences between cell types, and provided more glycan epitopes and classes associated with each cell type. In conclusion, combination of analysis of different glycan types and epitopes was useful in analysis and identification of cell types.
Glycans were isolated from hESC glycosphingolipid fraction by endoglycoceramidase digestion, purified, and permethylated according to the methods described in the present invention. Mass spectrometric fragmentation of permethylated glycans was performed using Bruker Ultraflex TOF/TOF instrument essentially after manufacturer's instructions. In the following, all fragments are sodiated unless otherwise indicated. Naming of fragments is according to Domon and Costello, 1998 (Glycoconj. J. 5, 397-409).
Glycosphingolipid Glycans of hESC
Based on the resulting fragment ions, 1130.6 (mother), 912.5 (Y4), 708.1 (C3), 667.0 (Y3), 485.8 (B2), 462.8 (Y2), 258.7 (Y1), a major glycan included in Hex4HexNAc1 composition had the following linear sequence: Hex-HexNAc-Hex-Hex-Hex. Fragment C3 suggests that corresponding glycans include structures with 3-substitution of the second Hex from the reducing end, indicative of isoglobo-type structure.
Based on the resulting fragment ions, 926.5 (mother), 718 (unknown), 690.6 (B3), 667.5 (Y3), 486.2 (B2), 463.2 (Y2), 282.1 (B1), 259.1 (Y1), 227.0 (B2/Y3 or Y2/B3), a major glycan included in Hex3HexNAc1 composition had the following linear sequence: HexNAc-Hex-Hex-Hex, indicative of globo-type structure.
Based on the resulting fragment ions, 1100.6 (mother), 912.6 (Y4), 708.2 (Y3), 690.3 (Z3), 660.2 (B3),462 (Y2), 432.9 (C2), 415 (B2), a major glycan included in Hex3HexNAc1dHex1 composition had the following linear sequence: dHex-Hex-HexNAc-Hex-Hex. Fragments Z3 and C2 suggest that corresponding glycans include structures with 3-substitution of HexNAc, indicative of lacto-type structure.
Based on the resulting fragment ions, 1304.6 (mother), 666.9 (Y3), 660.2 (B2) 432.6 (C2), a major glycan included in Hex4HexNAc1dHex1 composition had the following linear sequence: dHex-Hex-HexNAc-Hex-Hex-Hex. Fragment C2 suggests that corresponding glycans include structures with 3-substitution of HexNAc.
Similarly, fragmentation analysis of sialylated glycosphingolipid glycans indicated ganglio-type structures including branched sequence Hex-HexNAc-(NeuAc-)Hex-Hex, wherein the branch is indicated by brackets.
1)Under HDO.
1)n.a., not assigned.
2)—, not present.
3)Overlap, overlapping signals at 4.139-4.088 ppm.
1)Code: +++ new signal appeared, ++ highly increased relative signal intensity, ++ increased relative signal intensity, − decreased relative signal intensity, −− greatly decreased relative signal intensity, −−− signal disappeared, blank: no change.
§“→” indicates linkage to a monosaccharide in the rest of the structure; “[ ]” indicates branch in the structure.
#Preferred structure group based on monosaccharide compositions according to the present invention. HI, high-mannose; LO, low-mannose; S, soluble mannosylated; HF, fucosylated high-mannose; G, glucosylated high-mannose; HY, hybrid-type or monoantennary; CO, complex-type; F, fucosylation; FC, complex fucosylation; N═H, terminal HexNAc (HexNAc = Hex); N > H, terminal HexNAc (HexNAc > Hex).
§“→” indicates linkage to a monosaccharide in the rest of the structure; “[ ]” indicates branch in the structure.
#Preferred structure group based on monosaccharide compositions according to the present invention. HI, high-mannose; LO, low-mannose; S, soluble mannosylated; HF, fucosylated high-mannose; G, glucosylated high-mannose; HY, hybrid-type or monoantennary; CO, complex-type; F, fucosylation; FC, complex fucosylation; N═H, terminal HexNAc (HexNAc = Hex); N > H, terminal HexNAc (HexNAc > Hex).
1)Grading of staining/labelling: +++ very intense, ++ intense, + low, +/− barely detectable, − not labelled.
1)Data reference: Skottman, H., et al. (2005).
2)EB, embryoid bodies used as reference in calculation of fold changes.
3)Det. (detection) codes: P, present; A, absent; M, medium.
4)Ch. (fold change) codes: I, increased; D, decreased; NC, no change.
1)N, HexNAc; H, Hex.
1)N, HexNAc; H, Hex.
#Preferred structure group based on monosaccharide compositions according to the present invention. HY, hybrid-type or monoantennary; CO, complex-type; F, fucosylation; FC, complex fucosylation; N═H, terminal HexNAc (HexNAc = Hex); N > H, terminal HexNAc (HexNAc > Hex); SP, sulphate and/or phosphate ester; “( )” indicates that the glycan signal includes also other structure types.
§“→” indicates linkage to a monosaccharide in the rest of the structure; “[ ]” indicates branch in the structure.
#Preferred structure group based on monosaccharide compositions according to the present invention. HI, high-mannose; LO, low-mannose; S, soluble mannosylated; HF, fucosylated high-mannose; G, glucosylated high-mannose; HY, hybrid-type or monoantennary; CO, complex-type; F, fucosylation; FC, complex fucosylation; N═H, terminal HexNAc (HexNAc = Hex); N > H, terminal HexNAc (HexNAc > Hex).
Altmann, F., et al (1999) Glycoconj. J. 16:109-23
Harvey, D. J., et al. (1993) Rapid Commun. Mass Spectrom. 7(7):614-9
Hirabayashi, J., et al. (2002) Biochim. Biophys. Acta. 1572:232-54.
Jaatinen, T., et al. (2006) Stem cells. 24:631-41.
Karlsson, H., et al. (2000) Glycobiology 10(12):1291-309
Kretzchmar, E., et al. (1994) Biol. Chem. Hoppe Seyler 375(5):23-7
Kubelka, V., et al. (1994) Arch. Biochem. Biophys. 308(1):148-57
Leskelä, H., et al. (2003) Biochem. Biophys. Res. Commun. 311:1008-13
Miller-Podraza, H., et al. (2000) Glycobiology. 10:975-982
Moore (1999) Trends Cell Biol. 9:441-6
Naven, T. J. & Harvey, D. J. (1996) Rapid Commun. Mass Spectrom. 10(11):1361-6
Nyman, T. A., et al. (1998) Eur. J. Biochem. 253(2):485-93
Papac, D., et al. (1996) Anal. Chem. 68(18):3215-23
Saarinen, J., et al. (1999) Eur. J. Biochem. 259(3):829-40
Skottman, H. et al. (2005) Stem cells
Staudacher, E., et al. (1992) Eur. J. Biochem. 207(3):987-93
Thomson, J. A., et al. (1998) Science 282:1145-7
Venable et al. (2005) BMC Developmental biology.
Number | Date | Country | Kind |
---|---|---|---|
20051130 | Nov 2005 | FI | national |
20060452 | May 2006 | FI | national |
20060630 | Jun 2006 | FI | national |
2006/050336 | Jul 2006 | FI | national |
This application is claims the benefit of FI 20051130, filed 8 Nov. 2005, FI 20060452, filed 9 May 2006, FI 20060630, filed 29 Jun. 2006 and PCT/FI2006/050336, filed 11 Jul. 2006. The entire content of each application is expressly incorporated herein by reference thereto.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/FI2006/050483 | 11/8/2006 | WO | 00 | 1/5/2009 |