Information
-
Patent Application
-
20030224411
-
Publication Number
20030224411
-
Date Filed
March 13, 200321 years ago
-
Date Published
December 04, 200321 years ago
-
CPC
-
US Classifications
-
International Classifications
- C12Q001/68
- G01N033/53
- G01N033/567
- C12N005/08
Abstract
Genes that are up- or down-regulated during differentiation provide important leverage by which to characterize and manipulate early-stage pluripotent stem cells. Over 35,000 unique transcripts have been amplified and sequenced from undifferentiated human embryonic stem cells, and three types of differentiated progeny. Statistical analysis of the assembled transcripts identified genes that alter expression levels as differentiation proceeds. The expression profile provides a marker system that has been used to identify particular culture components for maintaining the undifferentiated phenotype. The gene products can also be used to promote differentiation; to assess other relatively undifferentiated cells (such as cancer cells); to control gene expression; or to separate cells having desirable characteristics. Manipulation of particular genes can be used to forestall or focus the differentiation process, en route to producing a specialized homogenous cell population suitable for human therapy.
Description
TECHNICAL FIELD
[0001] This invention relates generally to the field of cell biology of stem cells. More specifically, it relates to phenotypic markers that can be used to characterize, qualify, and control differentiation of pluripotent cells, and to evaluate clinical conditions associated with marker expression.
BACKGROUND
[0002] A promising development in the field of regenerative medicine has been the isolation and propagation of human stem cells from the early embryo. These cells have two very special properties: First, unlike other normal mammalian cell types, they can be propagated in culture almost indefinitely, providing a virtually unlimited supply. Second, they can be used to generate a variety of tissue types of interest as a source of replacement cells and tissues for use in therapy.
[0003] Thomson et al. (Science 282:114, 1998; U.S. Pat. No. 6,200,806) were the first to successfully isolate and propagate embryonic stem cells from human blastocysts. Gearhart and coworkers derived human embryonic germ cell lines from fetal gonadal tissue (Shamblott et al., Proc. Natl. Acad. Sci. USA 95:13726, 1998;U.S. Pat. No. 6,090,622).
[0004] International Patent Publication WO 99/20741 (Geron Corp.) describes methods and materials for the growth of primate-derived primordial stem cells. International Patent Publication WO 01/51616 (Geron Corp.) provides techniques for growth and differentiation of human pluripotent stem cells. An article by Xu et al. (Nature Biotechnology 19:971, 2001) describes feeder-free growth of undifferentiated human embryonic stem cells. Lebkowski et al. (Cancer J. 7 Suppl. 2:S83, 2001) discuss the culture, differentiation, and genetic modification of human embryonic stem cell for regenerative medicine applications. These publications report exemplary culture methods for propagating human embryonic stem cells in an undifferentiated state, and their use in preparing cells for human therapy.
[0005] Markers for identifying undifferentiated pluripotent stem cells include SSEA-4, Tra-1-60, and Tra-1-81 (Thomson et al. and Gearhart et al., supra). They also express human telomerase reverse transcriptase, and the POU transcription factor Oct 3/4 (WO 01/51616; Amit et al., Dev. Biol. 227:271, 2000; Xu et al., supra).
[0006] Loring et al. (Restor. Neurol. Neurosci. 18:81, 2001) review gene expression profiles of embryonic stem cells and ES-derived neurons. Pesce et al. (Bioessays 20:722, 1998) comment on the potential role of transcription factor Oct-4 in the totipotent germ-line cycle of mice. Gajovic et al. (Exp. Cell Res. 242:138, 1998) report that genes expressed after retinoic acid-mediated differentiation of embryoid bodies are likely to be expressed during embryo development. Zur Nieden et al. (Toxicol. in Vitro 15:455, 2001) propose certain molecular markers for embryonic stem cells. Henderson et al. (Stem Cells 20:329, 2002) report that pre-implantation human embryos and ES cells have comparable expression of SSEAs. Tanaka et al. (Genome Res. 12:1921, 2002) profile gene expression in mouse ES cells to identify candidate genes associated with pluripotency and lineage specificity. Draper et al. (J. Anat. 299:249, 2002) review change of surface antigens of human embryonic stem cells upon differentiation in culture.
[0007] Kelly et al. (Mol Reprod. Dev. 56:113, 2000) report DNA microarray analyses of genes regulated during the differentiation of embryonic stem cells. Woltjen et al. (Nucl. Acids Res. 28:E41, 2000) report retro-recombination screening of a mouse embryonic stem cell genomic library. Monk et al. (Oncogene 20:8085, 2001) list human embryonic genes re-expressed in cancer cells. Tanaka et al. (Genome Res. 12:1921, 2002) discuss gene expression profiling of embryo-derived stem cells, and candidate genes putatively associated with pluripotency and lineage specificity. Monk et al. report developmental genes identified by differential display (Reprod. Fertil. Dev. 13:51, 2001). Natale et al. (Reprod. 122:687, 2001) characterize bovine blastocyst gene expression patterns by differential display RT-PCR.
[0008] Fan et al. (Dev. Biol. 210:481,1999) propose that forced expression of the homeobox-containing gene Pem blocks differentiation of embryonic stem cells. Abdel-Rahman et al. (Hum. Reprod. 10:2787, 1995) report the effect of expressing transcription regulating genes in human preimplantation embryos. Jackson et al. (J. Biol. Chem. 277:38683, 2002) describe the cloning and characterization of Ehox, a homeobox gene that reportedly plays a role in ES cell differentiation.
[0009] The following disclosure provides new markers and marker combinations that are effective means to identify, characterize, qualify, and control differentiation of pluripotent cells.
SUMMARY OF THE INVENTION
[0010] This invention identifies a number of genes that are up- or down-regulated during the course of differentiation of early-stage pluripotent stem cells obtained from primates, exemplified by human embryonic stem cells. As a consequence, the genes are differentially expressed in undifferentiated versus differentiated cells. This property confers special benefit on these genes for identification, characterization, culturing, differentiation, and manipulation of stem cells and their progeny, and other cells that express the same markers.
[0011] One aspect of this invention is a system for assessing a culture of undifferentiated primate pluripotent stem (pPS) cells or their progeny, in which expression of one or more of the identified markers listed in the disclosure is detected or measured. The level of expression can be measured in isolation or compared with any suitable standard, such as undifferentiated pPS cells maintained under specified conditions, progeny at a certain stage of differentiation, or stable end-stage differentiated cells, such as may be obtained from the ATCC. Depending on whether the marker(s) are up- or down-regulated during differentiation, presence of the markers is correlated with the presence or proportion of undifferentiated or differentiated cells in the population.
[0012] An exemplary (non-limiting) combination suitable for qualifying cultures of undifferentiated pPS cells is a marker selected from the list of Cripto, gastrin-releasing peptide (GRP) receptor, and podocalyxin-like protein, in combination with either hTERT and/or Oct 3/4 (POU domain, class 5 transcription factor), or a second marker from the list. Additional markers can also be measured as desired. Markers can be detected at the mRNA level by PCR amplification, at the protein or enzyme product level by antibody assay, or by any suitable technique.
[0013] The marker system of this invention can be used for quantifying the proportion of undifferentiated pPS cells or differentiated cells in the culture; for assessing the ability of a culture system or component thereof (such as a soluble factor, culture medium, or feeder cell) to maintain pPS cells in an undifferentiated state; for assessing the ability of a culture system or component thereof to cause differentiation of pPS cells into a culture of lineage-restricted precursor cells or terminally differentiated cells; or for any other worthwhile purpose. This invention includes kits and the use of specific reagents in order to measure the expression of the markers whenever appropriate.
[0014] This invention also provides a system assessing the growth characteristics of a cell population by detecting or measuring expression of one or more of the differentially expressed marker genes identified in this disclosure. This can be applied not only to various types of pPS cells and progenitor cells in various stages of differentiation, but also to clinical samples from a disease condition associated with abnormal cell growth. Renewed expression of markers of a relatively undifferentiated phenotype may be diagnostic of disease conditions such as cancer, and can serve as a means by which to target therapeutic agents to the disease site.
[0015] The marker system can also be used to regulate gene expression. Transcriptional control elements for the markers will cause an operatively linked encoding region to be expressed preferentially in undifferentiated or differentiated cells. For example, the encoding sequence can be a reporter gene (such as a gene that causes the cells to emit fluorescence), a positive selection marker (such as a drug resistance gene), or a negative selection marker. Vector constructs comprising recombinant elements linked in this fashion can be used to positively select or deplete undifferentiated, differentiated, or cancerous cells from a mixed population or in vivo, depending on the nature of the effector gene and whether transcription is up- or down-regulated during differentiation. They can also be used to monitor culture conditions of pPS cells, differentiation conditions, or for drug screening.
[0016] The marker system of this invention can also be used to sort differentiated cells from less differentiated cells. The marker can be used directly for cell separation by adsorption using an antibody or lectin, or by fluorescence activated cell sorting. Alternatively, these separation techniques can be effected using a transcription promoter from the marker gene in a promoter-reporter construct.
[0017] The marker system of this invention can be used to map differentiation pathways or influence differentiation. Markers suited for this purpose may act as transcription regulators, or encode products that enhance cell interaction in some fashion. pPS cells or their differentiated progeny are genetically altered to increase expression of one or more of the identified genes using a transgene, or to decrease expression, for example, using an antisense or siRNA construct. Alternatively, gene products involved in cell interaction or signaling can be added directly to the culture medium. The effect of this can be to help maintain the transfected cell in the undifferentiated state, promote differentiation in general, or direct differentiation down a particular pathway.
[0018] Another aspect of the invention are methods for identifying these and other genes that are up- or down-regulated upon differentiation of any cell type. The methods involve comparing expression libraries obtained from the cells before and after differentiation, by sequencing transcripts in each of the libraries, and identifying genes that have statistically significant differences in the relative number of transcripts (as a percentage of transcripts in each library) at a confidence level of 67%, 95%, or 98%. The method can be enhanced by creating assemblies in which different sequences are counted for the same transcript if they are known to correspond to a single transcript according to previously compiled data.
[0019] Amongst the differentially expressed markers identified in this disclosure are 39 nucleotide sequences which are not present in their entirety in the UniGene database. These are listed in this disclosure as SEQ. ID NOs:101 to 139. This invention includes novel nucleic acids consisting of or containing any of these sequences or the complementary sequences, and novel fragments thereof. This invention also includes novel polypeptides encoded in these sequences (made either by expressing the nucleic acid or by peptide synthesis), antibodies specific for the polypeptides (made by conventional techniques or through a commercial service), and use of these nucleic acids, peptides, and antibodies for any industrial application.
[0020] Also embodied in this invention are culture conditions and other cell manipulations identified using the marker system of this invention that are suitable for maintaining or proliferating pPS cells without allowing differentiation, or causing them to differentiate in a certain fashion. Culture conditions tested and validated according to this invention are illustrated in the example section.
[0021] Other embodiments of the invention will be apparent from the description that follows.
DRAWINGS
[0022]
FIG. 1 shows the profile of genes preferentially expressed in undifferentiated pluripotent stem cells, upon preliminary differentiation of the cells by culturing in retinoic acid or DMSO. Level of gene expression at the mRNA level was measured by real-time PCR assay. Any of the genes showing substantial down-regulation upon differentiation can be used to characterize the undifferentiated cell population, and culture methods suitable for maintaining them in an undifferentiated state.
[0023]
FIG. 2 shows the level of expression of five genes in hES cells, compared with fully differentiated cells. This five-marker panel provides robust qualification of the undifferentiated phenotype.
[0024]
FIG. 3 show results of an experiment in which hES cells of the H1 line were maintained for multiple passages in different media. Medium conditioned with feeder cells provides factors effective to allow hES cells to proliferate in culture without differentiating. However, culturing in unconditioned medium leads to decreased percentage of cells expressing CD9, and the classic hES cell marker SSEA-4.
[0025]
FIG. 4 illustrates the sensitivity of hTERT, Oct 3/4, Cripto, GRP receptor, and podocalyxin-like protein (measured by real-time PCR) as a means of determining the degree of differentiation of the cells. After multiple passages in unconditioned medium, all five markers show expression that has been downregulated by 10 to 104-fold.
[0026]
FIG. 5 shows results of an experiment in which the hES cell line H1 was grown on different feeder cell lines: mEF=mouse embryonic fibroblasts; hMSC=human mesenchymal stem cells; UtSMC =uterine smooth muscle cells; WI-38=human lung fibroblasts. As monitored using Cripto, the hMSC is suitable for use as feeder cells to promote hES cell proliferation without differentiation.
[0027]
FIG. 6 shows results of an experiment in which different media were tested for their ability to promote growth of hES cells without proliferation. The test media were not preconditioned, but supplemented with 8-40 ng/mL bFGF, with or without stem cell factor, Flt3 ligand, or LIF. Effective combinations of factors (Conditions 4 to 8) were identified by following the undifferentiated phenotype using the markers of this invention. Alterations in expression profiles were temporary and reversible, showing that the cells are still undifferentiated.
DETAILED DESCRIPTION
[0028] The propensity of pluripotent stem cells to differentiate spontaneously has made it challenging for investigators to work with these cells. Consistent cultures of undifferentiated stem cells are required to compare results obtained from multiple experiments performed within or between laboratories. Unfortunately, morphological characterization is subjective and especially difficult for cultures that often contain 10-20% differentiated cells. Nevertheless, having a set of standardized criteria will be important in qualifying these cells for use in clinical therapy.
[0029] The marker system identified in this disclosure provides the basis for establishing these standards. 148,453 different transcripts were amplified and sequenced from undifferentiated human embryonic stem cells, and three types of progeny. As a result of this sequencing effort, 532 genes were identified having substantially higher EST counts in undifferentiated cells, and 142 genes were identified having substantially higher EST counts after differentiation. Other differentially expressed genes were identified by microarray analysis of undifferentiated cells, compared with cells at the beginning of the differentiation process.
[0030] The system provided by this invention can be used not only to qualify populations of undifferentiated cells, but in other powerful ways of maintaining and manipulating cells described later in this disclosure. Culture systems have been identified and protocols have been developed to expand cultures of undifferentiated cells and produce commercially viable quantities of cells for use in research, drug screening, and regenerative medicine.
Definitions
[0031] “Pluripotent Stem cells” (pPS cells) are pluripotent cells that have the characteristic of being capable under appropriate conditions of producing progeny of several different cell types that are derivatives of all of the three germinal layers (endoderm, mesoderm, and ectoderm), according to a standard art-accepted test, such as the ability to form a teratoma in 8-12 week old SCID mice. The term includes both established lines of stem cells of various kinds, and cells obtained from primary tissue that are pluripotent in the manner described. For the purposes of this disclosure, the pPS cells are not embryonal carcinoma (EC) cells, and are not derived from a malignant source. It is desirable (but not always necessary) that the cells be euploid. Exemplary pPS cells are obtained from embryonic or fetal tissue at any time after fertilization.
[0032] “Human Embryonic Stem cells” (hES cells) are pluripotent stem cells derived from a human embryo in the blastocyst stage, or human pluripotent cells produced by artificial means (such as by nuclear transfer) that have equivalent characteristics. Exemplary derivation procedures and features are provided in a later section.
[0033] hES cell cultures are described as “undifferentiated” when a substantial proportion (at least 20%, and possibly over 50% or 80%) of stem cells and their derivatives in the population display morphological characteristics of undifferentiated cells, distinguishing them from differentiated cells of embryo or adult origin. It is understood that colonies of undifferentiated cells within the population will often be surrounded by neighboring cells that are differentiated. It is also understood that the proportion of cells displaying the undifferentiated phenotype will fluctuate as the cells proliferate and are passaged from one culture to another. Cells are recognized as proliferating in an undifferentiated state when they go through at least 4 passages and/or 8 population doublings while retaining at least about 50%, or the same proportion of cells bearing characteristic markers or morphological characteristics of undifferentiated cells.
[0034] A “differentiated cell” is a cell that has progressed down a developmental pathway, and includes lineage-committed progenitor cells and terminally differentiated cells.
[0035] “Feeder cells” or “feeders” are terms used to describe cells of one type that are co-cultured with cells of another type, to provide an environment in which the cells of the second type can grow. hES cell populations are said to be “essentially free” of feeder cells if the cells have been grown through at least one round after splitting in which fresh feeder cells are not added to support the growth of pPS cells.
[0036] The term “embryoid bodies” refers to aggregates of differentiated and undifferentiated cells that appear when pPS cells overgrow in monolayer cultures, or are maintained in suspension cultures. Embryoid bodies are a mixture of different cell types, typically from several germ layers, distinguishable by morphological criteria and cell markers detectable by immunocytochemistry.
[0037] A cell “marker” is any phenotypic feature of a cell that can be used to characterize it or discriminate it from other cell types. A marker of this invention may be a protein (including secreted, cell surface, or internal proteins; either synthesized or taken up by the cell); a nucleic acid (such as an mRNA, or enzymatically active nucleic acid molecule) or a polysaccharide. Included are determinants of any such cell components that are detectable by antibody, lectin, probe or nucleic acid amplification reaction that are specific for the cell type of interest. The markers can also be identified by a biochemical or enzyme assay that depend on the function of the gene product. Associated with each marker is the gene that encodes the transcript, and the events that lead to marker expression.
[0038] The terms “polynucleotide” and “nucleic acid” refer to a polymeric form of nucleotides of any length. Included are genes and gene fragments, mRNA, cDNA, plasmids, viral and non-viral vectors and particles, nucleic acid probes, amplification primers, and their chemical equivalents. As used in this disclosure, the term polynucleotide refers interchangeably to double- and single-stranded molecules. Unless otherwise specified, any embodiment of the invention that is a polynucleotide encompasses both a double-stranded form, and each of the two complementary single-stranded forms known or predicted to make up the double-stranded form.
[0039] A cell is said to be “genetically altered” or “transtected” when a polynucleotide has been transferred into the cell by any suitable means of artificial manipulation, or where the cell is a progeny of the originally altered cell that has inherited the polynucleotide.
[0040] A “control element” or “control sequence” is a nucleotide sequence involved in an interaction of molecules that contributes to the functional regulation of a polynucleotide, including replication, duplication, transcription, splicing, translation, or degradation of the polynucleotide. “Operatively linked” refers to an operative relationship between genetic elements, in which the function of one element influences the function of another element. For example, an expressible encoding sequence may be operatively linked to a promoter that drives gene transcription.
[0041] The term “antibody” as used in this disclosure refers to both polyclonal and monoclonal antibody. The ambit of the term deliberately encompasses not only intact immunoglobulin molecules, but also such fragments and derivatives of immunoglobulin molecules that retain a desired binding specificity.
General Techniques
[0042] Methods in molecular genetics and genetic engineering are described generally in the current editions of Molecular Cloning: A Laboratory Manual, (Sambrook et al.); Oligonucleotide Synthesis (M. J. Gait, ed.); Animal Cell Culture (R.l. Freshney, ed.); Gene Transfer Vectors for Mammalian Cells (Miller & Calos, eds.); Current Protocols in Molecular Biology and Short Protocols in Molecular Biology, 3rd Edition (F. M. Ausubel et al., eds.); and Recombinant DNA Methodology (R. Wu ed., Academic Press). Antibody production is described in Basic Methods in Antibody Production and Characterization (Howard & Bethell eds., CRC Press, 2000).
[0043] A survey of relevant techniques is provided in such standard texts as DNA Sequencing (A. E. Barron, John Wiley, 2002), and DNA Microarrays and Gene Expression (P. Baldi et al., Cambridge U. Press, 2002). For a description of the molecular biology of cancer, the reader is referred to Principles of Molecular Oncology (M. H. Bronchud et al. eds., Humana Press, 2000); The Biological Basis of Cancer (R. G. McKinnel et al. eds., Cambridge University Press, 1998); and Molecular Genetics of Cancer (J. K. Cowell ed., Bios Scientific Publishers, 1999).
[0044] Sources of Stem Cells
[0045] This invention is based on observations made with established lines of hES cells. The markers are suitable for identifying, characterizing, and manipulating related types of undifferentiated pluripotent cells. They are also suitable for use with pluripotent cells obtained from primary embryonic tissue, without first establishing an undifferentiated cell line. It is contemplated that the markers described in this application will in general be useful for other types of pluripotent cells, including embryonic germ cells (U.S. Pat. Nos. 6,090,622 and 6,251,671), and ES and EG cells from other mammalian species, such as non-human primates.
[0046] Embryonic Stem Cells
[0047] Embryonic stem cells can be isolated from blastocysts of members of primate species (U.S. Pat. No. 5,843,780; Thomson et al., Proc. Natl. Acad. Sci. USA 92:7844, 1995). Human embryonic stem (hES) cells can be prepared from human blastocyst cells using the techniques described by Thomson et al. (U.S. Pat. No. 6,200,806; Science 282:1145, 1998; Curr. Top. Dev. Biol. 38:133 ff., 1998) and Reubinoff et al, Nature Biotech. 18:399, 2000. Equivalent cell types to hES cells include their pluripotent derivatives, such as primitive ectoderm-like (EPL) cells, outlined in WO 01/51610 (Bresagen).
[0048] hES cells can be obtained from human preimplantation embryos. Alternatively, in vitro fertilized (IVF) embryos can be used, or one-cell human embryos can be expanded to the blastocyst stage (Bongso et al., Hum Reprod 4: 706, 1989). Embryos are cultured to the blastocyst stage in G1.2 and G2.2 medium (Gardner et al., Fertil. Steril. 69:84, 1998). The zona pellucida is removed from developed blastocysts by brief exposure to pronase (Sigma). The inner cell masses are isolated by immunosurgery, in which blastocysts are exposed to a 1:50 dilution of rabbit anti-human spleen cell antiserum for 30 min, then washed for 5 min three times in DMEM, and exposed to a 1:5 dilution of Guinea pig complement (Gibco) for 3 min (Solter et al., Proc. Natl. Acad. Sci. USA 72:5099, 1975). After two further washes in DMEM, lysed trophectoderm cells are removed from the intact inner cell mass (ICM) by gentle pipetting, and the ICM plated on mEF feeder layers.
[0049] After 9 to 15 days, inner cell mass derived outgrowths are dissociated into clumps, either by exposure to calcium and magnesium-free phosphate-buffered saline (PBS) with 1 mM EDTA, by exposure to dispase or trypsin, or by mechanical dissociation with a micropipette; and then replated on mEF in fresh medium. Growing colonies having undifferentiated morphology are individually selected by micropipette, mechanically dissociated into clumps, and replated. ES-like morphology is characterized as compact colonies with apparently high nucleus to cytoplasm ratio and prominent nucleoli. Resulting ES cells are then routinely split every 1-2 weeks by brief trypsinization, exposure to Dulbecco's PBS (containing 2 mM EDTA), exposure to type IV collagenase (˜200 U/mL; Gibco) or by selection of individual colonies by micropipette. Clump sizes of about 50 to 100 cells are optimal.
[0050] Propagation of pPS Cells in an Undifferentiated State
[0051] pPS cells can be propagated continuously in culture, using culture conditions that promote proliferation without promoting differentiation. Exemplary serum-containing ES medium is made with 80% DMEM (such as Knock-Out DMEM, Gibco), 20% of either defined fetal bovine serum (FBS, Hyclone) or serum replacement (US 20020076747 A1, Life Technologies Inc.), 1% non-essential amino acids, 1 mM L-glutamine, and 0.1 mM β-mercaptoethanol. Just before use, human bFGF is added to 4 ng/mL (WO 99/20741, Geron Corp.).
[0052] Traditionally, ES cells are cultured on a layer of feeder cells, typically fibroblasts derived from embryonic or fetal tissue. Embryos are harvested from a CF1 mouse at 13 days of pregnancy, transferred to 2 mL trypsin/EDTA, finely minced, and incubated 5 min at 37° C. 10% FBS is added, debris is allowed to settle, and the cells are propagated in 90% DMEM, 10% FBS, and 2 mM glutamine. To prepare a feeder cell layer, cells are irradiated to inhibit proliferation but permit synthesis of factors that support ES cells (˜4000 rads γ-irradiation). Culture plates are coated with 0.5% gelatin overnight, plated with 375,000 irradiated mEFs per well, and used 5 h to 4 days after plating. The medium is replaced with fresh hES medium just before seeding pPS cells.
[0053] Scientists at Geron have discovered that pPS cells can be maintained in an undifferentiated state even without feeder cells. The environment for feeder-free cultures includes a suitable culture substrate, particularly an extracellular matrix such as Matrigel® or laminin. The pPS cells are plated at >15,000 cells cm−2 (optimally 90,000 cm−2 to 170,000 cm−2). Typically, enzymatic digestion is halted before cells become completely dispersed (say, ˜5 min with collagenase IV). Clumps of ˜10 to 2,000 cells are then plated directly onto the substrate without further dispersal. Alternatively, the cells can be harvested without enzymes before the plate reaches confluence by incubating ˜5 min in a solution of 0.5 mM EDTA in PBS. After washing from the culture vessel, the cells are plated into a new culture without further dispersal. In a further illustration, confluent human embryonic stem cells cultured in the absence of feeders are removed from the plates by incubating with a solution of 0.05% (wt/vol) trypsin (Gibco) and 0.053 mM EDTA for 5-15 min at 37° C. The remaining cells in the plate are removed and the cells are triturated into a suspension comprising single cells and small clusters, and then plated at densities of 50,000-200,000 cells cm−2 to promote survival and limit differentiation.
[0054] Feeder-free cultures are supported by a nutrient medium containing factors that support proliferation of the cells without differentiation. Such factors may be introduced into the medium by culturing the medium with cells secreting such factors, such as irradiated (˜4,000 rad) primary mouse embryonic fibroblasts, telomerized mouse fibroblasts, or fibroblast-like cells derived from pPS cells. Medium can be conditioned by plating the feeders at a density of ˜5-6×104 cm−2 in a serum free medium such as KO DMEM supplemented with 20% serum replacement and 4 ng/mL bFGF. Medium that has been conditioned for 1-2 days is supplemented with further bFGF, and used to support pPS cell culture for 1-2 days. Alternatively or in addition, other factors can be added that help support proliferation without differentiation, such as ligands for the FGF-2 or FGF-4 receptor, ligands for c-kit (such as stem cell factor), ligands for receptors associated with gp 130, insulin, transferrin, lipids, cholesterol, nucleosides, pyruvate, and a reducing agent such as β-mercaptoethanol. Aspects of the feeder-free culture method are further discussed in International Patent Publications WO 99/20741, WO 01/51616; Xu et al., Nat. Biotechnol. 19:971, 2001; and PCT application PCT/US02/28200. Exemplary culture conditions tested and validated using the marker system of this invention are provided below in Example 6.
[0055] Under the microscope, ES cells appear with high nuclear/cytoplasmic ratios, prominent nucleoli, and compact colony formation with poorly discernable cell junctions. Conventional markers for hES cells are stage-specific embryonic antigen (SSEA) 3 and 4, and markers detectable using antibodies Tra-1-60 and Tra-1-81 (Thomson et al., Science 282:1145, 1998). Differentiation of pPS cells in vitro results in the loss of SSEA-4, Tra-1-60, and Tra-1-81 expression, and increased expression of SSEA-1.
Markers of Undifferentiated pPS Cells and Their Differentiated Progeny
[0056] The tables and description provided later in this disclosure provide markers that distinguish undifferentiated pPS cells from their differentiated progeny.
[0057] Expression libraries were made from ES cells (WO 01/51616), embryoid bodies (WO 01/51616), and cells differentiated towards the hepatocyte (WO 01/81549) or neural cell (WO 01/88104) lineage. mRNA was reverse transcribed and amplified, producing expressed sequence tags (ESTs) occurring in frequency proportional to the level of expression in the cell type being analyzed. The ESTs were subjected to automatic sequencing, and counted according to the corresponding unique (non-redundant) transcript. A total of 148,453 non-redundant transcripts were represented in each of the 4 libraries. Genes were then identified as having a differential expression pattern if the number of EST counts of the transcript was statistically different between the libraries being compared.
[0058] In a parallel set of experiments, mRNA from each of the cell types was analyzed for binding to a broad-specificity EST-based microarray, performed according to the method described in WO 01/51616. Genes were identified as having a differential expression pattern if they showed a comparatively different signal on the microarray.
[0059] Significant expression differences determined by EST sequencing, microarray analysis, or other observations were confirmed by real-time PCR analysis. The mRNA was amplified by PCR using specific forward and reverse primers designed from the GenBank sequence, and the amplification product was detected using labeled sequence-specific probes. The number of amplification cycles required to reach a threshold amount was then compared between different libraries.
[0060] Distinguishing markers fall into several categories. Those of particular interest include the following:
[0061] Markers characteristically expressed at a higher level in undifferentiated pPS cells than any of the differentiated cells, indicating down-regulation during differentiation. The gene products may be involved in maintaining the undifferentiated phenotype.
[0062] Markers characteristically expressed at a higher level in the three differentiated cell types than in the undifferentiated cells, indicating up-regulation during differentiation. The gene products may be involved in the general differentiation process.
[0063] Markers characteristically expressed at a higher level in one of the differentiated cell types. The encoded genes may be involved in differentiation down restricted lineages.
[0064] Markers can also be classified according to the function of the gene product or its location in the cell. Where not already indicated, protein gene products can be predicted by referencing public information according to the GenBank accession number, or by translating the open reading frame after the translation start signal though the genetic code. Features of the markers listed can be determined by the descriptors give in the tables below, or by using the accession number or sequence data to reference public information. Marker groups of particular interest include the following:
[0065] Secreted proteins—of interest, for example, because they can be detected by immunoassay of the culture supernatant, and may transmit signals to neighboring cells. Secreted proteins typically have an N-terminal signal peptides, and may have glycosylation sites.
[0066] Surface membrane proteins—of interest, for example, because they can be used for cell-surface labeling and affinity separation, or because they act as receptors for signal transduction. They may have glycosylation sites and a membrane spanning region. A Markov model for predicting transmembrane protein topology is described by Krogh et al., J. Mol Biol. 305:567, 2001.
[0067] Enzymes with relevant function. For example, enzymes involved in protein synthesis and cleavage or in apoptosis may influence differentiation. Glycosyltransferases decorate the cell membrane with distinguishing carbohydrate epitopes that may play a role in cellular adhesion or localization.
[0068] Transcription regulatory factors—of interest for their potential to influence differentiation, as explained later in this disclosure. These factors sometimes have zinc fingers or other identifiable topological features involved in the binding or metabolism of nucleic acids.
[0069] Through the course of this work, the key signaling pathways Wnt, Sonic hedgehog (Shh), and Notch emerged as regulators of growth of pPS cells. Interestingly, these pathways have also been shown to play a role in the growth of tumor cells of various kinds, and in embryonic development of lower species.
[0070] Now that genes have been identified that are up-regulated or down-regulated upon differentiation, a number of commercial applications of these markers will be apparent to the skilled reader. The sections that follow provide non-limiting illustrations of how some of these embodiments can be implemented.
Use of Cell Markers to Characterize DPS Cells and Their Differentiated Progeny
[0071] The markers provided in this disclosure can be used as a means to identify both undifferentiated and differentiated cells—either a population as a whole, or as individual cells within a population. This can be used to evaluate the expansion or maintenance of pre-existing cell populations, or to characterize the pluripotent nature (or lineage commitment) of newly obtained populations.
[0072] Expression of single markers in a test cell will provide evidence of undifferentiated or differentiated phenotype, according to the expression pattern listed later in this disclosure. A plurality of markers (such as any 2, 3, 4, 5, 6, 8, 10, 12, 15, or 20 markers from Tables 2-3 or 5-9) will provide a more detailed assessment of the characteristics of the cell. Expression of genes that are down-regulated and/or lack of expression of genes that are up-regulated upon differentiation correlates with a differentiated phenotype. Expression of genes that are up-regulated and/or lack of expression of genes that are down-regulated upon differentiation correlates with an undifferentiated phenotype. The markers newly identified in this disclosure may be analyzed together (with or without markers that were previously known) in any combination effective for characterizing the cell status or phenotype.
[0073] Tissue-specific markers can be detected using any suitable immunological technique—such as flow cytochemistry for cell-surface markers, or immunocytochemistry (for example, of fixed cells or tissue sections) for intracellular or cell-surface markers. Expression of a cell-surface antigen is defined as positive if a significantly detectable amount of antibody will bind to the antigen in a standard immunocytochemistry or flow cytometry assay, optionally after fixation of the cells, and optionally using a labeled secondary antibody or other conjugate to amplify labeling.
[0074] The expression of tissue-specific gene products can also be detected at the mRNA level by Northern blot analysis, dot-blot hybridization analysis, or by reverse transcriptase initiated polymerase chain reaction (RT-PCR) using sequence-specific primers in standard amplification methods. See U.S. Pat. No. 5,843,780 for further details. Sequence data for particular markers listed in this disclosure can be obtained from public databases such as GenBank.
[0075] These and other suitable assay systems are described in standard reference texts, such as the following: PCR Cloning Protocols, 2nd Ed (James & Chen eds., Humana Press, 2002); Rapid Cycle Real-Time PCR: Methods and Applications (C. Wittwer et al. eds., Springer-Verlag NY, 2002); Immunoassays: A Practical Approach (James Gosling ed., Oxford Univ Press, 2000); Cytometric Analysis of Cell Phenotype and Function (McCarthy et al. eds., Cambridge Univ Press, 2001). Reagents for conducting these assays, such as nucleotide probes or primers, or specific antibody, can be packaged in kit form, optionally with instructions for the use of the reagents in the characterization or monitoring of pPS cells, or their differentiated progeny.
Use of Cell Markers for Clinical Diagnosis
[0076] Stem cells regulate their own replenishment and serve as a source of cells that can differentiate into defined cell lineages. Cancer cells also have the ability to self-renew, but lack of regulation results in uncontrolled cellular proliferation. Three key signaling pathways, Wnt, Sonic hedgehog (Shh), and Notch, are known growth regulators of tumor cells. The genomics data provided in this disclosure indicate that all three of these pathways are active in hES cells.
[0077] It is a hypothesis of this invention that many of the markers discovered to be more highly expressed in undifferentiated pPS cells can also be up-regulated upon dedifferentiation of cells upon malignant transformation. Accordingly, this disclosure provides a system for evaluating clinical conditions associated with abnormal cell growth, such as hyperplasia or cancers of various kinds. Markers meeting the desired criteria include those contained in Tables 2, 5, 7 and 9.
[0078] Expression of each marker of interest is determined at the mRNA or protein level using a suitable assay system such as those described earlier; and then the expression is correlated with the clinical condition that the patient is suspected of having. As before, combinations of multiple markers may be more effective in doing the assessment. Presence of a particular marker may also provide a means by which a toxic agent or other therapeutic drug may be targeted to the disease site.
[0079] In a similar fashion, the markers of this invention can be used to evaluate a human or non-human subject who has been treated with a cell population or tissue generated by differentiating pPS cells. A histological sample taken at or near the site of administration, or a site to which the cells would be expected to migrate, could be harvested at a time subsequent to treatment, and then assayed to assess whether any of the administered cells had reverted to the undifferentiated phenotype. Reagents for conducting diagnostic tests, such as nucleotide probes or primers, or specific antibody, can be packaged in kit form, optionally with instructions for the use of the reagents in the determination of a disease condition.
Use of Cell Markers to Assess and Manipulate Culture Conditions
[0080] The markers and marker combinations of this invention provide a system for monitoring undifferentiated pPS cells and their differentiated progeny in culture. This system can be used as a quality control, to compare the characteristics of undifferentiated pPS cells between different passages or different batches. It can also be used to assess a change in culture conditions, to determine the effect of the change on the undifferentiated cell phenotype.
[0081] Where the object is to produce undifferentiated cells, a decrease in the level of expression of an undifferentiated marker because of the alteration by 3-, 10-, 25-, 100- and 1000-fold is progressively less preferred. Corresponding increases in marker expression may be more beneficial. Moderate decreases in marker expression may be quite acceptable within certain boundaries, if the cells retain their ability to form progeny of all three germ layers is retained, and/or the level of the undifferentiated marker is relatively restored when culture conditions are returned to normal.
[0082] In this manner, the markers of this invention can be used to evaluate different feeder cells, extracellular matrixes, base media, additives to the media, culture vessels, or other features of the culture as illustrated in WO 99/20741 and PCT application PCT/US02/28200. Illustrations of this technique are provided below in Example 6 (FIGS. 3 to 6).
[0083] In a similar fashion, the markers of this invention can also be used to monitor and optimize conditions for differentiating cells. Improved differentiation procedures will lead to higher or more rapid expression of markers for the differentiated phenotype, and/or lower or more rapid decrease in expression of markers for the undifferentiated phenotype.
Use of Cell Markers to Regulate Gene Expression
[0084] Differential expression of the markers listed in this disclosure indicates that each marker is controlled by a transcriptional regulatory element (such as a promoter) that is tissue specific, causing higher levels of expression in undifferentiated cells compared with differentiated cells, or vice versa. When the corresponding transcriptional regulatory element is combined with a heterologous encoding region to drive expression of the encoding region, then the expression pattern in different cell types will mimic that of the marker gene.
[0085] Minimum promoter sequences of many of the genes listed in this disclosure are known and further described elsewhere. Where a promoter has not been fully characterized, specific transcription can usually be driven by taking the 500 base pairs immediately upstream of the translation start signal for the marker in the corresponding genomic clone.
[0086] To express a heterologous encoding region according to this embodiment of the invention, a recombinant vector is constructed in which the specific promoter of interest is operatively linked to the encoding region in such a manner that it drives transcription of the encoding region upon transfection into a suitable host cell. Suitable vector systems for transient expression include those based on adenovirus and certain types of plasmids. Vectors for long-term expression include those based on plasmid lipofection or electroporation, episomal vectors, retrovirus, and lentivirus.
[0087] One application of tissue-specific promoters is expression of a reporter gene. Suitable reporters include fluorescence markers such as green fluorescent protein, luciferase, or enzymatic markers such as alkaline phosphatase and β-galactosidase. Other reporters such as a blood group glycosyltransferase (WO 02/074935), or Invitrogen's pDisplay™, create a cell surface epitope that can be counterstained with labeled specific antibody or lectin. pPS cells labeled with reporters can be used to follow the differentiation process directly, the presence or absence of the reporter correlating with the undifferentiated or differentiated phenotype, depending on the specificity of the promoter. This in turn can be used to follow or optimize culture conditions for undifferentiated pPS cells, or differentiation protocols. Alternatively, cells containing promoter-reporter constructs can be used for drug screening, in which a test compound is combined with the cell, and expression or suppression of the promoter is correlated with an effect attributable to the compound.
[0088] Another application of tissue-specific promoters is expression of a positive or negative drug selection marker. Antibiotic resistance genes such as neomycin phosphotransferase, expressed under control of a tissue-specific promoter, can be used to positively select for undifferentiated or differentiated cells in a medium containing the corresponding drug (geneticin), by choosing a promoter with the appropriate specificity. Toxin genes, genes that mediate apoptosis, or genes that convert a prodrug into a toxic compound (such as thymidine kinase) can be used to negatively select against contaminating undifferentiated or differentiated cells in a population of the opposite phenotype (WO 02/42445; GB 2374076).
[0089] Promoters specific for the undifferentiated cell phenotype can also be used as a means for targeting cancer cells—using the promoter to drive expression of a gene that is toxic to the cell (WO 98/14593, WO 02/42468), or to drive a replication gene in a viral vector (WO 00/46355). For example, an adenoviral vector in which the GRPR promoter (AY032865) drives the E1a gene should specifically lyse cancer cells in the manner described in Majumdar et al., Gene Ther. 8:568, 2001. Multiple promoters for the undifferentiated phenotype can be linked for improved cancer specificity (U.S. Ser. No. 10/206,447).
[0090] Other useful applications of tissue-specific promoters of this invention will come readily to the mind of the skilled reader.
Use of Markers for Cell Separation or Purification
[0091] Differentially expressed markers provided in this disclosure are also a means by which mixed cell populations can be separated into populations that are more homogeneous. This can be accomplished directly by selecting a marker of the undifferentiated or differentiated phenotype, which is itself expressed on the cell surface, or otherwise causes expression of a unique cell-surface epitope. The epitope is then used as a handle by which the marked cells can be physically separated from the unmarked cells. For example, marked cells can be aggregated or adsorbed to a solid support using an antibody or lectin that is specific for the epitope. Alternatively, the marker can be used to attach a fluorescently labeled antibody or lectin, and then the cell suspension can be subject to fluorescence-activated cell sorting.
[0092] An alternative approach is to take a tissue-specific promoter chosen based on its expression pattern (as described in the last section), and use it to drive transcription of a gene suitable for separating the cells. In this way, the marker from which the promoter is chosen need not itself be a cell surface protein. For example, the promoter can drive expression of a fluorescent gene, such as GFP, and then cells having the marked phenotype can be separated by FACS. In another example, the promoter drives expression of a heterologous gene that causes expression of a cell-surface epitope. The epitope is then used for adsorption-based separation, or to attach a fluorescent label, as already described.
Use of Cell Markers to Influence Differentiation
[0093] In another embodiment of this invention, the differentially expressed genes of this invention are caused to increase or decrease their expression level, in order to either inhibit or promote the differentiation process. Suitable genes are those that are believed in the normal case of ontogeny to be active in maintaining the undifferentiated state, active in the general process of differentiation, or active in differentiation into particular cell lineages. Markers of interest for this application are the following:
[0094] Transcription factors and other elements that directly affect transcription of other genes, such as Forkhead box O1A (FOXO1A); Zic family member 3 (ZIC3); Hypothetical protein FLJ20582; Forkhead box H1 (FOXH1); Zinc finger protein, Hsal2; KRAB-zinc finger protein SZF1-1; Zinc finger protein of cerebellum ZIC2; and Coup transcription factor 2 (COUP-TF2). Other candidates include those marked in Tables 5 and 6 with the symbol “{circle over (x)}”, and other factors with zinc fingers or nucleic acid binding activity.
[0095] Genes that influence cell interaction, such as those that encode adhesion molecules, and enzymes that make substrates for adhesion molecules
[0096] Genes encoding soluble factors that transmit signals within or between cells, and specific receptors that recognize them and are involved in signal transduction.
[0097] One way of manipulating gene expression is to induce a transient or stable genetic alteration in the cells using a suitable vector, such as those already listed. Scientists at Geron Corp. have determined that the following constitutive promoters are effective in undifferentiated hES cells: for transient expression CMV, SV40, EF1α, UbC, and PGK; for stable expression, SV40, EF1α, UbC, MND and PGK. Expressing a gene associated with the undifferentiated phenotype may assist the cells to stay undifferentiated in the absence of some of the elements usually required in the culture environment. Expressing a gene associated with the differentiated phenotype may promote early differentiation, and/or initiate a cascade of events beneficial for obtaining a desired cell population. Maintaining or causing expression of a gene of either type early in the differentiation process may in some instances help guide differentiation down a particular pathway.
[0098] Another way of manipulating gene expression is to alter transcription from the endogenous gene. One means of accomplishing this is to introduce factors that specifically influence transcription through the endogenous promoter. Another means suitable for down-regulating expression at the protein level is to genetically alter the cells with a nucleic acid that removes the mRNA or otherwise inhibits translation (for example, a hybridizing antisense molecule, ribozyme, or small interfering RNA). Dominant-negative mutants of the target factor can reduce the functional effect of the gene product. Targeting a particular factor associated with the undifferentiated phenotype in this fashion can be used to promote differentiation. In some instances, this can lead to de-repression of genes associated with a particular cell type.
[0099] Where the gene product is a soluble protein or peptide that influences cell interaction or signal transduction (for example, cytokines like osteopontin and Cripto), then it may be possible to affect differentiation simply by adding the product to the cells—in either recombinant or synthetic form, or purified from natural sources. Products that maintain the undifferentiated phenotype can then be withdrawn from the culture medium to initiate differentiation; and products that promote differentiation can be withdrawn once the process is complete.
[0100] Since differentiation is a multi-step process, changing the level of gene product on a permanent basis may cause multiple effects. In some instances, it may be advantageous to affect gene expression in a temporary fashion at each sequential step in the pathway, in case the same factor plays different effects at different steps of differentiation. For example, function of transcription factors can be evaluated by changing expression of individual genes, or by invoking a high throughput analysis, using cDNAs obtained from a suitable library such as exemplified in Example 1. Cells that undergo an alteration of interest can be cloned and pulled from multi-well plates, and the responsible gene identified by PCR amplification.
[0101] The effect of up- or down-regulating expression of a particular gene can be determined by evaluating the cell for morphological characteristics, and the expression of other characteristic markers. Besides the markers listed later in this disclosure, the reader may want to follow the effect on particular cell types, using markers for later-stage or terminally differentiated cells. Tissue-specific markers suitable for this purpose are listed in WO 01/81549 (hepatocytes), WO 01/88104 (neural cells), PCT/US02/20998 (osteoblasts and mesenchymal cells), PCT/US02/22245 (cardiomyocytes), PCT/US02/39091 (hematopoietic cells), PCT/US02/39089 (islet cells), and PCT/US02/39090 (chondrocytes). Such markers can be analyzed by PCR amplification, fluorescence labeling, or immunocytochemistry, as already described. Promoter-reporter constructs based on the same markers can facilitate analysis when expression is being altered in a high throughput protocol.
[0102] The examples that follow are provided for further illustration, and are not meant to limit the claimed invention.
An EST Database of Undifferentiated hES Cells and Their Differentiated Progeny
[0103] cDNA libraries were prepared from human embryonic stem (hES) cells cultured in undifferentiated form. cDNA libraries were also prepared from progeny, subject to non-specific differentiation as embryoid bodies (EBs), or taken through the preliminary stages of established differentiation protocols for neurons (preNEU) or hepatocytes (preHEP).
[0104] The hES cell lines H1, H7, and H9 were maintained under feeder-free conditions. Cultures were passaged every 5-days by incubation in 1 mg/mL collagenase IV for 5-10 min at 37° C., dissociated and seeded in clumps at 2.5 to 10×105 cells/well onto Matrigel™-coated six well plates in conditioned medium supplemented with 8 mg/mL bFGF. cDNA libraries were made after culturing for 5 days after the last passage.
[0105] EBs were prepared as follows. Confluent plates of undifferentiated hES cells were treated briefly with collagenase IV, and scraped to obtain small clusters of cells. Cell clusters were resuspended in 4 mL/well differentiation medium (KO DMEM containing 20% fetal bovine serum in place of 20% SR, and not preconditioned) on low adhesion 6-well plates (Costar). After 4 days in suspension, the contents of each well was transferred to individual wells pre-coated with gelatin. Each well was re-fed with 3 mL fresh differentiation medium every two days after replating. Cells were used for the preparation of cytoplasmic RNA on the eighth day after plating.
[0106] PreHEP cells were prepared based on the hepatocyte differentiation protocol described in WO 01/81549. Confluent wells of undifferentiated cells were prepared, and medium was changed to KO DMEM plus 20% SR+1% DMSO. The medium was changed every 24 h, and cells were used for preparation of cytoplasmic RNA on day 5 of DMSO treatment.
[0107] PreNEU cells were prepared based on the neural differentiation protocol described in WO 01/88104. hES cells of the H7 line (p29) were used to generate EBs as described above except that 10 μM all-trans RA was included in the differentiation medium. After 4 days in suspension, EBs were transferred to culture plate precoated with poly-L-lysine and laminin. After plating, the medium was changed to EPFI medium. Cells were used for the preparation of cytoplasmic RNA after 3 days of growth in EPFI.
[0108] Partial 5′ end sequences (an expressed sequence tag, or EST) were determined by conventional means for independent clones derived from each cDNA library. Overlapping ESTs were assembled into conjoined sequences.
1TABLE 1
|
|
Non-redundant EST sequences
Number
Libraryof ESTs
|
hESC 37,081
EB 37,555
preHEP 35,611
preNEU 38,206
Total148,453
|
[0109] All of the stem cell lines used for preparation of the expression libraries were originally isolated and initially propagated on mouse feeder cells. Accordingly, the libraries were analyzed to determine whether they were contaminated with murine retroviruses that had shed from the feeder cells and subsequently infected the stem cells. Three complete viral genomes were used in a BLAST search: Moloney murine leukemia virus, Friend murine leukemia virus, and murine type C retrovirus. No matches with a high score were found against any of the ESTs.
[0110] The sequences were then compared to the Unigene database of human genes. ESTs that were at least 98% identical, over a stretch of at least 150 nucleotides each, to a common reference sequence in Unigene, were assumed to be transcribed from the same gene, and placed into a common assembly. The complete set of 148,453 ESTs collapsed to a non-redundant set of 32,764 assemblies.
Selection of Marker Genes Specific for Undifferentiated and Differentiated Cells
[0111] Candidate markers were selected from a database based on the imputed level of gene expression. The frequency of ESTs for any particular gene correlates with the abundance of that mRNA in the cells used to generate the cDNA library. Thus, a comparison of frequencies of ESTs among the libraries indicates the relative abundance of the associated mRNA in the different cell types.
[0112] Candidate molecular markers were selected from the expressed gene (EST) database from their greater abundance in undifferentiated hES cells, relative to differentiated hES cells. Genes were identified as having a differential expression pattern (being up- or down-regulated) during the differentiation process, if the count of ESTs sequenced in the undifferentiated cells was substantially different from the sum of ESTs in the three differentiated libraries.
[0113] Oct 3/4 (a POU domain-containing transcription factor) and telomerase reverse transcriptase (hTERT) are known to be expressed preferentially in undifferentiated hES cells (WO 01/51616). Other genes suitable for characterizing or manipulating the undifferentiated phenotype are those that are down-regulated upon differentiation with a significance of p≦0.05, as determined by the Fisher Exact Test (explained below). 193 genes were found to have 4-fold more ESTs in hES cells, relative to each of the three cell types. 532 genes were found that were 2-fold greater hES cells, with a confidence of over 95% as determined by the Fisher Exact Test, relative to the sum of ESTs of the three cell types (minimum of 4 ESTs in hES cells). The following markers are of particular interest:
2TABLE 2
|
|
EST Frequency of Genes that are Down-regulated upon Differentiation of hES cells
EST counts
Geron IDGenBank IDNameESEBpreHEPpreNEU
|
GA_10902NM_024504Pr domain containing 14 (PRDM14)12100
GA_11893NM_032805Hypothetical protein FLJ1454925000
GA_12318NM_032447Fibrillin36000
GA_1322NM_000142Fibroblast growth factor receptor 3 precursor9151
(FGFR-3)
GA_34679NM_002015Forkhead box o1a (FOXO1a)4011
GA_1470NM_003740potassium channel, subfamily K, member 54001
(KCNK5), mRNA
GA_1674NM_002701Octamer-Binding Transcription Factor 3a24120
(OCT-3A) (OCT-4)
GA_2024NM_003212Teratocarcinoma-derived growth factor 120100
(CRIPTO)
GA_2149NM_003413Zic family member 3 (ZIC3)7010
GA_2334NM_000216Kallmann syndrome 1 sequence (KAL1)5010
GA_23552NM_152742hypothetical protein DKFZp547M1096012
(DKFZp547M109), mRNA
GA_2356NM_002851Protein tyrosine phosphatase, receptor-type,10000
z polypeptide 1 (PTPRZ1),
GA_2357NM_001670Armadillo repeat protein deleted in6000
velo-cardio-facial syndrome (ARVCF)
GA_23578BM454360AGENCOURT_6402318 NIH_MGC_856000
Homo sapiens cDNA clone IMAGE: 5497491
5′, mRNA sequence
GA_2367NM_003923Forkhead box H1 (FOXH1)5000
GA_2436NM_004329Bone morphogenetic protein receptor, type la7311
(BMPR1A) (ALK-3)
GA_2442NM_004335Bone marrow stromal antigen 2 (BST-2)13023
GA_2945NM_005232Ephrin type-a receptor 1 (EPHA1)5111
GA_2962NM_005314Gastrin-releasing peptide receptor (GRP-R)4000
GA_2988NM_005397Podocalyxin-like (PODXL)592358
GA_3337NM_006159NELL2 (nel-like protein 2)5320
GA_3559NM_005629Solute carrier family 6, member 8 (SLC6A8)5101
GA_3898NM_006892DNA (cytosine-5-)-methyltransferase 3 beta49231
(DNMT3B)
GA_5391NM_002968Sal-like 1 (SALL1),7110
GA_33680NM_016089Krab-zinc finger protein SZF1-115010
GA_36977NM_020927KIAA1576 protein9210
GA_8723NM_152333Homo sapiens chromosome 14 open reading14113
frame 69 (C14orf69), mRNA
GA_9167AF308602Notch 1 (N1)6210
GA_9183NM_007129Homo sapiens Zic family member 2 (odd-8110
paired homolog, Drosophila) (ZIC2), mRNA
GA_35037NM_004426Homo sapiens polyhomeotic-like 134954
(Drosophila) (PHC1), mRNA
|
[0114] Only one EST for hTERT was identified in undifferentiated hES cells and none were detected from the differentiated cells, which was not statistically significant. Thus, potentially useful markers that are expressed at low levels could have been omitted in this analysis, which required a minimum of four ESTs. It would be possible to identify such genes by using other techniques described elsewhere in this disclosure.
[0115] Three genes were observed from EST frequency queries that were of particular interest as potentially useful markers of hES cells. They were Teratocarcinoma-derived growth factor (Cripto), Podocalyxin-like (PODXL), and gastrin-releasing peptide receptor (GRPR). These genes were not only more abundant in undifferentiated cells, relative to differentiated hES cells, but also encoded for proteins expressed on the surface of cells. Surface markers have the added advantage that they could be easily detected with immunological reagents. ESTs for Cripto and GRPR were quite restricted to hES cells, with one or zero ESTs, respectively, scored in any of the differentiated cells. PODXL ESTs were detected in all 4-cell types, but substantially fewer (2.5×-12×) in differentiated cells. All three markers retained a detectable level of expression in differentiated cultures of hES cells. There may be a low level of expression of these markers in differentiated cells, or the expression detected may be due to a small proportion of undifferentiated cells in the population. GABA(A) receptor, Lefty B, Osteopontin, Thy-1 co-transcribed, and Solute carrier 21 are other significant markers of the undifferentiated phenotype.
[0116] By similar reasoning, genes that show a higher frequency of ESTs in differentiated cells can be used as specific markers for differentiation. ESTs that are 2-fold more abundant in the sum of all three differentiated cell types (EBs, preHEP and preNEU cells) and with a p-value<0.05 as determined by the Fisher Exact Test, compared with undifferentiated hES cells are candidate markers for differentiation down multiple pathways. ESTs that are relatively abundant in only one of the differentiated cell types are candidate markers for tissue-specific differentiation. The following markers are of particular interest:
3TABLE 3
|
|
EST Frequency of Genes that are Upregulated upon Differentiation
EST counts
Geron IDGenBank IDNameESEBpreHEPpreNEU
|
GA_35463NM_024298Homo sapiens leukocyte receptor cluster (LRC)0498
member 4 (LENG4), mRNA
GA_10492NM_006903Inorganic pyrophosphatase (PPASE)0556
GA_38563NM_021005Homo sapiens nuclear receptor subfamily 2,0989
group F, member 2 (NR2F2), mRNA
GA_38570NM_001844Collagen, type II, alpha 1 (COL2A1), transcript15315
variant 1
GA_1476NM_002276Keratin type I cytoskeletal 19 (cytokeratin 19)1261438
GA_34776NM_002273Keratin type II cytoskeletal 8 (cytokeratin 8)971144156
(CK 8)
GA_1735NM_002806Homo sapiens proteasome (prosome,1778
macropain) 26S subunit, ATPase, 6 (PSMC6),
mRNA
GA_1843NM_00098260 s ribosomal protein I21174842
GA_35369NM_003374Voltage-dependent anion-selective channel15610
(VDAC-1)
GA_23117NM_004772P311 protein [Homo sapiens]1576
GA_2597NM_138610Homo sapiens H2A histone family, member Y15514
(H2AFY), transcript variant 3, mRNA
GA_3283NM_004484Homo sapiens glypican 3 (GPC3), mRNA16712
GA_3530NM_002539Homo sapiens ornithine decarboxylase 111089
(ODC1), mRNA
GA_4145NM_002480Protein phosphatase 1, regulatory(inhibitor)1666
subunit 12A (PPP1R12A)
GA_5992NM_014899Homo sapiens Rho-related BTB domain010713
containing 3 (RHOBTB3), mRNA
GA_6136NM_016368Homo sapiens myo-inositol 1-phosphate17516
synthase A1 (ISYNA1), mRNA
GA_6165NM_015853Orf (LOC51035)1595
GA_6219NM_01613916.7 Kd protein (LOC51142),151314
GA_723NM_005801Homo sapiens putative translation initiation1141519
factor (SUI1), mRNA
GA_9196NM_000404Homo sapiens galactosidase, beta 1 (GLB1),06107
transcript variant 179423, mRNA
GA_9649NM_014604Tax interaction protein 1 (TIP-1)0855
|
[0117] The relative expression levels were calculated as follows:
1
[0118] The es value is substantially >1 for genes marking the undifferentiated phenotype, and <1 for genes indicating differentiation.
[0119] The Fisher Exact Test was used to determine whether changes were statistically significant. S. Siegel & N. J. Castellan. Nonparametric Statistics for the Behavioral Sciences (2nd ed., McGraw-Hill NJ, 1988). This is a standard test that can be used for 2×2 tables, and is conservative in declaring significance if the data are sparse. For analysis of EST sequences, the tables were of the following form:
4TABLE 4
|
|
Fisher Exact Test for Statistical Analysis of Differential Expression
Gene XAll Other GenesTotal
|
Poola = number ofA = number of sequencesN = a + A
Asequences in Pool Ain Pool A NOT assignedtotal number of
assigned to Gene Xto Gene Xsequences in Pool
A
Poolb = number ofB = number of sequencesM = b + B
Bsequences in Pool Bin Pool B NOT assignedtotal number of
assigned to Gene Xto Gene Xsequences in Pool
B
Totalc = a + bC = A + BN + M = c + C
|
[0120] where Pool A contains the sequences derived from the undifferentiated hES cells and Pool B contains the sequences from the other three cell types (EB, preHep, preNeu). N is equal to the number of sequences derived from the undifferentiated hES cells (37,081) and M is equal to the sum of all ESTs from the three differentiated cell types (111,372). For any given pair of pool sizes (N, M) and gene counts (c and C), the probability p of the table being generated by chance is calculated where:
p=[N! M! c! C!]/[
(N+M)! a! b! A! B!]
[0121] and where 0! by default is set to 1. The null hypothesis of a gene being equally represented in two pools is rejected when probability p≦0.05, where 0.05 is the level of statistical certainty. Thus, genes with p≦0.05 are considered to be differentially represented.
[0122] The following markers were identified as changing their expression levels significantly upon differentiation. The markers identified with the symbol “{circle over (x)}” may play a role in the regulation of gene transcription.
5TABLE 5
|
|
EST Frequency of Genes that Down-regulate upon Differentiation
EST counts
Geron IDGenBank IDNameESEBpreHEPpreNeuTotalRelative Expression
|
GA_10021NM_018124hypothetical protein FLJ10520 (FLJ10520)10310es 4.51p = 0.02
GA_10053NM_033427cortactin binding protein 2 (CORTBP2)40004es > 4p = 0.00
GA_10057AB051540KIAA1753 protein sequence41106es 6.01p = 0.04
GA_10082NM_030645KIAA1720 protein (KIAA1720)60107es 18.02p = 0.00
GA_10153NM_015039chromosome 1 open reading frame 15 (C1orf15),41106es 6.01p = 0.04
transcript variant 1
GA_102NM_015043KIAA0676 protein (KIAA0676)640111es 3.60p = 0.03
GA_10252NM_003376vascular endothelial growth factor (VEGF)52029es 3.75p = 0.05
GA_10258AK091948cDNA FLJ34629 fis, clone KIDNE2015515, highly40004es > 4p = 0.00
similar to NADP-dependent leukotriene b4 12-
hydroxydehydrogenase (EC 1.1.1.-) sequence
GA_10308NM_024046hypothetical protein MGC8407 (MGC8407)40004es > 4p = 0.00
GA_10327NM_024077SECIS binding protein 2 (SBP2)923216es 3.86p = 0.01
GA_10334NM_024090long-chain fatty-acyl elongase (LCE)50027es 7.51p = 0.01
GA_10513NM_033209Thy-1 co-transcribed (LOC94105)722112es 4.20p = 0.01
GA_10528NM_030622cytochrome P450, subfamily IIS, polypeptide 160107es 18.02p = 0.00
(CYP2S1)
GA_1053NM_001618ADP-ribosyltransferase (NAD+; poly (ADP-ribose)251314961es 2.09p = 0.01
polymerase) (ADPRT)
GA_10531NM_015271tripartite motif-containing 2 (TRIM2)620210es 4.51p = 0.02
GA_10603NM_025215pseudouridylate synthase 1 (PUS1)50229es 3.75p = 0.05
GA_10641NM_025108hypothetical protein FLJ13909 (FLJ13909)60017es 18.02p = 0.00
GA_10649NM_025082hypothetical protein FLJ13111 (FLJ13111)830011es 8.01p = 0.00
GA_1067NM_020977ankyrin 2, neuronal (ANK2), transcript variant 240004es > 4p = 0.00
GA_10696NM_024888hypothetical protein FLJ11535 (FLJ11535)52007es 7.51p = 0.01
GA_10713NM_024844pericentrin 1 (PCNT1)811010es 12.01p = 0.00
GA_1076NM_001659ADP-ribosylation factor 3 (ARF3)1985436es 3.36p = 0.00
GA_10831NM_024619hypothetical protein FLJ12171 (FLJ12171)40116es 6.01p = 0.04
GA_1085NM_000048argininosuccinate lyase (ASL)62008es 9.01p = 0.00
GA_10902NM_024504PR domain containing 14 (PRDM14)1210013es 36.04p = 0.00
GA_10905NM_022362MMS19-like (MET18 homolog, S. cerevisiae)1054120es 3.00p = 0.02
(MMS19L)
GA_10935NM_032569cytokine-like nuclear factor n-pac (N-PAC)831113es 4.81p = 0.01
GA_11047NM_004728DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 211893535es 3.18p = 0.00
(DDX21)
GA_11103NM_138347hypothetical protein BC005868 (LOC90233)40206es 6.01p = 0.04
GA_1119NM_001217carbonic anhydrase XI (CA11)51219es 3.75p = 0.05
GA_11368NM_032147hypothetical protein DKFZp434D012771008es 21.02p = 0.00
(DKFZP434D0127)
GA_11398NM_015471DKFZP566O1646 protein (DC8)51107es 7.51p = 0.01
GA_11528NM_021633kelch-like protein C3IP1 (C3IP1)51017es 7.51p = 0.01
GA_11532NM_024900PHD protein Jade-1 (Jade-1)61029es 6.01p = 0.01
GA_11552NM_024086hypothetical protein MGC3329 (MGC3329)630110es 4.51p = 0.02
GA_11577AB058780KIAA1877 protein sequence42006es 6.01p = 0.04
GA_1160NM_052988cyclin-dependent kinase (CDC2-like) 10 (CDK10),40116es 6.01p = 0.04
transcript variant 3
GA_11600NM_002883Ran GTPase activating protein 1 (RANGAP1)1273527es 2.40p = 0.03
GA_11656NM_018425phosphatidylinositol 4-kinase type II (PI4KII)51129es 3.75p = 0.05
GA_11773NM_025109hypothetical protein FLJ22865 (FLJ22865)60006es > 4p = 0.00
GA_11790NM_013432nuclear factor of kappa light polypeptide gene52007es 7.51p = 0.01
enhancer in B-cells inhibitor-like 2 (NFKBIL2)
GA_11868NM_032844hypothetical protein FLJ14813 (FLJ14813)621110es 4.51p = 0.02
GA_11893NM_032805hypothetical protein FLJ14549 (FLJ14549)2500025es > 4p = 0.00
GA_11964NM_032620mitochondrial GTP binding protein (GTPBG3)51129es 3.75p = 0.05
GA_11971NM_138575hypothetical protein MGC5352 (MGC5352)41106es 6.01p = 0.04
GA_12025NM_020465NDRG family member 4 (NDRG4)41005es 12.01p = 0.02
GA_1206441005es 12.01p = 0.02
GA_1212NM_001313collapsin response mediator protein 1 (CRMP1)711211es 5.26p = 0.01
GA_12167NM_138357hypothetical protein BC010682 (LOC90550)40004es > 4p = 0.00
GA_1217NM_001316CSE1 chromosome segregation 1-like (yeast)2375237es 4.93p = 0.00
(CSE1L)
GA_12173NM_021912gamma-aminobutyric acid (GABA) A receptor, beta40004es > 4p = 0.00
3 (GABRB3), transcript variant 2
GA_12253NM_032420protocadherin 1 (cadherin-like 1) (PCDH1),50027es 7.51 p = 0.01
transcript variant 2
GA_12279NM_033019PCTAIRE protein kinase 1 (PCTK1), transcript1172424es 2.54p = 0.03
variant 3
GA_12318NM_032447fibrillin3 (KIAA1776)60006es > 4p = 0.00
GA_1236NM_003611oral-facial-digital syndrome 1 (OFD1)40105es 12.01p = 0.02
GA_12367NM_033317hypothetical gene ZD52F10 (ZD52F10)814417es 2.67p = 0.05
GA_12386AB002336KIAA0338 sequence41005es 12.01p = 0.02
GA_12440NM_032383Hermansky-Pudlak syndrome 3 (HPS3)71008es 21.02p = 0.00
GA_12522NM_052860kruppel-like zinc finger protein (ZNF300)622111es 3.60p = 0.03
GA_1260NM_000791dihydrofolate reductase (DHFR)1542425es 4.51p = 0.00
GA_12630NM_015356scribble (SCRIB)1240218es 6.01p = 0.00
GA_12635NM_002913replication factor C (activator 1) 1, 145 kDa (RFC1)80109es 24.03p = 0.00
GA_12640NM_004741nucleolar and coiled-body phosphoprotein 11697638es 2.18p = 0.02
(NOLC1)
GA_1265NM_001387dihydropyrimidinase-like 3 (DPYSL3)391331469es 3.90p = 0.00
GA_12672D86976similar to C.elegans protein (Z37093) sequence52018es 5.01p = 0.03
GA_12767NM_015360KIAA0052 protein (KIAA0052)822113es 4.81p = 0.01
GA_12899BC039246clone IMAGE: 527851752119es 3.75p = 0.05
GA_12900NM_003302thyroid hormone receptor interactor 6 (TRIP6)1233422es 3.60p = 0.00
GA_12949BC033781PAX transcription activation domain interacting40015es 12.01p = 0.02
protein 1 like sequence
GA_12954NM_003972BTAF1 RNA polymerase II, B-TFIID transcription732012es 4.20p = 0.01
factor-associated, 170 kDa (Mot1 homolog, S.
cerevisiae) (BTAF1)
GA_1322NM_000142fibroblast growth factor receptor 3 (achondroplasia,915116es 3.86p = 0.01
thanatophoric dwarfism) (FGFR3), transcript variant 1
GA_1378NM_000178glutathione synthetase (GSS)40116es 6.01p = 0.04
GA_1386NM_001517general transcription factor IIH, polypeptide 4 (52 kD812213es 4.81p = 0.01
subunit) (GTF2H4)
GA_1470NM_003740potassium channel, subfamily K, member 540015es 12.01p = 0.02
(KCNK5)
GA_1523NM_002442musashi homolog 1 (Drosophila) (MSI1)41005es 12.01p = 0.02
GA_1529NM_172164nuclear autoantigenic sperm protein (histone-5873215112es 3.23p = 0.00
binding) (NASP), transcript variant 1
GA_1634NM_002647phosphoinositide-3-kinase, class 3 (PIK3C3)51129es 3.75p = 0.05
GA_1650NM_002660phospholipase C, gamma 1 (formerly subtype 148)1044119es 3.34p = 0.01
(PLCG1)
GA_1662AF195139pinin (PNN) gene, complete cds2397544es 3.29p = 0.00
GA_1665NM_002691polymerase (DNA directed), delta 1, catalytic subunit962118es 3.00p = 0.02
125 kDa (POLD1)
GA_1674NM_002701POU domain, class 5, transcription factor 12412027es 24.03p = 0.00
(POU5F1)
GA_1696NM_000947primase, polypeptide 2A, 58 kDa (PRIM2A)40015es 12.01p = 0.02
GA_1702NM_002740protein kinase C, iota (PRKCI)822113es 4.81p = 0.01
GA_171BC013923Similar to SRY-box containing gene 2 sequence1211317es 7.21p = 0.00
GA_1710NM_002764phosphoribosyl pyrophosphate synthetase 1732113es 3.50p = 0.02
(PRPS1)
GA_1752NM_152881PTK7 protein tyrosine kinase 7 (PTK7), transcript15145337es 2.05p = 0.04
variant 3
GA_1777NM_002862phosphorylase, glycogen; brain (PYGB), nuclear1381224es 3.55p = 0.00
gene encoding mitochondrial protein
GA_1794NM_003610RAE1 RNA export 1 homolog (S. pombe) (RAE1)50027es 7.51p = 0.01
GA_1814NM_002907RecQ protein-like (DNA helicase Q1-like) (RECQL),42006es 6.01p = 0.04
transcript variant 1
GA_1820NM_002916replication factor C (activator 1) 4, 37 kDa (RFC4)602210es 4.51p = 0.02
GA_1865NM_002949mitochondrial ribosomal protein L12 (MRPL12),40026es 6.01p = 0.04
nuclear gene encoding mitochondrial protein
GA_1909NM_003012secreted frizzled-related protein 1 (SFRP1)1281728es 2.25p = 0.05
GA_1938NM_003601SWI/SNF related, matrix associated, actin19104538es 3.00p = 0.00
dependent regulator of chromatin, subfamily a,
member 5 (SMARCA5)
GA_1942NM_003076SWI/SNF related, matrix associated, actin1033319es 3.34p = 0.01
dependent regulator of chromatin, subfamily d,
member 1 (SMARCD1), transcript variant 1
GA_1962NM_152826sorting nexin 1 (SNX1), transcript variant 340015es 12.01p = 0.02
GA_1963NM_003100sorting nexin 2 (SNX2)824115es 3.43p = 0.02
GA_2024NM_003212teratocarcinoma-derived growth factor 1 (TDGF1)2010021es 60.07p = 0.00
GA_2031NM_003234transferrin receptor (p90, CD71) (TFRC)1393429es 2.44p = 0.02
GA_2066NM_003283troponin T1, skeletal, slow (TNNT1)51107es 7.51p = 0.01
GA_2091NM_001069tubulin, beta polypeptide (TUBB)4013111781es 2.93p = 0.00
GA_2123NM_003481ubiquitin specific protease 5 (isopeptidase T) (USP5)1365125es 3.25p = 0.00
GA_2149NM_003413Zic family member 3 heterotaxy 1 (odd-paired70108es 21.02p = 0.00
homolog, Drosophila) (ZIC3)
GA_2175NM_001605alanyl-tRNA synthetase (AARS)2361333es 6.91p = 0.00
GA_2178NM_001104actinin, alpha 3 (ACTN3)61007es 18.02p = 0.00
GA_2234NM_000107damage-specific DNA binding protein 2, 48 kDa810211es 8.01p = 0.00
(DDB2)
GA_2235NM_001358DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 151373124es 3.55p = 0.00
(DDX15)
GA_2240NM_001384diptheria toxin resistance protein required for61209es 6.01p = 0.01
diphthamide biosynthesis-like 2 (S. cerevisiae)
(DPH2L2)
GA_2271NM_001533heterogeneous nuclear ribonucleoprotein L (HNRPL)1014520es 3.00p = 0.02
GA_2289NM_000234ligase I, DNA, ATP-dependent (LIG1)1025320es 3.00p = 0.02
GA_2319NM_000456sulfite oxidase (SUOX), nuclear gene encoding51107es 7.51p = 0.01
mitochondrial protein
GA_2323NM_002164indoleamine-pyrrole 2,3 dioxygenase (INDO)60006es > 4p = 0.00
GA_2334NM_000216Kallmann syndrome 1 sequence (KAL1)50106es 15.02p = 0.00
GA_2337NM_003501acyl-Coenzyme A oxidase 3, pristanoyl (ACOX3)40015es 12.01p = 0.02
GA_23430NM_006474lung type-I cell membrane-associated glycoprotein52108es 5.01p = 0.03
(T1A-2)
GA_23457AK055600cDNA FLJ31038 fis, clone HSYRA2000159620210es 4.51p = 0.02
sequence
GA_23467AK092578cDNA FLJ35259 fis, clone PROST200425140004es > 4p = 0.00
sequence
GA_23468620210es 4.51p = 0.02
GA_2347650207es 7.51p = 0.01
GA_234844301044es 129.15p = 0.00
GA_234852511027es 37.54p = 0.00
GA_2348670007es > 4p = 0.00
GA_234874900049es > 4p = 0.00
GA_2348890009es > 4p = 0.00
GA_234891300013es > 4p = 0.00
GA_234901211014es 18.02p = 0.00
GA_2351451028es 5.01p = 0.03
GA_2351540004es > 4p = 0.00
GA_23525830011es 8.01p = 0.00
GA_2356NM_002851protein tyrosine phosphatase, receptor-type, Z1000010es > 4p = 0.00
polypeptide 1 (PTPRZ1)
GA_2357NM_001670armadillo repeat gene deletes in velocardiofacial60006es > 4p = 0.00
syndrome (ARVCF)
GA_2357241106es 6.01p = 0.04
GA_2357742006es 6.01p = 0.04
GA_23578BM454360AGENCOURT_6402318 NIH_MGC_85cDNA clone60006es > 4p = 0.00
IMAGE: 5497491 5′sequence
GA_2357940004es > 4p = 0.00
GA_23585801110es 12.01p = 0.00
GA_2359640105es 12.01p = 0.02
GA_23612NM_005762tripartite motif-containing 28 protein; KRAB-62109es 6.01p = 0.01
associated protein 1; transcriptional intermediary
factor 1-beta; nuclear corepressor KAP-1 sequence
GA_2361541005es 12.01p = 0.02
GA_2363441005es 12.01p = 0.02
GA_2367NM_003923forkhead box H1 (FOXH1)50005es > 4p = 0.00
GA_2367351006es 15.02p = 0.00
GA_2368341106es 6.01p = 0.04
GA_23981AK057602cDNA FLJ33040 fis, clone THYMU2000382, weakly40004es > 4p = 0.00
similar to 60S RIBOSOMAL PROTEIN L12
GA_2418NM_004317arsA arsenite transporter, ATP-binding, homolog 1631111es 3.60p = 0.03
(bacterial) (ASNA1)
GA_2436NM_004329bone morphogenetic protein receptor, type la731112es 4.20p = 0.01
(BMPR1A)
GA_2442NM_004335bone marrow stromal cell antigen 2 (BST2)1302318es 7.81p = 0.00
GA_2443NM_004336BUB1 budding uninhibited by benzimidazoles 11054221es 2.73p = 0.02
homolog (yeast) (BUB1)
GA_2444NM_004725BUB3 budding uninhibited by benzimidazoles 31247427es 2.40p = 0.03
homolog (yeast) (BUB3)
GA_2447NM_004341carbamoyl-phosphate synthetase 2, aspartate1182122es 3.00p = 0.01
transcarbamylase, and dihydroorotase (CAD),
nuclear gene encoding mitochondrial protein
GA_2467NM_004804WD40 protein Ciao1 (CIAO1)801211es 8.01p = 0.00
GA_2496NM_004229cofactor required for Sp1 transcriptional activation,711211es 5.26p = 0.01
subunit 2, 150 kDa (CRSP2)
GA_2501NM_080598HLA-B associated transcript 1 (BAT1), transcript241313959es 2.06p = 0.01
variant 2
GA_2621NM_004135isocitrate dehydrogenase 3 (NAD+) gamma (IDH3G)52018es 5.01p = 0.03
GA_2641NM_017522low density lipoprotein receptor-related protein 8,70029es 10.51p = 0.00
apolipoprotein e receptor (LRP8), transcript variant 3
GA_2643NM_004635mitogen-activated protein kinase-activated protein60129es 6.01p = 0.01
kinase 3 (MAPKAPK3)
GA_2644NM_004526MCM2 minichromosome maintenance deficient 2,2386441es 3.84p = 0.00
mitotin (S. cerevisiae) (MCM2)
GA_2717NM_004703rabaptin-5 (RAB5EP)51107es 7.51p = 0.01
GA_2728NM_004168succinate dehydrogenase complex, subunit A,52029es 3.75p = 0.05
flavoprotein (Fp) (SDHA), nuclear gene encoding
mitochondrial protein
GA_2751NM_004596small nuclear ribonucleoprotein polypeptide A1134523es 2.75p = 0.02
(SNRPA)
GA_2762NM_004819symplekin; Huntingtin interacting protein I (SPK)1056122es 2.50p = 0.04
GA_2784NM_004818prp28, U5 snRNP 100 kd protein (U5-100 K)16143336es 2.40p = 0.01
GA_2791NM_004652ubiquitin specific protease 9, X chromosome (fat1022115es 6.01p = 0.00
facets-like Drosophila) (USP9X), transcript variant 1
GA_2800NM_004629Fanconi anemia, complementation group G50218es 5.01p = 0.03
(FANCG)
GA_2840NM_004960fusion, derived from t(12; 16) malignant liposarcoma1424121es 6.01p = 0.00
(FUS)
GA_2857NM_004987LIM and senescent cell antigen-like domains 152018es 5.01p = 0.03
(LIMS1)
GA_2868NM_005006NADH dehydrogenase (ubiquinone) Fe-S protein 1,612211es 3.60p = 0.03
75 kDa (NADH-coenzyme Q reductase) (NDUFS1)
GA_2889NM_005032plastin 3 (T isoform) (PLS3)351871979es 2.39p = 0.00
GA_2897NM_005044protein kinase, X-linked (PRKX)630110es 4.51p = 0.02
GA_2898NM_005049PWP2 periodic tryptophan protein homolog (yeast)60129es 6.01p = 0.01
(PWP2H)
GA_2937NM_005207v-crk sarcoma virus CT10 oncogene homolog61007es 18.02p = 0.00
(avian)-like (CRKL)
GA_2945NM_005232EphA1 (EPHA1)51118es 5.01p = 0.03
GA_2962NM_005314gastrin-releasing peptide receptor (GRPR)40004es > 4p = 0.00
GA_2984NM_005474histone deacetylase 5 (HDAC5), transcript variant 1641011es 3.60p = 0.03
GA_2988NM_005397podocalyxin-like (PODXL)59235895es 4.92p = 0.00
GA_3017NM_000098carnitine palmitoyltransferase II (CPT2), nuclear41106es 6.01p = 0.04
gene encoding mitochondrial protein
GA_3024NM_003902far upstream element (FUSE) binding protein 11346326es 3.00p = 0.01
(FUBP1)
GA_3042NM_005760CCAAT-box-binding transcription factor (CBF2)922316es 3.86p = 0.01
GA_3055NM_005864signal transduction protein (SH3 containing) (EFS2),61018es 9.01p = 0.00
transcript variant 1
GA_3112NM_005789proteasome (prosome, macropain) activator subunit1226222es 3.60p = 0.00
3 (PA28 gamma; Ki) (PSME3)
GA_3118NM_005778RNA binding motif protein 5 (RBM5)1164425es 2.36p = 0.04
GA_3130NM_005785hypothetical SBBI03 protein (SBB103)41005es 12.01p = 0.02
GA_3134NM_005877splicing factor 3a, subunit 1, 120 kDa (SF3A1)1014318es 3.75p = 0.01
GA_3137NM_005628solute carrier family 1 (neutral amino acid231121349es 2.66p = 0.00
transporter), member 5 (SLC1A5)
GA_3144NM_005839serine/arginine repetitive matrix 1 (SRRM1)1665835es 2.53p = 0.01
GA_3150NM_139315TAF6 RNA polymerase II, TATA box binding protein40004es > 4p = 0.00
(TBP)-associated factor, 80 kDa (TAF6), transcript
variant 2
GA_3175NM_005741zinc finger protein 263 (ZNF263)740112es 4.20p = 0.01
GA_3178NM_006017prominin-like 1 (mouse) (PROML1)722011es 5.26p = 0.01
GA_3183NM_006035CDC42 binding protein kinase beta (DMPK-like)1350321es 4.88p = 0.00
(CDC42BPB)
GA_3219NM_005928milk fat globule-EGF factor 8 protein (MFGE8)3011111466es 2.50p = 0.00
GA_32806BE568403601341979F1 NIH_MGC_53cDNA clone925218es 3.00p = 0.02
IMAGE: 3684283 5′ sequence
GA_32836AK055259cDNA FLJ30697 fis, clone FCBBF2000815, weakly40116es 6.01p = 0.04
similar to ZYXIN
GA_32842830011es 8.01p = 0.00
GA_3286070007es > 4p = 0.00
GA_32868AK091598cDNA FLJ34279 fis, clone FEBRA200383340004es > 4p = 0.00
sequence
GA_32887NM_006141dynein, cytoplasmic, light intermediate polypeptide 2720211es 5.26p = 0.01
(DNCLI2)
GA_3289554009es 3.75p = 0.05
GA_32908AL832758mRNA; cDNA DKFZp686C0927 (from clone40004es > 4p = 0.00
DKFZp686C0927) sequence
GA_3291340004es > 4p = 0.00
GA_3291740004es > 4p = 0.00
GA_3292670007es > 4p = 0.00
GA_3294740206es 6.01p = 0.04
GA_3297940004es > 4p = 0.00
GA_3298540004es > 4p = 0.00
GA_3321NM_006345chromosome 4 open reading frame 1 (C4orf1)1054221es 2.73p = 0.02
GA_33423NM_002537ornithine decarboxylase antizyme 2 (OAZ2)1817329es 4.91p = 0.00
GA_3343NM_006392nucleolar protein 5A (56 kDa with KKE/D repeat)16511537es 2.29p = 0.02
(NOL5A)
GA_33455NM_006047RNA binding motif protein 12 (RBM12), transcript1743428es 4.64p = 0.00
variant 1
GA_33475NM_004902RNA-binding region (RNP1, RRM) containing 21228224es 3.00p = 0.01
(RNPC2)
GA_33503NM_018135mitochondrial ribosomal protein S18A (MRPS18A),41106es 6.01p = 0.04
nuclear gene encoding mitochondrial protein
GA_33528NM_032803solute carrier family 7 (cationic amino acid40105es 12.01p = 0.02
transporter, y+ system), member 3 (SLC7A3)
GA_33533BC037428Unknown (protein for MGC: 46327) sequence741113es 3.50p = 0.02
GA_33548NM_015638chromosome 20 open reading frame 188730111es 5.26p = 0.01
(C20orf188)
GA_33588AL832967mRNA; cDNA DKFZp666B082 (from clone50218es 5.01p = 0.03
DKFZp666B082) sequence
GA_33680NM_016089KRAB-zinc finger protein SZF1-1 (SZF1)1501016es 45.05p = 0.00
GA_33684NM_005186calpain 1, (mu/l) large subunit (CAPN1)1381527es 2.79p = 0.01
GA_33691AL117507mRNA; cDNA DKFZp434F1935 (from clone41106es 6.01p = 0.04
DKFZp434F1935); partial cds
GA_33704AL833549mRNA; cDNA DKFZp686N183 (from clone41106es 6.01p = 0.04
DKFZp686N183) sequence
GA_33730AL832779mRNA; cDNA DKFZp686H157 (from clone40116es 6.01p = 0.04
DKFZp686H157) sequence
GA_33747NM_032737lamin B2 (LMNB2)1183325es 2.36p = 0.04
GA_33755NM_033547hypothetical gene MGC16733 similar to CG1211350016es 15.02p = 0.00
(MGC16733)
GA_33772BF2230237q27f09.x1 NCI_CGAP_GC6cDNA clone50005es > 4p = 0.00
IMAGE: 3699616 3′ sequence
GA_33816NM_015850fibroblast growth factor receptor 1 (fms-related35129561es 4.04p = 0.00
tyrosine kinase 2, Pfeiffer syndrome) (FGFR1),
transcript variant 2
GA_33874NM_017730hypothetical protein FLJ20259 (FLJ20259)1964433es 4.08p = 0.00
GA_33876NM_148904oxysterol binding protein-like 9 (OSBPL9), transcript51028es 5.01p = 0.03
variant 1
GA_33877NM_020796sema domain, transmembrane domain (TM), and16111432es 3.00p = 0.00
cytoplasmic domain, (semaphorin) 6A (SEMA6A)
GA_33959NM_030964sprouty homolog 4 (Drosophila) (SPRY4)41005es 12.01p = 0.02
GA_34010AK000089cDNA FLJ20082 fis, clone COL03245803011es 8.01p = 0.00
GA_34047NM_170752chromodomain protein, Y chromosome-like (CDYL),811111es 8.01p = 0.00
transcript variant 3
GA_34061NM_152429hypothetical protein MGC39320 (MGC39320)71019es 10.51p = 0.00
GA_3407NM_006328RNA binding motif protein 14 (RBM14)1634326es 4.81p = 0.00
GA_34077NM_133457likely ortholog of mouse type XXVI collagen704213es 3.50p = 0.02
(COL26A1)
GA_34137NM_020314esophageal cancer associated protein (MGC16824)61007es 18.02p = 0.00
GA_34200NM_005763aminoadipate-semialdehyde synthase (AASS)1000212es 15.02p = 0.00
GA_34219NM_018449ubiquitin associated protein 2 (UBAP2), transcript62109es 6.01p = 0.01
variant 1
GA_34245NM_004922SEC24 related gene family, member C (S.1060117es 4.29p = 0.00
cerevisiae) (SEC24C)
GA_34270NM_152758hypothetical protein FLJ31657 (FLJ31657)52108es 5.01p = 0.03
GA_34280NM_000702ATPase, Na+/K+ transporting, alpha 2 (+)40004es > 4p = 0.00
polypeptide (ATP1A2)
GA_34320NM_006461sperm associated antigen 5 (SPAG5)1465227es 3.23p = 0.00
GA_34322NM_023926hypothetical protein FLJ12895 (FLJ12895)50128es 5.01p = 0.03
GA_3436NM_018062hypothetical protein FLJ10335 (FLJ10335)51309es 3.75p = 0.05
GA_34419NM_002952ribosomal protein S2 (RPS2)19511742es 2.48p = 0.00
GA_34438NM_006521transcription factor binding to IGHM enhancer 352029es 3.75p = 0.05
(TFE3)
GA_34480NM_012218interleukin enhancer binding factor 3, 90 kDa (ILF3),41261320100es 2.09p = 0.00
transcript variant 1
GA_34503NM_005762tripartite motif-containing 28 (TRIM28)1368229es 2.44p = 0.02
GA_34505NM_002065glutamate-ammonia ligase (glutamine synthase)2118232es 5.73p = 0.00
(GLUL)
GA_34522NM_000071cystathionine-beta-synthase (CBS)721212es 4.20p = 0.01
GA_34539NM_002880v-raf-1 murine leukemia viral oncogene homolog 11473024es 4.20p = 0.00
(RAF1)
GA_34563NM_007192suppressor of Ty 16 homolog (S. cerevisiae)911314es 5.41p = 0.00
(SUPT16H)
GA34594NM_004426polyhomeotic-like 1 (Drosophila) (PHC1)60006es > 4p = 0.00
GA_34606NM_015570autism susceptibility candidate 2 (AUTS2)70029es 10.51p = 0.00
GA_34626NM_004911protein disulfide isomerase related protein (calcium-52119es 3.75p = 0.05
binding protein, intestinal-related) (ERP70)
GA_34655X74794P1 Cdc21 protein sequence3495452es 5.67p = 0.00
GA_34679NM_002015forkhead box O1A (rhabdomyosarcoma) (FOXO1A)40116es 6.01p = 0.04
GA_34715NM_002421matrix metalloproteinase 1 (interstitial collagenase)51028es 5.01p = 0.03
(MMP1)
GA_34820NM_024656hypothetical protein FLJ22329 (FLJ22329)51118es 5.01p = 0.03
GA_34875NM_004459fetal Alzheimer antigen (FALZ)52029es 3.75p = 0.05
GA_35037NM_004426polyhomeotic-like 1 (Drosophila) (PHC1)3432544es 10.21p = 0.00
GA_35125NM_005386neuronatin (NNAT)53019es 3.75p = 0.05
GA_35141NM_018555zinc finger protein 331; zinc finger protein 4631325222es 4.34p = 0.00
(ZNF361)
GA_35150AB014542KIAA0642 protein sequence51219es 3.75p = 0.05
GA_35158NM_015327KIAA1089 protein (KIAA1089)1062220es 3.00p = 0.02
GA_3520NM_005915MCM6 minichromosome maintenance deficient 61255224es 3.00p = 0.01
(MIS5 homolog, S. pombe) (S. cerevisiae) (MCM6)
GA_35206NM_005678SNRPN upstream reading frame (SNURF),20109948es 2.15p = 0.01
transcript variant 1
GA_35221NM_020442KIAA1885 protein (DKFZP434L1435)60006es > 4p = 0.00
GA_35231NM_014389proline and glutamic acid rich nuclear protein14113129es 2.80p = 0.01
(PELP1)
GA_35233NM_138615DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 301134523es 2.75p = 0.02
(DDX30), transcript variant 1
GA_35239NM_014633KIAA0155 gene product (KIAA0155)51208es 5.01p = 0.03
GA_35260NM_004104fatty acid synthase (FASN)62019es 6.01p = 0.01
GA_35393NM_006861RAB35, member RAS oncogene family (RAB35)722112es 4.20p = 0.01
GA_35395NM_024662hypothetical protein FLJ10774 (FLJ10774)640111es 3.60p = 0.03
GA_354051283124es 3.00p = 0.01
GA_35422NM_021211transposon-derived Buster1 transposase-like protein40026es 6.01p = 0.04
(LOC58486)
GA_35457AJ459424JEMMA protein sequence712111es 5.26p = 0.01
GA_35481NM_006452phosphoribosylaminoimidazole carboxylase,361413972es 3.00p = 0.00
phosphoribosylaminoimidazole succinocarboxamide
synthetase (PAICS)
GA_35495NM_003472DEK oncogene (DNA binding) (DEK)16381037es 2.29p = 0.02
GA_35547NM_032202hypothetical protein KIAA1109 (KIAA1109)40026es 6.01p = 0.04
GA_35558AL831917hypothetical protein sequence61119es 6.01p = 0.01
GA_3559NM_005629solute carrier family 6 (neurotransmitter transporter,51017es 7.51p = 0.01
creatine), member 8 (SLC6A8)
GA_35606NM_024586oxysterol binding protein-like 9 (OSBPL9), transcript41106es 6.01p = 0.04
variant 6
GA_35607AB002366KIAA0368 sequence842317es 2.67p = 0.05
GA_35615NM_000251mutS homolog 2, colon cancer, nonpolyposis type 11666028es 4.00p = 0.00
(E. coli) (MSH2)
GA_35687NM_033502transcriptional regulating protein 132 (TReP-132),50005es > 4p = 0.00
transcript variant 1
GA_35693NM_014782armadillo repeat protein ALEX2 (ALEX2)1284327es 2.40p = 0.03
GA_35762NM_020765retinoblastoma-associated factor 600 (RBAF600)1243120es 4.51p = 0.00
GA_35833NM_015878ornithine decarboxylase antizyme inhibitor (OAZIN),17810641es 2.13p = 0.02
transcript variant 1
GA_35852AK056479cDNA FLJ31917 fis, clone NT2RP7004925, weakly42006es 6.01p = 0.04
similar to VASODILATOR-STIMULATED
PHOSPHOPROTEIN
GA_35869AB011112KIAA0540 protein sequence52108es 5.01p = 0.03
GA_35905NM_006640MLL septin-like fusion (MSF)28256665es 2.27p = 0.00
GA_35913NM_018265hypothetical protein FLJ10901 (FLJ10901)50117es 7.51p = 0.01
GA_3593NM_000270nucleoside phosphorylase (NP)51118es 5.01p = 0.03
GA_35955NM_022754sideroflexin 1 (SFXN1)51107es 7.51p = 0.01
Gk_35984NM_015340leucyl-tRNA synthetase, mitochondrial (LARS2),40206es 6.01p = 0.04
nuclear gene encoding mitochondrial protein
GA_36015NM_015341barren homolog (Drosophila) (BRRN1)911213es 6.76p = 0.00
GA_36017AK074137FLJ00210 protein sequence40105es 12.01p = 0.02
GA_36019NM_012426splicing factor 3b, subunit 3, 130 kDa (SF3B3)1132319es 4.13p = 0.00
GA_36080NM_152333chromosome 14 open reading frame 69 (C14orf69)1411319es 8.41p = 0.00
GA_36090NM_020444KIAA1191 protein (KIAA1191)971219es 2.70p = 0.03
GA_3611NM_001211BUB1 budding uninhibited by benzimidazoles 11344425es 3.25p = 0.00
homolog beta (yeast) (BUB1B)
GA_36126NM_004286GTP binding protein 1 (GTPBP1)41005es 12.01p = 0.02
GA_36127NM_016121NY-REN-45 antigen (NY-REN-45)51219es 3.75p = 0.05
GA_36129NM_018353hypothetical protein FLJ11186 (FLJ11186)1003316es 5.01p = 0.00
GA_36133NM_020428CTL2 gene (CTL2)960015es 4.51p = 0.00
GA_36137NM_007363non-POU domain containing, octamer-binding3912221487es 2.44p = 0.00
(NONO)
GA_36139NM_004990methionine-tRNA synthetase (MARS)1131015es 8.26p = 0.00
GA_36155AB020719KIAA0912 protein sequence51107es 7.51p = 0.01
GA_36183NM_016333serine/arginine repetitive matrix 2 (SRRM2)23219154es 2.23p = 0.00
GA_36184NM_020151START domain containing 7 (STARD7), transcript1760124es 7.29p = 0.00
variant 1
GA_36219NM_152392hypothetical protein DKFZp564C236712111es 5.26p = 0.01
(DKFZp564C236)
GA_36221NM_000966retinoic acid receptor, gamma (RARG)620210es 4.51p = 0.02
GA_36241NM_018031WD repeat domain 6 (WDR6), transcript variant 1292011767es 2.29p = 0.00
GA_36270NM_003715vesicle docking protein p115 (VDP)1254223es 3.28p = 0.01
GA_3628NM_006579emopamil binding protein (sterol isomerase) (EBP)713011es 5.26p = 0.01
GA_36307NM_015897protein inhibitor of activated STAT protein PIASy52209es 3.75p = 0.05
(PIASY)
GA_36389NM_025256HLA-B associated transcript 8 (BAT8), transcript1156224es 2.54p = 0.03
variant NG36/G9a-SPI
GA_36450NM_003051solute carrier family 16 (monocarboxylic acid2277541es 3.48p = 0.00
transporters), member 1 (SLC16A1)
GA_36474X87832NOV54009es 3.75p = 0.05
GA_36491NM_024611similar to NMDA receptor-regulated gene 2 (mouse)640111es 3.60p = 0.03
(FLJ11896)
GA_36526NM_033557similar to putative transmembrane protein; homolog632011es 3.60p = 0.03
of yeast Golgi membrane protein Yif1p (Yip1p-
interacting factor) (LOC90522)
GA_36545AB014600KIAA0700 protein sequence841316es 3.00p = 0.04
GA_36581NM_018071hypothetical protein FLJ10357 (FLJ10357)63009es 6.01p = 0.01
GA_36592AB002363KIAA0365 sequence61018es 9.01p = 0.00
GA_36595NM_024718hypothetical protein FLJ10101 (FLJ10101)842317es 2.67p = 0.05
GA_36643NM_003918glycogenin 2 (GYG2)51006es 15.02p = 0.00
GA_36675NM_003605O-linked N-acetylglucosamine (GIcNAc) transferase940114es 5.41p = 0.00
(UDP-N-acetylglucosamine: polypeptide-N-
acetylglucosaminyl transferase) (OGT)
GA_36692NM_015902progestin induced protein (DD5)841215es 3.43p = 0.02
GA_36707NM_021627sentrin-specific protease (SENP2)40105es 12.01p = 0.02
GA_36730AF164609endogenous retrovirus HERV-K101, complete50005es > 4p = 0.00
sequence
GA_36734AF376802neuroligin 2 sequence63009es 6.01p = 0.01
GA_36771NM_016238anaphase-promoting complex subunit 7 (ANAPC7)60107es 18.02p = 0.00
GA_36788NM_000141fibroblast growth factor receptor 2 (bacteria-951217es 3.38p = 0.02
expressed kinase, keratinocyte growth factor
receptor, craniofacial dysostosis 1, Crouzon
syndrome, Pfeiffer syndrome, Jackson-Weiss
syndrome) (FGFR2), transcript variant 1
GA_36798NM_000071cystathionine-beta-synthase (CBS)1101214es 11.01p = 0.00
GA_36842NM_006197pericentriolar material 1 (PCM1)631111es 3.60p = 0.03
GA_36897NM_006773DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 18732113es 3.50p = 0.02
(Myc-regulated) (DDX18)
GA_36933NM_016424cisplatin resistance-associated overexpressed1914731es 4.76p = 0.00
protein (LUC7A)
GA_36936NM_149379Williams Beuren syndrome chromosome region 20C1164122es 3.00p = 0.01
(WBSCR20C), transcript variant 4
GA_36951NM_005916MCM7 minichromosome maintenance deficient 7 (S.19361139es 2.85p = 0.00
cerevisiae) (MCM7)
GA_36957NM_024642UDP-N-acetyl-alpha-D-galactosamine: polypeptide40116es 6.01p = 0.04
N-acetylgalactosaminyltransferase 12 (GalNAc-T12)
(GALNT12)
GA_36964NG_001332T cell receptor alpha delta locus (TCRA/TCRD) on1620018es 24.03p = 0.00
chromosome 14
GA_36974AL834155mRNA; cDNA DKFZp761O0611 (from clone41016es 6.01p = 0.04
DKFZp761O0611) sequence
GA_36977NM_020927KIAA1576 protein (KIAA1576)921012es 9.01p = 0.00
GA_37071NM_153759DNA (cytosine-5-)-methyltransferase 3 alpha921113es 6.76p = 0.00
(DNMT3A), transcript variant 2
GA_37078NM_014977apoptotic chromatin condensation inducer in the1062220es 3.00p = 0.02
nucleus (ACINUS)
GA_37079NM_032156EEG1 (EEG1), transcript variant S70007es > 4p = 0.00
GA_37094AL832758mRNA; cDNA DKFZp686C0927 (from clone1113318es 4.72p = 0.00
DKFZp686C0927) sequence
GA_37215NM_019023hypothetical protein FLJ10640 (FLJ10640)713011es 5.26p = 0.01
GA_3723NM_003750eukaryotic translation initiation factor 3, subunit 10301561768es 2.37p = 0.00
theta, 150/170 kDa (EIF3S10)
GA_37251NM_000604fibroblast growth factor receptor 1 (fms-related715013es 3.50p = 0.02
tyrosine kinase 2, Pfeiffer syndrome) (FGFR1),
transcript variant 1
GA_3730NM_003751eukaryotic translation initiation factor 3, subunit 91352323es 3.90p = 0.00
eta, 116 kDa (EIF3S9)
GA_37314NM_003169suppressor of Ty 5 homolog (S. cerevisiae)1461122es 5.26p = 0.00
(SUPT5H)
GA_37354NM_015726H326 (H326)51107es 7.51p = 0.01
GA_37372NM_024658importin 4 (FLJ23338)1270322es 3.60p = 0.00
GA_37389NM_017647FtsJ homolog 3 (E. coli) (FTSJ3)1375126es 3.00p = 0.01
GA_37391NM_004938death-associated protein kinase 1 (DAPK1)60017es 18.02p = 0.00
GA_37399NM_148842Williams-Beuren syndrome chromosome region 161001213es 10.01p = 0.00
(WBSCR16), transcript variant 2
GA_37409NM_021145cyclin D binding myb-like transcription factor 151028es 5.01p = 0.03
(DMTF1)
GA_37424NM_152742hypothetical protein DKFZp547M10960129es 6.01p = 0.01
(DKFZp547M109)
GA_37431NM_006034p53-induced protein (PIG11)741012es 4.20p = 0.01
GA_37478NM_014670basic leucine zipper and W2 domains 1 (BZW1)241311957es 2.18p = 0.01
GA_37504NM_153613PISC domain containing hypothetical protein51039es 3.75p = 0.05
(LOC254531)
GA_37536AK026970cDNA: FLJ23317 fis, clone HEP12062, highly similar52108es 5.01p = 0.03
to AF008936syntaxin-16B mRNA
GA_37538NM_080797death associated transcription factor 1 (DATF1),60107es 18.02p = 0.00
transcript variant 3
GA_37589AL834216hypothetical protein sequence40105es 12.01p = 0.02
GA_37595NM_015062KIAA0595 protein (KIAA0595)730111es 5.26p = 0.01
GA_37606NM_019012phosphoinositol 3-phosphate-binding protein-242006es 6.01p = 0.04
(PEPP2)
GA_37707NM_022574PERQ amino acid rich, with GYF domain 1 (PERQ1)40105es 12.01p = 0.02
GA_37729NM_005436DNA segment on chromosome 10 (unique) 170841316es 3.00p = 0.04
(D10S170)
GA_37737NM_003707RuvB-like 1 (E. coli) (RUVBL1)52029es 3.75p = 0.05
GA_37755NM_015044golgi associated, gamma adaptin ear containing,1350220es 5.58p = 0.00
ARF binding protein 2 (GGA2), transcript variant 1
GA_37788NM_133631roundabout, axon guidance receptor, homolog 1741012es 4.20p = 0.01
(Drosophila) (ROBO1), transcript variant 2
GA_37800NM_032701hypothetical protein MGC2705 (MGC2705)41016es 6.01p = 0.04
GA_37805NM_025222hypothetical protein PRO2730 (PRO2730)613111es 3.60p = 0.03
GA_37866NM_138927SON DNA binding protein (SON), transcript variant f632011es 3.60p = 0.03
GA_37877NM_012215meningioma expressed antigen 5 (hyaluronidase)1043320es 3.00p = 0.02
(MGEA5)
GA_37884AB032993KIAA1167 protein sequence52108es 5.01p = 0.03
GA_37904NM_000478alkaline phosphatase, liver/bone/kidney (ALPL)41106es 6.01p = 0.04
GA_37914NM_153464interleukin enhancer binding factor 3, 90 kDa (ILF3),911011es 13.52p = 0.00
transcript variant 3
GA_38001NM_152312hypothetical protein FLJ35207 (FLJ35207)41005es 12.01p = 0.02
GA_38023NM_015846methyl-CpG binding domain protein 1 (MBD1),70108es 21.02p = 0.00
transcript variant 1
GA_3802941005es 12.01p = 0.02
GA_38084NM_015658DKFZP564C186 protein (DKFZP564C186)1353526es 3.00p = 0.01
GA_3818NM_006833COP9 subunit 6 (MOV34 homolog, 34 kD) (COPS6)811616es 3.00p = 0.04
GA_38225NM_007152zinc finger protein 195 (ZNF195)40206es 6.01p = 0.04
GA_38238AL133439mRNA full length insert cDNA clone EUROIMAGE40206es 6.01p = 0.04
200978
GA_38243BM920378AGENCOURT_6709352 NIH_MGC_122cDNA52119es 3.75p = 0.05
clone IMAGE: 5750332 5′ sequence
GA_3826NM_006875pim-2 oncogene (PIM2)50106es 15.02p = 0.00
GA_38266NM_144504junctional adhesion molecule 1 (JAM1), transcript1843833es 3.60p = 0.00
variant 5
GA_38278NM_019852methyltransferase like 3 (METTL3)804315es 3.43p = 0.02
GA_38283NM_013411adenylate kinase 2 (AK2), nuclear gene encoding1666331es 3.20p = 0.00
mitochondrial protein, transcript variant AK2B
GA_38292NM_005455zinc finger protein 265 (ZNF265)623011es 3.60p = 0.03
GA_38304NM_002394solute carrier family 3 (activators of dibasic and40105es 12.01p = 0.02
neutral amino acid transport), member 2 (SLC3A2)
GA_38370NM_024923nucleoporin 210 (NUP210)802111es 8.01p = 0.00
GA_38371NM_018003uveal autoantigen with coiled-coil domains and51129es 3.75p = 0.05
ankyrin repeats (UACA)
GA_38377NM_033288KRAB zinc finger protein KR18 (KR18)52108es 5.01p = 0.03
GA_38426NG_001332T cell receptor alpha delta locus (TCRA/TCRD) on712010es 7.01p = 0.00
chromosome 14
GA_38431NM_021238TERA protein (TERA)2652841es 5.21p = 0.00
GA_38500AB040903KIAA1470 protein sequence21127747es 2.43p = 0.00
GA_3851NM_006759UDP-glucose pyrophosphorylase 2 (UGP2)1745228es 4.64p = 0.00
GA_38548AB033107KIAA1281 protein sequence620311es 3.60p = 0.03
GA_3861NM_006845kinesin family member 2C (KIF2C)914115es 4.51p = 0.00
GA_38627AL831836hypothetical protein sequence51129es 3.75p = 0.05
GA_38635NM_133370KIAA1966 protein (KIAA1966)944219es 2.70p = 0.03
GA_38666BC000401splicing factor 3b, subunit 2, 145 kD sequence1699640es 2.00p = 0.04
GA_38677NM_153280ubiquitin-activating enzyme E1 (A1S9T and BN7544411014109es 2.03p = 0.00
temperature sensitivity complementing) (UBE1),
transcript variant 2
GA_38691NM_004550NADH dehydrogenase (ubiquinone) Fe-S protein 2,912618es 3.00p = 0.02
49 kDa (NADH-coenzyme Q reductase) (NDUFS2)
GA_387AB020648KIAA0841 protein sequence41106es 6.01p = 0.04
GA_38786NM_138769mitochondrial Rho 2 (MIRO-2)802313es 4.81p = 0.01
GA_38804NM_018249CDK5 regulatory subunit associated protein 253109es 3.75p = 0.05
(CDK5RAP2)
GA_38826NM_133171engulfment and cell motility 2 (ced-12 homolog, C.41016es 6.01p = 0.04
elegans) (ELMO2), transcript variant 1
GA_38854NM_032228hypothetical protein FLJ22728 (FLJ22728)52029es 3.75p = 0.05
GA_38867NM_018189hypothetical protein FLJ10713 (FLJ10713)3426143es 11.35p = 0.00
GA_3897NM_007015chondromodulin I precursor (CHM-I)40105es 12.01p = 0.02
GA_3898NM_006892DNA (cytosine-5-)-methyltransferase 3 beta4923155es 24.53p = 0.00
(DNMT3B)
GA_3899NM_144733E1B-55 kDa-associated protein 5 (E1B-AP5),23166752es 2.38p = 0.00
transcript variant 2
GA_3938NM_006925splicing factor, arginine/serine-rich 5 (SFRS5)29424663es 2.56p = 0.00
GA_3984NM_006114translocase of outer mitochondrial membrane 40712212es 4.20p = 0.01
homolog (yeast) (TOMM40)
GA_4038NM_007223putative G protein coupled receptor (GPR)52007es 7.51p = 0.01
GA_4059NM_007221polyamine-modulated factor 1 (PMF1)622111es 3.60p = 0.03
GA_4148NM_003826N-ethylmaleimide-sensitive factor attachment41016es 6.01p = 0.04
protein, gamma (NAPG)
GA_4176NM_004448v-erb-b2 erythroblastic leukemia viral oncogene15112533es 2.50p = 0.01
homolog 2, neuro/glioblastoma derived oncogene
homolog (avian) (ERBB2)
GA_4247NM_001975enolase 2, (gamma, neuronal) (ENO2)50207es 7.51p = 0.01
GA_4251NM_002528nth endonuclease III-like 1 (E. coli) (NTHL1)40015es 12.01p = 0.02
GA_4253NM_004761RAB2, member RAS oncogene family-like (RAB2L)632011es 3.60p = 0.03
GA_4255NM_006929superkiller viralicidic activity 2-like (S. cerevisiae)54009es 3.75p = 0.05
(SKIV2L)
GA_4258NM_080911uracil-DNA glycosylase (UNG), nuclear gene936018es 3.00p = 0.02
encoding mitochondrial protein, transcript variant 2
GA_4263NM_006247protein phosphatase 5, catalytic subunit (PPP5C)613111es 3.60p = 0.03
GA_4268NM_003852transcriptional intermediary factor 1 (TIF1)1344122es 4.34p = 0.00
GA_4295NM_005255cyclin G associated kinase (GAK)632011es 3.60p = 0.03
GA_4302NM_005054RAN binding protein 2-like 1 (RANBP2L1), transcript40015es 12.01p = 0.02
variant 1
GA_4332NM_019900ATP-binding cassette, sub-family C (CFTR/MRP),832114es 4.00p = 0.01
member 1 (ABCC1), transcript variant 5
GA_4446NM_002388MCM3 minichromosome maintenance deficient 3 (S.3848757es 6.01p = 0.00
cerevisiae) (MCM3)
GA_4478AK074826cDNA FLJ90345 fis, clone NT2RP2002974, highly40004es > 4p = 0.00
similar to HOMEOBOX PROTEIN SIX5 sequence
GA_4551NM_007375TAR DNA binding protein (TARDBP)17114537es 2.55p = 0.01
GA_4568NM_012100aspartyl aminopeptidase (DNPEP)811111es 8.01p = 0.00
GA_458AF080158lkB kinase-b sequence40004es > 4p = 0.00
GA_4619NM_012295calcineurin binding protein 1 (CABIN1)641011es 3.60p = 0.03
GA_4659NM_134434RAD54B homolog (RAD54B), transcript variant 240206es 6.01p = 0.04
GA_4689NM_012470transportin-SR (TRN-SR)1143119es 4.13p = 0.00
GA_4693NM_012256zinc finger protein 212 (ZNF212)50128es 5.01p = 0.03
GA_4694NM_012482zinc finger protein 281 (ZNF281)40004es > 4p = 0.00
GA_4788NM_016263Fzr1 protein (FZR1)51039es 3.75p = 0.05
GA_4802AB033092KIAA1266 protein sequence942015es 4.51p = 0.00
GA_4973NM_015503SH2-B homolog (SH2B)52119es 3.75p = 0.05
GA_5037AB037847KIAA1426 protein sequence623011es 3.60p = 0.03
GA_5052NM_015705hypothetical protein DJ1042K10.2 (DJ1042K10.2)922114es 5.41p = 0.00
GA_5301NM_145251serine/threonine/tyrosine interacting protein (STYX)40004es > 4p = 0.00
GA_5391NM_002968sal-like 1 (Drosophila) (SALL1)71109es 10.51p = 0.00
GA_5470NM_002610pyruvate dehydrogenase kinase, isoenzyme 140116es 6.01p = 0.04
(PDK1), nuclear gene encoding mitochondrial
protein
GA_5475NM_012280FtsJ homolog 1 (E. coli) (FTSJ1)60107es 18.02p = 0.00
GA_5493NM_005415solute carrier family 20 (phosphate transporter),610310es 4.51p = 0.02
member 1 (SLC20A1)
GA_5504NM_007318presenilin 1 (Alzheimer disease 3) (PSEN1),51129es 3.75p = 0.05
transcript variant I-463
GA_5513NM_014324alpha-methylacyl-CoA racemase (AMACR)40105es 12.01p = 0.02
GA_5534NM_014316calcium regulated heat stable protein 1, 24 kDa813113es 4.81p = 0.01
(CARHSP1)
GA_5620NM_014516CCR4-NOT transcription complex, subunit 3851216es 3.00p = 0.04
(CNOT3)
GA_5622NM_014434NADPH-dependent FMN and FAD containing50106es 15.02p = 0.00
oxidoreductase (NR1)
GA_5665NM_014264serine/threonine kinase 18 (STK18)51129es 3.75p = 0.05
GA_5703NM_134264SOCS box-containing WD protein SWiP-1 (WSB1),442991294es 2.64p = 0.00
transcript variant 3
GA_5729NM_015456cofactor of BRCA1 (COBRA1)722011es 5.26p = 0.01
GA_5735NM_015537DKFZP586J1624 protein (DKFZP586J1624)41016es 6.01p = 0.04
GA_5811NM_014669KIAA0095 gene product (KIAA0095)1034017es 4.29p = 0.00
GA_5829NM_014773KIAA0141 gene product (KIAA0141)812314es 4.00p = 0.01
GA_5836NM_014865chromosome condensation-related SMC-associated1254223es 3.28p = 0.01
protein 1 (KIAA0159)
protein 1 (KIAA0159)
GA_5906NM_014675KIAA0445 gene product (KIAA0445)53109es 3.75p = 0.05
GA_5911NM_014857KIAA0471 gene product (KIAA0471)40026es 6.01p = 0.04
GA_5954NM_014871KIAA0710 gene product (KIAA0710)52007es 7.51p = 0.01
GA_5961NM_014828chromosome 14 open reading frame 92 (C14orf92)730313es 3.50p = 0.02
GA_5981NM_014921lectomedin-2 (KIAA0821)1150117es 5.51p = 0.00
GA_6007NM_014962BTB (POZ) domain containing 3 (BTBD3)703313es 3.50p = 0.02
GA_6011NM_014963KIAA0963 protein (KIAA0963)41005es 12.01p = 0.02
GA_6106NM_015888hook1 protein (HOOK1)50016es 15.02p = 0.00
GA_6133NM_016335proline dehydrogenase (oxidase) 1 (PRODH),51208es 5.01p = 0.03
nuclear gene encoding mitochondrial protein
GA_6139NM_016448RA-regulated nuclear matrix-associated protein61209es 6.01p = 0.01
(RAMP)
GA_6232NM_016223protein kinase C and casein kinase substrate in51118es 5.01p = 0.03
neurons 3 (PACSIN3)
GA_6271NM_016518pipecolic acid oxidase (PIPOX)40004es > 4p = 0.00
GA_6317NM_015935CGI-01 protein (CGI-01)721313es 3.50p = 0.02
GA_638AB024494huntingtin interacting protein 3 sequence40206es 6.01p = 0.04
GA_6438NM_002889retinoic acid receptor responder (tazarotene40015es 12.01p = 0.02
induced) 2 (RARRES2)
GA_6445NM_017424cat eye syndrome chromosome region, candidate 11022418es 3.75p = 0.01
(CECR1)
GA_6460NM_017415kelch-like 3 (Drosophila) (KLHL3)40004es > 4p = 0.00
GA_6649NM_148956Williams Beuren syndrome chromosome region 20A40004es > 4p = 0.00
(WBSCR20A), transcript variant 1
GA_6665NM_018077hypothetical protein FLJ10377 (FLJ10377)702312es 4.20p = 0.01
GA_6669NM_018085importin 9 (FLJ10402)1203318es 6.01p = 0.00
GA_6673NM_018093hypothetical protein FLJ10439 (FLJ10439)52029es 3.75p = 0.05
GA_6731NM_018182hypothetical protein FLJ10700 (FLJ10700)702110es 7.01p = 0.00
GA_6742NM_018198hypothetical protein FLJ10737 (FLJ10737)843015es 3.43p = 0.02
GA_6760NM_018228chromosome 14 open reading frame 1151310014es 39.05p = 0.00
(C14orf115)
GA_6806NM_018303homolog of yeast Sec5 (SEC5)51118es 5.01p = 0.03
GA_6905NM_017722hypothetical protein FLJ20244 (FLJ20244)41016es 6.01p = 0.04
GA_6957NM_017815chromosome 14 open reading frame 94 (C14orf94)40015es 12.01p = 0.02
GA_6975NM_017840mitochondrial ribosomal protein L16 (MRPL16),602210es 4.51p = 0.02
nuclear gene encoding mitochondrial protein
GA_7078NM_015148PAS domain containing serine/threonine kinase50005es > 4p = 0.00
(PASK)
GA_7155NM_007098clathrin, heavy polypeptide-like 1 (CLTCL1),40105es 12.01p = 0.02
transcript variant 2
GA_7158NM_017489telomeric repeat binding factor (NIMA-interacting) 11432322es 5.26p = 0.00
(TERF1), transcript variant 1
GA_7170NM_019013hypothetical protein FLJ10156 (FLJ10156)713213es 3.50p = 0.02
GA_7178NM_019079hypothetical protein FLJ10884 (FLJ10884)3424141es 14.59p = 0.00
GA_7334NM_020347leucine zipper transcription factor-like 1 (LZTFL1)62109es 6.01p = 0.01
GA_7382AB040878KIAA1445 protein sequence710210es 7.01p = 0.00
GA_75422104025es 15.77p = 0.00
GA_7691D42046The ha3631 gene product is related to S.cerevisiae41106es 6.01p = 0.04
protein encoded in chromosome VIII. sequence
GA_8100NM_054013mannosyl (alpha-1,3-)-glycoprotein beta-1,4-N-51129es 3.75p = 0.05
acetylglucosaminyltransferase, isoenzyme B
(MGAT4B), transcript variant 2
GA_8103NM_144570HN1 like (HN1L)1424424es 4.20p = 0.00
GA_8119NM_012266DnaJ (Hsp40) homolog, subfamily B, member 541016es 6.01p = 0.04
(DNAJB5)
GA_8152AK095108cDNA FLJ37789 fis, clone BRHIP300008162109es 6.01p = 0.01
sequence
GA_82NM_015545KIAA0632 protein (KIAA0632)51118es 5.01p = 0.03
GA_8484AK026658cDNA: FLJ23005 fis, clone LNG00396, highly similar40004es > 4p = 0.00
to AF055023clone 24723 mRNA sequence
GA_8559NM_022497mitochondrial ribosomal protein S25 (MRPS25),613111es 3.60p = 0.03
nuclear gene encoding mitochondrial protein
GA_8603NM_007175chromosome 8 open reading frame 2 (C8orf2)731112es 4.20p = 0.01
GA_866740004es > 4p = 0.00
GA_8686Z24725mitogen inducible gene mig-2 sequence1030316es 5.01p = 0.00
GA_8730AK098833cDNA FLJ25967 fis, clone CBR01929 sequence1032015es 6.01p = 0.00
GA_8803NM_000533proteolipid protein 1 (Pelizaeus-Merzbacher63009es 6.01p = 0.01
disease,
spastic paraplegia 2, uncomplicated) (PLP1)
GA_8862AK091593cDNA FLJ34274 fis, clone FEBRA200332750005es > 4p = 0.00
sequence
GA_901460118es 9.01p = 0.00
GA_9162AF311912pancreas tumor-related protein sequence710412es 4.20p = 0.01
GA_9163NM_138639BCL2-like 12 (proline rich) (BCL2L12), transcript813012es 6.01p = 0.00
variant 1
GA_9167AF308602NOTCH 1 sequence62109es 6.01p = 0.01
GA_9183NM_007129Zic family member 2 (odd-paired homolog,811010es 12.01p = 0.00
Drosophila) (ZIC2)
GA_9257NM_005088DNA segment on chromosome X and Y (unique)41016es 6.01p = 0.04
155 expressed sequence (DXYS155E)
GA_9338NM_020436similar to SALL1 (sal (Drosophila)-like (LOC57167)1123016es 6.61p = 0.00
GA_9365NM_021078GCN5 general control of amino-acid synthesis 5-like712111es 5.26p = 0.01
2 (yeast) (GCN5L2)
GA_9384NM_020997left-right determination, factor B (LEFTB)40105es 12.01p = 0.02
GA_9388NM_021643GS3955 protein (GS3955)710210es 7.01p = 0.00
GA_9488NM_007372RNA helicase-related protein (RNAHP)1271626es 2.57p = 0.02
GA_9571NM_022130golgi phosphoprotein 3 (coat-protein) (GOLPH3)622111es 3.60p = 0.03
GA_9593NM_022372G protein beta subunit-like (GBL)60118es 9.01p = 0.00
GA_96NM_012297Ras-GTPase activating protein SH3 domain-binding1996842es 2.48p = 0.00
protein 2 (KIAA0660)
GA_9664NM_015339activity-dependent neuroprotector (ADNP)712212es 4.20p = 0.01
GA_9688NM_022767hypothetical protein FLJ12484 (FLJ12484)1431321es 6.01p = 0.00
GA_9697NM_022778hypothetical protein DKFZp434L011762109es 6.01p = 0.01
(DKFZP434L0117)
GA_9784NM_021873cell division cycle 25B (CDC25B), transcript variant 352018es 5.01p = 0.03
GA_9829BM454622AGENCOURT_6406365 NIH_MGC_92cDNA clone61108es 9.01p = 0.00
IMAGE: 5583082 5′ sequence
GA_9952BC003542Unknown (protein for IMAGE: 3611719) sequence60107es 18.02p = 0.00
GA_9996NM_005911methionine adenosyltransferase II, alpha (MAT2A)27891458es 2.62p = 0.00
|
[0123]
6
TABLE 6
|
|
|
EST Frequency of Genes that Up-regulate upon Differentiation
|
EST counts
|
Geron ID
GenBank ID
Name
ES
EB
preHEP
preNeu
Total
Relative Expression
|
|
GA_10484
AK056774
unnamed protein product sequence
4
153
17
34
208
es 0.06
p = 0.00
|
GA_10493
NM_023009
MARCKS-like protein (MLP)
6
7
15
32
60
es 0.33
p = 0.01
|
GA_1071
NM_001641
APEX nuclease (multifunctional DNA repair
5
13
15
12
45
es 0.38
p = 0.04
|
enzyme) 1 (APEX1), transcript variant 1
|
GA_11334
NM_032272
homolog of yeast MAF1 (MAF1)
0
4
7
1
12
es 0.00
p = 0.05
|
GA_11407
NM_015070
KIAA0853 protein (KIAA0853)
0
2
2
8
12
es 0.00
p = 0.05
|
GA_12217
BC009917
Unknown (protein for MGC: 2764) sequence
0
7
3
5
15
es 0.00
p = 0.03
|
GA_1222
NM_001901
connective tissue growth factor(CTGF)
2
26
4
14
46
es 0.14
p = 0.00
|
GA_12727
NM_004926
zinc finger protein 36, C3H type-like 1 (ZFP36L1)
3
8
12
22
45
es 0.21
p = 0.00
|
GA_1336
NM_002024
fragile X mental retardation 1 (FMR1)
0
3
4
7
14
es 0.00
p = 0.03
|
GA_1353
NM_002051
GATA binding protein 3 (GATA3)
0
2
8
2
12
es 0.00
p = 0.05
|
GA_1403
NM_001530
hypoxia-inducible factor 1, alpha subunit (basic
4
22
5
8
39
es 0.34
p = 0.04
|
helix-loop-helix transcription factor) (HIF1A)
|
GA_1432
NM_002166
inhibitor of DNA binding 2, dominant negative helix-
1
3
17
4
25
es 0.13
p = 0.01
|
loop-helix protein (ID2)
|
GA_1476
NM_002276
keratin 19 (KRT19)
1
26
14
38
79
es 0.04
p = 0.00
|
GA_1545
NM_002512
non-metastatic cells 2, protein (NM23B) expressed
3
6
7
16
32
es 0.31
p = 0.04
|
in (NME2), nuclear gene encoding mitochondrial
|
protein
|
GA_1556
NM_003633
ectodermal-neural cortex (with BTB-like domain)
1
5
2
28
36
es 0.09
p = 0.00
|
(ENC1)
|
GA_1735
NM_002806
proteasome (prosome, macropain) 26S subunit,
1
7
7
8
23
es 0.14
p = 0.03
|
ATPase, 6 (PSMC6)
|
GA_1736
NM_002814
proteasome (prosome, macropain) 26S subunit,
0
4
10
5
19
es 0.00
p = 0.01
|
non-ATPase, 10 (PSMD10)
|
GA_1841
NM_000979
ribosomal protein L18 (RPL18)
4
6
36
35
81
es 0.16
p = 0.00
|
GA_1843
NM_000982
ribosomal protein L21 (RPL21)
1
7
48
42
98
es 0.03
p = 0.00
|
GA_1850
BC020169
clone IMAGE: 3543815, partial cds
0
2
8
11
21
es 0.00
p = 0.00
|
GA_1857
NM_000999
ribosomal protein L38 (RPL38)
1
2
12
10
25
es 0.13
p = 0.01
|
GA_1866
NM_002950
ribophorin I (RPN1)
3
12
10
14
39
es 0.25
p = 0.01
|
GA_1886
NM_001009
ribosomal protein S5 (RPS5)
8
14
46
30
98
es 0.27
p = 0.00
|
GA_1977
NM_003134
signal recognition particle 14 kDa (homologous Alu
1
4
18
12
35
es 0.09
p = 0.00
|
RNA binding protein) (SRP14)
|
GA_2014
NM_003564
transgelin 2 (TAGLN2)
5
31
8
28
72
es 0.22
p = 0.00
|
GA_2039
NM_003246
thrombospondin 1 (THBS1)
0
3
2
7
12
es 0.00
p = 0.05
|
GA_23018
NM_005336
high density lipoprotein binding protein; vigilin
11
37
17
21
86
es 0.44
p = 0.01
|
sequence
|
GA_23176
2
18
3
7
30
es 0.21
p = 0.02
|
GA_23180
AB009010
polyubiquitin UbC, complete cds
7
16
23
26
72
es 0.32
p = 0.00
|
GA_23653
NM_003289
tropomyosin 2 (beta) (TPM2)
2
14
7
8
31
es 0.21
p = 0.01
|
GA_23969
0
1
181
20
202
es 0.00
p = 0.00
|
GA_24037
0
1
6
5
12
es 0.00
p = 0.05
|
GA_2524
NM_004415
desmoplakin (DPI, DPII) (DSP)
3
14
5
23
45
es 0.21
p = 0.00
|
GA_2597
NM_138610
H2A histone family, member Y (H2AFY), transcript
1
5
5
14
25
es 0.13
p = 0.01
|
variant 3
|
GA_2627
NM_004905
anti-oxidant protein 2 (non-selenium glutathione
3
6
11
17
37
es 0.27
p = 0.01
|
peroxidase, acidic calcium-independent
|
phospholipase A2) (AOP2)
|
GA_2702
NM_000942
peptidylprolyl isomerase B (cyclophilin B) (PPIB)
5
6
7
26
44
es 0.39
p = 0.04
|
GA_2752
NM_004175
small nuclear ribonucleoprotein D3 polypeptide
0
1
9
4
14
es 0.00
p = 0.03
|
18 kDa (SNRPD3)
|
GA_2782
NM_004786
thioredoxin-like, 32 kDa (TXNL)
0
4
1
10
15
es 0.00
p = 0.03
|
GA_2808
NM_001154
annexin A5 (ANXA5)
2
14
4
11
31
es 0.21
p = 0.01
|
GA_2968
BC007090
histidine triad nucleotide-binding protein, clone
0
1
11
9
21
es 0.00
p = 0.00
|
MGC: 14708 IMAGE: 4250172, complete cds
|
GA_3016
NM_001873
carboxypeptidase E (CPE)
1
8
4
9
22
es 0.14
p = 0.02
|
GA_3026
NM_005722
ARP2 actin-related protein 2 homolog (yeast)
6
19
7
19
51
es 0.40
p = 0.03
|
(ACTR2)
|
GA_3033
NM_005717
actin related protein 2/3 complex, subunit 5, 16 kDa
3
10
8
19
40
es 0.24
p = 0.01
|
(ARPC5)
|
Gk_3036
NM_152862
actin related protein 2/3 complex, subunit 2, 34 kDa
1
9
3
7
20
es 0.16
p = 0.04
|
(ARPC2), transcript variant 1
|
GA_3126
NM_005620
S100 calcium binding protein A11 (calgizzarin)
0
1
7
37
45
es 0.00
p = 0.00
|
(S100A11)
|
GA_3132
NM_005625
syndecan binding protein (syntenin) (SDCBP)
1
3
10
10
24
es 0.13
p = 0.02
|
GA_3260
NM_006004
ubiquinol-cytochrome c reductase hinge protein
1
4
12
5
22
es 0.14
p = 0.02
|
(UQCRH)
|
GA_3283
NM_004484
glypican 3 (GPC3)
1
6
7
12
26
es 0.12
p = 0.01
|
GA_3294
NM_006476
ATP synthase, H+ transporting, mitochondrial F0
0
1
3
11
15
es 0.00
p = 0.03
|
complex, subunit g (ATP5L)
|
GA_33625
NM_058179
phosphoserine aminotransferase (PSA), transcript
2
8
5
14
29
es 0.22
p = 0.03
|
variant 1
|
GA_33660
BF528488
602043661F1 NCl_CGAP_Brn67cDNA clone
0
7
7
2
16
es 0.00
p = 0.02
|
IMAGE: 4181462 5′ sequence
|
GA_33787
AL832673
mRNA; cDNA DKFZp313B1017 (from clone
0
3
4
6
13
es 0.00
p = 0.05
|
DKFZp313B1017) sequence
|
GA_3403
NM_006142
stratifin (SFN)
0
2
1
14
17
es 0.00
p = 0.01
|
GA_3431
NM_006294
ubiquinol-cytochrome c reductase binding protein
0
2
9
7
18
es 0.00
p = 0.01
|
(UQCRB)
|
GA_3435
NM_006472
thioredoxin interacting protein (TXNIP)
4
14
16
11
45
es 0.29
p = 0.01
|
GA_34569
NM_003299
tumor rejection antigen (gp96) 1 (TRA1)
3
9
27
20
59
es 0.16
p = 0.00
|
GA_34776
NM_002273
keratin 8 (KRT8)
9
71
144
156
380
es 0.07
p = 0.00
|
GA_34912
NM_006367
adenylyl cyclase-associated protein (CAP)
9
24
10
31
74
es 0.42
p = 0.01
|
GA_34930
NM_000700
annexin A1 (ANXA1)
2
12
3
15
32
es 0.20
p = 0.01
|
GA_35086
NM_002128
high-mobility group box 1 (HMGB1)
1
3
8
8
20
es 0.16
p = 0.04
|
GA_35179
NM_001402
eukaryotic translation elongation factor 1 alpha 1
16
29
43
63
151
es 0.36
p = 0.00
|
(EEF1A1)
|
GA_3530
NM_002539
ornithine decarboxylase 1 (ODC1)
1
10
8
9
28
es 0.11
p = 0.01
|
GA_35369
NM_003374
voltage-dependent anion channel 1 (VDAC1)
1
5
6
10
22
es 0.14
p = 0.02
|
GA_35434
NM_006094
deleted in liver cancer 1 (DLC1)
0
8
1
5
14
es 0.00
p = 0.03
|
GA_35463
NM_024298
leukocyte receptor cluster (LRC) member 4
0
4
9
8
21
es 0.00
p = 0.00
|
(LENG4)
|
GA_3560
NM_003079
SWI/SNF related, matrix associated, actin
2
5
11
11
29
es 0.22
p = 0.03
|
dependent regulator of chromatin, subfamily e,
|
member 1 (SMARCE1)
|
GA_35641
BC029424
similar to weakly similar to glutathione peroxidase 2
1
11
5
3
20
es 0.16
p = 0.04
|
sequence
|
GA_35978
NM_006830
ubiquinol-cytochrome c reductase (6.4 kD) subunit
0
1
4
7
12
es 0.00
p = 0.05
|
(UQCR)
|
GA_3617
NM_000391
ceroid-lipofuscinosis, neuronal 2, late infantile
1
4
15
2
22
es 0.14
p = 0.02
|
(Jansky-Bielschowsky disease) (CLN2)
|
GA_36322
NM_001554
cysteine-rich, angiogenic inducer, 61 (CYR61)
0
3
3
7
13
es 0.00
p = 0.05
|
GA_36460
NM_001300
core promoter element binding protein (COPEB)
0
6
2
7
15
es 0.00
p = 0.03
|
GA_3652
NM_005556
keratin 7 (KRT7)
0
9
1
14
24
es 0.00
p = 0.00
|
GA_36638
NM_002954
ribosomal protein S27a (RPS27A)
3
5
37
35
80
es 0.12
p = 0.00
|
GA_36721
NM_005134
protein phosphatase 4, regulatory subunit 1
0
8
2
6
16
es 0.00
p = 0.02
|
(PPP4R1)
|
GA_36891
NM_001019
ribosomal protein S15a (RPS15A)
0
2
50
32
84
es 0.00
p = 0.00
|
GA_36932
NM_015338
KIAA0978 protein (KIAA0978)
0
5
3
5
13
es 0.00
p = 0.05
|
GA_3707
NM_003816
a disintegrin and metalloproteinase domain 9
0
8
1
3
12
es 0.00
p = 0.05
|
(meltrin gamma) (ADAM9)
|
GA_37238
NM_021019
myosin, light polypeptide 6, alkali, smooth muscle
0
2
2
12
16
es 0.00
p = 0.02
|
and non-muscle (MYL6), transcript variant 1
|
GA_37377
NM_000516
GNAS complex locus (GNAS), transcript variant 1
12
16
27
38
93
es 0.44
p = 0.01
|
GA_37494
NM_001305
claudin 4 (CLDN4)
1
2
10
12
25
es 0.13
p = 0.01
|
GA_37508
NM_000994
ribosomal protein L32 (RPL32)
2
6
26
35
69
es 0.09
p = 0.00
|
GA_37557
NM_152437
hypothetical protein DKFZp761B128
1
7
13
3
24
es 0.13
p = 0.02
|
(DKFZp761B128)
|
GA_37660
NM_001749
calpain, small subunit 1 (CAPNS1)
4
7
11
20
42
es 0.32
p = 0.02
|
GA_37689
AK022962
cDNA FLJ12900 fis, clone NT2RP2004321
0
4
6
2
12
es 0.00
p = 0.05
|
sequence
|
GA_37776
NM_000366
tropomyosin 1 (alpha) (TPM1)
24
46
37
74
181
es 0.46
p = 0.00
|
GA_3782
NM_003968
ubiquitin-activating enzyme E1C (UBA3 homolog,
0
1
5
6
12
es 0.00
p = 0.05
|
yeast) (UBE1C)
|
GA_3789
NM_006818
ALL1-fused gene from chromosome 1q (AF1Q)
0
17
1
11
29
es 0.00
p = 0.00
|
GA_38037
NM_033480
F-box only protein 9 (FBXO9), transcript variant 2
0
4
4
4
12
es 0.00
p = 0.05
|
GA_3812
NM_006854
KDEL (Lys-Asp-Glu-Leu) endoplasmic reticulum
3
12
5
17
37
es 0.27
p = 0.01
|
protein retention receptor 2 (KDELR2)
|
GA_38124
NM_000269
non-metastatic cells 1, protein (NM23A) expressed
1
2
8
13
24
es 0.13
p = 0.02
|
in (NME1)
|
GA_38191
NM_000224
keratin 18 (KRT18)
8
46
50
119
223
es 0.11
p = 0.00
|
GA_38341
NM_006931
solute carrier family 2 (facilitated glucose
28
49
45
85
207
es 0.47
p = 0.00
|
transporter), member 3 (SLC2A3)
|
GA_38503
NM_000612
insulin-like growth factor 2 (somatomedin A) (IGF2)
0
17
4
21
42
es 0.00
p = 0.00
|
GA_38528
NM_012062
dynamin 1-like (DNM1L), transcript variant 1
0
5
4
3
12
es 0.00
p = 0.05
|
GA_38545
NM_005801
putative translation initiation factor (SUI1)
1
14
15
19
49
es 0.06
p = 0.00
|
GA_38563
NM_021005
nuclear receptor subfamily 2, group F, member 2
0
9
8
9
26
es 0.00
p = 0.00
|
(NR2F2)
|
GA_3857
NM_006644
heat shock 105 kD (HSP105B)
1
11
3
7
22
es 0.14
p = 0.02
|
GA_38570
NM_033150
collagen, type II, alpha 1 (primary osteoarthritis,
0
15
31
5
51
es 0.00
p = 0.00
|
spondyloepiphyseal dysplasia, congenital)
|
(COL2A1), transcript variant 2
|
GA_38790
NM_001743
calmodulin 2 (phosphorylase kinase, delta)
15
23
36
37
111
es 0.47
p = 0.00
|
(CALM2)
|
GA_38817
NM_013341
hypothetical protein PTD004 (PTD004)
0
4
5
3
12
es 0.00
p = 0.05
|
GA_38830
NM_006013
ribosomal protein L10 (RPL10)
12
13
71
81
177
es 0.22
p = 0.00
|
GA_3892
NM_006888
calmodulin 1 (phosphorylase kinase, delta)
1
3
11
9
24
es 0.13
p = 0.02
|
(CALM1)
|
GA_3973
NM_144497
A kinase (PRKA) anchor protein (gravin) 12
0
17
1
20
38
es 0.00
p = 0.00
|
(AKAP12), transcript variant 2
|
GA_3977
NM_005139
annexin A3 (ANXA3)
0
3
4
10
17
es 0.00
p = 0.01
|
GA_4045
NM_003897
immediate early response 3 (IER3), transcript
1
14
2
4
21
es 0.15
p = 0.04
|
variant short
|
GA_4132
NM_002305
lectin, galactoside-binding, soluble, 1 (galectin 1)
0
5
2
7
14
es 0.00
p = 0.03
|
(LGALS1)
|
GA_4182
NM_001202
bone morphogenetic protein 4 (BMP4), transcript
0
7
6
4
17
es 0.00
p = 0.01
|
variant 1
|
GA_4395
NM_003145
signal sequence receptor, beta (translocon-
6
17
12
14
49
es 0.42
p = 0.05
|
associated protein beta) (SSR2)
|
GA_4418
NM_004800
transmembrane 9 superfamily member 2 (TM9SF2)
0
7
2
8
17
es 0.00
p = 0.01
|
GA_4615
NM_012286
MORF-related gene X (MRGX)
10
22
16
23
71
es 0.49
p = 0.04
|
GA_4640
NM_012342
putative transmembrane protein (NMA)
1
8
3
10
22
es 0.14
p = 0.02
|
GA_4914
NM_016282
adenylate kinase 3 like 1 (AK3L1)
0
2
6
4
12
es 0.00
p = 0.05
|
GA_5243
NM_139207
nucleosome assembly protein 1-like 1 (NAP1L1),
7
19
28
25
79
es 0.29
p = 0.00
|
transcript variant 1
|
GA_5387
NM_002047
glycyl-tRNA synthetase (GARS)
8
9
34
34
85
es 0.31
p = 0.00
|
GA_5557
NM_014211
gamma-aminobutyric acid (GABA) A receptor, pi
1
3
4
13
21
es 0.15
p = 0.04
|
(GABRP)
|
GA_5730
NM_015641
testis derived transcript (3 LIM domains) (TES),
0
2
2
9
13
es 0.00
p = 0.05
|
transcript variant 1
|
GA_5992
NM_014899
Rho-related BTB domain containing 3 (RHOBTB3)
0
10
7
13
30
es 0.00
p = 0.00
|
GA_6118
NM_016403
hypothetical protein HSPC148 (HSPC148)
0
2
7
3
12
es 0.00
p = 0.05
|
GA_6136
NM_016368
myo-inositol 1-phosphate synthase A1 (ISYNA1)
1
7
5
16
29
es 0.11
p = 0.00
|
GA_6165
NM_015853
ORF (LOC51035)
1
5
9
5
20
es 0.16
p = 0.04
|
GA_6219
NM_016139
16.7 Kd protein (LOC51142)
1
5
13
14
33
es 0.09
p = 0.00
|
GA_6381
NM_016641
membrane interacting protein of RGS16 (MIR16)
0
2
3
7
12
es 0.00
p = 0.05
|
GA_6388
NM_016145
PTD008 protein (PTD008)
0
1
2
10
13
es 0.00
p = 0.05
|
GA_6437
NM_016732
RNA binding protein (autoantigenic, hnRNP-
2
6
7
12
27
es 0.24
p = 0.04
|
associated with lethal yellow) (RALY), transcript
|
variant 1
|
GA_6481
NM_014380
nerve growth factor receptor (TNFRSF16)
1
4
8
17
30
es 0.10
p = 0.00
|
associated protein 1 (NGFRAP1)
|
GA_7280
NM_020199
HTGN29 protein (HTGN29)
0
6
2
6
14
es 0.00
p = 0.03
|
GA_7286
NM_172316
Meis1, myeloid ecotropic viral integration site 1
0
4
2
10
16
es 0.00
p = 0.02
|
homolog 2 (mouse) (MEIS2), transcript variant h
|
GA_749
BC015794
Unknown (protein for MGC: 8837) sequence
0
4
4
9
17
es 0.00
p = 0.01
|
GA_7520
NM_003486
solute carrier family 7 (cationic amino acid
2
20
3
20
45
es 0.14
p = 0.00
|
transporter, y+ system), member 5 (SLC7A5)
|
GA_7635
NM_170746
selenoprotein H (SELH)
0
1
10
2
13
es 0.00
p = 0.05
|
GA_8275
NM_012203
glyoxylate reductase/hydroxypyruvate reductase
0
3
2
12
17
es 0.00
p = 0.01
|
(GRHPR)
|
GA_8627
NM_006868
RAB31, member RAS oncogene family (RAB31)
0
5
1
7
13
es 0.00
p = 0.05
|
GA_8674
NM_000598
insulin-like growth factor binding protein 3 (IGFBP3)
1
15
4
3
23
es 0.14
p = 0.03
|
GA_8980
NM_005347
heat shock 70 kDa protein 5 (glucose-regulated
10
29
15
30
84
es 0.41
p = 0.01
|
protein, 78 kDa) (HSPA5)
|
GA_9152
NM_005324
H3 histone, family 3B (H3.3B) (H3F3B)
20
26
57
49
152
es 0.46
p = 0.00
|
GA_9196
NM_000404
galactosidase, beta 1 (GLB1), transcript variant
0
6
10
7
23
es 0.00
p = 0.00
|
179423
|
GA_9251
NM_004373
cytochrome c oxidase subunit VIa polypeptide 1
0
3
7
8
18
es 0.00
p = 0.01
|
(COX6A1), nuclear gene encoding mitochondrial
|
protein
|
GA_9266
NM_021104
ribosomal protein L41 (RPL41)
6
9
70
75
160
es 0.12
p = 0.00
|
GA_9649
NM_014604
Tax interaction protein 1 (TIP-1)
0
8
5
5
18
es 0.00
p = 0.01
|
GA_9734
NM_022908
hypothetical protein FLJ12442 (FLJ12442)
0
3
2
14
19
es 0.00
p = 0.01
|
|
Microarray Analysis for Other Differentially Expressed Genes
[0124] In another series of experiments, the level of gene expression was tested at the mRNA level in microarrays.
[0125] Genes were selected from the non-redundant set of gene assemblies from the four cDNA libraries described in Example 1, based on their novelty and possible interest as markers. An additional 7,000 sequence-verified clones were obtained from Research Genetics (Huntsville Ala.) and incorporated into an array with a control set of ˜200 known housekeeping genes. Each clone was grown overnight in 96-well format and DNA purified using the Qiagen 96-well DNA kit. The DNA templates were PCR amplified in 100 μL reactions. PCR product was then purified using the Arraylt™ PCR Purification Kit (Telechem, Sunnyvale Calif.) according to manufacturer instructions. Product was dried down, resuspended in 50% DMSO and Arraylt™ Microprinting solution (Telechem, Sunnyvale Calif.) and arrayed onto GAPS™ amino silane coated slides (Corning Inc., Acton Mass.) using a GMS 417 Arrayer (Affymetrix, Santa Clara, Calif.). After printing, slides were humidified and snap heated, baked at 80° for 4 h, then blocked with succinic anhydride.
[0126] Total RNA from undifferentiated ES cells, embryoid body cells (EB), retinoic acid treated (preNeu), and DMSO treated (PreHep) cells S, EB, RA-treated, and DMSO-treated cells (10 μg, 15 μg, and 20 μg for sensitivity) was then reverse transcriptase labeled with Cy3 or Cy5 fluorophores, and competitively hybridized to the microarrays overnight at 42° C. in 50% formamide and Sigma hybridization buffer. Undifferentiated ES RNA was directly and indirectly compared with RNA from all other cell types. Experiments were repeated at least 5 times each, and dye reversed. Stratagene Universal Human Reference RNA (Cat. #740000) was used as the indirect comparator. Arrays were washed repeatedly and scanned using a Genepix™ 4000A microarray scanner (Axon Instruments, Fremont Calif.).
[0127] Image processing, data extraction and preliminary quality control were performed using Genepix™ Pro 3.0.6 (Axon Instruments). Quality control calculations involved quantifying overall signal intensities, statistical means and medians of pixel intensities and spot morphologies. Extracted data was further analyzed based on statistical algorithms of signal-to-noise, sensitivity range, and reproducibility. Data was then loaded into the GeneSpring™ database and analysis program. Of particular interest were genes that showed reproducible expression differences of 2-fold in either direction, especially when the change occurred upon differentiation to all three differentiated cell types.
[0128] The following table lists genes that were identified as being downregulated or upregulated in their expression level upon differentiation into EB, preHEP, or preNEU cells. EST counts are provided from the data generated in the previous example.
7TABLE 7
|
|
Microarray Analysis - Genes that Decrease Expression upon Differentiation
Fold ChangeEST Counts
Geron IDGenBank IDNameRADMSOESEBpreHeppreNeu
|
GA_1674NM_002701POU domain, class 5, transcription factor−3.61−10.6824120
1 (POU5F1)
GA_9384NM_020997left-right determination, factor B (LEFTB)−4.88−5.484010
GA_37788NM_133631roundabout, axon guidance receptor,−7.93−2.97410
homolog 1
GA_12173NM_021912gamma-aminobutyric acid (GABA) A−3.37−2.164000
receptor, beta 3 (GABRB3)
GA_37606NM_019012phosphoinositol 3-phosphate-binding−2.96−9.994200
protein-2 (PEPP2)
GA_1470NM_003740potassium channel, subfamily K, member−2.93−2.474001
5 (KCNK5)
GA_2937NM_005207v-crk sarcoma virus CT10 oncogene−2.29−3.786100
homolog (avian)-like (CRKL)
GA_10513NM_033209Thy-1 co-transcribed (LOC94105)−2.21−3.397221
GA_36957NM_024642N-acetylgalactosaminyltransferase 12−3.24−5.054011
(GaINAc-T12) (GALNT12)
GA_36420NM_001064transketolase (Wernicke-Korsakoff−2.25−2.2814171117
syndrome) (TKT)
GA_1677NM_003712phosphatidic acid phosphatase type 2C−2.46−2.711000
(PPAP2C)
GA_36793NM_152295threonyl-tRNA synthetase (TARS)−2.18−3.58416
GA_7151NM_017488adducin 2 (beta) (ADD2), transcript−4.21−2.034220
variant beta-4
GA_12053NM_001986ets variant gene 4 (E1A enhancer binding−2.76−2.040104
protein, E1AF) (ETV4)
GA_1798NM_000964retinoic acid receptor, alpha (RARA)−2.76−3.33200
GA_5617NM_014502nuclear matrix protein NMP200 related to−2.19−2.335342
splicing factor PRP19 (NMP200)
GA_2753NM_000582secreted phosphoprotein 1 (osteopontin)−3.78−3.3236239
(SPP1)
GA_7151NM_017486adducin 2 (beta) (ADD2), transcript−3.34−2.134220
variant beta-6a
GA_36775NM_000918procollagen-proline, thyroid hormone−2.01−2.6512281022
binding protein p55) (P4HB)
GA_1086NM_133436asparagine synthetase (ASNS), transcript−2.27−2.5365313
variant 1
GA_2928NM_005163v-akt murine thymoma viral oncogene−2.79−3.4521025
homolog 1 (AKT1)
GA_33799NM_003250thyroid hormone receptor (THRA)−4.28−4.440201
GA_37861NM_021784forkhead box A2 (FOXA2), transcript−3.56−2.992000
variant 1
GA_34109NM_002026fibronectin 1 (FN1), transcript variant 1−2.91−2.0117166527
GA_38641NM_004309Rho GDP dissociation inhibitor (GDI)−2.72−2.3578914
alpha (ARHGDIA)
GA_33829NM_002081glypican 1 (GPC1)−2.61−2.323941
GA_5549NM_014600EH-domain containing 3 (EHD3)−2.39−2.811511
GA_9269NM_021074NADH dehydrogenase (ubiquinone)−2.26−2.010096
flavoprotein 2, 24 kDa (NDUFV2)
GA_2934NM_005180B lymphoma Mo-MLV insertion region−2.11−3.241201
(mouse) (BMI1)
GA_3522NM_002415macrophage migration inhibitory factor−2.04−2.054289
(glycosylation-inhibiting factor) (MIF)
GA_2465NM_004364CCAAT/enhancer binding protein−2.79−40100
(C/EBP), alpha (CEBPA)
GA_36793NM_152295threonyl-tRNA synthetase (TARS)−5.34−2.988416
GA_9259NM_005539inositol polyphosphate-5-phosphatase,−4.37−6.541002
40 kDa (INPP5A)
GA_2232NM_001348death-associated protein kinase 3−2.9−3.563312
(DAPK3)
GA_37240NM_007029stathmin-like 2 (STMN2)−4.37−2.370401
GA_4617NM_012289Kelch-like ECH-associated protein 1−11.88−2.592422
(KEAP1)
GA_38021NM_002111huntingtin (Huntington disease) (HD)−10.84−2.161502
GA_9227NM_001552insulin-like growth factor binding protein 4−6.13−3.065402
(IGFBP4)
GA_267NM_007041arginyltransferase 1 (ATE1)−3.03−3.221102
GA_38392NM_006597heat shock 70 kDa protein 8 (HSPA8),−8.8−2.739204862
transcript variant 1
GA_1829NM_002936ribonuclease H1 (RNASEH1)−2.81−2.111012
GA_9228NM_001664ras homolog gene family, member A−3.21−2.481118817
(ARHA)
GA_1495NM_002347lymphocyte antigen 6 complex, locus H−2.33−2.570001
(LY6H)
GA_3840NM_006749solute carrier family 20 (phosphate−5.4−2.830113
transporter), member 2 (SLC20A2)
GA_1045NM_001105activin A receptor, type I (ACVR1)−2.7−2.370313
GA_36361NM_020636zinc finger protein 275 (ZNF275)−4.09−2.070003
GA_2445NM_004337chromosome 8 open reading frame 1−3.02−2.21000
(C8orf1)
GA_4652NM_012228pilin-like transcription factor (PILB)−2.73−2.460010
GA_10567NM_025195phosphoprotein regulated by mitogenic−4.74−3.640201
pathways (C8FW)
GA_9258NM_005393plexin B3 (PLXNB3)−3.56−3.040200
GA_35992NM_001402eukaryotic translation elongation factor 1−5.55−2.22419467454428
alpha 1 (EEF1A1)
GA_33537NM_133259leucine-rich PPR-motif containing−2.47−3.418753
(LRPPRC)
GA_6367NM_016354solute carrier family 21 (organic anion−2.08−3.260001
transporter), member 12 (SLC21A12)
GA_667AB028976mRNA for KIAA1053 protein, partial cds−7.55−3.520202
BQ023180NCI_CGAP_PI6 cDNA clone UI-1-BB1p-−2.96−2.1
aui-g-05-0-UI 3' sequence
AA419281Soares ovary tumor NbHOT cDNA clone−3.36−2.59
IMAGE: 755641 3' sequence
NM_006604ret finger protein-like 3 (RFPL3)−2.69−2.5
NM_012155echinoderm microtubule associated−9.82−6.65
protein like 2 (EML2)
NM_000160glucagon receptor (GCGR)−3.94−2.18
NM_003181T, brachyury homolog (mouse) (T)−9.15−2.11
NM_014620homeo box C4 (HOXC4), transcript−9.54−2.1
variant 1
NM_005583lymphoblastic leukemia derived sequence−4.36−2.79
1 (LYL1)
NM_014310RASD family, member 2 (RASD2)−2.72−3.13
NM_012467tryptase gamma 1 (TPSG1)−2.63−2.55
NM_000539rhodopsin (opsin 2, rod pigment) (retinitis−4.84−5.53
pigmentosa 4, autosomal dominant)
(RHO)
NM_021076neurofilament, heavy polypeptide (200 kD)−2.03−2.41
(NEFH)
NM_012407protein kinase C, alpha binding protein−5.44−2.56
(PRKCABP)
NM_000201intercellular adhesion molecule 1 (CD54),−2.18−2.06
human rhinovirus receptor (ICAM1)
|
[0129]
8
TABLE 8
|
|
|
Microarray Analysis - Genes that Increase Expression upon Differentiation
|
Fold Change
EST Counts
|
Geron ID
GenBank ID
Name
RA
DMSO
ES
EB
preHep
preNeu
|
|
GA_1055
NM_001134
alpha-fetoprotein (AFP)
8.02
5.07
0
4
0
0
|
GA_1055
NM_001134
alpha-fetoprotein (AFP)
6.45
3.71
0
4
0
0
|
GA_1055
NM_001134
alpha-fetoprotein (AFP)
2.58
2.67
0
4
0
0
|
GA_1213
NM_001884
cartilage linking protein 1 (CRTL1)
4.57
8.71
3
1
17
3
|
GA_1476
NM_002276
keratin 19 (KRT19)
2.09
5.21
1
26
14
38
|
GA_8674
NM_000598
insulin-like growth factorn binding protein
3.16
3.59
1
15
4
3
|
3 (IGFBP3)
|
GA_3283
NM_004484
glypican 3 (GPC3)
2.6
3.29
1
6
7
12
|
GA_37735
NM_058178
neuronal pentraxin receptor (NPTXR)
3.77
4.04
1
0
0
1
|
GA_1280
NM_001957
endothelin receptor type A(EDNRA)
3.05
6.37
2
2
1
0
|
GA_37308
NM_003068
snail homolog 2 (Drosophila) (SNAI2)
2.24
4.68
4
3
0
0
|
GA_5909
NM_014851
KIAA0469 gene product
2.77
2.03
3
3
0
1
|
GA_23450
XM_027313
ATP synthase mitochondrial F1 complex
2.48
3.55
3
1
1
1
|
assembly factor 1 (ATPAF1),
|
GA_7286
NM_020119
likely ortholog of rat zinc-finger antiviral
2.5
3.55
1
0
0
0
|
protein (ZAP)
|
|
Specificity of Expression Confirmed by Real-time PCR
[0130] To verify the expression patterns of particular genes of interest at the mRNA level, extracts of undifferentiated hES cells and their differentiated progeny were assayed by real-time PCR. Cells were cultured for 1 week with 0.5% dimethyl sulfoxide (DMSO) or 500 nM retinoic acid (RA). The samples were amplified using sequence-specific primers, and the rate of amplification was correlated with the expression level of each gene in the cell population.
[0131] Taqman™ RT-PCR was performed under the following conditions: 1×RT Master Mix (ABI), 300 nM for each primer, and 80 nM of probe, and 10 pg to 100 ng of total RNA in nuclease-free water. The reaction was conducted under default RT-PCR conditions of 48° C. hold for 30 min, 95° C. hold for 10 min, and 40 cycles of 95° C. at 15 sec and 60° C. hold for 1 min. RNA was isolated by a guanidinium isothiocyanate method (RNAeasy™ kit, Qiagen) according to manufacturer's instructions, and subsequently DNAse treated (DNAfree™ kit, Ambion). Gene-specific primers and probes were designed by PrimerExpress™ software (Ver. 1.5, ABI). Probe oligonucleotides were synthesized with the fluorescent indicators 6-carboxytluorescein (FAM) and 6-carboxy-tetramethylrhodamine (TAMRA) at the 5′ and 3′ ends, respectively. Relative quantitation of gene expression between multiple samples was achieved by normalization against endogenous 18S ribosomal RNA (primer and probe from ABI) using the ΔΔCT method of quantitation (ABI). Fold change in expression level was calculated as 2 −ΔΔCT.
[0132] The table below shows the results of this analysis. Since the cells have been cultured in RA and DMSO for a short period, they are at the early stages of differentiation, and the difference in expression level is less dramatic than it would be after further differentiation. Of particular interest for following or modulating the differentiation process are markers that show modified expression within the first week of differentiation by more than 2-fold (*), 5-fold (**), 10-fold (***), or 100-fold (****)
9TABLE 9
|
|
Quantitative RT-PCR analysis of gene expression in hESC differentiation
Fold Change
Geron IDGenBank IDNameRADMSO
|
A.GA_10902NM_024504Pr domain containing 14 (PRDM14)**−1.9−8.3
GA_11893NM_032805Hypothetical protein FLJ14549***−2.3−10.0
GA_12318NM_032447Fibrillin3
GA_1322NM_000142Fibroblast growth factor receptor 3 precursor 1.5 2.3
(FGFR-3)*
GA_1329NM_002015Forkhead box o1a (foxo1a)*−1.6−2.9
GA_1470NM_003740Potassium channel subfamily k member 5 (TASK-2)−1.6 1.0
GA_1674NM_002701Octamer-binding transcription factor 3a (OCT-3A)−3.7−7.7
(OCT-4)**
GA_2024NM_003212Teratocarcinoma-derived growth factor 1−4.0−12.5
(CRIPTO)***
GA_2149NM_003413Zic family member 3 (ZIC3)**−1.7−5.3
GA_2334NM_000216Kallmann syndrome 1 sequence (KAL1)*−1.1−2.5
GA_23552BC027972Glypican-2 (cerebroglycan)−1.5−1.2
GA_2356NM_002851Protein tyrosine phosphatase, receptor-type, z−1.7−3.3
polypeptide 1 (PTPRZ1)*
GA_2367NM_003923Forkhead box h1 (FOXH1)**−1.8−5.6
GA_2436NM_004329Bone morphogenetic protein receptor, type Ia−2.4−2.4
(BMPR1A) (ALK-3)*
GA_2442NM_004335Bone marrow stromal antigen 2 (BST-2) 1.1−1.9
GA_2945NM_005232Ephrin type-a receptor 1 (EPHA1)−1.3−1.9
GA_2962NM_005314Gastrin-releasing peptide receptor (GRP-R)**−6.3−9.1
GA_2988NM_005397Podocalyxin-like (PODXL)*−2.6−4.3
GA_3337NM_006159Nell2 (NEL-like protein 2)−1.3−1.3
GA_3559NM_005629Solute carrier family 6, member 8 (SLC6A8)−1.1−1.1
GA_420X98834Zinc finger protein, HSAL2*−1.4−2.8
GA_5391NM_002968Sal-like 1 (SALL1), 1.4−1.3
GA_6402NM_016089Krab-zinc finger protein SZF1-1*−1.8−3.1
GA_9167AF308602Notch 1 (N1) 1.3 1.0
GA_9183AF193855Zinc finger protein of cerebellum ZIC2* 1.0−2.9
GA_9443NM_004426Early development regulator 1 (polyhomeotic 1−1.8−5.6
homolog) (EDR1)**
B.GA_9384NM_020997Left-right determination, factor b (LEFTB)**−16.7 −25.0
GA_12173BC010641Gamma-aminobutyric acid (GABA) A receptor,−2.8−5.6
beta 3**
GA_10513NM_033209Thy-1 co-transcribed***−12.5 −11.1
GA_1831NM_002941Roundabout, axon guidance receptor, homolog 1 1.1 1.0
(ROBO1),
GA_2753NM_000582Secreted phosphoprotein 1 (osteopontin)***−3.8−10.0
GA_32919NM_133259130 kDa leucine-rich protein (LRP 130)−1.9−1.9
GA_28290AK055829FLJ31267 (acetylglucosaminyltransferase-like−2.3−4.5
protein)*
C.GA_28053T24677EST****<−100* <−100*
GA_26303NM_138815Hypothetical protein BC018070***−3.2−10.0
GA_2028NM_003219Telomerase reverse transcriptase (TERT)*−2.1−2.3
|
Selection of Markers for Monitoring ES Cell Differentiation
[0133] Genes that undergo up- or down-regulation in expression levels during differentiation are of interest for a variety of different commercial applications, as described earlier. This experiment provides an example in which certain genes were selected as a means to monitor the ability of culture conditions to maintain the undifferentiated cell phenotype—and hence, the pluripotent differentiation capability of the cells.
[0134] Particular genes were chosen from those identified as having differential expression patterns, because they are khown or suspected of producing a protein gene product that is expressed at the cell surface, or is secreted. These attributes are helpful, because they allow the condition of the cells to be monitored easily either by antibody staining of the cell surface, or by immunoassay of the culture supernatant. Genes were chosen from the EST database (Groups 1), microarray analysis (Group 2), and other sources (Group 3).
10TABLE 10
|
|
Additional Genes analyzed by real-time PCR
GenBank or
NameID No.
|
Group 1Bone marrow stromal antigenNM_004335
Podocalyxin-likeNM_005397
Rat GPC/ glypican-2 (cerebroglycan)TA_5416486
Potassium channel subfamily k member 5 (TASK-2)NM_003740
Notch 1 proteinAF308602
Teratocarcinoma-derived growth factor 1 (Cripto)NM_003212
Nel 1 like / NELL2 (Nel-like protein 2)NM_006159
Gastrin releasing peptide receptorNM_005314
Bone morphogenetic protein receptorNM_004329
ABCG2-ABC transporterAY017168
Solute carrier family 6, member 8 (SLC6A8)NM_005629
hTERTNM_003219
Oct 3/4 octamer-binding transcription factor 3a (oct-3a) (oct-4)NM_002701
Group 2Left-right determination factor b (LEFTB)NM_020997
Secreted phosphoprotein 1 (osteopontin)NM_000582
Gamma-aminobutyric acid (GABA) A receptor, beta 3NM_021912
Roundabout, axon guidance receptor, homologue 1 (ROBO1),NM_002941
Glucagon receptorNM_00160
Leucine-rich PPR-motif hum 130 kDa hum130leu 130 kd LeuM92439
Thy-1 co-transcribedNM_033209
Solute carrier family 21NM_016354
LY6H lymphocyte antigen 6 complex locus HNM_002347
Plexin (PLXNB3)NM_005393
ICAMNM_000201
Group 3RhodopsinNM_000539
Kallmann syndrome 1 sequence (KAL1)NM_000216
Armadillo repeat protein deleted in velo-cardio-facial syndromeNM_001670
(ARVCF)
Ephrin type-a receptor 1 (EPHA1)NM_005232
|
[0135]
FIG. 1 shows the decrease in expression of the genes in Group I (Upper Panel) and Group II (Lower Panel) in H9 hES cells after culturing for 7 days with RA or DM. Gene expression of rhodopsin and ICAM was below the limit of detection in differentiated cells. KAL1 and EPHA1 were not tested.
[0136] Besides hTERT and Oct 3/4, three other genes were selected as characteristic of the undifferentiated hES cell phenotype. They were Teratocarcinoma-derived growth factor (Cripto), Podocalyxin-like (PODXL), and gastrin-releasing peptide receptor (GRPR).
[0137]
FIG. 2 compares the level of expression of these five genes in hES cells with fully differentiated cells: BJ fibroblasts, BJ fibroblasts transfected to express hTERT (BJ-5TA), and 293 (human embryonic kidney) cells. The level of all markers shown was at least 10-fold higher, and potentially more than 102, 103, 104, 105, or 106-fold higher in pluripotent stem cells than fully differentiated cells. All five markers retained a detectable level of expression in differentiated cultures of hESC. It is not clear if there is lower level of expression of these markers in differentiated cells, or if the detectable expression derived from the undifferentiated cells in the population. The one exception observed in this experiment was the hTERT transgene, expressed at an elevated level as expected in the BJ-5TA cells.
[0138] High-level expression of Cripto, GRPR and PODXL in undifferentiated hES cells reveals interesting aspects of the biology of these cells. Cripto has been implicated in normal mammalian development and tumor growth. Cripto encodes a glycosylphosphoinositol anchored protein that contains an EGF repeat and a cysteine rich motif, which makes it a member of the EGF-CFC family. It has been demonstrated that Cripto serves as a co receptor for Nodal, which is essential for mesoderm and endoderm formation in vertebrate development (Yeo et al., Molecular Cell 7:949, 2001). The finding that Cripto is expressed preferentially on undifferentiated hESC suggests that Nodal is an important signaling molecule for stem cells, perhaps to promote survival and/or proliferation.
[0139] PODXL encodes for transmembrane sialoprotein that is physically linked to the cytoskeleton. PODXL is suspected to act as an inhibitor of cell-cell adhesion and has been implicated in the embryonic development of the kidney podocyte. The anti-adhesion properties of PODXL when expressed on undifferentiated hESC may be an important feature related to stem cell migration.
[0140] The receptor for gastrin releasing peptide (GRP) is a G-protein coupled receptor that mediates numerous biological effects of Bombesin-like peptides, including regulation of gut acid secretion and satiety. A critical role has also been established for GRP and GRPR in control growth of cultured cells and normal mammalian development. GRP and GRPR may be oncofetal antigens that act as morphogens in normal development and cancer.
Use of Cell Markers to Modify ES Cell Culture Conditions
[0141] This example illustrates the utility of the differentially expressed genes identified according to this invention in the evaluation of culture environments suitable for maintaining pluripotent stem cells.
[0142]
FIG. 3 show results of an experiment in which hES cells of the H1 line were maintained for multiple passages in different media. Medium conditioned with feeder cells provides factors effective to allow hES cells to proliferate in culture without differentiating. However, culturing in unconditioned medium leads to loss of the undifferentiated phenotype, with an increasing percentage of the cells showing decreased expression of CD9 (a marker for endothelial cells, fibroblasts, and certain progenitor cells), and the classic hES cell marker SSEA-4.
[0143]
FIG. 4 illustrates the sensitivity of hTERT, Oct 3/4, Cripto, GRP receptor, and podocalyxin-like protein (measured by real-time PCR assay) as a means of determining the degree of differentiation of the cells. After 4 passages in unconditioned X-VIVO™ 10 medium containing 8 ng/mL bFGF, all 5 markers show expression that has been downregulated by about 10-fold. After 8 passages, expression has decreased by 102, 103, or 104-fold.
[0144]
FIG. 5 shows results of an experiment in which the hES cell line H1 was grown on different feeder cell lines: mEF=mouse embryonic fibroblasts; hMSC=human mesenchymal stem cells; UtSMC =human uterine smooth muscle cells; WI-38=an established line of human lung fibroblasts. As monitored by RT-PCR assay of Cripto, Oct 3/4, and hTERT, at least under the conditions used in this experiment, the hMSC are better substitutes for mEF feeders than the other cell lines tested.
[0145]
FIG. 6 shows results of an experiment in which different media were tested for their ability to promote growth of hES cells without differentiation. Expression of Podocalyxin-like protein, Cripto, GFP Receptor, and hTERT were measured by RT-PCR. The test media were not preconditioned, but supplemented with the growth factors as follows:
11TABLE 11
|
|
Growth Conditions Tested for Marker Expression
DMEM preconditioned with
Standard conditions:mEF + bFGF (8 ng/mL)
|
Condition 3X-VIVO ™ 10 + bFGF (8 ng/mL)
Condition 4X-VIVO ™ 10 + bFGF (40 ng/mL)
Condition 5X-VIVO ™ 10 + bFGF (40 ng/mL) + stem cell
factor (SCF, 15 ng/mL)
Condition 6X-VIVO ™ 10 + bFGF (40 ng/mL) + FIt3 ligand
(75 ng/mL)
Condition 7X-VIVO ™ 10 + bFGF (40 ng/mL) + LIF
(100 ng/mL)
Condition 8QBSF ™−60 + bFGF (40 ng/mL)
|
[0146] The results show that the markers selected to monitor the undifferentiated phenotype showed similar changes in each of these culture conditions. By all criteria, XVIVO 10™ supplemented according to Condition 6 was found to be suitable for culturing hES cells without having to be preconditioned. As shown on the right side, when cells were put back into standard conditioned medium after 8 passages in the test conditions, expression of all four markers returned essentially to original levels. This shows that alterations in expression profiles in media Conditions 4 to 8 are temporary and reversible—consistent with the cells retaining full pluripotency.
Sequence Data
[0147]
12
TABLE 12
|
|
|
Sequences Listed in this Disclosure
|
SEQ. ID NO:
Designation
Reference
|
|
1
hTERT mRNA sequence
GenBank Accession NM_003129
|
2
hTERT protein sequence
GenBank Accession NM_003129
|
3
Oct 3/4 mRNA sequence
GenBank Accession NM_002701
|
4
Oct 3/4 protein sequence
GenBank Accession NM_002701
|
5
Cripto mRNA sequence
GenBank Accession NM_003212
|
6
Cripto protein sequence
GenBank Accession NM_003212
|
7
podocalyxin-like protein mRNA sequence
GenBank Accession NM_005397
|
8
podocalyxin-like protein amino acid sequence
GenBank Accession NM_005397
|
9
GRP receptor mRNA sequence
GenBank Accession NM_005314
|
10
GRP receptor proteins sequence
GenBank Accession NM_005314
|
11 to 81
Primers & probes for real-time PCR assay
This disclosure
|
82-100
Human telomeric repeats
U.S. Pat. No. 5,583,016
|
101
Geron sequence designation GA_12064
This disclosure
|
102
Geron sequence designation GA_23176
This disclosure
|
103
Geron sequence designation GA_23468
This disclosure
|
104
Geron sequence designation GA_23476
This disclosure
|
105
Geron sequence designation GA_23484
This disclosure
|
106
Geron sequence designation GA_23485
This disclosure
|
107
Geron sequence designation GA_23486
This disclosure
|
108
Geron sequence designation GA_23487
This disclosure
|
109
Geron sequence designation GA_23488
This disclosure
|
110
Geron sequence designation GA_23489
This disclosure
|
111
Geron sequence designation GA_23490
This disclosure
|
112
Geron sequence designation GA_23514
This disclosure
|
113
Geron sequence designation GA_23515
This disclosure
|
114
Geron sequence designation GA_23525
This disclosure
|
115
Geron sequence designation GA_23572
This disclosure
|
116
Geron sequence designation GA_23577
This disclosure
|
117
Geron sequence designation GA_23579
This disclosure
|
118
Geron sequence designation GA_23585
This disclosure
|
119
Geron sequence designation GA_23596
This disclosure
|
120
Geron sequence designation GA_23615
This disclosure
|
121
Geron sequence designation GA_23634
This disclosure
|
122
Geron sequence designation GA_23673
This disclosure
|
123
Geron sequence designation GA_23683
This disclosure
|
124
Geron sequence designation GA_23969
This disclosure
|
125
Geron sequence designation GA_24037
This disclosure
|
126
Geron sequence designation GA_32842
This disclosure
|
127
Geron sequence designation GA_32860
This disclosure
|
128
Geron sequence designation GA_32895
This disclosure
|
129
Geron sequence designation GA_32913
This disclosure
|
130
Geron sequence designation GA_32917
This disclosure
|
131
Geron sequence designation GA_32926
This disclosure
|
132
Geron sequence designation GA_32947
This disclosure
|
133
Geron sequence designation GA_32979
This disclosure
|
134
Geron sequence designation GA_32985
This disclosure
|
135
Geron sequence designation GA_35405
This disclosure
|
136
Geron sequence designation GA_38029
This disclosure
|
137
Geron sequence designation GA_7542
This disclosure
|
138
Geron sequence designation GA_8667
This disclosure
|
139
Geron sequence designation GA_9014
This disclosure
|
|
[0148]
13
|
|
LOCUS
TERT 4015 bp mRNA linear PRI 31-OCT-2000
SEQ. ID NO: 1
|
DEFINITION
Homo sapiens
telomerase reverse transcriptase (TERT), mRNA.
|
ACCESSION
NM_003219
|
AUTHORS
Nakamura, T. M., Morin, G. B., Chapman, K. B., Weinrich, S. L.,
|
Andrews, W. H., Lingner, J., Harley, C. B. and Cech, T. R.
|
TITLE
Telomerase catalytic subunit homologs from fission yeast and human
|
JOURNAL
Science 277 (5328), 955-959 (1997)
|
CDS
56..3454
|
LOCUS
POU5F1 1158 bp mRNA linear PRI 31-OCT-2000
SEQ. ID NO: 3
|
DEFINITION
Homo sapiens
POU domain, class 5, transcription factor 1 (POU5F1),
|
mRNA.
|
ACCESSION
NM_002701
|
AUTHORS
Takeda, J., Seino, S. and Bell, G. I.
|
TITLE
Human Oct3 gene family: cDNA sequences, alternative splicing, gene
|
organization, chromosomal location, and expression at low levels in
|
adult tissues
|
JOURNAL
Nucleic Acids Res. 20 (17), 4613-4620 (1992)
|
CDS
102..899
|
LOCUS
TDGF1 2033 bp mRNA linear PRI 05-NOV-2002
SEQ. ID NO: 5
|
DEFINITION
Homo sapiens
teratocarcinoma-derived growth factor 1 (TDGF1), mRNA.
|
ACCESSION
NM_003212
|
AUTHORS
Dono, R., Montuori, N., Rocchi, M., De Ponti-Zilli, L., Ciccodicola, A.
|
and Persico, M. G.
|
TITLE
Isolation and characterization of the CRIPTO autosomal gene and its
|
X-linked related sequence
|
JOURNAL
Am. J. Hum. Genet. 49 (3), 555-565 (1991)
|
CDS
248..814
|
LOCUS
PODXL 5869 bp mRNA linear PRI 01-NOV-2000
SEQ. ID NO: 7
|
DEFINITION
Homo sapiens
podocalyxin-like (PODXL), mRNA.
|
ACCESSION
NM_005397
|
AUTHORS
Kershaw, D. B., Beck, S. G., Wharram, B. L., Wiggins, J. E., Goyal,
|
M., Thomas, P. E. and Wiggins, R. C.
|
TITLE
Molecular cloning and characterization of human podocalyxin-like
|
protein. Orthologous relationship to rabbit PCLP1 and rat
|
podocalyxin
|
JOURNAL
J. Biol. Chem. 272 (25), 15708-15714 (1997)
|
CDS
251..1837
|
LOCUS
GRPR 1726 bp mRNA linear PRI 05-NOV-2002
SEQ. ID NO: 9
|
DEFINITION
Homo sapiens
gastrin-releasing peptide receptor (GRPR), mRNA.
|
ACCESSION
NM_005314
|
AUTHORS
Xiao, D., Wang, J , Hampton, L. L. and Weber, H. C.
|
TITLE
The human gastrin-releasing peptide receptor gene structure, its
|
tissue expression and promoter
|
JOURNAL
Gene 264 (1), 95-103 (2001)
|
CDS
399..1553
|
|
Bone Marrow Stromal antigen
|
Forward primer: ACCTGCAACCACACTGTGATG
SEQ. ID NO: 11
|
Probe: 6fam-CCCTAATGGCTTCCCTGGATGCAGA-tam
SEQ. ID NO: 12
|
Reverse Primer: TTTCTTTTGTCCTTGGGCCTT
SEQ. ID NO: 13
|
Podocalyxin-like
|
Forward primer: GCTCGGCATATCAGTGAGATCA
SEQ. ID NO: 14
|
Probe: 6fam-TCTCATCCGAAGCGCCCCCTG-tam
SEQ. ID NO: 15
|
Reverse Primer: AGCTCGTCCTGAACCTCACAG
SEQ. ID NO: 16
|
Rat GPC/glpican-2 (cerebroglycan)
|
Forward primer: CTGGAAGAAATGTGGTCAGCG
SEQ. ID NO: 17
|
Probe: 6fam-AGCGCTTAAGGTGCCGGTGTCTGAAG-tam
SEQ. ID NO: 18
|
Reverse Primer: CATCAGAGCCTGGCTGCAG
SEQ. ID NO: 19
|
Potassium channel subfamily k member 5 (TASK-2)
|
Forward primer: ACCATCGGCTTCGGTGAC
SEQ. ID NO: 20
|
Probe: 6fam-TGTGGCCGGTGTGAACCCCA-tam
SEQ. ID NO: 21
|
Reverse Primer: TACAGGGCGTGGTAGTTGGC
SEQ. ID NO: 22
|
Notch 1 protein
|
Forward primer: TGAGAGCTTCTCCTGTGTCTGC
SEQ. ID NO: 23
|
Probe: 6fam-CAAGGGCAGACCTGTGAGGTCGACA-tam
SEQ. ID NO: 24
|
Reverse Primer: GGGCTCAGAACGCACTCGT
SEQ. ID NO: 25
|
Teratocarcinoma-derived growth factor 1 (Cripto)
|
Forward primer: TGAGCACGATGTGCGCA
SEQ. ID NO: 26
|
Probe: 6fam-AGAGAACTGTGGGTCTGTGCCCCATG-tam
SEQ. ID NO: 27
|
Reverse Primer: TTCTTGGGCAGCCAGGTG
SEQ. ID NO: 28
|
Nel 1 like/NELL2 (Nel-like protein 2)
|
Forward primer: CTTAAGTCGGCTCTTGCGTATGT
SEQ. ID NO: 29
|
Probe: 6fam-ATGGCAAATGCTGTAAGGAATGCAAATCG-tam
SEQ. ID NO: 30
|
Reverse Primer: AAGTAGGTTCGTCCTTGAAATTGG
SEQ. ID NO: 31
|
Gastrin releasing peptide receptor
|
Forward primer: CCGTGGAAGGGAATATACATGTC
SEQ. ID NO: 32
|
Probe: 6fam-AGAAGCAGATTGAATCCCGGAAGCGA-TAM
SEQ. ID NO: 33
|
Reverse Primer: CACCAGCACTGTCTTGGCAA
SEQ. ID NO: 34
|
Bone morphogenetic protein receptor
|
Forward primer: CAGATTATTGGGAGCCTATTTGTTC
SEQ. ID NO: 35
|
Probe: 6fam-TCATTTCTCGTGTTCAAGGACAGAATCTGGAT-tam
SEQ. ID NO: 36
|
Reverse Primer: CATCCCAGTGCCATGAAGC
SEQ. ID NO: 37
|
ABC G2-ABC transporter
|
Forward primer: GGCCTCAGGAAGACTTATGT
SEQ. ID NO: 38
|
Probe: SYBR Green Detection Method
|
Reverse Primer: AAGGAGGTGGTGTAGCTGAT
SEQ. ID NO: 39
|
Solute carrier family 6, member 8 (SLC6A8)
|
Forward primer: CCGGCAGCAT CAATGTCTG
SEQ. ID NO: 40
|
Probe: 6fam-TCAAAGGCCTGGGCTACGCCTCC-tam
SEQ. ID NO: 41
|
Reverse Primer: GTGTTGCAGTAGAAGACGATCACC
SEQ. ID NO: 42
|
Oct 3/4 octamer-binding trasncription factor 3a (oct3a) (oct-4)
|
Forward primer: GAAACCCACACTGCAGCAGA
SEQ. ID NO: 43
|
Probe: 6fam-CAGCCACATCGCCCAGCAGC-TAM
SEQ. ID NO: 44
|
Reverse Primer: CACATCCTTCTCGAGCCCA
SEQ. ID NO: 45
|
Left-right determination factor b (LEFTB)
|
Forward primer: TGCCGCCAGGAGATGTACA
SEQ. ID NO: 46
|
Probe: 6fam-TGGGCCGAGAACTGGGTGCTG-tam
SEQ. ID NO: 47
|
Reverse Primer: TCATAAGCCAGGAAGCCCG
SEQ. ID NO: 48
|
Secreted phosphoprotein 1 (osteopontin)
|
Forward primer: TTGCAGCCTTCTCAGCCAA
SEQ. ID NO: 49
|
Probe: 6fam-CGCCGACCAAGGAAAACTCACTACCA-tam
SEQ. ID NO: 50
|
Reverse Primer: GGAGGCAAAAGCAAATCACTG
SEQ. ID NO: 51
|
Gamma-aminobutyric aci (GABA) A receptor, beta 3
|
Forward primer: CCGTCTGGTCTCGAGGAATG
SEQ. ID NO: 52
|
Probe: 6fam-TCTTCGCCACAGGTGCCTATCCTCG-tam
SEQ. ID NO: 53
|
Reverse Primer: TCAACCGAAAGCTCAGTGACA
SEQ. ID NO: 54
|
Roundabout, axon guidance receptor, homologue 1 (ROBO1)
|
Forward primer: GAGAGGAGGCGAAGCTGTCA
SEQ. ID NO: 55
|
Probe: 6fam-CAGTGGAGGGAGGCCTGGACTTCTC-tam
SEQ. ID NO: 56
|
Reverse Primer: GCGGCAGGTTCACTGATGT
SEQ. ID NO: 57
|
Glucagon receptor
|
Forward primer: CCACACAGACTACAAGTTCCGG
SEQ. ID NO: 58
|
Probe: 6fam-TGGCCAAGTCCACGCTGACCCT-tam
SEQ. ID NO: 59
|
Reverse Primer: CTTCGTGGACGCCCAGC
SEQ. ID NO: 60
|
Leucine-rich PPR-motif hum 130 kda hum 130 kd leu
|
Forward primer: GCAGCAGACCCCTTCTAGGTTAG
SEQ. ID NO: 61
|
Probe: 6fam-ACCCGTGTCATCCAGGCATTGGC-tam
SEQ. ID NO: 62
|
Reverse Primer: TGAACTACTTCTATGTTTTCAACATCACC
SEQ. ID NO: 63
|
Thy-1 co-transcribed
|
Forward primer: AGCCTCCAAGTCAGGTGGG
SEQ. ID NO: 64
|
Probe: 6fam-CAGAGCTGCACAGGGTTTGGCCC-TAM
SEQ. ID NO: 65
|
Reverse Primer: GGAGGAAGTGCCTCCCTTAGA
SEQ. ID NO: 66
|
Solute carrier family 21
|
Forward primer: GCGTCACCTACCTGGATGAGA
SEQ. ID NO: 67
|
Probe: 6fam-CCAGCTGCTCGCCCGTCTACATTG-tam
SEQ. ID NO: 68
|
Reverse Primer: TGGCCGCTGTGTAGAAGATG
SEQ. ID NO: 69
|
LY6H lympohocyte antigen 6 complex locus H
|
Forward primer: CGAATCACCGATCCCAGC
SEQ. ID NO: 70
|
Probe: 6fam-CAGCAGGAAGGATCACTCGGTGAACAA-tam
SEQ. ID NO: 71
|
Reverse Primer: CGAAGTCACAGGAGGAGGCA
SEQ. ID NO: 72
|
Plexin (PLXNB3)
|
Forward primer: GAGAAGGTGTTGGACCAAGTCTACA
SEQ. ID NO: 73
|
Probe: 6fam-CCTCAGTGCATGCCCTAGACCTTGAGTG-tam
SEQ. ID NO: 74
|
Reverse Primer: CTTCGTCCGATAGGGTCAGG
SEQ. ID NO: 75
|
ICAM
|
Forward primer: ACTCCAGAACGGGTGGAACTG
SEQ. ID NO: 76
|
Probe: 6fam-ACCCCTCCCCTCTTGGCAGCC-tam
SEQ. ID NO: 77
|
Reverse Primer: CGTAGGGTAAGGTTCTTGCCC
SEQ. ID NO: 78
|
Rhodopsin
|
Forward primer: CCGGCTGGTCCAGGTACAT
SEQ. ID NO: 79
|
Probe: 6fam-CCGAGGGCCTGCAGTGCTCG-tam
SEQ. ID NO: 80
|
Reverse Primer: TTGAGCGTGTAGTAGTCGATTCCA
SEQ. ID NO: 81
|
|
[0149] The subject matter provided in this disclosure can be modified as a matter of routine optimization, without departing from the spirit of the invention, or the scope of the appended claims.
Claims
- 1. A method for assessing a culture of undifferentiated primate pluripotent stem (pPS) cells or their progeny, comprising detecting or measuring expression of two or more of the markers in any of Tables 5 to 9, other than hTERT or Oct 3/4.
- 2. The method of claim 1, comprising measuring expression of two or more of the markers in Tables 2, 7, and 9(C), and correlating the expression measured with the presence of undifferentiated embryonic stem (ES) cells in the culture.
- 3. The method of claim 1, comprising measuring expression of two or more of the markers in Tables 3 and 8, and correlating the expression measured with the presence of differentiated cells in the culture.
- 4. The method of claim 1, comprising detecting or measuring expression of one or more of the following markers: bone marrow stromal antigen; Podocalyxin-like; Rat GPC/glypican-2 (cerebroglycan); Potassium channel subfamily k member 5 (TASK-2); Notch 1 protein; Teratocarcinoma-derived growth factor 1 (Cripto); Nel 1 like/NELL2 (Nel-like protein 2); Gastrin releasing peptide receptor; Bone morphogenetic protein; ABCG2- ABC transporter; Solute carrier family 6, member 8 (SLC6A8); hTERT; Oct 3/4 Octamer-binding transcription factor 3a (Oct-3a) (Oct-4); Left-right determination factor b (LEFT); Secreted phosphoprotein 1 (osteopontin); Gamma-aminobutyric acid (GABA) A receptor, beta 3; Roundabout, axon guidance receptor, homologue 1 (ROBO1); Glucagon receptor; Leucine-rich ppr-motif hum 130 kDa hum130leu 130 kd leu; Thy-1 co-transcribed; Solute carrier family 21; LY6H lymphocyte antigen 6 complex locus H; Plexin (PLXNB3); Armadillo repeat protein deleted in velo-cardio-facial syndrome; and Ephrin type-a receptor 1 (EPHA1).
- 5. The method of claim 1, comprising detecting or measuring expression of three or more of said markers.
- 6. The method of claim 1 further comprising detecting or measuring expression of hTERT and/or Oct 3/4.
- 7. A method for assessing a culture of undifferentiated primate pluripotent stem (pPS) cells or their progeny, comprising detecting or measuring:
a marker from the following list: Cripto, gastrin-releasing peptide (GRP) receptor, and podocalyxin-like protein; and either hTERT and/or Oct 3/4, or a second marker from the list.
- 8. The method of claim 7, comprising detecting or measuring at least two markers from the list.
- 9. The method of claim 7, comprising detecting or measuring at least two markers from the list, and detecting or measuring hTERT and/or Oct 3/4.
- 10. The method of claim 7, comprising detecting or measuring Cripto, gastrin-releasing peptide (GRP) receptor podocalyxin-like protein, hTERT, and Oct 3/4.
- 11. The method of claim 1, wherein expression of the marker(s) is detected or measured at the mRNA level by PCR amplification.
- 12. The method of claim 1, wherein expression of the marker(s) is detected or measured at the protein or enzyme product level by antibody assay.
- 13. The method of claim 1, comprising quantifying the proportion of undifferentiated pPS cells or differentiated cells in the culture from said marker expression.
- 14. The method of claim 1, comprising assessing the ability of a culture system or component thereof to maintain pPS cells in an undifferentiated state from said marker expression.
- 15. The method of claim 14, comprising assessing the ability of a soluble factor to maintain pPS cells in an undifferentiated state from said marker expression.
- 16. The method of claim 14, comprising assessing the ability of a culture medium to maintain pPS cells in an undifferentiated state from said marker expression.
- 17. The method of claim 14, comprising assessing the ability of a preparation of feeder cells to maintain pPS cells in an undifferentiated state from said marker expression.
- 18. The method of claim 1, comprising assessing the ability of a culture system or component thereof to cause differentiation of pPS cells into a culture of lineage-restricted precursor cells and/or terminally differentiated cells.
- 19. The method of claim 1, comprising assessing the suitability of a pPS cell culture for preparing cells for human administration.
- 20. The method of claim 7, wherein the level of the marker is determined to be at least 100-fold higher than the level of the marker in BJ fibroblasts.
- 21. A method for assessing the growth characteristics of a cell population, comprising detecting or measuring expression of two or more of the markers in any of Tables 5 to 9, at least one of which is neither hTERT nor Oct 3/4.
- 22. The method of claim 21, comprising detecting or measuring:
a marker from the following list: Cripto, gastrin-releasing peptide (GRP) receptor, and podocalyxin-like protein; and either hTERT and/or Oct 3/4, or a second marker from the list.
- 23. The method of claim 21, wherein the cell population has been obtained by culturing cells from a human blastocyst.
- 24. The method of claim 23, comprising determining whether the cell population is pluripotent from said marker expression.
- 25. The method of claim 21, wherein the cell population has been obtained from a human patient suspected of having a clinical condition related to abnormal cell growth.
- 26. The method of claim 25, comprising assessing whether the patient has a malignancy from said marker expression.
- 27. A method for maintaining pPS cells in a pluripotent state, comprising causing them to express one of the following markers at a higher level:
Forkhead box O1A (FOXO1A); Zic family member 3 (ZIC3); Hypothetical protein FLJ20582; Forkhead box H1 (FOXH1); Zinc finger protein, Hsal2; KRAB-zinc finger protein SZF1-1; or Zinc finger protein of cerebellum ZIC2; or any other marker listed in Table 5 with the symbol “{circle over (x)}”.
- 28. The method of claim 27, wherein the cells are caused to express the marker by genetically altering it with a gene that encodes the marker.
- 29. A method for causing pPS cells to differentiate into a particular tissue type, comprising causing them to express one of the following markers at a lower level:
Forkhead box O1A (FOX01A); Zic family member 3 (ZIC3); Hypothetical protein FLJ20582; Forkhead box H1 (FOXH1); Zinc finger protein, Hsal2; KRAB-zinc finger protein SZF1-1; or Zinc finger protein of cerebellum ZIC2; or any other maker listed in Table 5 with the symbol “{circle over (x)}”; or by causing them to express one of the markers listed in Table 6 with the symbol “{circle over (x)}” at a higher level.
- 30. The method of claim 29, wherein the cells are caused to express the marker by genetically altering it with a gene that encodes the marker.
- 31. A method for maintaining pPS cells in a pluripotent state, comprising culturing pPS cells or their progeny in the presence of a normally secreted protein that is encoded by a gene listed in Table 2, 5, 7, or 9.
- 32. A method for causing pPS cells to differentiate, comprising culturing pPS cells or their progeny in the presence of a normally secreted protein that is encoded by a gene listed in Table 3, 6, or 8.
- 33. A method for causing an encoding sequence to be preferentially expressed in undifferentiated pPS cells, comprising genetically altering pPS cells with the encoding sequence under control of a promoter for one of the markers listed in Table 2, 5, or 7.
- 34. The method of claim 33, further comprising selecting undifferentiated cells, wherein the encoding sequence is a reporter gene (such as a gene that causes the cells to emit fluorescence), or a positive selection marker (such as a drug resistance gene).
- 35. The method of claim 33, further comprising depleting undifferentiated cells from a population of differentiated cells, wherein the encoding sequence is a negative selection marker (such as a gene that activates apoptosis or converts a prodrug into a compound that is toxic to the cell).
- 36. A method for causing an encoding sequence to be preferentially expressed in differentiated cells, comprising genetically altering the pPS cells with the encoding sequence under control of a promoter for one of the markers listed in Table 3, 6, or 8.
- 37. The method of claim 36, further comprising selecting differentiated cells, wherein the encoding sequence is a reporter gene (such as a gene that causes the cells to emit fluorescence), or a positive selection marker (such as a drug resistance gene).
- 38. The method of claim 36, further comprising depleting differentiated cells from a population of undifferentiated cells, wherein the encoding sequence is a negative selection marker (such as a gene that activates apoptosis or converts a prodrug into a compound that is lethal to the cell).
- 39. A method for sorting differentiated cells from less differentiated cells, comprising separating cells expressing a surface marker in any of Tables 5 to 9 from cells not expressing the marker.
- 40. The method of claim 39, wherein the cells are sorted using an antibody or lectin that binds the marker or product thereof on the cell surface.
- 41. A method for causing pPS cells to proliferate without differentiation, comprising culturing them in a culture system assessed according to the method of claim 6.
- 42. A method for causing pPS cells to proliferate without differentiation, comprising culturing them with mesenchymal stem cells.
- 43. A method for identifying genes that are up- or down-regulated during differentiation of pPS cells, comprising:
a) sequencing transcripts in an expression library from undifferentiated pPS cells; b) sequencing transcripts in one or more expression libraries from one or more cell types that have differentiated from the same line of pPS cells; c) determining the frequency of transcripts from each gene sequenced in each of the libraries; and d) identifying the gene as being up- or down-regulated during differentiation of the pPS cells if the frequency of transcripts in the library from the undifferentiated pPS cells is statistically different from the frequency of transcripts in one or more libraries from the differentiated cell types.
- 44. The method of claim 43, further comprising assessing a culture of pPS cells depending on the expression level in the culture of a marker identified in step d).
- 45. A kit for assessing a culture of pPS cells according to claim 1, comprising polynucleotide probes and/or primers for specifically amplifying a transcript for two or more markers in any of Tables 5 to 9, accompanied by written instructions for assessing the pPS cells or their progeny according to the expression of said markers measured using the probes or primers in the kit.
- 46. A kit for assessing a culture of pPS cells according to claim 1, comprising antibodies specific for each gene product of two or more markers in any of Tables 5 to 9, accompanied by written instructions for assessing the pPS cells or their progeny according to the expression of said markers measured using the antibodies in the kit.
- 47. The method of claim 1, wherein the pPS cells are obtained from a human blastocyst, or are the progeny of such cells.
- 48. The method of claim 1, wherein the pPS cells are human embryonic stem cells.