The present invention relates to methods and compositions for performing functional genomics and gene trapping using haploid or hypodiploid cells, including haploid or hypodiploid vertebrate cells, in combination with high throughput imaging. In a particular aspect, the present invention relates to methods for identifying genes involved in cellular signaling pathways.
Gene trapping or random insertional mutagenesis is a method used to discover genes responsible for a particular phenotypic characteristic of an organism. Traditionally a mutagenic element, sometimes also containing a reporter element, is introduced in a stochastic and random way into the genome of embryonic stem (ES) cells by means of a viral vector and/or electroporation. The randomly-mutagenized ES cell lines are characterized and then possibly selected on the basis of some morphological, biochemical or other criterion, then injected into blastocysts, which are implanted into females and go on to form chimaeric animals. Animal lines harboring the mutation of interest in the germline tissue are then bred to homozygosity and the resulting phenotype studied in the whole, mutant animal, or in some tissue or cell of interest taken from the mutant animal.
The process of generating mutant animals in this fashion is very time consuming, as well as being labor and cost intensive to the point of being prohibitive for many research facilities. Likewise, the time involved is also substantial requiring many months before experiments on the “gene-trapped” animal can begin. The motivation to use such a system is that most commonly used cell lines are diploid and as such insertion of a mutagenic element will at most probably affect only one of two alleles of a gene present in the genome. It is unsafe to assume, and indeed very unlikely, that inactivation of a single allele will be sufficient to eliminate totally the function of a particular gene, thereby necessitating elimination of both alleles of a diploid cell line.
Accordingly, there is a need in the art for high-throughput screening methods which allow the use of gene-trapping and functional genomics but do not require the generation of live animals.
In accordance with the present invention, it has been determined that, if one applies insertional mutagenesis to a cell line that is haploid or hypodiploid, mutation and inactivation of any single gene results in elimination of the function of that gene as there is only a single copy of the gene represented in the haploid or hypodiploid cell line. This is of substantial benefit because performing gene-trapping (e.g., insertional mutagenesis) and functional genomics methods in diploid cells is impractical because of the near-impossibility of knocking out both copies of any particular gene.
Gene trapping in haploid or hypodiploid cells can be used to look at any interesting morphological response which can include such responses as changes in cell size, changes in cell shape, changes in cell number, changes in cell migration, changes in the subcellular distribution or concentration of anything that can be visualized, and the like. Indeed, there are already algorithms commercially available to quantify most any type of morphology which one might choose to study. One of the main differences is in the quality of the algorithms.
As will be readily recognized by those of skill in the art, the utility of the present invention extends to any of the above-referenced applications, morphologies or interests. While a focus of the present specification may be directed to the specific chemical conversion of palmitoylation, this is merely a useful model system for demonstration of the concepts embraced by the invention methods.
The introduction of the mutagenic element into a haploid or hypodiploid cell can occur via a variety of mechanisms, e.g., employing retrovirus or electroporation. For example, a stable haploid cell line exists and is available commercially from ATCC (Accession No. CCL-145). This cell line is fibroblast-like, and adherent, two properties that make it useful for microscopic imaging. These cells provide a genomic composition which facilitates carrying out functional genomics experiments such as random insertional mutagenesis using high-throughput/high content microscopy (HCM).
In accordance with another aspect of the present invention, haploid or hypodiploid lines have been developed from other animals, including mouse and human. Following mutagenesis, cellular morphological or physiological readouts selected to identify specific genes that alter the morphology or physiology of interest can be carried out using HCM.
Other features and advantages of the invention will be apparent from the following detailed description and claims.
In accordance with the present invention, there are provided assays which result from combining gene trapping in haploid (or hypodiploid) cells with HTM. As will be appreciated by those of skill in the art, numerous biological phenomena can be beneficially explored using invention methodology. Such phenomena include prenylation (e.g., famesylation and geranylgeranylation), acylation (e.g., palmitoylation and myristoylation), regulation of transcription, the action of steroids and steroid-like compounds, neuroregeneration and spinal cord repair, cell cycle regulation, stem cells and neuronal stem cells (i.e., to enable an understanding of signal transduction pathways that determine/regulate cell developmental fate and provide a specific means by which to regulate or manipulate cell fate, cell migration, filopodia, GPCR-related signaling, phosphorylation, contact-inhibition of cell growth, tumorigenesis, identification of the molecular targets of drugs with unknown mechanism(s) of action, and the like, as well as any signaling pathway or cellular process in which a unique morphological metric can be tied (either directly or indirectly) to that process. As readily recognized by those of skill in the art, there is with near absolute certainty, some such marker for every pathway or process in a cell. By way of illustration, exemplary metrics which may be monitored to investigate the biological phenomena set forth above are presented in the following table.
All of the above-described metrics can be monitored in a variety of ways, e.g., by selective labeling of the moving/translocating, elongating/granulating, component with fluorescent proteins or antibodies or affinity tag labels (such as FlAsH, ReAsH or Hoechst; see, for example, Adams et al., 2002; Leubke, 1998; Marek and Davis, 2002; Tour et al., 2003; Griffin et al., 2000; and Park et al., 2004). The homogeneity of a given response can be dramatically increased (i.e. reduction in the biological noise component of the assay).
In accordance with a specific embodiment of the present invention, there are provided methods for identifying a gene that modulates subcellular localization of a protein, said methods comprising:
As used herein, the term “contacting” refers to any suitable means of bringing a DNA- or RNA-based mutagenic element into physical contact with the cell and passing the mutagenic element from the exterior of the cell to the interior. As readily recognized by those of skill in the art, this can be accomplished in a variety of ways, such as, for example, by use of a viral infective particle, electroporation of naked DNA or RNA, transfection of naked DNA or RNA employing a lipid-based transfection reagent, transfection of naked DNA or RNA employing high-intensity sound or a high-density microscopic particle (e.g., gold) introduced at high-velocity (biolistic transfection), and the like.
As used herein, the term “detecting” refers to any of a variety of means that can be used to identify (e.g., quantitatively or semi-quantitatively) the occurrence of any change (or plurality of changes) of any type, no matter how subtle, that can be captured by a recording device such as a camera, fluorimeter, luminometer, photodiode, PMT, methods/photophysical phenomena (such as fluorescence resonance energy transfer, fluorescence polarization, anisotropy, fluorescence lifetimes, coloration or fluorescence resulting from an enzymatic reaction with a chromogenic or fluorigenic substrate), and the like.
As used herein, the term “identifying” refers to the delineation of any of a variety of parameters (or sets of parameters) that describe the normal or basal state of a cell population, relative to cell(s) that fall outside of the normal range. Such cells are typically subjected to further analysis. For example, cells that fall outside of the normal range may do so because of a change in any one or more morphometric properties (as described in greater detail herein). Subcategorization of the cells that have entered the non-normal population will determine whether the mutation has introduced a desired affect or not.
Stated more generally, the present invention provides methods for identifying a gene that modulates cell morphology, said methods comprising:
As used herein, “cell morphology” refers to any feature (or combination of features) of a cell (or cell population) which can be detected (by any available means), such as, for example, cell size, cell shape, cell volume, quantity and/or distribution of subcellular components (e.g., organelles, macromolecular (hetero and homo) complexes, single molecules, and the like), morphologies in the cell population as a whole (e.g., islands, spheres, spheroids, clumps, striations, waves, and the like), behaviors of the cell population as a whole (e.g., adherence, or lack of adherence to the substrate, migration of single or groups of cells, repulsive or attractive properties between a single cell or groups of cells, and the like), the stage of the cell cycle (e.g., G0, G1, G2, S or M phases), the degree of cell differentiation, and the like.
In accordance with another specific embodiment of the present invention, there are provided methods for identifying a gene that modulates subcellular localization of a protein, said methods comprising:
Stated more generally, the above-described embodiment of present invention provides methods for identifying a gene that modulates cell morphology, said methods comprising:
In accordance with an alternative embodiment of the present invention, there are provided methods for identifying a gene that modulates subcellular localization of a protein, said methods comprising:
Stated more generally, the above-described embodiment of the present invention provides methods for identifying a gene that modulates cell morphology, said methods comprising:
In accordance with yet another alternative embodiment of the present invention, there are provided methods for identifying a gene that modulates subcellular localization of a protein, said methods comprising:
Stated more generally, the above-described embodiment of the present invention provides methods for identifying a gene that modulates cell morphology, said methods comprising:
In accordance with still another alternative embodiment of the present invention, there are provided methods for identifying a gene that modulates subcellular localization of a protein, said methods comprising identifying, in those modified haploid or hypodiploid cells which reveal a change in the subcellular localization of the protein upon introduction of an insertional mutagen thereto, the gene into which the insertional mutagen is inserted, thereby identifying the gene that modulates subcellular localization of the protein.
In accordance with another aspect of the present invention, there are provided methods for identifying a gene that modulates cell morphology, said methods comprising identifying, in those modified haploid or hypodiploid cells which reveal a change in the cell morphology upon introduction of an insertional mutagen thereto, the gene into which the insertional mutagen is inserted, thereby identifying the gene that modulates cell morphology.
In accordance with yet another aspect of the present invention, there are provided methods for determining the enzymatic cascade of genes responsible for a protein modification of interest, said methods comprising:
In accordance with an alternate embodiment of the present invention, there are provided methods for determining the enzymatic cascade of genes responsible for a protein modification of interest, said methods comprising:
In accordance with another alternative embodiment of the present invention, there are provided methods for determining the enzymatic cascade of genes responsible for a protein modification of interest, said methods comprising identifying a change in phenotype in modified haploid or hypodiploid cells prepared by randomly mutagenizing a haploid or hypodiploid cell, optionally containing one or more stably expressed marker gene(s), with a mutagenic element, thereby identifying a gene in the enzymatic cascade of interest.
In accordance with still another aspect of the present invention, there are provided stable, haploid or hypodiploid lines expressing a detectable marker, operably associated with a substrate for a reaction of interest. As readily recognized by those of skill in the art, a variety of detectable markers can be employed herein, such as, for example, a fluorophore, a chromophore, a chromogenic substrate, and the like.
Presently preferred detectable markers contemplated for use herein are fluorophores, such as Green Fluorescent Protein (GFP).
As readily recognized by those of skill in the art, invention cell lines can be employed to study a wide variety of reactions of interest, such as, for example, palmitoylation, prenylation, acylation, regulation of transcription, the action of steroids and steroid-like compounds, neuroregeneration and spinal cord repair, cell cycle regulation, stem cells and neuronal stem cells, cell migration, filopodia, GPCR-related signaling, phosphorylation, contact-inhibition of cell growth, tumorigenesis, identification of the molecular targets of drugs with unknown mechanism(s) of action, and any signaling pathway or cellular process in which a unique morphological metric can be tied (either directly or indirectly) to that process, and the like.
For example, when the reaction of interest is palmitoylation, the substrate employed herein will typically be a short peptide, such as, for example, an S-palmitoylation substrate.
Recent advances in machine visions have resulted in an explosion of the application of imaging to cell biology. Several companies including Q3DM (San Diego, Calif.) have developed microscopic platforms and machine vision algorithms having such a degree of sophistication that they are being implemented by major pharmaceutical companies for mainstream drug discovery and by academic labs with a need for extremely quantitative analyses of morphological events in cell biology. The combination of functional genomics like gene trapping and HCM has tremendous untapped potential.
An example shows how useful the combination of a haploid cell line and HCM is (see also
Determining the genes into which the mutagenic element was introduced, resulting in the redistribution of GFP from the membrane to the cytoplasm, is readily accomplished, employing standard procedures. The task of screening tens of thousands of individual mutated cell lines for a gross change in the distribution of a fluorescent reporter like GFP is readily accomplished employing an HCM system like the EIDAQ100 from Q3DM.
Existing algorithms allow for discriminating morphological details such as the redistribution of a fluorescent reporter from the membrane to the cytoplasm. Such algorithms are also capable of discriminating interesting mutants from mutants benign to the morphology of interest with enough speed to completely assay the entire genome three times per day (assuming that the genome consists of roughly 30,000 unique genes). In contrast, this process would take many months of expensive labor if scoring such changes by eye.
The process described above can readily be applied to any case where there is a change in physiology or morphology and an algorithm exists that can describe and quantitate the event. A large library of metrics currently exists, making it relatively simple to tailor a system to ones needs. The combination of HCM with the use of haploid or hypodiploid cells is completely unique and has the potential to rapidly elucidate all steps of signaling cascades that regulate processes that can be detected using fluorescence microscopy.
Many proteins are concentrated on the plasma membrane (PM), trapped in specialized subcellular regions, like synapses and caveolae, by virtue of their lipid modifications. Thio-acylation or S-palmitoylation, a common form of lipid modification, is unique in that it is reversible and dynamic, suggesting a modulatory role in signal transduction similar to phosphorylation. Recent data indicate that proper, dynamic regulation of the palmitoylation of PSD-95, an abundant scaffolding protein in the synapse, is critical for synaptic organization and function, linking palmitoylation to complex processes such as learning, memory and disease. Consistent with this understanding, it is known that mutations in one gene regulating S-palmitoylation result in a severe neurodegenerative disorder, infantile neuronal ceroid lipofuscinosis or ICNL. Additionally, a candidate gene for the regulation of S-palmitoylation is linked to schizophrenia. These examples highlight the importance of properly regulated S-palmitoylation to human health and disease.
Biochemical characterization of the enzymes responsible for S-palmitoylation (palmitoyl thio-acyl transferases, S-PATs) has been difficult and controversial; recent data from experiments in yeast add substantial weight to the view that such enzymes exist. To date, functional genomics discovery programs in vertebrate systems similar to those in yeast have been expensive and time consuming. The present invention addresses this problem by combining a novel form of gene-trapping in vertebrate cell cultures, with a fully automated readout in a high-throughput microscopy (HTM) format. The invention assay system enables one to test directly and functionally for the presence of S-PATs in vertebrates. The present invention further enables one to elucidate the entire enzymatic pathway for protein S-palmitoylation by quantitatively analyzing millions of cells from tens of thousands of “trapped” cell lines.
The system described herein, using S-palmitoylation as an exemplary pathway, provides critical information about the regulation of S-palmitoylation. The invention system also provides an invaluable experimental tool that can be extended to screens for other genes that regulate the subcellular distribution and concentration of proteins, enabling numerous applications in basic and therapeutic research.
Many proteins are concentrated on the PM by virtue of their lipid modifications. Recent data show that lipid modifications of proteins may well be the primary physical determinant for targeting to and retention of some proteins to membrane lipid microdomains such as synapses and caveolae (El-Husseini et al., 2000; El-Husseini Ael et al., 2002; Kanaani et al., 2002; Loranger & Linder, 2002; Topinka & Bredt, 1998; Zacharias et al., 2002). Fusion of Green Fluorescent Protein (GFP) to small-peptide substrates for lipid modification (e.g. the N-terminal 18 residues of GAP-43) has been shown to be sufficient to localize the fusion proteins to the PM in the absence of any other targeting signal (Zacharias et al., 2002). Similarly, mutagenic substitution of modifiable residues for ones which cannot be modified results in gross mislocalization and/or loss of function of the expressed proteins (Craven et al., 1999; Hiol et al., 2003; Osterhout et al., 2003; Wiedmer et al., 2003). It has also been shown that different lipid moieties induce partitioning into different lipid environments or lipid microdomains of cells (Melkonian et al., 1999; Moffett et al., 2000; Zacharias et al., 2002). Specific associations of proteins within such microdomains, whether mediated by attractive, protein-protein interactions, forced proximity or both, are critical components in the architecture of cellular communication. Lipid modifications undoubtedly play an important role in the creation and modulation of such protein associations. This point is amply demonstrated by the fact that the regulation of protein lipidation is known to be involved in several forms of cancer (Dinsmore & Bell, 2003; Ghobrial & Adjei, 2002), a severe neurodegenerative disorder, ICNL (Vesa et al., 1995) and possibly for schizophrenia (Liu et al., 2002). New optical methods and fluorescent sensors for characterizing the state of protein lipidation, protein localization and protein interactions (like those described herein) are consistently at the technological forefront, driving new discoveries in these fields and providing insight into the intricate, structured workings within cells and in networks of cells (Lippincott-Schwartz & Patterson, 2003; Weijer, 2003; Zacharias et al., 2000).
Prenyl and Acyl groups are the most common forms of protein lipid modifications (see
In contrast to prenylation, a detailed knowledge about the process of protein acylation is still lacking. Among the types of acylation, enzymatic processes regulating myristoylation have been characterized best and are reviewed in (Farazi et al., 2001; Rajala et al., 2000; Resh, 1999). Briefly, proteins that will become myristoylated begin with a consensus sequence Met-Gly-X-X-X-Ser/Thr (SEQ ID NO:4). The start Met is co-translationally, proteolytically removed and the myristate is added to the exposed N-terminal glycine via a stable amide bond. The formation of this bond is catalyzed by N-myristoyl transferase with a high degree of selectivity for 14-carbon myristate (Farazi et al., 2001; Rajala et al., 2000). N-terminal myristoylation often exists in combination with palmitoylation which can take at least two forms: N-palmitoylation (apparently rare) and S-palmitoylation (the most common). N-palmitoylation, first described for the protein sonic hedgehog (Pepinsky et al., 1998), is the addition of palmitic acid to the a-amide of Cys-24, which is proteolytically exposed to become the N-terminal residue of the functional protein. Addition of a palmitoyl group by an amide bond to the N-terminal glycine was recently shown to occur on the heterotrimeric G-protein, Gas (Kleuss & Krause, 2003). In contrast, S-palmitoylation is a reversible modification that occurs via a labile thioester bond with the thiol side chains of reactive cysteine residues on the cytoplasmic portions of a protein. Proteins that are palmitoylated are relatively abundant and diverse (Melkonian et al., 1999; Moffett et al., 2000) and the functional consequences of the modification vary depending on the protein. In general, palmitoylation increases the hydrophobicity of a protein, thereby affecting the degree of membrane association as well as sublocalization within a membrane. Once associated with the membrane, the palmitoyl group partitions primarily into cholesterol- and sphingolipid-rich lipid rafts (Moffett et al., 2000; Zacharias et al., 2002). The additional membrane avidity increases the likelihood that the palmitoylated protein will interact (forced proximity) with other membrane-bound or membrane-associated proteins, a phenomenon that is exemplified by the synaptic scaffolding protein, PSD-95 (Craven et al., 1999; El-Husseini et al., 2000; Perez & Bredt, 1998; Topinka & Bredt, 1998). The resulting complexes of interacting proteins assembled at sites of membrane specialization like synapses are critical nodes containing signaling proteins that are involved in every conceivable signaling pathway. Furthermore, modulation of the associative properties of individual proteins in these networks by reversibly palmitoylating some members, is a very attractive and plausible mechanism to regulate the participation of proteins in signaling events (el-Husseini Ael & Bredt, 2002; El-Husseini Ael et al., 2002; Hess et al., 1993; Qanbar & Bouvier, 2003).
The finding that the residence half life of the palmitoyl group on proteins is considerably shorter than the half life of the proteins to which it is attached (Lane & Liu, 1997; Wolven et al., 1997) suggests that enzymes for palmitate removal, protein palmitoylthioesterases or PPTases, could exist. PPT1 (Camp & Hofmann, 1993; Camp et al., 1994) is a lysosomal hydrolase that participates in the degradation of palmitoylated proteins by deacylating cysteine thioesters; acyl protein thioesterase 1 (APT1), a cytoplasmic protein first biochemically characterized as an acyl thioesterase by Duncan and Gilman (1998), is a member of the serine hydrolase, α/β fold family of lysophospholipases that has several additional substrate and lipid specificities (Duncan & Gilman, 1998). Regulated removal of the palmitoyl group from proteins is critical in human health; defects in PPT1 result in a severe neurodegenerative disorder known as infantile neuronal ceroid lipofuscinosis (ICNL) (Vesa et al., 1995). The role of PPT1 in ICNL was confirmed by targeted disruption of the gene in a mouse resulting in a model of the human disorder (Gupta et al., 2001).
In spite of the fact that the mechanism for depalmitoylation is reasonably well understood, elucidation of the mechanism for regulation of protein S-palmitoylation has lagged significantly behind. In one aspect of the present invention, elucidation of the enzymatic pathway that leads to protein palmitoylation is provided. There is currently no known amino acid consensus sequence for palmitoylation, suggesting either:
Human orthologs of yeast proteins have been identified from database queries. It remains to be seen, however, whether all DHHC-CRD proteins exhibit S-PAT activity, and, if so, whether they act alone, as hetero/homo-oligomers, or in concert with cofactors or other proteins. The first successful identification of these proteins took advantage of the genetic tractability of yeast. Similar genetic manipulations in vertebrate systems such as mice are far more labor intensive but often times required, for example when the mutant phenotype affects the development of the organism. However, when a mutation is expected to cause a visually identifiable change in the concentration or distribution of a protein, it becomes possible to reduce the experimental model to one where the gene of interest is mutated in a cultured cell line and the phenotype analyzed using microscopy. Specifically, using gene- and/or promoter-trap mutagenesis and examining the resulting mutant cell lines using HTM, would enable cell-based experimental analysis of gene function, on a genome-wide scale. This nascent technology can be employed to rapidly (on the order of hours) screen tens of thousands of mutated, stable cell lines covering millions of cells, for one or more mutants having a phenotype that is indicative of a gene that is directly, or indirectly, involved in regulating protein distribution or concentration. Proteins contemplated for use in the practice of the present invention would be those involved in the addition of palmitates to a protein(s) or peptide responsible for the biosynthesis of palmitate, palmitate-CoA. The morphological metric of interest in this case will be the subcellular localization of the reporter, Green Fluorescent Protein (GFP) fused to an S-palmitoylation substrate (SPS); GFP:SPS. Briefly, under normal circumstances GFP:SPS would be localized at the PM. However, if the reporter construct mutates a gene in the signaling pathway for S-palmitoylation, GFP:SPS will relocalize from the PM to the cytoplasm.
Gene- and promoter-trapping are forms of insertional mutagenesis whereby reporter genes and/or selectable markers are randomly and likely stochastically (Chowdhury et al., 1997; Evans, 1998) inserted into the genome of mouse ES cells (reviewed in: Cecconi & Meyer, 2000; Cecconi & Gruss, 2002; Stanford et al., 2001). Traditionally functional genomics studies employing gene-trap have relied on “trapping” the genes in ES cells, generating lines of mice with the mutated ES cell lines, then analyzing the phenotype resulting from the mutation in the whole animal (Evans et al., 1997; Stanford et al., 2001). This approach, while very informative and necessary in many cases (such as the role of the gene in the development of the animal), is labor intensive and expensive. While the methods of the present invention do not seek to circumvent or replace the process of screening mutant phenotypes in mice when it is necessary, instead the invention methods provide:
In accordance with the present invention, HTM (also referred to herein as high-content screening or HCS) is adapted to the functional genomics format, thereby enabling the screening of thousands of cell lines to determine the subcellular localization of the GFP:SPS reporter. HTM is a technology just now coming of age in drug discovery research (Milligan, 2003; Price et al., 2002; Woollacott & Simpson, 2001) in pharmaceutical companies. There is a vast and untapped potential for this technology stemming primarily from the fact that this platform provides the most quantitative mechanism for doing cell biological experiments of virtually any type (Price et al., 2002). Traditionally, the method for determining subcellular localization of proteins has been visual inspection and interpretation of relatively low numbers of microscopic images of cells expressing a GFP-tagged protein or fluorescent-labeled antibodies with specificity to a protein (Giuliano & Taylor, 1998; Zacharias et al., 2000). However, such subjective interpretations may be unconsciously or consciously influenced by investigator bias. Data obtained this way are not always easily confirmed in independent investigations, are not usually amenable to statistical analysis and are labor intensive. A primary motivating factor in the development of HTM has been the perceived need of pharmaceutical companies (Taylor et al., 2001) to investigate rapidly the effects of millions of small molecule compounds on a target molecule within the complex environment of a living cell; an environment where entire, endogenous intracellular signaling pathways are hopefully intact and functional. It has been the hope of the developers of HTM and the pharmaceutical industry that the increased information density that can be generated using multiplexed fluorescent readouts of drug responses would increase the quality of “hits” in a primary (>a million compounds) screen as well as decrease the time required to determine if lead compounds were functioning via the desired or predicted signaling routes. This type of screening method will prove very useful to the pharmaceutical industry, as well as in basic, academic research (Yarrow et al., 2003). HTM will add a badly needed quantitative aspect to cell biology.
Morphometric Analysis with HTM
Existing HTM algorithms are capable of discriminating minute changes in cell morphology or subcellular protein distribution (Conway et al., 2001; Ghosh et al., 2000; Minguez et al., 2002). There is essentially no limitation on the types of morphology that can be used, alone or combinatorially, as the criteria for a unique marker (Boland & Murphy, 2001; Price et al., 1996; Price et al., 2002; Roques & Murphy, 2002). Multiple criteria ranging from the location or concentration of a fluorophore in a cell (Boland et al., 1998; Boland & Murphy, 1999 a,b;2001; Markey et al., 1999; Murphy et al., 2000; Price et al., 2002; Roques & Murphy, 2002) to a physical change in the shape of a cell or a redistribution of cellular contents such as chromosomes, transcription factors (Ding et al., 1998) microtubules (Minguez et al., 2002) membrane protrusions (e.g. neurites, ruffles etc)((Price et al., 1996; Roques & Murphy, 2002)) can be used individually or combined to enhance the sensitivity and accuracy (Boland & Murphy, 2001; Price et al., 1996; Roques & Murphy, 2002). These algorithms provide a flexible and unbiased determination of the existence and degree of interesting, visible change(s) in cells allowing for high-resolution determinations of pharmacological efficacy tied to such a change (Conway et al., 2001; Ding et al., 1998; Ghosh et al., 2000). It is the exquisite sensitivity for subtle change and the objectivity of HTM that make possible the use of cell lines for a wider variety of gene trap screens.
In accordance with the present invention, there are provided cell-based, high-throughput, functional genomics assays that identify genes/enzymes comprising the pathway leading to S-palmitoylation of multiple S-palmitoylation substrates. Based on the above-described assay, one of skill in the art would readily recognize that the use of HTM in cell-based, functional genomics can be expanded in that it will be the basis upon which the development of other changes in cell morphology or fluorescent probe (re)distribution can be applied to gene-trap and functional genomics.
In accordance with the present invention, there is also provided a conceptual and technical basis for developing a drug discovery screen for small-molecule modulators of protein S-palmitoylation.
Insertional mutagenesis, or gene trap, is normally performed in embryonic stem (ES) cells which are subsequently used to create mutant mouse lines in which the resulting phenotypes are analyzed. In accordance with the present invention, there is provided a novel, complementary format for screening interesting mutations in cell lines using fluorescent reporters of protein localization. Also provided are gene trap vectors that enable the discovery of genes involved in regulating protein S-palmitoylation, as well as other signaling pathways of interest. Still further provided in accordance with the present invention is methodology for the functional characterization of mutant cell lines which are characterized employing invention high-throughput methodology.
This invention is further illustrated by the following examples which should not be construed as limiting. The contents of all references, patents and published patent applications cited throughout this application, as well as the sequence listing and the figures, are incorporated herein by reference.
To demonstrate the feasibility of finding S-PATs using a fluorescent reporter of palmitoylation, in combination with gene trapping and HTM, it is useful to establish that palmitoylation is a necessary and sufficient condition to cause the Green Fluorescent Protein: S-Palmitoylation Substrate (GFP:SPS) reporter to be localized to the plasma membrane (PM). Having this ability allows one to separate the role of palmitoylation from the influence of other signals, like protein-protein interactions, that might alter subcellular targeting. Though an example where S-palmitoylation is the exclusive physical property directing subcellular localization may not exist naturally, creating it with the reporters described herein allows one to look specifically and exclusively for modulators of S-palmitoylation of this specific, exemplary substrate.
In order to further demonstrate the feasibility of invention methods, it is also useful to demonstrate that the signaling cascade leading to palmitoylation is intact in haploid Rana ICR-2A cells (a stable cell line created from an androgenetic haploid embryo of the frog Rana pipien ATCC #CCL-145), that a pantropic retrovirus can infect ICR-2A cells and that it is possible to quantitatively analyze images of millions of cells and to determine the subcellular distribution of specific reporters, thereby raising the proposed screen to a high- throughput level that is sufficient to saturate the genome quickly and easily.
There are many cases in which multiple targeting signals within a full-length protein work in concert to determine its final, subcellular location. Often, protein-protein interaction domains such as PDZ domains are found in close, linear and/or structural proximity to palmitoylatable cysteines of the same protein. A prime example of this is PSD-95 (Craven et al., 1999; El-Husseini et al., 2000; Topinka & Bredt, 1998). Isolating the portion of such a protein that is acting as the SPS is an ideal way to create a sensor specific for palmitoylation rather than for the function of the original, full-length protein from which the peptide was borrowed.
The N-terminus of GAP-43 is doubly palmitoylated on two adjacent cysteine residues. When an 18-residue S-palmitoylation substrate peptide from the N-terminus GAP43 is fused to the N-terminus of GFP, this peptide alone, by virtue of its palmitoylation, retains GFP on the PM (see
The fundamental ability to quantitatively recognize GFP localized to the PM has already been achieved by the engineers at Q3DM using methodology described herein. Computer-generated demarcation of the PM and the nucleus are demonstrated in
The palmitoylation sensor, GAP43:GFP, is seen in
ICR-2A cells expressing GAP43:GFP, introduced by pantropic retroviral infection, are illustrated in
The ability to infect ICR-2A cells with retrovirus demonstrates at the most fundamental level that invention methods are applicable to mutagenesis. Likewise, expression of GAP43:GFP has been achieved by electroporating the cells with a CMV-driven expression plasmid containing the CDNA for GAP43:GFP.
This algorithm is able to make a simple binary decision of whether GAP43:GFP is on the PM or in the cytoplasm (see
The second level of sophistication, the fractional localization of the reporter to the PM, provides an unprecedented degree of precision at high throughput that makes it possible to determine, when there has been only a partial loss of function of a gene, as may be the case when one of two alleles of a gene has been mutated in a diploid cell line. Additionally, replication of previous determinations of the residence half-life of palmitate on a protein (see
A schematic of the vectors to be used and their resulting protein fusion products is shown in
The two most common methods used to introduce the mutagenic DNA to the genome are electroporation of plasmid DNA (Chowdhury et al., 1997; Wurst & Joyner, 1993) and by virus-mediated (most commonly retrovirus) infection (Friedrich & Soriano, 1991; Zambrowicz et al., 1998). Each method has advantages and disadvantages but a general consensus is developing that a combination of these two methods is required for complete coverage of the genome (Stanford et al., 2001). The plasmid DNA used for electroporation is based in the promoterless pBluescript (Stratagene) vector and introduced into cells using the BioRad GenePulser. For retroviral infection, the Pantropic Retroviral Expression System (BD Biosciences Clontech), which efficiently infects mammalian and nonmammalian hosts including amphibians (Rana and Xenopus) is used as starting material and modified as illustrated in
The trapping vectors are designed to serve the essential basic purposes required herein:
The first set of experiments utilize gene-trap vectors that include a SA site fused to the 5′ end of the reporter construct (see
It is possible that the cDNA encoding the reporters could be physically fractured during the integration event (Voss et al., 1998) giving rise to the possibility that one will be integrated independently from the others. For example it could be the case that GFP becomes separated from the SPS resulting in a completely cytoplasmic pattern for fluorescence localization, or in other words, a false-positive result indicating integration into a gene necessary for S-palmitoylation. For this reason, all clonal lines reporting a positive result are further surveyed for expression of an un-fragmented reporter, GFP:SPS. Reverse transcription-PCR and/or PCR of genomic DNA using primers to the 5′ end of GFP and the 3′ end of the SPS are efficient ways to determine the integrity of the integrated reporter.
S-palmitoylation substrates are cloned from whole mouse brain mRNA by RT-PCR. Fusions of the SPSs to GFP can be done by PCR and confirmed by DNA sequencing. Analysis of and selection for the appropriate expression pattern as well as the transfection/infection efficiencies of various permutations of the constructs are typically made in small scale transfection experiments prior to running a full-scale screen.
The choice of which SPSs to use, and how they are fused to GFP, is based on the tolerance of both GFP and SPSs for N- and C-terminal fusions. The fluorescence properties of GFP are unlikely to be perturbed by fusions to either terminus. In the case of the trapping vectors to be used (
A significant fraction GFP:Yck2p fluorescence is localized to the PM as is the case for GAP-43 (see
In the event that a synthetic substrate for S-palmitoylation derived from a full-length protein does not localize GFP strongly enough to the PM, additional residues from the parent protein can be included in the fusion in an attempt to restore a preferred environment or context for full S-palmitoylation. Alternatively SPS peptides borrowed from the N- or C-termini of other proteins (e.g. GRK6 (Stoffel et al., 1994) without the PDZ-binding domain, V2R(Sadeghi et al., 1997), bovine rhodopsin (Ovchinnikov Yu et al., 1988) etc.) can be fused to GFP, evaluated for subcellular expression pattern and substituted when appropriate.
S-palmitoylation is a highly conserved function, so in that respect, most every cell type and cell line could be used.
Cell lines generated by insertional mutagenesis are screened using HTM, for 1) fluorescence, indicating that the vector has integrated into the genome and is expressed and for 2) subcellular localization. If GFP:SPS is expressed solely on the PM then the disrupted gene was presumably not essential for palmitoylation of that specific SPS. Potential outcomes for such experiments are depicted in
One of the primary advantages of virus-mediated gene infection is that it integrates only a single copy per cell genome. This advantage, while making it much easier to identify the mutated gene, virtually ensures that only one of two potential alleles of a gene will be hit in an “ideal diploid” cell line. It is possible that eliminating one of two S-PAT alleles in an ideal diploid cell will be insufficient to cause total redistribution of the GFP:SPS from the PM to the cytoplasm. However using the exquisite sensitivity provided by HTM will increase the likelihood of detecting any small changes should they occur. Similarly, electroporation of reporter plasmid DNA into cells can be controlled to reduce the likelihood of multiple integrations, but this also reduces the already-slim chances of randomly hitting both alleles of a gene within the same cell line. Most cell lines have variable numbers of chromosomes, often not resembling the normal diploid state of the organism from which it was derived. Additional copies of chromosomes increases the potential copy number of particular genes of interest thereby potentially decreasing the likelihood of generating a recognizable mutant phenotype (
Genes that when mutagenized will inhibit the process of S-palmitoylation are identified by the invention methods. It is also likely that many rounds of mutagenesis will be necessary to find candidate clonal lines as well as to understand the degree to which the genome has been saturated. 5′- and 3′-RACE methods (Frohman et al., 1988; Zambrowicz et al., 1997) can be used to identify trapped genes in cell lines that have the morphological hallmark of a mutagenized gene critical for S-palmitoylation. The vectors have been carefully designed so that unique sequence in SA, SD and GFP:SPS can be used in combination with universal primers and adaptors and protocols that are standard in the lab to rapidly and efficiently identify the locus of integration of the mutagenic reporters. Identification of the mutated gene by sequence analysis allows one to predict, in most cases, a possible function for the gene. However, further analysis may be desirable to understand the role of the identified genes in the pathway leading to S-palmitoylation. It is likely that genes involved in the synthesis of required precursors, as well as S-PATs, will be trapped. The background, false-positive clones are expected to outnumber the clones in which S-PATS are trapped with a significant number of the false positives occurring due to fragmentation of the reporter construct. As mentioned above, it is possible that the reporters could be physically fractured during the integration event (Voss et al., 1998) giving rise to independent, fractional integrations. Therefore, all clonal lines reporting a positive result are surveyed for expression of an un-fragmented reporter, GFP:SPS. Reverse transcription-PCR and/or PCR of genomic DNA using primers to the 5′ end of GFP and the 3′ end of the SPS are efficient ways to determine the integrity of the integrated reporter. This type of prescreening quickly reduces the potential positive clones to a workable number. The number of critical genes identified is expected to be in the tens, not hundreds or thousands. Among these, the most interesting will be singled out for more extensive characterization.
The polyA-trap-style of gene trap vector does not trap genes without introns. However, this smaller genomic space can be explored using promoter-trap vectors either as a supplement to the information that we gathered using polyA traps or as a backup in case we don't find critical genes searching with the polyA traps. While this genomic space is smaller, it is possible that the family(s) of genes responsible for S-palmitoylation could all fall into the intronless category.
After identifying a candidate, “critical”, trapped gene, a “rescue” of the mutant phenotype (i.e., GFP:SPS in the cytoplasm) is carried out by re-expressing the wild-type version of the mutated gene in the mutant cell line. This requires cloning the full-length cDNA of the mutated gene, putting it into a suitable expression vector such as pcDNA3 (Invitrogen) and reintroducing the gene into the mutant cell line. Successful rescue of the mutant phenotype, as observed by the GFP:SPS relocalizing back onto the PM, provides additional functional information supporting the identity of a gene involved in S-palmitoylation of the substrate used in the initial screen. An inability to rescue the mutant phenotype suggests that the full extent of mutagenic integrations was not properly characterized, at which point different mutagenized clones would be sought.
Cross-Substrate Specificity Tests
It is possible that there are many S-PATs, each with unique substrate specificity. This hypothesis can be tested by expressing different, contextually unique S-palmitoylation substrates fused to another color of fluorescent protein (e.g., monomeric red fluorescent protein (mRFP) (Campbell et al., 2002)) in clonal cell lines containing putative or known S-PAT mutations. The red fluorescence of mRFP is easy to separate spectrally from GFP and gives simultaneous readouts of the subcellular localization of the two SPSs. Determining the degree of overlap among the other SPSs, and the original GFP:SPS provides important information aiding the prediction of the numbers of PATs and their specificity. Representatives from three of the four classes of SPSs as described in (Linder & Deschenes, 2003) can also be tested. Specifically, the C-terminal tail of CD151 (residues 236-end)(Yang et al., 2002); the c-terminus of Rho (Zacharias et al., 2002); The N-terminus of GAP-43 (Zacharias et al., 2002). The fourth class is mitochondrial proteins. The experiments described herein are not designed to determine the palmitoylation state of proteins localized to the mitochondria.
Increasing the Morphological Homogeneity of the Reporter Cell Line
A potentially useful alternative approach that can be explored if necessary is to create a stable cell line constitutively expressing the GFP:SPS of choice and then performing gene-trapping on this cell line using G-418 resistance as the only marker for selection of mutagenic integrations. The phenotypic marker for integration of the construct into a gene relevant to S-palmitoylation would remain redistribution of the fluorescent marker from the PM to the cytoplasm. While this approach does not utilize the convenience of fluorescence as a marker for integration, preselecting a line stably, efficiently and correctly expressing the GFP:SPS could increase the ability to distinguish mutagenic events that are truly disruptive of proper S-palmitoylation.
Identification of the trapped genes by sequence analysis and database homology searches, combined with the direct tests of functionality described above allows one to reconstruct, at least in part, the genes that are required to induce S-palmitoylation of a protein. The identities of the genes make it possible to place them in a logical sequence that leads to S-palmitoylation.
In sum the preliminary data presented herein demonstrate:
The invention has been described in detail with reference to preferred embodiments thereof However, it will be appreciated that those skilled in the art, upon consideration of this disclosure, may make modification and improvements within the spirit and scope of the invention as set forth in the following claims.
This application claims benefit of provisional patent application no. 60/548,509, filed Feb. 6, 2004, the entire contents of which are hereby incorporated by reference herein.
| Filing Document | Filing Date | Country | Kind | 371c Date |
|---|---|---|---|---|
| PCT/US05/06309 | 2/25/2005 | WO | 00 | 7/6/2007 |
| Number | Date | Country | |
|---|---|---|---|
| 60548509 | Feb 2004 | US |