The present invention relates to a method for generating a network of direct and indirect interaction partners of a disease-related (poly)peptide comprising the steps of (a) contacting a selection of (poly)peptides suspected to contain one or several of said direct or indirect interaction partners with said disease-related (poly)peptides and optionally with known direct or indirect interaction partners of said disease-related (poly)peptide under conditions that allow the interaction between interaction partners to occur; (b) detecting (poly)peptides that interact with said disease-related (poly)peptide or with said known direct or indirect interaction partners of said disease-related (poly)peptide; (c) contacting (poly)peptides detected in step (b) with a selection of (poly)peptides suspected to contain one or several (poly)peptides interacting with said (poly)peptides detected in step (b) under conditions that allow the interaction between interaction partners to occur; (d) detecting proteins that interact with said (poly)peptides detected in step (b); (e) contacting said disease-related (poly)peptide and optionally said known direct or indirect interaction partners of said disease-related (poly)peptide, said (poly)peptides detected in steps (b) and (d) and a selection of proteins suspected to contain one or several (poly)peptides interacting with any of the afore mentioned (poly)peptides under conditions that allow the interaction between interaction partners to occur; (f) detecting (poly)peptides that interact with said disease-related (poly)peptide and optionally said known direct or indirect interaction partners of said disease-related (poly)peptide or with said (poly)peptides identified in step (b) or (d); and (g) generating a (poly)peptide—(poly)peptide interaction network of said disease-related (poly)peptide and optionally said known direct or indirect interaction partners of said disease-related (poly)peptide and said (poly)peptides identified in steps (b), (d) and (f). Moreover, the present invention relates to a protein complex comprising at least two proteins and to methods for identifying compounds interfering with an interaction of said proteins. Finally, the present invention relates to a pharmaceutical composition and to the use of compounds identified by the present invention for the preparation of a pharmaceutical composition for the treatment of Huntington's disease.
Several documents are cited throughout the text of this specification. The disclosure content of the documents cited herein (including any manufacture's specifications, instructions, etc.) is herewith incorporated by reference. The present invention is based on scientific experiments which have been performed on biological specimen derived from diseased patients. Patients have given their consent to use the specimen for the study which is disclosed in the present invention. In case of deceased patients, the consent has been given by a relative.
With the identification of >35.000 genes in the human genome the challenge arises to assign biological function to all proteins and to link these proteins to physiological pathways and disease processes. Since protein-protein interactions play a role in most events in a cell, clues to the function of an unknown protein can be obtained by investigating its interaction with other proteins whose function are already known. Thus, if the function of one protein is known, the function of the binding parners can be infered (deduced). This allows the researcher to assign a biological function to uncharacterized proteins by identifying protein-protein interactions. For example, several so far uncharacterized proteins in Caenorhabditis elegans were identified in a yeast two-hybrid screen for eukaryotic 26S proteasome interacting proteins and thereby could be linked to the ubiquitin-proteasome proteolytic pathway (Vidal et al., 2001). Elucidation of protein-protein interactions is particularly desired when it comes to the generation of new drugs. For many diseases, the available drug portfolio is insufficient or inappropriate to provide a cure or to prevent onset of the disease. One such disease is Huntington's disease.
Huntington's disease (HD) is a neurodegenerative disorder caused by an expanded polyglutamine (polyQ) tract in the multidomain protein huntingtin (htt). The elongated polyQ sequence is believed to confer a toxic gain of function to htt. It leads to htt aggregation primarily in neurons of the striatum and cortex and subsequently to the appearance of the disease phenotype. However, there is experimental evidence that loss of htt function may also contribute to HD pathogenesis. Since huntingtin aggregation correlates with disease progression, it is crucial to develop methods for identifying factors that promote or inhibit aggregation of huntingtin.
Previously, a number of single interaction partners of huntingtin had been reported. In light of these reports, it is tempting to speculate that huntingtin is bound into a larger network of interacting partners, many of which might be capable of modulating huntingtin's activity and function by direct or indirect interaction. It is likely that an aberrant interaction of huntingtin with some of the members of said network will impair huntingtin's normal function. Moreover, this interaction might also be relevant for the conformation of huntingtin or for its solubility or state of aggregation. Interfering with the direct or indirect interactions of the protein-protein interaction network will provide an excellent basis for therapeutic intervention as it will allow to modulate huntingtin's activity or state of aggregation or both. The state of the art so far did not provide compounds capable of reducing or suppressing huntingtin aggregation since the factors promoting or suppressing huntingtin aggregation were not known.
Thus, the technical problem underlying the present invention was to provide novel approaches for identifying direct or indirect interaction partners of disease-related proteins, which must be seen as new targets for drug development. The solution to this technical problem is achieved by providing the embodiments characterized in the claims.
Accordingly, the present invention relates to a method for generating a network of direct and indirect interaction partners of a disease-related (poly)peptide comprising the steps of (a) contacting a selection of (poly)peptides suspected to contain one or several of said direct or indirect interaction partners with said disease-related (poly)peptides and optionally with known direct or indirect interaction partners of said disease-related (poly)peptide under conditions that allow the interaction between interaction partners to occur; (b) detecting (poly)peptides that interact with said disease-related (poly)peptide or with said known direct or indirect interaction partners of said disease-related (poly)peptide; (c) contacting (poly)peptides detected in step (b) with a selection of (poly)peptides suspected to contain one or several (poly)peptides interacting with said (poly)peptides detected in step (b) under conditions that allow the interaction between interaction partners to occur; (d) detecting proteins that interact with said (poly)peptides detected in step (b); (e) contacting said disease-related (poly)peptide and optionally said known direct or indirect interaction partners of said disease-related (poly)peptide, said (poly)peptides detected in steps (b) and (d) and a selection of proteins suspected to contain one or several (poly)peptides interacting with any of the afore mentioned (poly)peptides under conditions that allow the interaction between interaction partners to occur; (f) detecting (poly)peptides that interact with said disease-related (poly)peptide and optionally said known direct or indirect interaction partners of said disease-related (poly)peptide or with said (poly)peptides identified in step (b) or (d); and (g) generating a (poly)peptide-(poly)peptide interaction network of said disease-related (poly)peptide and optionally said known direct or indirect interaction partners of said disease-related (poly)peptide and said (poly)peptides identified in steps (b), (d) and (f).
In accordance with the present invention, the term “direct and indirect interaction partners” relates to (poly)peptides that either directly interact with the disease-related (poly)peptide (direct interaction) or that interact via a protein binding to/interacting with said disease-related (poly)peptide. In the letter case, there is no direct contact between the direct interaction partner and the disease-related protein. Rather, a further protein forms a “bridge” between these two proteins.
The term “known direct or indirect interaction partners” refers to the fact that for certain disease-related (poly)peptides, such interaction partners are known in the art. If such interaction partners are known in the art, it is advantageous to include them into the method of the invention. If no such interactions partners are known in the art, then the network may be generated starting solely from the known disease-related (poly)peptide.
The term “conditions that allow the interaction between interaction partners to occur” relates to conditions that would, as a rule, resemble physiological conditions. Conditions that allow protein actions are well known in the art and, can be taken, for example from Golemis, E. A. Ed., Protein-Protein Interactions, Cold Spring Harbor Laboratory Press, 2002.
The term “suspected to contain one or more of said direct or indirect interaction partners” relates to the fact that normally, a selection of (poly)peptides would be employed where the person skilled in the art would expect that interaction partners are present. Examples of such selections of (poly)peptides are libraries of human origin such as cDNA libraries or genomic libraries.
The term “detecting proteins” refers to the fact that the (poly)peptides interacting with the “bait” (poly)peptides are identified within the selection of (poly)peptides. A further characterization or isolation of the “prey” (poly)peptides at this stage may be advantageous but is not necessary. The term “detecting (poly)peptides” preferably also comprises characterizing said (poly)peptides or the nucleic acid molecules encoding said (poly)peptides. The skilled person knows that this can be done by a number of techniques, some of which are described for example in Sambrook et al., “Molecular Cloning, A Laboratory Manual”; CSH Press, Cold Spring Harbor, 1989 or Higgins and Hames (eds.). For example, the nucleotide sequence may be determined by DNA Sequencing, including PCR-Sequencing (see for example Mullis K, Faloona F, Scharf S, Saiki R, Horn G, Erlich H., Cold Spring Harb Symp Quant Biol. 1986; 51 Pt 1:263-73). Alternatively, the amino acid sequence of said (poly)peptide may be determined. The skilled artesian knows various methods for sequencing proteins which include the method of Edman degradation, which is a preferred method of the present invention of determining the amino acid sequence of a protein. However, the amino acid sequence of a protein or (poly)peptide can also be reliably determined by methods such as for example Maldi-Tof, optionally in combination with the method of Edman degradation. The interaction partner may be identified either as fusion with a DNA binding domain or as fusion with an activation domain. Preferably, if an interaction partner has been identified as a fusion molecule comprising a DNA binding domain, the interaction partner is cloned into a vector allowing the expression of the interaction partner as a fusion with an activation domain. Consequently, protein interaction can be tested in the context the DNA activation or the DNA binding domain.
In accordance with the present invention, the first round of detecting (poly)peptides that interact with the “bait” (poly)peptides recited in step (a) wherein the detected (poly)peptides be considered as “prey” (poly)peptides is followed by the second round of detecting further interacting (poly)peptides wherein the former “prey” (poly)peptides are now used as “bait” (poly)peptides. In certain preferred embodiments of the present invention such as in a two-hybrid detection system, a re-cloning of the former “prey” (poly)peptides into vectors that are suitable for expressing “bait” (poly)peptides may be desired.
Accordingly, the invention describes a novel strategy to identify protein-protein interaction networks for human disease proteins. This strategy was applied to detect pair-wise protein-protein interactions for Huntington's disease and is useful for other hereditary diseases as well. Several human hereditary diseases are summarized in table 5.
A crucial step of the method of the invention is step (e). Here, the disease-related (poly)peptide and optionally said known direct or indirect interaction partners of said disease-related (poly)peptide are contacted under appropriate conditions, preferably at the same time, with both the (poly)peptides identified in steps (b) and (d) and further with a selection of (poly)peptides suspected to contain further interaction partners. Alternatively, the various baits, preys and further selection partners are added one after another, so that the final pool contains all baits and preys so far identified, in addition to the further selection partners. In other terms, in this step of the method of the invention, all “baits” and all “preys” are pooled and, additionally, further potential interaction partners are added. In this way, surprisingly the number of directed or indirect interactions partners of the previously identified “baits” and “preys” could significantly be enhanced. It is to be understood that various preys identified in one detection step may interact with each other and not only with the baits that were employed for the identification. For example, if a collection of baits detects prays “a” and “b”, the invention does not exclude that “a” also interacts with “b”. The same holds true mutatis mutandis for the baits used in accordance with the present invention. Wherever possible, baits and preys are exchangeable in the sense that bait (poly)peptides may be used as preys and vice versa. In a given case, however, the skilled person has to determine whether or not this exchange is possible on the basis of unfavourable site effects and limitations of the applied scientific approach. This can be done by the skilled person without undue burden by applying standard techniques known in the art.
It is further preferred in accordance with the present invention that the interaction of proteins is a specific interaction, such as a specific binding. This means that the (poly)peptide being an interaction partner with a further (poly)peptide only or essentially only interacts with the interaction site(s) involved with this interaction partner. This does not exclude, of course, that further interaction sites of said (poly)peptide interact with further interaction partners, wherein in the corresponding interaction is preferably also specific. The concept also embraces that, if a (poly)peptide has several identical interaction sites, which in nature bind to different interaction partners, these different interaction partners are also bound by the (poly)peptide in the method of the present invention.
In other terms, at least in the case of huntingtin, the number of interaction partners found in step (e) was enhanced in an exponential rather than in a linear fashion.
The term “(poly)peptide” refers alternatively to peptide or to (poly)peptides. Peptides conventionally are covalently linked amino acids of up to 30 residues, whereas polypeptides (also referred to as “proteins”) comprise 31 and more amino acid residues.
The term “huntingtin” refers to a protein with the data bank accession number P42858 which is referenced for the purpose of the present invention as “wild-type huntingtin protein”. However, the term “huntingtin” also comprises proteins encoded by the nucleic acid sequence deposited under accession number L12392 or to proteins encoded by nucleic acid molecules which hybridize to the nucleic acid molecule of L12392 under stringent conditions of hybridization. The present invention relates to all variants of the huntingtin protein. In particular, relevant for the present invention are those variants of huntingtin which comprise a polyglutamine tract (polyQ tract) or an elongated polyQ tract. A polyQ tract consists of two or more glutamines within the huntingtin protein. The insertion of additional glutamine codons will result in huntingtin proteins with, for example 2, 51, 75 or 100 added glutamines in comparison to the sequence deposited under accession number P42858. In fact, the person skilled in the art knows that the huntingtin protein may have a glutamine tract with any random number of glutamines in the range of 1 to 200 added glutamines. All these proteins are comprised by the present invention.
The term “hybridizes under stringent conditions”, as used in the description of the present invention, is well known to the skilled artisian and corresponds to conditions of high stringency. Appropriate stringent hybridization conditions for each sequence may be established by a person skilled in the art on well-known parameters such as temperature, composition of the nucleic acid molecules, salt conditions etc.; see, for example, Sambrook et al., “Molecular Cloning, A Laboratory Manual”; CSH Press, Cold Spring Harbor, 1989 or Higgins and Hames (eds.), “Nucleic acid hybridization, a practical approach”, IRL Press, Oxford 1985, see in particular the chapter “Hybridization Strategy” by Britten & Davidson, 3 to 15. Stringent hybridization conditions are, for example, conditions comprising overnight incubation at 42° C. in a solution comprising: 50% formamide, 5×SSC (750 mM NaCl, 75 mM trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5× Denhardt's solution, 10% dextran sulfate, and 20 micrograms/ml denatured, sheared salmon sperm DNA, followed by washing the filters in 0.1×SSC at about 650. Other stringent hybridization conditions are for example 0.2×SSC (0.03 M NaCl, 0.003M Natriumcitrat, pH 7) bei 65° C. In addition, to achieve even lower stringency, washes performed following stringent hybridization can be done at higher salt concentrations (e.g. 5×SSC). Note that variations in the above conditions may be accomplished through the inclusion and/or substitution of alternate blocking reagents used to suppress background in hybridization experiments. Typical blocking reagents include Denhardt's reagent, BLOTTO, heparin, denatured salmon sperm DNA, and commercially available proprietary formulations. The inclusion of specific blocking reagents may require modification of the hybridization conditions described above, due to problems with compatibility.
The skilled person knows that the presence of additional codons in the nucleic acid sequence of huntingtin might significantly reduce the capability of this nucleic acid molecule to hybridize to the nucleic acid molecule deposited under L12392 and referenced as wild-type huntingtin protein. Nevertheless, such proteins shall still be comprised by the present invention. In fact, computer programs such as the computer program Bestfit (Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics Computer Group, University Research Park, 575 Science Drive, Madison, Wis. 53711) or blast, capable of calculating homologies between two nucleic acid sequences, efficiently recognize nucleotide insertions and allow for an adjustment of gaps created by these insertions. The term “huntingtin” as used in the present invention, also includes those molecules of huntingtin, which have a homology of more than 95% to wild-type huntingtin when analyzed with a program like bestfit under conditions not weighing gaps created by polyQ tracts (gap penalty=0).
The term “contacting” means bringing into contact so that two or more proteins or (poly)peptides can interact with each other, preferably under physiological conditions. The terms “interacting” or “binding” refer to a transient or permanent contact between two proteins or (poly)peptides. Preferably, the (poly)peptide or protein is provided by expression from a nucleic acid molecule, more preferably from a cDNA molecule within a cDNA library. Alternatively, said nucleic acid molecule is a genomic nucleic acid molecule of a genomic DNA library, or a nucleic acid molecule from a synthetic DNA or RNA library. Preferably, the nucleic acid molecule encoding the disease-related protein or its interaction partner is obtainable from nerve cells, brain tissue human adrenal gland, human bladder, human bone, human brain, human colon, human dorsal root ganglion, human heart, human HeLa cells, human kidney, human liver, human lung, human mammary gland, human ovary, human pancreas, human placenta, human prostate, human retina, human salivary gland, human sceletal muscle, human small intestine, human smooth muscle, human spinal cord, human spleen, human stomach, human testis, human thymus, human thyroid, human tonsil, human trachea, human uterus, human cell line HEP G2, human cell line MDA 435, human fetal brain, human fetal heart, human fetal kidney, human fetal liver, human fetal spleen, human fetal thymus, human breast tumor, human cervix tumor, human colon tumor, human kidney tumor, human lung tumor, human ovary tumor, human stomach tumor, human brain tumor and/or human uterus tumor.
The term “disease-related protein” refers to a protein known to be the causative agent of a disease or known to be involved in onset or progression of a disease. Preferably, said disease is CHOREA HUNTINGTON or the disease-related protein is huntingtin. More preferably, the disease-related protein is selected from table 6 and/or 7. The term “conditions that allow the interaction between interaction partners” means conditions that are similar to physiological conditions. Preferably, said conditions are physiological conditions.
The term “selection of (poly)peptides” refers to a library of (poly)peptides, which comprises the above-mentioned libraries, but also includes libraries such as phage display libraries. Preferably, the (poly)peptide is provided by expression from a nucleic acid molecule. Preferably, the protein or (poly)peptide expressed by said nucleic acid molecule is a (poly)peptide comprising a DNA binding domain (DBD) (in this case the fusion protein is termed “bait”) or (b) a (poly)peptide comprising an activation domain capable of interacting with a transcription factor or an RNA polymerase and capable of activating transcription of a reporter or indicator gene (in this case the fusion protein is called “prey”). As used here, the terms “reporter gene” and “indicator gene” are to be understood as synonyms. It is important to note that one of the interaction partners will always comprise the amino acid sequence of a protein or (poly)peptide translated from said nucleic acid molecule while the other interaction partner will comprise the amino acid sequence of a protein or protein fragment. Preferably, a bait used for a method of the present invention is selected from the proteins listed in table 6 and/or 7. If, for example, the proteins encoded by the nucleic acid molecules contain a DNA binding domain fused in frame, the fusion protein can bind to the DNA recognition sequence of the DNA binding domain. Interaction of said fusion protein with a second fusion protein containing an activation domain can induce transcription of a nearby indicator gene. The indicator gene may encode a selection marker such as a protein that confers resistance to an antibiotic including ampicillin, kanamycin, chloramphenicol, tetracyclin, hygromycin, neomycin or methotrexate. Further examples of antibiotics are Penicillins: Ampicillin HCl, Ampicillin Na, Amoxycillin Na, Carbenicillin disodium, Penicillin G, Cephalosporins, Cefotaxim Na, Cefalexin HCl, Vancomycin, Cycloserine. Other examples include Bacteriostatic Inhibitors such as: Chloramphenicol, Erythromycin, Lincomycin, Tetracyclin, Spectinomycin sulfate, Clindamycin HCl, Chlortetracycline HCl. Additional examples are proteins that allow selection with Bacteriosidal inhibitors such as those affecting protein synthesis irreversibly causing cell death. Aminoglycosides can be inactivated by enzymes such as NPT II which phosphorylates 3′-OH present on kanamycin, thus inactivating this antibiotic. Some aminoglycoside modifying enzymes acetylate the compound and block their entry in to the cell. Gentamycin, Hygromycin B, Kanamycin, Neomycin, Streptomycin, G418, Tobramycin Nucleic Acid Metabolism Inhibitors, Rifampicin, Mitomycin C, Nalidixic acid, Doxorubicin HCl, 5-Flurouracil, 6-Mercaptopurine, Antimetabolites, Miconazole, Trimethoprim, Methotrexate, Metronidazole, Sulfametoxazole. Alternatively, said indicator gene may encode a protein such as lacZ, GFP or luciferase, the expression of which can be monitored by detection of a specific color. Other proteins commonly used as indicator proteins are beta-galactosidase, beta-glucuronidase, green fluorescent protein (GFP), autofluorescent proteins, including blue fluorescent protein (BFP), glutathione-5-transferase (GST), luciferase, horseradish peroxidase (HRP), and chloramphenicol acetyltransferase (CAT). In general, however, the selection in the yeast two hybrid-system is based on a deficiency of the yeast strain to produce specific amino acids. The skilled person knows that any amino acid deficiency can be used for this selection strategy.
Preferably said preys and baits are expressed from two separate expression vectors contained in one host cell. The nucleic acid molecule encoding the preys and baits can be introduced into the host cell, for example, by transformation, transfection, transduction or microinjection which are common techniques known to the person skilled in the art and which require no additional explanation. In addition, the nucleic acid molecule contains a chromosomal or episomal nucleic acid sequence encoding the above-mentioned indicator protein. The expression of said indicator protein is under control of a recognition sequence which serves as a binding site for the bait protein. The nucleic acid molecule may be fused either to a DNA binding domain or to an activation domain. Co-expression of only those bait- and prey fusion proteins which are capable of interacting will induce the expression of one of the above-identified indicator proteins and thus allow the identification a nucleic acid molecule encoding a protein capable of interacting with huntingtin or an interaction or binding partner of huntingtin. The skilled person knows this system as the yeast two hybrid system. The yeast two hybrid system, which uses a bait protein-prey protein combination to induce transcription of the reporter gene, is a preferred method to identify proteins capable of interacting with huntingtin or with a direct or indirect interaction or binding partner of huntingtin. See for example Fields and Song, Nature 340:245 (1989) or Uetz et al., 2000 Nature 403(6770): 623-7. This is a useful way of determining protein-protein interactions. Another preferred method uses the yeast three hybrid system, as described in U.S. Pat. No. 5,928,868. Preferably, steps (a) to (d) of the method for generating a network of direct and indirect interaction partners comprise the yeast two hybrid system. Preferably, steps (e) and (f) of the method for generating a network of direct and indirect interaction partners comprise yeast interaction mating. Preferably, said “interaction mating” comprises the interaction of all interaction partners identified in steps (a) to (d). Also preferred is that the interaction partners identified in steps (a) to (d) interact as prey and bait proteins, so that all prey proteins are contacted with all bait proteins. Using the array mating system, each bait is tested individually for interaction with every prey in the array. Alternatively, steps (e) and (f) of the method for generating a network of direct and indirect interaction partners comprise testing all interaction partners identified in steps (a) to (d) in interaction assays such as biacore or coimmunoprecipitation. When performing such an assay, it is preferred that the interaction partners are tested as prey and/or bait fusion proteins or contain no fused (poly)peptides. Preferably, all interaction partners are contacted in the biacore or coimmunoprecipitation assay by themselves and by all other remaining interaction partners identified in steps (a) to (d).
The method for generating a network of direct and indirect interaction partners of a disease related protein or (poly)peptide has proven to be an effective tool for unveiling the protein-protein interactions (PPI) of preferably monogenic diseases. This is exemplified by the analysis of the disease related protein of Chorea Huntington, the analysis of which has demonstrated that the method of the present invention will be useful in an approach to identify potential drugs in the treatment of CHOREA HUNTINGTON. Moreover, this method will also be effective in unveiling the protein-protein interactions of other disease related proteins and in identifying novel targets for treatment of these diseases. Using a preferred combination of library and matrix yeast two-hybrid screens, based on the methods of the present invention, a highly connected network was generated among 70 proteins involved in 117 protein-protein interactions, 99 of which had not been described previously. As progression of Huntington's disease (HD) appears to be linked to huntingtin aggregation, a set of network proteins was tested for their potential to modulate this process. By using the methods of the present invention, it was discovered that the GTPase activating protein GIT1 strongly promotes huntingtin aggregation in vivo. GIT1 also localises to huntingtin aggregates in brains of transgenic mice and HD patients. Therefore, a combination of the methods of the present invention has proven to provide effective means for the identification of potential targets for therapeutic intervention. GIT1 is a selected example of a modulator interaction partner of huntingtin. The other proteins in the network of interaction partners disclosed by the present invention are further modulator interaction partners of huntingtin.
Preferably, the interaction mating comprises using an array maiting system. In general, for this screen, MATα yeast cultures are transformed with plasmids encoding prey proteins and arrayed on a microtiter plate for interaction mating with individual MATa strains expressing bait proteins. Using this test system, each bait can be tested individually for interaction with every prey in the array. Diploid yeast clones, formed by maiting on YPD plates and expressing both, bait and prey proteins, are selected on agar SDII plates, and further transferred for example by a spotting robot on SDIV plates to select for protein-protein interactions. In a more preferred embodiment of the method, plasmids encoding bait and prey proteins are transformed into strains L40 ccua and L40 cca, respectively. L40 cca clones are arrayed on microtitre plates and mixed with a single L40 ccua clone for interaction mating. These cells are transferred, preferably by a robot onto YPD medium plates and, after incubation for 20 h to 28 h at approximately 30° C., for selection of the cells, were transferred onto SDII medium plates, where mating takes place, for additional 60 h to 80 h at approximately 30° C. For two-hybrid selection diploid cells are transferred onto SDIV medium plates with and without nylon or nitrocellulose membranes and incubated for approximately 5 days at about 30° C. The nylon or nitrocellulose membranes are subjected to the β-GAL assay. Positive clones can be verified by cotransformation assays using plasmids encoding respective bait and prey proteins. Other preferred methods for studying protein-protein interactions according to the present invention are colocalization, coimmunoprecipitation, screening of protein or (poly)peptide arrays, library screens, in vivo and in vitro binding experiments using different tags such as HIS6, TAP or FLAG.
In a preferred embodiment of the present invention's method for generating a network of direct and indirect interaction partners of a disease related protein or (poly)peptide, plasmids encoding bait proteins are transformed into a strain such as L40 ccua, tested for the absence of reporter gene activity and co-transformed with a human fetal brain cDNA library. Independent transformants are plated onto minimal medium lacking tryptophan, leucine, histidine and uracil (SDIV medium) and incubated at about 30° C. for 5 to 10 days. Clones are transferred into microtitre plates, optionally using a picking robot, and grown over night in liquid minimal medium lacking tryptophan and leucine (SDII medium). Subsequently, the clones are spotted onto nylon or nitrocellulose membranes placed on SDIV medium plates. After incubation for about 4 days membranes are subjected to a β-galactosidase (β-GAL) assay. Plasmids are prepared from positive clones and characterised, for example by restriction analyses and sequencing. For retransformation assays plasmids encoding bait and prey proteins are cotransformed in the yeast strain L40 ccua and plated onto SDIV medium.
The term “generating a protein-protein interaction (PPI) network” means listing the interactions of all proteins interacting or binding directly or indirectly interacting the disease related (poly)peptide or protein. Preferably, this can be done by displaying the information in a matrix or a network representation. In a more preferred embodiment of the present invention's method, the protein-protein interaction network is generated by using Pivot 1.0 (Prof. Ron Shamir, Prof. Yossi Shilo, Nir Orlev; Tel Aviv University (TAU); Dep. of computer science; Ramat Aviv; Tel Aviv 69978; Israel).
In a preferred embodiment of the invention, interactions are detected by using the yeast two-hybrid system, MALDI-TOF MS or electro spray MS. Preferably, yeast strains such as strains L40 ccua and L40 cca, are transformed with an expression selected from the group consisting of pBTM116, pBTM117, pBTM117c, pACT2, pAS2-1, pGADIO, pGAD424, pGAD425, pGAD426, pGAD427, pGAD428.
In another preferred embodiment of the present invention's method for generating a network of direct and indirect interaction partners of a disease-related polypeptide, the method contains after step (d) the additional steps of isolating a nucleic acid molecule with homology to said nucleic acid molecule expressing the encoded protein and testing it for its activity as a modulator of huntingtin, wherein said nucleic acid molecule is DNA, RNA, cDNA, or genomic DNA. Said testing can be done in several different assays. Preferably, the testing is performed in a co-immunoprecipitation assay or an affinity chromatography-based technique. Generally, co-immunoprecipitation is performed by purifying an interacting protein complex with a single antibody specific for one protein in the protein complex and by detecting the proteins in the protein complex. The step of detection can involve the use of additional antibodies directed against proteins suspected of being trapped in the purified protein complex. Alternatively, at least one protein in the protein complex is fused to a tag sequence with affinity to a compound fixed to a solid matrix. By contacting the solid matrix with said tagged protein, further proteins binding to said protein can be purified and binding can be detected. GST or HA are preferred tags in accordance with the present invention.
In a preferred embodiment of the present invention's method, said contacting step (e) is effected in an interaction mating two hybrid approach.
In another preferred embodiment of the present invention's method, said method comprises after step (d) and before step (e) the steps of: (d′) contacting (poly)peptides detected in step (d) with a selection of (poly)peptides suspected to contain one or several (poly)peptides interacting with said (poly)peptides detected in step (d) under conditions that allow the interaction between interaction partners to occur; and (d″) detecting proteins that interact with said (poly)peptides detected in step (d′).
This preferred embodiment of the invention, an additional step of identifying further interaction partners is carried out prior to the contacting of all “baits” and “preys” in one pool (step (e)). Optionally, further steps of selecting interaction partners in analogy to steps (d′) and (d″) may be infected prior to the pooling/interaction step.
Diseases of particular interest for which interrelationships of disease-related proteins may be analyzed in accordance with the invention are provided in Table 5.
In yet another preferred embodiment of the present invention's method, said disease related protein is a protein suspected of being a causative agent of a hereditary (see Table 5), such as a monogenic disease.
In another preferred embodiment of the present invention's method, said disease related protein is huntingtin and said interaction partners are the interaction partners as shown in table 6,7 and/or 9
In another preferred embodiment of the present invention's method, said method comprises the step of determining the nucleotide sequence of a nucleic acid molecule encoding a direct or indirect interaction partner of the disease related protein.
In another preferred embodiment of the present invention's method, said selections of proteins are translated from a nucleic acid library.
In, another preferred embodiment of the present invention's method, said selection of proteins in step (a) and/or (c) and/or (d′) and/or (e) is the same selection or a selection from the same source. In another preferred embodiment of the present invention's method, said selection of proteins in step (a) and/or (c) and/or (d′) and/or (e) is a different selection or a selection from a different source.
Preferably, said source is selected from nerve cells, brain tissue, human adrenal gland, human bladder, human bone, human brain, human colon, human dorsal root ganglion, human heart, human HeLa cells, human kidney, human liver, human lung, human mammary gland, human ovary, human pancreas, human placenta, human prostate, human retina, human salivary gland, human sceletal muscle, human small intestine, human smooth muscle, human spinal cord, human spleen, human stomach, human testis, human thymus, human thyroid, human tonsil, human trachea, human uterus, human cell line HEP G2, human cell line MDA 435, human fetal brain, human fetal heart, human fetal kidney, human fetal liver, human fetal spleen, human fetal thymus, human breast tumor, human cervix tumor, human colon tumor, human kidney tumor, human lung tumor, human ovary tumor, human stomach tumor, human brain tumor and/or human uterus tumor.
In another preferred embodiment of the present invention's method, said method is performed by contacting the proteins on an array. Preferably, said array is an array allowing to detect protein-protein interaction by the principle of a biacore detector.
In another preferred embodiment of the present invention's method, said interactions are detected by using the yeast two-hybrid system. Preferably, said inteactions detected by using MALDI-TOF, MS, electro spray MS or biacore.
In another preferred embodiment of the present invention's method, said method contains after step of (b), (d), (d″) or (f) the additional steps of isolating a nucleic acid molecule with homology to said cDNA expressing the encoded protein and testing it for its activity as a modulator of huntingtin, wherein said nucleic acid molecule is DNA, or RNA, and preferably cDNA, or genomic or synthetic DNA, or mRNA.
By using the methods disclosed herein, a rate of success or fidelity of at least 70% validatable protein-protein interactions (PPI) (of proteins within the protein interaction network of huntingtin) can be achieved. This level of consistency is well above the level described in the art. In order to increase the rate of success or fidelity, the skilled person can, when carrying out the methods of the present invention, combine the methods of the present invention with additional steps of testing. For example, a step of co-immunoprecipitation and/or an in vitro binding assay may be carried out, in cases when initially the interaction was determined by using the yeast-two-hybrid system (or vice versa). Such additional steps may be carried out at any stage of the methods of the present invention. For example, after but also prior to step (f) of the method of the present invention, PPIs may be verified using in-vitro binding and/or immunoprecipatation assays in order to increase the stringency of the method. By performing these additional steps of testing, the skilled person can increase the rate of success or fidelity to at least 50%, more preferably to at least 60%. For the additional validation, any method may be employed that is available to the skilled artisan for testing the protein interaction. For example, the skilled artisan may simply repeat the step(s) initially carried out, optionally by (slightly) altering the reaction conditions, preferably to more stringent reaction conditions, i.e. conditions that could be expected to further reduce the number of false positive interactions. Alternatively, a different method may be carried out in the validation process. For example, if the method of the invention employed two hybrid systems, the validation might be carried out by precipitation steps as outlined elsewhere in the specification. Whereas the method of the invention provides valid results without the additional validation step(s), the inclusion of such additional validation steps may be advantageous for certain purposes, e.g. drug target identification. In the case that a first validation step does not confirm that the protein in question is a member of the interaction network, further steps in this regard should be carried out. For example, it should be excluded that the validation step(s) do/does not catch weak protein interactions that nevertheless are part of the network. The present invention also relates to a nucleic acid molecule encoding a modulator of huntingtin, wherein said modulator is a protein selected from table 8.
The term “modulator protein”, as used in the present invention, refers to a protein capable of modulating the function or physical state of a second protein and comprises proteins that enhance or reduce (inhibit) the function or activity of huntingtin. Preferably, the modulator protein is a protein having an activity selected from the group consisting of oxidoreductase activity (acting on the CH—OH group of donors, acting on the aldehyde or oxo group of donors, acting on the CH—CH group of donors, acting on the CH—NH(2) group of donors, acting on the CH—NH group of donors, acting on NADH or NADPH, acting on other nitrogenous compounds as donors, acting on a sulfur group of donors, acting on a heme group of donors, acting on diphenols and related substances as donors, acting on a peroxide as acceptor, acting on hydrogen as donor, acting on single donors with incorporation of molecular oxygen, acting on the CH—OH group of donors, acting on superoxide as acceptor, oxidizing metal ions, acting on —CH(2) groups, acting on iron-sulfur proteins as donors, acting on reduced flavodoxin as donor, acting on phosphorus or arsenic in donors, acting on x-H and y-H to form an x-y bond, other oxidoreductases), transferase activity (transferring one-carbon groups, transferring aldehyde or ketone residues, acyltransferases, glycosyltransferases, transferring alkyl or aryl groups, other than methyl groups, transferring nitrogenous groups, transferring phosphorous-containing groups, transferring sulfur-containing groups, transferring selenium-containing groups), hydrolase activity (glycosylase activity, acting on ether bonds, acting on peptide bonds, acting on carbon-nitrogen bonds (other than peptide bonds), acting on acid anhydrides, acting on carbon-carbon bonds, acting on halide bonds, acting on phosphorus-nitrogen bonds, acting on sulfur-nitrogen bonds, acting on carbon-phosphorus bonds, acting on sulfur-nitrogen bonds, acting on carbon-phosphorus bonds, acting on sulfur-sulfur bonds, acting on carbon-sulfur bonds, lyases (carbon-carbon lyases, carbon-oxygen lyases, carbon-nitrogen lyases, carbon-sulfur lyases, carbon-halide lyases, phosphorus-oxygen lyases, other lyases), isomerases (racemases and epimerases, cis-trans-isomerases, intramolecular oxidoreductases, intramolecular transferases, intramolecular lyases, other isomerases), ligases activity (forming carbon-oxygen bonds, forming carbon-sulfur bonds, forming carbon-nitrogen bonds, forming carbon-carbon bonds, forming phosphoric ester bonds), transcription factor activity, filament protein, membrane protein and structural protein.
In a preferred embodiment, the present invention's nucleic acid molecule is DNA, or RNA, and preferably cDNA, or genomic DNA or synthetic DNA or mRNA
In another preferred embodiment of the invention, the nucleic acid molecule is double stranded or single stranded.
In another preferred embodiment of the invention, the nucleic acid molecule is of vertebrate, nematode, insect, bakterium or yeast. Preferably, the nematode is Caenorhabditis elegans. In another more preferred embodiment of the present invention, the insect is drosophila, preferably drosiphila melanogaster. In another more preferred embodiment of the present invention, the vertebrate is human, mouse rat, Xenopus laevis, zebrafish.
In yet another preferred embodiment of the present invention, the nucleic acid molecule is fused to a heterologous nucleic acid molecule. In a further preferred embodiment of the present invention, the heterologous (poly)peptide encoded by said heterlogous nucleic acid molecule is an immunoglobulin Fc domain.
In another preferred embodiment of the present invention the nucleic acid molecule is labeled. Labeled nucleic acid molecules may be useful for purification or detection. Suitable labels include fluorochromes, e.g. fluorescein isothiocyanate (FITC), rhodamine, Texas Red, phycoerythrin, allophycocyanin, 6-carboxyfluorescein (6-FAM), 2′,7′-dimethoxy-4′,5′-dichloro-6-carboxyfluorescein (JOE), 6-carboxy-X-rhodamine(ROX), 6-carboxy-2′,4′,7′,4,7-hexachlorofluorescein (HEX), 5-carboxyfluorescein (5-FAM) or N,N,N′,N′-tetramethyl-6-carboxyrhodamine (TAMRA), radioactive labels, e.g. 32P, 35S, 3H; etc. The label may also be a two stage system, where the DNA is conjugated to biotin, haptens, etc. having a high affinity binding partner, e.g. avidin, specific antibodies, etc., where the binding partner is conjugated to a detectable label. In the case of amplification the label may be conjugated to one or both of the primers. The pool of nucleotides used in the amplification may also be labeled, so as to incorporate the label into the amplification product. Alternatively, the double strand formed after hybridization can be detected by anti-double strand DNA specific antibodies or aptamers etc.
In a more preferred embodiment said heterologous nucleic acid molecule encodes a heterologous polypeptide. Preferably said heterologous (poly)peptide, fused to the (poly)peptide encoded by the nucleic acid molecule of the present invention, is a DNA binding protein selected from the group consisting of GAL4 (DBP) and LexA (DBP). Also preferred in accordance with the present invention are activation domains selected from the group consisting of GAL4(AD) and VP16(AD). Also, preferred are (poly)peptides selected from the group consisting of GST, His Tag, Flag Tag, Tap Tag, HA Tag and Protein A Tag.
Thus, the sequence encoding the (poly)peptide may be fused to a marker sequence, such as a sequence encoding a peptide which facilitates purification of the fused (poly)peptide. In certain preferred embodiments of this aspect of the invention, the marker amino acid sequence is a hexa-histidine peptide, such as the tag provided in a pQE vector (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, Calif., 91311), among others, many of which are commercially available. As described in Gentz et al., Proc. Natl. Acad. Sci. USA 86:821-824 (1989), for instance, hexa-histidine provides for convenient purification of the fusion protein. The “HA” tag is another peptide useful for purification which corresponds to an epitope derived from the influenza hemagglutinin protein, which has been described by Wilson et al., Cell 37: 767 (1984).
The (poly)peptide may be expressed in a modified form, such as a fusion protein, and may include not only secretion signals, but also additional heterologous functional regions. For instance, a region of additional amino acids, particularly charged amino acids, may be added to the N-terminus of the (poly)peptide to improve stability and persistence in the host cell, during purification, or during subsequent handling and storage. Also, peptide moieties may be added to the (poly)peptide to facilitate purification. Such regions may be removed prior to final preparation of the (poly)peptide. The addition of peptide moieties to (poly)peptides to engender secretion or excretion, to improve stability and to facilitate purification, among others, are familiar and routine techniques in the art. A preferred fusion protein comprises a heterologous region from immunoglobulin that is useful to stabilize and purify proteins.
The present invention also relates to a method of producing a vector comprising the nucleic acid molecule the present invention. Furthermore, the present invention relates to a vector produced said method.
The present invention also relates to a vector comprising the nucleic acid molecule of the present invention. Preferably said vector is a transfer or expression vector selected from the group consisting of pACT2; pAS2-1; pBTM116; pBTM117; pcDNA3.1; pcDNAI; pECFP; pECFP-C1; pECFP-N1; pECFP-N2; pECFP-N3; pEYFP-C1; pFLAG-CMV-5 a, b, c; pGAD10; pGAD424; pGAD425; pGAD427; pGAD428; pGBT9; pGEX-3×1; pGEX-5×1; pGEX-6P1; pGFP; pQE30; pQE30N; pQE30-NST; pQE31; pQE31 N; pQE32; pQE32N; pQE60; pSE111; pSG5; pTET-CMV-AS; pTET-CMV-F.°-AS; pTET-CMV-F.°-S; pTET-CMV-MCS; pTET-CMV-S; pTK-Hyg; pTL1; pTL10; pTL-HA0; pTL-HA1; pTL-HA2; pTL-HA3; pBTM118c; pGEX-6P3; pACGHLT-C; pACGHLT-A; pACGHLT-B; pUP; pcDNA3.1-V5His; pMalc2x. Said expression vectors may particularly be plasmids, cosmids, viruses or bacteriophages used conventionally in genetic engineering plasmids, cosmids, viruses and bacteriophages used conventionally in genetic engineering that comprise the aforementioned nucleic acid. Preferably, said vector is a gene transfer or targeting vector. Expression vectors derived from viruses such as retroviruses, vaccinia virus, adeno-associated virus, herpes viruses, or bovine papilloma virus, may be used for delivery of the nucleic acid into targeted cell population. Methods which are well known to those skilled in the art can be used to construct recombinant viral vectors; see, for example, the techniques described in Sambrook et al., Molecular Cloning A Laboratory Manual, Cold Spring Harbor Laboratory (1989) N.Y. and Ausubel et al., Current Protocols in Molecular Biology, Green Publishing Associates and Wiley Interscience, N.Y. (1989).
In yet a further preferred embodiment of the invention the vector contains an additional expression cassette for a reporter protein, selected from the group consisting of β-galactosidase, luciferase, green fluorescent protein and variants thereof.
Preferably, said vector comprises regulatory elements for expression of said nucleic acid molecule. Consequently, the nucleic acid of the invention may be operatively linked to expression control sequences allowing expression in eukaryotic cells. Expression of said nucleic acid molecule comprises transcription of the sequence nucleic acid molecule into a translatable mRNA. Regulatory elements ensuring expression in eukaryotic cells, preferably mammalian cells, are well known to those skilled in the art. They usually comprise regulatory sequences ensuring initiation of transcription and, optionally, a poly-A signal ensuring termination of transcription and stabilization of the transcript, and/or an intron further enhancing expression of said nucleic acid. Additional regulatory elements may include transcriptional as well as translational enhancers, and/or naturally-associated or heterologous promoter regions. Possible regulatory elements permitting expression in eukaryotic host cells are the AOX1 or GAL1 promoter in yeast or the CMV-, SV40-, RSV-promoter (Rous sarcoma virus), CMV-enhancer, SV40-enhancer or a globin intron in mammalian and other animal cells. Beside elements which are responsible for the initiation of transcription such regulatory elements may also comprise transcription termination signals, such as the SV40-poly-A site or the tk-poly-A site, downstream of the nucleic acid molecule. Furthermore, depending on the expression system used leader sequences capable of directing the (poly)peptide to a cellular compartment or secreting it into the medium may be added to the coding sequence of the aforementioned nucleic acid and are well known in the art. The leader sequence(s) is (are) assembled in appropriate phase with translation, initiation and termination sequences, and preferably, a leader sequence capable of directing secretion of translated protein, or a portion thereof, into the periplasmic space or extracellular medium. Optionally, the heterologous sequence can encode a fusion protein including an C- or N-terminal identification peptide imparting desired characteristics, e.g., stabilization or simplified purification of expressed recombinant product. In this context, suitable expression vectors are known in the art such as Okayama-Berg cDNA expression vector pcDVI (Pharmacia), pCDM8, pRc/CMV, pcDNA1, pcDNA3, the Echo™ Cloning System (Invitrogen), pSPORT1 (GIBCO BRL) or pRevTet-On/pRevTet-Off or pCI (Promega).
The present invention also relates to a method of producing a host cell comprising genetically engineering cells with the nucleic acid molecule or the vector of the present invention. The present invention also relates to a host cell produced said method. Furthermore, the present invention relates to a host cell comprising the vector of the present invention. Preferably, said host cell contains an endogenous nucleic acid molecule which is operably associated with a heterologous regulatory control sequence, including the regulatory elements contained in the vector of the present invention.
The present invention also relates to a method of producing a (poly)peptide, comprising culturing the host cell of the present invention under conditions such that the (poly)peptide encoded by said polynucleotide is expressed and recovering said (poly)peptide.
The present invention also relates to a (poly)peptide comprising an amino acid sequence encoded by a nucleic acid molecule of the present invention, or which is chemically synthesized, or is obtainable from the host cell of the present invention, or which is obtainable by a method of the present invention or which is obtainable from an in vitro translation system by expressing the nucleic acid molecule of the present invention or the vector of the present invention.
In another preferred embodiment of the invention, the (poly)peptide or protein is of vertebrate, nematode, insect, bakterium or yeast. Preferably, the nematode is Caenorhabditis elegans. In another more preferred embodiment of the present invention, the insect is Drosophila, preferably Drosophila melanogaster. In another more preferred embodiment of the present invention, the vertebrate is human, mouse rat, Xenopus laevis, zebrafish.
In another preferred embodiment, the (poly)peptide of the present invention is fused to a heterologous (poly)peptide. Such a fusion protein may include not only secretion signals, but also additional heterologous functional regions. For instance, a region of additional amino acids, particularly charged amino acids, may be added to the N-terminus of the (poly)peptide to improve stability and persistence in the host cell, during purification, or during subsequent handling and storage. Also, peptide moieties may be added to the (poly)peptide to facilitate purification. Such regions may be removed prior to final preparation of the (poly)peptide. The addition of peptide moieties to (poly)peptides to engender secretion or excretion, to improve stability and to facilitate purification, among others, are familiar and routine techniques in the art. A preferred fusion protein comprises a heterologous region from immunoglobulin that is useful to stabilize and purify proteins.
In a preferred embodiment of the present invention, the (poly)peptide of the present invention is fused to a heterologous (poly)peptide which is an immunoglobulin Fc domain or Protein A domain. In another preferred embodiment of the present invention, the (poly)peptide the (poly)peptide is labelled. Preferably, the label is selected from the group consisting of fluorochromes, e.g. fluorescein isothiocyanate (FITC), rhodamine, Texas Red, phycoerythrin, allophycocyanin, 6-carboxyfluorescein (6-FAM), 2′,7′-dimethoxy-4′,5′-dichloro-6-carboxyfluorescein (JOE), 6-carboxy-X-rhodamine(ROX), 6-carboxy-2′,4′,7′,4,7-hexachlorofluorescein (HEX), 5-carboxyfluorescein (5-FAM) or N,N,N′,N′-tetramethyl-6-carboxyrhodamine (TAMRA), radioactive labels, e.g. 32P, 35S, 3H; etc. The label may also be a two stage system, where the protein or (poly)peptide is conjugated to biotin, haptens, etc. having a high affinity binding partner, e.g. avidin, specific antibodies, etc., where the binding partner is conjugated to a detectable label. In another preferred embodiment of the present invention the label is a toxin, radioisotope, or fluorescent label.
In another preferred embodiment of the present invention, the (poly)peptide contains or lacks an N-terminal methionine. it is well known in the art that the N-terminal methionine encoded by the translation initiation codon generally is removed with high efficiency from any protein after translation in all eukaryotic cells. While the N-terminal methionine on most proteins also is efficiently removed in most prokaryotes, for some proteins this prokaryotic removal process is inefficient, depending on the nature of the amino acid to which the N-terminal methionine is covalently linked.
The present invention also relates to a protein complex comprising at least two proteins, wherein said at least two proteins are selected from the group of interaction partners listed in table 9. The term “protein complex” refers to a compound stably comprising at least two proteins. Preferably, said stability allows to purify said protein complex. In a preferred embodiment of the present invention, the protein complex comprises GIT1 and huntingtin.
The present invention also relates to the protein network of huntingtin, preferably the physical protein entities forming this network, which is described herein. In one embodiment, said protein network is formed by the interaction partners shown in table 6. Preferable, the protein network of the present invention is a validated protein network as described herein.
The present invention also relates to an antibody specifically recognizing the (poly)peptide of the present invention or specifically reacting with the protein complex of the present invention. This antibody is characterized in not recognizing the individual components of the protein complex but rather the complex itself. As such, said antibody recognizes a combined epitope, composed of amino acids of two different proteins within the protein complex. Dissociation of the complex will be detrimental to antibody recognition. Therefore, antibody binding depends on the integrity of the protein complex. In a preferred embodiment of the present invention, the antibody is specific for a protein complex comprising GIT1 and huntingtin.
In a preferred embodiment, the antibody of the present invention is polyclonal, monoclonal, chimeric, single chain, single chain Fv, human antibody, humanized antibody, or Fab fragment
In a more preferred embodiment of the present invention the antibody is labeled. Preferably, the label is selected from the group consisting of fluorochromes, e.g. fluorescein isothiocyanate (FITC), rhodamine, Texas Red, phycoerythrin, allophycocyanin, 6-carboxyfluorescein (6-FAM), 2′,7′-dimethoxy-4′,5′-dichloro-6-carboxyfluorescein (JOE), 6-carboxy-X-rhodamine(ROX), 6-carboxy-2′,4′,7′,4,7-hexachlorofluorescein (HEX), 5-carboxyfluorescein (5-FAM) or N,N,N′,N′-tetramethyl-6-carboxyrhodamine (TAMRA), radioactive labels, e.g. 32P, 35S, 3H; etc. The label may also be a two stage system, where the antibody is conjugated to biotin, haptens, etc. having a high affinity binding partner, e.g. avidin, specific antibodies, etc., where the binding partner is conjugated to a detectable label. In another preferred embodiment of the present invention the label is a toxin, radioisotope, or fluorescent label.
In a preferred embodiment of the present invention, the antibody is immobilized to a solid support. Preferably, the solid support may be the surface of a cell, a microtiter plate, beads or the surface of a sensor capable of detecting binding of the antibody or to the antibody.
The present invention also relates to a method of identifying whether a protein promotes huntingtin aggregation, comprising (a) transfecting a first cell with a nucleic acid molecule encoding a variant of the huntingtin protein or a fragment thereof capable of forming huntingtin aggregates; (b) co-transfecting a second cell with (i) a nucleic acid molecule encoding a variant of the huntingtin protein or a fragment thereof capable of forming huntingtin aggregates; and (ii) a nucleic acid molecule encoding a candidate modulator protein identified by the methods of the present invention or a nucleic acid molecule encoding a modulator protein selected from table 6 or table 7 (c) expressing the proteins encoded by the transfected nucleic acid molecule of (a) and (b); (d) isolating insoluble aggregates of huntingtin from the transfected cell of (a) and (b); and (e) determining the amount of insoluble huntingtin aggregates from the transfected cell of (a) and (b), wherein an increased amount of huntingtin aggregates isolated from the transfected cells of (b) in comparison with the amount of huntingtin aggregates isolated from the transfected cells of (a) is indicative of a protein's activity as an enhancer of huntingtin aggregation. Preferably, the huntingtin protein or protein fragment of step (a) is HD169Q68 or HD510Q68.
The present invention also relates to a method of identifying whether a protein inhibits huntingtin aggregation, comprising (a) transfecting a first cell with a nucleic acid molecule encoding a variant of the huntingtin protein or a fragment thereof capable of forming huntingtin aggregates; (b) co-transfecting a second cell with (i) a nucleic acid molecule encoding a variant of the huntingtin protein or a fragment thereof capable of forming huntingtin aggregates; and (ii) a nucleic acid molecule encoding a candidate modulator protein identified by the methods of the present invention or a nucleic acid molecule encoding a modulator protein selected from table 6 or table 7 (c) expressing the proteins encoded by the transfected nucleic acid molecule of (a) and (b); (d) isolating insoluble aggregates of huntingtin from the transfected cell of (a) and (b); and (e) determining the amount of insoluble huntingtin aggregates from the transfected cell of (a) and (b), wherein a reduced amount of huntingtin aggregates isolated from the transfected cells of (b) in comparison with the amount of huntingtin aggregates isolated from the transfected cells of (a) is indicative of a protein's activity as an inhibitor of huntingtin aggregation. Preferably, the huntingtin protein or protein fragment of step (a) is HD169Q68 or HD510Q68 or HdexQ51.
The term “promotes” means increasing the amount of huntingtin aggregation.
Preferably said huntingtin protein or the fragments thereof is selected from the proteins listed in table 6 and/or 7. Preferably said insoluble aggregates are isolated by using a filter retardation method comprising lysing cells and boiling in 2% SDS for 5 min in the presence of 100 mM DDT followed by a filtration step. The presence of aggregates is detected by using specific antibodies.
In a preferred embodiment of the present invention, determining the amount of insoluble huntingtin is performed by using light scattering or size exclusion chromatography. In another preferred embodiment of the present invention prior to step (d) the cells are treated with an ionic detergent. In yet another preferred embodiment of the methods of the present invention, the huntingtin aggregates are filtered or transferred onto a membrane.
The present invention also relates to a method for identifying compounds affecting, e.g. interfering or enhancing the interaction of huntingtin or of a direct or indirect interaction partner of huntingtin comprising (a) contacting interacting proteins selected from the group of interacting proteins listed in table 6 in the presence or absence of a potential modulator of interaction; and (b) identifying compounds capable of modulating said interaction. The contacting is performed under conditions that permit the interaction of the two proteins. Sometimes more than two interacting proteins might be present in a single reaction as additional interaction partners of those listed under table 6, can be tested. However, the compound may also be a small molecule. Preferably said compounds are antibodies directed to huntingtin or to said interaction partner listed in table 6, wherein these antibodies are capable of interfering with the interaction with huntingtin. Alternatively, said compound is a peptide fragment of 10 to 25 amino acid residues of an interaction partner listed in table 7, wherein said peptide fragment is capable of interfering with the interaction with huntingtin. In a more preferred embodiment of the present invention, said antibody is an antibody directed to GIT1. In another more preferred embodiment of the invention, said peptide fragment is a peptide fragment of GIT1 of 10 to 25 capable of interfering with the interaction of GIT1 with huntingtin. Said interfering peptide may contain additional modifications in order to increase cellular uptake, solubility or to increase stability. Such modifications are known to the person skilled in the art and need not be listed here in detail. In a preferred embodiment of the present invention, the methods for identifying a compound further comprise the steps of modeling said compound by peptidomentics and chemically synthesizing the modeled compound.
In another preferred embodiment of the present invention, the methods for identifying a compound further comprise producing said compound. In yet another preferred embodiment of the present invention, the method for identifying said compound further comprise modifiying to achieve (i) modified site of action, spectrum of activity, organ specificity, and/or (ii) improved potency, and/or (iii) decreased toxicity (improved therapeutic index), and/or (iv) decreased side effects, and/or (v) modified onset of therapeutic action, duration of effect, and/or (vi) modified pharmakinetic parameters (resorption, distribution, metabolism and excretion), and/or (vii) modified physico-chemical parameters (solubility, hygroscopicity, color, taste, odor, stability, state), and/or (viii) improved general specificity, organ/tissue specificity, and/or (ix) optimized application form and route by (i) esterification of carboxyl groups, or (ii) esterification of hydroxyl groups with carbon acids, or (iii) esterification of hydroxyl groups to, e.g. phosphates, pyrophosphates or sulfates or hemi succinates, or (iv) formation of pharmaceutically acceptable salts, or (v) formation of pharmaceutically acceptable complexes, or (vi) synthesis of pharmacologically active polymers, or (vii) introduction of hydrophilic moieties, or (viii) introduction/exchange of substituents on aromates or side chains, change of substituent pattern, or (ix) modification by introduction of isosteric or bioisosteric moieties, or (x) synthesis of homologous compounds, or (xi) introduction of branched side chains, or (xii) conversion of alkyl substituents to cyclic analogues, or (xiii) derivatisation of hydroxyl group to ketales, acetates, or (xiv) N-acetylation to amides, phenylcarbamates, or (xv) synthesis of Mannich bases, imines, or transformation of ketones or aldehydes to Schiff's bases, oximes, acetates, ketales, enolesters, oxazolidines, thiozolidines or combinations thereof.
The present invention also relates to a method of diagnosing Huntington's disease in a biological sample comprising the steps of (a) contacting the sample with an antibody specific for a protein of table 6 or 7 or an antibody specific for the protein complex of the present invention; and (b) detecting binding of the antibody to a protein complex, wherein the detection of binding is indicative of Huntington's disease or of a predisposition to develop Huntington's disease. Preferably, binding is detected by measuring the presence of a fluorescent label bound to the protein complex.
In a preferred embodiment of the present invention's method protein complex contains (a) GIT1 or (b) said antibody is specific for a protein complex containing GIT1.
In a preferred embodiment of the present invention, said protein complex contains (a) at least one protein selected from htt, HIP15 or HP28 or (b) said antibody is specific for a protein complex containing at least one protein selected from htt, HIP15 or HP28.
The present invention also relates to a diagnostic agent/composition comprising the nucleic acid molecule of the present invention, the (poly)peptide of the present invention including/or the (poly)peptide mentioned in table 6 or 7, the antibody of the present invention, an antibody specifically reacting with a protein selected from table 7 and/or a protein selected from table 7.
Moreover, the present invention also relates to a pharmaceutical composition comprising the nucleic acid molecule of the present invention, the (poly)peptide of the present invention, the interfering compound identified with a method of the present invention, the antibody of the present invention, an antibody specifically reacting with a protein selected from table 7 and/or a protein selected from table 7.
The pharmaceutical composition will be formulated and dosed in a fashion consistent with good medical practice, taking into account the clinical condition of the individual patient, the site of delivery of the pharmaceutical composition, the method of administration, the scheduling of administration, and other factors known to practitioners. The “effective amount” of the pharmaceutical composition for purposes herein is thus determined by such considerations.
As a general proposition, the total pharmaceutically effective amount of pharmaceutical composition administered parenterally per dose will be in the range of about 1 μg protein/kg/day to 10 mg protein/kg/day of patient body weight, although, as noted above, this will be subject to therapeutic discretion. More preferably, this dose is at least 0.01 mg protein/kg/day, and most preferably for humans between about 0.01 and 1 mg protein/kg/day for the peptide. If given continuously, the pharmaceutical composition is typically administered at a dose rate of about 1 μg/kg/hour to about 50 μg/kg/hour, either by 1-4 injections per day or by continuous subcutaneous infusions, for example, using a mini-pump. An intravenous bag solution may also be employed. The length of treatment needed to observe changes and the interval following treatment for responses to occur appears to vary depending on the desired effect.
Pharmaceutical compositions of the invention may be administered orally, rectally, parenterally, intracistemally, intravaginally, intraperitoneally, topically (as by powders, ointments, drops or transdermal patch), bucally, or as an oral or nasal spray. By “pharmaceutically acceptable carrier” is meant a non-toxic solid, semisolid or liquid filler, diluent, encapsulating material or formulation auxiliary of any type. The term “parenteral” as used herein refers to modes of administration which include intravenous, intramuscular, intraperitoneal, intrasternal, subcutaneous and intraarticular injection and infusion.
The pharmaceutical composition is also suitably administered by sustained-release systems. Suitable examples of sustained-release compositions include semi-permeable polymer matrices in the form of shaped articles, e.g., films, or mirocapsules. Sustained-release matrices include polylactides (U.S. Pat. No. 3,773,919, EP 58,481), copolymers of L-glutamic acid and gamma-ethyl-L-glutamate (Sidman, U. et al., Biopolymers 22:547-556 (1983)), poly (2-hydroxyethyl methacrylate) (R. Langer et al., J. Biomed. Mater. Res. 15:167-277 (1981), and R. Langer, Chem. Tech. 12:98-105 (1982)), ethylene vinyl acetate (R. Langer et al., Id.) or poly-D-(−)-3-hydroxybutyric acid (EP 133,988). Sustained-release pharmaceutical composition also include liposomally entrapped protein, antibody, (poly)peptide, peptide or nucleic acid. Liposomes containing the pharmaceutical composition are prepared by methods known per se: DE 3,218,121; Epstein et al., Proc. Natl. Acad. Sci. (USA) 82:3688-3692 (1985); Hwang et al., Proc. Natl. Acad. Sci. (USA) 77:4030-4034 (1980); EP 52,322; EP 36,676; EP 88,046; EP 143,949; EP 142,641; Japanese Pat. Appl. 83-118008; U.S. Pat. Nos. 4,485,045 and 4,544,545; and EP 102,324. Ordinarily, the liposomes are of the small (about 200-800 Angstroms) unilamellar type in which the lipid content is greater than about 30 mol. percent cholesterol, the selected proportion being adjusted for the optimal therapy.
For parenteral administration, in one embodiment, the pharmaceutical composition is formulated generally by mixing it at the desired degree of purity, in a unit dosage injectable form (solution, suspension, or emulsion), with a pharmaceutically acceptable carrier, i.e., one that is non-toxic to recipients at the dosages and concentrations employed and is compatible with other ingredients of the formulation. For example, the formulation preferably does not include oxidizing agents and other compounds that are known to be deleterious to (poly)peptides.
Generally, the formulations are prepared by contacting the components of the pharmaceutical composition uniformly and intimately with liquid carriers or finely divided solid carriers or both. Then, if necessary, the product is shaped into the desired formulation. Preferably the carrier is a parenteral carrier, more preferably a solution that is isotonic with the blood of the recipient. Examples of such carrier vehicles include water, saline, Ringer's solution, and dextrose solution. Non-aqueous vehicles such as fixed oils and ethyl oleate are also useful herein, as well as liposomes. The carrier suitably contains minor amounts of additives such as substances that enhance isotonicity and chemical stability. Such materials are non-toxic to recipients at the dosages and concentrations employed, and include buffers such as phosphate, citrate, succinate, acetic acid, and other organic acids or their salts; antioxidants such as ascorbic acid; low molecular weight (less than about ten residues) (poly)peptides, e.g., polyarginine or tripeptides; proteins, such as serum albumin, gelatin, or immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino acids, such as glycine, glutamic acid, aspartic acid, or arginine; monosaccharides, disaccharides, and other carbohydrates including cellulose or its derivatives, glucose, manose, or dextrins; chelating agents such as EDTA; sugar alcohols such as mannitol or sorbitol; counterions such as sodium; and/or nonionic surfactants such as polysorbates, poloxamers, or PEG. The proteinacous components of the pharmaceutical composition are typically formulated in such vehicles at a concentration of about 0.1 mg/ml to 100 mg/ml, preferably 1-10 mg/ml, at a pH of about 3 to 8. It will be understood that the use of certain of the foregoing excipients, carriers, or stabilizers will result in the formation protein or (poly)peptide salts.
The components of the pharmaceutical composition to be used for therapeutic administration must be sterile. Sterility is readily accomplished by filtration through sterile filtration membranes (e.g., 0.2 micron membranes). Therapeutic components of the pharmaceutical composition (poly)peptide compositions generally are placed into a container having a sterile access port, for example, an intravenous solution bag or vial having a stopper pierceable by a hypodermic injection needle.
The components of the pharmaceutical composition ordinarily will be stored in unit or multi-dose containers, for example, sealed ampoules or vials, as an aqueous solution or as a lyophilized formulation for reconstitution. As an example of a lyophilized formulation, 10-ml vials are filled with 5 ml of sterile-filtered 1% (w/v) aqueous protein solution, and the resulting mixture is lyophilized. The infusion solution is prepared by reconstituting the lyophilized protein using bacteriostatic Water-for-Injection.
The invention also provides a pharmaceutical/diagnostic pack or kit comprising one or more containers filled with one or more of the ingredients of the pharmaceutical/diagnostic compositions of the invention. Associated with such container(s) can be a notice in the form prescribed by a governmental agency regulating the manufacture, use or sale of pharmaceuticals or biological products, which notice reflects approval by the agency of manufacture, use or sale for human administration. In addition, the (poly)peptides of the components of the pharmaceutical composition invention may be employed in conjunction with other therapeutic compounds.
Finally, the present invention relates to the use of the nucleic acid molecule of the present invention, the interfering compound identified with a method of the present invention, the (poly)peptide of the present invention including/or the (poly)peptide mentioned in table 6 or 7, the antibody of the present invention, an antibody specifically reacting with a protein selected from table 7 and/or a protein selected from table 7 for the preparation of a pharmaceutical composition for the treatment of Huntington's disease. Tables:
Abbreviations:
aa, amino acids;
IDEN, Identity;
LOC, localisation;
AD, activation domain;
DBD, DNA binding domain;
AJ, adherens junctions;
C, cytosol;
CN, cytoskeleton;
EC, extracellular space;
EE, early endosomes;
ER, endoplasmic reticulum;
IF, intermediate filaments;
GN, Golgi network;
Mit, mitochondria;
MT, microtubules;
N, nucleus;
PM, plasma membrane;
PN, perinuclear;
UN, unknown;
V, vesicles;
[ ], database annotation
Abbreviations:
aa, amino acids;
IDEN, identity;
LOC, localisation;
AD, activation domain;
DBD, DNA binding domain;
AJ, adherens junctions;
C, cytosol;
CN, cytoskeleton;
EC, extracellular space;
EE, early endosomes;
ER, endoplasmic reticulum;
IF, intermediate filaments;
GN, Golgi network;
Mit, mitochondria;
MT, microtubules;
N, nucleus;
PM, plasma membrane;
PN, perinuclear;
UN, unknown;
V, vesicles;
[ ], database annotation
Abbreviations:
aa, amino acids;
IDEN, identity;
LOC, localization;
LOCUS ID, NCBI LocusLink Identity, activation domain;
DBD, DNA binding domain;
DBD1, DBD fusion proteins yielding no interactions;
DBD2, autoactive DBD fusion proteins;
AJ, adherens junctions;
C, cytosol;
CN, cytoskeleton;
EC, extracellular space;
EE, early endosomes;
ER, endoplasmic reticulum;
IF, intermediate filaments;
GN, Golgi network;
Mit, mitochondria;
MT, microtubules;
N, nucleus;
PM, plasma membrane;
pN, perinuclear;
UN, unknown;
V, vesicles;
[ ], database annotation.
Abbreviations:
aa, amino acids;
IDEN, identity;
LOC, localisation;
AD, activation domain;
DBD, DNA binding domain;
AJ, adherens junctions;
C, cytosol;
CN, cytoskeleton;
EC, extracellular space;
EE, early endosomes;
ER, endoplasmic reticulum;
IF, intermediate filaments;
GN, Golgi network;
Mit, mitochondria;
MT, microtubules;
N, nucleus;
PM, plasma membrane;
PN, perinuclear;
UN, unknown;
V, vesicles;
[ ], database annotation
Abbreviations:
aa, amino acids;
DBD, DNA binding domain;
PPIs, protein-protein interactions;
AA, autoactivation of reporter gene.
Abbreviations:
DBD, DNA binding domain;
PPIs, protein-protein interactions;
AA, autoactivation of reporter gene.
Abbreviations:
ID, interacting protein gene symbol;
LOCUS ID, NCBI LocusLink Identity;
Pubmed ID, NCBI PubMed publication index;
Reported htt interactors are presented according to databases: MINT, HPRD, BIND; Li & Ll, Trends Genet. (2004), 20, 146-152 and Harjes & Wanker, Trends. Biochem. Sci. (2003), 28, 425-433.
Abbreviations:
ID, interacting protein gene symbol;
LOCUS ID, NCBI LocusLink Identity.
The presented list of protein-protein interactions is computed from databases: MINT, HPRD, BIND; Li & Li, Trends Genet. (2004), 20, 146-152 and Harjes & Wanker, Trends. Biochem. Sci. (2003), 28, 425-433.
The figures show:
The examples illustrate the invention:
Part I: Establishing the Protein-Interaction Network of Huntingtin
Antibodies, Strains and Plasmids
A polyclonal antibody (pAb) against GIT1 was generated by injection of affinity purified His6-tagged GIT1 (residues 368-587) into a rabbit. The htt-specific pAb CAG53b and HD1 were described13,14. Commercially available antibodies were anti-GST pAb (Amersham Pharmacia), anti-GIT1 pAb (Santa Cruz Biotechnology), anti-HA monoclonal antibody 12CA5 (mAb) (Roche Diagnostics), anti-htt pAb EM4847, anti-htt mAb 2B448 and anti-htt mAb 4C8 (Chemicon). As secondary antibodies for immunofluorescence microscopy Cy3- and FITC-conjugated IgGs (Jackson ImmunoResearch) were used. The yeast strains used as two-hybrid reporters were L40 ccua [MATa his3Δ200 trp1-901 leu2-3,112 LYS2::(lexAop)4-HIS3 ura3::(lexAop)8-lacZ ADE2::(lexAop)8-URA3 GAL4 gal80can1 cyh2] and L40 ccα [MATα his3Δ200 trp1-910 leu2-3,112 ade2 LYS2::(lexAop)4-HIS3 URA3::(lexAop)8-lacZ GAL4 gal80 can1 cyh2]. Both strains are derivatives of L40c17. Plasmids pHD510Q17 and pHD510Q68 were generated by insertion of fragments coding for HD510Q17 and HD510Q68 into pcDNA-I (Invitrogen). pHD169Q68 was derived from pHD510Q68 by deletion of the XhoI-XhoI fragment encoding aa 170-510 of human htt.
Library Screening
Plasmids encoding bait proteins were transformed into the strain L40 ccua, tested for the absence of reporter gene activity and cotransformed with a human fetal brain cDNA library (Clontech). For each transformation 1×106 independent transformants were plated onto minimal medium lacking tryptophan, leucine, histidine and uracil (SDIV medium) and incubated at 30° C. for 5 to 10 days. Clones were picked into microtitre plates using a picking robot and grown over night in liquid minimal medium lacking tryptophan and leucine (SDII medium). Then, they were spotted onto nylon or nitrocellulose membranes placed on SDIV medium plates. After incubation for 4 days membranes were subjected to a β-galactosidase (β-GAL) assay. Plasmids were prepared from positive clones and characterised by restriction analyses and sequencing. For retransformation assays plasmids encoding bait and prey proteins were cotransformed in the yeast strain L40 ccua and plated onto SDIV medium.
Array Mating Screen
Plasmids encoding bait and prey proteins were transformed into strains L40 ccua and L40 ccα, respectively. L40 ccα clones were arrayed in 96-well microtitre plates and mixed with a single L40 ccua clone for interaction mating. Diploid cells were transferred by a robot (Beckman, Biomek® 2000) onto YPD medium plates and, after incubation for 24 h at 30° C., onto SDII medium plates for additional 72 h at 36° C. For two-hybrid selection diploid cells were transferred onto SDIV medium plates with and without nylon or nitrocellulose membranes and incubated for 5 days at 30° C. The nylon or nitrocellulose membranes were subjected to the β-GAL assay. Positive clones were verified by cotransformation assays using plasmids encoding respective bait and prey proteins.
Protein Expression and Verification Assays
For verification experiments cDNA fragments encoding baits and preys were subcloned into pGEX derivatives (Stratagene) or pTL-HA18. GST fusion proteins were expressed in E. coli BL21-codon Plus™ RP (Stratagene) and affinity purified on glutathione agarose beads (Sigma) using standard protocols17. COS1 cells were transfected with mammalian expression plasmids and lysed as described18. For in vitro binding assays, 30 μg of GST or GST fusion protein were immobilized on glutathione agarose beads and incubated with 500 μg protein extract prepared from COS1 cells expressing a HA-tagged fusion protein for 2 h at 4° C. in binding buffer [50 mM HEPES pH 7.4, 150 mM NaCl, 10% glycerol, 1% NP-40, 1 mM EDTA, 20 mM NaF, 1 mM DTT, 0.1% Triton X-100, protease inhibitors (Roche Diagnostics)]. After centrifugation and extensive washing of the beads bound proteins were eluted and analysed by SDS-PAGE and Western blotting. Coimmunoprecipitation experiments were performed as described by Sittler et al.,18. For immunofluorescence microscopy COS1 cells were grown on cover slips and cotransfected with pcDNA-HD510Q68 and pTL-HA-GIT1. 40 h post transfection cells were fixed with 2% paraformaldehyde. Standard protocols for staining with appropriate primary and secondary antibodies were used18.
Filter Retardation Assay
HEK293 cells coexpressing HD169Q68 and GIT1, PIASy, HIP5, HP28, PFN2, FEZ1 or BARD1 were harvested 48 h post transfection. Cells were lysed as described 18 and boiled in 2% SDS, 100 mM DTT for 5 min. Aliquots containing 50, 25 and 12.5 μg of total protein were used for filtration on a cellulose acetate membrane 1 SDS-resistant aggregates were detected using anti-CAG53b or anti-GIT1 antibodies.
Immunocytochemistry
Mice were deeply anaesthetised and perfused through the left cardiac ventricle with 4% paraformaldehyde in 0.1 M phosphate buffer. Brains were removed and postfixed overnight in 4% paraformaldehyde. Sections were processed for immunocytochemistry as described47. pAb EM48 (1:1000) and affinity purified anti-GIT1 pAb (1:100) were used as primary antibodies.
Six human HD and 5 control brains were used in this study. Two HD cases were classified as grade 3 and four cases as grade 4 of neuropathological severity. For immunolabelling standard protocols were used48. 2B4 mAb (1:200) and affinity purified GIT1 pAb (1:50) were used as primary antibodies.
To generate a PPI network for HD we used a combination of library and matrix yeast two-hybrid screens (
In the second round of screens, 12 cDNA fragments encoding preys identified in the first screen were subcloned into a DNA binding domain vector. The resulting baits were tested for autoactivation and 10 were screened against a human fetal brain cDNA library. Four of the 10 proteins revealed additional 13 PPIs.
Finally, an array mating screen was performed to connect all baits and preys identified in the transformation screens. For this assay, MATα yeast cultures were transformed with plasmids encoding prey proteins and arrayed in 96-well microtitre plates for interaction mating with individual MATa strains expressing bait proteins. Using this strategy each bait was individually tested for interaction with every prey in the array. Diploid yeast clones, formed by mating on YPD plates, were selected on agar SDII plates, and further transferred by a spotting robot on SDIV plates to select for Y2H interactions (
Thus, the combination of cDNA library and array mating screens proved powerful in establishing a highly connected PPI network linked to htt.
Sequence analysis of the cDNAs encoding bait and prey proteins revealed ORFs ranging from 82 to 728 amino acids in size (Table 2). In a systematic Blast search 60 out of the 67 proteins identified were identical to a SwissProt or TrEMBL protein entry (http://us.expasy.org/sprot/). The remaining 7 proteins showed 75-99% identity to its best fit and either contained single amino acid substitutions, variable polyQ lengths or small regions of sequence variation. Uncharacterised proteins were named according to their interaction partners. Each ORF was further examined for consensus protein domains using the FprintScan, HMMPfam, HMMSmart, ProfileScan, and BlastProDom programs providing useful hints to protein function. For example, the protein BAIP1 (BARD1 interacting protein 1) possesses a Zn-finger-like PHD finger that is believed to be important for chromatin-mediated transcriptional regulation. Similarly, domain searches for BAIP2 (BARD1 interacting protein 2) revealed a BTB/POZ domain, a motif found in developmentally regulated zinc finger proteins of the Kelch family of actin-associated proteins. Thus, BAIP2 could potentially mediate the association of BARD1 with the actin cytoskeleton.
Our two-hybrid screens identified a total of 117 PPIs between 70 protein fragments. As a result of the iterative two-hybrid strategy all interactions could be depicted in a single large network. The number of interactions identified for each bait varied from 1 to 18, with each protein having 1.6 interaction partners on average. In order to display the PPI data, both matrix and network representations were used (
We assigned a subcellular localisation to each protein by examining various sources of literature and based on available experimental data we grouped the proteins into six broad functional categories (
Eighteen proteins in the HD network are involved in transcriptional regulation or DNA maintenance (
For the analysis of htt PPIs, as much as 40 out of 117 interactions (34,2%) included a htt fragment (
The current hypothesis that htt has a function in transcriptional regulation is inferred from, its interactions with transcriptional activators, coactivators or repressors21 In agreement with previous reports, binding of htt to CA15022 and HYPA23 has been detected in our screens. In addition, new connections to nuclear proteins such as SETBD1, PLIP and HBO1 were found. These multidomain proteins act on histones and are known modulators of chromatin structure and gene expression. Similarly, the zinc finger bromo domain containing proteins BARD1, NAG4, HZFH, ZHX1, ZNF33B play a role in transcriptional control. The protein IKAP directly interacts with htt and was recently shown to be part of a complex regulating RNA polymerase II activity24. Htt also interacts with PIASy, which inhibits transcription factor STAT-mediated gene activation25. PIASy functions as SUMO E3 ligase for the Wnt-responsive transcription factor LEF1, inhibiting its activity via sumoylation26. This suggests that PIASy catalysed sumoylation of transcription factors could represent a general mechanism in repression of gene expression. The binding of PIASy to htt indicates that htt may itself be a substrate for sumoylation. Alternatively, it could influence the sumoylation of other transcription factors. Thus, our data extend the nuclear role of htt and provide additional leads for its involvement in transcriptional regulation.
Another large group of htt interactors identified here are proteins that function in cellular organization and vesicle transport. We report a new interaction between htt and dynein light chain (HP28), a component of the dynein/dynactin motor protein complex. Interestingly, the p150Glued subunit of dynactin is linked to the htt-associated proteinHAP116,27. Our observation that htt directly binds to HP28 underscores the potential scaffolding role of htt/HAP1 in dynein/dynactin driven retrograde vesicle transport along microtubules in axons.
The htt interacting protein HIP1 anchors clathrin-coated vesicles to the cytoskeleton via its actin-binding domain, a link crucial for synaptic vesicle endocytosis28. Here, a new PPI between htt and profilin II (PFN2)29 was detected. PFN2, a protein enriched in neurons, modulates actin polymerization in vitro and is involved in endocytosis via association with scaffolding proteins29. The htt-PFN2 connection adds support to a potential role of htt in modulation of both actin polymerization and vesicle transport processes.
Currently, for the function of 6 htt interactors, including HIP5, no genetic or biochemical evidence is available (Table 2). We found that HIP5 binds to htt as well as to karyopherin a (KPNA2). KPNA2 serves as an adapter for karyopherin β (KPNB1), which transports NLS-tagged proteins into the nucleus30. Thus, HIP5 might take this route to the nucleus. Interestingly, HEAT or armadillo (ARM) repeats, forming α-helical structures in KPNA2 and KPNB1 are also present in htt31 Therefore, the complexes between KPNA2 and HIP5 as well as between htt and HIP5 could be similar in terms of protein structure. It is tempting to further speculate that htt participates in nucleocytoplasmic transport.
Comparison with literature-cited interactions revealed that more than 80% of the two-hybrid interactions identified here are novel. For all network bait and prey proteins only 24 PPIs have been reported previously using two-hybrid methods, coimmunoprecipitations or affinity chromatography-based techniques; 18 of these were confirmed in our Y2H assays (
Beside false negatives, the two-hybrid assay is also prone to create false positive results9. Addressing this issue we performed a series of pull-down and overlay assays and thereby confirmed several of the two-hybrid PPIs independently. Proteins were expressed as GST-fusions in E coli and as HA-fusions in COS1 cells. After immobilization of the GST-fusion protein to beads or nitrocellulose membranes the respective partner was affinity-purified from a COS1 cell extract and binding was detected by immunoblotting. Using these assays, 22 physical interactions, central to the HD network, were verified (
The formation of insoluble polyQ-containing protein aggregates is a pathological hallmark of HD. Several lines of evidence link htt aggregation to disease progression and the development of motor symptoms. We screened network proteins for their potential to enhance htt aggregation in a cell-based aggregation assay14. In this assay, formation of SDS-insoluble htt aggregates in mammalian cells, that have been cotransfected with constructs encoding an N-terminal htt fragment with 68 glutamines (HD169Q68) and a network protein of interest, is monitored by filter retardation14 HD169Q68 per se has only a low propensity to form insoluble aggregates in HEK293 cells. However, as shown in
Furthermore, probing of the insoluble HD169Q68 aggregates with an anti-GIT1 antibody revealed that GIT1 does not only stimulate aggregation but is also an integral part of the insoluble aggregates (
The interaction between GIT1 and htt was confirmed by coimmunoprecipitation from COS1 cells transfected with constructs encoding HD510Q68 and HA-GIT1. Forty hours post transfection cell extracts were prepared and treated with antiserum against GIT1. HD510Q68 and HA-GIT1 were detected in the immunoprecipitate on Western blots with anti-hft antibody 4C8 and anti-HA antibody 12CA5, respectively (
The GIT1-htt interaction was also detected in human brain. Protein extracts prepared from human cortex were treated with the anti-htt antibodies CAG53b and HD1, and the precipitate was probed for the presence of GIT1 (
Finally, we performed colocalisation studies of htt and GIT1 in COS1 cells using immunofluorescence microscopy. In cells expressing HD510Q68 or GIT1 alone a diffuse cytoplasmic staining was observed for each protein (
The finding of colocalisation of htt and GIT1 within aggregates in transfected COS1 cells suggests that GIT1 might also be a component of htt aggregates in vivo. To investigate this possibility we first assessed the distribution of GIT1 in brains of R6/1 transgenic mice expressing a human htt exon 1 protein with 150 glutamines34. In wildtype mice, GIT1 immunoreaction product was found diffuse in the cytoplasm and nuclei of neurons throughout the brain. In R6/1 brains, in addition to the diffuse staining, GIT1 immunoreactivity was also present in large nuclear and cytoplasmic puncta similar to htt aggregates (
Part II: Verification and Further Results
Antibodies
A polyclonal antibody (pAb) against GIT1 was generated by injection of purified His6-tagged GIT1 (aa 368-587) into a rabbit. The resulting GIT1 pAb (C-GIT1) was affinity purified using immobilized GIT1 protein. The pAb NT-GIT1 recognizes the first 100 aa of GIT1 (Santa Cruz Biotechnology), the monoclonal antibody (mAb) CT-GIT1 (Transduction Laboratories) is specific for the last 106 amino acids of GIT1. For all three Abs, no cross-reaction with GIT2 was observed (
Strains and Plasmids
The yeast strains used for two-hybrid analysis were L40 ccua [MATa his3D200 trp1-901 leu2-3,112 LYS2::(lexAop)4-HIS3 ura3::(lexAop)8-lacZ ADE2::(lexAop)8-URA3 GAL4 gal80 can1 cyh2] and L40 cca [MATa his3D200 trp1-910 leu2-3,112 ade2 LYS2::(lexAop)4-HIS3 URA3::(lexAop)8-lacZ GAL4 gal80 can1 cyh2].
Plasmids pHD510Q17 and pHD510Q68 were generated by insertion of fragments coding for HD510Q17 and HD510Q68 into pcDNA-1 (Invitrogen). pHD169Q68 was derived from pHD510Q68 by deletion of the XhoI-XhoI fragment encoding aa 170-510 of human htt. pV5-HD169Q68 was generated by inserting the EcoRI-XhoI fragment from pHD510Q68 into pcDNA3.1/5-HIS (Invitrogen). Full-length GIT1 (aa 1-770) was amplified by PCR from the cDNA clone IMAGp958H111245Q2 (RZPD, Germany) using the primers GIT1-F/GIT1-R and subcloned into the EcoRI-BglII site of pTL-HA (HA-GIT1). The GIT2 cDNA (aa 1-759) was PCR amplified with the primers GIT2-F/GIT2-R and subcloned into the XhoI-NotI site of pTL-HA (HA-GIT2). The primer sequences were as follows: GIT1-F (5′-CGGMTTCATGTCCCGAAAGGGGCCGCG-3′), GIT1-R (5′-GGMGATCT GGTCACTGCTTCTTCTCTCGGG-3′), GIT2-F (5′-ACGCGTCGACCATGTCGAAA CGGCTCCG-3′) and GIT2-R (5′-ATAAGAATGCGGCCGCGCCCTGCCCTTGCTA GTTG-3′).
Library Screening
Plasmids encoding baits were transformed into L40 ccua, tested for the absence of reporter gene activity and cotransformed with a human fetal brain cDNA library (Clontech). For each transformation, 1×106 independent transformants were plated onto minimal medium lacking tryptophan, leucine, histidine and uracil (SDIV medium) and incubated at 30° C. for 5 to 10 days. Clones were picked into microtitre plates and grown overnight in liquid minimal medium lacking tryptophan and leucine (SDII medium). Then, they were spotted onto nylon membranes placed on SDIV agar plates. After incubation for 4 days, the membranes were subjected to a b-galactosidase (b-GAL) assay. Plasmids were prepared from positive clones and characterized by sequencing. For retransformation assays, plasmids encoding baits and preys were cotransformed into L40 ccua and plated onto SDIV medium.
Array Mating Screen
Plasmids encoding baits and preys were transformed into strains L40 ccua and L40 cca, respectively. L40 cca clones were arrayed in 96-well microtitre plates and mixed with a single L40 ccua clone for interaction mating. Diploid cells were transferred onto YPD medium plates and, after incubation for 24 h at 30° C., onto SDII medium plates for additional 72 h at 30° C. For two-hybrid selection, diploid cells were transferred onto SDIV medium plates with and without nylon membranes and incubated for 5 days at 30° C. The nylon membranes were subjected to the b-GAL assay. Positive clones were verified by cotransformation assays.
Protein Expression and Verification Assays
For verification experiments, cDNA fragments encoding baits and preys were subcloned into pGEX derivatives (Stratagene) or pTL-HA (Sittler et al., 1998). GST-fusion proteins were expressed in E. coli BL21-codon Plus™ RP (Stratagene) and affinity purified on glutathione agarose beads (Sigma) (Wanker et al., 1997). COS-1 cells were transfected with mammalian expression plasmids and lysed as described (Sittler et al., 1998). For in vitro binding assays, 30 μg of GST or GST fusion protein were immobilized on glutathione agarose beads and incubated with 500 μg COS-1 cell extract containing HA-tagged fusion protein for 2 h at 4° C., in binding buffer [50 mM HEPES-KOH pH 7.4, 150 mM NaCl, 10% glycerol, 1% NP-40, 1 mM EDTA, 20 mM NaF, 1 mM DTT, 0.1% Triton X-100, protease inhibitors (Roche Diagnostics)]. After centrifugation and extensive washing, bound proteins were eluted and analyzed by SDS-PAGE and Western blotting.
Coimmunoprecipitation experiments were performed as previously described (Sittler et al., 1998). For immunofluorescence microscopy, COS-1 cells were grown on cover slips and cotransfected with plasmids encoding N-terminal htt V5-HD169Q68 and/or C-terminal HA-GIT1-CT. 40 h post-transfection, cells were treated with 2% paraformaldehyde. Immunolabeling was performed with anti-C-GIT1 (1:500) and with anti-V5 (1:300) Abs. Nuclei were counterstained with Hoechst. For subcellular localization of endogenous GIT1 and htt, differentiated PC12 and SH-SY5Y cells were used. PC12 cells were treated with 50 ng/ml NGF and grown on cover slips for 6 d. SH-SY5Y cells were serum starved for 24 h and then differentiated with 10 nM IGF-I for 30 min. Cells were labeled with C-GIT1 (1:20) and 4C8 (1:20) Abs and viewed with a confocal microscope LSM510 (Zeiss).
Filter Retardation Assay
HEK293 cells coexpressing HD169Q68 and selected network proteins were harvested 48 h post-transfection. Cell lysates were boiled in 2% SDS, 50 mM DTT for 5 min. Aliquots containing 12.5, 25 or 50 μg of total protein were used for filtration on cellulose acetate membranes (Scherzinger et al., 1997). SDS-resistant aggregates were detected using anti-CAG53b or anti C-GIT1 pAbs.
Inhibition of GIT1 Expression by siRNA
For silencing of endogenous GIT1 expression, HEK293 cells were transfected with the siRNA duplex siRNA-GIT1 (5′-AAGCCTGGATGGAGACCTA GA-3′) using TransMessenger (Qiagen) or Lipofectamin 2000 (Invitrogen) transfection reagents. 48 h post transfection, cell lysates were analyzed for GIT1 expression by immunoblotting using C-GIT1 Ab. To examine the effect of endogenous GIT1 silencing on htt aggregation, HEK293 cells were cotransfected with pHD169Q68 and siRNA-GIT1 and subjected to filtration after 72 h.
Detection of GIT1 in R6/1 Mouse and Human HD Brains
For immunocytochemistry, mice were deeply anaesthetized and perfused through the left cardiac ventricle with 4% paraformaldehyde in 0.1 M phosphate buffer. Brains were removed and postfixed overnight in 4% paraformaldehyde. Sections were processed for immunocytochemistry as described (Gutekunst et al., 1999). EM48 (1:1000) and C-GIT1 (1:100) pAbs were used.
Tissues from 8 human HD and 7 control brains were used in this study. Two HD cases were classified as grade 3 of neuropathological severity, six cases as grade 4. Standard protocols were used (Lunkes et al., 2002) for immunolabeling with 2B4 mAb (1:200) and C-GIT1 pAb (1:50). For Western analysis of total protein lysates from frontal cortex, the C-GIT1 pAb (1:300) was used.
To generate a PPI network for HD, we used a combination of library and matrix yeast two-hybrid screens (
Forty of these baits did not activate the reporters by themselves and were used individually for cotransformation screening of a human fetal brain cDNA library expressing GAL4 activation domain (AD) hybrids as preys. In each screen, 1×106 auxotrophic transformants were tested on selective plates, and 1-50 positive colonies were typically obtained. Restriction analysis and sequencing revealed that about 12% of all positive clones expressed preys with correct in-frame sequences, while 88% of the clones contained plasmids with out-of-frame sequences or sequences from non-protein-encoding regions, which were discarded. 27 preys were identified only once, while the other 11 were found up to four times as independent AD fusions. Plasmids with the longest coding regions were used for subsequent studies. The preys identified by the library two-hybrid screens were tested together with their respective baits for activation of reporter gene expression in cotransformation assays. Only prey/bait combinations that activated the reporter gene expression in two independent cotransformation assays were selected for further two-hybrid studies and in vitro pull-down assays (
For a second round of two-hybrid screens, cDNAs encoding 12 prey proteins were selected from literature verified interactions and from interactions confirmed by in vitro binding experiments (Suppl. Table 2), and subcloned into a DNA binding domain vector. The resulting baits were tested for autoactivation, and 10 were screened against a human fetal brain cDNA library as described above. We identified another 14 PPIs among 5 bait and 13 prey proteins. Nine preys were found once and 4 were discovered multiple times as independent AD fusions. All interactions were confirmed by cotransformation assays.
Finally, an array-mating screen was performed to connect bait and prey proteins identified in the cDNA library transformation screens (
Sequence analysis of the cDNAs revealed ORFs ranging from 82 to 728 amino acids in size (Table 7). In a systematic Blast search, 77 of the 86 bait and prey protein fragments were identical to a SwissProt or TrEMBL protein entry (http://us.expasy.org/sprott/). Nine proteins showed 75-99% identity to their best respective database hit and either contained single amino acid substitutions, variable polyQ lengths or small regions of sequence variation. Uncharacterized proteins were named according to their interaction partners.
This chapter describes the whole yeast two hybrid screening procedure and obtained fundamental data. A full description of our final datasets are shown in tables 6 to 9. Table 6 contains a compilation of all found protein-protein interactions in the Huntington's disease protein network. Some of these interactions are already known and literature-cited. A dataset which describes only new identified interactions will be found in Table 9. Table 7 characterizes all proteins involved in the protein network. Most of these proteins are known from different databases but some proteins are still unknown (Table 8). Nucleic acid and amino acid sequence data for all network-proteins are available from
To chart two-hybrid interactions identified in this study, previously reported, or verified by independent methods, a matrix representation was used (
For htt, 19 different interacting partners from various functional groups were identified, of which HIP1, CA150, SH3GL3 and HYPA had been described previously (Harjes and Wanker, 2003). 6 of the htt partners are involved in transcriptional regulation and DNA maintenance, 4 function in cellular organization and transport and 3 in cellular signaling, supporting the hypothesis that htt is involved in these processes. Moreover, we have detected 6 novel htt interacting proteins of unknown function termed HIP5, HIP11, HIP13, HIP15, HIP16, and CGI-125.
Using 5 different N-terminal htt fragments as baits, the potential htt-binding sites of 13 interaction partners were mapped (
To generate a more comprehensive HD interaction map, we supplemented bur two-hybrid network (red diamonds) with all 38 known direct htt interaction partners (Suppl. Table 4 and
Comparison with literature-cited interactions revealed that more than 89% of the two-hybrid interactions identified are unknown. 30 PPIs have been reported previously using two-hybrid methods, coimmunoprecipitations or affinity chromatography-based techniques; 21 of these were detected in our Y2H assays (
Failure to detect interactions may also result from the high stringency of our two-hybrid assay, which can be attributed to low protein expression levels and the simultaneous use of three reporters. Our system is particularly designed to minimize false positives, which are known to occur frequently in two-hybrid assays (von Mering et al., 2002). To determine the rate of false positives in our system, we directly assessed 54 interactions from the two-hybrid network by in vitro pull-down experiments, mainly focusing on htt and its immediate interaction partners. Proteins were expressed as GST-fusions in E. coli, and their interacting partners as HA-fusions in COS-1 cells. After immobilization of GST-fusion proteins to beads, the potential interaction partners were pulled down from COS-1 cell extracts. Binding was detected by SDS-PAGE and immunoblotting. Using this assay, 35 interactions representing 32 different protein pairs were verified successfully (
Several lines of evidence indicate that aggregation of mutant htt is linked to disease progression and the development of motor symptoms (Davies et al., 1997; Sanchez et al., 2003). Therefore, cellular proteins that influence aggregate formation are potential modulators of disease pathogenesis. In order to identify such proteins, we screened all 19 direct htt interaction partners (
It has been described previously that GIT1 and its homologue p95-APP1 are able to form homo- and heterodimers in vitro and in vivo (Kim et al., 2003; Paris et al., 2003). Therefore, we wondered whether GIT1-CT by itself is able to form SDS-insoluble protein aggregates in mammalian cells. As shown in
Next, we tested whether full-length GIT1 is able to accelerate htt aggregation in mammalian cells. Analysis by filter retardation assay revealed that full-length GIT1 enhances htt aggregation in a dose dependent manner (
As previous studies have shown that the expression of C-terminal GIT1/p95-APP1 fragments induces the formation of large vesicular structures in mammalian cells (Di Cesare et al., 2000; Matafora et al., 2001), we analyzed the effect of GIT1-CT on HD169Q68 aggregation by indirect immunofluorescence microscopy. We found that expression of GIT1-CT alone induced the accumulation of large vesicular structures in the perinuclear region (
Next, we investigated whether endogenous GIT1 promotes htt aggregation in mammalian cells. In order to reduce endogenous GIT1 levels in HEK293 cells, we employed the short-interfering RNA (siRNA) technology (Elbashir et al., 2001). Cells were cotransfected with HD169Q68 and GIT1-specific siRNA, and silencing of endogenous GIT1 was monitored 48 h post transfection by Western blot analysis (
The interaction between GIT1-CT and htt was confirmed by coimmunoprecipitation from COS-1 cells transfected with constructs encoding the first 510 amino acids of htt with 68 glutamines (HD510Q68) and an N-terminally truncated hemagglutinin (HA) tagged HA-GIT1-CT (aa 249-770) protein. 40 h post-transfection, cell extracts were prepared and treated with GIT1 antiserum. HD510Q68 and HA-GIT1-CT were detected in the immunoprecipitates on Western blots with anti-htt antibody 4C8 and anti-HA antibody 12CA5, respectively (
The GIT1-htt interaction was also detected in healthy human brain. Protein extracts prepared from cortex were treated with the anti-htt antibodies CAG53b and HD1, and the precipitate was probed for the presence of GIT1 (
Next, we examined the colocalization of endogenous htt and GIT1 in differentiated PC12 cells by confocal immunofluorescence microscopy. Both proteins were mainly detected in the cytoplasm, but were also present in the neurite-like extensions (
Our findings suggest that GIT1 might also be a component of neuronal inclusions containing htt aggregates in brain of HD patients and transgenic animals (Davies et al., 1997; DiFiglia et al., 1997). To investigate this possibility, we first assessed the distribution of GIT1 in brain slices of R6/1 transgenic mice expressing a human hft exon 1 protein with 150 glutamines (Mangiarini et al., 1996). In wild type mice, GIT1 specific immunoreactivity was diffused in the cytoplasm and nuclei of neurons throughout the brain. In R6/1 brain, however, in addition to a diffuse staining, GIT1 immunoreactivity was also present in large nuclear and cytoplasmic puncta containing htt aggregates (
The presence of GIT1 in protein extracts from HD affected and unaffected cortex was also analyzed by SDS-PAGE and immunoblotting. As shown in
Number | Date | Country | Kind |
---|---|---|---|
03013957.0 | Jun 2003 | EP | regional |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/EP04/06617 | 6/18/2004 | WO | 12/20/2005 |