The present invention relates to methods for screening compounds for treating and/or preventing an Hepatitis C Virus (HCV) infection.
Hepatitis C Virus (HCV) infection is characterized by a high rate of chronicity and concerns 170 millions of individuals worldwide. Chronically-infected patients present liver injury essentially mediated by immune mechanisms and metabolic disorders associated with hepatic steatosis, fibrogenesis and insulin resistance to various extent (1, 2). Long-term infected patients have a high risk of developing cirrhosis and hepatocarcinoma but despite considerable efforts, molecular basis of HCV pathology remains poorly understood. HCV genome is a positive strand RNA of 9.6 kb encoding a polyprotein that is post-translationally processed into structural (CORE, E1, E2 and p7) and non structural (NS2, NS3, NS4A, NS4B, NS5A and NS5B) proteins (3).
Current therapy consists in the association of pegylated interferon (IFN) alpha and ribavirin (1-β-D-ribofuranosyl-1,2,4-triazole-3-carboxamide). However, the outcome of hepatitis C virus (HCV) infection varies among individuals and the likelihood of sustained response to antiviral treatment depends on viral and host characteristics. Naturally occurring variants of HCV are classified into 6 major genotypes. Viral genotype is one of the main factors associated to therapy response. Indeed, sustained virological response (SVR) is achieved in only 45% of the genotype 1 infected patients, whereas up to 80% of the genotypes 2 or 3 infected patients reach a SVR (Feld J J. et al. 2005).
Therefore, there is a need for other treatments of HCV infections, and there is an incentive to focus on the interactions between HCV proteins and host (human) proteins. The rapidly growing knowledge of cellular protein network and now of viral-cellular interactome indeed begins to provide network-based models for disease. In a network approach, a viral infection can be viewed as a perturbation of the cellular interactome. Viral pathogenesis appears as the expression of new constraints on the protein network imposed by the virus when connecting to the cellular interactome. Identification of topological and functional properties that are lost, dysregulated or that emerge in the “infected network” becomes a major challenge for the complex systems analysis of an infection. However, the interactions between human and viral proteins have not yet fully documented.
The present invention relates to a method for screening compounds for treating and/or preventing an HCV infection comprising the steps of:
The farnesoid X receptor (FXR) is a nuclear receptor that is activated by supraphysiological levels of farnesol (Forman et al., Cell, 1995, 81, 687-693). FXR, is also known as NR1H4, retinoid X receptor-interacting protein 14 (RIP14) and bile acid receptor (BAR).
The present invention also relates to a method for screening compounds for treating and/or preventing an HCV infection comprising the steps of:
The present invention also relates to a method for screening compounds for treating and/or preventing an HCV infection comprising the steps of:
The present invention also relates to a method for screening compounds for treating and/or preventing an HCV infection comprising the steps of:
The present invention also relates to a method for screening compounds for treating and/or preventing an HCV infection comprising the steps of:
The present invention also relates to a method for screening compounds for treating and/or preventing an HCV infection comprising the steps of:
The present invention also relates to a method for screening compounds for treating and/or preventing an HCV infection comprising the steps of:
The present invention also relates to a method for screening compounds for treating and/or preventing an HCV infection comprising the steps of:
The present invention also relates to a method for screening compounds for treating and/or preventing an HCV infection comprising the steps of:
The present invention also relates to a method for screening compounds for treating and/or preventing an HCV infection comprising the steps of:
The present invention also relates to a method for screening compounds for treating and/or preventing an HCV infection comprising the steps of:
The term “hepatitis C virus” or “HCV” is used herein to define a viral species of which pathogenic strains cause hepatitis C, also known as non-A, non-B hepatitis. All the human and HCV genes and proteins are defined in the table 1:
The invention relates to methods for screening compounds for treating and/or preventing an HCV infection comprising the steps of:
a) determining the ability of a candidate compound to inhibit the interaction between a viral HCV protein and a human protein as described above, and
b) selecting the candidate compound that inhibits said interaction between said viral protein and said human protein.
In one embodiment the step b) consists in generating physical values which illustrate or not the ability of said candidate compound to inhibit the interaction between said HCV protein and said human protein and comparing said values with standard physical values obtained in the same assay performed in the absence of the said candidate compound. The “physical values” that are referred to above may be of various kinds depending of the binding assay that is performed, but notably encompass light absorbance values, radioactive signals and intensity value of fluorescence signal. If after the comparison of the physical values with the standard physical values, it is determined that the said candidate compound inhibits the binding between said HCV protein and said human protein, then the candidate is positively selected at step b).
The compounds that inhibit the interaction between the HCV protein and human protein encompass those compounds that bind either to HCV protein or to human protein, provided that the binding of the said compounds of interest then prevents the interaction between HCV protein and human protein.
In one embodiment, any protein of the invention is labelled with a detectable molecule.
According to the invention, said detectable molecule may consist of any compound or substance that is detectable by spectroscopic, photochemical, biochemical, immunochemical or chemical means. For example, useful detectable molecules include radioactive substance (including those comprising 32P, 25S, 3H, or 125I), fluorescent dyes (including 5-bromodesosyrudin, fluorescein, acetylaminofluorene or digoxigenin), fluorescent proteins (including GFPs and YFPs), or detectable proteins or peptides (including biotin, polyhistidine tails or other antigen tags like the HA antigen, the FLAG antigen, the c-myc antigen and the DNP antigen).
According to the invention, the detectable molecule is located at, or bound to, an amino acid residue located outside the said amino acid sequence of interest, in order to minimise or prevent any artefact for the binding between said polypeptides or between the candidate compound and or any of said polypeptides.
In another particular embodiment, the polypeptides of the invention are fused with a GST tag (Glutathione S-transferase). In this embodiment, the GST moiety of the said fusion protein may be used as detectable molecule. In the said fusion protein, the GST may be located either at the N-terminal end or at the C-terminal end. The GST detectable molecule may be detected when it is subsequently brought into contact with an anti-GST antibody, including with a labelled anti-GST antibody. Anti-GST antibodies labelled with various detectable molecules are easily commercially available.
In another particular embodiment, proteins of the invention are fused with a poly-histidine tag. Said poly-histidine tag usually comprises at least four consecutive hisitidine residues and generally at least six consecutive histidine residues. Such a polypeptide tag may also comprise up to 20 consecutive histidine residues. Said poly-histidine tag may be located either at the N-terminal end or at the C-terminal end In this embodiment, the poly-histidine tag may be detected when it is subsequently brought into contact with an anti-poly-histidine antibody, including with a labelled anti-poly-histidine antibody. Anti-poly-histidine antibodies labelled with various detectable molecules are easily commercially available.
In a further embodiment, the proteins of the invention are fused with a protein moiety consisting of either the DNA binding domain or the activator domain of a transcription factor. Said protein moiety domain of transcription may be located either at the N-terminal end or at the C-terminal end. Such a DNA binding domain may consist of the well-known DNA binding domain of LexA protein originating form E. Coli. Moreover said activator domain of a transcription factor may consist of the activator domain of the well-known Gal4 protein originating from yeast.
In one embodiment of the screening method according to the invention, the proteins of the invention comprise a portion of a transcription factor. In said assay, the binding together of the first and second portions generates a functional transcription factor that binds to a specific regulatory DNA sequence, which in turn induces expression of a reporter DNA sequence, said expression being further detected and/or measured. A positive detection of the expression of said reporter DNA sequence means that an active transcription factor is formed, due to the binding together of said first HCV protein and second human protein.
Usually, in a two-hybrid assay, the first and second portion of a transcription factor consist respectively of (i) the DNA binding domain of a transcription factor and (ii) the activator domain of a transcription factor. In some embodiments, the DNA binding domain and the activator domain both originate from the same naturally occurring transcription factor. In some embodiments, the DNA binding domain and the activator domain originate from distinct naturally occurring factors, while, when bound together, these two portions form an active transcription factor. The term “portion” when used herein for transcription factor, encompass complete proteins involved in multi protein transcription factors, as well as specific functional protein domains of a complete transcription factor protein.
Therefore in one embodiment of the invention, step a) of the screening method of the invention comprises the following steps:
The expression level of said DNA reporter sequence that is determined at step (3) above is compared with the expression of said DNA reporter sequence when step (2) is omitted. A lower expression level of said DNA reporter sequence in the presence of the candidate compound means that the said candidate compound effectively inhibits the binding between HCV protein and human protein and that said candidate compound may be positively selected a step b) of the screening method.
Suitable host cells include, without limitation, prokaryotic cells (such as bacteria) and eukaryotic cells (such as yeast cells, mammalian cells, insect cells, plant cells, etc.). However preferred host cell are yeast cells and more preferably a Saccharomyces cerevisiae cell or a Schizosaccharomyces pombe cell.
Similar systems of two-hybrid assays are well know in the art and therefore can be used to perform the screening method according to the invention (see. Fields et al. 1989; Vasavada et al. 1991; Fearon et al. 1992; Dang et al., 1991, Chien et al. 1991, U.S. Pat. No. 5,283,173, U.S. Pat. No. 5,667,973, U.S. Pat. No. 5,468,614, U.S. Pat. No. 5,525,490 and U.S. Pat. No. 5,637,463). For instance, as described in these documents, the Gal4 activator domain can be used for performing the screening method according to the invention. Gal4 consists of two physically discrete modular domains, one acting as the DNA binding domain, the other one functioning as the transcription-activation domain. The yeast expression system described in the foregoing documents takes advantage of this property. The expression of a Gal1-LacZ reporter gene under the control of a Gal4-activated promoter depends on the reconstitution of Gal4 activity via protein-protein interaction. Colonies containing interacting polypeptides are detected with a chromogenic substrate for β-galactosidase. A compete kit (MATCHMAKER™) for identifying protein-protein interactions is commercially available from Clontech. So in one embodiment, a first HCV protein as above defined is fused to the DNA binding domain of Gal4 and the second human protein as above defined is fused to the activation domain of Gal4.
The expression of said detectable marker gene may be assessed by quantifying the amount of the corresponding specific mRNA produced. However, usually the detectable marker gene sequence encodes for detectable protein, so that the expression level of the said detectable marker gene is assessed by quantifying the amount of the corresponding protein produced. Techniques for quantifying the amount of mRNA or protein are well known in the art. For example, the detectable marker gene placed under the control of regulatory sequence may consist of the β-galactosidase as above described.
In another one embodiment, step a) comprises a step of subjecting to a gel migration assay the mixture of the first HCV protein and the second human protein as above defined, with or without the candidate compound to be tested and then measuring the binding of the said polypeptides altogether by performing a detection of the complexes formed between said polypeptides. The gel migration assay can be carried out as known by the one skilled in the art.
Therefore in one embodiment of the invention, step a) of the screening method of the invention comprises the following steps:
The presence or the amount of the complexes formed between the proteins are then compared with the results obtained when the assay is performed in the absence of the candidate compound to be tested. Therefore, when no complexes between the proteins is detected or, alternatively when those complexes are present in a lower amount compared to the amount obtained in the absence of the candidate compound, then the candidate compound may be positively selected at step b) of the screening method.
The detection of the complexes formed between the said two proteins may be easily performed by staining the migration gel with a suitable dye and then determining the protein bands corresponding to the protein analysed since the complexes formed between the first and the second proteins possess a specific apparent molecular weight. Staining of proteins in gels may be done using the standard Coomassie brilliant blue (or PAGE blue), Amido Black, or silver stain reagents of different kinds. Suitable gels are well known in the art such as sodium dodecyl (lauryl) sulfate-polyacrylamide gel. In a general manner, western blotting assays are well known in the art and have been widely described (Rybicki et al., 1982; Towbin et al. 1979; Kurien et al. 2006).
In a particular embodiment, the protein bands corresponding to the proteins submitted to the gel migration assay can be detected by specific antibodies. It may used both antibodies directed against the HCV proteins and antibodies specifically directed against the human proteins.
In another embodiment, the said two proteins are labelled with a detectable antigen as above described. Therefore, the proteins bands can be detected by specific antibodies directed against said detectable antigen. Preferably, the detectable antigen conjugates to the HCV protein is different from the antigen conjugated to the human protein. For instance, the first HCV protein can be fused to a GST detectable antigen and the second human protein can be fused with the HA antigen. Then the protein complexes formed between the two proteins may be quantified and determined with antibodies directed against the GST and HA antigens respectively.
In another embodiment, step a) included the use of an optical biosensor such as described by Edwards et al. (1997) or also by Szabo et al. (1995). This technique allows the detection of interactions between molecules in real time, without the need of labelled molecules. This technique is indeed bases on the surface plasmon resonance (SPR) phenomenon. Briefly, a first protein partner is attached to a surface (such as a carboxymethyl dextran matrix). Then the second protein partner is incubated with the previously immobilised first partner, in the presence or absence of the candidate compound to be tested. Then the binding including the binding level or the absence of binding between said protein partners is detected. For this purpose, a light beam is directed towards the side of the surface area of the substrate that does not contain the sample to be tested and is reflected by said surface. The SPR phenomenon causes a decrease in the intensity of the reflected light with a combination of angle and wavelength. The binding of the first and second protein partner causes a change in the refraction index on the substrate surface, which change is detected as a change in the SPR signal.
In another one embodiment of the invention, the screening method includes the use of affinity chromatography.
Candidate compounds for use in the screening method above can also be selected by any immunoaffinity chromatography technique using any chromatographic substrate onto which (i) the first HCV protein or (ii) the second human protein as above defined, has previously been immobilised, according to techniques well known from the one skilled in the art. Briefly, the HCV protein or the human protein as above defined, may be attached to a column using conventional techniques including chemical coupling to a suitable column matrix such as agarose, Affi Gel®, or other matrices familiar to those of skill in the art. In some embodiment of this method, the affinity column contains chimeric proteins in which the HCV protein or human protein as above defined, is fused to glutathion—s-transferase (GST). Then a candidate compound is brought into contact with the chromatographic substrate of the affinity column previously, simultaneously or subsequently to the other protein among the said first and second protein. The after washing, the chromatography substrate is eluted and the collected elution liquid is analysed by detection and/or quantification of extent, the candidate compound has impaired the binding between (i) first HCV protein and (ii) the second human protein.
In another one embodiment of the screening method according to the invention, the first HCV protein and the second human protein as above defined are labelled with a fluorescent molecule or substrate. Therefore, the potential alteration effect of the candidate compound to be tested on the binding between the first HCV protein and the second human protein as above defined is determined by fluorescence quantification.
For example, the first HCV protein and the second human protein as above defined may be fused with auto-fluorescent polypeptides, as GFP or YFPs as above described. The first HCV protein and the second human protein as above defined may also be labelled with fluorescent molecules that are suitable for performing fluorescence detection and/or quantification for the binding between said proteins using fluorescence energy transfer (FRET) assay. The first HCV protein and the second human protein as above defined may be directly labelled with fluorescent molecules, by covalent chemical linkage with the fluorescent molecule as GFP or YFP. The first HCV protein and the second human protein as above defined may also be indirectly labelled with fluorescent molecules, for example, by non covalent linkage between said polypeptides and said fluorescent molecule. Actually, said first HCV protein and second human protein as above defined may be fused with a receptor or ligand and said fluorescent molecule may be fused with the corresponding ligand or receptor, so that the fluorescent molecule can non-covalently bind to said first HCV protein and second human protein. A suitable receptor/ligand couple may be the biotin/streptavidin paired member or may be selected among an antigen/antibody paired member. For example, a protein according to the invention may be fused to a poly-histidine tail and the fluorescent molecule may be fused with an antibody directed against the poly-histidine tail.
As already specified, step a) of the screening method according to the invention encompasses determination of the ability of the candidate compound to inhibit the interaction between the HCV protein and the human protein as above defined by fluorescence assays using FRET. Thus, in a particular embodiment, the first HCV protein as above defined is labelled with a first fluorophore substance and the second human protein is labelled with a second fluorophore substance. The first fluorophore substance may have a wavelength value that is substantially equal to the excitation wavelength value of the second fluorophore, whereby the bind of said first and second proteins is detected by measuring the fluorescence signal intensity emitted at the emission wavelength of the second fluorophore substance. Alternatively, the second fluorophore substance may also have an emission wavelength value of the first fluorophore, whereby the binding of said and second proteins is detected by measuring the fluorescence signal intensity emitted at the wavelength of the first fluorophore substance.
The fluorophores used may be of various suitable kinds, such as the well-known lanthanide chelates. These chelates have been described as having chemical stability, long-lived fluorescence (greater than 0.1 ms lifetime) after bioconjugation and significant energy-transfer in specificity bioaffinity assay. Document U.S. Pat. No. 5,162,508 discloses bipyridine cryptates. Polycarboxylate chelators with TEKES type photosensitizers (EP0203047A1) and terpyridine type photosensitizers (EP0649020A1) are known. Document WO96/00901 discloses diethylenetriaminepentaacetic acid (DPTA) chelates which used carbostyril as sensitizer. Additional DPT chelates with other sensitizer and other tracer metal are known for diagnostic or imaging uses (e.g., EP0450742A1).
In a preferred embodiment, the fluorescence assay performed at step a) of the screening method consists of a Homogeneous Time Resolved Fluorescence (HTRF) assay, such as described in document WO 00/01663 or U.S. Pat. No. 6,740,756, the entire content of both documents being herein incorporated by reference. HTRF is a TR-FRET based technology that uses the principles of both TRF (time-resolved fluorescence) and FRET. More specifically, the one skilled in the art may use a HTRF assay based on the time-resolved amplified cryptate emission (TRACE) technology as described in Leblanc et al. (2002). The HTRF donor fluorophore is Europium Cryptate, which has the long-lived emissions of lanthanides coupled with the stability of cryptate encapsulation. XL665, a modified allophycocyanin purified from red algae, is the HTRF primary acceptor fluorophore. When these two fluorophores are brought together by a biomolecular interaction, a portion of the energy captured by the Cryptate during excitation is released through fluorescence emission at 620 nm, while the remaining energy is transferred to XL665. This energy is then released by XL665 as specific fluorescence at 665 nm. Light at 665 nm is emitted only through FRET with Europium. Because Europium Cryptate is always present in the assay, light at 620 nm is detected even when the biomolecular interaction does not bring XL665 within close proximity.
Therefore in one embodiment, step a) of the screening method may therefore comprises the steps of:
If at step (5) as above described, the intensity value of the fluorescence signal is lower than the intensity value of the fluorescence signal found when pre assay sample of step (1) is prepared in the absence of the candidate compound to be tested, then the candidate compound may be positively selected at step b) of the screening method.
Antibodies labelled with a European Cryptate or labelled with XL665 can be directed against different antigens of interest including GST, poly-histidine tail, DNP, c-myx, HA antigen and FLAG which include. Such antibodies encompass those which are commercially available from CisBio (Bedfors, Mass., USA), and notably those referred to as 61 GSTKLA or 61 HISKLB respectively.
According to a one embodiment of the invention, the candidate compound of the invention may be selected from a library of compounds previously synthesised, or a library of compounds for which the structure is determined in a database, or from a library of compounds that have been synthesised de novo.
The candidate compound may be selected from the group of (a) proteins or peptides, (b) nucleic acids and (c) organic or chemical compounds. Illustratively, libraries of pre-selected candidate nucleic acids may be obtained by performing the SELEX method as described in documents U.S. Pat. No. 5,475,096 and U.S. Pat. No. 5,270,163. Further illustratively, the candidate compound may be selected from the group of antibodies directed against said HCV protein and said human proteins as above described.
The candidate compounds that have been positively selected at the end of any one of the embodiments of the in vitro screening which has been described previously in the present specification may be subjected to further selection steps in view of further assaying its anti-HCV biological properties.
Proteins of the invention may be produced by any technique known per se in the art, such as without limitation, any chemical, biological, genetic or enzymatic technique, either alone or in combination(s).
Knowing the amino acid sequence of the desired sequence, one skilled in the art can readily produce said proteins, by standard techniques. For instance, they can be synthesized using well-known solid phase method, preferably using a commercially available peptide synthesis apparatus (such as that made by Applied Biosystems, Foster City, Calif.) and following the manufacturer's instructions.
Alternatively, the proteins of the invention can be synthesized by recombinant DNA techniques as is now well-known in the art. For example, these fragments can be obtained as DNA expression products after incorporation of DNA sequences encoding the desired proteins into expression vectors and introduction of such vectors into suitable eukaryotic or prokaryotic hosts that will express the desired proteins, from which they can be later isolated using well-known techniques.
A wide variety of host/expression vector combinations are employed in expressing the nucleic acids encoding for the polypeptides of the present invention. Useful expression vectors that can be used include, for example, segments of chromosomal, non-chromosomal and synthetic DNA sequences. Suitable vectors include, but are not limited to, derivatives of SV40 and pcDNA and known bacterial plasmids such as col EI, pCR1, pBR322, pMal-C2, pET, pGEX, pMB9 and derivatives thereof, plasmids such as RP4, phage DNAs such as the numerous derivatives of phage I such as NM989, as well as other phage DNA such as M13 and filamentous single stranded phage DNA; yeast plasmids such as the 2 microns plasmid or derivatives of the 2 microns plasmid, as well as centomeric and integrative yeast shuttle vectors; vectors useful in eukaryotic cells such as vectors useful in insect or mammalian cells; vectors derived from combinations of plasmids and phage DNAs, such as plasmids that have been modified to employ phage DNA or the expression control sequences; and the like.
Consequently, mammalian and typically human cells, as well as bacterial, yeast, fungi, insect, nematode and plant cells an used in the present invention and may be transfected by the nucleic acid or recombinant vector as defined herein. Examples of suitable cells include, but are not limited to, VERO cells, HELA cells such as ATCC No. CCL2, CHO cell lines such as ATCC No. CCL61, COS cells such as COS-7 cells and ATCC No. CRL 1650 cells, W138, BHK, HepG2, 3T3 such as ATCC No. CRL6361, A549, PC12, K562 cells, 293T cells, Sf9 cells such as ATCC No. CRL1711 and Cv1 cells such as ATCC No. CCL70. Other suitable cells that can be used in the present invention include, but are not limited to, prokaryotic host cells strains such as Escherichia coli, (e.g., strain DH5-[alpha]), Bacillus subtilis, Salmonella typhimurium, or strains of the genera of Pseudomonas, Streptomyces and Staphylococcus. Further suitable cells that can be used in the present invention include yeast cells such as those of Saccharomyces such as Saccharomyces cerevisiae.
In a further aspect, the invention provides a method for treating an HCV infection or preventing an HCV infection comprising administering a subject in need thereof with a therapeutically effective amount of a compound that inhibits the interaction between the HCV and human proteins as described above. Said compound may be identified by the screening methods of the invention.
In the context of the invention, the term “treating” or “treatment”, as used herein, means reversing, alleviating, inhibiting the progress of, or preventing the disorder or condition to which such term applies, or one or more symptoms of such disorder or condition such as liver injury, metabolic disorders associated with hepatic steatosis, fibrogenesis and insulin resistance.
According to the invention, the term “patient” or “subject in need thereof”, is intended for a human or non-human mammal affected or likely to be affected with an HCV infection.
By a “therapeutically effective amount” of the compound of the invention is meant a sufficient amount of compound to treat an infection with HCV, at a reasonable benefit/risk ratio applicable to any medical treatment. It will be understood, however, that the total daily usage of the compound of the invention and compositions of the present invention will be decided by the attending physician within the scope of sound medical judgment. The specific therapeutically effective dose level for any particular patient will depend upon a variety of factors including the disorder being treated and the severity of the disorder; activity of the specific compound employed; the specific composition employed, the age, body weight, general health, sex and diet of the patient; the time of administration, route of administration, and rate of excretion of the specific compound employed; the duration of the treatment; drugs used in combination or coincidental with the specific compound employed; and like factors well known in the medical arts. For example, it is well within the skill of the art to start doses of the compound at levels lower than those required to achieve the desired therapeutic effect and to gradually increase the dosage until the desired effect is achieved.
In another one embodiment the invention relates to the use of at least one compound that inhibits the interaction between the HCV and human proteins as described above for the manufacture of a medicament intended for treating an HCV infection or preventing an HCV infection.
The compound that inhibits the interaction between the HCV and human proteins as described above may be combined with pharmaceutically acceptable excipients. “Pharmaceutically” or “pharmaceutically acceptable” refers to molecular entities and compositions that do not produce an adverse, allergic or other untoward reaction when administered to a mammal, especially a human, as appropriate. A pharmaceutically acceptable carrier or excipient refers to a non-toxic solid, semi-solid or liquid filler, diluent, encapsulating material or formulation auxiliary of any type.
The invention will further be illustrated in view of the following figures and examples.
A. Nomenclature.
V: viral protein (black node). HHCV: human protein interacting with HCV proteins (red node). HNOT-HCV: human protein not interacting with HCV proteins (blue node). V-HHCV: HCV-human protein interaction (red edge). HHCV-HHCV: interaction between HCV-interacting human proteins (blue edge). H-H: human-human protein interaction (blue edges). V-HHCV represents the interactions between HCV and human proteins (black box). HHCV-HHCV is composed of human proteins interacting with viral proteins (red box). H-H network represents interactions between human proteins (blue box).
B. Number of proteins and interactions in HCV-human interaction network. Number of human proteins interacting with HCV proteins (HHCV) and corresponding number of protein-protein interactions (V-HHCV PPI). Data are given for our yeast two-hybrid screens (IMAP Y2H) and for literature curated interactions (IMAP LCI).
C. Validation of Y2H interactions by co-affinity purification assay. Nine co-AP positive assays are shown, representing: NS5A-SORBS2 (1), NS3-CALCOCO2 (2), NS5A-BIN1 (3), NS5A-MOBK1B (4), NS5A-EFEMP1 (5), NS3-PSMB9 and NS5A-PSMB9 (6), NS5A-PPPIRI3L (7), NS3-RASAL2 (8). After pull-down with GST tagged viral baits (+) or with negative control GST alone (−), cellular preys are identified with anti-Flag antibody. Anti-GST antibody identifies either GST alone or GST-tagged viral baits. Expression of cellular preys in cell lysate is controlled by anti-Flag (bottom panel).
A. Graphical representation of H-H network. Each node represents a protein and each edge an interaction. Red and blue nodes are respectively HHCV and HNOT-HCV.
B. Graphical representation of V-HHCV interaction network. Black node: viral protein; Red node: human protein; Red edge: interaction between human and viral proteins (V-HHCV); Blue edge: interaction between human proteins (HHCV-HHCV). The largest component containing 196 proteins is represented in the middle of the network. Names of cellular proteins belonging to the three other connected components are also represented.
A. Topological analysis of H and HHCV in H-H network. Degree (k), betweenness (b) and shortest path (l) were computed for all human proteins and for HHCV from the IMAP Y2H dataset.
B. Degree and Betweenness Distribution of H and HHCV Proteins in H-H Network. Normalised log degree (left) and log betweenness (right) distribution of H (blue) and
HHCV proteins (red). Solid line represents linear regression fit. Vertical dashed lines give mean degree and betweenness values. Each class is represented with conventional standard error.
C. Degree and betweenness correlation of H in H-H network. Normalised log degree (x axis) and log betweenness (y axis) of H proteins into H-H network. Black solid line represents the linear regression fit (R2=0.56). Horizontal and vertical dashed lines give respectively the mean degree and betweenness values. Low degree (LD) and high degree (HD) classes were defined by using the average degree cut-off.
D. Mean degree and betweenness of HNOT-HCV and HHCV for low and high degree proteins. Top: mean betweenness (log scale) of HNOT-HCV (blue) and HHCV (red) is given for LD and HD classes. Bottom: mean degree of HNOT-HCV (blue) and HHCV (red) is given for LD and HD classes. The conventional standard error threshold and the U test p-value are represented (***: p-value<10−10, NS: not significant).
A. Graphical representation of IJT network. Proteins (nodes) members of insulin (blue), Jak/STAT (red) and TGFβ (green) pathways according to KEGG annotation, and their interactions (edges) are shown (proteins interacting with HCV proteins are named). Proteins shared by two pathways are shown in secondary colours (pink, yellow and cyan). Grey and black nodes are neighbours that connect the KEGG pathways and that interact with HCV proteins (grey: protein from the IMAP Y2H dataset, black: protein from IMAP LCI dataset). Neighbours interacting with HCV but not connecting the KEGG pathways are not represented. Discussed protein examples PLSCR1 and YY1 are in box. Interactive visualization tools are provided in supplementary files (Network visualization).
B and C. Relative contribution of each viral protein in V-HHCV and IJT network. Percentage of the three most interacting viral proteins is given. 51.3% of CORE interactions are concentrated in the IJT network.
A. Schematic representation of focal adhesion adapted from KEGG (ID: Hs04510). HHCV are represented by orange boxes and HNOT-HCV by blue boxes.
B and C. Functional validation of focal adhesion perturbation by NS3 and NS5A.
96-well plates were coated with fibronectin (B) or poly-L-lysine (C) at various concentrations. 293T cells expressing NS2, NS3, NS3/4A or NS5A were plated on the matrix for 30 min. Adherent cells were stained with crystal violet. FA50 is the matrix concentration necessary for half maximum adhesion. Values represent mean of three independent experiments with their standard deviation.
Schematic representation of the HCV genome and definition of ORFs designed for Y2H screens. The HCV positive strand genome (purple) encodes a polyprotein (orange) which is co-translationally processed in 10 proteins. Red: full length protein; blue: domain; yellow+green: NS4A+NS3 chimeric fusion; pink: NS5A membrane anchor. Genome and polyprotein coordinates of each construction are given in the table.
log degree (left) and log betweenness (right) distributions of H proteins (blue), HHCV (red) and HEBV (green). Solid lines represent linear regression fits. Vertical dashed lines give mean degree and betweenness values.
HCV proteins are referenced according to their NCBI mature peptide product name (column 1). Human proteins are referenced with their cognate NCBI gene name and gene ID (columns 2 and 3). The number of IST for IMAP Y2H (IMAP1 and IMAP2, according to the method of screening) is given in columns 4 and 5. IMAP LCI (Literature Curated Interactions) from text-mining and BIND database associated PubMed IDs are given in columns 6 and 7. Co-affinity purification (CoAP) or Y2H pairwise matrices validations are indicated in columns 8 and 9 (+: IMAP validation, −: not validated, NA: non assayable due to default of protein expression or to cellular protein directly interacting with GST).
Human proteins are referenced with their cognate NCBI gene name (column 1). HCV proteins are referenced according to their NCBI mature peptide product name (column 2). Origin of the dataset (IMAP Y2H, IMAP LCI, column 3).
The size of the largest component and the number of connected components of V-HHCV sub-network were computed (IMAP dataset, column 2). In order to test the significance of observed values, we computed the mean of the largest component size and the mean number of connected components obtained after 1000 simulations of random sub-networks (IMAP Sim column 3). The differences between the observed and the simulated values were highly significant (***: p-value<10−10).
Full interactome (A), high-confidence interactome (B, containing only PPIs with at least two PMIDs or validated by two different methods). The number of proteins and PPIs that can be integrated into the human interactome are given for HHCV and HEBV. Percentage of HHCV and HEBV that are present in the human interactome are given according to the origin of the dataset. Average degree (k), betweenness (b) and shortest path (l) were computed for HHCV and HEBV in both full and high-confidence interactomes (25).
Over-represented KEGG pathways were identified as significant after multiple testing adjustments (adjusted p-value<5.10−2) and are listed by viral protein. For each pathway, number of HHCV is given, with the relative contribution of IMAP Y2H dataset between brackets. Black boxes highlight discussed pathways.
A. HHCV enrichment in IJT network for each viral protein. Number of HHCV is given in V-HHCV and IJT networks. Enrichment of HHCV in IJT network was tested with exact Fisher test for each viral protein. Associated odd ratios and p-values are given.
B. HHCV enrichment in Jak/STAT, TGFβ and Insulin pathways for each viral protein. Number of HHCV is given in V-HHCV network and Jak/STAT, TGFβ and Insulin pathways (as defined in KEGG database). Enrichment of HHCV in Jak/STAT, TGFβ and Insulin pathways was tested with exact Fisher test for each viral protein. Associated odd ratios and p-values are given.
C. HHCV enrichment in Jak/STAT, TGFβ and Insulin inter-pathways for each viral protein. Inter-pathways are defined as the HHCV connecting two or three KEGG pathways. Number of HHCV is given in V-HHCV network and Jak/STAT, TGFβ and Insulin inter-pathways. Enrichment of HHCV in Jak/STAT, TGFβ and Insulin inter-pathways was tested with exact Fisher test for each viral protein. Associated odd ratios and p-values are given.
A proteome-wide mapping of interactions between hepatitis C virus and human proteins was performed to provide a comprehensive view of the cellular infection. A total of 314 protein-protein interactions between HCV and human proteins was identified by yeast two-hybrid and 170 by literature mining. Integration of this dataset into a reconstructed human interactome showed that cellular proteins interacting with HCV are enriched in highly central and interconnected proteins. A global analysis based on functional annotation highlighted the enrichment of cellular pathways targeted by HCV. A network of proteins associated with frequent clinical disorders of chronically infected patients was constructed by connecting the insulin, Jak/STAT and TGFβ pathways with cellular proteins targeted by HCV. CORE protein appeared as a major perturbator of this network. Focal adhesion was identified as a new function affected by HCV, mainly by NS3 and NS5A proteins.
HCV genome is a positive strand RNA molecule, encoding one polyprotein which is cleaved by cellular and viral proteases in structural proteins (CORE, E1, E2 and p7), and non-structural proteins (NS2, NS3, NS4A, NS4B, NS5A and NS5B) (1). All proteins were cloned in full length and domains except for NS4B for which no domain has been designed, using the euHCVdb facilities (http://www.euhcvdb.ibcp.fr (2)) (
HCV ORFs were transferred from pDONR223 into bait vector (pPC97) to be expressed as Gal4-DB fusions in yeast. Two different screening methods were used (IMAP1 and IMAP2). Both for IMAP1 and IMAP2 strategies and because bait constructs sometimes self-transactivate reporter genes, SD-L-H culture medium were supplemented with 3-aminotriazole (3-AT). Appropriate concentrations of this drug were determined by growing bait strains on SD-L-H medium supplemented with increasing concentrations of 3-AT. Self-transactivation by NS5A without its membrane anchor was too high to be titrated with 3-AT and was of further tested. For IMAP1, bait vectors were introduced in MAV203 yeast strain and both human spleen and foetal brain AD-cDNA libraries (Invitrogen) were screened by transformation as described (7). All primary positive clones (selected on SD-W-L-H+3-AT) were tested by further phenotypic assay using two additional reporter genes: LacZ (X-Gal colorimetric assay) and URA3 (growth assay on 5-FOA supplemented medium). Positive clones that displayed at least 2 out of 3 positive phenotypes were retested into fresh yeasts. Clones that did not retest were discarded. AD-cDNA were PCR-amplified and inserts were sequenced to identify interactors. IMAP2 screens were performed by yeast mating, using AH109 and Y187 yeast strains (Clontech (8)). Bait vectors were transformed into AH109 (bait strain) and human spleen and foetal brain AD-cDNA libraries (Invitrogen) were transformed into Y187 (prey strain). Single bait strains were mated with prey strains then diploids were plated on SD-W-L-H+3-AT medium. Positive clones were maintained onto this selective medium for 15 days to eliminate any contaminant AD-cDNA plasmid (9). AD-cDNAs were PCR amplified and inserts were sequenced.
We have developed a bioinformatic pipeline that assigns each IST to its native human genome transcript. First, ISTs were filtered by using PHRED (10, 11) at a quality score superior to the conventional 20 threshold value (less than 1% sequence errors). Gal4 motif was searched (last 87 bases of GAL4-AD), sequences downstream of this motif were translated into peptides and aligned using BLASTP against the REFSEQ (http://www.ncbi.nlm.nih.gov/RefSeq/) human protein sequence database (release 04/2007). Low-confidence alignments (E value>10−10, identity <80%) and premature STOP codon containing sequences were eliminated. Only in-frame proteins and high quality sequences were further considered
Only physical and direct binary protein-protein interactions were retrieved from BIND (12), BioGRID (13), DIP (14), GeneRIF (15), HPRD (16), IntAct (17), MINT (18), and Reactome (19). NCBI official gene names were used to unify protein ACC, protein ID, gene name, symbol or alias defined in different genome reference databases (i.e ENSEMBL, UNIPROT, NCBI, INTACT, HPRD . . . ) and to eliminate interaction redundancy due to the existence of different protein isoforms for a single gene. Thus, the gene name was used in the text to identify the proteins. Finally, only non-redundant protein-protein interactions were retained for building the human interactome dataset, i.e if A interacts with B and B with A, only A with B interaction was selected.
Literature curated interactions (LCI), describing binary interactions between cellular and HCV proteins, were extracted from BIND database and PubMed (publications before August 2007) by using an automatic text-mining pipeline completed by expert curation process. For the text-mining approach, all abstracts related to “HCV” and “protein interactions” keywords were retrieved, subjected to a sentencizer (sentence partition) and a part-of-speech tagger for gene name (based on NCBI gene name and aliases) and interaction verbs (interact, bind, attach . . . ) (20). Sentences presenting co-occurrences of at least one human gene name, one viral gene name and one interaction term, were prioritized to curation by human expert.
A random pool of 59 IMAP Y2H interactions was chosen for CoAP assays. Cellular ORFs (interacting domains found in Y2H screens) were cloned by recombinational cloning from a pool of human cDNA library or the MGC cDNA plasmids using KOD polymerase (Toyobo) into pDONR207 (Invitrogen). After validation by sequencing, these ORFs were transferred to pCi-neo-3xFLAG gateway-converted, and HCV ORFs were transferred into pDEST27 (GST fusion in N-term). HEK-293T cells were then co-transfected (JetPei, Polyplus) by each pair of plasmid encoding interacting proteins. Controls are GST alone against 3xFLAG-tagged preys. Two days after transfection, cells were harvested and lysed (0.5% NP-40, 20 mM Tris-HCl (pH 8.0), 180 mM NaCl, 1 mM EDTA, and complete protease inhibitor cocktail). Cell lysates were cleared by centrifugation for 20 min at 13,000 rpm at 4° C. and soluble protein complexes were purified using Glutathione Sepharose 4B beads (GE Healthcare). Beads were then washed extensively four times with lysis buffer and proteins were separated on SDS-PAGE and transferred to nitrocellulose membrane. GST-tagged viral proteins and 3xFLAG-tagged cellular proteins were detected using standard immunoblotting techniques using anti-GST (Covance) and anti-FLAG M2 (Sigma) monoclonal antibodies (21).
Large Graph Layout (http://bioinformatics.icmb.utexas.edu/lgl/) was applied to visualize the H-H network in
The R (http://www.r-project.org/) statistical environment was used to perform statistical analysis and the igraph R package (http://cneurocvs.rmki.kfki.hu/igraph/) to compute network connected components, centrality (degree, betweenness) and shortest path measures.
In an undirected network, a connected component is a maximal connected sub-network. Two nodes are in the same connected component if and only if there exist a path between them. We also included in connected components proteins that are not connected to any other protein, according to igraph R package.
The degree of a node (k) is the number of edges incident to the node. The mean degree of human proteins was computed and was compared to the mean degree of all HHCV.
The shortest path problem is the finding of a path between two nodes such that the sum of the weights of its constituent edges is minimized. The shortest paths (l, also called geodesics) are calculated here by using breath-first search in the graph. Edge weights are not used here, i.e every edge weight is one. The mean shortest path between any two pairs of human proteins was computed and was compared to the mean shortest path between any two pairs HHCV.
The node betweenness (b) are roughly defined by the number of shortest paths going through a node. The mean betweenness of all human proteins was computed and compared to the mean shortest path between any two pairs of HHCV.
The Wilcoxon Mann-Withney rank sum test (the U test) was chosen to statistically challenge observed differences. The U test is a non-parametric alternative to the paired Student's t-test for the case of two related samples or repeated measurements on a single sample. The generalized linear model and ANOVA analysis was used to respectively model and test the separate and additive effects of degree and betweenness on the probability that HCV proteins interact with human proteins.
Cellular pathway data were retrieved from KEGG (22), the Kyoto Encyclopedia of Genes and Genomes (http://www.genome.jp/kegg/) and were used to annotate NCBI gene functions. For each viral-host protein interactors, the enrichment of specific KEGG pathway was tested by using an exact Fisher test (pvalue<5 10−2) followed by the Benjamini and Hochberg multiple test correction (23) in order to control false discovery rate.
Serial dilutions (from 10 to 0.04 μg/ml) of fibronectin or poly-L-lysine in PBS were coated on 96-well microtiter plates overnight at 4° C. Non-specific binding sites were saturated at room temperature with PBS 1% BSA for 1 h. HEK 293T cells were transfected with pCineo3xFlag NS2, NS3, NS3/4A or NS5A (JetPei, Polyplus), collected 2 days later with 2 mM EDTA in PBS, spread in triplicate at 1.105 cell/well in serum-free medium with 0.1% BSA, and incubated for 30 min at 37° C. Non-adherent cells were washed away and adherent cells were fixed with 3.7% paraformaldehyde. Cells were stained with 0.5% crystal violet in 20% methanol for 20 min at room temperature and washed 5 times in H2O. Staining was extracted 50% ethanol in 50 mM sodium citrate, pH4.5, and the absorbance was read at 590 nm on an ELISA reader (MRX microplate reader, Dynatech Laboratories). Values were normalized to 100% adhesion at 10 μg/ml. The percentage of adhesion was determined for each cell type at each matrix concentration. 50% of maximum adhesions (FA50) were calculated from the curves (Adapted from Miao et al (24)).
A comprehensive interactome map between HCV and cellular proteins was generated by Y2H screens. Twenty seven constructs encoding full-length HCV mature proteins or discrete domains were cloned using a recombination-based cloning system (10) (
A human PPI network (H-H network,
To assess how HCV proteins interplay with the cellular protein network, we next focused on the centrality measures of HHCV proteins integrated into the H-H interactome. Local (degree) and global (shortest path and betweenness) centrality measures were calculated. Briefly, the degree (k) of a protein in a network corresponds to its number of direct partners and is therefore a measure of local centrality. Betweenness (b) is a global measure of centrality as it measures the number of shortest paths (the minimum distance between two proteins in the network, l) that pass through a given protein. The average degree, betweenness and shortest path of the H-H network are respectively 9.3, 1.6 10−4 and 4.04, which is in good agreement with previous reports (18) (
In order to determine which of the degree or the betweenness most influences the probability of interaction between viral and cellular proteins, we used a generalized linear model to test the separate and additive effects of both measures (Supplementary Methods). This analysis revealed that betweenness better explain the probability of interaction between viral and human proteins (ANOVA p<10−3).
In order to better understand biological functions targeted by HCV, we next tested the enrichment of specific pathways for all interactors of a given viral protein. This was done by analyzing the HHCV proteins in regards to the KEGG functional annotation pathways (Table S4, Supplementary Methods). Although this approach is not totally unbiased because functions have not yet been attributed to all proteins, it remains a powerful way of incorporating conventional biology in system-level datasets. This analysis showed an enrichment for three pathways associated with HCV clinical syndromes (insulin, TGFβ and Jak/STAT pathways) and identified focal adhesion as a novel pathway affected by HCV.
Chronic infection by HCV is associated with an increased risk for metabolic disorders with development of steatosis. Insulin resistance is a common feature of this process. It also contributes to liver fibrosis and is a predictor of a poor response to interferon-α (IFN-α) anti-viral therapy (25, 26). Conversely, IFN-α can prevent fibrosis progression (27). TGFβ plays a crucial role in maintaining cell growth and differentiation in the liver. It is a strong profibrogenic cytokine whose production is frequently enhanced during infection. Impaired TGFβ response is also observed during HCV infection (28). Although insulin, TGFβ and Jak/STAT pathways have been suspected to be involved in these clinical features (29), their closely related perturbation during HCV infection remain largely unexplained. We thus used a network approach to identify cellular proteins targeted by HCV and localized at the interface of these pathways. The resulting interaction map was constructed to form the IJT network (Insulin-Jak/STAT-TGFβ,
Another issue that became apparent in the IJT network is that CORE protein mediates proportionally more interactions than the other HCV proteins (
Focal adhesion was over-represented as a new function targeted by NS3 and NS5A proteins, with a major contribution of data generated by IMAP Y2H screens (Table S4). Integrin-linked focal adhesion complexes control cell adhesion to extracellular matrix (ECM) and association of these complexes with actin-cytoskeleton plays a major role in cell migration. Upon binding to the ECM, both α and β integrin subunits recruit proteins establishing a physical link between the actin-cytoskeleton and signal transduction pathways. When deregulated, this functional process can lead to perturbation of cell mobility, detachment from the ECM and tumour initiation and progression.
Hepatitis C virus (HCV) infected patients with high serum levels of bile acids (BAs) usually fail to respond to antiviral therapy. The role of BAs on HCV RNA replication was thus assessed. BAs, especially chenodeoxycholate and deoxycholate, up-regulated HCV RNA replication by more than tenfold. Only free but not conjugated BAs were active, suggesting that their effect was mediated by a nuclear receptor. Only farnesoid X receptor (FXR) ligands stimulated HCV replication while FXR silencing and FXR antagonism by guggulsterone blocked the up-regulation induced by BAs. Furthermore, guggulsterone alone inhibited basal level of HCV replication by tenfold. Modulation of HCV replication by FXR ligands occurred in the same proportion in presence or absence of type I interferon, suggesting a mechanism of action independent of this control of viral replication. Thus, exposure to BAs increases HCV replication by a novel mechanism involving activation of the nuclear receptor FXR.
This raises the possibility that the virus could directly interfere with FXR to favour its replication. To complete the proteome-wide screening of HCV proteins cellular interactors, we performed a specific screening focused on FXR to identify all potential interactions between this receptor and any viral proteins.
Escherichia coli competent bacteria (OneShot® Top10, Invitrogen) (F-mcrA Δ (mrr-hsdRMS-mcrBC) φ80lacZΔM15 ΔlacX74 recA1 araD139 Δ (ara-leu), 7697 galU galK rpsL (StrR) endA1 nupG).
We used the following strains: AH109 et Y187 (Clontech) Saccharomyces cerevisiae with the following genotype:
AH109: MATa, trp1-901, leu2-3, 112, ura3-52, his3-200, gal4Δ, gal80Δ, LYS2::GAL1UAS-GAL1TATA-HIS3, GAL2UAS-GAL2TATA-ADE2, ura3::MEL1UAS-MEL1TATA-lacZ.
Y187: MATa, ura3-52, his3-200, ade2-101, trp1-901, leu2-3, 112, met-, gal4Δ, gal80A, MEL1, URA3::GAL1UAS-GAL1TATA-lacZ.
Hek-293T: human embryonic kidney cells expressing large T antigen.
Identification of HCV Proteins Interacting with FXR Using Yeast Two Hybrid Matrix
Our previous work indicated that HCV replication can be under the control of FXR activity. In order to identify viral components that could interfere with FXR activity, we search for interaction between viral proteins and FXR. This was done by pairwise interaction screening with the yeast-two-hybrid method. All 10 viral proteins have been tested and data are summarized in the following table:
The data have been confirmed by GST-pull down in mammal cells. Viral proteins in fusion with GST were co-transfected with FXR tagged with 3×Flag in Hek-293T cells. 48 h latter, cells were lysed, precipitation was performed with glutathion sepharose and subjected to electrophoresis and western blot with anti-Flag (to reveal FXR) or anti-GST antibodies coupled to peroxidase. Co-precipitations of FXR with NS3 and NS5A were positive confirming direct interaction of these viral proteins with FXR. Co-precipitations of FXR with all other viral proteins were negative confirming that these proteins do not interact with FXR.
Throughout this application, various references describe the state of the art to which this invention pertains. The disclosures of these references are hereby incorporated by reference into the present disclosure.
Number | Date | Country | Kind |
---|---|---|---|
PCT/IB08/53132 | Apr 2008 | IB | international |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/EP09/54536 | 4/16/2009 | WO | 00 | 12/9/2010 |