The references cited in the present application are not admitted to be prior art to the claimed invention.
After the Human Immunodeficiency Virus (HIV) integrates into the host genome, a gap remains between the integrated viral DNA and the host chromosome. Because HIV integrase is incapable of repairing the gap, the damage has long been assumed to be repaired by host DNA repair factors. Although it has been possible to model the enzymatic steps involved in repairing HIV integration-induced lesions in DNA in vitro (e.g., Yoder & Bushman, 2000), host factors that might be necessary for HIV transduction remain to be conclusively identified. A number of DNA repair-associated proteins have been linked to retroviral transduction as it is known that host DNA repair pathways are required to complete the process of retroviral integration (Kilzer, et al., 2003; Daniel, et al., 2004; Parissi, et al., 2003; Mulder et al., 2002). This indicates that such host cellular factors may be potential targets for antiviral therapy.
Past drug discovery programs for HIV have largely targeted viral enzymes, including reverse transcriptase, protease, and integrase. Compounds targeting these enzymes have become the standard treatment for HIV infection. Although anti-retroviral therapy successfully suppresses viral replication, the existence of latent viral reservoirs coupled with the poor fidelity of HIV reverse transcriptase often leads to the emergence of resistance. Because the pharmacological targeting of required host factors may slow or prevent viral resistance, the identification of novel host factors as targets for HIV therapy represents a significant advance for the field of HIV therapeutics.
Thus, there is an unmet need to identify novel targets for the treatment of HIV infection, which might include host cellular factors.
A set of genes have been identified by siRNA screening as being essential for HIV infection. Knockdown of expression of these genes using siRNA decreases HIV transduction of P4/R5 HeLa cells in a single cycle HIV infectivity assay. The identified genes and proteins encoded thereby provide targets for inhibiting HIV infection and for evaluating the ability of compounds to inhibit HIV infection, which might include both compounds targeting the nucleic acids encoding the proteins identified and those targeting the proteins themselves.
Thus, in one embodiment of the present invention there is described a method of identifying a host cell factor involved in HIV infection using a siRNA library. A “library” contains a collection of different siRNAs screened as part of an experiment. The experimental results are obtained at about the same time or over a limited time period. In different embodiments, the limited time period is within about a week or within about a day. Preferably, the members of the library are tested at the same time. Reference to the library comprising a certain number of siRNA different host cell factors indicates that at least the indicated number of different siRNA are used. siRNA methods and compositions are set forth in references such as WO2005042708 and WO2005018534, the disclosures of which are incorporated herein by reference.
The method of identifying a host cell factor involved in HIV infection comprises the step of measuring the ability of a siRNA library targeting different host cell factors to inhibit HIV infection, wherein measuring the ability of a siRNA library to inhibit HIV infection further comprises: transfecting human cells with the siRNA library targeting different cell factors; infecting the transfected cells with HIV; and assaying for viral infection to determine whether siRNA-mediated downregulation of host cell factors inhibits HIV infection. More particularly, the siRNA library may comprise at least 244 different siRNA's targeting a different host cellular protein not previously associated with HIV infection. Additionally, the host cellular proteins may be one or more components of a DNA repair pathway.
In another embodiment of the invention there is provided isolated host cellular proteins involved in HIV infection selected from the group consisting of: post-meiotic segregation increased 2-like 1 (PMS2L1); excision repair cross-complementing rodent repair deficiency, complementation group 3 (ERCC3); DNA polymerase iota (POLI); transition protein 1 (TNP1); DNA polymerase lambda (POLL); centromere protein F (CENPF); MutS homolog 6 (MSH6); Nei-like 2 (NEIL2); B-cell translocation gene (BTG) family, member 2 (BTG2); damage-specific DNA binding protein 2 (DDB2); DNA cross-link repair 1B (DCLRE1b); regulator of telomere elongation helicase 1 (RTEL1); RAD51 homolog C (RAD51C); DNA polymerase epsilon (POLE); structural maintenance of chromosomes 6-like 1 (SMC6L1); AP endonuclease class 1 (APEX1); TATA box binding protein-associated factor, RNA polymerase II, (TAF2); 8-oxoguanine DNA glycosylase (OGG1); RuvB-like 2 (RUVBL2); RecQ protein-like 4 (RECQL4); topoisomerase (DNA) II alpha (TOP2A); Excision repair cross-complementing rodent repair deficiency, complementation group 3 (ERCC3); Replication protein A2 (RPA2); High mobility group (nonhistone chromosomal) protein 4-like (HMG4L); Retinoblastoma binding protein 8 (RBBP8); MutL homolog 1 (MLH1); MUS81 endonuclease homolog (MUS81); MutS homolog 4 (MSH4); Insulin-like growth factor 1 receptor (IGF1R); RAD23 homolog B (RAD23B); Ankyrin repeat domain 17 (ANKRD17); Nth endonuclease III-like 1 (NTHL1); DNA polymerase eta (POLH); WD repeat domain 33 (WDR33); DNA cross-link repair 1A (DCLRE1A), and Postmeiotic segregation increased 1 (PMS1), or a protein substantially similar to the target protein and homologs.
“Substantially similar” is defined as a sequence identity of at least 95% to the target protein. Nucleic acid and protein substantially similar to a particular identified sequence provide sequences with a small number of changes to the particular identified sequence. Substantially similar sequences include sequences containing one or more naturally occurring polymorphisms or changes that are artificially produced. A substantially similar protein sequence is at least 95% identical to a reference sequence. The substantially similar protein sequence should also not have significantly less activity than the reference sequence. In different embodiments, the substantially similar protein sequence differs from the reference sequence by 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 amino acid alterations. Each amino acid alteration is independently an addition, deletion or substitution. Preferred substantially similar sequences are naturally occurring variants. A substantially similar nucleic acid is at least 95% identical to a reference sequence. The substantially similar nucleic acid sequence should encode a protein that does not have significantly less activity than the protein encoded by the reference sequence. In different embodiments, the substantially similar nucleic acid sequence differs from the reference sequence by 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 nucleotide alterations. Each nucleic acid alteration is independently an addition, deletion or substitution. Preferred substantially similar sequences are naturally occurring variants.
In yet another embodiment of the invention, there is provided an assay for identifying a compound as an HIV inhibitor comprising the steps of: identifying a compound that downregulates or otherwise inhibits the activity or expression of a target protein that is a component of a DNA repair pathway of a human cell; and determining the ability of said compound to inhibit HIV. Said assay may be more particularly characterized in that the target protein is either or a protein having a sequence identity with one or more members selected from the group consisting of: PMS2L1; ERCC3; POLI; TNP1; POLL; CENPF; MSH6; NEIL2; BTG2; DDB2; DCLRE1b; RTEL1; RAD51C; POLE; SMC6L1; APEX1; TAF2; OGG1; RUVBL2; RECQL4; TOP2A; RPA2; HMG4L; RBBP8; MLH1; MUS81; MSH4; IGF1R; RAD23B; ANKRD17; NTHL1; POLH; WDR33; DCLRE1A, and PMS1 and homologs.
In another embodiment of the invention there is provided a method of identifying a biological pathway involved in HIV infection comprising the steps of: identifying genes targeted by siRNA analysis of host cellular genes whose downregulation inhibits HIV infection; inputting those genes into a database; and identifying what pathway they map to.
In yet another embodiment of the present invention there is provided a method of screening for a compound which down-regulates the expression of one or more components of a DNA repair pathway of a human cell, thereby decreasing HIV infection, comprising the steps of: contacting the one or more components of a DNA repair pathway of a human cell with a noncircularized HIV DNA in the presence of a test compound; contacting the or more components of a DNA repair pathway of a human cell with a noncircularized HIV DNA in the absence of a test compound; and determining the effect of the test compound on HIV integration as measured by the amount of circularization. More particularly, the one or more components of a DNA repair pathway of a human cell may be a nucleic acid molecule encoding a polypeptide selected from the group consisting of: PMS2L1; ERCC3; POLI; TNP1; POLL; CENPF; MSH6; NEIL2; BTG2; DDB2; DCLRE1b; RTEL1; RAD51C; POLE; SMC6L1; APEX1; TAF2; OGG1; RUVBL2; RECQL4; TOP2A; RPA2; HMG4L; RBBP8; MLH1; MUS81; MSH4; IGF1R; RAD23B; ANKRD17; NTHL1; POLH; WDR33; DCLRE1A, and PMS1 and homologs thereof.
Novel host cell protein targets for inhibiting HIV infection have been identified. Such targets may prove useful not only for inhibiting HIV infection, but also for assessing the ability of compounds to inhibit HIV infection.
A library of siRNAs targeting genes involved in DNA repair was transfected into HeLa P4/R5 cells. P4/R5 is a cell line which stably expresses exogenous CD4, CCR5 and LTR-β-GAL. Twenty-four hours following siRNA transfection, the cells were infected with HIV. Forty-eight hours after infection, the cells were assayed for expression of the β-GAL reporter gene, as an indication that the virus had successfully integrated into the host genome and was producing sufficient quantities of the viral Tat protein to induce expression through the LTR (Joyce et al., 2002). siRNAs that blocked or reduced the expression of β-GAL were then examined in more detail.
Cells were transfected with a pool of three siRNAs targeting each gene at 50 nM final concentration. siRNAs targeting 242 genes with Gene Ontology annotations indicating an involvement in DNA repair were assayed in duplicate in both the presence and absence of an HIV integrase inhibitor. Transfections of siRNAs targeting cyclin T1 and CDK9 were included as positive controls for each transfection plate. Mock transfections and transfections of a non-silencing siRNA directed against luciferase were included as negative controls for each transfection plate. Two days after infection, the cells were lysed and β-GAL activity was assayed. A “hit” was defined as any siRNA pool that decreased β-galactosidase activity by more than 40% relative to controls, or that showed enhanced effects on HIV infection in the presence of EC50 concentrations of an integrase inhibitor. All of these siRNA pools were chosen for further analysis. siRNAs from each original pool of three siRNAs were assayed individually for their effect on HIV infection. If two out of the three siRNAs in the pool were effective inhibitors, the hit was considered to be confirmed.
Inhibiting HIV infection has implications for both research and for antiviral therapy. Research applications of the present invention include providing methods to screen for compounds which inhibit HIV infection. Therapeutic applications include using identified compounds to treat or inhibit HIV infection.
Examples are provided below further illustrating different features of the present invention and illustrate useful embodiments for practicing the invention. Theses embodiments should be viewed as exemplary of the present invention rather than in any way limiting its scope.
The procedure was performed as follows:
Day 1: Plate HeLa (P4/R5) cells at 2000 cells per well in 4×96-well plates.
Day 2: Transfect HeLa (P4/R5) cells with siRNA pools as follows:
Incubate the plate at room temperature for 15 minutes.
Data was analyzed in the following manner. Readings for each plate were normalized to the reading for the luciferase negative control and expressed as “percent of Luciferase Control”. Hits were considered to be those siRNA pools that suppressed beta-galactosidase activity by 40% or more, or those that showed 30% or greater inhibition of beta-galactosidase activity in the presence of IC50 levels of integrase inhibitor compared to the absence of compound treatment.
The total number of inhibitory hits from the primary screen was 41, and included the following genes: SF3B3, PMS2L1, POLL, TNP1, POLL, CENPF, MSH6, NEIL2, SUPT3H, BTG2, DDB2, DCLRE1B, RAD51C, POLE, SMC6L1, APEX1, TAF2, OGG1, POLR2G, RUVBL2, RECQL4, TOP2A, ERCC3, RPA2, RRM2, HMG4L, RBBP8, MLH1, MUS81, MSH4, IGF1R, RAD23B, ANKRD17, NTHL1, POLH, WDR33, and DCLRE1A. An additional three genes were of interest because siRNAs targeting these genes appeared to enhance HIV infectivity. These genes were also considered to be hits: PMS1, HMGB2, XAB2.
Genes targeted by siRNAs that hit in the assay were evaluated further with respect to tissue distribution and which specific DNA repair pathways they represented. In addition, the siRNA hits were electronically counterscreened to assess whether they were toxic to HeLa cells in a viability-output screen. Additionally, the efficacy of the siRNA used in knocking down RNA or protein levels of the targeted gene was confirmed for ERCC3, MUS81, POL1, and RUVBL2 by testing mRNA levels with and without siRNA treatment, and for APEX1 and LIG3 by testing protein levels with and without siRNA treatment.
A siRNA screen was run in HeLa cells in which the cells were transfected with siRNAs and cell viability was assessed by Alamar Blue staining 72 h post-transfection. Thus, siRNAs that were toxic to HeLa cells in this assay may appear to hit in the infectivity screen simply due to cytotoxicity. For this reason, the siRNA hits from the HIV infection assay were examined for cytotoxic effects in the HeLa cytotoxicity assay.
Analysis of the HeLa cytotoxicity data led to the elimination of six hits. The remaining hits of interest are: PMS2L1, RAD52, POLI, TNP1, POLL, CENPF, MSH6, NEIL2, BTG2, DDB2, DCLRE1B, C20orf41 (RTEL), ADPRT (PARP1), RAD51C, POLE, SMC6L1, APEX1, TAF2, OGG1, RUVBL2, RECQL4, TOP2A, ERCC3, RPA2, HMG4L, RBBP8, MLH1, MUS81, MSH4, IGF1R, XRCC4, RAD23B, ANKRD17, NTHL1, POLH, WDR33, DCLRE1A, and PMS1.
Genes targeted by siRNAs that hit in the HIV infection screen were analyzed using software that searches a database of gene ontology definitions and reports on gene ontology functions and pathways that are held in common by the query set. Excluding “DNA repair” as a definition, since this was the criteria used to define the gene set in the library, the following definitions were the top ten selections (displayed in Table 1):
After general functions, such as “DNA recombination”, “DNA-dependent DNA replication”, “DNA (apurinic or apyrimidinic site) lyase activity”, and “damaged DNA binding”, the highest ranked repair pathway was “base excision repair” or BER. Further analysis of the hits revealed that most of the hits mapped to the short patch repair pathway of BER, but some genes representing important BER functions did not hit in the screen. These include LIG3, MUTYH, POLB, and XRCC1. Additional siRNAs for these genes were tested for knockdown of HIV infection. Six individual siRNAs were tested for LIG3, MUTYH, POLB, and XRCC1. Of the six LIG3, MUTYH, and POLB siRNAs tested, three were capable of inhibiting HIV infection by 40% or more, confirming that these genes in the BER pathway are associated with HI V infection. Of the six XRCC1 siRNAs tested, two were capable of inhibiting HIV infection by 40% or more. Thus, the BER DNA repair pathway appears essential for HIV infection.
siRNAs chosen for further analysis were examined for expression in cells infected by HIV or tissues that harbor the virus, including CD4+ T-lymphocytes, macrophage, lymph node and thymus using a previously generated Body Atlas, which contains data from microarray experiments carried out with many different tissues compared against a species-specific reference pool. Expression of all of the hits was examined in CD4+ T-lymphocytes, macrophage, lymph node and thymus.
All of the genes had some expression in the cell types of interest, but some had higher expression levels in those tissues than others. The potential targets for HIV were grouped according to their tissue distribution, with high to moderate levels of expression in the tissues of interest being preferred. The ranking of targets proceeded as follows:
The siRNAs in Example 1 were then re-assayed as individual siRNAs to guard against off-target activity arising from any one of the individual siRNAs present in the initial pool. Each of the individual siRNAs was tested for inhibition of HIV infection using the methodology described in Example 1. The hit was considered to be confirmed if a minimum of two out of the three siRNAs inhibited β-galactosidase activity by a minimum of 40% relative to the luciferase siRNA negative control.
After compiling the data from screening pools and individual siRNAs for effects of knockdown on HIV infection, followed by electronic counterscreening, pathway mapping, tissue distribution, and the potential for druggable domains, the siRNA hits were ranked and then prioritized as follows:
As indicated above, some of the genes identified in the screen have a published link to HIV, including PARP1 (Kameoka et al., 2004), XRCC4 (Daniel et al., J Virol. 78:8573, 2004), and RAD52 (Lau et al., 2004). Identification of the same genes through siRNA screening demonstrates that this screening method can effectively isolate genes with a known interaction with HIV. The remaining genes have no published link to HIV and represent truly novel targets for treatment of HIV infection.
Preferred genes identified by the screening methodology of the present invention include the following:
PMS2L1: (Postmeiotic segregation increased 2-like 1) which is a member of a family of proteins related to predicted DNA mismatch repair protein PMS2. The protein sequence and encoding cDNA sequence are provided in
ERCC3: (Excision repair cross-complementing rodent repair deficiency (complementation group 3)) which is a DNA helicase involved in DNA repair and a member of the TFIIH transcriptional complex. The protein sequence and encoding cDNA sequence are provided in
APEX1: (apurinic:apyrimidinic endonuclease I), which is a multifunctional DNA repair enzyme involved in the oxidative stress response. The protein sequence and encoding cDNA sequence are provided in
POLI: a low fidelity DNA polymerase and 5′-deoxyribose phosphate lyase that functions in translesion DNA replication and base excision DNA repair. The protein sequence and encoding cDNA sequence are provided in
MUS81: MUS81 endonuclease is an endonuclease that cleaves Holliday junctions and it may be involved in the resolution of Holliday junctions formed during DNA replication responses to damage.
RUVBL2: (RUVB (E. coli)-like 2), which is a single-stranded DNA-stimulated ATPase and ATP-dependent DNA helicase. It is predicted to function in processes involved in DNA metabolism.
OGG1: (8-oxoguanine DNA glycosylase 1), which is a nuclear and mitochondrial base excision repair DNA enzyme that also has DNA-AP lyase activity. The protein sequence and encoding cDNA sequence are provided in
DCLRE1b: a protein containing a DNA repair metallo-beta-lactamase domain. It has a region of low similarity to a region of DNA cross-link repair protein (mouse Dclre1a), which is involved in repair of interstrand DNA cross-links. The protein sequence and encoding DNA sequence are provided in
RTEL1: Protein with high similarity to regulator of telomere length (mouse Rtel1), which is a DNA helicase-like protein that regulates telomere length and chromosome stability. The protein sequence and encoding cDNA sequence are provided in
IGFR1: (insulin-like growth factor 1 receptor), which mediates IGF-1 stimulated cell proliferation and inhibits apoptosis. The protein sequence and encoding DNA sequence are provided in
siRNAs targeting APEX1, DDB2, PMS2L1, POLE and POLI were tested for efficacy in preventing production of infectious viral particles. Briefly, HeLa P4/R5 cells were transfected with siRNAs targeting the above genes. The following day, cells were infected with HXB2 HIV. Four days after infection, a time point at which the virus has had an opportunity to infect cells and generate progeny virus which are released to the media, the viral supernatants were collected and used to infect freshly plated HeLa P4/R5 cells. Two days following infection, these cells were assessed for β-galactosidase expression as described above. A decrease in β-galactosidase activity in this assay signals that the levels of infectious HIV particles produced in cells treated with a particular siRNA are reduced, thus verifying that the decreased in HIV infection observed with the virus in Example 1 is owing to a direct effect on the viral life cycle and not to an effect on transcription of the β-galactosidase reporter gene or an indirect effect on cell metabolism.
PMS2L1 siRNAs strongly inhibited production of infectious HIV, giving a greater than 80% reduction in the viral reinfection assay. POLE siRNAs resulted in more than 40% reduction in viral particle formation. APEX1 and DDB2 resulted in more than 30% reduction in viral particle formation. POLI resulted in 28% reduction in viral particle formation.
Other embodiments are within the scope of the following claims. All of the compositions and methods disclosed and claimed herein can be made and executed without undue experimentation in light of the present disclosure. While the compositions and methods of this invention have been described in terms of preferred embodiments, it will be apparent to those of skill in the art that variations may be applied to the compositions and methods and in the steps or in the sequence of steps of the method described herein without departing from the concept, spirit and scope of the invention. All such variations apparent to those skilled in the art are deemed to be within the spirit, scope and concept of the invention as defined by the appended claims.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US06/30856 | 8/8/2006 | WO | 00 | 2/8/2008 |
Number | Date | Country | |
---|---|---|---|
60707012 | Aug 2005 | US | |
60708738 | Aug 2005 | US |