1. Field of Disclosure
The present disclosure relates generally to the information provided by the three-dimensional structure of the C-terminal domain of APOBEC3G (Apo3G-CD2) and other structure models of any APOBEC proteins obtained by computer modeling that bears similarity with a root-mean-square deviation (RMSD) of 2.0 with the Apo3G-CD2 monomer. Additionally, the present disclosure relates to the uses of the three-dimensional structure of Apo3G-CD2 and models of APOBEC proteins particularly for structure-based drug design of compounds, peptides or mutant APOBEC proteins designed to treat Hyper-IgM-2 Syndrome, B cell lymphomas and lentivirus infections, particularly the human immunodeficiency virus (HIV) infection.
2. General Background
The present disclosure relates to the APOBEC family members, which are involved in diverse biological functions. APOBEC-3G (Apo3G) restricts the replication of Human Immunodeficiency Virus (HIV) and Hepatitis B virus (HBV) via cytidine deamination on ssDNA or RNA binding. The present disclosure also related to the high-resolution crystal structure of an enzymatically active APOBEC protein, the C-terminal deaminase domain of Apo3G (Apo3G-CD2). The Apo3G-CD2 structure closely resembles the Apo2 structure and a detailed comparison suggests that differences in the loops near the active center influence substrate binding and activity. The Apo3G-CD2 structure differs significantly from a recently reported NMR structure of the A3G-CD2 mutant. The NMR structure lacks features, including the absence of a helical region (helix 1) and an intact β strand (β2), which may significantly contribute to the active center conformation and oligomer formation. The loops in the X-ray structure of Apo3G-CD2 are in open conformations around the active site and form a continuous “substrate groove” that can accommodate a ssDNA substrate. We have introduced mutations around the groove that identify critical residues involved in substrate specificity, ssDNA binding, and deaminase activity. The structure permits the modeling of the full-length Apo3G and provides insights into key residues and structural features that are important for HIV viral incorporation and viral restriction.
The apolipoprotein B mRNA-editing enzyme catalytic polypeptide (APOBEC)-3G (Apo3G, previously named CEM15) was discovered in a subtractive hybridization screen as the cellular factor that blocks the replication of a human immunodeficiency virus type-1 (HIV-1) strain that is deficient for its viral infectivity factor (Vif) protein (Chiu and Greene, 2007; Conticello et al., 2007b; Holmes et al., 2007). The HIV-1 expresses its Vif protein to overcome the Apo3G imposed replication block primarily by binding to Apo3G and targeting it for polyubiquitylation and proteasomal degradation (Chiu and Greene, 2007; Conticello et al., 2007b; Holmes et al., 2007). In the absence of Vif, Apo3G multimers associated with viral RNA are packaged into budding HIV-1 virions (Burnett and Spearman, 2007). When these virions enter new target cells, Apo3G introduces multiple cytidine deaminations on the HIV-1 minus strand cDNA to inactivate the provirus and block infection (Suspene et al., 2004; Yu et al., 2004). Apo3G can also disrupt the HIV-1 reverse transcription (RT) process (Guo et al., 2007; Iwatani et al., 2007; Xiao-Yu et al., 2007) and impair the integration of the provirus (Luo et al., 2007; Mbisa et al., 2007). Beyond HIV-1, Apo3G can inhibit other retroviruses, retrotransposons and the Hepatitis B Virus (HBV) (Chiu and Greene, 2007; Conticello et al., 2007b; Holmes et al., 2007). Although non-catalytic properties of Apo3G are significant (Chiu and Greene, 2007), recent reports show that the catalytic activity of Apo3G is necessary for efficient restriction of HIV-1 and retrotransposition when Apo3G is expressed at endogenous levels (Miyagi et al., 2007; Schumacher et al., 2008).
Apo3G belongs to the APOBEC family of polynucleotide cytidine deaminase enzymes including: APOBEC-1 (Apo1), APOBEC-2 (Apo2), APOBEC-3A-APOBEC-3H (Apo3A-Apo3H), APOBEC-4 (Apo4) and activation induced cytidine deaminase (AID). These enzymes have one or two conserved cytidine deaminase motifs defined as H-X-E-X2328-P-C-X24-C (X=any amino acid) and achieve remarkably diverse functions by binding or deaminating single-stranded (ss) DNA and RNA (Chiu and Greene, 2007; Conticello et al., 2007b; Holmes et al., 2007). The first discovered APOBEC protein, Apo-1, deaminates the 6666 cytidine in the apolipoprotein B mRNA thereby creating a premature stop codon leading to the formation of two protein isoforms with distinct roles in lipid metabolism (Conticello et al., 2007b). Cytidine deamination catalyzed by AID on the immunoglobulin gene during somatic hypermutation and class switch recombination is required for antibody affinity maturation (Bransteitter et al., 2006; Conticello et al., 2007b; Peled et al., 2007). The APOBEC-3 proteins inhibit retroviruses, various retrotransposons and some DNA viruses, such as the hepatitis B virus (HBV) and the adeno-associated virus (AAV) (Chiu and Greene, 2007; Conticello et al., 2007b; Holmes et al., 2007).
Attempts to understand the biochemical mechanisms of the APOBEC proteins from a structural perspective have involved comparative modeling with other related zinc coordinating deaminases that deaminate free cytidine nucleotide bases (Jarmuz et al., 2002; Navaratnam et al., 1998; Wedekind et al., 2003; Xie et al., 2004). Originally, a homology model of Apo-1 was created based on the square-shaped dimer structure of the Escherichia coli cytidine deaminase (ECDA) (Betts et al., 1994; Navaratnam et al., 1998). The active centers of an ECDA dimer, which consist of residues from different monomers, are buried and accessible only to small free nucleotide substrates. Apo1 was modeled to have the same structural organization as ECDA, with one catalytic active site region, a linker region and a pseudoactive site region. Sequence alignments of the newly discovered APOBEC proteins with Apo1 led to the same domain organization classification and oligomerization mode (Jarmuz et al., 2002; Navaratnam et al., 1998; Wedekind et al., 2003). Later, similar homology modeling of AID and Apo3G were attempted based on the Saccharomyces cerevisiae CDD1 cytidine deaminase (ScCDD1) structure that forms a square-shaped tetramer (Wedekind et al., 2003; Xie et al., 2004). Yet, similar to the ECDA, the active sites of the ScCDD1 square-like tetramer are buried and only accessible to free nucleotides, which is the known substrate En vivo. However, ScCDD1 is reported to deaminate the apoB mRNA in a yeast cell based assay (Dance et al., 2001). Upon removal of two neighboring molecules within the ScCDD1 tetramer structure, the active sites of the resulting ScCDD1 dimer are more accessible to larger nucleic acid substrates, which may provide an explanation as to how ScCDD1 can deaminate the apoB mRNA substrate in vitro.
Previously, we solved the first high-resolution crystal structure of an APOBEC protein, Apo2 (Prochnow et al., 2007). Many of the structural features of Apo2 are highly conserved among all of the Zn-deaminase superfamily members. However, in striking contrast to the square-shaped oligomers of the ECDA and ScCDD1, Apo2 forms a rod-shaped tetramer. Unique structural features of Apo2 prevent the square-shaped oligomerization and facilitate the formation of the elongated oligomer (Prochnow et al., 2007). Small-x ray scattering (SAXS) data of Apo3G dimers provides supporting evidence that other APOBECs have a similar elongated oligomerization (Chelico and Goodman, 2008; Wedekind et al., 2006). Although deamination activity of Apo2 has not yet been observed, the structure shows how the APOBEC active sites are accessible to DNA or RNA. To better understand how the APOBEC proteins act on their substrates, it is important to obtain additional structures of APOBEC proteins that are enzymatically characterized. Here, we report the high resolution crystal structure of a truncated Apo3G protein that consists of the enzymatically active CD2 domain. The surface representation of the Apo3G structure reveals a substrate binding “groove”. With structure-based mutagenesis, we identify residues within and near the groove that are important for substrate interactions and deaminase activity. The combination of structural and biochemical results provide a foundation for understanding how APOBEC family proteins bind nucleic acids, recognize substrates, and form oligomers.
APOBEC-2 (Apo2) belongs to the Apolioprotein B (APOB) mRNA-editing enzyme catalytic polypeptide (APOBEC) family of cytidine deaminases found exclusively in vertebrates (6). APOBEC nucleic acid deaminases modify genes by deaminating cytosines in mRNA coding sequences and in single-stranded DNA (6). Additionally, these enzymes can inhibit the replication of retroviruses, such as the human immunodeficiency virus (HIV) and hepatitis B virus (HBV), and retrotransposons. (4,5,6,7).
The APOBEC family is composed of APOBEC-1 (Apo1), APOBEC-2, Activation Induced Cytidine Deaminase (AID), APOBEC-3 (3A, 3B, 3C, 3DE, 3F, 3G, and 3H) and APOBEC-4 (2). Apo1, the first member to be characterized, deaminates C6666→U in the APOB mRNA thereby creating a premature stop codon, which results in a truncated APOB100 protein (APOB48) with a different function. Of the APOBEC3 subgroup of enzymes, APOBEC-3B (A3B), APOBEC-3F (A3F) and APOBEC-3G (A3G) have two cytidine deaminase domains (CDAs) and inhibit HIV-1 replication in the absence of the HIV viral infectivity factor protein (Vif) (4,5,6,7). In this setting, the APOBEC enzymes are incorporated into HIV virions and introduce multiple dC→dU deaminations on the minus strand of HIV viral cDNA formed during reverse transcription. Additionally, APOBEC enzymes inhibit HIV replication by a less characterized mechanism that is independent of deamination activity. APOBEC3 proteins also shield the human genome from the deleterious action of endogenous retrotransposons: A3A, A3B, A3C and A3F inhibit LINE 1 and Alu retrotransposition.
AID and Apo2 have a single CDA homology domain and are phylogenetically the most ancient members of the APOBEC family (2). AID induces somatic hypermutation (SHM) and class switch recombination (CSR) in activated germinal center B cells (3). Specific point mutations in AID are responsible for an immunodeficiency disease, Hyper-IgM-2 (HIGM-2) syndrome, which is characterized by a deficiency in isotype-switched and high affinity antibody formation (14,15). Additionally, aberrant expression of AID can induce B cell lymphomas (1,29).
Apo2, also known as ARCD-1, is ubiquitously expressed at low levels in both human and mouse and highly expressed in cardiac and skeletal muscle (16). Apo2 can form heterodimers with Apo1 and inhibit APOB mRNA deamination by Apo1 (16). Apo2 is encapsulated into HIV-1 virions when co-expressed with Δvif HIV-1 DNA in 293T cells (21). However, studies fail to show that Apo2 inhibits HIV-1 viral replication (21).
The APOBEC proteins use the same deamination activity and RNA binding properties to achieve diverse human biological functions. A comprehension of the molecular mechanisms of the APOBEC enzymes has been limited by the lack of 3-dimensional structures. Therefore, there is a need in the art for solving a 3-dimensional structure of Apo3G-CD2 and creating 3-dimensional models of other APOBEC enzymes derived from the Apo3G-CD2 structure.
Patients diagnosed with Hyper-IgM-2 Syndrome suffer from severe and recurrent infections throughout their lifetime. Currently, the only cure for Hyper-IgM-2 Syndrome is a bone marrow transplant if it is possible. The only treatment available is lifelong immunoglobulin replacement therapy. Given that mutations in the gene encoding the APOBEC protein, AID, cause Hyper-IgM-2 Syndrome, there is a need in the art for using information provided by the 3-dimensional structure of an APOBEC protein (such as Apo3G-CD2) to design drugs or mutant AID enzymes to serve as a cure or treatment for this chronic disease.
There is a need in the art for using the information provided by the 3-dimensional structure of an APOBEC protein (such as Apo3G-CD2) to design drugs that can affect the deamination activity of APOBEC proteins. The aberrant expression and deamination activity of AID has been shown to result in B cell lymphoma (1,29). Drugs that can restore the proper function of APOBEC deaminases and the timing of their function could prevent or treat B cell lymphomas.
HIV is a human retrovirus which leads to the depletion of CD4+ T lymphocytes resulting in the acquired immunodeficiency syndrome (AIDS). AIDS is characterized by various pathological conditions, including immune incompetence, opportunistic infections, neurological dysfunctions, and neoplastic growth. HIV-1 relies on Vif (virion infectivity factor), a protein encoded by HIV-1 and many related primate lentiviruses, to evade the potent innate antiviral function of APOBEC3G (also known as CEM15) and APOBEC3F in vivo. Most of the APOBEC-3 proteins are DNA cytidine deaminases that are incorporated into virions and produce extensive hypermutation in newly synthesized viral DNA formed during reverse transcription. These proteins can also inhibit HIV replication by a less characterized mechanism that is independent of deamination activity but that involves RNA binding.
Despite the availability of a number of drugs to combat HIV infections, there is a need in the art for additional drugs that inhibit HIV replication, and which are suitable for treating HIV and other lentiviral infections. The present invention addresses this need by providing structure based methods for identifying agents that target APOBEC enzymes and prevent Vif mediated degradation of APOBEC3G, APOBEC3F or other APOBEC enzymes that can restrict HIV replication under certain conditions.
There is a need in the art for using the information provided by the 3-dimensional structure of an APOBEC protein (such as Apo3G-CD2) to design drugs that can affect the oligomerization of the APOBEC protein. It has been demonstrated that oligomerization of APOBEC proteins occurs in vivo and in vitro. Information provided by the Apo3G-CD2 structure suggests this oligomerization is important for the biological functions of these enzymes. Drugs designed to affect oligomerization of APOBEC enzymes may enhance or restrict their biological functions, such as, deamination activity, RNA binding properties and viral restriction.
There is a need in the art for designing or identifying compounds that mimic, enhance, disrupt or compete with the interactions of APOBEC proteins with their substrates and other cellular or viral proteins, such as HIV Vif. Knowledge of the three dimensional structure of the protein enables a skilled artisan to design a compound that has a specific and appropriate conformation to achieve such an objective. Information from the three dimensional structure of the protein also enables a skilled artisan strategically select such a compound from available libraries of compounds. For example, knowledge of the three dimensional structure of Apo3G-CD2 enables one of skill in the art to design a compound that binds to Apo3G-CD2 or other APOBEC proteins that can inhibition interactions with the HIV Vif protein and restore the ability of APOBEC proteins to restrict HIV viral replication.
One embodiment of the present disclosure provides structural information derived from the Apo3G-CD2 crystal structure and models of related APOBEC proteins obtained by computer modeling that bears similarity with a root-mean-square deviation (RMSD) of 2.0 with the Apo3G-CD2 monomer. Additionally, other embodiments of the present disclosure provide methods for using this structural information to design drugs to treat chronic diseases, such as Hyper-IgM-2 Syndrome, B cell lymphomas, and infectious lentiviral infections, such as HIV. Yet other embodiments of the present disclosure drugs and related methods to affect the DNA or RNA binding properties, zinc coordination and/or oligomerization of APOBEC proteins. Additionally, yet other embodiments of the present disclosure include drugs and related methods to inhibit interactions with other cellular or viral proteins, including but not limited to, HIV Vif. The present disclosure provides these and other additional advantages described herein.
Definitions
According to the present disclosure, the C-terminus of APOBEC3G (Apo3G-CD2) can be defined as a protein that is characterized by the amino acid sequence including amino acids 197-380. Additionally, Apo3G-CD2 can be defined as a protein including amino acids 197-380 filed in the NCBI Genbank data base(NP—068594; GI: 13399304). According to the present disclosure, general reference to the Apo3G-CD2 protein is a protein that, at a minimum, includes an Apo3G-CD2 monomer and may include other biologically active fragments of APOBEC proteins.
A “homologue” of an APOBEC protein, or “homologous” APOBEC protein, includes proteins which differ from a naturally occurring APOBEC protein in that at least one or a few, but not limited to one or a few, amino acids have been deleted (e.g., a truncated version of the protein, such as a peptide or fragment), inserted, inverted, substituted and/or derivatized (e.g., by glycosylation, phosphorylation, acetylation, myristoylation, prenylation, palmitation, amidation and/or addition of glycosylphosphatidyl inositol). Preferably, an APOBEC homologue has a buried amino acid sequence that is at least 70% similar in chemical nature (such as polar or hydrophobic), if not identical, to the amino acid sequence of a naturally occurring APOBEC protein, and more preferably, at least about 75%, and more preferably, at least about 80%, and more preferably, at least about 85%, and more preferably, at least about 90%, and more preferably, at least about 95% identical to the amino acid sequence of a naturally occurring APOBEC protein. Preferred three-dimensional structural homologues of an APOBEC protein are described in detail below.
According to the present disclosure, an APOBEC “homologue”, or a “homologous” APOBEC protein, preferably has, at a minimum, one or two cytidine deamination motifs that consists of H-X-E-X23-28-P-C-X2-4-C (H=Histidine; X=any amino acid; E=Glutamic Acid; P=Proline; and C=Cysteine).
In general, the biological activity or biological action of a protein refers to any function(s) exhibited or performed by the protein that is ascribed to the naturally occurring form of the protein as measured or observed in vivo (i.e., in the natural physiological environment of the protein) or in vitro (i.e., under laboratory conditions). Modifications of a protein, such as in a homologue or mimetic (discussed below), may result in proteins having the same biological activity as the naturally occurring protein, or in proteins having decreased or increased biological activity as compared to the naturally occurring protein. Modifications which result in a decrease in protein expression or a decrease in the activity of the protein, can be referred to as inactivation (complete or partial), down-regulation, or decreased action of a protein. Similarly, modifications which result in an increase in protein expression or an increase in the activity of the protein can be referred to as amplification, overproduction, activation, enhancement, up-regulation or increased action of a protein. As used herein, a protein that has “biological activity” refers to a protein that has an activity that can include any one, and preferably more than one, of the following characteristics: (a) binds to the following APOBEC substrates: DNA, RNA or zinc; (b) deaminates cytosines to uracils in single-stranded DNA or RNA.
An isolated protein, according to the present disclosure, is a protein that has been removed from its natural milieu (i.e., that has been subject to human manipulation) and can include purified proteins, partially purified proteins, recombinantly produced proteins, and synthetically produced proteins, for example. As such, “isolated” does not reflect the extent to which the protein has been purified. Preferably, an isolated protein, and particularly, an isolated APOBEC protein, is produced recombinantly.
Proteins of the present disclosure are preferably retrieved, obtained, and/or used in “substantially pure” form. As used herein, “substantially pure” refers to a purity that allows for the effective use of the protein in vitro, ex vivo or in vivo according to the present disclosure. For a protein to be useful in an in vitro, ex vivo or in vivo method according to the present disclosure, it is substantially free of contaminants, other proteins and/or chemicals that might interfere or that would interfere with its use in a method disclosed by the present disclosure, or that at least would be undesirable for inclusion with the protein when it is used in a method disclosed by the present disclosure. Preferably, a “substantially pure” protein, as referenced herein, is a protein that can be produced by any method (i.e., by direct purification from a natural source, recombinantly, or synthetically), and that has been purified from other protein components such that the protein comprises at least about 80% weight/weight of the total protein in a given composition (i.e., the protein is about 80% of the protein in a solution/composition/buffer), and more preferably, at least about 85%, and more preferably at least about 90%, and more preferably at least about 91%, and more preferably at least about 92%, and more preferably at least about 93%, and more preferably at least about 94%, and more preferably at least about 95%, and more preferably at least about 96%, and more preferably at least about 97%, and more preferably at least about 98%, and more preferably at least about 99%, weight/weight of the total protein in a given composition.
As used herein, a “structure” of a protein refers to the components and the manner of arrangement of the components to constitute the protein. The “three dimensional structure” or “tertiary structure” of the protein refers to the arrangement of the components of the protein in three dimensions. Such term is well known to those of skill in the art. It is also to be noted that the terms “tertiary” and “three dimensional” can be used interchangeably.
As used herein, the terms “crystalline Apo3G-CD2”, “Apo3G-CD2 crystal”, “APOBEC crystal” refer to crystallized Apo3G-CD2 or APOBEC protein and are intended to be used interchangeably. Preferably, a crystalline APOBEC is produced using the crystal formation method described herein, in particular according to the method disclosed in Example 1. An Apo3G-CD2 crystal of the present disclosure can comprise any crystal structure and preferably crystallizes as an orthorhombic crystal lattice. A suitable crystalline Apo3G-CD2 of the present disclosure includes a monomer of Apo3G-CD2 protein. One preferred crystalline Apo3G-CD2 comprises one Apo3G-CD2 protein in an asymmetric unit. Preferably, a composition of the present disclosure includes Apo3G-CD2 protein molecules arranged in a crystalline manner in a space group C2 so as to form a unit cell of dimensions a=83.464 Å, b=57.329 Å, c=40.5787 Å and α=90°, β=96.46°, γ=90°. A preferred crystal of the present disclosure provides X-ray diffraction data for determination of atomic coordinates of the Apo3G-CD2 protein to a resolution of about 4.0 Å, and preferably to about 3.0 Å, and more preferably to about 2.0 Å.
As used herein, the term “model” refers to a representation in a tangible medium of the three dimensional structure of a protein, polypeptide or peptide. For example, a model can be a representation of the three dimensional structure in an electronic file, on a computer screen, on a piece of paper (i.e., on a two dimensional medium), and/or as a ball-and-stick figure. Physical three-dimensional models are tangible and include, but are not limited to, stick models and space-filling models. The phrase “imaging the model on a computer screen” refers to the ability to express (or represent) and manipulate the model on a computer screen using appropriate computer hardware and software technology known to those skilled in the art. Such technology is available from a variety of sources including, for example, Evans and Sutherland, Salt Lake City, Utah, and Biosym Technologies, San Diego, Calif. The phrase “providing a picture of the model” refers to the ability to generate a “hard copy” of the model. Hard copies include both motion and still pictures. Computer screen images and pictures of the model can be visualized in a number of formats including space-filling representations, a carbon traces, ribbon diagrams and electron density maps.
As used herein, the phrase “common amino acid side chains” refers to amino acid side chains that are common to both the structural homologue and to the structure that is actually represented by such atomic coordinates.
According to the present disclosure, the phrase “providing a three dimensional structure of APOBEC protein” is defined as any means of providing, supplying, accessing, displaying, retrieving, or otherwise making available the three dimensional structure of Apo3G-CD2 or a three dimensional computer generated structure model of an APOBEC protein. For example, the step of providing can include, but is not limited to, accessing the atomic coordinates for the structure from a database; importing the atomic coordinates for the structure into a computer or other database; displaying the atomic coordinates and/or a model of the structure in any manner, such as on a computer, on paper, etc.; and determining the three dimensional structure of Apo3G-CD2 de novo using the guidance provided herein.
As used herein, structure based drug design refers to the prediction of a conformation of a peptide, polypeptide, protein, or conformational of an interaction between a peptide or polypeptide, and a compound, using the three dimensional structure of the peptide, polypeptide or protein. Typically, structure based drug design is performed with a computer. For example, generally, for a protein to effectively interact with (or bind to) a compound, it is necessary that the three dimensional structure of the compound assume a compatible conformation that allows the compound to bind to the protein in such a manner that a desired result is obtained upon binding.
The above-mentioned features and objects of the present disclosure will become more apparent with reference to the following description taken in conjunction with the accompanying drawings wherein like reference numerals denote like elements and in which:
Another embodiment of the present disclosure relates to the information provided by the three-dimensional crystal structure of a human APOBEC protein, Apo3G-CD2, and other structure models of APOBEC proteins obtained by computer modeling that bear similarity with an Apo3G-CD2 monomer and have a root-mean-square deviation (RMSD) of 2.0. Additionally, yet another embodiment of the present disclosure relates to how the information provided by the three-dimensional Apo3G-CD2 crystal structure and models of other homologous APOBECS can be used for drug discovery. Since Apo3G-CD2 shares sufficient sequence and structural similarities to all the other homologues included in the APOBEC protein family, it can be used for homology modeling to obtain computer models of other APOBEC proteins. For example, Apo3G-CD2 shares a sequence homology of 43% and buried residue homology of 83% with the N-terminal catalytic domain of APOBEC-2. With the C-terminal catalytic domain of APOBEC-3G, APOBEC-2 shares a sequence homology of 46% and buried residue homology of 83%. The extent of homology between the two proteins indicates that the proteins are folded in a similar manner. Therefore, information provided by the Apo3G-CD2 crystal structure can be used to model the single domain APOBEC proteins (AID, APOBEC-1, APOBEC-3A, APOBEC-3C, APOBEC3H, APOBEC-4) and the double-domain APOBEC proteins (APOBEC3B, APOBEC-3DE, APOBEC3G and APOBEC3F).
Yet another embodiment of the present disclosure relates to the structural information pertaining to the unique features of an APOBEC active site, which is provided by the three-dimensional crystal structure of Apo3G-CD2 and other structure models of APOBEC proteins obtained by computer modeling that bear similarity with an Apo3G-CD2 monomer and have a root-mean-square deviation (RMSD) of 2.0.
Yet another embodiment of the present disclosure relates to the structural information pertaining to unique features of APOBEC oligomerization, which is provided by the three-dimensional crystal structure of Apo3G-CD2 and other structure models of APOBEC proteins obtained by computer modeling that bear similarity with an Apo3G-CD2 monomer and have a root-mean-square deviation (RMSD) of 2.0.
Yet another embodiment of the present disclosure relates to the structural information pertaining to the APOBEC residues which reside on the surface of APOBEC proteins, which is provided by the three-dimensional crystal structure of Apo3G-CD2 and other structure models of APOBEC proteins obtained by computer modeling that bear similarity with an Apo3G-CD2 monomer and have a root-mean-square deviation (RMSD) of 2.0.
Yet another embodiment of the present disclosure relates to a method for the identification of compounds which inhibit APOBEC DNA or RNA binding and Zinc coordination within the APOBEC active site. Such compounds could be used to prevent or treat aberrant cytidine deamination activity of APOBEC enzymes causing chronic diseases, such as B cell lymphomas. Additionally, such compounds could enhance the anti-viral action of APOBEC enzymes. It has been demonstrated that APOBEC3G and APOBEC3F are associated with inhibitory RNA molecules and/or inhibitory ribonucleoprotein complexes in cells that are targets for HIV infection (4). Releasing APOBEC3G or APOBEC3F from these RNA complexes with a drug that inhibits RNA binding, while DNA binding remains intact, could restore their post entry HIV viral restriction properties. In this case, APOBEC3G or APOBEC3F would be able to inactivate the HIV provirus by introducing extensive cytidine deaminations onto the viral cDNA.
Yet another embodiment of the present disclosure includes a method including one or more steps of: (1) providing a three dimensional structure of an APOBEC protein or a model of a homologous APOBEC protein; and, (2) identifying a candidate compound that can affect DNA or RNA binding or zinc coordination within the APOBEC active sites via structure based drug design utilizing structural information provided in (1). The three dimensional structure of Apo3G-CD2 or a model(s) of homologous APOBEC proteins includes structures: (a) defined by atomic coordinates of a three dimensional structure of a crystalline Apo3G-CD2 protein with the atomic coordinates represented in table 1 (monomer); (b) defined by atomic coordinates wherein at least 50% of the structure has an average root-mean-square deviation (RMSD) from backbone atoms in the secondary structure elements represented by the atomic coordinates of (a) of equal to or less than about 2.5 Å for main chain Ca carbon backbone; and (c) a structure defined by atomic coordinates derived from Apo3G-CD2 molecules arranged in a crystalline manner in a space group C2 so as to form a unit cell of dimensions: a=83.464 Å, b=57.329 Å, c=40.5787 Å and α=90°, β=96.46°, γ=90°.
In another aspect of this embodiment, the methods described above further includes the step (3) of screening lead compounds identified in step (2) that inhibit the binding of an APOBEC protein to DNA, RNA or zinc. The step (3) of screening can include: (a) contacting the candidate compound identified in step (2) with an APOBEC protein or a fragment thereof or with the APOBEC substrates (DNA, RNA or zinc) under conditions in which the APOBEC protein can bind its substrate in the absence of the candidate compound; and (b) measuring the binding affinity of the APOBEC protein or fragment thereof to its substrates (DNA, RNA or zinc); wherein a candidate inhibitor compound is selected as a compound that inhibits the binding of the APOBEC protein to its substrate when there is a decrease in the binding affinity of the APOBEC protein or fragment thereof to its substrate (DNA,RNA or zinc), as compared to in the absence of the candidate inhibitor compound.
Another embodiment of the present disclosure relates to a method for the identification of compounds which enhance the ability of the APOBEC protein to bind DNA or RNA. Such compounds could potentially restore the function of AID in patients diagnosed with Hyper-IgM-2 syndrome. A subset of these patients has mutations in the gene encoding for AID that may impair DNA binding. Compounds that enhance the DNA binding capabilities of AID could potentially correct this defect. Additionally, these compounds may enhance the anti-viral properties of the APOBEC enzymes. This method includes the steps of: (1) providing a three dimensional structure of an APOBEC protein or a model of a homologous APOBEC protein as described in detail above; and, (2) identifying a candidate compound that can enhance DNA or RNA binding via structure based drug design utilizing structural information provided in (1). The step (3) of screening can include: (a) contacting the candidate compound identified in step (2) with an APOBEC protein or a fragment thereof or with the APOBEC substrates, DNA or RNA, under conditions in which the APOBEC protein can bind its substrate in the absence of the candidate compound; and (b) measuring the binding affinity of the APOBEC protein or fragment thereof to its substrates (DNA or RNA); wherein a lead compound is selected as a compound that enhances the binding of the APOBEC protein to its substrate (DNA or RNA) when there is an increase in the binding affinity of the APOBEC protein or fragment thereof to its substrate (DNA or RNA), as compared to in the absence of the lead compound.
Yet another embodiment of the present disclosure relates to a method for the identification of compounds which disrupt APOBEC protein oligomerization. Such compounds could be used to prevent or treat aberrant cytidine deamination activity of APOBEC enzymes causing chronic diseases, such as B cell lymphomas. Experimental evidence has been reported which suggests that APOBEC oligomerization can alter its deamination activity. Yet another embodiment related to a method including one or more of the steps of: (1) providing a three dimensional structure of an APOBEC protein or a model of a homologous APOBEC protein as described in detail above; and, (2) identifying a candidate compound that can disrupt oligomerization (for example, dimerization or tetramerization) via structure based drug design utilizing structural information provided in (1). The step (3) of screening can include: (a) contacting the candidate compound identified in step (2) with an APOBEC protein or a fragment thereof under conditions in which the APOBEC protein can oligomerize in the absence of the candidate compound; and (b) measuring the oligomerization of the APOBEC protein or fragment thereof; wherein a candidate inhibitor compound is selected as a compound that inhibits the oligomerization of the APOBEC protein when there is a decrease in the oligomerization of the APOBEC protein or fragment thereof, as compared to in the absence of the candidate inhibitor compound. APOBEC oligomerization can be measured by many techniques including, but not limited to: gel filtration, dynamic light scattering, native gel analysis, protein cross linking, immunoprecipitation, FRET analysis or BIACore.
Yet another embodiment of the present disclosure relates to a method for the identification of compounds which enhance APOBEC protein oligomerization. Such compounds could be used to enhance the anti-viral activity of the APOBEC enzymes by increasing DNA deamination activity and RNA binding to the viral RNA. Further, such compounds could be used to repair the effects of mutations in the AID protein which disrupt AID oligomerization and cause Hyper-IgM-2 syndrome. In one aspect of the present disclosure, this method includes the steps of: (1) providing a three dimensional structure of an APOBEC protein or a model of a homologous APOBEC protein as described in detail above; and, (2) identifying a candidate compound that can enhance oligomerization (for example, dimerization or tetramerization) via structure based drug design utilizing structural information provided in (1). The step (3) of screening can include: (a) contacting the candidate compound identified in step (2) with an APOBEC protein or a fragment thereof under conditions in which the APOBEC protein can oligomerize in the absence of the candidate compound; and (b) measuring the oligomerization of the APOBEC protein or fragment thereof; wherein a lead compound is selected as a compound that enhances the oligomerization of the APOBEC protein when there is an increase in the oligomerization of the APOBEC protein or fragment thereof, as compared to in the absence of the lead compound. APOBEC oligomerization can be measured by many techniques including but not limited to: gel filtration, dynamic light scattering, native gel analysis, protein cross linking, immunoprecipitation, FRET analysis or BIACore.
Yet another embodiment of the present disclosure relates to a method for the identification of compounds which inhibit HIV viral infectivity factor (Vif) protein from binding to an APOBEC protein. The HIV Vif protein can bind to most all of the APOBEC enzymes regardless of their ability to restrict HIV replication. For example, Vif can bind to AID and inhibit its deamination activity. In cells that are targets for HIV infection, Vif binds to APOBEC3G and APOBEC3F and targets it for ubiquitylation and proteasome mediated degradation. Compounds that can disrupt Vif and APOBEC protein interactions may serve as very effective anti-viral drugs.
In one aspect of the method described above, the steps include one or more of the following: (1) providing a three dimensional structure of an APOBEC protein or a model of a homologous APOBEC protein as described in detail above; and, (2) identifying a candidate compound that can disrupt Vif and APOBEC binding interactions via structure based drug design utilizing structural information provided in (1). The step (3) of screening can include: (a) contacting the candidate compound identified in step (2) with an APOBEC protein or a fragment thereof, or with Vif or a fragment thereof, under conditions in which the APOBEC protein and Vif can interact in the absence of the candidate compound; and (b) measuring the binding interactions of the APOBEC protein or fragment thereof with Vif or a fragment thereof; wherein a lead inhibitory compound is selected when there is a decrease in the binding interactions of the APOBEC protein or fragment thereof with Vif or a fragment thereof, as compared to in the absence of the lead compound.
Yet another embodiment of the present disclosure relates to a method for the identification of compounds which inhibit APOBEC ubiquitylation and proteasomal mediated degradation. In cells that are targets for HIV infection, Vif binds to APOBEC3G and APOBEC3F and targets it for ubiquitylation and proteasomal mediated degradation. Compounds that can disrupt APOBEC ubiquitlyation may serve as very effective anti-viral drugs. In one aspect of the methods described above, the method includes one or more of the steps of: (1) providing a three dimensional structure of an APOBEC protein or a model of a homologous APOBEC protein as described in detail above; and, (2) identifying a candidate compound that can disrupt Vif and APOBEC binding interactions via structure based drug design utilizing structural information provided in (1). The step (3) of screening can include: (a) contacting the candidate compound identified in step (2) with an APOBEC protein or a fragment thereof under conditions in which the APOBEC protein or a fragment thereof becomes ubiquitylated in the absence of the candidate compound; and (b) measuring the ubiquitlyation of the APOBEC protein of fragment thereof; wherein a lead inhibitory compound is selected when there is a decrease in ubiquitylation of the APOBEC protein or fragment thereof, as compared to in the absence of the lead compound. Ubiquitlyation can be measured by many techniques including, but not limited to: immunoprecipitation and western blot analysis with an antibody specific for ubiquitin and the APOBEC protein.
In yet another aspect of various embodiments of the present disclosure, the step (2) of identifying a compound in the method described above in this present disclosure can include any suitable method of drug design, drug screening or identification, including, but not limited to: directed drug design, random drug design, grid-based drug design, and/or computational screening of one or more databases of chemical compounds.
Yet another embodiment of the present disclosure relates to a method for preparing APOBEC proteins having modified biological activity. In one embodiment, the method includes the steps of: (1) providing a three dimensional structure of an APOBEC protein or a model of a homologous APOBEC protein as described in detail above; (2) utilizing the structural information provided by (1) to identify at least one or more sites in the structure contributing to the biological activity of an APOBEC protein; and (3) modifying at least one or more sites in an APOBEC protein to alter its biological activity. The mutant APOBEC protein comprises an amino acid sequence that differs from the wildtype sequence via amino acid substitutions. The APOBEC mutant protein includes mutations that can inhibit, reduce or enhance oligomerization, zinc coordination, binding to DNA or RNA substrates, binding to cellular co-factors or viral proteins including but not limited to HIV Vif, as compared to the wild-type APOBEC protein.
Yet another embodiment of the present disclosure includes a method for producing crystals of APOBEC-2. Native and selenium-methionine labeled protein is concentrated to 15 mg per ml in a buffer containing 25 mM Hepes, pH 7.0, 50 mM NaCl and 10 mM dithiothreitol. Crystals are grown at 18° C. by hanging-drop vapor diffusion from a reservoir solution of 85 mM Na-citrate, pH 5.6, 160 mM LiSO4, 24% (weight/volume) polyethylene glycol monomethyl ether and 15% glycerol.
Yet another embodiment of the present disclosure includes a representation, or model, of the three dimensional structure of an APOBEC protein, such as a computer model. A computer model of the present disclosure can be produced using any suitable software program, including, but not limited to, MOLSCRIPT 2.0 (Avatar Software AB, Heleneborgsgatan 21C, SE-11731 Stockholm, Sweden), the graphical display program 0 (Jones et. al., Acta Crystallography, vol. A47, p. 110, 1991), the graphical display program GRASP, or the graphical display program INSIGHT. Suitable computer hardware useful for producing an image of the present disclosure is known to those of skill in the art (e.g., a Silicon Graphics Workstation).
A representation, or model, of the three dimensional structure of the Apo3G-CD2or any other APOBEC protein for which a crystal has been produced can also be determined using techniques which include molecular replacement or SIR/MIR (single/multiple isomorphous replacement). Methods of molecular replacement are generally known by those of skill in the art (generally described in Brunger, Meth. Enzym., vol. 276, pp. 558-580, 1997; Navaza and Saludjian, Meth. Enzym., vol. 276, pp. 581-594, 1997; Tong and Rossmann, Meth. Enzym., vol. 276, pp. 594-611, 1997; and Bentley, Meth. Enzym., vol. 276, pp. 611-619, 1997, each of which are incorporated by this reference herein in their entirety) and are performed in a software program including, for example, AmoRe (CCP4, Acta Cryst. D50, 760-763 (1994) or XPLOR. Briefly, X-ray diffraction data is collected from the crystal of a crystallized target structure.
The X-ray diffraction data is transformed to calculate a Patterson function. The Patterson function of the crystallized target structure is compared with a Patterson function calculated from a known structure (referred to herein as a search structure). The Patterson function of the crystallized target structure is rotated on the search structure Patterson function to determine the correct orientation of the crystallized target structure in the crystal. The translation function is then calculated to determine the location of the target structure with respect to the crystal axes. Once the crystallized target structure has been correctly positioned in the unit cell, initial phases for the experimental data can be calculated. These phases are necessary for calculation of an electron density map from which structural differences can be observed and for refinement of the structure. Preferably, the structural features (e.g., amino acid sequence, conserved di-sulphide bonds, and β-strands or β-sheets) of the search molecule are related to the crystallized target structure.
In yet another embodiment of the present disclosure, a three dimensional structure of an Apo3G-CD2 homologue protein includes a structure represented by atomic coordinates, wherein at least 50% of the structure has an average root-mean-square deviation (RMSD) from backbone atoms in secondary structure elements the three dimensional structure represented by the atomic coordinates of Table 1 of equal to or less than about 1.0 Å. Such a structure can be referred to as a structural homologue of the APOBEC structures defined by Table 1. Preferably, at least 50% of the structure has an RMSD from backbone atoms in secondary structure elements in the three dimensional structure represented by the atomic coordinates of Table 1 of equal to or less than about 0.7 Å, equal to or less than about 0.5 Å, and most preferably, equal to or less than about 0.3 Å. In another embodiment, a three dimensional structure of an Apo3G-CD2 protein provided by the present disclosure includes a structure defined by atomic coordinates that define a three dimensional structure, wherein at least about 75% of such structure has the recited average RMSD value, and more preferably, at least about 90% of such structure has the recited average RMSD value, and most preferably, about 100% of such structure has the recited average RMSD value.
In yet another embodiment of the present disclosure, the RMSD of a structural homologue of Apo3G-CD2 can be extended to include atoms of amino acid side chains. As used herein, the phrase “common amino acid side chains” refers to amino acid side chains that are common to both the structural homologue and to the structure that is actually represented by such atomic coordinates. Preferably, at least 50% of the structure has an average RMSD from common amino acid side chains in the three dimensional structure represented by the atomic coordinates of Table 1 of equal to or less than about 1.0 Å equal to or less than about 0.7 Å, equal to or less than about 0.5 Å, and most preferably, equal to or less than about 0.3 Å. In a more preferred embodiment, a three dimensional structure of an Apo3G-CD2 protein provided by the present disclosure includes a structure defined by atomic coordinates that define a three dimensional structure, wherein at least about 75% of such structure has the recited average RMSD value, and more preferably, at least about 90% of such structure has the recited average RMSD value, and most preferably, about 100% of such structure has the recited average RMSD value.
Suitable structures and models useful for structure based drug design are disclosed herein. Preferred target structures to use in a method of structure based drug design include any representations of structures produced by any modeling method disclosed herein, including molecular replacement and fold recognition related methods.
According to the present disclosure, the step of designing a compound for testing in a method of structure based identification of the present disclosure can include creating a new chemical compound or searching databases of libraries of known compounds (e.g., a compound listed in a computational screening database containing three dimensional structures of known compounds). Designing can also be performed by simulating chemical compounds having substitute moieties at certain structural features. The step of designing can include selecting a chemical compound based on a known function of the compound. A preferred step of designing comprises computational screening of one or more databases of compounds in which the three dimensional structure of the compound is known and is interacted (e.g., docked, aligned, matched, interfaced) with the three dimensional structure of an APOBEC protein by computer (e.g. as described by Humblet and Dunbar, Animal Reports in Medicinal Chemistry, vol. 28, pp. 275-283, 1993, M Venuti, ed., Academic Press). Methods to synthesize suitable chemical compounds are known to those of skill in the art and depend upon the structure of the chemical being synthesized. Methods to evaluate the bioactivity of the synthesized compound depend upon the bioactivity of the compound (e.g., inhibitory or stimulatory) and are disclosed herein.
Various other methods of structure-based drug design are disclosed in Maulik et al., 1997, Molecular Biotechnology: Therapeutic Applications and Strategies, Wiley-Liss, Inc., which is incorporated herein by reference in its entirety. Maulik et al. disclose, for example, methods of directed design, in which the user directs the process of creating novel molecules from a fragment library of appropriately selected fragments; random design, in which the user uses a genetic or other algorithm to randomly mutate fragments and their combinations while simultaneously applying a selection criterion to evaluate the fitness of candidate ligands; and a grid-based approach in which the user calculates the interaction energy between three dimensional receptor structures and small fragment probes, followed by linking together of favorable probe sites.
In a molecular diversity strategy, large compound libraries are synthesized, for example, from peptides, oligonucleotides, carbohydrates and/or synthetic organic molecules, using biological, enzymatic and/or chemical approaches. The critical parameters in developing a molecular diversity strategy include subunit diversity, molecular size, and library diversity. The general goal of screening such libraries is to utilize sequential application of combinatorial selection to obtain high-affinity ligands for a desired target, and then to optimize the lead molecules by either random or directed design strategies. Methods of molecular diversity are described in detail in Maulik, et al., ibid.
Maulik et al. also disclose, for example, methods of directed design, in which the user directs the process of creating novel molecules from a fragment library of appropriately selected fragments; random design, in which the user uses a genetic or other algorithm to randomly mutate fragments and their combinations while simultaneously applying a selection criterion to evaluate the fitness of candidate ligands; and a grid-based approach in which the user calculates the interaction energy between three dimensional receptor structures and small fragment probes, followed by linking together of favorable probe sites.
In the present method of structure based drug design, it is not necessary to align a candidate chemical compound (i.e., a chemical compound being analyzed in, for example, a computational screening method of the present disclosure) to each residue in a target site (target sites will be discussed in detail below). Suitable candidate chemical compounds can align to a subset of residues described for a target site. Preferably, a candidate chemical compound comprises a conformation that promotes the formation of covalent or noncovalent crosslinking between the target site and the candidate chemical compound. Preferably, a candidate chemical compound binds to a surface adjacent to a target site to provide an additional site of interaction in a complex. When designing an antagonist (i.e., a chemical compound that inhibits the binding of a substrate for an APOBEC protein by blocking a binding site or interface), the antagonist should bind with sufficient affinity to the binding site or to substantially prohibit a substrate (i.e., a molecule that specifically binds to the target site) from binding to a target area. It will be appreciated by one of skill in the art that it is not necessary that the complementarity between a candidate chemical compound and a target site extend over all residues specified here in order to inhibit or promote binding of a ligand.
In general, the design of a chemical compound possessing stereochemical complementarity can be accomplished by techniques that optimize, chemically or geometrically, the “fit” between a chemical compound and a target site. Such techniques are disclosed by, for example, Sheridan and Venkataraghavan, Acc. Chem Res., vol. 20, p. 322, 1987: Goodford, J Med. Chem., vol. 27, p. 557, 1984; Beddell, Chem. Soc Reviews, vol. 279, 1985; Hol, Angew. Chem., vol. 25, p. 767, 1986; and Verlinde and Hol, Structure, vol. 2, p. 577, 1994, each of which are incorporated by this reference herein in their entirety.
One embodiment of the present disclosure for structure based drug design comprises identifying a chemical compound that complements the shape of an APOBEC protein, or a portion thereof. Such method is referred to herein as a “geometric approach”. In a geometric approach, the number of internal degrees of freedom (and the corresponding local minima in the molecular conformation space) is reduced by considering only the geometric (hard-sphere) interactions of two rigid bodies, where one body (the active site) contains pockets” or “grooves” that form binding sites for the second body (the complementing molecule, such as a ligand).
The geometric approach is described by Kuntz et al., J Mol. Biol., vol. 161, p. 269, 1982, which is incorporated by this reference herein in its entirety. The algorithm for chemical compound design can be implemented using the software program DOCK Package, Version 1.0 (available from the Regents of the University of California). Pursuant to the Kuntz algorithm, the shape of the cavity or groove on the surface of a structure (e.g., Apo3G-CD2) at a binding site or interface is defined as a series of overlapping spheres of different radii. One or more extant databases of crystallographic data (e.g., the Cambridge Structural Database System maintained by University Chemical Laboratory, Cambridge University, Lensfield Road, Cambridge CB2 1EW, U.K.) or the Protein Data Bank maintained by Brookhaven National Laboratory, is then searched for chemical compounds that approximate the shape thus defined. Chemical compounds identified by the geometric approach can be modified to satisfy criteria associated with chemical complementarity, such as hydrogen bonding, ionic interactions or Van der Waals interactions.
Yet another embodiment of the present disclosure for structure based identification of compounds comprises determining the interaction of chemical groups (“probes”) with an active site at sample positions within and around a binding site or interface, resulting in an array of energy values from which three dimensional contour surfaces at selected energy levels can be generated. This method is referred to herein as a “chemical-probe approach.” The chemical-probe approach to the design of a chemical compound of the present disclosure is described by, for example, Goodford, J Med Chem., vol. 28, p. 849, 1985, which is incorporated by this reference herein in its entirety, and is implemented using an appropriate software package, including for example, GRID (available from Molecular Discovery Ltd., Oxford 0X2 9LL, U.K.). The chemical prerequisites for a site-complementing molecule can be identified at the outset, by probing the active site of an APOBEC protein, with different chemical probes, e.g., water, a methyl group, an amine nitrogen, a carboxyl oxygen and/or a hydroxyl. Preferred sites for interaction between an active site and a probe are determined. Putative complementary chemical compounds can be generated using the resulting three dimensional pattern of such sites
According to the present disclosure, suitable candidate compounds to test using the method of the present disclosure include proteins, peptides or other organic molecules, and inorganic molecules. Suitable organic molecules include small organic molecules. Peptides refer to small molecular weight compounds yielding two or more amino acids upon hydrolysis. A polypeptide is comprised of two or more peptides. As used herein, a protein is comprised of one or more polypeptides. Preferred therapeutic compounds to design include peptides composed of “L” and/or “D” amino acids that are configured as normal or retroinverso peptides, peptidomimetic compounds, small organic molecules, or homo- or hetero-polymers thereof, in linear or branched configurations.
Preferably, a compound that is identified by the method of the present disclosure originates from a compound having chemical and/or stereochemical complementarity with an APOBEC protein. Such complementarity is characteristic of a compound that matches the surface of the protein either in shape or in distribution of chemical groups and binds to the APOBEC protein to promote or inhibit APOBEC ligand binding in a cell expressing an APOBEC protein upon the binding of the compound to the APOBEC protein. More preferably, a compound that binds to a ligand binding site of an APOBEC protein associates with an affinity of at least about 10-6 M, and more preferably with an affinity of at least about 10-7 M, and more preferably with an affinity of at least about 10-8 M.
Preferably, four general sites on an APOBEC protein are targets for structure based drug design (i.e., target sites), although other sites may become apparent to those of skill in the art. The four preferred sites include: (1) the interfaces between APOBEC monomers, dimers and tetramers; (2) the active site where zinc is coordinated and where cytosine to uracil deamination activity occurs on DNA or RNA substrates (3) the D128 residue on APOBEC3G or D118 on AID (4) and DNA or RNA binding sites. Combinations of any of these general sites are also suitable target sites.
The following discussion provides specific detail on compound identification (i.e., drug design) using target sites of APOBEC proteins based on the Apo3G-CD2 three-dimensional structure. It is to be understood, however, that one of skill in the art, using the description of the Apo3G-CD2 structure provided herein, will be able to identify compounds that are potential candidates for inhibiting, stimulating or enhancing the interaction of APOBEC proteins with their other substrates, cellular co-factors and other viral accessory proteins.
A candidate compound for binding to an APOBEC protein, including one of the preferred target sites described above, is identified by one or more of the methods of structure-based identification discussed above. As used herein, a “candidate compound” or “lead compound” refers to a compound that is selected by a method of structure-based identification described herein as having a potential for binding to an APOBEC protein (or its substrate) on the basis of a predicted conformational interaction between the candidate compound and the target site of the APOBEC protein. The ability of the candidate compound to actually bind to an APOBEC protein can be determined using techniques known in the art, as discussed in some detail below. A “putative compound” is a compound with an unknown regulatory activity, at least with respect to the ability of such a compound to bind to and/or regulate an APOBEC protein as described herein. Therefore, a library of putative compounds can be screened using structure based identification methods as discussed herein, and from the putative compounds, one or more candidate compounds for binding to an APOBEC protein can be identified. Alternatively, a candidate compound for binding to an APOBEC protein can be designed de novo using structure based drug design, also as discussed above. Candidate compounds can be selected based on their predicted ability to inhibit the binding of an APOBEC protein to its substrate, cellular co-factor or a viral accessory protein, such as HIV Vif and to disrupt or enhance the oligomerization of APOBEC monomers or dimers.
In accordance with the present disclosure, a cell-based assay is conducted under conditions which are effective to screen for candidate compounds useful in the method of the present disclosure. Effective conditions include, but are not limited to, appropriate media, temperature, pH and oxygen conditions that permit the growth of the cell that expresses the receptor. An appropriate, or effective, medium refers to any medium in which a cell that naturally or recombinantly expresses an APOBEC protein, when cultured, is capable of cell growth and expression of the APOBEC protein. Such a medium is typically a solid or liquid medium comprising growth factors and assimilable carbon, nitrogen and phosphate sources, as well as appropriate salts, minerals, metals and other nutrients, such as vitamins. Culturing is carried out at a temperature, pH and oxygen content appropriate for the cell. Such culturing conditions are within the expertise of one of ordinary skill in the art.
Cells that are useful in the cell-based assays of the present disclosure include any cell that expresses an APOBEC protein and particularly, other proteins that are associated with that APOBEC protein. Such cells include bacterial cells. Additionally, certain cells may be induced to express an APOBEC protein recombinantly. Therefore, cells that express an APOBEC protein can include cells that naturally express an APOBEC protein, recombinantly express an APOBEC protein, or which can be induced to express an APOBEC protein. Cells useful in some embodiments can also include cells that can express the HIV Vif protein, such as Hela or 293T cells.
The assay of the present disclosure can also be a non-cell based assay. In this embodiment, the candidate compound can be directly contacted with an isolated APOBEC protein or fragment of that APOBEC protein, and the ability of the candidate compound to bind to the APOBEC protein can be evaluated by a binding assay. The assay can, if desired, additionally include the step of further analyzing whether candidate compounds which bind to a portion of the APOBEC protein are capable of increasing or decreasing the activity of the APOBEC protein or disrupting its interactions with the HIV Vif protein. Such further steps can be performed by cell-based assay, as described above, or by non-cell-based assay.
Alternatively, soluble APOBEC protein may be recombinantly expressed and utilized in non-cell based assays to identify compounds that bind to APOBEC proteins. Recombinantly expressed APOBEC polypeptides or fusion proteins containing one or more extracellular domains of an APOBEC protein can be used in the non-cell based screening assays. In non-cell based assays the recombinantly expressed APOBEC protein is attached to a solid substrate by means well known to those in the art. For example, APOBEC3G and/or cell lysates containing such proteins can be immobilized on a substrate such as: artificial membranes, organic supports, biopolymer supports and inorganic supports. The protein can be immobilized on the solid support by a variety of methods including adsorption, cross-linking (including covalent bonding), and entrapment. Adsorption can be through van del Waal's forces, hydrogen bonding, ionic bonding, or hydrophobic binding. Exemplary solid supports for adsorption immobilization include polymeric adsorbents and ion-exchange resins. Solid supports can be in any suitable form, including in a bead form, plate form, or well form. The test compounds are then assayed for their ability to bind to an APOBEC protein and disrupt interactions with their substrates, cellular co-factors or viral accessory proteins such as HIV Vif.
Yet another embodiment of the present disclosure relates to a therapeutic composition that, when administered to an animal, inhibits or prevents the degradation of an APOBEC protein by proteasome mediated degradation. The therapeutic composition comprises a compound that inhibits the binding of HIV Vif protein to APOBEC3G or APOBEC3F. The method comprises: (a) providing a three dimensional structure or structure model of an APOBEC protein as previously described herein; (b) identifying a candidate compound for binding to the APOBEC protein by performing structure based drug design with the structure of (a) to identify a compound structure that binds to the three dimensional structure of the APOBEC protein; (c) synthesizing the candidate compound; and (d) selecting candidate compounds that inhibit HIV Vif binding to the APOBEC protein in the presence of the candidate compounds. Preferably, the compounds inhibit the formation of a complex between the APOBEC protein and HIV Vif.
Another embodiment of the present disclosure relates to a therapeutic composition that, when administered to an animal, inhibits or prevents the deamination activity of an APOBEC protein. One embodiment of the method comprises one or more of the following: (a) providing a three dimensional structure or structure model of an APOBEC protein as previously described herein; (b) identifying a candidate compound for binding to the APOBEC protein by performing structure based drug design with the structure of (a) to identify a compound structure that binds to the three dimensional structure of the APOBEC protein; (c) synthesizing the candidate compound; and (d) selecting candidate compounds that inhibit deamination activity of the APOBEC protein in the presence of the candidate compounds. Preferably, the compounds prevent or inhibit the formation of B cell lymphomas.
Methods of identifying candidate compounds and selecting compounds that bind to and activate or inhibit an APOBEC protein have been previously described herein. Candidate compounds can be synthesized using techniques known in the art, and depending on the type of compound. Synthesis techniques for the production of non-protein compounds, including organic and inorganic compounds are well known in the art.
For smaller peptides, chemical synthesis methods are preferred. For example, such methods include well known chemical procedures, such as solution or solid-phase peptide synthesis, or semi-synthesis in solution beginning with protein fragments coupled through conventional solution methods. Such methods are well known in the art and may be found in general texts and articles in the area such as: Merrifield, 1997, Methods Enzymol. 289:3-13; Wade et al., 1993, Australas Biotechnol. 3(6):332-336; Wong et al., 1991, Experientia 47(11-12):1123-1129; Carey et al., 1991, Ciba Found Symp. 158:187-203; Plaue et al., 1990, Biologicals 18(3): 147-157; Bodanszky, 1985, Int. J. Pept. Protein Res. 25(5):449-474; H. Dugas and C. Penney, BIOORGANIC CHEMISTRY, (1981) at pages 54-92, all of which are incorporated herein by reference in their entirety. For example, peptides may be synthesized by solid-phase methodology utilizing a commercially available peptide synthesizer and synthesis cycles supplied by the manufacturer. One skilled in the art recognizes that the solid phase synthesis could also be accomplished using the FMOC strategy and a TFA/scavenger cleavage mixture.
If larger quantities of a protein are desired, or if the protein is a larger polypeptide, the protein can be produced using recombinant DNA technology. A protein can be produced recombinantly by culturing a cell capable of expressing the protein (i.e., by expressing a recombinant nucleic acid molecule encoding the protein) under conditions effective to produce the protein, and recovering the protein. Effective culture conditions include, but are not limited to, effective media, bioreactor, temperature, pH and oxygen conditions that permit protein production. An effective medium refers to any medium in which a cell is cultured to produce the protein. Such medium typically comprises an aqueous medium having assimilable carbon, nitrogen and phosphate sources, and appropriate salts, minerals, metals and other nutrients, such as vitamins. Recombinant cells (i.e., cells expressing a nucleic acid molecule encoding the desired protein) can be cultured in conventional fermentation bioreactors, shake flasks, test tubes, microtiter dishes, and petri plates. Culturing can be carried out at a temperature, pH and oxygen content appropriate for a recombinant cell. Such culturing conditions are within the expertise of one of ordinary skill in the art. Such techniques are well known in the art and are described, for example, in Sambrook et al., 1988, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. or Current Protocols in Molecular Biology (1989) and supplements.
As discussed above, a composition, and particularly a therapeutic composition, of the present disclosure generally includes the therapeutic compound (e.g., the compound identified by the structure based identification method) and a carrier, and preferably, a pharmaceutically acceptable carrier. Pharmaceutically acceptable carriers and preferred methods of administration of therapeutic compositions of the present disclosure have been described in detail above with regard to the administration of an inhibitor compound to a patient. Such carriers and administration protocols are applicable to this embodiment.
Another embodiment of the present disclosure relates to a computer for producing a three-dimensional model of a molecule or molecular structure, wherein the molecule or molecular structure comprises a three dimensional structure defined by atomic coordinates of Apo3G-CD2, or a three-dimensional model of a homologue of the molecule or molecular structure, wherein the homologue comprises a three dimensional structure that has an average root-mean-square deviation (RMSD) of equal to or less than about 2.0 Å for the backbone atoms in secondary structure elements in the Apo3G-CD2 protein, wherein the computer comprises: a) a computer-readable medium encoded with the atomic coordinates of the Apo3G-CD2 protein to create an electronic file; b) a working memory for storing a graphical display software program for processing the electronic file; c) a processor coupled to the working memory and to the computer-readable medium which is capable of representing the electronic file as the three dimensional model; and, d) a display coupled to the processor for visualizing the three dimensional model; wherein the three dimensional structure of the APOBEC protein is displayed on the computer.
The following example is put forth so as to provide those of ordinary skill in the art with a complete disclosure and description of how to make and use the present invention, and are not intended to limit the scope of what the inventors regard as their invention nor are they intended to represent that the experiments below are all or the only experiments performed. Efforts have been made to ensure accuracy with respect to numbers used (e. g. amounts, temperature, etc.) but some experimental errors and deviations should be accounted for. Unless indicated otherwise, parts are parts by weight, molecular weight is weight average molecular weight, temperature is in degrees Celsius, and pressure is at or near atmospheric. Standard abbreviations may be used, e.g., bp, base pair(s); kb, kilobase(s); pl, picoliter(s); s or sec., second(s); min, minute (s); h or hr, hour(s); and the like.
Deamination Activity of the Apo3G-CD2
We have purified the human wild-type (wt) C-terminal cytidine deaminase domain of Apo3G (Apo3G-CD2, residues 197-380) expressed in E. coli, which is highly soluble and deaminates cytidine to uracil on ssDNA (
Apo3G-CD2 Structure and Comparison to Other Cytidine Deaminases
The Apo3G-CD2 structure was solved through the multi-wavelength anomalous dispersion (MAD) phasing method using Se-Met diffraction data. The 2.3 Å resolution X-ray structure of the Apo3G-CD2 reveals a core β-sheet that is composed of five β-strands surrounded by six α-helices (
The Apo3G-CD2 structure shows similar core structural features as other cytidine deaminases within the superfamily of zinc-coordinating deaminases (Conticello et al., 2007b). All high resolution structures of cytidine deaminases have a typical core β-sheet consisting of five β-strands (
What differentiates the APOBEC structures from other known Zn-deaminase structures are the number and positions of the surrounding helices. The X-ray structures of A3G-CD2 and Apo2 have six surrounding helices that have the same spatial arrangement (
An analysis of the Zn-deaminase structures reveals that helices surrounding the β-sheet core dictate oligomerization and substrate access to the active site. The active forms of ECDA and ScCDDi are square-shaped dimers and tetramers with active sites that are buried between monomers and are only accessible to free base substrates (
Comparison of the Apo3G and Apo2 Structures
A superposition of the core structures of Apo3G-CD2 and Apo2 monomers exhibits substantial overlap for all six helices and for all five β-strands that are present in all Zn-deaminases (
The AC-Loop 1, which connects h1 with β-strand 1, is located further away from the active site in Apo3G than in Apo2 (
The Apo3G AC-loop 3, which connects the β2 strand with h2, is also located further away from the active site Zn than that of Apo2 (
Comparison of the Apo3G-CD2 X-Ray Structure with the Apo3G-2K3A NMR Structure
A recently reported NMR structure of an Apo3G CD2 mutant (called Apo3G-2K3A) resembles the X-ray structure of the wt Apo3G-CD2 (Chen et al., 2008). However, the structural superposition of the two structures reveals some significant differences (
It should be noted that the NMR CD2 fragment (residue 198-384) carries five point mutations created to solve the protein solubility problem for the NMR study (Chen et al., 2008), whereas the A3G-CD2 protein (residue 197-380) reported here contains no mutations because this fragment is highly soluble as the wt sequence. Two of the five mutations in the NMR CD2 structure are located on both ends of the β2 strand (
The Active Site of Apo3G-CD2
The deamination activity of Apo3G-CD2 involves a canonical type of zinc coordination where the active center Zn atom is coordinated by three residues His257, Cys288 and Cys291 and a water molecule located at a hydrogen bond distance from the Zn atom (
Surprisingly, the Apo3G AC-loop 3 and the two residues (N244 and H257) on this loop display a remarkable structural conservation with many distantly related Zn-deaminases, specifically TadA and hCDA (Chung et al., 2005; Losey et al., 2006) (
In all three enzymes, the conserved asparagine residue (Apo3G N244, TadA N42, hCDA N54) is located at the beginning of AC-loop 3 and immediately follows the last residue of the β2 strand (C243 in Apo3G) (
Structural Features Important for ssDNA Binding
In addition to the AC-loop 3 conformation and the residues N244 and H257 on the loop mentioned above, the Apo3G X-ray structure reveals other structural features for binding ssDNA substrate around the active site. First, a pocket generated by the open loop conformation around the active site has ample space to accommodate ssDNA (
Apo3G Mutations Affecting DNA Binding and Deamination Activities
To correlate the structure and function of Apo3G, mutations of the residues predicted to be involved with binding DNA were constructed in the context of full-length Apo3G. The impact of these mutations on ssDNA binding and deamination activity was examined (
The structure displays the hydrophobic residues W285 and Y315 on the floor of an open pocket and the F289 on the edge of the same open pocket. These residues could stack with the bases of ssDNA and position the DNA into the active site. The Apo3G mutants, W285A and Y315A, have no detectable deamination activity (
The structure reveals that some of the positively charged arginines (R256, R215 and R313) around the active site establish elaborate bonding networks and should play an important structural role by maintaining the proper conformation of the active center for DNA binding and deamination. Therefore it is not likely that these residues directly bind DNA. As discussed earlier, R256 plays a role in stabilizing the AC-loop 3 open conformation for substrate access through interactions with D264 and F252 (
DNA Binding Groove of Apo3G
A surface representation of the Apo3G-CD2 X-ray structure reveals a spacious groove running across the active center pocket (
Molecular surface representation of the Apo3G-CD2 structure shows a small exposed area of the zinc atom from the pocket side (below the Zn), where the activating water molecule is located (
This DNA-binding groove model differs from the recently proposed ‘brim-domain” model based on the A3G-2K3A NMR structure (Chen et al., 2008). For ease of comparison, we maintained the same orientation previously used to describe the brim-domain model to present both the X-ray structure A3G-CD2 (
Comparing the surface features of the X-ray structure (
Models of Full-Length Apo3G and Oligomerization
A full-length Apo3G structure containing both CD1 and CD2 domains can be modeled based on the close similarity of the Apo3G-CD2 structure with Apo2 (
We have described the high-resolution structural features of Apo3G-CD2. The structure reveals that Apo3G-CD2 has the same core fold as Apo2 and other cytidine deaminases, all of which contain a β-sheet core composed of five β-strands. However, what differentiates the APOBEC structures from those of other zinc coordinating deaminases is the positioning of the surrounding helices and loops, which may account for some of the differences in assembly, substrate specificity, and regulation by other co-factors. The helices in Apo3G and Apo2 determine how the deaminase can oligomerize, which in turn influences how accessible the active site is to larger polynucleotide substrates. Both structures have a similar h4, h6 and a long β2 strand, of which the former two can prevent the canonical square-shaped oligomerization but facilitate an elongated oligomer formation. Furthermore, the X-ray structure of Apo3-CD2 reveals a deep groove across the active center, and mutagenesis has identified residues around this “substrate-groove” that play critical roles in substrate specificity, in ssDNA substrate binding, and in deaminase activity. The results of the Apo3G-CD2 structure and its analysis reported here will provide a basis to pursue further structural and functional studies of Apo3G and other APOBEC proteins that will facilitate our understanding of their important biological functions, such as how they interact with nucleic acid substrates for deamination, how their activity is regulated, and how they restrict HIV and other viral pathogens.
Protein Purification and Crystallization
Apo3G-CD2 was expressed and purified as a recombinant GST-fusion protein in Escherichia coli. Purified GST-fusion protein was digested by PreScission Protease. Further purification of the Apo3G-CD2 protein was completed with Superdex-75 gel filtration chromatography in 50 mM Hepes pH 7.0, 250 mM NaCl and 1 mM DTT. Native and selenium-methionine labeled protein were concentrated to 25 mg mL−1. Crystals were grown at 18° C. by hanging-drop vapor diffusion from a reservoir solution of 100 mM MES pH 6.5, 40% PEG 200.
Structure Determination and Refinement
Selenium substituted methionine protein crystals were used for collecting Se-MAD data using the ALS synchrotron beam source. Data were processed with HKL3000 (Otwinowski and Minor, i997). A total of 3 selenium and 1 zinc sites were located by the SHELXD (Schneider and Sheldrick, 2002) program using MAD data between 50-3.0 Å resolution range. The SHARP program was used to calculate the experimental and model-combined phases using the MAD data in the resolution range of 50-2.3 Å as well as for density modification. The model was built with O using the high quality electron density map obtained, and was refined with CNS to 2.3 Å resolution with excellent statistics. The final refinement statistics and geometry as defined by Procheck were in good agreement and are summarized in Table 1. Structure figures were designed using PyMOL (DeLano, 2002).
Construction of Apo3G Mutants
Mutant Apo3G proteins (D316R1D3 17R, R3 13E/R320D, and R374E/R376D) were constructed by site-directed mutagenesis using the pAcG2T-Apo3G vector as the template. The following primers and their complementary strands were used: 5′ctt cac tgc ccg cat cta tag aag aca agg aag atg tca gga g 3′ (D3 16R/D3 17R), 5′ctg tgc atc ftc act gcc gag atc tat gat gat caa gga gat tgt cag gag ggg ctg cgc 3′ (R313E/R320D), and 5′gag cac agc caa gac ctg agt ggg gag ctg gac gcc aft ctc cag aat cag g 3′ (R374E/R376D). The entire coding region of Apo3G mutant constructs was verified by DNA sequencing. The mutant plasmids were then cotransfected, according to the manufacturer's protocol, with linearized baculovirus DNA (BD Biosciences) to generate recombinant mutant Apo3G baculovirus. Wild-type and mutant Apo3G expression in Sf9 insect cells and purification was carried out as described previously (Chelico et al., 2008). Mutant E. coli GST-Apo3G proteins (R213E, R215E, K249E, R256E, W285A, F289A, Y315A) were constructed by site directed mutagenesis using the pGEX-6P1-GST-Apo3G vector as the template. The following primers and their complementary strands were used: 5′ aat gaa cct tgg gil gaa ggt cgt cac gag act tac 3′ (R213E), 5′ gaa ccttgg gil cgt ggt gaa cac gag acttac ctg 3′ (R215E), 5′ tgt aac cag gcc ccg cac gag cac ggt ttt ctg gaa 3′ (K249E), 5′ g cac ggt ttt ctg gaa ggt gaa cac gcc gaa ctg tg 3′ (R256E), 5′ gil acc tgc ttt acc tct gcg tcc ccg tgc ttt tcc 3′ (W285A), 5′ acc tct tgg tcc ccg tgc get tcc tgc gca caa gaa 3′ (F289A), 5′ atc ftc act gca cgt aft gcc gac gac cag ggc cgt 3′ (Y315A). The entire coding region of Apo3G mutant constructs was verified by DNA sequencing. Plasmids were expressed in XA-90 E. coli cells and were lysed by French press. Further purification was carried out as described previously (Chelico et al., 2008).
DNA Binding
Apo3G-DNA binding were monitored by changes in steady state fluorescence depolarization (rotational anisotropy). Reaction mixtures (70 μl), containing an F-labeled DNA (SO nM) in buffer (50 mM HEPES, pH 7.3, 1 mM DTT and 5 mM MgCl2) and varying concentration of 0 to 500 nM Apo3G, were incubated at 37° C. The sequence of the ssDNA is: tta gat gag tgt aa(FdT) gtg ata tat gtg tat. Rotational anisotropy was measured as described previously (Chelico et al., 2006). The fraction of DNA bound to protein was determined as described previously (Bertram et al., 2004).
Deamination Activity
Apo3G (0.024-μM) was allowed to react with 500 nM FdT incorporated ssDNA for 10 or 15 mm and subsequently treated with UDG and resolved on 16% UREA PAGE for analysis as described previously10. Specific activity, measured as fmoles substrate deaminated per pg enzyme per minute, was calculated from the percent deamination of an ssDNA substrate over a range of enzyme concentrations. For experiments measuring processivity and directionality the ssDNA substrate sequence is: 5′ aaa gag aaa gtg ata ccc aaa gag taa agt (FdT) aga tag aga gtg ata ccc aaa gag taa agt tag taa gat gtg taa gta tgt taa 3′. For specific activity measurements the ssDNA substrate sequence is: gg (FdT) agt tta gtg gtt tgt ata gaa tta ata ccc aaa gaa gtg tat gta att gtt atg ata aga ttg aaa.
Bertram, J. G., Bloom, L. B., ODonnell, M., and Goodman, M. F. (2004). Increased dNTP binding affinity reveals a nonprocessive role for Escherichia coli beta clamp with DNA polymerase IV. Journal of Biological Chemistry 279, 33047-33050.
Betts, L., Xiang, S., Short, S. A., Wolfenden, R., and Carter, C. W. (1994). Cytidine deaminase. The 2.3 A crystal structure of an enzyme: transition-state analog complex. Current Biology 235, 635-656.
Bogerd, H. P., Doehle, B. P., Wiegand, H. L., and Cullen, B. R. (2004). A single amino acid difference in the host APBEC3G protein controls the primate species specificity of HIV type 1 virion infectivity factor. Proc Natl Acad Sci 101, 3770-3774.
Bransteitter, Sneeden, J. L., Allen, S., Pham, P., and Goodman, M. F. (2006). First AID (Activation-induced Cytidine Deaminase) Is Needed to Produce High Affinity Isotype-switched Antibodies. Journal of Biological Chemistry 281, 16833-16836.
Burnett, A., and Spearman, P. (2007). APOBEC3G Multimers Are Recruited to the Plasma Membrane for Packaging into Human Immunodeficiency Virus Type 1 Virus-Like Particles in an RNA-Dependent Process Requiring the NC Basic Linker. Journal of Virology 81, 5000-50 13.
Chelico, L., Sachio, E J., Erie, D A., and Goodman, M. F. (2008). A model for oligomeric regulation of APOBEC3G cytosine deaminase-dependent restriction of HIV. Journal of Biological Chemistry printed online, 10.1074/jbc.M801004200.
Chelico, L., Pham, P., Calabrese, P., and Goodman, M. F. (2006). APOBEC3G DNA deaminase acts processively 3 to S on single-stranded DNA. Nature Structural and Molecular Biology 13, 392-399.
Chen, K. M., Harjes, E., Gross, P. J., Fahmy, A., Lu, Y., Shindo, K., Harris, R. S., and Matsuo, H. (2008). Structure of the DNA deaminase domain of the HIV-1 restriction factor APOBEC3G. Nature 452, 116-119.
Chen, K. M., Martemyanova, N., Lu, Y., Shindo, K., Matsuo, H., and Harris, R. 5. (2007). Extensive mutagenesis experiments corroborate a structural model for the DNA deaminase domain of APOBEC3G. Federation of European Biochemical Societies 581, 4761-4766.
Chiu, Y. L., and Greene, W. C. (2007). The APOBEC3 Cytidine Deaminases: An Innate Defensive Network Opposing Exogenous Retroviruses and Retroelements. Annu Rev Immunol 26, 317-353.
Chung, S. J., Fromme, J. C., and Verdine, G. L. (2005). Structure of human cytidine deaminase bound to a potent inhibitor. Journal of Medicinal Chemistry 48, 658-660.
Conticello, S. G., Langlois, M. A., and Neuberger, M. S. (2007a). Insights into DNA deaminases. Nature Structural and Molecular Biology 14, 7-9.
Conticello, S. G., Langlois, M. A., Yang, Z., and Neuberger, M. S. (2007b). DNA Deamination in Immunity: AID in the Context of Its APOBEC Relatives. Advances in Immunology 94, 37-73.
Dance, G. S., Beemiller, P., Yang, Y., Mater, D. V., Mian, I. S., and Smith, H. S. (2001). Identification of the yeast cytidine deaminase CDD1 and an orphan C→U RNA editase. Nucleic Acids Research 29, 1772-1780.
DeLano, W. L. (2002). The PyMOL Molecular Graphics System, DeLano Scientific).
Guo, F., Cen, S., Niu, M., yANG, y., Gorelick, R., and Kleiman, L. (2007). The Interaction of APOBEC3G with Human Immunodeficiency Virus Type 1 Nucleocapsid Inhibits tRNA3Lys Annealing to Viral RNA. Journal of Virology 81, 11322-11331.
Holmes, R. K., Malim, M. H., and Bishop, K. N. (2007). APOBEC-mediated viral restriction: not simply editing? Trends in Biochemical Sciences 32, 118-128.
Huthoff, H., and Malim, M. H. (2007). Identification of amino acid residues in APOBEC3G required for regulation by human immunodeficiency virus type 1 Vif and Virion encapsidation. Journal of Virology 81,3807-3815.
Iwatani, Y., Chan, D. S. B., Wang, F., Maynard, K. S., Sugiura, W., Gronenborn, A. M., Rouzina, I., Williams, M. C., Musier-Forsyth, K., and Levin, J. G. (2007). Deaminase-independent inhibition of HIV-1 reverse transcription by APOBEC3G. Nucleic Acids Research 35, 7096-7108.
Jarmuz, A., Clhester, A., Bayliss, J., Gisbourne, J., Dunham, I., Scott, J., and Navaratnam, N. (2002). An anthropoid-specific locus of orphan C to U RNA-editing enzymes on chromosome 22. Genomics 79, 285-296.
Losey, H., Ruthenburg, A. J., and Verdine, G. L. (2006). Crystal structure of Staphylococcus aureus tRNA adenosine deaminase TadA in complex with RNA. Nature Structural and Molecular Biology 13, 153-159.
Luo, K., Wang, T., Liu, B., Tian, C., Xiao, Z., Kappes, J., and Yu, X. (2007). Cytidine Deaminases APOBEC3G and APOBEC3F Interact with Human Immunodeficiency Virus Type 1 Integrase and Inhibit Proviral DNA Formation. Journal of Virology 81, 7238-7248.
Mangeat, B., Turelli, P., Liao, S., and Trono, D. (2004). A Single Amino Acid Determinant Governs the Species-specific Sensitivity of APOBEC3G to Vif Action. Journal of Biological Chemistry 15, 14481-14483.
Mariani, R., Chen, D., Schrofelbauer, B., Navarro, F., Konig, R., Bollman, B., Munk, C., Nymark-McMahon, H., and Landau, N. R. (2003). Species-specific exclusion of APOBEC3G from HIV-1 virions by Vif. Cell 114, 21-31.
Mbisa, J. L., Barr, R., Thomas, J. A., Vandegraaff, N., Dorweiler, I. J., Svarovskaia, E. S., Brown, W. L., Mansky, L. M., Gorelick, R. J., Harris, R. S., et al. (2007). Human Immunodeficiency Virus Type 1 cDNAs Produced in the Presence of APOBEC3G Exhibit Efects in Plus-Strand DNA Transfer and Integration. Journal of Virology 81, 7099-7110.
Miyagi, E., Opi, S., Takeuchi, H., Khan, M., Goila-Gaur, R., Kao, S., and Strebel, K. (2007). Enzymatically Active APOBEC3G Is Required for Efficient Inhibition of Human Immunodeficiency Virus Type 1. Journal of Virology 81, 13346-13353.
Navaratnam, N., Fujino, T., Bayliss, J., Jarmuz, A., How, A., Richardson, N., Somasekaram, A., Bhattacharya, S., Carter, C., and Scott, J. (1998). Escherichia coli cytidine deaminase provides a molecular model for ApoB RNA editing and a mechanism for RNA substrate recognition. Journal of Molecular Biology 275, 695-714.
Otwinowski, Z., and Minor, W. (1997). Processing of X-ray Diffraction Data Collected in Oscillation Mode. Methods in Enzymology 276, 307-326.
Peled, J. U., Kuang, F. L., Iglesias-Ussel, M. D., Roa, S., Kalis, S. L., Goodman, M. F., and Scharff, M. D. (2007). The Biochemistry of Somatic Hypermutation. Annu Rev Immunol 26, 481-511.
Prochnow, C., Bransteitter, R., Klein, M. G., Goodman, M. F., and Chen, X. 5. (2007). The APOBEC-2 crystal structure and functional implications for the deaminase AID. Nature 445, 447-451.
Schneider, T. R., and Sheldrick, G. M. (2002). Substructure solution with SHELXD. Acta Crystallographica 58, 1772-1779.
Schumacher, A. J., Hache, G., Macduff, D. A., Brown, W. L., and Harris, R. 5. (2008). The DNA deaminase activity of human APOBEC3G is required for Tyi, MusD, and human immunodeficiency virus type 1 restriction. Journal of Virology 82, 2652-2660.
Suspene, R., Sommer, P., Henry, M., Ferris, S., Guetard, D., Pochet, S., Chester, A., Navaratnam, N., Wain-Hobson, S., and Vartanian, J. P. (2004). APOBEC3G is a single-stranded DNA cytidine deaminase and functions independently of HIV reverse transcriptase. Nucleic Acids Research 32, 242 1-2429.
Wang, W., and Malcolm, B. A. (1999). Two-stage PCR protocol allowing introduction of multiple mutations, deletions and insertions using QuikChange site-directed mutatgenesis. BioTechniques 26, 680-682.
Wedekind, J. E., Dance, G. S., Sowden, M. P., and Smith, H. C. (2003). Messenger RNA editing in mammals: new members of the APOBEC family seeking roles in the family business. Trends in Genetics 19, 207-216.
Wedekind, J. E., Gillilan, R., Janda, A., Krucinska, J., Salter, J. D., Bennett, R. P., Raina, J., and Smith, H. C. (2006). Nanostructures of APOBEC3G support a hierarchical assembly model of high molecular ribonucleoprotein particles from dimeric subunits. Journal of Biological Chemistry 281, 38122-38126.
Xiao-Yu, L., Guo, F., Zhang, L., and Kleiman, L. (2007). APOBEC3G Inhibits DNA Strand Transfer during HIV-1 Reverse Transcription. Journal of Biological Chemistry 282, 32065-32074.
Xie, K., Sowden, M. P., Dance, G. S., Torelli, A. T., Smith, H. C., and Wedekind, J. E. (2004). The structure of a yeast RNA-editing deaminase provides insight into the fold and function of activation-induced deaminase and APOBEC-1. Proc Natl Acad Sci 101, 8114-8119.
Xu, H., Svarovskaia, E. S., Barr, R., Zhang, Y., Khan, M. A., Strebel, K., and Pathak, V. K. (2004). A single amino acid substitution in human APOBEC3G antiretroviral enzyme confers resistance to HIV-1 virion infectivity factor-induced depletion. Proc Natl Acad Sci 101, 5652-5657.
Yu, Q., Konig, R., Pillai, S., Chiles, K., Kearney, M., Palmer, S., Richman, D., Coffin, J. M., and Landau, N. R. (2004). Single-strand specificity of APOBEC3G accounts for minus-strand deamination of the HIV genome. Nature Structural and Molecular Biology 11, 435-442.
Zhang, K., Mangeat, B., Ortiz, M., Zoete, V., Trono, D., Telenti, A., and Michielin, 0. (2007). Model Structure of Human APOBEC3G. Public Library of Science Biology 2, 1-7.
While the apparatus and method have been described in terms of what are presently considered to be the most practical and preferred embodiments, it is to be understood that the disclosure need not be limited to the disclosed embodiments. It is intended to cover various modifications and similar arrangements included within the spirit and scope of the claims, the scope of which should be accorded the broadest interpretation so as to encompass all such modifications and similar structures. The present disclosure includes any and all embodiments of the following claims.
This application claims priority to U.S. Provisional Application No. 61/089,141, filed Aug. 15, 2008, the entire contents of which are incorporated herein. This application is related to U.S. Application No. 61/016,172, filed on Dec. 21, 2007.
This invention was made with government support under Contract No. R01 AI050096 awarded by the National Institutes of Health. The government has certain rights in this invention.
Number | Date | Country | |
---|---|---|---|
61089141 | Aug 2008 | US |