Throughout this application, various publications are referenced. Full bibliographic citations for these publications are found at the end of the specification immediately preceding the claims. The disclosures of these publications in their entireties are hereby incorporated by reference into this application in order to more fully describe the state of the art known to those skilled therein as of the date of the invention described and claimed herein.
Human Immunodeficiency Virus (HIV) is the primary cause of Acquired Immunodeficiency Syndrome (AIDS) (Barre-Sinoussi, Chermann et al. 1983; Gallo, Salahuddin et al. 1984). Today, twenty antiretroviral drugs have been approved by FDA for clinical treatment of AIDS (De Clercq 2005). Most of them target either the reverse transcriptase or the protease of HIV with one exception: enfuvirtide that targets virus fusion. Although the mortality of the HIV-infected patients has been largely decreased by HAART (highly active antiretroviral therapy) (Richman 2001), emergency of drug-resistant virus and drug toxicity problems demand the search for novel antiretroviral drugs.
As the first step of HIV life cycle that precedes cellular infection, the elements of virus entry are attractive antiviral targets. The entry of the virus is mediated by the specific interactions between viral envelope glycoproteins and host cell surface receptors. The virus envelope glycoprotein complex is a trimer (Chan, Fass et al. 1997; Tan, Liu et al. 1997; Weissenhorn, Dessen et al. 1997) consisting of three pairs of gp41 and gp120, both derived by cleavage of precursor gp160 (Allan, Coligan et al. 1985; Robey, Safai et al. 1985). gp41 is a membrane protein, and gp120 attaches to the virion through non-covalent interaction with gp41 (Helseth, Olshevsky et al. 1991). Sequence analysis of gp120s from HIV-1, HIV-2 and SIVs identifies five conserved regions (C1 to C5) and five variable regions (V1 to V5) (Starcich, Hahn et al. 1986; Modrow, Hahn et al. 1987).
HIV first attaches to host cell surface through gp120's recognition of CD4, a glycoprotein on the surface of the host cell (Dalgleish, Beverley et al. 1984; Klatzmann, Champagne et al. 1984). The molecular details of this interaction have been revealed by X-ray crystal structures of various core gp120 proteins (Kwong, Wyatt et al. 1998; Kwong, Wyatt et al. 2000; Huang, Tang et al. 2005) from three different HIV strains in complex with D1D2 (the first two immunoglobulin-like domains of sCD4) and a Fab fragment of antibody, 17B or X5. In these complexes, D1 domain of CD4 binds into a depression on the core gp120 formed by all three domains of gp120 including inner domain, outer domain, and a bridging β-sheet structure that appears to require the interaction of CD4 for its integrity. A separate thermodynamic analysis also shows a unusually large structural rearrangement of both gp120 and the core gp120 upon CD4 binding (Myszka, Sweet et al. 2000). In contrast, the structure of D1D2 (Ryu, Kwong et al. 1990; Wang, Yan et al. 1990; Ryu, Truneh et al. 1994; Wu, Kwong et al. 1997) is essentially unchanged in the presence of gp120. The primary binding site of CD4 is located in the second complementarity-determining region (CDR2) of D1 domain. Although twenty-two residues of CD4 are involved in gp120 binding, 63% of all contacts come from residue 40-48 of CD4. Among them, Phe43 alone contributes 23% of the total interactions. Another CD4 determinant at the interface is residue Arg59, contributing two hydrogen-bonds with gp120 (Kwong, Wyatt et al. 1998). At the interface of gp120-CD4 in all structures, a deep hydrophobic cavity enclosed by conserved gp120 residues has been identified and Phe43 is the only CD4 residue that contacts it (
CD4 binding induces extensive structural rearrangements in gp120, resulting the exposure of binding surface for a second host cell chemokine receptor, CCR5 or CXCR4 (Trkola, Dragic et al. 1996; Wu, Gerard et al. 1996). The following engagement of gp120 with the chemokine receptor triggers further conformational changes in gp120-associated gp41, which then releases its “fusion peptide” (Kowalski, Potz et al. 1987) for insertion into target cell membranes and ultimately mediates virus-cell membrane fusion (Lu, Blacklow et al. 1995; Chan, Fass et al. 1997; Weissenhorn, Dessen et al. 1997).
Entry inhibitors target one of the following steps in the virus entry: viral attachment by gp120-CD4 interaction, coreceptor binding, and fusion between virus and host cell (Kilby and Eron 2003). Enfuvirtide, a small peptide derived from gp41, is the only available entry drug targeting viral fusion step by inhibiting the formation of the “six-helix bundle” during fusion (Furuta, Wild et al. 1998). There are also many drug candidates under clinical development, which target the other two steps of viral entry. Most of the inhibitors of coreceptor binding are small molecules that bind either CCR5 or CXCR4 by mimicking the natural ligands of the receptors; on the contrary, most of potent gp120-CD4 inhibitors identified to data are proteins or peptides (Vermeire and Schols 2005).
A large fraction of gp120-CD4 inhibitors are gp120-directed while some of them, such as PRO 2000, a naphthalene polyanion that binds CD4, CD3, and CD8 (Rusconi, Moonis et al. 1996, Milligan, Chu et al. 2004) are CD4-directed. Three strategies have been used to develop gp120-directed inhibitors: rational design of CD4 mimics, peptide phage display, and high-throughput screening. CD4-based gp120-targeting inhibitors range from gp120 antibody IgG1 b12 (Burton, Pyati et al. 1994), fusion protein of CD4 with IgG2 (PRO 542) (Allaway, Davis-Bruno et al. 1995) to CD4 miniproteins, which are scorpion toxin-based mimetics that have CDR2 loop of CD4 transplanted into toxin scaffold. The most successful inhibitor of the latter kind is CD4M33, a 27-amino acid mimetic that inhibit the interaction of gp120 and CD4 at nanomolar concentration (Martin, Stricher et al. 2003). This mimetic uses a bi-phenyl group instead of phenyl at the position corresponding to Phe43 of CD4 and structure of CD4M33 in complex with gp120:17b reveals the binding site of the additional phenyl as the Phe43 cavity (Huang, Stricher et al. 2005). There is also sCD4-17b, a single-chain chimeric protein of D1D2 and 17b, capable of targeting both CD4 and co-receptor sites on gp120 (Dey, Del Castillo et al. 2003). Random peptide libraries screening based on phage display has led to the discovery of a peptide 12p1 that blocks gp120's interaction with both CD4 and 17b with micromolar IC50 (Ferrer and Harrison 1999). Screening of extracts from cultured cyanobacteria identified cyanovirin-N (Boyd, Gustafson et al. 1997), an 11-kDa protein, which inhibits both CD4 and coreceptor by interacting with high-mannose glycans on gp120. Screening of small compound library, however, has yet to identify any potent candidate. BMS-378806, a small molecule with high anti-entry activity, was initially identified by a viral-infection-based screen and had been shown to block CD4-gp120 interaction by binding gp120 (Guo, Ho et al. 2003; Lin, Blair et al. 2003; Wang, Zhang et al. 2003). New evidence, however, indicated that it exerts its inhibitory function on entry through blocking the CD4 induction of fusion-driving conformation in gp41 (Si, Madani et al. 2004). Study on BMS-378806 escape mutants of gp120 suggests a possible binding site of the compound near Phe43 cavity (Madani, Perdigoto et al. 2004).
The difficulty in identifying a small molecule inhibiting gp120-CD4 interaction with sub-micromolar IC50 is not surprising. Protein-protein interaction has long known to be attractive but not straightforward drug target due to rather flat features of protein-protein interface (Cochran 2000). Interfacial hydrophobic pocket like Phe43 cavity in gp120, however, could be binding site for small molecules that block protein-protein binding either by direct steric effect or through allosteric mechanism. A good example can be found in the case of rhinoviruse, where compounds targeting the viral protein 1 (VP1) bind into the hydrophobic pocket just beneath the canyon floor, which is important in cellular receptor binding (Chapman, Minor et al. 1991; Zhang, Nanni et al. 1993).
Conventional high-throughput screening is only strong in identifying medium-affinity (low μM to nM) compounds, but relatively small size of Phe43 cavity (152 Å3) (Kwong, Wyatt et al. 1998) as well as large unfavorable entropic change involved in forming this cavity (Myszka, Sweet et al. 2000), have made identification of small molecules targeting this site with medium-high affinity extremely difficult.
This invention provides a soluble polypeptide consisting of a portion of CD4 comprising all HIV gp120-binding epitopes present on intact CD4, wherein the polypeptide has a cysteine substitution at a residue which, in intact CD4, interfaces with HIV gp120.
This invention provides a soluble polypeptide comprising (i) a portion of CD4 comprising all HIV gp120-binding epitopes present on intact CD4, wherein the polypeptide has a cysteine substitution at a residue which, in intact CD4, interfaces with HIV gp120, and (ii) a chemical moiety bound to the CD4 portion at the cysteine substitution via a thiol bond.
This invention provides a method for making a derivatized soluble polypeptide comprising contacting, under suitable conditions, (a) a thiol-reactive reagent with (b) a portion of CD4 comprising all HIV gp120-binding epitopes present on intact CD4, wherein the polypeptide has a cysteine substitution at a residue which, in intact CD4, interfaces with HIV gp120.
This invention provides a method for obtaining a structural model useful in the design of an agent for inhibiting CD4 binding to HIV gp120 comprising (a) identifying a soluble polypeptide of claim 5 which binds to HIV gp120 with an affinity comparable to or greater than the affinity with which intact CD4 binds to HIV gp120; and (b) obtaining a three-dimensional structure of the identified polypeptide while it is bound to HIV gp120, thereby obtaining a structural model useful in the design of an agent for inhibiting CD4 binding to HIV gp120.
This figure shows a design of modified D1D2F43C for targeting the gp120 Phe43 cavity.
This figure shows the modification of F43C of D1D2 by haloacetamides, halopropanones or 5-nitro-2-pyridinesulfenyl reagents.
This figure shows representative curves for the inhibition of gp120-CD4 binding by D1D2F43C derivatives. Legend: ▪=D1D2 control; □=D1D2F43C-Iodoacetamide; =D1D2F43C-10; ◯=D1D2F43C; ♦=D1D2F43C-19; ⋄=D1D2F43C-DN52.
This figure shows distribution of the IC50 values of the D1D2F43C derivatives derived from both libraries.
These figures show comparisons of IC50 values of D1D2F43C derivatives on binding of D1D2 to YU2 FL gp120 to that on the binding of D1D2 to YU2 375S/W & 257T/S gp120.
This figure shows the correlation between the sizes of the compounds and the folds of IC50 values of their derivatives increased from wild type gp120 to S375W/T257S gp120
This figure shows probing of the Phe-43 pocket: binding of chemically modified CD4 to HIV gp120. Phe-43 of CD4 replaced by Cys-43. Chemical modification of Cys-43 by S-alkylation with bromoacetamides. Effects of different substituents at position 43 on gp120 binding. X-ray structures of derivatized CD4-gp120 complexes. Over 100 bromoacetamides have been prepared in the Smith laboratory.
This figure shows probing of the Phe-43 pocket: binding of chemically modified CD4 to HIV gp120. The synthesis of bromoacetamides-1-7 steps from commercially available starting material.
This figure shows binding of chemically modified CD4 to HIV gp120.
This figure shows binding of chemically modified CD4 to HIV gp120: structure-affinity relationship. Branching at P5 (except cyclohexane and aromatic group) disfavors binding.
This figure shows binding of chemically modified CD4 to HIV gp120: structure-affinity relationship. Electronic effect and substitution pattern.
This figure shows binding of chemically modified CD4 to HIV gp120: structure-affinity relationship. Other aromatic groups shown. With few exceptions, most chemically modified CD4 have similar binding affinities as native CD4 (A “flat” SAR). Phe43 cavity is able to accommodate changes in substituents.
These figures show binding of chemically modified CD4 to HIV gp120: X-ray structures of derivatized CD4-gp120 complexes.
These figures show binding of chemically modified CD4 to HIV gp120: X-ray structures of derivatized CD4-gp120 complexes. Internal plasticity of the cavity: volume of Phe43 pocket expands to accommodate structural changes.
This figure shows current design and synthetic efforts such as introduction of additional non-covalent interactions: newly discovered H2O sites; extension into water channels; and crystallization and structural determination of additional complexes.
This figure shows X-ray diffraction-derived structural data for complexes of derivatized CD4 fragments and gp120, namely HX-SNS-10.
This figure shows X-ray diffraction-derived structural data for complexes of derivatized CD4 fragments and gp120, namely HX-SNS-14.
This figure shows X-ray diffraction-derived structural data for complexes of derivatized CD4 fragments and gp120, namely HX-SNS-40.
This figure shows X-ray diffraction-derived structural data for complexes of derivatized CD4 fragments and gp120, namely HX-DN-234.
This figure shows a summary of HXBc2 core gp120:17b:CD4-derivative complexes (abbreviated as HX-compound) in comparison with the wild type gp120:17b:CD4 complex (HX-WT) (Kwong et al. 2000). Ribbon diagram of gp120 bound by a chemically derivatized CD4-D1D2 protein. The chemical group attached to Cα of residue 43 of CD4 is represented as “R”, which is positioned right in the Phe43 cavity of gp120. Fab fragment of 17b is removed from the figure for clarity.
This figure shows Fo-Fc electron densities (2.5σ, blue) of modified Cys43 in comparison with the Phe43 cavity surface (red) in HX-WT complex. The structures of differently modified Cys43 from CD4 for all four HX-compound complexes and the structure of Phe43 in HX-WT complex (PDB-ID: 1RZJ) are shown as sticks, whereas gp120 (gold) and CD4 (hot pink) are shown as ribbons. In the stick models, carbon, nitrogen, oxygen, and sulfur atoms are colored green, blue, red, and yellow respectively. The electron densities for the four HX-compound complexes are obtained from simulated-annealing (10K) omit maps calculated by removing all chemical entities linked to position 43 starting from sulfur of the cysteines. The orientations of all figures are the same as that in
These figures show two different binding modes for interaction of CD4 derivatives and CD4M33 with Phe43 cavity.
These figures show the extensive interactions between the Phe43 cavity and the derivatized D1D2, enlarged Phe43 cavity and expanded water channel in gp120 complexed with D1D2 derivatives.
These figures show the surface complementarity between the Phe43 cavity in gp120 and derivatized CD4 from different HX complexes. For clarity, the molecular surface of gp120 are shown in both transparent and mesh representations in cyan. The side chains of (derivatized) residue 43 are depicted as solid surfaces in magenta. The Cα traces of D1D2/D1D2 derivatives are also in magenta. The viewing angle of
This figure shows the superimpositions of Cα traces of gp120 bound to D1D2 or its derivatives. Only gp120 regions close to the Phe43 cavity are shown and they are colored in white, blue, orange, green, and pink for gp120D1D2, gp120SNS-10, gp120SNS-14, gp120SNS-40, and gp120DN-234. The superimpositions are based on the Cα atoms of invariant regions of gp120 identified by ESCET. For simplicity, only the side chain of modified Cys43 of D1D2F43C-DN-234 is shown in sticks (magenta).
This figure shows distance-sorted error-scaled difference-distance matrices for selected pairs of different gp120 structures extracted from their complexes with D1D2 or its derivatives. WT, SNS-10, SNS-14, SNS-40, and DN-234 stand for gp120D1D2, gp120SNS-10, gp120SNS-14, gp120SNS-40, and gp120DN-234 respectively. gp120 residues were first sorted in an ascending order by the distances between their Cα atoms and the center of Phe43 cavity (defined by the C4 atom of the phenyl ring of residue 43 in D1D2F43C-SNS-10). The Cα atoms of all 273 residues (85-126, 196-297, 330-392, 415-459, and 471-491), which do not belong to the variable regions V1-V5 (Modrow et al. 1987; Leonard et al. 1990), were used for calculation of error-scaled difference-distance matrices. Because these matrices are symmetrical, only half of them (either the upper right or the low left half) are shown. For each matrix between a pair of gp120 structure “a” and “b”, the matrix element Eijab was calculated by the equation Eijab=Δijab/σ(Δijab)=(|ria−rjb|)/σ(Δijab), where Δijab stands for the difference distance of a pair of atom i and j between model “a” and “b”; ria denotes the Cartesian coordinate vector of atom i in model “a”; and σ(Δijab) is the estimated standard derivation for the matrix elements derived from the quality of the diffraction data and atomic B factors (Schneider 2000). Matrix elements are colored according to the bar at the bottom of figure: elements with absolute value less than 1.3σ(Δijab) are colored grey; elements between 1.3σ(Δijab) and 4σ(Δijab) are colored by the color gradients—blue for negative changes (expansion of distance between atom i and j in model “b” with respect to “a”) and red for positive changes (contraction); elements larger than 4Δijab (negative or position) are shown as full blue or red respectively.
These figures show the flexible regions in gp120.
This figure shows the gp120-CD4 interface in HX-DN-234 complex.
This figure shows the gp120-17b interface. gp120 is shown as ribbon and colored similarly as in
This figure shows the thermodynamic cycles of binding of 17b and D1D2 (black)/D1D2F43C-SNS-10 (blue)/D1D2F43C-DN-234 (red) to YU2 gp120.
This figure shows the pathway for motion propagation of gp120 residues in binding D1D2F43C-DN-234. Selected gp120 residues that display plasticity in binding D1D2F43C-DN-234 are same as shown in
These figures show the preparation of ternary complex of gp120 with D1D2 derivatives.
This figure shows the crystals of four ternary complexes composed of HXBc2 gp120, 17b and different derivatized D1D2. HXBc2 gp120:17b:CD4-derivative complexes are abbreviated as HX-compound correspondingly.
These figures show the crystals of two ternary complexes composed of YU2 gp120, 17b and D1D2 or derivatized D1D2.
This figure shows future directions for the design of gp120-CD4 antagonist. Two possible directions are depicted starting from the identified cavity-targeting chemical modules: 1) further optimization of the cavity-binding ligands by using a weak CD4 mimetics; 2) screening and assembly of small molecules that recognize not only the Phe43 cavity but also the vestibule to the cavity and Arg59 site.
This figure shows the ratio of IC50 of D1D2F43C:R59A derivatives to gp120:D1D2 binding compared with that of corresponding D1D2F43C derivatives modified from same compounds. The compound name for deriving both derivatives in IC50 comparison is listed under corresponding column. A dash line parallel to X-axis is shown with a Y-axis intersection of 5.6, the value for the ratio of IC50 of D1D2F43C:R59A to D1D2F43C.
This figure shows the fragments proposed for the assembly of cysteine-modification compounds for D1D2F43A:R59C scaffold.
This invention provides soluble CD4-based polypeptides and compositions comprising same. The first soluble polypeptide consists of a portion of CD4 comprising all HIV gp120-binding epitopes present on intact CD4, wherein the polypeptide has a cysteine substitution at a residue which, in intact CD4, interfaces with HIV gp120.
The second soluble polypeptide comprises (i) a portion of CD4 comprising all HIV gp120-binding epitopes present on intact CD4, wherein the polypeptide has a cysteine substitution at a residue which, in intact CD4, interfaces with HIV gp120, and (ii) a chemical moiety bound to the CD4 portion at the cysteine substitution via a thiol bond.
The third soluble polypeptide comprises intact CD4, wherein the intact soluble CD4 has a cysteine substitution at a residue which interfaces with HIV gp120.
The fourth soluble polypeptide comprises (i) intact CD4, wherein the soluble CD4 has a cysteine substitution at a residue which interfaces with HIV gp120, and (ii) a chemical moiety bound to the intact soluble CD4 at the cysteine substitution via a thiol bond.
The fifth soluble polypeptide comprises (i) intact soluble CD4 or a portion of intact soluble CD4 covalently bound to (ii) a polypeptide moiety (e.g. an Ig polypeptide), wherein the intact soluble CD4 or portion thereof has a cysteine substitution at a residue which interfaces with HIV gp120.
The sixth soluble polypeptide comprises (i) intact soluble CD4 or a portion of intact soluble CD4 covalently bound to (ii) a polypeptide moiety (e.g. an Ig polypeptide), wherein the intact soluble CD4 or portion thereof has a cysteine substitution at a residue which interfaces with HIV gp120 and (iii) a chemical moiety bound to the intact soluble CD4 at the cysteine substitution via a thiol bond.
Herein, the first through sixth soluble polypeptides are referred to individually and collectively as CD4-based polypeptides.
In one embodiment, the portion of CD4 is the portion designated D1D2. In a second embodiment, the cysteine substitution is an F43C or R59C substitution. In another embodiment, the HIV gp120 is HIV-1 gp120.
In a further embodiment, the chemical moiety is bound to the intact soluble CD4 or CD4 portion via reaction with a haloacetamide, a halopropanone or a 5-nitro-2-pyridinesulfenyl reagent. In another embodiment, the chemical moiety is bound to the intact soluble CD4 or CD4 portion via reaction with 2-Bromo-N-(4-nitro-phenyl)-acetamide. In another embodiment, the polypeptide (e.g. second, fourth or sixth) binds to HIV gp120 with an IC50 of ≦10 nM. In a final embodiment, the polypeptide (e.g. second, fourth or sixth) binds to HIV gp120 with an IC50 of ≦5 nM.
This invention also provides two methods. The first is a method for making a derivatized soluble polypeptide comprising contacting, under suitable conditions, (a) a thiol-reactive reagent with (b) the first, third or fifth soluble polypeptide, wherein the polypeptide has a cysteine substitution at a residue which, in intact CD4, interfaces with HIV gp120.
The second is a method for obtaining a structural model useful in the design of an agent for inhibiting CD4 binding to HIV gp120 comprising (a) identifying a second, fourth or sixth soluble polypeptide which binds to HIV gp120 with an affinity comparable to or greater than the affinity with which intact CD4 binds to HIV gp120; and (b) obtaining a three-dimensional structure of the identified polypeptide while it is bound to HIV gp120, thereby obtaining a structural model useful in the design of an agent for inhibiting CD4 binding to HIV gp120.
In one embodiment of both methods, the portion of CD4 is the portion designated D1D2. In a second embodiment, the cysteine substitution is an F43C or R59C substitution. In another embodiment, the HIV gp120 is HIV-1 gp120.
In a further embodiment of the first method, the thiol reactive agent is a haloacetamide, a halopropanone or a 5-nitro-2-pyridinesulfenyl reagent.
In a further embodiment of the second method, the chemical moiety of the polypeptide is bound to the CD4-based polypeptide via reaction with a haloacetamide, a halopropanone or a 5-nitro-2-pyridinesulfenyl reagent. In another embodiment of the second method, the polypeptide binds to HIV gp120 with an IC50 of ≦10 nM. In a final embodiment of the second method, the polypeptide binds to HIV gp120 with an IC50 of ≦5 nM.
This invention further provides methods for identifying and for designing candidate inhibitors of CD4/gp120 binding comprising identifying desired chemical features for such compounds based on the structural information herein regarding the gp120/CD4 interface (e.g. Figures, Example IV and Example V).
This instant invention is illustrated in the Experimental Details section that follows. This section is set forth to aid in an understanding of the instant invention but is not intended to, and should not be construed to, limit in any way the invention as set forth in the claims which follow thereafter.
Crystal structures of complexes between HIV gp120 envelope glycoproteins and the cellular receptor CD4 defined their high-affinity (nM level) interaction at an atomic level. This includes a cavity in the interface near CD4 residue phenylalanine 43 (Phe43) at the center of the interface. Although HIV proteins mutate readily to escape the immune system, determinants of the unique and specific interaction with human CD4 are preserved. Subsequent thermodynamic and spectroscopic studies showed that large conformational changes occur in gp120 upon CD4 binding, suggesting that epitopes for CD4 binding are hidden from the immune system in apo gp120. It would be desirable to develop inhibitors that compete effectively with HIV sites for CD4 binding, but the exceptional flexibility of gp120 complicates lead identification. High-throughput screens have had little success in this system.
We have devised a method for identifying chemical leads for inhibition of the gp120-CD4 interaction. We use a D1D2 construct of CD4 that includes all of the gp120 binding epitopes, which we mutate to introduce a site for chemical derivatization. Most of the work has been done with the F43C variant (CD4 Phe43 Cys43), but other variants including R59C have also been used. CD4 thus mutated is reacted with chemicals, such as bromoacetamides or 5-nitro-2-pyridinesulfenyl reagents, that will react with the free thiol that has been introduced. Binding affinity of the derivatized CD4 (D1D2) is tested in an ELISA assay for binding to full-length gp120 molecules. F43C CD4 is impaired in gp120 affinity relative to wild type, but high affinity is restored with certain of the derivatives. Complexes of chemically derivatized CD4 with core gp120 molecules can be isolated and purified, and four of these have been crystallized and subjected to structure analysis by x-ray crystallography. Since the chemical modification is buried into the interface between two proteins, the exterior surface remains the same and these complexes crystallize isomorphously with the wild type structures. The structures show in detail how the chemical additions bind into the F43 cavity, and they motivate the design of new chemical derivatives aimed at higher potency and the exploration of a water channel beyond the water cavity.
The CD4 derivatives identified by this method will serve as leads for further development. Ultimately they will be attached to other, non-CD4 scaffolds for further elaboration. An intermediate step will be to use smaller peptide mimetics of CD4 that also contain a cysteine residue for derivatization. Ultimately, we expect to keep the chemical portions found to bind optimally into the Phe 43 cavity and to elaborate chemical replacements for all of the CD4 protein. Such compounds will be legitimate leads for drug development as HIV entry inhibitors.
Recognition of the HIV envelope protein gp120 by the host cell receptor CD4 is the first step in HIV infection. An interfacial “Phe43 cavity” in gp120, close to where the CD4 residue Phe43 is bound, has been suggested as a potential target for therapeutic intervention. Because this cavity is unique to CD4-bound gp120, we designed and prepared a two domain CD4 template with Phe43 mutated to the chemoselective cysteine residue for site-specific coupling of chemically diverse compounds for screening against the Phe43 cavity. A library of haloacetamides and 5-nitro-2-pyridyldisulfides were selected and synthesized for modification of the reactive cysteine on CD4. Among them, 2-Bromo-N-(4-nitro-phenyl)-acetamide (compound DN-052) produced a CD4 derivative with highest affinity in binding gp120 (IC50=4.14 nM). The structure-activity relationship (SAR) study of derivatized CD4 binding to gp120 revealed a significant plasticity of the Phe43 cavity in binding different compounds and a narrow entrance to the cavity. The primary contacts for compound recognition by the cavity were found to be van der Waals interactions, while hydrophilic interactions were detected at the narrow entrance region. This first SAR on ligand binding to an interior cavity of gp120 may provide a starting point for structure-based assembly of small molecules targeting gp120-CD4 interaction.
A novel method of identifying low-affinity cavity binder needs to be developed. Also, special consideration is needed to ensure the stabilization of Phe43 cavity, which is absent in free gp120 (Kwong, Wyatt et al. 1998; Kwong, Wyatt et al. 2000; Chen, Vogan et al. 2005). Once identified, the low-affinity small molecules that targets Phe43 cavity could be used in combination with other fragments that recognize Phe43 site (the entrance of the pocket) or Arg59 site on gp120 by fragment-assembly approaches (Rees, Congreve et al. 2004).
We addressed this question using a protein-small molecule hybrid approach. The protein template in the hybrid retained enough activity to structure Phe43 cavity in gp120, while providing chemically active site for site-specific coupling of chemically diverse compounds for screening against the Phe43 cavity (Smith, Savinov et al. 2002). Here, we report the construction of 81 protein-small molecule hybrids derived from a library of various small compounds, which would otherwise bind weakly to gp120 without the template, and the first structure-activity relationships for the interactions of Phe43 cavity targeting probes with gp120.
Iodoacetamide, 5,5′-Dithiobis(2-nitrobenzoic acid) (DTNB) and N-Ethylmaleimide (NEM) were purchased from Sigma-Aldrich. All other cysteine-modification compounds were synthesized as described in Chemistry section below. The gp120 antibody 17b was produced in ascites and purified by Strategic BioSolutions. Purified full length YU2 gp120 from S2 cells was provided by Dr. R. Wyatt. Purified full length YU2 S375W/T257S and wild type YU2 produced in HEK293 cells were also obtained from Dr. R. Wyatt.
Recombinant two domain CD4 (D1D2, residue 1-183) was cloned into NcoI and XhoI sites of vector pET24d (Novagen). D1D2 was expressed as inclusion bodies in Rosetta (DE3) cells (Novagen) through leaky-expression of T7lac promoter without IPTG induction in SuperBroth medium (BIO 101, Inc.) at 37° C. for 24 hours. The inclusion bodies were isolated from cells by sonication and centrifugation, and then washed three time by 2% Triton-X 100 (Sigma-Aldrich), 2 M urea (Fisher Scientific), 5 mM EDTA and 5 mM DTT in Tris.HCl buffer, pH 7.5. Pure inclusion bodies were further solubilized by 6 M guanidine-HCl, 5 mM EDTA, 20 mM Tris.HCl, pH 7.5, 10 mM DTT, 0.5 mM PMSF. Solubilized D1D2 were refolded at 0.5 mg/ml in a refolding solution optimized based on the #10 condition of Foldit kit (Hampton Research): 50 mM Tris.HCl, pH 8, 10 mM NaCl, 1 mM KCl, 1 mM EDTA, 440 mM Sucrose, 2 mM Cysteine and 2 mM cystine. The impurity and oligomeric D1D2 were removed by passing refolded D1D2 proteins through Q and SP Sepharose Fast Flow resins (Amersham) in batch-mode at pH 10.5 and pH 6.2 respectively. As a last step, a size exclusion column Superdex 200 26/60 (Amersham) was used to separate any residual misfolded oligomeric D1D2 from monomeric D1D2. Purified soluble D1D2 contains residue 1-183 of CD4 and an additional glycine from the cloning vector at the N-terminus. CD4 mutants D1D2F43C and D1D2F43Y were created by site-directed mutagenesis and prepared similarly as wild type D1D2.
Iodoacetamide, DTNB and NEM were dissolved and store as 20 mM solution in 0.5 M Na/K-phosphate, pH 7.4. All other compounds including both bromo-compounds and mix-disulfide compounds were dissolved in DMF, DMSO or ethanol as 20-60 mM stock. D1D2F43C proteins at 1-3 mg/ml were first reduced by 2 mM Dithiothreitol (DTT) (BioVectra). Excessive DTT was then removed using PD-10 columns (Amersham) and at the same time the proteins were exchanged into proper reaction buffer: 0.1 M phosphate buffer, pH 7.4, 0.1 M NaCl, 1 mM EDTA and 0.1 mM DTT for all Bromo-compounds, Iodoacetamide and DTNB; 0.1 M phosphate buffer, pH 7.0, 0.1 M NaCl, 1 mM EDTA for 5-nitro-2-pyridinesulfenyl compounds and 0.1 M phosphate buffer, pH 7.4, 0.1 M NaCl, 1 mM EDTA and 0.1 mM DTT for NEM. The thiol-reactive compounds were then diluted to protein solutions to reach final concentration of 2 mM, which is more than 10 folds of the free thiols's concentration in reaction. The reactions were allowed to continue at 25° C. for 2 hours in the dark. The final products, namely derivatized D1D2F43C, were separated from small molecules through desalting columns (PD-10 columns, Amersham) and solvent exchange in Amicon Ultra-4 5K concentrator (Millipore). The completeness of the modifications was examined using protein mass spectrometry (MALDI-TOF) as well as peptide mass spectrometry of trypsin-digested proteins. This latter step was also used to confirm the correct site-directed incorporation of the compounds. A control D1D2 was prepared by using compound 40 to “mock-modify” D1D2 in the same way described above. The product of this reaction was thus name D1D2-40.
Two independent methods were used. A) Amino acid analysis (Keck Biotechnology Resource Laboratory, Yale University) was used for determinations of the concentrations of a selected group of D1D2F43C derivatives. Only the results of residue Ala, Leu and Phe were used in the calculation of protein concentrations. B) The “nominal concentrations” of proteins were calculated from 280 nm absorbance of protein samples in PBS using theoretic extinction coefficient of D1D2F43C. The concentrations of D1D2F43C variants derived from the halo-compounds were then corrected by the corresponding correction factors: Correction factor (%) OD280D1D2F43C/(OD280D1D2F43C+OD280protein-conjugated compound). OD280protein-conjugated compound was estimated, by the experimentally determined absorbance of the free halo-compounds in PBS.
The concentrations of selected derivatives determined by method A were in good agreement with the results obtained by method B. Thus method B was then used for concentration determinations of all halocompound-derivatized proteins. The “nominal concentration” was used for derivatized D1D3F43C from all other compounds.
The abilities of derivatized D1D2F43C proteins to bind gp120 and inhibit gp120-CD4 interaction were evaluated using a competition ELISA (Enzyme-Linked Immunosorbent Assay). Briefly, Immuno 2HB plates (Thermo LabSystem) were coated by with 100 μl 4 μg/ml recombinant D1D2 in PBS overnight at 4° C. The plates were then blocked by 3% bovine serum albumin (BSA) (CalBiochem) in PBS for 2 hrs at 25° C. Fifty nanogram YU2 gp120 from S2 cells, YU2 gp120 from HEK293 cells or YU2 S375W/T257S from HEK293 cells in total volume of 100 μl in 3% BSA-PBS was added to the plates in the presence of either D1D2 or D1D2F43C derivatives at various ranges of concentrations and incubated for 90 minutes at 25° C. After removal of unbound gp120 by washing plates four times with PBST (0.05% Tween-20 in PBS), the bound gp120 was detected by a gp120 antibody 17b (100 μl, 1 μg/ml), which was further probed with a peroxidase-conjugated donkey anti-human antibody (Jackson ImmunoResearch, 1:20,000 dilution, 100 μl). 3,3′,5,5′-tetramethylbenzidine (Sigma) was used as the substrate for peroxidase and the optical density (OD) was read at 450 nm. The binding of the residual gp120 to plate-bound D1D2 was calculated by using the following formula:
Binding (%)=100×(ODgp120-Competitor−ODbackground)/(ODgp120-ODbackground).
IC50 values were obtained by nonlinear regression fitting of binding results by GraphPad Prism 4 (GraphPad Software) using the formula of one site competition shown below:
Binding (%)=BOTTOM+(TOP−BOTTOM)/[1+10̂(log(Concentration)−log(IC50))].
We used two-domain CD4 (D1D2) scaffold with Phe43 replaced by cysteine (D1D2F43C) as the template for site-specific delivery of diverse probes to the cavity. This template was chosen based on following considerations. First, D1D2 contains all the essential elements of CD4 in binding gp120. The D1D2's ability to co-crystallize with gp120 (Kwong, Wyatt et al. 1998; Kwong, Wyatt et al. 2000; Huang, Tang et al. 2005) opened a door for structural characterization of binding between gp120 and the derivatized CD4 proteins. Second, the exact location of residue Phe43 of CD4 at the entrance of the cavity made it the best position for attaching and specifically delivering chemical groups into cavity. The chemical modification of cysteine at position 43 would be straightforward given the high solvent accessibility of Phe43 revealed by the crystal structures of free CD4 (Ryu, Kwong et al. 1990; Wang, Yan et al. 1990; Ryu, Truneh et al. 1994; Wu, Kwong et al. 1997). Third, cysteine was chosen to replace Phe43 because it, among all natural amino acids, enables chemoselective conjugation best. Also all endogenous cysteines of D1D2 are disulfide-bonded (Ryu, Kwong et al. 1990; Wang, Yan et al. 1990; Ryu, Truneh et al. 1994; Wu, Kwong et al. 1997) and will not interfere with selective modification of F43C. Furthermore, D1D2F43C has much lower affinity to gp120 than D1D2 (
The binding affinities of the derivatized D1D2F43C to gp120 were evaluated by competitive ELISA assays, in which the potencies (IC50) of these derivatives in inhibition of gp120-D1D2 binding were measured (
In order to identify the best thiol-reacting module for linking diverse compounds to Cys43, one representative compound from each of three commonly-used classes of sulfhydryl reagents was tested for its performance in modification of F43C residue of D1D2 and the affinity of the derived D1D2F43C to gp120 was determined. The compounds, namely iodoacetamide, 5,5′-dithiobis(2-nitrobenzoic acid) (DTNB), and N-ethylmaleimide (NEM), all successfully modified D1D2F43C with satisfactory completeness (>98%, 80%, and >98% respectively) (Supplement Table 1). But iodoacetamide was the only reagent that produced a CD4 derivative with gp120-binding affinity higher (6 folds) than the template D1D2F43C (reflected by lower IC50) (
Thus haloacetamide was picked as the primary starting module for construction of the thiol-reactive compound library. Some 5-nitro-2-pyridyldisulfides were also included. The initial library included 41 bromoacetamides (compounds 1-41) and 7 5-nitro-2-pyridyldisulfides (compounds 42-48), designed through a computer-assisted molecular complementarity search using GrowMol (Bohacek and McMartin 1994; Ripka, Satyshur et al. 2001). Based on the results from the first library, a second library of 32 bromoacetamides, 1 bromopropanone and 3 5-nitro-2-pyridyldisulfides (DN010-DN271) were further designed to complete the analysis of structure-activity relationships of cavity-filling probes.
All D1D2F43C derivatives were generated through the nucleophilic reactions between thiolate anions of D1D2F43C and various electrophiles from the two compound libraries at pH 7-7.5 (
All 81 D1D2F43C derivatives were tested in competition ELISA for their abilities of inhibiting gp120-CD4 interaction. Although more than one third of the derivatives from initial library bound gp120 worse than the starting template D1D2F43C, all of the derivatives from the second library bound gp120 better than D1D2F43C (Table 1). Furthermore, majority of second-library (32 out of 36) derived protein had IC50 lower than D1D2F43C-Iodoacetamide. The distribution of the IC50 values of the D1D2F43C derivatives derived from both libraries was summarized in
In order to carry out any valid SAR study on them, we first confirmed the presence of these D1D2F43C-attached compounds in the Phe43 cavity when they were bound by gp120 by measuring affinities of derivatives to a Phe43 cavity-filling mutant of gp120 in the competition ELISA. This gp120 mutant, namely gp120 S375W/T257S, has the cavity-lining residue serine 375 mutated to tryptophan, which fills the cavity (Xiang, Kwong et al. 2002). (The other mutation, T257S, is incorporated to stabilize gp120 S375W.) This cavity-filling mutant of gp120 bound D1D2 and D1D2F43C equally well as wild type gp120 did, as indicted by their identical IC50 values in competition assay (
SAR of CD4-Attached Compounds on gp120
The introduction of the small alkyl group, isopropyl (1), at the nitrogen of acetamide group, was not well tolerated by the cavity reflected by the increase of IC50 by 5 folds from that of D1D2F43C-Iodoacetamide where only hydrogen was linked to acetamide nitrogen (Table 2.1). When bigger alkyl groups such as isobutyl (2) and 6-hydroxy-hexyl (9) were used, however, IC50 values of the correspondingly modified D1D2F43C were back to the level comparable to, but not much better than that of D1D2F43C-Iodoacetamide. Even bulkier cyclic group, such as cyclohexyl group (3), increased the binding affinity more than 2 folds with an IC50 of 13 nM, which was further reduced by half when an extra methylene linker was used between acetamide nitrogen and cyclohexyl group, i.e. cyclohexylmethyl (4). Presence of even bigger groups (6, DN-040 and 8) in cavity knocked down the binding affinity to micromolar level, suggesting the size limit of the cavity was probably reached.
Unlike alkyl groups, aryl groups were much more favored when attached to acetamide nitrogen. The highest binding affinity in this group (Table 2.2) was observed with D1D2F43C-10 (IC50=7.76 nM) that has a phenyl group at acetamide nitrogen. Insertion of up to 3 methylene linkers between nitrogen and phenyl group (28, 40 and Dn-022) decreased binding slightly with ethyl being the best linker among the three. Interestingly, additional methylene branch to benzyl group at the first carbon atom linked to acetamide nitrogen resulted in a huge (5-10 folds) lost in affinity (29 and 30 compared to 28). Similarly to phenyl group, introduction of naphthalene group (14) at acetamide nitrogen was also favorable, but the additional linkers (32, Dn-149, 31, and Dn-152) between naphthalene and nitrogen were much less tolerated by the cavity. In light of the preference of phenyl and naphthyl like groups in the cavity, we also synthesized compounds containing 5-membered aromatic rings linked to acetamide. The results (Table 2.3) were very similar to the cases when phenyl was used except when acetamide nitrogen was linked to position 2 of thiophen (DN-155). Unlike D1D2F43C-DN-242, D1D2F43C-DN-155 had an IC50 3 times higher than that of its counterpart when phenyl was used (10). This was probably due to the different position of sulfur in thiophen, suggesting the unfavorable interaction between sulfur in DN-242 and the Phe43 cavity.
Extensive substitutions on phenyl group of compound 10 were employed to screen for higher-affinity binders compared to parent compound 10 and to characterize the chemical preferences of Phe43 cavity.
Eighteen compounds were synthesized with various substitutions at para position (Table 2.4A). In general, small substitutions ranging from methyl (DN-183) to ethoxy (DN-185) groups had limited effect on the activities of the derivatized CD4, but larger group such as isopropoxy (22) and benzyloxy (26) groups reduced the affinities significantly. The highest binding affinities of this study were observed when compound 10 was substituted at para position by nitro (DN-052, 4.14 nM), isopropyl (12), or ethyl (DN-189) group. Interestingly, derivatives with hydroxy-containing substituents at para position always showed IC50 values higher than that of similar derivatives with hydroxy group replaced by methyl group (DN-060 & DN-183, DN-199 & DN-189). This indicated that the electron-donating groups such as hydroxy are probably not favored at para position of the phenyl ring. An isopropanol was found in the Phe43 cavity in the refined 2.2 Å structure of HxBc2 gp120:D1D2:17B Fab (Kwong, Wyatt et al. 2000), suggesting a possible starting point for the design of cavity-binding compound. We tested this idea by adding an isopropanol group on the para position of phenyl group of compound 10. Surprisingly, the resulting compound DN-271 showed lower affinity to gp120 than the parent compound 10, indicating the disfavor of isopropanol group at this location of cavity.
A smaller collection of eight various groups were tested at meta position of phenyl ring of compound 10. A much higher tolerance for bulkier groups was noticed compared to the para position (Table 2.4B). Also small groups increased the affinity of the derivatives from the parent derivative (D1D2F43C-10) significantly less at meta position than at para position.
None of seven different substitutions at ortho position on phenyl group was able to enhance the affinity (Table 2.4C). Interestingly, the smallest group tested, methyl (DN-209), was the second to the least favored at this position with an IC50 of 24.5 nM, yet a much bulkier group benzyloxy (DN-234) performed best at this position with an IC50 of 8.87 nM, comparable to that of the parent compound.
D1D2F43C-Iodoacetamide, the simplest D1D2F43C derivatives containing only acetamide module, displayed much higher affinity to gp120 than D1D2F43C did (
We were only able to study a very small selection of disulfide compounds due to their limited reactivity towards D1D2F43C. All of the derivatives contained aryl groups that were linked to cysteine 43 through disulfide bond (Table 2.6). The incorporation of most of these aryl groups significantly increased activities of derivatives in gp120 binding when compared with D1D2F43C. Similar to the SAR on the substituted N-phenyl-acetamide derived D1D2F43C, addition of isopropyl but not other bulkier group at para position of benzyl group in compound 42 slightly enhanced affinity (43).
The methods of identification of lead compounds or partial lead compounds (fragments) as antagonists of protein-protein interactions have expanded recently from the traditional high-throughput screening to more modern approaches such as NMR techniques, rational discovery and tethering (Gadek 2003). Among them, tethering (Erlanson, Braisted et al. 2000) also utilizes the chemical property of free thiol in cysteine, as we used in this study, to identify potential chemical fragments for specific binding pocket. In tethering, selected residues of the target protein around binding site are mutated to cysteines and used for screening a library of fragment molecules containing a disulfide moiety. The screening is done under partial reducing conditions so that only fragments really complimentary to the binding site can react with the free thiols. Compared to the method we used here, tethering can screen a large library of small fragment faster and have higher hit-finding rate by using more than one binding-site residues as tethering points. Tethering, however, is not practical in the case of identifying fragments targeting gp120 Phe43 cavity for two reasons. First, highly glycosylated gp120 does not permit the usage of mass spectrometry for identifying suitable fragments reacted with cysteines; second, weak-binding fragments may not be able to recognize and stabilize gp120 in the conformation that exhibit the phe43 cavity, resulting few or no hits. In this study, both difficulties were overcome by using D1D2F43C as tethering points for compounds. As the natural ligand for gp120, it stabilized Phe43 cavity in gp120; having no glycosylation sites, its derivatives were suitable for mass spectrometry study. In addition, usage of a gp120 ligand as the template in our method enables the quantitative study of structure-activity relations on the fragments, which would otherwise bind gp120 with weak affinity not suitable for accurate affinity determination using traditional binding assays.
When D1D2F43C template was used for tethering compounds, two potential binding sites on gp120 were available for small molecules. They were phe43 cavity and the binding site for the phenyl group of original phe43 residue. Chemical groups that separately targeted each site could work additively when combined properly. A good example comes from a recent study in rational design of CD4 mimetic CD4M33 (Martin, Stricher et al. 2003) where a biphenyl group instead of phenyl group at position corresponding to Phe43 of CD4 increases gp120-binding affinity of the mimetic by 6 folds. Crystal structure of core gp120 in complex with CD4M33 and 17b antibody shows that the addition phenyl binds inside the cavity where the bottom phenyl group superimposes well with the original phenyl group from phe43 residues (Huang, Stricher et al. 2005).
In this study, the usage of cysteine residue as the handle for chemoselective modification and screening of small molecules made it impossible to re-construct a phenyl group for occupying the original Phe43 binding position. Instead, the cysteine-reacting module (e.g. acetamide group for haloacetamides) of each type of cysteine modification reagents interacted with this site. Among all three kinds of electrophiles examined in this study (See Results), derivative of iodoacetamides had highest affinity to gp120 indicating the good complementarity of acetamide moiety at Phe43 binding site possible due to its relative small size as well as rigid and no-branching shape. Judging from its linear shape, it probably protruded out CDR2 loop of D1 domain and reached out to entrance of Phe43 cavity as much as the original phenyl group of original Phe43 residue. Its rather small “width” compared to phenyl ring, however, probably limited its capability of making extensive interactions with gp120 around cavity opening as seen with Phe43. This agreed with the observations that IC50 of D1D2F43C-Iodoacetamide was 6 folds less than that of D1D2F43C but still 5 folds more than that of wild type D1D2 (
On the other hand, modification by NEM decreased CD4's binding to gp120 significantly indicating that maleimide group was probably too bulky to be fitted at this site. In the case of disulfide compounds, i.e. 5-nitro-2-pyridyldisulfides, lack of proper control compound could not give a direct answer as to how well the phenyl site was occupied but the flexible nature of disulfide bond and the sulfur-carbon bond in disulfanly-benzyl moiety probably rendered it less favored by Phe43 binding site than acetamide moiety. Another disulfides-derived CD4, D1D2F43C-DTNB did not bind gp120 well not because of the disulfide linkage but probably because of the large size and extremely hydrophilic nature of 3-Carboxy-4-nitrophenyl group. This was supported by the high affinities of 5-nitro-2-pyridyldisulfides derived D1D2F43C (Table 2.6).
Most efforts in this study had been taken to characterize the chemical preference of the compounds in binding the Phe43 cavity by studying the structure-activity relationships of altogether 81 D1D2-compound hybrids. Despite of dramatic differences in their sizes, shapes, chemical properties, and affinities to gp120, all compounds presented by D1D2F43C template resided Phe43 cavity (
In the SAR study of N-alkyl-acetamide derived D1D2F43C (Table 2.1), isobutyl group was found to be much more favored than a smaller isopropyl group, which branched at the first carbon linked to acetamide nitrogen. This suggested that Phe43 cavity may had a relatively narrow opening near the binding site of acetamide nitrogen. The side groups that branched out from acetamide nitrogen may have repulsive steric interaction with the residues around the cavity neck region while non-branching groups may be accommodated well. This also agreed with the observation that insertion of a methylene linker between nitrogen and a bulky group just as cyclohexyl or adamantan group enhanced the affinity.
Crystal structures of CD4-bound core gp120 also show the existence of a cavity entrance narrower than the body of cavity (Kwong, Wyatt et al. 1998; Kwong, Wyatt et al. 2000), suggesting this narrow entrance of the cavity is preserved from full length gp120 to core gp120. Modeling of cysteine-acetamide on the D1D2 scaffold positioned acetamide nitrogen close to the cavity neck (data not shown), consistent with the finding from the SAR data.
The introduction of isobutyl group at acetamide nitrogen, however, only had marginally positive effect on affinity probably due to limited van der Waals interactions. On the contrary, cyclohexyl group, although had bulky branches, affected affinity positively possibly by providing more van der Waals contacts.
Aromatic groups such as phenyl and thiophen (Tables 2.2 and 2.3) were shown to be the most favorable groups when linked to acetamide nitrogen at the cavity entrance probably because they not only had capability of engaging a lot of van der Waals interactions like cyclohexyl, but also had rigid and planar shapes that could be accommodated much better at the cavity neck than cyclohexyl group. Additional methylene linkers between nitrogen and phenyl or thiophen group, unlike in the case of alkyl group, affected binding adversely. One plausible explanation was that phenyl group fitted the narrow cavity entrance well but could no longer maintain all the favorable van der Waals contacts once in the broader space inside the cavity due to the insertion of methylene linkers. Again, in agreement with the presence of a narrow cavity neck, additional methyl branch of benzyl group at the first carbon atom linked to acetamide nitrogen resulted in a dramatic affinity lost (Compound 29 and 30, Table 2.2).
Substitutions on the para position of phenyl group of compound 10 (Table 2.4) were found to be most effective in affinity enhancement. Best substituents were identified as nitro (DN-052) and isopropyl (12) groups with IC50 of 4.14 and 4.84 nM respectively. These two groups' different-electronic properties but similar molecular shape and similar effect in binding suggested that they may enhance binding solely through hydrophobic interaction with cavity residues. Similarly to phenyl group, insertion of methylene bridges for these substituted phenyl groups weakened binding (Supplemental Table 1: 12 & 25; DN-052 & 36) possibly by breaking the perfect shape complementation of these substituted phenyl groups to Phe43 cavity. These evidences, together with the observation on the disfavor of hydroxy group on the phenyl ring and sulfur on thiophen group (Table 2.3 and 2.4), indicated that van der Waals interactions may dominate the recognition of the compounds within cavity. This is consistent with the crystal structure of core gp120 in which the Phe cavity is primarily lined by hydrophobic residues (Kwong, Wyatt et al. 1998; Kwong, Wyatt et al. 2000). Although main chain atoms of the cavity-enclosing residues of gp120, such amide nitrogens in residue 376 and 377 or carbonyl oxygen of residue 375, could in principle participate in hydrophilic interactions, based on this study this appeared not to be the case. The finding of an isopropanol occupying the cavity in the crystal structure of core HxBc2 gp120 bound with D1D2 (Kwong, Wyatt et al. 2000) was also not in contradictory to the observed preference of hydrophobic compounds in the cavity since the hydroxy group of isopropanol is hydrogen-bonded to two water molecules but not any residue from the gp120 in the crystal structure.
Substitutions on meta and ortho positions on compound 10's phenyl ring did not work as well as that on para position. But much larger groups (Table 2.4) such as groups seen in compound DN-234 were able to fit in the cavity without affecting binding affinity at all, suggesting they may bind in the cavity in different modes from the para-substituted phenyl groups and their binding in cavity may involve inducing gp120's structural rearrangement and expansion of cavity region.
In summary, the structure-activity relationship (SAR) study of derivatized D1D2F43C binding to gp120 revealed that the narrow entrance of cavity seen in the structure of core gp120 bound by CD4 was also preserved in full length gp120 bound by derivatized CD4 but the cavity itself displayed significant plasticity in binding different compounds. Van der Waals interactions were found to be primarily responsible for the recognition of compounds by cavity while hydrophilic interactions may play a role around cavity entrance as seen with nitrogen of acetamide group.
Plasticity at protein-protein interface has been observed and studied (DeLano, Ultsch et al. 2000; Ma, Shatsky et al. 2002). Significant adaptability of Phe43 cavity, however, was not expected because of the presence of D1D2 scaffold, which pre-fixed gp120 at a CD4-bound substrate. This may explain the relative small gain in affinity by addition of isopropyl/nitro groups at the para position of phenyl group of compound 10 (Table 2.4) as well as the IC50 plateau of 4 nM reached after optimization: addition of favorable compounds in cavity may still result in the redistribution of gp120 population to a substate that is not permitted or favored by D102F43C template. This penalty from occupying the cavity that is intrinsic flexible but stabilized by D1D2 scaffold may prevent the further increase in affinity of derivatized D1D2F43C. A smaller CD4 scaffold such as mimetic may be a reasonable template for further optimization of cavity-binding fragment because it may structure gp120 less. D1D2 template, however, still has its irreplaceable values in that derivatized D1D2F43C can be readily used for structural study while most of CD4 mimetic can not except for CD4M33, of which the structure was recently solved in complex with gp120 (Huang, Stricher et al. 2005).
From the therapeutic aspect, it is interesting to know what kind of conformation gp120 adopts when Phe43 cavity is occupied by derivatized D1D2F43C. Are derivatized D1D2F43C CD4 agonists or CD4 antagonists? It is obvious that CD4 antagonists can be very useful as entry inhibitor by preventing the virus attachment, CD4 agonists have also been shown to be antiretroviral in vivo (Vermeire and Schols 2005), possibly by either blocking coreceptor binding sterically or by fixing gp120 in the CD4-bound conformation recognizable by immune system. They also can be particularly useful tools in the development of antibodies and vaccines against gp120 (Kang, Hariharan et al. 1994).
CD4 mimics, such as CD4-IgG, are CD4 agonists that stabilize gp120 in CD4-bound co-receptor-ready state. Derivatized D1D2F43C only differed from the CD4 mimics by occupying the Phe43 cavity, which probably only exits in gp120 upon CD4 binding. Does that necessarily determine gp120 in a CD4-bound state? Although biochemical data from a cavity-filling mutant of gp120 (S375W) suggested that gp120 adopts a conformation more resembling the co-receptor ready state (Xiang, Kwong et al. 2002), situation could be different when the unit that plugs in the cavity does not come from gp120 itself because it may inhibit the transduction of potential conformational change by locking gp120 in a particular conformation. Indirect evidences can be found from studies on CD4M33—a CD4 mimetic with an additional phenyl group attached to Phe33 position (equivalent to Phe43 of CD4) in CD4F23 mimetic. It has been shown by thermodynamic study (Huang, Stricher et al. 2005) that CD4M33 introduces much less structural rearrangement in gp120 reflected by entropic changes than CD4F23 and D1D2. Ability of promoting the binding of gp120 to CCR5 by CD4M33 is also 6 folds less than that of CD4 (Martin, Stricher et al. 2003). Both evidences strongly support the notion that gp120 with occupied Phe43 pocket may adopt a conformation different from either free or CD4-bound state.
In conclusion, we had constructed a library of D1D2-compound hybrids and presented here the structure-activity relationships of the integrated compound binding to the Phe43 cavity in gp120. The collection of structure-activity relationships of these cavity probes provided us insight on the detailed shape, conformational adaptability and chemical preference of Phe43 pocket, which could benefit the further transformation of the identified compounds into efficient gp120-CD4 inhibitors by incorporating them onto either smaller scaffold, such as miniproteins, or small molecules by fragment assembly. The unique utilization of a reactive ligand template (CD4) in screening compounds for targeting a receptor (gp120) site at the ligand-receptor interface may be applied to other system when the stabilization of targeting site on the receptor requires the presence of the ligand.
indicates data missing or illegible when filed
The method that we have described for identifying chemical leads for inhibition of the gp120-CD4 interaction has already produced several derivatized CD4 analogs that bind to HIV gp120 as well or better than natural human CD4. Various constructs based on human CD4 have been shown to be efficacious (e.g. CD4-IgG2 and dodecameric CD4-Ig), even in clinical trials, and it can be expected that such constructs modified to incorporate the very same modifications that we have introduced by reacting D1D2F43C with certain of our bromoacetamides or 5-nitro-2-pyridyldisulfides (e.g. SNS-10, SNS-12, SNS-14, DN-52 and DN-234) might themselves be expected to have better therapeutic efficacy than the parent therapeutic CD4 in treating neonates of HIV-infected mothers and newly HIV-infected medical workers (needle pricks).
The incorporation of our derivatives into an already developed CD4 product would require the following additional steps: 1. The recombinant cDNA used to produce the therapeutic CD4 product would need to be modified to encode the F43C mutation. 2. The alternative therapeutic CD4F43C would need to be reacted according to the same procedures used with D1D2F43C for modification with our derivatizing reagent to produce the product analogous to our D1D2F43C-X derivative.
The Phe43 interaction is the focal point of all CD4-gp120 interactions, and we expect that properties observed for derivatized D1D2F43C will transfer faithfully to any CD4-based therapeutic. Based on our analysis of derivatives of D1D2F43C, derivatized analogues of current therapeutic CD4s can be expected to have two advantages over the current therapeutics. First, binding affinities have been found that are higher than that of wild-type CD4 (e.g. D1D2F43C-DN-52 has a measured affinity 77% greater than that of natural D1D2). Structure-based design efforts that are in progress may lead to further improvements. Increased affinity is advantageous since it increased the potency of the drug. Second, larger derivatives are found to cause structural perturbations in gp120 and to lead to decreased affinity for the 17b antibody, a surrogate for the chemokine receptor, and to an observed decrease in binding of YU2 gp120 to the CCR5 chemokine receptor. Decreased affinity for the chemokine receptor is advantageous since it reduces the risk of adventious viral virion after association with the therapeutic CD4. A risk to be contemplated for these derivatives is that of a potential immune response directed against the new chemical moiety on CD4.
The surfaces of HIV viruses contain noncovalent trimeric association of the envelope glycoproteins gp41 and gp120 (Clapham et al. 2002). The gp120 proteins mediate the initial attachment step in HIV viral entry into host cells by sequentially interacting with host cell receptor CD4 and a chemokine receptor, CCR5 or CXCR4. In addition, it helps the virus to escape the neutralization of host immune system by 1) heavy glycosylation, 2) shielding of the conserved epitopes by highly variable loops and 3) protection of its active conformations by imposing large unfavorable entropy penalty for their transition from the free form (Kwong et al. 1998; Wyatt et al. 1998; Kwong et al. 2002). Thus, gp120 is a prominent target for therapeutic intervention either by blocking its binding to CD4 or the co-receptor or by eliciting gp120-directed neutralizing antibodies. Precise and comprehensive information on the structures of gp120 and their stability and flexibility is indispensable to advancing the progress of either approach.
Unfortunately, trimeric gp120 and full length monomeric gp120 have so far eluded crystallographic study, possibly due to the specific immune system-eluding structural characteristics mentioned above. Most of the known structural information on gp120 has come from a few X-ray crystal structures of core gp120 protein in complex with D1D2 and a Fab fragment of a gp120 antibody (17B or X5) (Kwong et al. 1998; Kwong et al. 2000; Huang et al. 2005) as well as a relative low-resolution structure of an unliganded SIV core gp120 (Chen et al. 2005). The structures of core gp120 bound by both D1D2 and an antibody are believed to reflect the true character of CD4-bound gp120 because the following reasons: 1) the gp120-CD4 interaction revealed by the crystal structures is consistent with the critical residues identified in both components by the mutational analysis (Kwong et al. 1998); and 2) core gp120 has been shown to resemble full length gp120 both structurally and functionally (Binley et al. 1998; Rizzuto et al. 1998; Myszka et al. 2000).
On the other hand, the crystal structure of free SIV core gp120 may still have non-trivial differences from the real free conformation of HIV gp120, due to the following concerns. Because of the flexible and partially-unfolded nature of free gp120 without its binding partners, a crystal structure can only provide a snapshot of one of its quasistates, which can be stabilized by crystal contacts. Oligomeric organization of gp120 may also provide additional constraints on the conformations of free gp120. Furthermore, the epitope for an antibody b12 that recognizes free gp120 with very small entropy change (Kwong et al. 2002) and neutralizes HIV-1 viruses broadly was mapped to a surface that is continuous in the structure of CD4-bound gp120 (Pantophlet et al. 2003) but not in the recently solved structure of the free SIV gp120 (Chen et al. 2005). Taken together, these evidences suggest that the SIV structure may resemble one of many conformations that free gp120 can adopt and most likely it is different from the gp120 structure in trimeric form. The predominant conformations of free gp120 may resemble CD4-bound gp120 structures more than the SIV core gp120 structure, but with substantial variation. Thus structural understanding of the flexibility of free gp120 and gp120 bound by different ligands are necessary for designing therapeutic agents targeting gp120.
In Experiments I and II, we have described the construction of a library of derivatized CD4 proteins for screening their binding affinities to the Phe43 cavity and presented a full structure-activity analysis of binding of these diverse chemical entities to the Phe43 cavity. Significant plasticity of gp120 in binding the derivatized CD4 has been also suggested by the SAR study. Here, we are interested in gaining structural understanding on the binding of gp120 to CD4 derivatives in order to aid further structure-based improvement of CD4-gp120 interaction inhibitors and also lend more insight into the plasticity and adaptability of gp120 in binding its ligands. Specifically, we solved crystal structures of core gp120 in complex with four differently derivatized CD4, each having high affinity in binding gp120. We describe here in detail how gp120 interacts with these CD4 derivatives and how these structures compare to a recently solved structure of gp120 binding to a CD4 mimetic, CD4M33 (Huang et al. 2005). Surprisingly large plasticity of gp120 in binding different derivatives was observed despite the fact CD4 has already constrained gp120 in a CD4-bound conformation. In addition, the thermodynamics of full-length gp120 binding to derivatized CD4 and 17b antibody was studied using isothermal titration calorimetric experiments. Reduced negative entropy and heat capacity changes in binding of gp120 and derivatized CD4 suggested for a less structured gp120 with Phe43-cavity occupied, when compared to CD4-bound gp120. This intermediate conformational state of gp120 may have reduced affinity for chemokine receptor, as suggested by an in vitro CCR5-binding experiment. A mechanism is proposed based on the crystal structures for the transduction of the conformational changes from the gp120 structure around the filled Phe43 cavity to farther regions in gp120 including chemokine-receptor binding site.
Overview of Structures of Four Derivatized CD4 in Complex with gp120
To study the structural bases for recognition of different D1D2F43C derivatives by gp120, we crystallized and solved structures of four D1D2F43C derivatives in complexes with HXBc2 core gp120 and Fab fragment of 17b (
All four derivatized HX complexes were crystallized isomorphously with the original ternary complex composed of gp120, 17b Fab and D1D2 (HX-WT) (Kwong et al. 1998; Kwong et al. 2000) in the same space group P2221 and with very similar unit cell dimensions (Table 3.5). The crystallization solutions for all four complexes were similar to that for HX-WT complex but seeding technique was indispensable for obtaining any crystal with decent diffraction quality (see Materials and Methods). Diffraction data were collected for HX-SNS-10 (
Except for the addition of the extra compounds in the Phe43 cavity introduced through modification of Cys43 of D1D2, the final models of all four structures contain essentially the same residues as those in HX-WT structure. Due to resolution limits, a few disordered residues, mostly in loop regions of gp120 lacking interpretable electron densities, were not built in the four new structures. Similarly, fewer solvent molecules were built in the derivatized complexes compared to HX-WT. As expected, the overall domain structures of all four derivatized HX complexes are similar to that of HX-WT. The introduction of the compounds to the Phe43 cavity in the derivatized HX complexes, however, does result in noticeable conformational changes in gp120 (
#PDB-ID: 1RZJ
$ IC50 in blocking gp120-CD4 binding in a competition ELISA (See Example II).
Interaction of the Phe43 Cavity with Derivatized D1D2
Unlike Phe43 in the HX-WT complex, all extensions from F43C residue in the derivatized D1D2F43C protrude into Phe43 cavity and make extensive interactions with cavity-lining residues (
In addition to the same placement of the acetamide groups in the cavity for the CD4 derivatives in the same binding mode, the sulfur atoms of Cys43 and the chemical groups that are linked to the acetamide groups and extend further into the cavity (e.g. phenyl group for D1D2F43C-SNS-10) are also positioned similarly in the cavity (
The CD4 mimetic protein, CD4M33, interacts with the Phe43 cavity in gp120 in a mode highly similar to that of D1D2F43C-SNS-10. The upper phenyl ring of residue 33 in CD4M33 superimposes very well with the phenyl ring in D1D2F43C-SNS-10. Although CD4M33 contains no acetamide group for hydrogen-bonding with gp120, its lower phenyl group binds to similar position on gp120 as the acetamide groups for D1D2F43C-SNS-10 and D1D2F43C-SNS-40 (
Besides the hydrogen bonds mentioned above, the carbonyl oxygen of DN-234 (the compound name is used for referring to the corresponding chemical group that is attached to Cys43 through modification of Cys43 by this compound) also makes a hydrogen bond with a water molecule, HOH47 in the HX-DN-234 structure. HOH47 is also coordinated with Gly473 of gp120 and Cys43 of D1D2F43C, at the same time (
With Phe43 residue replaced by the modified cysteines, all seven gp120 residues that interacts with side chain of Phe43 in HX-WT complex are now in contact (defined by non-hydrogen interatomic distance less than 4 Å) with the modified cysteines from four derivatized CD4 (Table 3.1 and
As the result of extensive interactions between the cavity and different CD4 derivatives, enormous changes in the shape and volume of the Phe43 cavity take place. Except for gp120 bound to D1D2F43C-SNS-10 (abbreviated as gp120SNS-10, the Phe43 cavities in gp120SNS-14, gp120SNS-40, and gp120DN-234 are enlarged substantially compared to that in gp120 bound to wild type D1D2 (gp120D1D2). A positive linear correlation is be found between the size of the Phe43 cavities and the molecule weight of the chemical entities residing in the cavities for all four complexes of gp120 and derivatized D1D2 (
Among the cavities in these gp120 proteins bound to the derivatized CD4, the cavity in gp120DN-234 is the largest, showing an increase of 50% in cavity volume (calculated by removing all compounds in the cavity; see Materials and Methods) over that of gp120D1D2 (
The shape of the Phe43 cavity bound to the derivatives is also modified and displays high shape complementarity to the compound that binds into the cavity (
Plasticity of gp120 and Identification of Highly Flexible Regions of gp120
The plasticity of gp120 displayed in its adapting the size and the shape of the Phe43 cavity to different compounds motivated us for further characterization of the critical residues for gp120's plasticity and identification of all flexible regions in gp120 that may not be restricted to the immediate vicinity of the cavity. Superimposition of gp120 bound to D1D2 and D1D2 derivatives revealed mostly main chain movements (along with corresponding side chain movements) but not rotamer change in the side chains in multiple regions in gp120 that are not necessarily close to the cavity (
The main chain movements between gp120 molecules bound to differently derivatized D1D2, although obvious by visual inspection, are not large (0.5-2 Å). Some of the gp120 regions that have the largest movements are actually located in intrinsically flexible regions of gp120, such as variable loops. Structures of HX-WT complex and derivatized HX complexes also have different coordinate accuracy, which should be taken into account when comparing these structures. Therefore, we chose to use an objective method, namely error-scaled difference-distance matrices (Schneider 2000; Schneider 2002; Schneider 2004), for accurate structure comparison of these HX complexes.
The difference-distance matrices calculate the difference between the distance between the Cα atoms of one pair of residues in a structure and that of the corresponding pair in another structure. This distance difference is independent of the alignment of the structures of the interest. In the error-scaled difference-distance matrices, the elements of the matrices are further normalized based on the estimated errors (σ) of the coordinate precision for each structure and individual atom, allowing unbiased study of structural similarity and difference between related structures (Schneider 2000).
In our analysis, the program ESCET was used for calculation of the error-scaled distance-difference matrices of gp120 models, extracted from differently liganded-tertiary complexes, using a 1.3σ cutoff (
In the calculation of difference matrices of different gp120 models, the gp120 residues were also distance-sorted based on the ascending distance of the Cα of each residue to the center of the Phe43 cavity defined by the position equivalent to that of atom C4 of the phenyl ring of residue 43 in D1D2F43C-SNS-10. Inspection of all pairwise comparisons between different gp120 models revealed that gp120D1D2 was not considered to be identical to any of the gp120 models bound to derivatized D1D2, whereas all four gp120 structures complexed with derivatized D1D2 were considered to be the same (Table 3.2). Consistent with the larger sizes of Phe43 cavities in gp120SNS-14 and gp120DN-234, more structural differences were noted in difference matrices between gp120D1D2 and these two gp120 models compared to the difference between gp120D1D2 and gp120SNS-10/gp120SNS-40. Based on the distance-sorted error-scaled difference-distance matrices (
All D1D2 derivatives bind gp120 in similar ways and introduce similar conformational changes in gp120 (
These two gp120 models were subjected to rigid-body analysis in the ESCET program with the following parameters: nhyp=20, wp=20.0, ε1=1.3, εh=4, rmut=5.0%. Out of 293 gp120 residues common to both gp120D1D2 and gp120DN-234, 255 residues (85-105, 118-208, 214-248, 254-375, 378-397, 412-420, 431-443, and 446-491) were identified as invariant region. The remaining 38 residues of gp120 (106-117, 209-213, 249-253, 376-377, 410-411, 421-430, and 444-445) were identified as flexible. These flexible residues of gp120 are located to α1, loop between β3 and β4, β8, β16, V4 loop, β20, and β22 respectively (
Except for residue 410-411, which belong to intrinsically variable regions (V4 loop), the remaining 36 gp120 residues that show significant structural rearrangement upon binding D1D2F43C-DN-234 are located at conserved regions of gp120 and are mapped primarily to the inner domain and bridging sheet. With respect to gp120D1D2, residues 106-117, 209-213, 376-377, and 444-445 in gp120DN-234 move away from the Phe43 cavity whereas residues 249-253 and 421-430 move closer to the Phe43 cavity. Most of these 36 residues, however, do not interact directly with D1D2F43C-DN-234 at all (
CD4-gp120 Interface Other than the Phe43 Cavity
Overall features of the interface between gp120 and D1D2 derivatives other than the Phe43 cavity are the same as that of gp120-CD4. In the HX-DN-234 complex, all gp120 residues contacted by CD4 interact with D1D2F43C-DN-234, except for a residue in the V5 loop, Asn460, which is disordered and not built in gp120DN-234. New interfacial gp120 residues that contact D1D2F43C-DN-234 are located mostly around the cavity region (
Although most of the interactions are conserved between D1D2 and gp120, some of them especially the hydrophilic interactions between gp120 and residue 42 to 45 in CDR2 loops of D1D2 derivatives are weakened primarily due to main chain movements of CDR2 loops in D1D2 derivatives and to less extent caused by that of β15 in gp120. In the HX-DN-234 complex, these main chain movements of both D1D2F43C-DN-234 (average 0.48 Å Cα movements in residue 42-45) and gp120 away from the location of Phe43 cavity result in an average increase of 0.14 Å in the length of 7 hydrogen bonds (
gp120-17b Interactions in the Presence of Derivatized CD4
Although some residues at the gp120-binding site were identified as structurally variant regions by ESCET when comparing structures of gp120-bound 17b in the presence of either D1D2 or D1D2F43C-DN-234, most of gp120-17b interactions are unchanged. Interestingly, many 17b-interacting gp120 residues are located either close to or belong to the regions that show substantial movement upon D1D2F43C-DN-234 binding (
Thermodynamic Analysis of the Binding of Derivatized D1D2 and 17b to gp120
The crystal structures of gp120 complexed with derivatized D1D2 revealed that core gp120 undergoes structural rearrangement upon binding to derivatized D1D2. To extend our understanding of the biological relevance of these conformational changes in gp120, we further studied the thermodynamics of the binding of gp120 to derivatized D1D2 and the binding of gp120 to 17b antibody in the presence of saturating concentrations of derivatized D1D2 using isothermal titration calorimetric experiments. Wild type D1D2 and two D1D2 derivatives, D1D2F43C-SNS-10 and D1D2F43C-DN-234, which binding the Phe43 cavity of gp120 in different modes, were used in the thermodynamic study.
Direct Binding of Wild-Type and Derivatized D1D2 to gp120
As reported earlier (Myszka et al. 2000; Leavitt et al. 2004), D1D2 binds gp120 with an unusually large and favorable enthalpy change, ΔH, and a large unfavorable entropy term, −TΔS (Table 3.4). In comparison, both D1D2 derivatives, especially D1D2F43C-DN-234, bind gp120 with smaller values for the favorable ΔH and the unfavorable −T□S compared to that of wild-type D1D2. The measured Kd values for the binding of D1D2 or D1D2 derivatives to gp120 agree well with IC50 values reported in Example II: D1D2F43C-SNS-10 binds gp120 with similar affinity as that for D1D2 whereas D1D2F43C-DN-234 binds gp120 with less affinity.
The temperature dependence of the enthalpy change, i.e. the change in heat capacity ΔCp, for direct binding to gp120 is significantly different for the wild-type D1D2 compared to the two derivatized forms, D1D2F43C-SNS-10 and D1D2F43C-DN-234. Binding of D1D2 to gp120 is associated with an extremely large negative change in heat capacity of −1800 cal/(K×mol), a value similar to that obtained for protein folding. ΔCp for binding of gp120 to D1D2F43C-SNS-10 and D1D2F43C-DN-234, however, are 22% and 33% less than that for binding of gp120 to D1D2, valued at −1400 and −1200 cal/(K×mol) respectively (Table 3.4).
A binding reaction associated with large favorable enthalpy and large unfavorable entropy changes together with a large negative change in heat capacity is characteristic of a process that involves large conformational changes. These changes in entropy and heat capacity can be analyzed as the equivalent number of unfolded residues that become conformationally restricted upon complexation (Luque et al. 1998). Such an analysis of the values presented here shows that binding of wild-type D1D2 to gp120 generates an ordering equivalent of about 120 residues whereas D1D2F43C-SNS-10 structures about 90 and D1D2F43C-DN-234 only 80 residues.
Enhancement of gp120 Binding to the Co-Receptor Site
Binding of CD4 to gp120 leads to the formation of chemokine-receptor binding site on gp120 (Trkola et al. 1996; Wu et al. 1996). The binding of derivatized D1D2-bound gp120 to 17b, a gp120 antibody that recognizes the chemokine-receptor site, is used to assess the formation of co-receptor site on gp120 upon derivative binding. It has been observed that the binding of 17b to gp120, especially to core gp120, can be greatly enhanced in the presence of CD4 (Kwong et al. 2002; Huang et al. 2005). Here, we found that in the presence of D1D2, the binding affinity of 17b to YU2 gp120 is enhanced by 4 fold, characterized by a 0.8 kcal/mol. increase in AG value (
Structural characterization of the Phe43 cavity as a binding site for diverse ligands, in the context of SAR study (Example II), provides us with a comprehensive understanding on the molecular details of how gp120 recognizes the cavity-targeting ligands. The four D1D2 derivatives, whose structures are presented here, bind the Phe43 cavity through two distinct modes (
Mode II binding is found in the binding of bulkier groups, namely naphthalene and benzyloxy-phenyl groups (D1D2F43C-SNS-14 and D1D2F43C-DN-234), into the cavity, featuring i) weakening of the hydrogen bond seen in mode I and ii) the enlargement of the cavity entrance by an alternative Met475 rotamer configuration.
Different locations of the ligands in these two binding modes suggest that there are more than one optimal binding configurations in the Phe43 cavity. Mode II binding, however, is probably less favored than mode I binding not only because of the weakening of the hydrogen bond but also because the new rotameric conformation of Met375 results in its slightly repulsive interaction (3.2 Å) with Trp479. The penalty for the widening of the cavity entrance is probably responsible for the low affinity binding between gp120 and aliphatic ligands with branches that crowds the entrance (SAR study, Example II). D1D2F43C-SNS-14 and D1D2F43C-DN-234, on the other hand, may compensate the lost of affinity by engaging extensive favorable interactions within the cavity, as evidenced by the nearly perfect shape complementation of DN-234 and the cavity (
D1D2F43C-DN-234, a mode II binder, has largest aryl group attached to the acetamide moiety among all high affinity D1D2 derivatives identified in Example II with IC50 less than 10 nM and has the largest potential for hydrophobic interactions with the cavity. D1D2 derivatives with smaller aryl group, if binding gp120 in mode II, should have affinity to gp120 no greater than that for D1D2F43C-DN-234. Thus the best gp120-binding derivatives, e.g. D1D2F43C-SNS-12 and D1D2F43C-DN-52, whose affinities to gp120 double that for D1D2F43C-DN-234, most likely do not recognize gp120 in mode II. Furthermore, D1D2F43C-SNS-12 and D1D2F43C-DN-52 are essentially derivatized D1D2F43C-SNS-10 with the para position of the phenyl group substituted with isopropyl and nitro group respectively. Although alternative binding modes other than mode I and II may exist, carefully inspection of the interface of gp120 and D1D2F43C SNS-10 (mode I binding) suggests that the addition of either isopropyl or nitro group at the para position to SNS-10 should fit perfectly in the unoccupied space in the “two corners” of the hearted-shaped cavity (
In summary, the structural information on the binding of core gp120 and the derivatized D1D2 proteins strongly support the results of SAR study using full length gp120 and indicates that characteristics of the Phe43 cavity are conserved from core gp120 to full length gp120. These findings should also help the design of next-generation cavity ligands, in a structure-based fashion.
gp120 Plasticity
The flexibleness of the Phe43 cavity to different ligands suggests that the Phe43 cavity, like rest of D1D2 interface on gp120, binds its ligand in an induced-fit mechanism. The adaptability of gp120 arises mostly from main chain but not side chain movements. Met475 and Phe382 are the only two interface residue in gp120 that adopt different rotameric conformation in binding D1D2F43C-SNS-14 and D1D2F43C-DN-234 from that in binding D1D2.
All gp120-(derivatized D1D2) interfacial residues that belong to outer domain (altogether 23 residues), except for residues 376-377 of β16, show no significant main-chain positional adjustment compared to gp120D1D2 (
Interestingly, main chain atoms of residues 472-475, which are located on tip of α5 (at junction between outer and inner domains) and the loop connecting α5 to β24 (outer domain), are found to be structurally rigid despite the fact the side chain of Met475 is flipped in the reconstruction of the Phe43 cavity upon binding its ligands. Although Met475 only contacts derivatized D1D2, residues 472-474 interact with wild-type D1D2 and its derivatives. In conclusion, gp120-(CD4 derivative) interfacial residues in gp120 outer domain except for those from β16 are structurally rigid, whereas most interfacial residues that are located in the inner domain or bridging sheet except for α5 have high degree of flexibility in binding cavity-filling ligands.
In addition to the above 9 D1D2-contacting gp120 residues that are flexible (“hotspot” residues), other 27 gp120 residues that do not directly contact D1D2F43C-DN234 were also identified to move significantly upon binding D1D2F43C-DN234 (
These results are consistent with the conservation of the outer domains between structures of free SIV gp120 and CD4-bound gp120, except for CD4-binding strand β15 (Chen et al. 2005) and its vicinity strands including β16, whose flexibility was also observed in the current study. Helix α5, especially its N-terminus, identified as rigid here, is also the only secondary structural element in the inner domain and the bridging sheet that displays very little dislocation in comparing structures of free SIV gp120 and liganded-gp120.
Pathway of Motion Propagation in gp120 Bound by Derivatized D1D2
The 27 gp120 residues that do not contact derivatized D1D2 yet display structural rearrangement, are located in 6 different segments including α1, β8, β16, β20, β22, and the loop between β3 to β4. Their motions are most likely transuded from the movements observed for derivatized D1D2-contacting residues, which include the 9 “hotspot” residues and residues Met475 and Phe382, for which the rotamer change is the primary mechanism used for binding derivatized D1D2. However, for the extremely high structural variance observed for residue 209-213 in the loop connecting β3 and β4, we could not rule out the possible contribution of the intrinsic flexibility, since this region is also identified as flexible when comparing gp120 from different HIV-1 strains (data not shown) and is completely disordered in the structure of free SIV gp120 (Chen et al. 2005).
By analyzing the inter-residue contacts between the non-CD4 contacting residues and the CD4 contacting residues, we propose here a possible pathway for the motion propagation from the Phe43 cavity to more remote areas in gp120 (
Co-Receptor Binding and Conformational State of gp120 Bound by Derivatized D1D2
It is surprising to observe conformational changes in multiple segments in gp120derivatized-D1D2 compared to gp120D1D2. CD4-restrained gp120 appears to still have considerable flexibility even in the regions that are directly bound by wild type D1D2. Even more surprisingly, the conformational changes can be propagated to the chemokine-receptor/17b binding sites despite the fact that 17b is also present in the determined gp120 complexes and should further rigidify gp120. The indispensable usage of the 17b antibody in X-ray crystallographic study of gp120 bound by the derivatized CD4 limits our capability to study the real conformation of gp120 bound only by the derivatized CD4, although this conformation of gp120 should not differ too much from that bound by both 17b and the derivatized CD4, based on relatively small entropy changes in the binding of derivatized CD4-bound gp120 to 17b (
The bridging sheet has been found to be the binding sites for 17b (Kwong et al. 1998; Kwong et al. 2000) and presumably for the chemokine receptors. A strand in the bridging sheet, β20, moves noticeably in binding of D1D2 derivatives to gp120. In addition, conformational changes have also been located to other residues involved in chemokine-receptor binding that were identified by mutagenesis study (Rizzuto et al. 1998) including Lys177 in α1, Asn377 in β16, and Arg444 in β22. Although no significant difference was found when comparing the gp120-17b interface in the presence of D1D2 to that in the presence of D1D2 derivatives, probably due to the inclusion of 17b in the complexes for structural study, thermodynamic study revealed reduced affinity of 17b and gp120 pre-bound with derivatized D1D2 instead of D1D2 (
This finding suggests that the real conformations of gp120 in solution with the Phe43 cavity bound to the derivatized D1D2 are indeed different than that of D1D2-bound gp120. The structural difference between gp120 bound to D1D2 and its derivatives in the absence of 17b may also be greater than what is observed in the crystal structures of gp120D1D2 and gp120derivatized-D1D2 solved in the presence of 17b. Smaller unfavorable entropy changes, less □Cp and fewer structured residues in gp120 binding to the derivatized D1D2 further support the notion that D1D2-bound gp120 adopts a much less structured state when Phe43 cavity is filled by proper ligands. This intermediate state between free and D1D2-bound gp120 may involve reorganization of identified flexible secondary structural elements in our study, especially, α1 and β20/21, implicated by the relatively easy conversion of free SIVgp120 structure to a near CD4-bound conformation through rearrangement of α1 and β20/21 (Pan et al. 2005). The degree of the reorganization in solution, however, must be greater that what is shown in the crystal structures, which are highly constrained by both lattice and 17b binding. Other possible mechanism in achieving the intermediate state may involve stabilization of the inner domain in a conformation between free and pre-bound conformation by the specific interactions between the inner domain residues (Trp112 and Met475) and derivatized D1D2, which are absent in gp120 and wild type D1D2 binding.
Earlier study has suggested that the stabilization of the Phe43 cavity in gp120 by a S375W mutation drives gp120 into a conformation similar to CD4-bound state (Xiang et al. 2002) presumably by rigidifying gp120 in providing a hydrophobic core at the nexus of inner domain, outer domain, and bridging sheet. Our study on derivatized CD4, however, indicates that filling the Phe43 cavity and CD4-binding do not work additively but rather counteractively in structuring gp120 in the conformation for co-receptor binding. This theory helps in explaining the greatly reduced capability (10% of WT) of gp120 S375W in supporting HIV-1 infection (Xiang et al. 2002). It is possible that the cavity, in addition to its presence in CD4-bound gp120, also exists in other intermediate state of gp120 between free and bound form, in which the rigidification (filling) of the cavity is preferred more than in the CD4-bound state. Potential small-molecule drugs for the cavity should introduce even less structure reorganization in free gp120 than the derivatized D1D2 and therefore may have better chance in viral neutralization, as seen with gp120 antibodies (Kwong et al. 2002).
Our lack of success in the identification of D1D2 derivatives with sub-nanomolar affinity to gp120 had been puzzling (Example II) and now becomes clearer with the help of the structural and thermodynamic studies. The usage of D1D2 scaffold, while stabilizing the cavity for targeting, reduces gp120's plasticity in binding ligands in the cavity and imposes penalty for the conversion of gp120 from the D1D2-bound conformation to a less structured state stabilized by the filling of the cavity. On the other hand, the binding of D1D2-scaffold to gp120 is also reduced, which is evidenced by the weakened hydrogen bonds at gp120-D1D2 interface. These reasons lead to overall reduction and masking of the favorable interaction of the Phe43 cavity and the ligands. Further optimization of cavity-binding ligands could benefit from exploitation of smaller CD4-like scaffold that structures gp120 less.
In summary, the structural studies of binding of gp120 with the derivatized D1D2 provides molecular basis for our previous SAR study and reveals high plasticity of gp120 in binding the cavity-targeting ligands. With help of the thermodynamic study, we conclude that cavity-filled gp120 adopts an intermediate conformation between the CD4-bound and free states even in the presence of the D1D2 scaffold. This study should benefit the future design of new cavity ligands especially with the help of smaller scaffold.
Derivatized D1D2 proteins were prepared as described in Example II. Recombinant endoglycosidase D (Endo D) (Muramatsu et al. 2001) was produced in E. coli using a periplasmic expression vector pBAD/gIII (a gift from Takashi Muramatsu) and was purified by ammonium sulfate precipitation and size exclusion chromatography. The preparation of the other reagents for forming gp120-containing complexes were similar to that described in previous studies of gp120 complexes (Kwong et al. 1998; Kwong et al. 1999; Kwong et al. 2000). Brief descriptions of the procedures are listed below.
The human monoclonal antibodies of gp120, 17b and F105, were produced with both in-house cell culture and ascites (Strategic BioSolutions) (both hybridoma cell lines are provided by R. Wyatt) and then purified by protein-A affinity chromatography. Fab fragment of 17b were generated by papain digestion. Briefly, 17b was first reduced by 50 mM DTT for 1 h at 37° C. then dialyzed into 100 volumes of 20 mM HEPES, pH7.8, 350 mM NaCl at 4° C. for 1 h to decrease DTT concentration to 0.5 mM. Alkylation of 17b by iodoacetamide was achieved by further dialyzing the antibody into 100 volumes of 20 mM HEPES, pH7.8, 350 mM NaCl and 4 mM iodoacetamide for 24 hr at 4° C. An additional dialysis with same alkylating buffer that is devoid of iodoacetamide (overnight, 4° C.) was used for removal of extra iodoacetamide. Alkylated 17b was then concentrated and digested using ImmunoPure Fab Preparation Kit (PIERCE). The product of digestion was further purified by size exclusion chromatography on S-200 column (Pharmacia).
Recombinant gp120 core (Δ82 deltaV1/V2*ΔV3ΔC5) (83-127 GAG 195-297 GAG 330-492) (Kwong et al. 1999) from laboratory-adapted HXBc2 strain and primary isolate YU2 strain were produced in Drosophila Schneider 2 (S2) cells (obtained from R. Wyatt) under the control of an inducible metallothionein promoter as described previously (Wu et al. 1996). Briefly, the S2 cells in suspension culture were grown in protein-free medium (Insect express media from BioWhittaker), 5% fetal bovine serum and 300 μg/ml hygromycin B (Roche Diagnostic) and expression of core gp120 proteins was induced by addition of 750 mM CuSO4 for 7 days at 25° C. Affinity chromatography was used for purification of gp120 proteins by passing cell supernatants over a F105-sepharose column.
The affinity column was then extensively washed with PBS/0.5 M NaCl. gp120 proteins were then eluted with 100 mM glycine.HCl, pH 2.8, followed by immediate neutralization with 1M Tris, pH 11. The core gp120 proteins were concentrated to 2 mg/ml (determined by 280 nM absorbance), treated with protease inhibitor cocktail (Roche) and stored in −80° C.
The preparation and crystallization of the ternary complexes composed of HXBc2 core gp120, 17b Fab and derivatized D1D2F43C were similar as described previously (Kwong et al. 1998; Kwong et al. 1999) except D1D2 was substituted with each of the four derivatized D1D2F43C proteins, including D1D2F43C-SNS-10, D1D2F43C-SNS-14, D1D2F43C-SNS-40 and D1D2F43C-DN-234. Although gp120 and 17b Fab were produced and purified almost identically as described previously, derivatized D1D2F43C proteins were produced from different resource and have different N- and C-termini from D1D2 proteins used for previous studies (Kwong et al. 1998; Kwong et al. 1999). As discussed in Example II, derivatized D1D2F43C proteins were expressed in E. coli and refolded, whereas previous studies used D1D2 expressed as soluble protein from CHO cells. Derivatized D1D2F43C proteins also have one additional Gly at its N-terminal compared to D1D2 used before and lack the two non-CD4 C-terminal residues that D1D2 has. A flow chart of the preparation is shown in
Briefly, gp120 was first deglycosylated by endoglycosidase D (50 ug recombinant Endo D per 1 mg gp120) and endoglycosidase Hf (Endo Hf) (New England BioLabs) (45 unit per 1 ug gp120) at 20° C. for 6-12 hours at pH 6.0. One of the derivatized D1D2F43C proteins was then added to the solution at a molar ratio of 120:100 for stabilizing the deglycosylated gp120. Concanavalin-A (Con A) column (Sigma) was then used for removing any glycosylated gp120 from the deglycosylated ones. These complex of gp120 and derivatized D1D2F43C was then purified by size exclusion chromatography on a Superdex 200 16/60 (Pharmacia), by which Endo D and free D1D2F43C derivative were separated from the complex of gp120 and D1D2F43C-derivative. Extra Fab fragment of 17b and more derivatized D1D2F43C were further added to the binary complex and the protein mixture was again purified on a Superdex 200 16/60 to isolate free 17b Fab, D1D2F43C derivative and Endo Hf from the final ternary complex, which was then concentrated by Amicon Ultra-15 30K concentrator (Millipore) to a final concentration of 6-15 mg/ml and either stored at −80° C. or used for crystallization. The whole purification processes of the complexes were monitored by SDS-PAGE (
Crystallization of the ternary complexes of HXBc2 gp120:17b:D1D2F43C-derivatives was carried out using vapor-diffusion in hanging-drop method as described previously for the complex of HXBc2 gp120, 17b and wild type D1D2 (Kwong et al. 1998; Kwong et al. 1999). Briefly, a droplet containing 0.5 μl of protein and 0.5 μl precipitation solution was composed on glass coverslip and suspended over 0.5 ml reservoir solution in a sealed well over time at 20° C., rendering increased concentrations of both protein and precipitant in the droplet and ultimately the formation of the crystals (McPherson 1999). Only crystal showers were obtained by this approach and subsequently microseeding technique (McPherson 1999) was used for producing crystals suitable for X-ray analysis.
The largest crystals obtained for all four complexes in this study were needles with cross-section about 25 μm by 25 μm (
YU2 core gp120 instead of HXBc2 core gp120 was also tried out in forming similar ternary complexes as described above. The preparation of the complexes was similar as described previously (Kwong et al. 2000). Three complexes were formed: YU-WT (YU2 gp120:17b Fab:D1D2), YU-SNS-10 (YU2 gp120:17b Fab:D1D2F43C-SNS-10) and YU-SNS-40 (YU2 gp120:17b Fab:D1D2F43C-SNS-40). Unfortunately, none of these complexes was able to crystallize isomorphously with the originally reported YU-WT (Kwong et al. 2000). A different crystal of YU-WT with much smaller unit cell was able to form under the similar crystallization condition published for the original YU-WT complex (
Crystals of the ternary complexes of HXBc2 gp120 with 17b and derivatized D1D2 proteins were crosslinked, stabilized and flash-frozen at 100 K similarly as reported previously (Kwong et al. 1998; Kwong et al. 1999; Kwong et al. 2000). Briefly, the crystals were crosslinked by vapor diffusion using 25 μl of 1% glutaraldehyde (Sigma) in a crystallization bridge (Hampton Research) placed in the reservoir containing 500 μl of reservoir solution (see above) for 1 h at room temperature. They were then washed by reservoir solution and transferred to stabilizing solution (10% ethylene glycol, 10% 1,6-hexanediol, 10% PEG 4k, 100 mM Na3Citrate, pH 5.6, 2.5% 2R, 3R butanediol and 2.5% sucrose). Right before data collection, paratone-N (Hampton Research) was used for replacing the external liquid surrounding the crystals, which were then immediately mounted in cryoloop of 20 μm diameter with loop diameter of 0.05 mm (Hampton Research) and flash-frozen in liquid nitrogen.
X-ray diffraction data were collected either at beamline X4A of the National Synchrotron Light Source, Brookhaven National Laboratory (HX-14 and HX-DN234 complexes) or at beamline 191D of the Advanced Photon Source (APS), Argonne National Laboratory (HX-10 and HX-40 complexes). 100-180 degrees of oscillation data were collected for each complex with half degree oscillation per image to avoid overlapping spots due to high mosaicity (0.8°-1.5°) and large unit cell dimension. The HKL-2000 program package (Minor 1997) was used for data processing and reduction.
All four structures were solved by “rigid body” replacement using the HX-wt (HXBc2 gp120:17b Fab:D1D2) complex (PDB-ID: 1G9M) as a starting model in which the Phe43 of D1D2 was mutated to Cys43, i.e. the mutated (F43C) starting model was rigid-body refined against the data of the new complexes. Torsional angle-simulated-annealing protocol, carried out by CNS (Brunger et al. 1998), was then used for further refinement of the models and then the initial Fo-Fc maps were generated and used for manual building of the compounds introduced by the derivatization of D1D2F43C, O software (Jones et al. 1997) and Coot (Emsley et al. 2004) were used for building/rebuilding of all models throughout. The new models were then further refined using CNS (simulated annealing, positional refinement, individual isotropic B value refinement, and automatic water pick and deletion) (Brunger et al. 1998) ARP-wARP (Perrakis et al. 1997) and Refmac5 (Murshudov et al. 1997) of the CCP4 program suite (CCP4 1994) combined with manual rebuilding until the Rfree value converged. At the later stage of the refinement, the sequences of 17b antibody were also corrected based on a newer structure of HX-WT (PDB-ID: 1RZJ) and all four structures were refined. Data collection and refinement statistics are given in Table 3.5. The final models of four complexes all contains residues 2-181 of D1D2F43C plus corresponding modifications on Cys43 and residues 1-214 for the light chain of Fab fragment of 17b. A few sections in gp120 and 17b heavy chains were not built in the final models due to lack of the electron density. Among residues 1-228 in the heavy chain of 17b, residues 143-147, 142-147, 143-147 and 142-148 are missing in HX-SNS-10, HX-SNS-14, HX-SNS-40 and HX-DN-234 respectively. The residues of gp120 built in these four complexes are 84-126:196-299:329-397:410-491, 85-126:196-299:329-397:410-459:463-491, 84-126:196-299:329-397:410-460:463-491, and 85-126:196-299:329-397:410-459:464-491 respectively. The modification group on the Cys43 of each complex is named PAM (N-phenyl-acetamide) for HX-SNS-10, NYA (N-naphthalen-1-yl-acetamide) for HX-SNS-14, PEM (N-phenethyl-acetamide) for HX-SNS-40, and BPS (2-(Benzyloxy-phenyl)-acetamide) for HX-DN-234.
#Numbers in parentheses represent the statistics for the data in the outer shell (10%).
§R = Σhkl||Fobs| − k|Fcalc||/Σhkl|Fobs|, RFree was calculated using 5% of all reflections that were never used in the refinement.
&http://kinemage.biochem.duke.edu/molprobity/
Calculation of error-scaled difference-distance matrices and identification of flexible regions in gp120 were carried out with ESCET (Schneider 2000; Schneider 2002). gp120 residues used in ESCET were sorted based on the distance of the Cα of each residue to the center of the Phe43 cavity defined by atom C4 of the phenyl ring of residue 43 in D1D2F43C-SNS-10. Superimpositions of different gp120 proteins were calculated with LSQKAB of CCP4 (CCP4 1994) suite using the invariable regions identified by ESCET. RMSDs of superimposed structures were calculated using LSQMAN (Kleywegt et al. 1997).
Volumes of the Phe43 cavity in different models were calculated using the MS program (Connolly 1993). Only gp120 and D1D2 were included for the calculation. All compounds, solvent molecules or other entities in the cavity were removed for this calculation. Phe43, water molecules #43 and #218 from the complex of gp120 and wild-type D1D2 (PDB-ID 1RZJ), onto which all other models were superimposed, were included in the coordinates for all other models for cavity calculation to ensure an unbiased volume calculation by blocking all three openings of the cavity. All structural figures were prepared using PyMOL (DeLano 2002).
Isothermal titration calorimetric experiments were performed using a high-precision VP-ITC titration calorimetric system from MicroCal Inc. (Northampton, Mass.). Direct binding to gp120 was studied in experiments where the calorimetric cell, containing 3 μM gp120, was titrated with a solution of 30 μM D1D2, D1D2F43C-SNS-10, or D1D2F43C-DN-234. All reagents were dissolved in PBS (Roche Diagnostics GmbH), pH 7.4. The binding of D1D2 or D1D2 derivatives was studied at different temperatures in the range of 15-37° C. Binding of MAb 17b to gp120 was studied by stepwise additions of 15 μM (30 μM of Fab-sites) to the calorimetric cell containing 3 μM YU2 gp120 by itself or equilibrated with 5 μM of wild-type or derivatized D1D2. The effect of 17b on the binding of derivatized D1D2 to gp120 was studied by stepwise addition of D1D2 or any of the derivatives to a mixture of gp120 and 17b. All titrations were performed by adding the titrant in steps of 10 μL. All solutions were properly degassed to avoid any formation of bubbles in the calorimeter during stirring. The heat evolved upon each injection of inhibitor was obtained from the integral of the calorimetric signal. The heat associated with binding to gp120 in the cell was obtained by subtracting the heat of dilution from the heat of reaction. The individual heats were plotted against the molar ratio and the enthalpy change, □H, and association constant, Ka=1/Kd, were obtained by non-linear regression of the data.
The CD4 scaffold approach described in previous Examples has been proved to be powerful in characterizing the Phe43 cavity and related gp120 plasticity. This approach has also significantly benefited our interactive process of screening and structure-based design of gp120-CD4 inhibitors that specifically target the Phe43 cavity. Although we still have some distance from the identification of a high-affinity small-molecule drug lead that functions as gp120-CD4 antagonist, this approach, with improvement, will continue to aid the SAR and structure-based optimization of the Phe43 cavity-targeting ligands. Eventually, our goal is to identify a compound that has high enough affinity for gp120 and will become active while scaffold-free.
Based on our understanding acquired from this study, the choice of scaffolds that rigidifies gp120 less well may be more beneficial especially for screening ligands with higher affinity to the Phe43 cavity. Also, proper identification and incorporation of fragments mimicking Phe43 and Arg59 into the ligands that bind inside the Phe43 cavity should lead to a new generation of inhibitors that targets multiple sites on gp120 with higher affinity and should advance the progress of lead discovery. In the following section, I will present the proposal and the preliminary results on both approaches (
Structural-activity analysis and especially the structural information on the binding of gp120 to different D1D2 derivatives have provided ample information on promising directions of ligand optimization. For examples, identification of unoccupied space in the top right corner (
One of the alternative scaffolds for the cavity-targeting ligand attachment is the D1D2 mutant with even weaker affinity to gp120 than D1D2F43C. Arg59 of CD4, together with Phe43, are two major determinants at CD4-gp120 interface (Kwong et al. 1998). Mutation of Arg59 to Ala or Gln reduces affinity of CD4 to gp120 by 8.8 and 2.9 fold respectively (Moebius et al. 1992; Brand et al. 1995). Thus D1D2F43C:R59A could be a better scaffold than D1D2F43C in screening of cavity ligands by restraining gp120 less.
We have expressed and purified this D1D2 mutant as described in Example II with a lower yield about 40% of that for D1D2F43C. Selected D1D2F43C:R59A derivatives have also been made using a group of compounds that have been shown to render D1D2F43C derivatives with high affinities (IC50<35 nM) to gp120. IC50 values for the D1D2F43C:R59A derivatives have been measured using same procedures for D1D2F43C derivatives described in Example II and were compared with the IC50 values of corresponding D1D2F43C derivatives (Table 4.1 and
The preliminary results showed that for the modification similarly favored on D1D2F43C scaffold, more distinct difference now is seen when they are attached to D1D2F43C:R59A, evidenced by IC50 rations ranging from 2 to 50. Encouragingly, some of the best modifications identified in D1D2F43C context, such as modification by SNS-10, SNS-12 and DN-189, were found to perform even better in D1D2F43C:R59A scaffold. In conclusion, D1D2F43C:R59A scaffold with micromolar affinity to gp120 allows better distinguishment of cavity-binding ligands and should be a good candidate for scaffolds of next generation.
Small peptide mimic for CD4 is another possibility for new modification scaffold. The synthesis and modification of the small peptide, however, are more difficult in general compared to that for the proteins. We had two peptides commercially synthesized (Alpha Digonostic International) to test their potentials as a scaffold (Table 4.2). G1C is designed based on peptide G1-6 (named G1F here) (Choi et al. 2001), which has been shown to inhibit gp120-CD4 binding with micromolar IC50. C14Cn was designed by us to mimic the CDR2 loop of D1 domain in CD4 by using a cyclic peptide that is prone to adopt β-hairpin configuration.
We have successfully obtained the derivatives of these two peptides by compound SNS-10. Unfortunately, none of the unmodified or modified peptides displayed any inhibitory effects on gp120-CD4 binding at the concentrations of 1 mM (data not shown). In addition, G1 peptide, as the positive control peptide for G1C peptide, did not inhibit gp120-CD4 binding at 1 mM either, in contrast to the previous report (Choi et al. 2001). We conclude that the neither G1C nor C14Cn is suitable as a new modification scaffold due to little appreciable interaction with gp120. Another possible candidate of the new peptide scaffolds is CD4F33 with its Phe33 replaced by the reactive cysteine. The introduction of an additional cysteine in this mimetic that contains 3 pairs of endogenous disulfides, however, may cause problems in peptide folding.
In addition to the Phe43 cavity, the vestibule to the cavity (binding site for Phe43 of wild type D1D2) and the binding sites for Arg59 of wild type CD4 are also good target sites for inhibitor design. Ligands that bind more than one site mentioned above should be advantageous than the ligands for only the Phe43 cavity. Usage of F43C site of D1 for ligand attachment eliminates the possibility of full screening for the chemical groups suitable for the sites that wild type Phe43 and Arg59 bind to. Using Arg59 as tethering point for ligand screening against all three sites is a plausible idea. Consequently, we have produced D1D2F43A:R59C (F43G should be better choice). As expected, our preliminary results of using compound library designed for only targeting the Phe43 cavity did not yield derivatives with high affinity to gp120. Surprisingly, D102F43A:R59C-Iodoacetamide has an IC50 value of 180 nM, which is much lower than that for unmodified D1D2F43A:R59A (601 nM) and is also comparable to that for D1D2F43C (206 nM). It suggests that the double hydrogen bonds that between Arg59 of CD4 and Asp368 of gp120 are at least partially restored by the acetamide group in D1D2F43A:R59C-Iodoacetamide. Based on preliminary modeling results on D1D2F43A:R59C, we have designed potential modification compounds composed of fragments that target either Arg59 site, the vestibule to the cavity or the Phe43 Cavity (
Another more direct and possibly more risky approach for identification of multi-site targeting compounds is to screen compounds derived from the favorable ligands identified by our SAR study. The derivation could be done by attaching a chemical reactive module (e.g. sulfur, azide groups or the original bromine atom) to a cavity-favored compound (such as DN-52 or SNS-12) and then by reacting the active cavity-targeting compound with a compound library. The generated new library of compounds can be screened either for their inhibition for viral entry or for their affinities to gp120. Click chemistry could also be tried by reacting alkynes against azide-bearing cavity-targeting compounds in situ [Lewis, 2002 #4641].
It is also worth in exploring the possibility of direct attachment of our best cavity-binding group (e.g. DN-52 or SNS-12) to the identified small molecules mimicking either Phe43 or Arg59 of CD4 with the help of computational modeling. DN2149 is a small molecule designed by D. Ng and M. Head to mimic both Phe43 and Arg59 (unpublished data). It binds gp120 with nanomolar affinity. Although its binding site on gp120 has not been experimentally proved, it would be interesting to find out if it could work synergistically with the cavity-targeting groups.
In conclusion, the door to the structure-based design of HIV entry inhibitors has been opened and the search will continue.
The invention disclosed herein was made with United States government support under grant number GM56550 from the National Institutes of Health. Accordingly, the United States government has certain rights in this invention.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US06/47907 | 12/14/2006 | WO | 00 | 12/17/2008 |
Number | Date | Country | |
---|---|---|---|
60750464 | Dec 2005 | US | |
60789467 | Apr 2006 | US |