The present invention relates to methods for building an atomic model of a protein molecule using Microsomal Glutathione Transferase 1 (MGST1) and methods for determining a drug candidate compound that interacts with proteins of the Membrane Associated Proteins in Eicosanoid and Glutathione Metabolism (MAPEG) superfamily, e.g., MGST1.
The present invention provides methods for building an atomic model of a protein molecule. The methods comprise: (a) identifying a protein molecule with at least 20% sequence identity with Microsomal Glutathione Transferase 1 (MGST1) and (b) utilizing the atomic coordinates of MGST1 to obtain an atomic model of the identified protein molecule.
The present invention also provides methods for determining drug candidate compound. The methods comprise: A method for determining a drug candidate compound that interacts with Microsomal Glutathione Transferase 1 (MGST1) comprising: (a) identifying a drug candidate compound that interacts with MGST1 and (b) analyzing the interaction of the drug candidate compound with MGST1 or other members of the MAPEG protein superfamily.
The following Detailed Description maybe more fully understood in view of the drawings, in which:
Oxidative stress and exposure to toxic compounds are constant threats to living organisms. Efficient protection systems involving specific enzymes have emerged throughout evolution. Glutathione transferases, GSTs, are playing a crucial role in cellular biotransformation of electrophilic compounds through binding and positioning the tri-peptide glutathione (GSH), γ-L-glutamyl-L-cysteinyl-glycine, for nucleophilic attack. Distinct, ancient glutathione transferase protein families of cytoplasmic, microsomal, mitochondrial and bacterial origin have been identified based on sequence and structural similarity. In addition to the detoxification role played by GSTs, homologous members within the families may carry out distinctly different functions. Soluble GSTs from mammals, plants, bacteria and insects have been well characterized structurally and subdivided into several classes. The canonical cytosolic enzymes are dimers and related by evolution to glutaredoxin having the unique thioredoxin βαβαββα fold.
The inventors have determined the first detailed structure of a GST from the microsomal family, MGST1 (Microsomal Glutathione Transferase 1). Like most soluble GSTs, MGST1 catalyses conjugation of GSH to a number of electrophilic compounds and is therefore playing an important role in phase 2 biotransformation. In addition, MGST1 protects biological membranes from degradation through GSH dependent reduction of unspecifically peroxidised phospholipids. The microsomal GSTs, more recently termed MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione Metabolism) also contains members that are crucial for synthesis of mediators of fever, pain and inflammation. These pathophysiological responses are regulated by GSH dependent transformations of specific oxidised lipid intermediates, prostaglandin H24 epoxide, to prostaglandin E2 and leukotriene C4 respectively. Thus, variations of a hitherto unknown common protein structure have emerged to selectively catalyse distinct activities with strong physiological and pathophysiological significance.
Upon reconstitution into lipid bilayers at low lipid/protein ratio MGST1 forms two-dimensional crystals of two different two-sided plane groups, p22121 and p6, both suitable for analysis by electron crystallography. The inventors have now solved the structure of the rat enzyme to 3.2 Å resolution using data from both crystal forms (
The present structure of MGST1 demonstrates distinct roles played by conserved amino acid residues among MAPEG members. Polar intrasubunit interactions facing the core of the monomer are important for TM2/TM3 stabilisation (
In order to explore similarities to other proteins, noted at lower resolution, the inventors performed structural comparison using DALI with both the MGST1 monomer and trimer as search models. A striking correspondence is found to subunit I of ba3-cytochrome c oxidase (
Two-dimensional crystallisation of MGST1 required a lower lipid to protein ratio as compared to other similarly grown 2D crystals of membrane proteins resulting in atomic models. Thus, it is expected that interaction between the hydrophobic transmembrane exposed belts of the protein would be tight. In fact, the locations of trimers in the two-dimensional crystals form topologically equivalent dimers of trimers (
In the present study MGST1 is crystallised together with a saturating concentration of the thiol donor substrate (1 mM GSH). In accordance with recent nanospray mass spectrometry studies demonstrating three bound GSH molecules per trimer, the inventors observed a non-protein density close to the inner face of TM1, TM2 and domain E interpreted as the substrate molecule (
A fundamental aspect of glutathione transferase catalysis involves the stabilisation of the reactive nucleophilic thiolate anion form of GSH. The presence of the GSH thiolate has been demonstrated by spectroscopy in both soluble glutathione transferases and MGST1. In soluble glutathione transferases a tyrosine or serine hydroxyl has been shown to hydrogen bond to the GSH thiolate and thereby lower its pKa by several units to ≈6. In MGST1 GSH is surrounded by several residues that are potential hydrogen bond donors (H75, Y120, E80, R37/72/73). Since replacement of H75 and Y120 does not affect the activity to the extent predicted (and indeed observed) for a residue that is responsible for lowering the pKa of the GSH thiol, it may be concluded that MGST1 thiolate stabilisation operates by a novel interaction. While not wishing to be bound by theory, the inventors suggest that an arginine, most likely R72, fulfils this role in MGST1 and that the resulting unique charge compensation underlies the enzyme's ability to preferentially conjugate extremely hydrophobic substrates such as reactive chlorofluorocarbons.
The inventors' structure including three GSH molecules bound to the MGST1 homotrimer supports a catalytic mechanism having third of the sites reactivity. The structural basis for active site communication is most likely the interlinked GSH coordination by residues in TM2 where each helix contributes both inter- and intrasubunit coordination (
Most of the relatively few membrane protein structures known are responsible for selective, active or passive translocation across membranes of cargo reaching in size from ions to proteins. In comparison, the primary function of MGST1 is to catalyse chemical reactions with implications regarding approach of the hydrophilic GSH molecule and hydrophobic second substrates. Like oligomeric transporting membrane proteins MGST 1 has a central cavity, here facing the cytosol, but it rapidly narrows in the centre of the trimer making it impermeable to water or any larger molecule (
Projection maps of two other MAPEG members, human MPGES1 and LTC4S, show striking similarity to MGST1. Thus it is likely that the structural principles for catalysis as displayed by MGST1, being completely different from that of functionally similar soluble proteins and thus representing a novel structural solution, will be representative for the MAPEG superfamily. In view of the 37% sequence identity between MGST1 and MPGES1 a homology model for these two proteins is expected to be particularly revealing. Thus the principal elements contributing to the difference in mechanism and substrate specificity including the broad substrate acceptance of MGST1 versus the selectivity of MPGES1 for PGH2 may now be identified through complementary investigations.
In summary, while not wishing to be bound by theory, the inventors propose a novel catalytic mechanism involving subunit interactions that takes advantage of the membrane location of MGST1 allowing entry of hydrophobic substrates to the active site both from the cytosol and the phospholid bilayer. The latter entry point is catering for extremely hydrophobic molecules that hardly leave the membrane. In addition, by utilising charge compensation for thiolate anion stabilisation the enzyme allows small extremely hydrophobic substrates to approach the thiolate much more efficiently.
By determining the atomic structure of a key enzyme or the MAPEG superfamily, MGST1 (Microsomal Glutathione Transferase 1), the inventors are able to determine the specificity and function of these proteins that are potential targets for treatment of common diseases such as rheumatoid arthritis and asthma, the results should impact on drug development.
Accordingly, the inventors have invented methods for building an atomic model of a protein molecule comprising: (a) identifying a protein molecule with at least 20% sequence identity with Microsomal Glutathione Transferase 1 (MGST1) and (b) utilizing the atomic coordinates of MGST1 to obtain an atomic model of the identified protein molecule. In one embodiment, the protein molecule is a MAPEG protein molecule. In another embodiment, the protein molecule has at least 30% sequence identity with MGST1. One skilled in the art will appreciate the various models and candidates that may be produced by knowing the atomic coordinates of one protein. In one embodiment, the atomic model comprises a homology model, which is obtained using a modelling software program. In another embodiment, the atomic model comprises an experimental model, which is obtained with molecular replacement.
Once a protein molecule has been identified, a drug candidate compound that interacts with the identified protein molecule or with MGST1 may be determined. In one embodiment, a drug candidate compound is identified by using the atomic structure of the identified protein molecule to design a drug candidate compound. In another embodiment, the drug candidate compound is identified by (a) contacting the drug candidate compound with the identified protein molecule; and (b) measuring for a change in the expression or activity of the identified protein molecule.
One skilled in the art will appreciate the various means in which a drug candidate compound interacts with a protein molecule. In one embodiment, a drug candidate compound that decreases the expression or activity of the protein molecule indicates that the compound is an inhibitor of the protein molecule. hi another embodiment, a compound that increases the expression or activity of the protein molecule indicates that the compound is a promoter of the protein molecule. One skilled in the art will appreciate the various methods for contacting the drug candidate compound with a protein molecule, any of which may be employed herein. One skilled in the art will also appreciate the various methods for measuring a change in the expression or activity of the protein, any of which may be employed herein.
The interaction of the drug candidate compound with the identified protein molecule may be analyzed. In one embodiment, the interaction of the drug candidate compound with the active site of the identified protein molecule is analyzed with a docking-program. In addition, the structure of the interaction of the candidate compound with the protein molecule may be obtained using molecular replacement.
Furthermore, one skilled in the art will appreciate that in order to comprehensively determine a suitable drug candidate compound that interacts with the protein molecule at least one catalytic position of the protein molecule may be mutated prior to contacting the drug candidate compound with the protein molecule. In one embodiment, the mutation comprises a substitution of at least one amino acid. In another embodiment, the mutation comprises a deletion of at least one amino acid.
Specimen preparation. MGST1 is purified from rat liver microsomes and two-dimensional crystals with p6 and p22121 two-dimensional plane group symmetry are prepared by reconstitution into phospholipid bilayers at molar lipid to protein ratios varying between 3 and 5 as previously described. The quality of the reconstituted crystals are evaluated by electron microscopy of negatively stained specimens. Excellent crystal preparations are selected for cryo-electron microscopy. These specimens are prepared on carbon coated molybdenum grids and embedded in 3-7% (w/v) trehalose using the back injection technique. Specimens selected for high-tilt data collection are prepared by the carbon sandwich technique.
Data collection. Specimens of both crystal forms are imaged at tilt angles ranging from 0° to 62.9° at 4K specimen temperature on Kodak SO-163 film using a JEOL3000SFF electron microscope equipped with a liquid He cooled stage and a field-emission gun operated at an acceleration voltage of 300 kV. The images are subsequently developed for 12 minutes in full-strength D19 developer. Electron diffraction patterns are recorded using a FEI CM120 electron microscope equipped with a TVIPS 1k×1k, and the JEOL3000SFF initially fitted with a Gatan 2k×2k and later with a Gatan 4k×4k slow-scan CCD camera respectively.
Data processing. Image negatives are evaluated by optical diffraction and scanned using a Zeiss SCAI scanner with a pixel size of 7 μm. Several rounds of computational unbending and correction for the contrast transfer function (CTF) are performed by the MRC program system (6) to produce initial 3D data sets for the different crystal forms. Tilt-angles and CTFs accurately determined during the first processing round are used for reprocessing all good images in a second round of unbending using the MAKETRAN program procedure. Final image amplitudes and phase values are extracted following tilted CTF and beam tilt correction. Diffraction amplitudes collected on the three different CCD cameras are processed using the MRC electron diffraction software. Image phases and electron diffraction amplitudes are merged using the LATLINE program from the MRC suite.
Map calculation. Maps are calculated from reflection files containing data to 3.04 Å (p6) and 3.09 Å (p22121) using CCP4 software. To evaluate the degree of non-crystallographic symmetry (ncs) in the p22121 crystal form, monomers from the unsymmetrised map are correlated. The correlation coefficients at 120° and 240° rotations are 0.82 demonstrating a high similarity between the subunits and thus the ncs in the centre of the trimer are applied. To quantify correspondence between the maps from the hexagonal and orthorhombic crystal symmetries, trimers from each map are aligned and correlated resulting in a maximum overall correlation coefficient of 0.84. The high degree of correspondence allowed calculation of a cross crystal averaged trimer containing data from both crystal forms.
Model building. Model building is performed in O initially using a combination of the three calculated maps that are skeletonised and fitted with secondary structure poly-alanine templates. Subsequently, mutations to the appropriate amino acids are performed using identifiable side chains. Residues N9 to F43 and L62 to L147 making up 78.6% of the polypeptide, could be built corresponding to all of the trans-membrane parts of the helices and to a large extent also extra-membranous regions.
Refinement and rebuilding. Given its higher symmetry and completeness the p6 crystal data set are used for refinement of the model while the p22121 data are used as an independent test and quality control data set. The initial monomer model are translated and rotated into the p6 map. Symmetry related monomers are generated in the p6 map in O rendering a trimer that are transferred and rebuilt into the appropriate position in the p22121 map. Numerous cycles of rigid body and restrained refinement using very tight geometry restraints of the model are performed against the p6 electron diffraction data set truncated at 3.2 Å using REFMAC5 including geometry idealisation and manual rebuilding in O. At the present resolution the difference between using electron- or X-ray form factors are negligible, thus X-ray values are used throughout the refinement. 2Fo-Fc and Fo-Fc maps are continuously generated to evaluate the accuracy and quality of the refined model although the experimental map are used for all rebuilding steps in order to avoid model bias. Reliable Rfree values are obtained by using a final fraction of 9.4% of the observed structure factor amplitudes. In addition to the conventional and free R-factors the refinement process are monitored by comparing the p22121 Fo- and Fc-values. This residual denoted R are calculated as the conventional R-value between observed p22121 amplitudes and corresponding calculated amplitudes from the model following translation and alignment of the trimer to the orthorhombic unit cell without any further refinement. The initial Rort are close to 60% before refinement. PROCHECK are used to examine the molecular geometry.
Structural alignment. For structural comparison the monomer and trimer are submitted to the DALI/FSSP server. The top seven hits, all having a Z-score above 5.0 are selected for structural alignment using CE.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/IB07/00824 | 3/22/2007 | WO | 00 | 12/18/2008 |
Number | Date | Country | |
---|---|---|---|
60785288 | Mar 2006 | US |