The invention relates to new substances, which bind to the PDZ-domain of proteins, the uses of substances, which bind to the PDZ-domain of proteins, and means for identification of compounds, which bind to the PDZ-domain of a protein.
Originally, the PDZ-domain was identified as a common element in three structurally related proteins, namely PSD-95/SAP90, DLG and ZO-1 (Garner et al., TRENDS IN CELL BIOL., 6:429-433 (1996); Craven et al., CELL, 93:495-498 (1998); Hutter et al., NEURO SCI. Res., 32:1-7 (1998)). The PDZ-domain is also called DLG homology region (DHR) or GLGF Repeat. The latter is based on the presence of a Gly-Leu-Gly-Phe sequence motive. The PDZ-domain comprises approx. 90 amino acids, and crystallographic investigations of PSD-95, SAP97 and CASK show that it is formed of two alpha helices and six beta sheets (Daniels et al., NAT. STRUCT. BIOL. 5:317-325 (1998); Doyle et al., CELL, 85:1067-1076 (1996)).
PDZ-domains, which can be found 785 times in 436 different human genes, belong to one of the most important protein classes in the human genome (Kay et al., Chemistry & Biology, 11:423-424 (2004)). PDZ-domains control the localization, the clustering, the recycling and the cell membrane expression of many receptor, transport and ion channel proteins (Dev, K.K., Nat. Rev. Drug Discov. 3:1047-1056 (2004)).
By recruiting downstream proteins in signalization pathways, PDZ-domains mediate the generation of specific multi-protein complexes. Proteins, which contain PDZ-domains, play an important role in many key pathways, including conservation of the polarity and morphology of epithelial cells, organization of the postsynaptic density in neural cells, and regulation of the activity and transport of membrane proteins. The consequence of this is that substances, which bind to the PDZ-domain, can specifically modulate such proteins or protein complexes and have therefore a particular therapeutic potential.
This principle of modulation of PDZ-domains could be shown by blocking peptide ligands in different cell culture and animal models (Dev, K.K., Nat. Rev. Drug Discov. 3:1047-1056 (2004)). Focal brain damages by ischemia in a rat model could be reduced by peptide ligands, which block the interaction between the PDZ-domain of the protein PSD-95 and N-methyl-D-aspartate receptors (NMDARs), which is a new approach for the therapy of strokes (Aarts et al., Science 298:846-850 (2002)). Furthermore, it could be shown that the blocking of the second PDZ-domain of the protein MAGI3 by irreversible, synthetic small-molecule inhibitors leads to a three times higher activity of a cancer-relevant enzyme in a cell culture model [Fujii et al., J. Am. Chem. Soc. 125:12074-12075 (2003)).
It is therefore the technical object of the invention to indicate compounds, which are capable to bind to the PDZ-domain of a protein, where in absence of a modulator the interaction with natural protein ligands takes place. Further, it is the technical object of the invention to indicate means for identification of such compounds.
For achieving this technical object, the invention teaches the use of a compound according to Formula I
wherein R1 and R2 are ═O or ═S and are identical or different, wherein R2 may alternatively be two —H, wherein R3 is ═CHR4, —CH2R4 or —CHR5R4, wherein R4 is phenyl; phenyl simply, doubly or triply substituted with -Hal or —C(Hal)n (n=1, 2, or 3); 2-, 3-, 4-, or 5-thienyl; 2-, 3-, 4-, or 5-thienyl simply, doubly or triply substituted with -Hal or —C(Hal)n (n=1, 2, or 3): 2-, 3-, 4-, or 5-furyl; 2-, 3-, 4-, or 5-furyl simply, doubly or triply substituted with -Hal or —C(Hal)n (n=1, 2, or 3); or C1-C5 alkyl, linear or branched, wherein -Hal is —F, —Cl, —Br, or —I, wherein R5 is C1-C5 alkyl, linear or branched, or —CH2—CO—N(R6)2 with R6=C1-C5 alkyl, linear or branched, wherein free valences of the ring are bound with hydrogen, wherein the ring —S— may be replaced by —O—, —CH2— or —CO—, or a physiologically well-tolerated salt of such a compound for preparing a pharmaceutical composition for modulation of a protein containing a PDZ-domain.
Generally, the term C1 to C5 includes alkyl methyl, ethyl, propyl, isopropyl, butyl, isobutyl, tert-butyl, pentyl, isopentyl, tert-pentyl and neopentyl.
Preferred substances are characterized by that R1 is ═S, wherein R2 is ═O, wherein R3 is ═CH—R4, and/or that R4 is 3-thienyl, phenyl, or phenyl substituted in para with -Hal, preferably —Br, and/or that R1 is ═S, wherein R2 is ═O, wherein R3 is —CH2R4 or —CHR5R4, and R4 is 3-furyl or phenyl substituted in para with —CHal3, preferably —CF3, wherein R5 is methyl or —CH2—CO—N(CH3)2, and/or that R1 is ═O, wherein R2 is ═O, wherein R3 is ═CH—R4, and/or R4 is isopropyl or phenyl substituted in meta or para with —CHal3, preferably —CF3, and/or R1 is ═O, wherein R2 is ═O, wherein R3 is —CH2—R4, and wherein R4 is phenyl substituted in para with -Hal, in particular —Br, or with —CHal3, in particular —CF3.
Furthermore, the invention teaches the use of a compound according to Formula II
wherein R1 is -Hal or C1-C5 alkyl, linear or branched, wherein R2 is —H, and wherein R3 is —NO2, -Hal, C1-C5 alkyl, linear or branched, wherein instead of R1 and R2 a ring with —CH═CHal-CH═CH— may be formed, or wherein instead of R2 and R3 a ring with —CH═CH—CH═CH— may be formed, wherein -Hal is —F, —Cl, —Br, or -J, wherein free valences of the ring are bound with hydrogen, or a physiologically well-tolerated salt of such a compound for preparing a pharmaceutical composition for modulation of a protein containing a PDZ-domain.
Then it is preferred that R3 is —Br or tertiary-butyl, and/or R1 is —Cl, —Br, or tertiary-butyl, and/or that R1 is —Br, wherein instead of R2 and R3 a ring with —CH═CH—CH═CH— is formed, and/or that R3 is —Br, wherein instead of R1 and R2 a ring with —CH═CBr—CH═CH— is formed.
Furthermore, the invention teaches the use of a compound according to Formula III
wherein R1 is —NO2 or -Hal, wherein R2 and R3 are identical or different and are —H or -Hal, wherein -Hal is —F, —Cl, —Br, or -J, wherein free valences of the rings are bound with hydrogen, or a physiologically well-tolerated salt of such a compound for preparing a pharmaceutical composition for modulation of a protein containing a PDZ-domain.
Then it is preferred that R1, R2 and R3 are -Hal, in particular —Cl, and/or that R1 is —NO2, wherein R2 and R3 are —H.
A pharmaceutical composition according to the invention is prepared by mixing the compound in a physiologically effective dose with at least one auxiliary or carrier substance.
This galenic preparation of a pharmaceutical composition according to the invention may be performed in a conventional manner. As counter ions for ionic compounds may for instance be used Na+, K+, Li+ or cyclohexylammonium. Suitable solid or liquid galenic forms of preparation are for instance granulates, powders, dragees, tablets, (micro) capsules, suppositories, syrups, juices, suspensions, emulsions, drops or injectable solutions (i.v., i.p., i.m., s.c.) or fine dispersions (aerosols), transdermal systems, and preparations with protracted release of active substance, for the production of which usual means are used, such as carrier substances, explosives, binding, coating, swelling, sliding or lubricating agents, tasting agents, sweeteners and solution mediators. As auxiliary substances are named here magnesium carbonate, titanium dioxide, lactose, mannite and other sugars, talcum, milk protein, gelatin, starch, cellulose and derivatives, animal and vegetable oils such as cod-liver oil, sunflower, peanut or sesame oil, polyethylene glycols and solvents, such as sterile water and mono or multi-valent alcohols, for instance glycerin.
As indications are mentioned here: cancer, schizophrenia, depressions and states of anxiety, Parkinson's disease, Huntington's disease, Alzheimer's disease, epilepsy, chronic and neuropathic pain, aberrations of the hormone-regulated food intake. PDZ-domains play an important role for the modulation of proteins, which are associated with these diseases (Dev, K.K., Nat. Rev. Drug Discov. 3:1047-1056 (2004)).
A plurality of the compounds mentioned above and falling under Formula I are not known from prior art. Therefore, the invention also teaches a compound according to Formula I
wherein R1 and R2 are ═O or ═S and are identical or different, wherein R2 may alternatively be two —H, wherein R3 is ═CHR4, —CH2R4 or —CHR5R4, wherein R4 is phenyl; phenyl simply, doubly or triply substituted with -Hal or —C(Hal)n (n=1, 2, or 3); 2-, 3-, 4-, or 5-thienyl; 2-, 3-, 4-, or 5-thienyl simply, doubly or triply substituted with -Hal or —C(Hal)n (n=1, 2, or 3); 2-, 3-, 4-, or 5-furyl; 2-, 3-, 4-, or 5-furyl simply, doubly or triply substituted with -Hal or —C(Hal)n (n=1, 2, or 3); or C1-C8 alkyl, linear or branched, wherein -Hal is —F, —Cl, —Br, or -J, wherein R5 is C1-C5 alkyl, linear or branched, or —CH2—CO—N(R6)2 with R6=C1-C5 alkyl, linear or branched, wherein free valences of the ring are bound with hydrogen, wherein R4 is not 2-furyl or 2-thienyl, if R1 is ═O or ═S, and if R3 is ═CH—R4, wherein R4 is not phenyl or phenyl substituted in meta or para with —CF3 or —Br, if R1 is ═O or ═S, if R2 is ═O, and if R3 is ═CH—R4 or —CH2—R4, wherein R4 is not phenyl, if R1 is ═O or ═S, if R2 is ═O, and if R3 is ═CH—R4, wherein R4 is not phenyl substituted in para with —Br, if R1 and R2 are ═O, and if R3 is —CH2—R4, wherein R4 is not isopropyl, if R1 is ═S, if R2 is ═O, and if R3 is ═CH—R4, wherein the ring —S— may be replaced by —O—, —CH2— or —CO—, or a physiologically well-tolerated salt of such a compound.
Herein it is preferred that R1 is ═S, wherein R2 is ═O, wherein R3 is ═CH—R4, and/or that R4 is 3-thienyl, phenyl, or phenyl substituted in para with -Hal, preferably —Br, and/or that R1 is ═S, wherein R2 is ═O, wherein R3 is —CH2—R4, and R4 is 3-furyl or phenyl substituted in para with —CHal3, preferably —CF3, and/or that R1 is ═O, wherein R2 is ═O, wherein R3 is ═CH—R4, and/or that R4 is isopropyl or phenyl substituted in meta or para with —CHal3, preferably —CF3, and/or that R1 is ═O, wherein R2 is ═O, wherein R3 is —CH2—R4, and wherein R4 is phenyl substituted in para with -Hal, in particular —Br, or with —CHal3, in particular —CF3, and wherein R5 is methyl or —CH2—CO—N(CH3)2, and/or that R1 is ═S, wherein R2 is ═O, wherein R3 is —CHR5R4, and wherein R4 is phenyl substituted in para with -Hal, in particular —Br, or with —CHal3, in particular —CF3, and wherein R5 is methyl or —CH2—CO—N(CH3)2.
The above compounds bind to the PDZ-domain of proteins and are thus suitable to modulate the respective proteins or the complexes formed by these proteins in the cell, i.e. to activate or inhibit them. Thus, it would for instance be recommendable to inhibit such proteins, which turned out to be differentially expressed at the event of a disease, for instance a cancer disease, i.e. regulated up or down in correlation with the disease.
Subject matter of the invention is further a method for identification of a modulator of a protein containing a PDZ-domain, wherein a structural model of a modulator candidate optionally is first compared to a structural model of a reference compound, which binds to the PDZ-domain, and is pre-selected in case of an overlap of bioisosteric atoms, and wherein the structural model of the if applicable pre-selected modulator candidate is compared with a structural model of the protein or a structural model of a complex of protein and modulator candidate is investigated and it is determined, whether the modulator candidate binds to the PDZ-domain, wherein the structural model of the protein or of the complex is derived i) from structural coordinates of the complex with the reference compound, ii) from a fragment of the complex containing a PDZ-domain with the reference compound, or of a homolog to i) or ii). Basically, this is a screening method based on a structural model, wherein prospective compounds, in particular compounds with a molecular weight under 5,000 down to 1,000 or less, are so to speak “tried on” with the structural model of the PDZ-domain or of the protein. If by the comparison a binding is detected, the prospective compound is selected and may then be submitted to further tests for the development of pharmaceutical compositions.
A preferred variant of the method according to the invention is that the structural model of the protein or of the complex is obtained with a reference compound, which is compound 1 (Formula I, with R1=═S, R2=═O, R3=—CH2—R4 and R4=phenyl substituted in para with CF3). Herein, the protein in the complex assumes a particular conformation, which does not occur in any of the previously published structural models of PDZ-domains and comprises a new hydrophobic binding pocket, which can offer a decisive contribution to the binding strength of the modulator candidate. Therefore, the structural coordinates according to the invention of the protein in the complex with compound 1 present the protein in a particular small-molecule binding conformation according to the invention, which is advantageous for the success of the method according to the invention.
The comparison consequently can take the structural model of the complex of the protein with compound 1 as a model for matching new, unknown test compounds with the bound compound 1. This method is called ligand-based virtual screening. If herein bioisosteric atoms of a modulator candidate overlap with the atoms of compound 1, without repulsive overlappings with the protein taking place, the modulator candidate can be identified as a new modulator. If a modulator candidate in addition to the above conditions fills up one or several further binding pockets of the protein, which are not covered by compound 1, the modulator candidate can be identified as a new modulator with a higher affinity and/or a higher specificity.
The comparison can consequently comprise the combination of the structural model of the modulator candidate, and that after a pre-comparison of the modulator candidate with compound 1, with the structural model of the protein, and that based on the structural coordinates according to the invention of the complex, wherein optionally the free binding energy of the binding between modulator candidate and protein is determined, and wherein with low free binding energy a high binding probability is detected. The free binding energy can be calculated: a) by addition of the free energies of interatomic contacts between the structural model of the modulator candidate and the structural model of the protein, and/or b) by determination of the free binding energy between the force field of the modulator candidate and the force field of the protein.
The invention consequently also relates to the use of the structural coordinates in the small-molecule binding conformation with compound 1 i) of the protein AF6, ii) of a fragment of AF6 containing a PDZ-domain, or iii) of a homolog to i) or ii) for identification of a modulator of a protein containing a PDZ-domain, preferably in a screening method according to the invention. The invention further comprises a machine-readable storage medium containing machine-readable data, which after reading-out and processing by means of a data processing system with a suitable software provide a representation of the structural model according to the invention of a protein or of a complex, a computer software with a software code for carrying-out a method according to the invention or a use according to the invention, and a data processing system comprising a computer software according to the invention and a machine-readable storage medium according to the invention.
In the following, the invention is explained in more detail with reference to embodiments.
In
An “S” marks new substances, which were synthesized. “p”, “P” and “R” are prior art substances, wherein commercially available substances are marked with “p”. Substances marked with “P” have been published—with synthesis pathways. Strong-binding substances are in addition framed with a continuous line. Weak-binding substances are in addition framed with a broken line. Non-binding substances do not have an additional frame.
When the non-binding compounds 31 and 32 in
A comparison between the compounds in
Compounds with R3=—CH2—R4 and R4 being isopropyl (compound 21), furanyl (compounds 4), thiophenyl (compound 5) and six-membered aromatic rings with small substituents of one or no C atom at all show a binding (compound 1, 3), whereas six-membered aromatic rings with larger substituents of more than two C atoms (compounds 29, 30) do not show any binding. Small substituents at the six-membered aromatic ring of R4 may in principle be at the ortho (compound 15), meta (compound 14) or also para (compound 1) positions, however the substitution position leading to a binding depends on the type of the substituent as well as on variations in other parts of the compound.
A particularly strong activity, i.e. binding to the PDZ-domain, show the following compounds: 1, 7, 9, 6 and 2.
A mixture of 2,4-thioxothiazolidine-one (20 mmol), aryl or alkyl aldehyde (20 mmol), piperidine (16 mmol) and EtOH is cooked for 18-24 h under reflux, then poured into H2O and acidified with acetic acid. The solid reaction product is recrystallized from methanol, methano/water or ethanol/water solvency.
The compounds in
These compounds can be prepared according to Example 3.1, followed by the following steps: A 2.0-molar solution of lithium borohydride (2.2 equivalents) in tetrahydrofuran (THF) is dropped under stirring into a solution of 5-arylidene-4-oxo-2-thiazolidinethione in pyridine and THF under nitrogen atmosphere at room temperature. The mixture is heated under reflux, until the reduction reaction is accomplished (approx. 3 to 5 h). The mixture is then cooled down, carefully added into a diluted hydrochloric acid solution in distilled water at 5° C. and multiply extracted with ethyl acetate. The ethyl acetate extracts are combined, washed with water, dried over MgSO4 and concentrated at the rotation vaporizer. The raw product is purified by means of silica gel flash chromatography, wherefrom the final product results.
The compounds in
Compound 33 in
Compounds 31 and 32 in
Compound 16 in
Compound 6 in
Compounds 35-37 in
The compounds 39-40 in
A complex of the human protein AF6 with compound 1 of
The protein backbone band model of the average structure of the 20 lowest-energy structures in the complex with compound 1 (
The surface representation of the PDZ-domain in the complex with compound 1 (
Many atoms of this residue are nearly or completely in van-der-Waals distance to atoms of the protein, with the consequence that at these positions no substituents with more than two C atoms are allowed. This result is in accordance with the non-binding compounds 29 and 30 in
The ubiquitarily 15N marked PDZ-domains of the proteins AF6 [Boisguerin, P. et al., Chem. Biol. 11:449-459 (2004)] and Syntrophin [Schultz, J. et al., Nat Struct Biol., 5(1):19-24 (1998)] were prepared as described in the publications. For the NMR assay 50 μM 15N marked protein are mixed in a buffer of 20 mM Na phosphate pH 6.5, 50 mM NaCl and 10% d6-DMSO with a 8 to 10-fold molar excess of test ligand and a 15N-HSQC NMR spectrum is recorded at a temperature of 300 K on a Bruker DRX 600 MHz NMR spectrometer. Secondly, an identical comparison spectrum of protein in absence of test ligand is recorded. The NMR spectra are processed with the aid of the software XWIN-NMR and compared with each other with the aid of the software Sparky. When the comparison shows chemical shifts of more than one resonance and less than about half of all resonances, then it is a binding ligand. Exemplarily is shown such a comparison of spectra for the PDZ-domain of AF6 in
For the determination of equilibrium dissociation constants (in short: Kd values), a test ligand is added step-wise in increasing concentrations to 50 μM protein solution, and for each concentration step a 15N-HSQC NMR spectrum is recorded under the same conditions as described above. The average chemical shift of a resonance, d, is calculated according to the following equation from the NMR raw data:
d=((ΔH)2+(ΔN/5)2)0.5
wherein ΔH is the shift in ppm units along the 1H resonance axis, and ΔN is the shift in ppm units along the 15N resonance axis. The average shift, d, is drawn as a function of the concentration of the test ligand, and a binding curve results, to which is adapted the following mathematical equation:
y(Lo)=a−(a2−Lo/Po)0.5
with
a=(Po+Lo+Kd)/(2*Po)
wherein Lo is the total concentration of ligands, Po is the total concentration of protein and Kd is the Kd value. This equation describes the law of mass action of a reaction “protein+ligand=complex”. The binding curve for the binding of compound 1 to the PDZ-domain of AF6 is shown in
With the aid of a software such as e.g. “Origin 5.0”, the adaptation is optimized in a non-linear manner, thereby providing for each investigated resonance a microscopic Kd value. From the mean value of all relevant microscopic Kd values, a macroscopic Kd value is calculated. In Table 2, the macroscopic Kd values including the statistical standard deviation for the binding of some test ligands to the PDZ-domains of AF6 and of Syntrophin are shown.
The basis for the complex structure-based method for finding new modulators is a correlation between chemical variations of compound 1, the binding affinity and the binding site occupation of the protein.
Furthermore, from a comparison of the shown amino acid residues between two compounds, information about the differential occupation of partial binding sites can be obtained. When e.g. compound 1 is compared with compound 21, the amino acid residues L25, S26 and V46 occur in both compounds, however the residues S75 to T91 are missing in compound 21. Considering the position of these residues S75 to T91 in the 3D structural model of the protein compound complex of Table 1, a localization is found in the environment of the trifluoromethylphenyl residue of compound 1. Since this trifluoromethylphenyl residue in compound 21 is reduced to a substantially smaller isopropyl residue, the missing of the amino acid residues S75 to T91 in compound 21 can be correlated with a missing occupation of the partial binding site of the trifluoromethylphenyl residue. In an analogous manner, it follows from a comparison of the compounds 46 and 1 that compound 46 does not or only insufficiently occupy the binding sites L25 to V28 and S75 to T91 of compound 1.
For a selection of 3 compounds, the Kd values for the PDZ-domains of AF6 and Syntrophin are shown in Table 2. Considering the fact that the two PDZ-domains are proteins with only a low small-molecule binding capacity (low drugability), the Kd value of 101 μM for compound 1 (enantiomers mixture with regard to atom position 5) indicates an unexpectedly strong binding. In comparison therewith, a natural hexa-peptide ligand shows a Kd value of only 137 μM [Wiedemann, U. et al., J. Mol. Biol. 343(3):703-18 (2004)], although the hexa-peptide has a 2.3 times higher molecular weight than compound 1.
The number behind the ± symbol indicates the statistical standard deviation of a mathematical curve adaptation to the chemical binding equation “protein+ligand=complex”.
Number | Date | Country | Kind |
---|---|---|---|
10 2005 022 182.3 | May 2005 | DE | national |
This is a national stage application based on PCT/DE2006/000779, filed Apr. 28, 2006, which claims priority from DE 10 2005 022 182.3, filed May 9, 2005.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/DE2006/000779 | 4/28/2006 | WO | 00 | 3/5/2009 |