The present invention relates to a novel three-dimensional homology model (structure) of Glycogen Synthase Kinase3-Alpha (GSK3-Alpha) and uses thereof. More specifically, the invention relates to a method of screening modulators which modulate the activity of GSK3-Alpha using a novel homology three-dimensional model of GSK3-Alpha.
The Glycogen Synthase Kinase3 (GSK3), a serine/threonine protein kinase family, comprises of Alpha and Beta isoforms, with integral roles in many aspects of cellular metabolism, proliferation, differentiation and development by phosphorylating and modulating the activity of a number of regulatory proteins in certain cell signaling pathways. Hence, GSK3 has been implicated in various diseases like diabetes, Alzheimer's disease, CNS disorders such as manic depressive disorder and neurodegenerative diseases, and cardiomyocete hypertrophy [WO 99/65897; WO 00/38675; and Haq et al., J. Cell Biol. (2000) 151, 117]. The regulatory proteins include the glycogen synthase which is the rate limiting enzyme necessary for glycogen synthesis, the microtubule associated protein Tau, the gene transcription factor β-catenin, the translation initiation factor e1F2B, as well as ATP citrate lyase, axin, heat shock factor-1, c-Jun, c-Myc, c-Myb, CREB, and CEPBα. It can thus be seen that these enzymes represent interesting pharmaceutical targets since complications arising from the expression of these enzymes could be treated by kinase inhibitors specific for the GSK3. Small molecules that inhibit GSK3 have recently been reported [WO 1999/65897 and WO 2000/38675].
The approach of homology modeling has been undertaken in cases of other proteins. There are few homology built models reported in the prior art such as U.S. Pat. No. 6,355,678 which discloses the homology model for EGF-R Kinase domain and the docking procedure for the same which is useful to rationally design compounds predicted to bind favorably to EGF-R Kinase & inhibit EGF-R TK activity. United States Patent 2005/4766 discloses the methods for obtaining and sequencing the proteins that are product of the genes exhibiting polymorphisms and determining the 3-D protein structural variant models for the proteins exhibiting the polymorphism and building a database. PCT application 2000/52050 discloses the homology model of a glucocorticoid receptor. PCT application 2005/47526 relates to the homology model of PLK and to the use thereof in assays for the identification of small molecules PLK modulators. PCT application 2005/2147 discloses the homology model for CDK4 and use thereof in assays for the identification of small molecule CDK 4 modulators. PCT application 1998/56812 discloses the Homology model for ER β receptor. A number of compounds have been described in literature that show preferential binding to GSK3-Beta. However, to date there has been no significant progress in determining the structural features required for selectivity towards GSK3-α. A major reason for this lack of insight has been the paucity of structural information for this isoform. In contrast, there is a plethora of structural data for GSK3-Beta as it has been crystallised in many forms including in complex with natural ligands as inhibitor complexes with the monomeric and activated forms. There is therefore tremendous scope for further development of new modulators of GSK3-Alpha family.
Systems and methods for building a three-dimensional homology model/structure of Glycogen Synthase Kinase 3-Alpha (GSK-3 Alpha) or its variants are described. The 3-D structure thus constructed is defined by at least a portion of the structure co-ordinates shown in Table 1. Methods to screen modulators for modulating the activity of GSK-3 Alpha using the structure coordinates of the model are also described. To screen a modulator of GSK3-α or its variant, the modulator is identified by using at least a portion of the structure co-ordinates shown in Table 1. The identified modulator is contacted with GSK3-α or its variant and the modulating activity of the identified modulator is determined.
The present invention relates to a three-dimensional model/structure of Glycogen Synthase Kinase 3-Alpha (GSK3-Alpha) or a variant thereof for screening a modulator that modulates the activity of GSK3-Alpha.
The Glycogen Synthase Kinase 3-Alpha is herein after referred to as “GSK3-Alpha”. The GSK3-Alpha described is intended to include any polypeptide, which has the activity of the naturally occurring GSK3-Alpha and includes all vertebrate and mammalian forms. Such terms also include polypeptides that differ from naturally occurring forms of GSK3-Alpha by having amino acid deletions, substitutions, and additions, but which retain the activity of GSK3-Alpha.
The terms “model” and “structure” are used interchangeable hereinafter.
The amino acid sequence of GSK3-Alpha is shown in SEQ ID NO: 1. The amino acid sequence of GSK3-Beta is shown in SEQ ID NO: 2.
A fragment, a homologue, a variant, a mutant, or a derivative of GSK3-Alpha or a naturally occurring polypeptide or nucleotide sequences which differs from a wild-type or a native sequence is herein after referred to as “variant” of GSK3-Alpha. The term variant also encompasses amino acid sequence comprising at least 50%, more preferably at least 65%, more preferably at least 80%, most preferably at least 90% of the wild-type sequence.
The term “fragment” indicates that a polypeptide or nucleotide sequence comprises a fraction of a wild-type or a native sequence. It may comprise one or more large contiguous sections of sequence or a plurality of small sections.
The term “homologue” refers to an entity that shares a certain homology with amino acid sequences or nucleotide sequences of GSK3-Alpha. Here, the term “homology” can be equated with “identity”. In the present context, a homologous sequence is taken to include an amino acid sequence which may be at least 70, 80 or 90% identical, to the subject sequence. Although homology can also be considered in terms of similarity (i.e. amino acid residues having similar chemical properties/functions), it is preferred to express homology in terms of sequence identity. An homologous sequence is taken to include a nucleotide sequence which may be at least 70, 80 or 90% identical, to the subject sequence. Homology comparisons can be conducted by eye, or more usually, with the aid of readily available sequence comparison programs. These commercially available computer programs can calculate % homology between two or more sequences. % homology may be calculated over contiguous sequences, i.e. one sequence is aligned with the other sequence and each amino acid in one sequence is directly compared with the corresponding amino acid in the other sequence, one residue at a time. This is called an “ungapped” alignment. Typically, such ungapped alignments are performed only over a relatively short number of residues.
The term “derivative” or “derivatised” herein after includes chemical modification of an entity, such as candidate compound or a GSK3-Alpha modulator.
The compounds that are can affect the activity of GSK-3 Alpha by preventing, suppressing, inhibiting, alleviating, restorating, elevating, increasing or otherwise is herein after referred to as modulator(s). The GSK3-Alpha modulator may be an antagonist or an agonist of GSK3-Alpha.
The term “agonist” means any entity, which is capable of interacting (eg. binding) with GSK3-Alpha and which is capable of increasing a proportion of the GSK-3 Alpha that is in an active form, resulting in an increased biological response.
The term “antagonist” means any entity, which is capable of interacting (eg. binding) with GSK3-Alpha and which is capable of decreasing (eg. inhibiting) a proportion of the GSK3-Alpha that is in an active form, resulting in a decreased biological response.
The term “amino acid sequence” as herein after is synonymous with the term “polypeptide” and/or the term “protein”. In some instances, the term “amino acid sequence” is synonymous with the term “peptide”.
An embodiment of the present invention relates to said three-dimensional model structure of GSK3-α or a variant thereof for screening a modulator that modulates the activity of GSK3-Alpha by using at least a portion of structure co-ordinates as shown in Table 1.
Another embodiment of the present invention relates to said three-dimensional model structure of GSK3-Alpha or a variant thereof wherein GSK3-Alpha comprises the amino acid sequence set forth in SEQ ID NO: 1.
Another embodiment of the present invention relates to said three-dimensional model structure, wherein said structure co-ordinate comprises a ligand binding domain defined by amino acid residue selected from VAL 49, ASP 106, GLU 39, GLN 91, ASN 92, LYS 89 and combinations thereof.
Another embodiment of the present invention relates to a computer assisted system for producing a three-dimensional structure of GSK3-Alpha or a variant thereof which comprises: a computer readable data storage medium comprising a data storage material encoded with computer readable data, wherein said data comprises at least a portion of the structure co-ordinates of Table 1; a working memory for storing instructions for processing said computer readable data; a central processing unit coupled to said computer readable data storage medium and said working memory for processing said computer readable data into said three-dimensional structure; and a display coupled to said central processing unit for displaying said three-dimensional structure.
Another embodiment of the present invention relates to a method of generating said three dimensional structure of GSK3-Alpha or a variant thereof by employing said computer assisted system.
Another embodiment of the present invention relates to a method of designing a modulator that modulates the activity of GSK3-Alpha or a variant thereof, wherein said method comprises: providing at least a portion of the structure co-ordinates of Table 1; employing said structure co-ordinates to design a modulator of GSK3-Alpha or a variant thereof; contacting said modulator with GSK3-α or variant to determine the modulating activity of said modulator.
Another embodiment of the present invention relates to said method of designing a modulator, wherein said structure co-ordinate comprises a ligand binding domain defined by amino acid residue selected from VAL 49, ASP 106, GLU 39, GLN 91, ASN 92, LYS 89 and combinations thereof.
Another embodiment of the present invention relates to a method of screening a modulator for modulating the activity of GSK3-Alpha or a variant thereof comprising the following: providing at least a portion of the structure co-ordinates of Table 1; employing said structure co-ordinates to identify a modulator of GSK3-Alpha or variant; contacting said modulator with GSK3-α or variant, and determining the modulating activity of said modulator. GSK3-α comprises the amino acid sequence set forth in SEQ ID NO: 1.
Another embodiment of the present invention relates to a method of screening a modulator for modulating the activity of GSK3-Alpha or a variant thereof, wherein said structure co-ordinate comprises a ligand binding domain defined by amino acid residue selected from VAL 49, ASP 106, GLU 39, GLN 91, ASN 92, LYS 89 and combinations thereof.
Another embodiment of the present invention relates to a method of screening a modulator for modulating the activity of GSK3-Alpha or a variant thereof, wherein the modulator inhibits GSK3-α activity. The steps prior to screening of the modulator that modulates the activity of GSK3-Alpha includes designing, selecting and synthesizing of the modulators
Another embodiment of the present invention relates to a method of screening a modulator for modulating the activity of GSK3-Alpha or a variant thereof, wherein the modulator is associated with preventing and/or treating GSK3-Alpha related diseases selected from HIV/AIDS, angiogenesis, breast cancer, colon cancer, type II diabetes, glioblastoma, pancreatic cancer, xaelanoma, prostrate cancer, and schizophrenia.
Another embodiment of the present invention relates to the use of the three-dimensional structure for designing, selecting and synthesizing a modulator that modulates the activity of GSK3-Alpha.
Another embodiment of the present invention relates to the use of a computer assisted system for producing a three-dimensional structure of GSK3-Alpha or a variant thereof for screening a modulator that modulates the activity of GSK3-Alpha.
Another embodiment of the present invention relates to the use of a computer assisted system for producing a three-dimensional structure of GSK3-Alpha or a variant thereof for designing, selecting and synthesizing a modulator that modulates the activity of GSK3-Alpha.
Another embodiment of the present invention relates to a compound for modulating the activity of GSK3-Alpha or a variant thereof by using the method of screening of modulators of GSK3-Alpha or a variant thereof.
Another embodiment of the present invention relates to the use of the three-dimensional structure of GSK3-Alpha or a variant thereof to determine the structure of the crystalline form of a protein with significant amino acid sequence homology to any functional domain of GSK3 Alpha.
Another embodiment of the present invention relates to the use of the three-dimensional model structure of GSK3-Alpha or a variant thereof, wherein the structure of the crystalline form of a protein with significant amino acid sequence homology to any functional domain of GSK3 Alpha is determined using molecular replacement.
Further, the present invention relates to a three-dimensional model structure that is used in conjunction with automated flexible docking methods to generate a set of binding hypotheses for the ligand-GSK3-Alpha complexes of a number of modulators. Based on analysis of the sequence and residues lining the ATP binding pocket, the specific residues and side chains which differ in the GSK3-Alpha structure were determined and the interactions that could be exploited in the design of selective compounds were identified. This information could be used in two ways, both in the enhancement of GSK3-Alpha selectivity and in the engineering of modulators with greater binding specificity for GSK3-Alpha.
In one particularly preferred embodiment, the GSK3-Alpha modulator or candidate compound of the invention has an ionizable functional group capable of forming associations with the amino acid residues corresponding to GSK3-Alpha amino acid residues V41, D106, E39, Q91, N92, K89 of GSK3-Alpha. More preferably still, the ionizable functional group is a group which is positively charged at physiological pH. Even more preferably, the ionizable functional group is an NH group which is positively charged at physiological pH.
In one preferred embodiment, the GSK3-Alpha modulator or candidate compound of the invention comprises a functional group capable of forming a hydrogen bond with the amino acid residue corresponding to V41, D106, E39, Q91, N92, K89.
In one preferred embodiment, the GSK3-Alpha modulator or candidate compound of the invention comprises a group which is capable of steric interaction with the amino acid residues corresponding to V41, D106, E39, Q91, N92, K89 of the ATP binding site of GSK3-Beta.
In a preferred embodiment, the GSK3-Alpha modulator or candidate compound of the invention is an irreversible antagonist.
By way of example, the structure co-ordinates of Table 1 may be used to design compounds that bind to GSK3-Alpha and may alter the physical properties of the compounds (eg. solubility) or GSK3-Alpha itself. This invention may be used to design compounds that act as modulators, such as competitive inhibitors-of GSK3-Alpha by binding to all or a portion of the active site of GSK3-Alpha. Compounds may also be designed that act as non-competitive inhibitors of GSK3-Alpha. These non-competitive inhibitors may bind to all or a portion of GSK3-Alpha already bound to its substrate and may be more potent and specific than known GSK3-Alpha inhibitors that compete only for the GSK3-Alpha active site. Similarly, non-competitive inhibitors that bind to and inhibit GSK3-Alpha whether or not it is bound to another chemical entity may be designed using the structure co-ordinates of GSK3-Alpha described herein.
The present invention also allows the development of compounds or its isomers that can act as reaction intermediates in a chemical reaction of a substrate that binds to GSK3-Alpha. Thus, the time-dependent analysis of structural changes in GSK-3 Alpha during its interaction with other molecules may be performed. The reaction intermediates of GSK3-Alpha may also be deduced from the reaction product in co-complex with GSK3-Alpha. Such information is especially useful to design improved analogues of known GSK3-Alpha modulators or to design new GSK3-Alpha modulators based on the reaction intermediates of the GSK3-Alpha enzyme and GSK3-Alpha-modulator complex. This provides a new route for designing GSK3-Alpha modulators with high specificity and stability. Preferably, this provides a new route for designing GSK3-Alpha modulators with high specificity, high stability and low toxicity.
Small molecule databases for modulators or candidate compounds may be screened for chemical entities that can bind in whole, or; in part, to GSK3-Alpha. Thus, in a preferred embodiment, the GSK3-Alpha modulators form a library of compounds or a database. In this screening, the quality of fit of such entities or compounds to the binding site may be judged by various methods-such as shape complementarity or estimated interaction energy (Meng, E. C. et al., J. Comp. Chem., 13, pp. 505-524 (1992)).
The structure co-ordinates of Table 1, or portions thereof, may also be useful in solving the structure of crystal forms of GSK3-Alpha. They may also be used to solve the structure of variants of GSK3-Alpha or GSK3-Alpha complexes.
Preferably, the structure co-ordinates of Table 1 are used to solve the structure of the crystalline form of proteins having significant amino acid sequence homology to any functional domain of GSK3-Alpha.
The structure co-ordinates of the homology model of the present invention may also facilitate the identification of related proteins or enzymes analogous to GSK3-Alpha in function, structure or both, thereby further leading to novel therapeutic modes for treating or preventing GSK3-Alpha related diseases.
The design of compounds that bind to or modulate GSK3-Alpha according to the present invention involves consideration of two factors. Firstly, the compound must be capable of physically and structurally associating with GSK3-Alpha. Non-covalent molecular interactions important in the association of GSK3-Alpha with its substrate may include hydrogen bonding, van der Waals and hydrophobic interactions. Secondly, the compound must be able to assume a conformation that allows it to associate with GSK-3Alpha. Although certain portions of the compound may not directly participate in the association with GSK3-Alpha, those portions may still influence the overall conformation of the molecule. This will have a significant impact on the activity. Such conformational requirements include the overall three-dimensional structure and orientation of the chemical entity or compound in relation to all or a portion of a binding site of GSK-3Alpha, or the spacing between functional groups of a compound comprising several chemical entities that directly interact with GSK3-Alpha.
The potential modulating or binding effect of a chemical compound on GSK3-Alpha may be analyzed prior to its actual synthesis and testing by the use of computer modeling techniques. If the theoretical structure of the given compound suggests insufficient interaction and association with GSK3-Alpha, then synthesis and testing of the compound may be obviated. However, if computer modeling indicates a strong interaction, the molecule may be synthesized and tested for its ability to bind to GSK3-Alpha and modulate the activity of GSK3-Alpha (e.g. inhibit) using the fluorescent substrate assay of Thornberry et al. (2000) Methods.
The modulator of GSK3-Alpha may be an organic compound or a biochemical compound or any other chemical compound. The modulator of GSK3-Alpha may be a compound, which is obtainable from or produced by any suitable source, whether natural or artificial. The modulator of GSK3-Alpha may be an amino acid molecule, a polypeptide, or a chemical derivative thereof, or a combination thereof. The modulator of GSK3-Alpha may even be a polynucleotide molecule, which can be a sense or an anti-sense molecule. The modulator of GSK3-Alpha may even be an antibody.
The modulator of GSK3-Alpha may be designed or obtained from a library of compounds, which may comprise peptides, as well as other compounds, such as small organic molecules.
By way of example, the modulator of GSK3-Alpha may be a natural substance, a biological macromolecule, or an extract made from biological materials such as bacteria, fungi, or animal (particularly mammalian) cells or tissues, an organic or an inorganic molecule, a synthetic agent, a semi-synthetic agent, a structural or functional mimetic, a peptide, a peptidomimetic, a derivatised agent, a peptide cleaved from a whole protein, or a peptide synthesized synthetically (such as, by way of example, either using a peptide synthesizer or by recombinant techniques or combinations thereof, a recombinant agent, an antibody, a natural or a non-natural agent, a fusion protein or equivalent thereof and mutants, derivatives or combinations thereof).
In case the modulator of GSK3-Alpha is an organic compound, the organic compound will comprise two or more hydrocarbyl groups. Here, the term “hydrocarbyl group” means a group comprising at least C and H and may optionally comprise one or more other suitable substituents. Examples of such substituents may include halo-, alkoxy-, nitro-, an alkyl group, a cyclic group etc. In addition to the possibility of the substituents being a cyclic group, a combination of substituents may form a cyclic group. If the hydrocarbyl group comprises more than one C, then those carbons need not necessarily be linked to each other. For example, at least two of the carbons may be linked via a suitable element or group. Thus, the hydrocarbyl group may contain hetero atoms. Some of the examples of suitable hetero atoms are sulphur, nitrogen and oxygen. For some applications, preferably the modulator of GSK3-Alpha comprises at least one cyclic group. The cyclic group may be a polycyclic group, such as a non-fused polycyclic group. For some applications, the modulator of GSK3-Alpha comprises at least one of said cyclic groups linked to another hydrocarbyl group.
The modulator of GSK3-Alpha which modulates the activity of GSK3-Alpha can be agonist or antagonist in nature.
The term “agonist” means any entity, which is capable of interacting (eg. binding) with GSK3-Alpha and which is capable of increasing a proportion of the GSK-3 Alpha that is in an active form, resulting in an increased biological response.
The term “antagonist” means any entity, which is capable of interacting (eg. binding) with GSK3-Alpha and which is capable of decreasing (eg. inhibiting) a proportion of the GSK3-Alpha that is in an active form, resulting in a decreased biological response.
The modulator of GSK3-Alpha may contain halo groups, for example, fluoro, chloro, bromo or iodo groups, or one or more of alkyl, alkoxy, alkenyl, alkylene and alkenylene groups, each of which may be branched or unbranched.
The modulator of GSK3-Alpha is a structurally novel modulator of GSK3-Alpha, or an analogue of a known modulator of GSK3-Alpha.
The modulator of GSK3-Alpha may be a mimetic, or may be chemically modified. The modulator of GSK3-Alpha may be capable of displaying various therapeutic properties.
The modulator of GSK3-Alpha may be used in combination with one or more other pharmaceutically active agents. If combinations of active agents are administered, then they may be administered simultaneously, separately or sequentially.
The % homology calculation of a protein is a very simple and consistent method, but it fails to take into consideration factors such as in an otherwise identical pair, of sequences, one insertion or deletion that will cause the following amino acid residues to be put out of alignment, thus potentially resulting in a large reduction in % homology when a global alignment is performed.
Consequently, most sequence comparison methods are designed to produce optimal alignments that take into consideration possible insertions and deletions without penalizing unduly the overall homology score. This is achieved by inserting “gaps” in the sequence alignment to try to maximise local homology.
However, these more complex methods assign “gap penalties” to each gap that occurs in the alignment so that, for the same number of identical amino acids, a sequence alignment with as few gaps as possible-reflecting higher relatedness between the two compared sequences-will achieve a higher score than one with many gaps; “Affine gap costs” are typically used when relatively high cost is charged for the existence of a gap and a smaller penalty for each subsequent residue in the gap. This is the most commonly used gap scoring system. High gap penalties will of course produce optimised alignments with fewer gaps. Most alignment programs allow the gap penalties to be modified. However, it is preferred to use the default values when using such software for sequence comparisons. For example, when using the Clustal W package the default gap penalty for amino acid sequences is 10 for a gap and 0.2 for each extension and End Gap as −1 with GONNET matrix.
Calculation of maximum % homology therefore firstly requires the production of an optimal alignment, taking into consideration gap penalties. A suitable computer program for carrying out such an alignment is the Clustal W (Ref: www.ebi.ac.uk/clustalw) package. Examples of other software than can perform sequence comparisons include, but are not limited to, the BLAST package (see Ausubel et al., 1999 ibid-Chapter18), FASTA (Altschul et al., 1990, J. Mol. Biol., 403-410) and the GENEWORKS suite of comparison tools. Both BLAST and FASTA are available for offline and online searching (see Ausubel et al., 1999 ibid, pages 7-58 to 7-60). A new tool, called BLAST 2 Sequences is also available for comparing protein and nucleotide sequence (see FEMS Microbiol Lett 1999 174 (2): 247-50; FEMS Microbiol Lett 1999 177(1): 187-8).
Although the final % homology can be measured in terms of identity, the alignment process itself is typically not based on an all-or-nothing pair comparison. Instead, a scaled similarity score matrix is generally used that assigns scores to each pair wise comparison based on chemical similarity or evolutionary distance. An example of such a matrix commonly used is the GONNET matrix—the default matrix for the Clustal W suite of programs. Clustal W programs generally use either the public default values or a custom symbol comparison table if supplied (see user manual for further details). For some applications, it is preferred to use the public default values for the Clustal W, or in the case of other software, the default matrix, such as GONNET.
Once the software has produced an optimal alignment, it is possible to calculate % homology, preferably % sequence identity. The software typically does this as part of the sequence comparison and generates a numerical result.
The amino acid sequence of GSK3-Alpha may be isolated from a suitable source, or it may be made synthetically or it may be prepared by use of recombinant DNA techniques. The amino acid sequence of GSK3-Alpha is shown in SEQ ID NO: 1. The three dimensional structure of GSK3-Alpha or variant thereof is constructed using at least a portion of the structure co-ordinates provided in Table 1. The variant of GSK3-Alpha encompasses of a fragment, a homologue, a variant, a mutant, or a derivative of GSK3-Alpha or a naturally occurring polypeptide or nucleotide sequences which differ from a wild-type or a native sequence of GSK3-Alpha as shown in SEQ ID NO: 1. The term variant also encompasses amino acid sequence comprising at least 50%, more preferably at least 65%, more preferably at least 80%, most preferably at least 90% of the wild-type sequence.
The term “fragment” indicates that a polypeptide or nucleotide sequence comprises a fraction of a wild-type or a native sequence. It may comprise one or more large contiguous sections of sequence or a plurality of small sections.
The term “homologue” refers to an entity that shares a certain homology with amino acid sequences or nucleotide sequences of GSK3-Alpha. Here, the term “homology” can be equated with “identity”. In the present context, a homologous sequence is taken to include an amino acid sequence which may be at least 70, 80 or 90% identical, to the subject sequence. Although homology can also be considered in terms of similarity (i.e. amino acid residues having similar chemical properties/functions), it is preferred to express homology in terms of sequence identity.
An homologous sequence is taken to include a nucleotide sequence which may be at least 70, 80 or 90% identical, to the subject sequence.
The term “derivative” or “derivatised” herein after includes chemical modification of an entity, such as candidate compound or a GSK3-Alpha modulator.
The amino acid sequences may also have deletions, insertions or substitutions of amino acid residues, which produce a silent change and result in a functionally equivalent substance. Deliberate amino acid substitutions may be made on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues as long as the secondary binding activity of the substance is retained. For example, negatively charged amino acids include aspartic acid and glutamic acid; positively charged amino acids include lysine and arginine; and amino acids with uncharged polar head groups having similar hydrophilicity values include leucine, isoleucine, valine, glycine, alanine, asparagine, glutamine, serine, threonine, phenylalanine, and tyrosine.
Conservative substitutions may be made, for example according to the Table 2 given below. Amino acids in the same block in the second column and preferably in the same line in the third column may be substituted for each other.
Homologous substitution (substitution and replacement are both used herein to mean the interchange of an existing amino acid residue, with an alternative residue) may occur i.e. like-for-like substitution such as basic for basic, acidic for acidic, polar for polar etc. Non-homologous substitution may also occur i.e. from one class of residue to another or alternatively involving the inclusion of unnatural amino acids such as ornithine (hereinafter referred to as Z), diaminobutyric acid ornithine (hereinafter referred to as B), norleucine ornithine (hereinafter referred to as O), pyriylalanine, thienylalanine, naphthylalanine and phenylglycine.
Replacements may also be made by unnatural, amino acids include; alpha* and alpha-disubstituted* amino acids, N-alkyl amino acids*, lactic acid*, halide derivatives of natural amino acids such as trifluorotyrosine*, p-Cl-phenylalanine*, p-Br-phenylalanine*, p-I-phenylalanine*, L-allyl-glycine*, 3-alanine*, L-α-amino butyric acid*, L-γ-aminobutyric acid*, L-α-amino isobutyric acid*, L-s-amino caproic acid #, 7-amino heptanoic acid*, L-methionine sulfone#, L-norleucine*, L-norvaline*, p-nitro-L-phenylalanine*, L-hydroxyproline#, L-thioproline*, methyl derivatives of phenylalanine (Phe) such as 4-methyl-Phe*, pentamethyl-Phe*, L-Phe (4-amino)#, L-Tyr (methyl)*, L-Phe (4-isopropyl)*, L-Tic (1,2,3,4-tetrahydroisoquinoline-3-carboxyl acid)*, L-diaminopropionic acid* and L-Phe (4-benzyl)*. The notation * has been used for the purpose of the discussion above (relating to homologous or non-homologous substitution), to indicate the hydrophobic nature of the derivative. The notation # has been used to indicate the hydrophilic nature of the derivative and #* indicates amphipathic characteristics.
Illustrative of chemical modifications/derivatives would be replacement of hydrogen by a halo group, an alkyl group, an acyl group or an amino group. Variant amino acid sequences may include suitable spacer groups that may be inserted between any two amino acid residues of the sequence including alkyl groups such as methyl, ethyl or propyl groups in addition to amino acid spacers such as glycine or β-alanine residues. A further form of variation, involves the presence of one or more amino acid residues in peptoid form.
The term “peptoid form” is used to refer to variant amino acid residues wherein the a-carbon substituent group is on the residue's nitrogen atom rather than the a-carbon. Processes for preparing peptides in the peptoid form are known in the art, for example Simon R J et al., PNAS (1992)89 (20), 9367-9371 and Horwell D C, Trends Biotechnol. (1995) 13 (4), 132-134.
The homology model for GSK3-Alpha was generated using the program module Prime within the Schrodinger Discovery Suite (Schrodinger Inc, NY, USA). The flow chart for homology modeling approach is shown in
At block 105, the GSK3-Alpha sequence (SEQ ID No. 1) was given as the input sequence. At block 110, a sequence homology search was conducted to identify templates having sequence homology greater than 40%. A template having greatest sequence homology amongst the identified templates was selected for model construction. For the present model construction, the GSK-3 Beta structure was used as template. At block 115, the GSK3-Alpha was subjected to modeling based on the selected template (GSK-3 Beta) using the program module Prime. At block 120, sequence alignment of GSK3-Alpha with the selected template was performed using the PAM 120 multiple scoring matrix with a dimension block of 0.6 and a significance p value of 0.0001. The strategy involved using the selected template to define the structurally conserved regions (SCR's) from which the coordinates were subsequently transferred to build the structure at block 125. This was then followed by loop construction at block 135, where the non-SCR's were generated from the selected template and additionally by de-novo building and subsequent evaluation of the most realistic coordinates (in terms of energetic of the loop itself and the exclusion of loops leading to overlapping atoms). After loop building was completed for missing coordinates, the raw coordinates were refined using successive rounds of end repair and energy minimization (500 steps of OPLS 2005) at block 140. The model was then completed through using a further minimization. 1 ps molecular dynamics was used to fully explore the conformational space of the loop regions. At block 145, the final model structure was checked against databases of protein structures for bond length and dihedral angle violations using Ramachandran plot. If the Ramachandran plot was not found to be within acceptable ranges, then further refinement was carried out at block 150. Finally, at block 155, a model was built using the acceptable structure. The output of the final modeled structure was obtained in PDB format.
In case any sequence of homology greater than 40% is not identified from the sequence homology search at block 110, an alternate method of modeling by threading methodology is used at block 130 to build a model. Here the different parts of the proteins are taken from the fold libraries and the model is built. This model is also further refined by energy minimization and using Ramachandran plot as before.
For the final model structure for GSK3-Alpha built based on the selected template, the results indicated that bond length and dihedral angle violations as a whole were within acceptable limits with greater than 80% of residues having phi-psi plots with the allowed region in Ramachandran space. The final model structure that was built had the structure coordinates given in table 1. The final structure is shown in
The structure coordinates of the ternary complex of GSK3-Beta were obtained from the RCSB (accession code 1J1B). GSK-3Alpha inhibitors were then docked into the ATP site of the GSK-3 Beta or the GSK-3 Alpha model structure using the Glide program (Schrodinger Inc, NY, USA). The flow chart of the docking protocol is described in
The GSK3 Alpha was given as an input molecule at 305 and the binding site was defined as an 8A radius from the centre of a ligand binding mode. The calculation was performed using the OPLS force field in a two-step process using an implicitly derived solvation model and geometric H-bond restraints. The refinement phase involved conformational sampling using molecular dynamics calculated over 5 ps in 100 fs stages, where the temperature was at 300 K. The Ligand was prepared at 310 by the Ligprep module of the Schrodinger discovery suite where around five conformations were generated. The modeled protein is subjected to protein preparation under protein refinement method at 320. These conformations were subjected to XP docking protocol of Glide without any constraints at 315. The docked pose was then analyzed at 325.
This molecular docking routine incorporates a full molecular mechanics approach and allows for flexibility both in the ligand and in the side chains and backbone of the receptor (the positions of the alpha carbon atoms were fixed during the simulation). The specificity analysis can be evident in the docking studies that were carried out using Glide module of Schrodinger Discovery suite and are discussed below.
For example, the highly active compound 4-(7-Butyl-5H-pyrrolo[2,3-b]pyrazin-6-yl)-phenol was docked to analyze the interaction. It was identified that L35, V36.L37 residues form a hydrophobic surface in the structure of GSK3Alpha which is depicted in
The coordinate of the model given in PDB format enables the visualization of the structure in relation to its 3D space. The various pockets/cavities were analyzed and the active site was identified by superimposition of the GSK3-Alpha specific ligand to the structure. The interacting residues were identified and the neighboring residues were analyzed for specificity details. The model built for GSK3-Alpha was superimposed onto the GSK-3 Beta structure and the differences of the pockets were analyzed.
It was found that the alpha isoform active site consists of the residues Val 41, Asp 106, Glu 39, Gln 91, Asn 92 and Lys 89. For validating the results the GSK3-Alpha active molecule was docked to the Alpha structure. Glide docking protocol was used to dock the active molecules as shown in
The model in its 3D structure exhibits many pockets which can be identified visual inspection and the active site was identified as described earlier. The active site residues are identified and analyzed. Using this information various method can be employed to design modulators to inhibit or modulate the activity of GSK3-Alpha. For example, when the residues comprising the ATP cleft of both GSK3-Beta and GSK3-Alpha are compared, a key difference is found in the residue, E39. However, as this residue provides contacts through the backbone, they would not be expected to impact inhibitor binding significantly. E39 has a strong hydrogen bonding interaction with the phenolic hydroxyl group of the inhibitor in the predicted binding mode. E39 is a specific conserved residue in GSK3-Alpha and a strong hydrogen bonding with this residue would potentially help in enhancing the specificity towards GSK3-Alpha.
Further, the residue GLU39 in place of ASP in the same place in the Beta isoform is the key residue for specificity. Moreover, the distance differences between the residues can be used to design pharmacophore based hypothesis. Various known Alpha isoform specific modulators can be docked and analyzed and from the gathered information new scaffolds can be designed as modulators for GSK3-Alpha. Based on the designed scaffolds various focused combinatorial libraries can be designed and generated for synthesis and analyzed. In the process new hit compounds can be identified as potential modulators for GSK3-Alpha.
The GSK-3 Alpha model in its 3D structure exhibits many pockets which can be identified by visual inspection and the active site can be identified as described earlier. The active site residues are thus identified and analyzed. Using this information various method can be employed to design modulators to inhibit GSK3-Alpha. The residue GLU39 in place of ASP in the same place in its beta isoform is the key residue for specificity. Moreover, the distance differences between the residues can be used to design pharmacophore based hypothesis. Various known alpha isoform specific inhibitor can be docked and analyzed and from the gathered information new scaffolds can be designed as modulators for GSK3-Alpha. Based on the designed scaffolds various focused combinatorial libraries can be designed and generated for synthesis and analyzed. In the process new hit compounds can be identified as potential modulators for GSK3-Alpha.
Number | Date | Country | Kind |
---|---|---|---|
1145/CHE/2005 | Aug 2005 | IN | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/IN2006/000290 | 8/14/2006 | WO | 00 | 6/10/2008 |