The present invention relates generally to structural studies of insulin binding to the insulin receptor (“IR”). In particular, the present invention relates to the crystal structure of insulin in complex with N- and C-terminal regions of the IR α-chain, and to methods of using the crystal structure and related structural information to identify, design and/or screen for agonists and antagonists that interact with and/or modulate the function of the IR.
The insulin receptor (“IR”), and its homologue the type 1 insulin-like growth factor receptor (“IGF-1R”), are closely related members of the tyrosine kinase receptor family, and are large, transmembrane, glycoprotein dimers consisting of several structural domains.
Insulin, a hormone produced by the pancreas, interacts with the IR, and via the IR signals into a complex network that ultimately controls the fundamental cellular processes of cell growth and differentiation, protein synthesis, as well as glucose uptake, synthesis and homeostasis (Cohen, 2006; Taniguchi et al., 2006). Insulin signalling also plays a role in neuronal development (Chiu and Cline, 2010). Insulin's role in the treatment of diabetes has underpinned decades of research into the biochemical and biophysical properties of the hormone (Ward and Lawrence, 2011). More recently, aberrant insulin receptor signalling has become implicated in cancer, in part due to its overlap with the closely-related insulin-like growth factor (“IGF”) signalling system and to an increased number of insulin receptors on cancer cells providing a mechanism for increased glucose uptake (Pollak, 2012). Brain insulin resistance is increasingly being seen as a potential causal factor in Alzheimer's disease (Talbot et al., 2012).
Despite these fundamental biological roles of insulin and the role of insulin signalling system in major diseases, there is to date no three-dimensional description of the way in which insulin interacts with the IR.
IR exists as two splice variant isoforms, IR-A and IR-B, which respectively lack or contain the 12 amino acids coded by exon 11. The longer variant, IR-B, is the isoform responsible for signalling metabolic responses. In contrast, IR-A signals predominantly mitogenic responses and is the preferentially expressed isoform in several cancers (Denley et al., 2003). Also, IR-A is capable of binding insulin-like growth factor 2 (“IGF-II”) with high affinity (Frasca et al., 1999; Denley et al., 2004).
The sequence of IR is highly homologous to the sequence of IGF-1R, indicating that their three-dimensional structures are highly likely to be similar. The mature human IR and IGF-1R molecules are each homodimers comprising two α-chains and two β-chains, the α- and β-chains arising from the post-translational cleavage at the furin cleavage site at residues 720-723 (IR-A numbering with the mature N-terminal residue numbered 1) or 707-710 (IGF-1R).
IR is a heavily-glycosylated, disulphide-linked (αβ)2 homodimer of ˜380 kDa molecular weight. Each IR or IGF-1R monomer comprises, from its N-terminus, a leucine-rich repeat domain (“L1”), a cysteine-rich region (“CR”), a leucine-rich repeat domain (“L2”), three fibronectin Type III domains (“FnIII-1”; “FnIII-2”; and “FnIII-3”), trans- and juxtamembrane segments (“TM” and “JM”), a tyrosine kinase domain (“TK”) and a C-terminal tail (McKern et al., 2006). The FnIII-2 domain contains, within its CC′ loop, an insert domain (“ID”) of ˜120 mostly-disordered residues within which lies the α/β cleavage site. Near the C-terminal region of the IR α-chain lies an amphipathic helical segment (“αCT”) that in apo-IR spans residues 693-710 and packs against the central β-sheet (L1-β2) of the L1 domain of the alternate IR monomer (Smith et al., 2010). The domain organisation of IR and the structure of apo-IR are presented in
Intracellularly, the TK domain is flanked by two regulatory regions that contain the phosphotyrosine binding sites for signalling molecules. Each α-chain is linked to its β-chain via a disulphide bond between residues Cys647 and Cys860 (Sparrow et al., 1997) in the case of IR and/or Cys633-Cys849 in the case of IGF-1R. The α-chains of both IR and IGF-1R are cross-linked by disulphide bonds in two places. The first is at Cys524 (IR) or Cys514 (IGF-1R) in the FnIII-1 domain, cross-linked to its counterpart in the opposite monomer, and the second involves one or more of the residues Cys682, Cys683 and Cys685 (1R) or Cys669, Cys670 and Cys672 (IGF-1R) in the insert region of each FnIII-2 domain, cross-linked to their counterparts in the opposite monomer (Sparrow et al., 1997).
The domains of IR and IGF-1R exhibit high (47-67%) amino acid sequence identity indicative of high conservation of three-dimensional structure. The crystal structure of the first three domains of IGF-1R (L1-CR-L2) has been determined (Garrett et al., 1998) and revealed that the L domains consist of a single-stranded right-handed β-helix (a helical arrangement of β-strands), while the cysteine-rich region is composed of eight related disulphide-bonded modules. The crystal structure of the first three domains of IR (L1-CR-L2) has also been determined (WO 07/147,213, Lou et al., 2006) and as anticipated is closely similar to that of its IGF-1R counterpart. Other evidence for the close structural similarity of IR and IGF-1R arises from: (i) electron microscopic analyses (Tulloch et al., 1999), (ii) the fact that hybrid receptors (heterodimers of one IR monomer disulphide-bonded to one of IGF-1R monomer) exist naturally and are commonly found in tissues expressing both receptors (Bailyes et al., 1997), and (iii) the fact that receptor chimeras can be constructed which have whole domains or smaller segments of polypeptide from one receptor replaced by the corresponding domain or sequence from the other (reviewed in Adams et al., 2000). Further to (ii) and (iii) above, physiologically relevant IR/IGF-1R receptor chimeras have been shown to form on cell surfaces (Belfiore et al., 2009).
The current model for insulin binding proposes that, in the basal state, the IR homodimer contains two identical pairs of binding sites (referred to as “Site 1” and “Site 2”) on each monomer (De Meyts and Whittaker, 2002; Schäffer, 1994; De Meyts, 1994; De Meyts, 2004; Kiselyov et al., 2009). Binding of insulin to a low affinity site (“Site 1”) on one α-subunit is followed by a second binding event between the bound insulin and a different region of the second IR α-subunit (“Site 2”). This ligand-mediated bridging between the two α-subunits generates the high affinity state that results in signal transduction. In contrast, soluble IR ectodomain, which is not tethered at its C-terminus, cannot generate the high affinity receptor-ligand complex. The soluble IR ectodomain can bind two molecules of insulin simultaneously at its two Site 1s, but only with nanomolar affinity (Adams et al., 2000).
The model for IGF-I or IGF-II binding to IGF-1R is the same as that just described for insulin binding to IR and involves IGF-I (or IGF-II) binding to an initial low affinity site (“Site 1”) and subsequent cross-linking to a second site (“Site 2”) on the opposite monomer to form the high affinity state, as described for the IR. However, the values of the kinetic parameters describing these events are somewhat different in the two systems (Surinya et al., 2008; Kiselyov et al., 2009).
In addition to the above models of ligand binding, IGF (IGF-I and IGF-II) and insulin have been shown to bind to IR and the IGF-1R, respectively, albeit at a lower affinity than their cognate receptors (Kristensen et al., 1999; Nakae et al., 2001; Kjeldsen et al., 1991). However, as previously mentioned, the IR-A isoform variant of the IR is a high-affinity receptor for IGF-II (Frasca et al., 1999).
While similar in structure, IGF-1R and IR serve different physiological functions. IGF-1R is expressed in almost all normal adult tissue except for liver, which is itself the major site of IGF-I production (Buttel et al., 1999). A variety of signalling pathways are activated following binding of IGF-I or IGF-II to IGF-1R, including Src and Ras, as well as downstream pathways, such as the MAP kinase cascade and the P13K/AKT axis (Chow et al., 1998). IR is primarily involved in metabolic functions whereas IGF-1R mediates growth and differentiation. Consistent with this, ablation of IGF-I (i.e., in IGF-I knock-out mice) results in embryonic growth deficiency, impaired postnatal growth, and infertility. In addition, IGF-1R knock-out mice were only 45% of normal size and died of respiratory failure at birth (Liu et al., 1993). However, both insulin and IGF-I can induce both mitogenic and metabolic effects.
The insulin monomer comprises a 30-residue B-chain and 21-residue A-chain, together containing three α-helices, A1-A8, A12-A18 and B9-B19, connected by three disulphide bonds (Adams et al., 1969). In its pancreatic (storage) form insulin forms a hexamer (a trimer of dimers). The structure of the insulin monomer is presented in
Two distinct surfaces of insulin are understood to interact with IR to initiate signalling (Schäffer, 1994). The first (“classical”) surface consists predominantly of residues involved in hormone dimerization, and the second mostly of residues involved in hexamerization (De Meyts, 2004).
The tandem L1-β2/αCT element from the IR is understood to interact with the classical binding surface of insulin and forms part of Site 1 (Ward and Lawrence, 2009). The second binding surface of insulin is understood to interact with residues at the junction of the FnIII-1 and FnIII-2 domains of the opposite α-chain to that contributing the L1 domain to Site 1 (McKern et al., 2006; Whittaker et al., 2008), referred to as Site 2 (Ward and Lawrence, 2009). IR site 1 is the primary binding hormone site; its dissociation constant for insulin is estimated to be ˜6.4 nM vs ˜400 nM for Site 2 (Kiselyov et al., 2009).
Insulin and the IR ectodomain are proposed to undergo a conformational change upon hormone binding. A key proposed change in insulin is the detachment of the C-terminal B-chain residues B20-B30 (Hua et al., 1991) from the helical core. In the absence of structural characterization of the hormone-receptor complex, however, details of this conformational change have remained speculative. The proposed change in insulin is presented in
Insulin remains the key therapeutic for the treatment of both Type 1 and Type 2 diabetes. Its use as a therapeutic has, however, two major complications. The first concerns the method of delivery and the second concerns its profile of action.
In terms of the method of delivery, insulin is not orally bioavailable, and its delivery has to thus be via other means (injection, pumps, pens, nasal sprays, etc.). Formulation of insulin appropriate to each of these means of delivery remains an area of active research.
In terms of profile of action, none of the current modes of delivery can currently deliver insulin in such a way that the respective prandial, post-prandial and sleep concentrations of insulin can be accurately mimicked through a 24-hour cycle.
The above concerns have led to a new class of therapeutics, namely insulin analogues (or “designer” insulins). Examples of designer insulins include:
However, the above designer insulins have been developed without the benefit of knowledge of the three-dimensional atomic details of the insulin/IR interaction. The absence of structural data on the insulin/IR complex has hindered a detailed understanding of ligand/IR interactions.
As mentioned above, structural information to date has been available only for insulin and parts of the IR ectodomain. Thus, there has been no structural information detailing the interactions between insulin and the IR, particularly the conformational changes posited to occur on insulin binding to the IR.
Accordingly, there is a need to determine the structure of insulin in complex with IR in order to better understand the nature of the insulin/IR interaction and its role in IR signalling in order to rationally design or redesign agonists and antagonists, particularly alternative insulin analogues, useful inter alia in the treatment of aberrant IR signalling diseases and/or disorders.
The present invention is predicated in part by the determination of the crystal structure of insulin in complex with the N- and C-terminal regions of the IR α-chain, which allows visualisation, for the first time, of the long posited conformational change in insulin upon IR binding. The structure also reveals, for the first time, insulin bound in a stabilised form on the L1-β2 surface of the IR by the αCT recognition helix and shows the accompanied remodelling of the Site 1 L1-β2/αCT tandem element of the IR upon insulin binding (Smith et al., 2010). The atomic coordinates for the structure are presented in Appendix I.
The crystal structure reveals much needed structural information about insulin residues PheB24 and PheB25 during receptor binding. The critical importance of these insulin residues B24 and B25 for receptor binding is well known (Mayer et al., 2007). Clinical mutations at these sites include PheB24Ser (insulin Los Angeles) and PheB25Leu (insulin Chicago), these having 16% and 1-5% of the activity of normal insulin respectively (Shoelson et al., 1983). Studies have shown that PheB24Gly leads to reduced receptor affinity, while D-Ala and D-Phe substitutions at 1324 results in a 150% and 140-180% increase in affinity relative to native insulin. Modifications to the phenyl ring of B24 mostly results in deleterious effects, while B25 accommodates sizable changes in side-chain structure, but is highly demanding in that the phenyl ring must be positioned at the β-carbon (Quan et al., 2006). A TyrB25 analogue demonstrates full in vitro activity. The significance of the size and orientation of the aromatic side-chain is revealed in the binding affinity for the 1- and 2-naphthylalanine analogues (24% and 50%, respectively). In Contrast, non-aromatic substitution at B25 invariably produces low potency. SerB25 and AlaB25 analogues exhibit respective binding potencies of ˜1% and 7% of native insulin. Distance of the phenyl ring from the backbone appears critical to activity, as the homo-phenylalanine and phenylglycine derivatives are as impotent as the nonaromatic analogues.
Thus, derivatization at insulin residues B24 and B25 appears to modulate the potency of insulin. Accordingly, the present inventors consider that the determination of the crystal structure of human insulin in complex with Site 1 of the human IR α-chain, including detailed structural information regarding the interaction between insulin residues B24 and B25 and IR, will assist in the rational design of insulin B24 and B25 analogues. Similarly, it is considered that the structural information provided concerning the nature of the interactions between insulin B24 and B25 residues and IR, will assist in the use of molecular modelling and related techniques to design insulin analogues that have altered binding affinity and or pharmacological profiles due to their alterations at insulin B24 and/or insulin B25.
With the foregoing in view, one aspect of the present invention provides an insulin/IR α-chain crystalline complex comprising the N- and C-terminal regions of the IR α-chain having the atomic coordinates as set forth in Appendix I. Generally, the crystalline complex comprising insulin and the N- and C-terminal regions of the IR α-chain or their derivatives or components thereof are in essentially pure native form. Derivatives and homologues, higher order complexes and soluble forms of the crystalline complex or its components are also contemplated by the present invention.
Another aspect of the present invention is directed to a data set of atomic coordinates defining the Site 1 interaction between IR including the N- and C-terminal regions of the IR α-chain and insulin.
Yet another aspect of the present invention is directed to conformational mimetics of the Site 1 binding surface of insulin useful as antagonists or agonists of insulin signalling via the IR. The mimetics may interfere or interact directly with Site 1 or may interact elsewhere causing conformational changes to Site 1 or may influence the form or level of the insulin/IR complex.
In particular, a Site 1 antagonist is proposed to, for example, reduce aberrant IR signalling, which as previously mentioned has become implicated in cancer, to reduce the increased glucose uptake attributable to the increased number of IRs on cancer cells.
In particular, a Site 1 antagonist is proposed, for example, in the treatment of cancer by reducing aberrant IR signalling attributable to increased glucose uptake in cancer cells due to an increased number of IRs located on cancer cells.
Likewise, a Site 1 agonist is proposed, for example, in the treatment of both Type 1 and Type 2 diabetes.
More particularly, a Site 1 mimetic molecule is proposed that binds to the low affinity insulin binding site of IR with a high degree of specificity and imparts a selective agonistic or antagonistic activity.
With the foregoing in view, one aspect of the present invention provides an insulin/IR α-chain crystalline complex comprising the N- and C-terminal regions of the IR α-chain having the atomic coordinates as set forth in Appendix I. Generally, the crystalline complex comprising insulin and the N- and C-terminal regions of the IR α-chain or their derivatives or components thereof are in essentially pure native form.
Accordingly, the present invention in one form resides in an insulin/IR complex in crystalline form or a derivative or homologue, higher order complex or soluble form thereof.
In one embodiment the complex comprises the N- and C-terminal regions of the IR α-chain in complex with insulin.
In a preferred embodiment, the complex comprises a leucine-rich repeat domain (“L1”) and an adjacent cysteine-rich region (“CR”) from the N-terminus of the IR α-chain and a portion of an amphipathic helical segment (“αCT”) from the C-terminus of the IR α-chain in complex with insulin.
In a more preferred embodiment, the complex comprises portions of the central fi-shed of the L1 domain and the αCT of the IR α-chain in complex with a portion of insulin.
In a most preferred embodiment, the complex comprises the components of the structure defined by the atomic coordinates shown in Appendix I or a subset thereof.
The present invention in another form provides a method of identifying, designing or screening for a compound that can potentially interact with IR, including performing structure-based identification, design, or screening of a compound based on the compound's interactions with an IR structure defined by the atomic coordinates of Appendix I or a subset thereof.
In another form, the present invention provides a method of identifying, designing or screening for a compound that can potentially mimic insulin interacting with IR, including performing structure-based identification, design, or screening of a compound based on (i) the compound's interaction with an IR structure and/or (ii) the compound's similarity with an insulin structure in complex with an IR structure defined by the atomic coordinates of Appendix 1 or a subset thereof.
In one embodiment, the method includes identifying, designing or screening for a compound which interacts with the three-dimensional structure of the low affinity insulin binding site of IR, the structure being defined by the atomic coordinates shown in Appendix 1, wherein interaction of the compound with the structure is favoured energetically.
In another embodiment, the method includes identifying, designing or screening for a compound based upon the three-dimensional structure of insulin in complex with components of the low affinity insulin binding site of IR, the structure being defined by the atomic coordinates shown in Appendix 1, wherein interaction of the compound with the structure is favoured energetically.
In another embodiment, the method further includes synthesising or obtaining an identified or designed candidate compound and determining the ability of the candidate compound to interact with IR and/or mimic insulin in complex with IR.
The present invention in another form provides a method for identifying an agonist or an antagonist compound comprising an entity selected from the group consisting of an antibody, a peptide, a non-peptide molecule and a chemical compound, wherein said compound is capable of enhancing, eliciting or blocking biological activity resulting from an interaction with insulin and/or the IR, wherein said process includes:
introducing into a suitable computer program parameters defining an interacting surface based on the conformation of insulin and/or IR corresponding to the atomic coordinates of Appendix I or a subset thereof, wherein said program displays a three-dimensional model of the interacting surface;
creating a three-dimensional structure of a test compound in said computer program;
displaying a superimposing model of said test compound on the three-dimensional model of the interacting surface;
assessing whether said test compound model fits spatially and optionally energetically into a binding site;
optionally incorporating said test compound in a biological activity assay; and
optionally determining whether said test compound inhibits or enhances the biological activity of insulin or IR signalling or signalling by a derivative of insulin or IR.
In one embodiment, the method includes identifying an agonist or an antagonist compound capable of interacting with the nanomolar affinity insulin binding site (“Site 1”) of the IR as defined by the atomic coordinates shown in Appendix I.
In a further embodiment, the atomic coordinates as shown in Appendix I or a subset thereof define one or more regions of insulin in complex with N- and C-terminal regions of the IR α-chain.
In a preferred embodiment, the atomic coordinates as shown in Appendix I or a subset thereof define insulin in complex with a L1 domain and an adjacent CR domain from the N-terminus of the IR α-chain and a portion of the αCT from the C-terminus of the IR α-chain.
In another preferred embodiment, the atomic coordinates as shown in Appendix I or a subset thereof define portions of the molecular surface of the central n-sheet of the L1 domain and the αCT of the IR α-chain, which interact with a portion of the molecular surface of insulin.
In a particularly preferred embodiment, the atomic coordinates as shown in Appendix I or a subset thereof define the molecular surface of a B-helix of insulin interacting with the molecular surface of the C-terminal edge of L1-β2 of the IR α-chain, the atomic coordinates or a subset thereof defining the B-helix of insulin as lying parallel with and adjacent to a helical portion of the αCT of the IR α-chain.
In another particularly preferred embodiment, the atomic coordinates as shown in Appendix I or a subset thereof define the αCT of the IR α-chain occupying a volume that would otherwise be occupied by the insulin B-chain if insulin was not bound to components of Site 1 of IR. In a more particularly preferred embodiment, a volume that would otherwise be occupied by the C-terminal portion of the B-chain of unbound insulin. In a most particularly preferred embodiment, a volume that would otherwise be occupied by amino acids 26 to 30 from the B-chain of unbound insulin.
In another embodiment, the atomic coordinates define one or more amino acids selected from 1 to 310 and 704 to 719 of the IR α-chain, 1 to 30 from the insulin B-chain and 1 to 21 from the insulin A-chain.
In another preferred embodiment, the one or more amino acids selected from 1 to 310 and 704 to 719 of the IR α-chain include one or more amino acids selected from the group consisting of Asp12, Arg14, Leu36, Leu37, Phe39, Lys40, Leu62, Phe64, Arg65, Phe88, Phe89, Tyr91, Val94, Phe96, Arg118, Glu120, His144, Phe705, Tyr708, Leu709, His710, Asn711, Val712, Val713, Phe714 and Val715.
In another preferred embodiment, the one or more amino acids selected from 1 to 30 from the insulin B-chain include one or more amino acids selected from the group consisting of Gly8, Ser9, Leu11, Val12, Leu15, Tyr16, Phe24, Phe25 and Tyr26.
In another preferred embodiment, the one or more amino acids selected from 1 to 21 from the insulin A-chain include one or more amino acids selected from the group consisting of Gly1, Ile2, Val3, Glu4 and Tyr19.
In another embodiment, IGF-I or IGF-II may be chemically modified to modify the binding affinity of IGF-I or IGF-II to bind to IR as a result of structure-based evaluation using the atomic coordinates as defined in Appendix I or a subset thereof.
In another form, the present invention includes use of the atomic coordinates or a subset thereof as shown in Appendix I at least representing:
in identifying, designing or screening for a compound that can potentially mimic insulin interacting with IR, including performing structure-based identification, design, or screening of a compound based on (a) the compound's interactions with an IR structure and/or (b) the compound's similarity with an insulin structure in complex with an IR defined by the atomic coordinates or a subset thereof.
In another form, the present invention includes use of the atomic coordinates or a subset thereof as shown in Appendix I at least representing:
in identifying, designing or screening for a compound that can potentially interact with IR, including performing structure-based identification, design, or screening of a compound based on the compound's interactions with an IR structure defined by the atomic coordinates or a subset thereof.
In another form, the present invention includes a set of atomic coordinates as shown in Appendix I, or a subset of thereof, at least representing:
In another form, the present invention includes an agonist or antagonist of a site comprising one or more amino acids selected from 1 to 310 and 704 to 719 of the IR α-chain including one or more amino acids selected from the group consisting of Asp12, Arg14, Leu36, Leu37, Phe39, Lys40, Leu62, Phe64, Arg65, Phe88, Phe89, Tyr91, Val94, Phe96, Arg118, Glu120, His144, Phe705, Tyr708, Leu709, His710, Asn711, Val712, Val713, Phe714 and Val715.
In another form, the present invention includes an agonist or antagonist that can potentially mimic insulin interacting with IR, said agonist or antagonist comprising one or more amino acids selected from 1 to 30 from the insulin B-chain including one or more amino acids selected from the group consisting of Gly8, Ser9, Leu11, Val12, Leu15, Tyr16, Phe24, Phe25 and Tyr26.
In another form, the present invention includes an agonist or antagonist that can potentially mimic insulin interacting with IR, said agonist or antagonist comprising one or more amino acids selected from the insulin A-chain including one or more amino acids selected from the group consisting of Gly1, Ile2, Val3, Glu4 and Tyr19.
The present invention has enabled the identification of molecular surface interactions between Site 1 on the IR α-chain and the Site 1 binding surface on insulin. Additionally, the present invention has enabled, for the first time, determination of conformational changes in insulin upon binding to the IR α-chain. In particular, the present invention has enabled the determination of key amino acid residues involved in the binding of insulin to the Site 1 binding surface on the IR α-chain. It will be evident to the skilled person that these findings can be transposed onto related receptors such as the IGF-1R, to which IGF-1 or IGF-II are modelled to bind in a similar fashion as insulin to IR.
The present invention is therefore also useful in the identification, screening and/or design of candidate compounds that bind to Site 1 of IGF-1R.
In one embodiment, candidate compounds for interacting with IR and/or IGF-1R may be chemically modified as a result of structure-based evaluation using the atomic coordinates as defined in Appendix I or a subset thereof.
In another embodiment, the chemical modification is designed to either:
In another embodiment, insulin may be chemically modified to modify the binding affinity of insulin to bind to IGF-1R as a result of structure-based evaluation using the atomic coordinates as defined in Appendix I or a subset thereof.
The present invention is also useful in the identification, screening and/or design of candidate compounds that bind to IR/IGF-1R receptor chimeras. In particular, the present invention is useful in the identification, screening and/or design of candidate compounds that bind to the Site 1 binding site of an IR/IGF-1R receptor chimera. The Site 1 binding site of the IR/IGF-1R receptor chimera may include the L1 domain from one monomer of one receptor type and the αCT peptide from another monomer of the other receptor type.
In one embodiment, candidate compounds for interacting with an IR/IGF-1R receptor or part thereof may be chemically modified as a result of structure-based evaluation using the atomic coordinates as defined in Appendix I or a subset thereof.
In a particular embodiment, candidate compounds for interacting with a Site 1 binding site of an IR/IGF-1R receptor or part thereof may be chemically modified as a result of structure-based evaluation using the atomic coordinates as defined in Appendix I or a subset thereof.
In another embodiment, a ligand selected from the group consisting of insulin, IGF-I and IGF-II may be chemically modified to modify the binding affinity for the ligand to bind to an IR/IGF-1R receptor or part thereof as a result of structure-based evaluation using the atomic coordinates as defined in Appendix I or a subset thereof.
Candidate compounds and compounds identified or designed using a method or process of the present invention may be any suitable compound, including naturally occurring compounds, de novo designed compounds, library generated compounds (chemically or recombinantly generated), mimetics etc., and may include organic compounds, new chemical entities, antibodies, binding proteins other than antibody-based molecules (non-immunoglobulin proteins) including, for example, protein scaffolds such as lipocalins, designed ankyrin repeat proteins (DARPins, Stumpp et al., 2007) and protein A domains (reviewed in Binz et al, 2005), avimers (Silverman et al., 2005), and other new biological entities such as nucleic acid aptamers (reviewed in Ulrich, 2006).
The present invention is also useful for improving the properties of known ligands for the Site 1 binding site of IR and/or IGF-1R. For example, existing IR or IGF-1R ligands may be screened against the 3D structure of the insulin binding site of IR or a region of the insulin binding site of IR as defined by the atomic coordinates of Appendix I or a subset thereof, and an assessment made of the potential to energetically interact with the insulin binding site of IR.
Similarly, existing IR or IGF-1R Site 1 ligands can be screened against the 3D structure of the binding surface of insulin bound to IR as defined by the atomic coordinates of Appendix I or a subset thereof, and an assessment made of the potential to energetically interact with the insulin binding site of IR.
Thus, the present invention also provides a method of redesigning a compound which is known to bind to IR and/or IGF-1R comprising performing structure-based evaluation of the compound based on the compound's interactions with an IR structure defined by the atomic coordinates of Appendix I or a subset thereof and redesigning or chemically modifying the compound as a result of the evaluation.
In another form, the present invention provides a method of redesigning a compound which is known to bind to IR and/or IGF-IR comprising performing structure-based evaluation of the compound's similarity with an insulin structure in complex with an IR defined by the atomic coordinates of Appendix I or, a subset thereof and redesigning or chemically modifying the compound as a result of the evaluation.
In one embodiment, the compound which is known to bind to IR and/or IGF-1R is redesigned or chemically modified to (i) improve affinity for binding to IR, and/or (ii) lower affinity for binding to IGF-1R.
In another embodiment, the compound which is known to bind to IR and/or IGF-1R is redesigned or chemically modified to (i) improve affinity for binding to IGF-1R, and/or (ii) lower affinity for binding to IR.
In a further embodiment the compound is redesigned or modified so as to lower the affinity to IR or IGF-1R by virtue of the structural information provided by the structure of insulin in complex with the IR α-chain, as defined by the atomic coordinates shown in Appendix I, or a subset thereof.
The present invention also provides a computer system for identifying one or more compounds that can potentially interact with IR and/or IGF-1R, the system containing data representing the structure of: (i) the Site 1 binding site of IR, the structure being defined by a subset of the atomic coordinates shown in Appendix I; (ii) the Site 1 binding site on insulin, the structure being defined by a subset of the atomic coordinates shown in Appendix I; and/or (iii) a combination thereof, the structure being defined by the atomic coordinates shown in Appendix I.
In another aspect, the present invention provides a computer-readable medium having recorded data thereon representing a model and/or the atomic coordinates as shown in Appendix I, or a subset of atomic coordinates thereof, the model and/or the atomic coordinates at least representing:
Also provided are a set of atomic coordinates as shown in Appendix I, or a subset of thereof, at least representing:
The three-dimensional structure of the N- and/or C-terminal regions of the IR α-chain and/or insulin and/or the one or more regions of insulin in complex with the N- and C-terminal regions of the IR α-chain may be used to develop models useful for drug design and/or in silico screening of candidate compounds that interact with and/or modulate IR. Other physicochemical characteristics may also be used in developing the model, e.g. bonding, electrostatics, etc.
Generally, the term “in silico” refers to the creation in a computer memory, i.e., on a silicon or other like chip. Stated otherwise “in silico” means “virtual”. When used herein the term “in silico” is intended to refer to screening methods based on the use of computer models rather than in vitro or in vivo experiments.
Accordingly, the present invention also provides a computer-assisted method of identifying a compound that potentially interacts with IR and/or IGF-1R, which method comprises fitting the structure of: (i) the Site 1 binding site of IR, the structure being defined by a subset of the atomic coordinates shown in Appendix I; and/or (ii) portions of the N- and C-terminal regions of the IR α-chain, which are in complex with insulin, the structure being defined by a subset of the atomic coordinates shown in Appendix I, to the structure of a candidate compound.
Also provided by the present invention is a computer-assisted method for identifying a molecule able to interact with IR and/or IGF-1R using a programmed computer comprising a processor, which method comprises the steps of: (a) generating, using computer methods, a set of atomic coordinates of a structure that possesses energetically favourable interactions with the atomic coordinates of (i) the Site 1 binding site of IR, the structure being defined by a subset of the atomic coordinates shown in Appendix I; and/or (ii) portions of the N- and C-terminal regions of the IR α-chain, which are in complex with insulin, the structure being defined by a subset of the atomic coordinates shown in Appendix I, which coordinates are entered into the computer thereby generating a criteria data set; (b) comparing, using the processor, the criteria data set to a computer database of chemical structures; (c) selecting from the database, using computer methods, chemical structures which are complementary or similar to a region of the criteria data set; and optionally, (d) outputting, to an output device, the selected chemical structures which are complementary to or similar to a region of the criteria data set.
The present invention further provides a computer-assisted method for identifying potential mimetics of IR, insulin and/or IGF-1R using a programmed computer comprising a processor, the method comprising the steps of: (a) generating a criteria data set from a set of atomic coordinates of: (i) the Site 1 binding site of IR, the structure being defined by a subset of the atomic coordinates shown in Appendix I; (ii) portions of the N- and C-terminal regions of the IR α-chain, which are in complex with insulin, the structure being defined by a subset of the atomic coordinates shown in Appendix I; (iii) insulin, the structure being defined by a subset of the atomic coordinates shown in Appendix I; and/or (iv) the Site 1 binding site of insulin or portions thereof, the structure being defined by a subset of the atomic coordinates shown in Appendix I, which coordinates are entered into the computer; (b) (i) comparing, using the processor, the criteria data set to a computer database of chemical structures stored in a computer data storage system and selecting from the database, using computer methods, chemical structures having a region that is structurally similar to the criteria data set; or (ii) constructing, using computer methods, a model of a chemical structure having a region that is structurally similar to the criteria data set; and, optionally, (c) outputting to an output device: (i) the selected chemical structures from step (b)(i) having a region similar to the criteria data set; or (ii) the constructed model from step (b)(ii).
The present invention further provides a method for evaluating the ability of a compound to interact with IR and/or IGF-1R, the method comprising the steps of: (a) employing computational means to perform: (i) a fitting operation between the compound and the binding surface of a computer model of the Site 1 binding site for insulin on IR; and/or (ii) a superimposing operation between the compound and insulin, the Site 1 binding site of insulin, or a portion thereof, using atomic coordinates wherein the root mean square deviation between the atomic coordinates and a subset of atomic coordinates of Appendix I or a subset of atomic coordinates of one or more thereof at least representing the N-terminal region of the IR α-chain, the C-terminal region of IR α-chain, insulin, the Site 1 binding site of insulin, or a portion of the Site 1 binding site of insulin, is not more than 1.5 Å; and (b) analysing the results of the fitting operation and/or superimposing operation to quantify the association between the compound and the binding surface model.
The present invention also provides a method of using molecular replacement to obtain structural information about a molecule or a molecular complex of an unknown structure, comprising the steps of (i) generating an X-ray diffraction pattern of the crystallized molecule or molecular complex; and (ii) applying the atomic coordinates of Appendix I, or a subset of atomic coordinates thereof at least representing the N-terminal region of the IR α-chain, the C-terminal region of IR α-chain, insulin, mimetics thereof, derivatives thereof, or portions thereof, to the X-ray diffraction pattern to generate a three-dimensional electron density map of at least a region of the molecule or molecular complex whose structure, is unknown.
The present invention provides compounds that bind to IR and/or IGF-1R designed, redesigned or modified using the methods or processes of the present invention. Preferably, such compounds have an affinity (Kd) for IR and/or IGF-1R of less than 10−5 M. In a particularly preferred embodiment, the compounds binds to the Site 1 binding site of IR and/or to the Site 1 binding site of IGF-1R.
In view of the high sequence homology between the ligands insulin, IGF-I and IGF-II (Adams et al. 2000), the present invention also encompasses compounds that bind to insulin, IGF-I or IGF-II, said compounds being designed, redesigned or modified using the methods or processes of the present invention.
The present invention also provides a composition comprising a compound of the invention, a peptide or mimetic of the invention, and optionally an acceptable carrier or diluent, more preferably a pharmaceutically acceptable carrier or diluent.
The present invention further provides a method for preventing or treating a disease associated with aberrant IR and/or IGF-1R functioning and/or signalling, the method comprising administering to a subject in need thereof a composition, compound, peptide or mimetic of the invention.
Also provided by the present invention is use of a composition, a compound, a peptide or mimetic of the invention in the manufacture of a medicament for treating a disease or disorder in a subject associated with aberrant IR and/or IGF-1R functioning and/or signalling.
Examples of diseases associated with aberrant IR and/or IGF-1R functioning and/or signalling include, but are not limited to, obesity, type I and type II diabetes, cardiovascular disease, osteoporosis, dementia and cancer.
It is also intended that embodiments of the present invention include manufacturing steps such as incorporating the compound, such as a peptide, into a pharmaceutical composition in the manufacture of a medicament.
Throughout this specification, preferred aspects and embodiments apply, as appropriate, separately, or in combination, to other aspects and embodiments, mutatis mutandis, whether or not explicitly stated as such.
The present invention will now be described further with reference to the following examples, which are illustrative only and non-limiting.
Various embodiments of the invention will be described with reference to the following drawings. Some of the figures contain colour representations or entities. Coloured versions of the figures are available from the applicant upon request or from an appropriate patent office.
SEQ ID NO: 1—Amino acid sequence of mature human insulin receptor (isoform A).
SEQ ID NO: 2—Amino acid sequence of mature human insulin receptor (isoform B).
SEQ ID NO: 3—Amino acid sequence of mature Type 1 insulin-like growth factor receptor.
SEQ ID NO: 4—Amino acid sequence of insulin A-chain.
SEQ ID NO: 5—Amino acid sequence of insulin B-chain.
SEQ ID NO: 6—Amino acid sequence of mature insulin-like growth factor I (“IGF-I”).
SEQ ID NO: 7—Amino acid sequence of mature insulin-like growth factor II (“IGF-II”).
SEQ ID NO: 8—Amino acid sequence of IR310.T.
SEQ ID NO: 9—Amino acid sequence of the IR α-chain C-terminal peptide from the IR-A isoform (“αCT”).
SEQ ID NO: 10—Amino acid sequence of the IR α-chain C-terminal peptide from the IR-B isoform.
SEQ ID NO: 11—Amino acid sequence of MAb 83-7 IgG1 heavy chain.
SEQ ID NO: 12—Amino acid sequence of MAb 83-7 kappa light chain.
SEQ ID NO: 13—Amino acid sequence of Fab 83-7 kappa light chain.
SEQ ID NO: 14—Amino acid sequence of Fab 83-7 IgG1 heavy chain.
SEQ ID NO: 15—Amino acid sequence of cIR485.
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art (e.g. in molecular biology, biochemistry, structural biology, and/or computational biology). Standard techniques are used for molecular and biochemical methods (see generally, Sambrook et al., 2001, and Ausubel et al., 1999, which are incorporated herein by reference) and chemical methods.
In the present specification and claims, the word ‘comprising’ and its derivatives including ‘comprises’ and ‘comprise’ include each of the stated integers but does not exclude the inclusion of one or more further integers.
Reference throughout this specification to ‘one embodiment’ or ‘an embodiment’ means that a particular feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment of the present invention. Thus, the appearance of the phrases ‘in one embodiment’ or ‘in an embodiment’ in various places throughout this specification are not necessarily all referring to the same embodiment. Furthermore, the particular features, structures, or characteristics may be combined in any suitable manner in one or more combinations.
As used herein the term “homologue” means a protein having at least 30% amino acid sequence identity with IR, insulin, IGF-1R, and/or portions thereof. Preferably, the percentage identity is 40 or 50%, more preferably 60 or 70% and most preferably 80 or 90%. A 95% or above identity is most particularly preferred such as 95%, 96%, 97%, 98%, 99% or 100%.
As used herein the term “higher order complex” is used to describe a multimer of the ternary complex of insulin in complex with the IR and the αCT peptide. A higher order complex also includes a co-complex of one or more components of the insulin/IR/αCT complex and another molecule. Various higher order complexes may have different signalling properties.
As used herein the term “derivatives” means IR/insulin/αCT peptide polypeptide complexes that display the biological activity of the wild-type IR/insulin/αCT peptide interactions, characterised by the replacement of at least one amino acid from the wild-type sequence or the modification of one or more of the naturally-occurring amino acids.
Crystals and the Crystal Structure of Insulin in Complex with the N- and C-Terminal Regions of the IR α-Chain
The present invention provides a crystal comprising an N-terminal region of the IR α-chain based on cIR485 (a “cleavable” construct consisting of the first 3 N-terminal domains of the IR α-chain; SEQ ID NO: 15) in complex with Fab 83-7 (SEQ ID NOs 13 and 14), a C-terminal region of the IR α-chain based on the IR classical αCT peptide (“αCT704-719”; residues 704-719; Genscript; SEQ ID NO: 9) and human insulin (Sigma-Aldrich; SEQ ID NOs 4 and 5; see Examples).
The crystal structure of human insulin complexed IR310.T/83-7/αCT704-719 is presented in
In the human insulin-complexed IR310.T structure, weak electron density extended beyond insulin GluB21 in a direction anti-parallel to the first strand of L1-β2, suggestive of partial ordering of a folded-out B-chain C-terminal strand.
The interface between insulin, the αCT segment and the L1-β2 surface involves well-defined elements of secondary structure, allowing ready discernment in the electron density maps of the overall direction of the constituent residue side chains. The refined model thus allows key residue-specific interactions at the interface to be discerned with confidence, even though a determination of detailed side-chain conformation is precluded. The major interaction between αCT and insulin is seen to be mediated by the side chains of (a) His710, which is directed into a pocket formed by insulin residues ValA3, GlyB8, SerB9 and ValB12, and (b) Phe714, which is directed into the now exposed hydrophobic crevice of insulin lined by residues GlyA1, IleA2, TyrA19, LeuB11, ValB12 and LeuB15 (
Initially, no interpretable density was discerned for insulin residues B20-B26. Final resolution of residues B20-B26 required careful reprocessing of the X-ray data and re-refinement of the model. Ultimately, B20-B24 could be built in a near native-like conformation, with the side chain of residue B24 directed into a hydrophobic pocket formed by the side chains of residues Phe714, Phe39, Leu37 and LeuB15 (
As used herein, the term “crystal” means a structure (such as a three-dimensional (3D) solid aggregate) in which the plane faces intersect at definite angles and in which there is a regular structure (such as an internal structure) of the constituent chemical species. The term “crystal” refers in particular to a solid physical crystal form such as an experimentally prepared crystal.
Crystals according to the invention may be prepared using any IR ectodomain, i.e., the IR polypeptide containing the extracellular domain, including the N- and C-terminal regions, and lacking the transmembrane domain and the intracellular tyrosine kinase domain. Typically, the extracellular domain comprises residues 1 to 917 (mature receptor numbering) of human IR, or the equivalent thereof together with any post-translational modifications of these residues such as N- or O-linked glycosylation.
In a preferred embodiment the IR polypeptide is human IR (SEQ ID NOs: 1 or 2). However, the IR polypeptide may also be obtained from other species, such as other mammalian, vertebrate or invertebrate species.
Crystals may be constructed with wild-type IR polypeptide ectodomain sequences or variants thereof, including allelic variants and naturally occurring mutations as well as genetically engineered variants. Typically, variants have at least 95 or 98% sequence identity with a corresponding wild-type IR ectodomain polypeptide.
Crystals according to the invention may be prepared using any insulin, i.e., insulin polypeptides containing the insulin A-chain and insulin B-chain. Typically, the insulin will be in a monomeric form.
In a preferred embodiment the insulin polypeptides encode human insulin (SEQ ID NOs: 4 and 5).
Crystals may be constructed with wild-type insulin polypeptide sequences or variants thereof, including allelic variants and naturally occurring mutations as well as genetically engineered variants. Typically, variants have at least 95 or 98% sequence identity with a corresponding wild-type insulin polypeptide.
Optionally, the crystal of the IR ectodomain in complex with insulin may comprise one or more compounds which bind to the ectodomain and/or insulin, or otherwise are soaked into the crystal or co-crystallised with the IR ectodomain and/or insulin. Such compounds include ligands or small molecules, which may be candidate pharmaceutical agents intended to modulate the interaction between IR and insulin or IR and biological targets. The crystal of the IR ectodomain in complex with insulin may also be a molecular complex with other receptors of the IGF receptor family such as IGF-1R (SEQ ID NO: 3). The complex may also comprise additional molecules such as the ligands to these receptors, e.g., IGF-I (SEQ ID NO: 6), IGF-II (SEQ ID NO: 7), etc.
The production of IR ectodomain crystals is described below.
In a preferred embodiment, a crystal of the IR ectodomain in complex with insulin of the invention comprises the N- and C-terminal regions of the IR α-chain in complex with insulin having the atomic coordinates set forth in Appendix I. As used herein, the term “atomic coordinates” or “set of coordinates” refers to a set of values which define the position of one or more atoms with reference to a system of axes. It will be understood by those skilled in the art that the atomic coordinates may be varied, without affecting significantly the accuracy of models derived therefrom. Thus, although the invention provides a very precise definition of a preferred atomic structure, it will be understood that minor variations are envisaged and the claims are intended to encompass such variations.
It will be understood that any reference herein to the atomic coordinates or subset of the atomic coordinates shown in Appendix I shall include, unless specified otherwise, atomic coordinates having a root mean square deviation of backbone atoms of not more than 1.5 Å, preferably not more than 1 Å, when superimposed on the corresponding backbone atoms described by the atomic coordinates shown in Appendix I.
The following defines what is intended by the term “root mean square deviation (‘RMSD’)” between two data sets. For each element in the first data set, its deviation from the corresponding item in the second data set is computed. The squared deviation is the square of that deviation, and the mean squared deviation is the mean of all these squared deviations. The root mean square deviation is the square root of the mean squared deviation.
Preferred variants are those in which the RMSD of the x, y and z coordinates for all backbone atoms other than hydrogen is less than 1.5 Å (preferably less than 1 Å, 0.7 Å or less than 0.3 Å) compared with the coordinates given in Appendix I. It will be readily appreciated by those skilled in the art that a 3D rigid body rotation and/or translation of the atomic coordinates does not alter the structure of the molecule concerned.
In a highly preferred embodiment, the crystal has the atomic coordinates as shown in Appendix I.
The present invention also provides a crystal structure of the Site 1 binding site of IR polypeptide comprising the N- and C-terminal regions of the IR α-chain, or regions thereof.
The atomic coordinates obtained experimentally for: amino acids 5 to 310 (the “N-terminal region of the IR α-chain”) and amino acids 705 to 714 (the “C-terminal region of the IR α-chain”) of human IR-A (mature receptor numbering; SEQ ID NO: 1); amino acids 1 to 20 of the human insulin A-chain; and amino acids 7 to 26 of the human insulin B-chain (SEQ ID NOs: 4 and 5) are shown in Appendix I. However, a person skilled in the art will appreciate that a set of atomic coordinates determined by X-ray crystallography is not without standard error. Accordingly, any set of structure coordinates for an IR polypeptide comprising the N- and C-terminal regions of the IR α-chain in complex with human insulin that has a root mean square deviation of protein backbone atoms of less than 0.75 Å when superimposed (using backbone atoms) on the atomic coordinates listed in Appendix I shall be considered identical.
The present invention also comprises the atomic coordinates of the N- and C-terminal regions of the IR α-chain in complex with insulin that substantially conforms to the atomic coordinates listed in Appendix I.
A structure that “substantially conforms” to a given set of atomic coordinates is a structure wherein at least about 50% of such structure has an RMSD of less than about 1.5 Å for the backbone atoms in secondary structure elements in each domain, and more preferably, less than about 1.3 Å for the backbone atoms in secondary structure elements in each domain, and, in increasing preference, less than about 1.0 Å, less than about 0.7 Å, less than about 0.5 Å, and most preferably, less than about 0.3 Å for the backbone atoms in secondary structure elements in each domain.
In a more preferred embodiment, a structure that substantially conforms to a given set of atomic coordinates is a structure wherein at least about 75% of such structure has the recited RMSD value, and more preferably, at least about 90% of such structure has the recited RMSD value, and most preferably, about 100% of such structure has the recited RMSD value.
In an even more preferred embodiment, the above definition of “substantially conforms” can be extended to include atoms of amino acid side chains. As used herein, the phrase “common amino acid side chains” refers to amino acid side chains that are common to both the structure which substantially conforms to a given set of atomic coordinates and the structure that is actually represented by such atomic coordinates.
As used herein, the term “IR ectodomain” refers to the extracellular domain of IR lacking the transmembrane domain and the intracellular tyrosine kinase domain of IR, typically comprising residues 1 to 917 (mature IR-A receptor numbering), of human IR, or the equivalent thereof, together with any post-translational modifications of these residues such as N- or O-linked glycosylation.
As used herein, the term “low affinity binding site” for IR means the regions of IR involved in forming the nanomolar affinity binding site (also known as “Site 1”) of IR for insulin, comprising the N- and C-terminal regions of the IR α-chain including one or both of the L1 domain of IR and the CR domain of IR. Insulin binding to the low affinity binding site of IR induces formation of the high affinity insulin binding site of IR and subsequent signal transduction.
As used herein, the term “classical α-chain C-terminal peptide”, or “αCT”, refers in IR to a region of the C-terminal α-chain of IR previously described in the literature as being important for insulin binding (Kurose et al., 1994; Kristensen et al, 2002), and comprising amino acids 704-719 (mature IR-A receptor numbering) as given in SEQ ID NO: 9.
As used herein, the term “leucine-rich repeat domain 1” or “L1 domain” refers in IR to a leucine-rich domain comprising amino acids 1-156 of mature human IR (SEQ ID NO: 1). The L1 domain of IR comprises a central β-sheet, which comprises amino acids selected from 10-15, 32-37, 60-65, 88-97, 116-121 and 142-147 of mature human IR (SEQ ID NO: 1).
As used herein, the term “leucine-rich repeat domain 2” or “L2” domain refers in IR to a leucine-rich domain comprising amino acids 310-469 of mature human IR (SEQ ID NO: 1).
As used herein, the term “loop in the fourth leucine-rich repeat (LRR) rung of the L1 domain”, or variations thereof, refers in IR to a leucine-rich domain comprising amino acids 85-91 of mature human IR (SEQ ID NO: 1).
As used herein, the term “cysteine-rich domain” or “CR” domain refers in IR to a cysteine-rich domain comprising amino acids 157-309 of mature human IR (SEQ ID NO: 1). The CR domain contains many different modules. As used herein, the term “module 6 of the CR domain” refers in IR to amino acids 256-286 of mature human IR (SEQ ID NO: 1).
It will be appreciated that a set of atomic coordinates for one or more polypeptides is a relative set of points that define a shape in three dimensions. Thus, it is possible that an entirely different set of coordinates could define a similar or identical shape. Moreover, slight variations in the individual coordinates will have little effect on overall shape.
The variations in coordinates may be generated due to mathematical manipulations of the atomic coordinates. For example, the atomic coordinates set forth in Appendix I could be manipulated by crystallographic permutations of the atomic coordinates, fractionalisation of the atomic coordinates, integer additions or subtractions to sets of the structure coordinates, inversion of the atomic coordinates, or any combination thereof.
Alternatively, modification in the crystal structure due to mutations, additions, substitutions, and/or deletions of amino acids, or other changes in any of the components that make up the crystal could also account for variations in atomic coordinates.
Various computational analyses are used to determine whether a molecular complex or a portion thereof is sufficiently similar to all or parts of the structure of the extracellular domain of IR in complex with insulin described above. Such analyses may be carried out in current software applications, such as the Sequoia program (Bruns et al., 1999).
The Molecular Similarity program permits comparisons between different structures, different conformations of the same structure, and different parts of the same structure.
Comparisons typically involve calculation of the optimum translations and rotations required such that the root mean square deviation of the fit over the specified pairs of equivalent atoms is an absolute minimum. This number is given in Angstroms (“Å”).
Accordingly, atomic coordinates of an IR ectodomain comprising the Site 1 binding site in complex with insulin of the present invention include atomic coordinates related to the atomic coordinates listed in Appendix I by whole body translations and/or rotations. Accordingly, RMSD values listed above assume that at least the backbone atoms of the structures are optimally superimposed which may require translation and/or rotation to achieve the required optimal fit from which to calculate the RMSD value.
A three dimensional structure of an IR ectodomain polypeptide or a region thereof and/or a three dimensional structure of an insulin polypeptide or a region thereof which substantially conforms to a specified set of atomic coordinates can be modelled by a suitable modelling computer program such as MODELLER (Sali & Blundell, 1993), using information, for example, derived from the following data: (1) the amino acid sequence of the human IR ectodomain polypeptide and/or the amino acid sequence of the human insulin polypeptide; (2) the amino acid sequence of the related portion(s) of the protein represented by the specified set of atomic coordinates having a three dimensional configuration; and (3) the atomic coordinates of the specified three dimensional configuration. A three dimensional structure of an IR ectodomain polypeptide and/or a three dimensional structure of an insulin polypeptide which substantially conforms to a specified set of atomic coordinates can also be calculated by a method such as molecular replacement, which is described in detail below.
Atomic coordinates are typically loaded onto a machine-readable medium for subsequent computational manipulation. Thus models and/or atomic coordinates are advantageously stored on machine-readable media, such as magnetic or optical media and random-access or read-only memory, including tapes, diskettes, hard disks, CD-ROMs and DVDs, flash memory cards or chips, servers and the interne. The machine is typically a computer.
The atomic coordinates may be used in a computer to generate a representation, e.g. an image of the three-dimensional structure of the IR ectodomain in complex with insulin which can be displayed by the computer and/or represented in an electronic file.
The atomic coordinates and models derived therefrom may also be used for a variety of purposes such as drug discovery, biological reagent (binding protein) selection and X-ray crystallographic analysis of other protein crystals.
The structure coordinates of IR comprising the N- and C-terminal regions of the α-chain in complex with insulin, such as those set forth in Appendix I, or a subset thereof, can also be used for determining the three-dimensional structure of a molecular complex which contains at least the N- and/or C-terminal regions of the α-chain of IR. In particular, structural information about another crystallised molecular complex may be obtained. This may be achieved by any of a number of well-known techniques, including molecular replacement.
Methods of molecular replacement are generally known by those of skill in the art (generally described in Brunger, 1997; Navaza & Saludjian, 1997; Tong & Rossmann, 1997; Bentley, 1997; Lattman, 1985; Rossmann, 1972; McCoy, 2007).
Generally, molecular replacement involves the following steps. X-ray diffraction data are collected from the crystal of a crystallised target structure. The X-ray diffraction data are transformed to calculate a Patterson function. The Patterson function of the crystallised target structure is compared with a Patterson function calculated from a known structure (referred to herein as a search structure). The Patterson function of the search structure is rotated on the target structure Patterson function to determine the correct orientation of the search structure in the crystal. A translation function is then calculated to determine the location of the search structure with respect to the crystal axes. Once the search structure has been correctly positioned in the unit cell, initial phases for the experimental data can be calculated. These phases are necessary for calculation of an electron density map from which structural differences can be observed and for refinement of the structure. Preferably, the structural features (e.g., amino acid sequence, conserved di-sulphide bonds, and beta-strands or beta-sheets) of the search molecule are related to the crystallised target structure.
The electron density map can, in turn, be subjected to any well-known model building and structure refinement techniques to provide a final, accurate structure of the unknown (i.e., target) crystallised molecular complex (e.g. see Jones et al., 1991; Brünger et al., 1998).
Obtaining accurate values for the phases, by methods other than molecular replacement, is a time-consuming process that involves iterative cycles of approximations and refinements and greatly hinders the solution of crystal structures. However, when the crystal structure of a protein containing at least a homologous portion has been solved, the phases from the known structure provide a satisfactory starting estimate of the phases for the unknown structure.
By using molecular replacement, all or part of the structure coordinates of IR comprising the N- and C-terminal regions of the IR α-chain in complex with insulin provided herein (and set forth in Appendix I) can be used to determine the structure of a crystallised molecular complex whose structure is unknown more rapidly and more efficiently than attempting to determine such information ab initio. This method is especially useful in determining the structure of IR.
The structure of any portion of any crystallised molecular complex that is sufficiently homologous to any portion of the extracellular domain of IR and/or IGF-1R can be solved by this method.
Such structure coordinates are also particularly useful to solve the structure of crystals of IR co-complexed with a variety of molecules, such as chemical entities. For example, this approach enables the determination of the optimal sites for the interaction between chemical entities, and the interaction of candidate IR and/or IGF-1R agonists or antagonists.
All of the complexes referred to above may be studied using well-known X-ray diffraction techniques and may be refined against 1.5-3.5 Å resolution X-ray data to an R value of about 0.25 or less using computer software, such as X-PLOR (Yale University, distributed by Molecular Simulations, Inc.; see Brünger, 1996). This information may thus be used to optimize known IR agonist/antagonists, such as anti-IR antibodies, and more importantly, to design new or improved IR agonists/antagonists.
The three-dimensional structure of the Site 1 binding site of IR in complex with insulin provided by the present invention (Appendix I) can be used to identify potential target binding sites in the Site 1 binding site of IR and/or IGF-1R (i.e., to identify those regions of the Site 1 binding site of IR and/or IGF-1R involved in and important to the binding of insulin, IGF-I and/or IGF-II and subsequent signal transduction) as well as in methods for identifying or designing compounds which interact with the low affinity binding site of IR and/or IGF-1R, e.g., potential modulators of IR and/or IGF-1R.
The three-dimensional structure of IR in complex with insulin provided by the present invention (Appendix I) can be used to identify potential target binding sites in the L1 domain of IR and/or IGF-1R important for binding to insulin and in the C-terminal region of the IR and/or IGF-1R α-chain important for binding to insulin as well as in methods for identifying or designing compounds which interact with the L1 domain of IR and/or IGF-1R and/or with the C-terminal region of the IR and/or IGF-1R α-chain in a manner similar to insulin in complex with the L1 domain and the C-terminal region of the IR α-chain, e.g., potential modulators of IR and/or IGF-1R.
The low affinity binding site of IR is a region of IR ectodomain involved in insulin docking to the receptor. The preferred low affinity target binding site may comprise the C-terminal region of the IR α-chain and one or more regions from the L1 domain of the IR ectodomain. With regards to the L1 domain, the target binding site preferably comprises portions of the molecular surface of the central β-sheet of L1, preferably containing Asp12, Arg14, Leu 36, Leu 37, Leu62, Phe64, Val94, Phe96, Arg118, Glu120 and His144 and portions of the molecular surface of the second leucine-rich repeat (LRR), preferably containing Phe88, Phe89 and Tyr 91. Most preferably, the low affinity binding site contains both portions of the molecular surface of the central β-sheet of L1 and portions of the molecular surface of the second LRR as defined above.
Alternatively, the low affinity target binding site in IR may comprise one or more amino acids from amino acids 704 to 719 (encompassing the C-terminal region of the IR α-chain) plus one or more of the following amino acid sequences: (i) amino acids 1-156 and (ii) amino acids 157-310.
With regards to amino acids 1-156, the target binding site preferably comprises one or more amino acids selected from Asp12, Arg14, Asn15, Gln34, Leu36, Leu37, Leu62, Phe39, Pro43, Phe46, Leu62, Phe64, Leu87, Phe88, Phe89, Asn90, Val94, Phe96, Glu97, Arg118, Gle120 and His144.
With regards to amino acids 157-310, the target binding site preferably comprises one or more amino acids from the amino acid sequence 192-310, more preferably one or more amino acids from the sequence 227-303, yet more preferably one or more amino acids selected from the sequence 259-284.
In a preferred embodiment, van der Waals and/or hydrophobic interactions account for the major portion of the binding energy between a compound and a low affinity insulin binding site of IR.
The three-dimensional structure of the N- and C-terminal regions of the IR α-chain provided by the present invention can also be used to identify or more clearly elucidate potential target binding sites on IGF-1R ectodomain (i.e., to identify those regions, or at least more accurately elucidate those regions of IGF-1R ectodomain involved in and important to the binding of IGF and signal transduction) as well as in methods used for identifying or designing compounds which interact with potential target binding sites of IGF-IR ectodomain, e.g. potential modulators of IGF-1R.
Preferred target binding sites are those governing specificity, i.e., those regions of IGF-1R ectodomain involved in the initial nanomolar affinity binding of IGF (i.e., the initial binding of IGF to IGF-1R).
The low affinity binding site of IGF-IR is a region of IGF-1R ectodomain involved in IGF-I binding to the receptor. Preferred low affinity target binding sites comprise the C-terminal region of IGF-1R α-chain and one or more regions from the L1 domain and/or the CR domain of IGF-1R ectodomain. With regards to the L1 domain, the target binding site preferably comprises the central β-sheet of the L1 domain, and/or that part of the second LRR containing Ser35, and/or the loop in the fourth LRR rung of the L1 domain, or preferably all of these, as defined above. With regards the CR domain, the target binding site preferably comprises module 6 of the CR domain, as defined above.
In a preferred embodiment, van der Waals and/or hydrophobic interactions account for the major portion of the binding energy between a compound and a low affinity binding site of IGF-1R.
Additional preferred binding sites in the case of both IR and IGF-1R, particularly for biological macromolecules such as proteins or aptamers, are those that are devoid of glycosylation or devoid of steric hindrance from glycan covalently attached to the polypeptide at sites in the spatial vicinity.
Design, Selection, Fitting and Assessment of Chemical Entities that Bind IR and/or IGF-1R
Using a variety of known modelling techniques, the crystal structure of the present invention can be used to produce a model for the low affinity binding site of IR and/or IGF-1R.
As used herein, the term “modelling” includes the quantitative and qualitative analysis of molecular structure and/or function based on atomic structural information and interaction models. The term “modelling” includes conventional numeric-based molecular dynamic and energy minimisation models, interactive computer graphic models, modified molecular mechanics models, distance geometry and other structure-based constraint models.
Molecular modelling techniques can be applied to the atomic coordinates of the low affinity binding site of IR in complex with insulin, or at least part of the C-terminal region of the α-chain of IR or insulin, or regions thereof to derive a range of 3D models and to investigate the structure of binding sites, such as the binding sites of monoclonal antibodies, nonimmunoglobulin binding proteins and inhibitory peptides.
These techniques may also be used to screen for or design small and large chemical entities which are capable of binding IR and modulating the ability of IR to interact with extracellular biological targets, such as insulin or members of the IGF receptor family e.g. which modulate the ability of IR to heterodimerise. The screen may employ a solid 3D screening system or a computational screening system.
Such modelling methods are to design or select chemical entities that possess stereochemical complementary to the low affininty binding site of IR and/or IGF-1R, to the regions of the L1 domain of IR and/or IGF-1R with which the C-terminal region of the α-chain of IR and/or IGF-1R interact, or to the regions of the L1 domain of IR and/or IGF-1R with which insulin or IGF interact. By “stereochemical complementarity” we mean that the compound or a portion thereof makes a sufficient number of energetically favourable contacts with the receptor as to have a net reduction of free energy on binding to the receptor.
Modelling method may also be used to design or select chemical entities that possess stereochemical similarity to the low affinity binding site surface of insulin, to the regions of the A-chain and B-chain of insulin that interact with the C-terminal region of the α-chain of IR or to the regions of the A-chain and B-chain of insulin that interact with the L1 domain of IR. By “stereochemical similarity” we mean that the compound or portion thereof makes about the same number of energetically favourable contacts with the receptor as insulin makes as determined by the crystal structure of insulin in complex with the IR α-chain set out by the coordinates shown in Appendix I.
Stereochemical complementarity is characteristic of a molecule that matches intra-site surface residues lining the groove of the receptor site as enumerated by the coordinates set out in Appendix I or a subset thereof. By “match” we mean that the identified portions interact with the surface residues, for example, via hydrogen bonding or by non-covalent Van der Waals and Coulomb interactions (with surface or residue) which promote desolvation of the molecule within the site, in such a way that retention of the molecule at the binding site is favoured energetically.
It is preferred that the stereochemical complementarity is such that the compound has a Kd for the receptor site of less than 10−4M, more preferably less than 10−5M and more preferably 10−6M. In a most preferred embodiment, the Kd value is less than 10−8M and more preferably less than 10−9M.
Chemical entities which are complementary to the shape and electrostatics or chemistry of the receptor site characterised by amino acids positioned at atomic coordinates set out in Appendix I will be able to bind to the receptor, and when the binding is sufficiently strong, substantially prohibit the interaction of the IR and/or IGF-1R ectodomain with biological target molecules such as insulin or IGF.
It will be appreciated that it is not necessary that the complementarity between chemical entities and the receptor site or similarity between the chemical entities and biological receptor target molecules such as insulin or IGF need extend over all residues of the receptor site or target molecule in order to inhibit or mimic binding of a molecule or complex that naturally interacts with IR and/or IGF-1R ectodomain.
A number of methods may be used to identify chemical entities possessing stereochemical complementarity to the low affinitybinding site of IR and/or IGF-1R, to the regions of the L1 domain of IR and/or IGF-1R with which the C-terminal region of the α-chain of IR and/or IGF-1R interact, or to the regions of the L1 domain of IR and/or IGF-1R with which insulin or IGF interact. For instance, the process may begin by visual inspection of the entire low affinity insulin binding site comprising the N- and C-terminal regions of the α-chain of IR, or the equivalent region in IGF-1R, on the computer screen based on the coordinates in Appendix I generated from the machine-readable storage medium. Alternatively, selected fragments or chemical entities may then be positioned in a variety of orientations, or docked, within the low affinity binding site of IR and/or IGF-1R, or within the L1 domain of IR and/or IGF-1R in a manner similar to insulin and/or the C-terminal region of the α-chain of IR, as defined above. Similar methods could be used to identify chemical entities or compounds that may interact with the L1 domain of IR and/or IGF-1R in a manner similar to that of insulin and/or the C-terminal region of the α-chain of IR.
Modelling software that is well known and available in the art may be used (Guida, 1994). These include. Discovery Studio (Accelrys Software Inc., San Diego), SYBYL (Tripos Associates, Inc., St. Louis, Mo., 1992), Maestro (Schrödinger LLC, Portland), MOE (Chemical Computing Group Inc., Montreal, Canada). This modelling step may be followed by energy minimization with standard molecular mechanics force fields such as AMBER (Weiner et al., 1984), OPLS (Jorgensen and Tirado-Rives, 1988) and CHARMM (Brooks et al., 1983). In addition, there are a number of more specialized computer programs to assist in the process of selecting the binding moieties of this invention.
Specialised computer programs may also assist in the process of selecting fragments or chemical entities. These include, inter alia:
Once suitable chemical entities or fragments have been selected, they can be assembled into a single compound. In one embodiment, assembly may proceed by visual inspection of the relationship of the fragments to each other on the three-dimensional image displayed on a computer screen in relation to the structure coordinates of the low affinity site of IR, or the L1 domain to which insulin and/or the C-terminal region of the α-chain of IR binds. This is followed by manual model building using software such as Discovery Studio, Maestro, MOE or Sybyl. Alternatively, fragments may be joined to additional atoms using standard chemical geometry.
The above-described evaluation process for chemical entities may be performed in a similar fashion for chemical compounds.
Useful programs to aid one of skilled in the art in connecting the individual chemical entities or fragments include:
Other molecular modelling techniques may also be employed in accordance with this invention, see, e.g., Cohen et al. (1990) and Navia & Murcko (1992).
There are two preferred approaches to designing a molecule according to the present invention that complement the stereochemistry of the low affinity binding site of IR and/or IGF-1R, or the L1 domain to which insulin and/or the C-terminal region of the α-chain of IR binds. The first approach is to in silky) directly dock molecules from a three-dimensional structural database, to the target binding site, using mostly, but not exclusively, geometric criteria to assess the goodness-of-fit of a particular molecule to the site. In this approach, the number of internal degrees of freedom (and the corresponding local minima in the molecular conformation space) is reduced by considering only the geometric (hard-sphere) interactions of two rigid bodies, where one body (the active site) contains “pockets” or “grooves” that form binding sites for the second body (the complementing molecule).
Flexibility of the receptor, IR or IGF-1R, can be incorporated into the in silico screening by the application of multiple conformations of the receptor (Totrov and Abagyan, 2008). The multiple conformations of the IR receptor can be generated from the coordinates listed in Appendix I computationally by use of molecular dynamics simulation or similar approaches.
This approach is illustrated by Kuntz et al. (1982) and Ewing et al. (2001), the contents of which are hereby incorporated by reference, whose algorithm for ligand design is implemented in a commercial software package, DOCK version 4.0, distributed by the Regents of the University of California and further described in a document, provided by the distributor, which is entitled. “Overview of the DOCK program suite” the contents of which are hereby incorporated by reference. Pursuant to the Kuntz algorithm, the shape of the cavity in which the C-terminal region of the α-chain of IR fits and/or insulin fits is defined as a series of overlapping spheres of different radii. One or more extant databases of crystallographic data, such as the Cambridge Structural Database System (The Cambridge Crystallographic Data Centre, Cambridge, U.K.), the Protein Data Bank, maintained by the Research Collaboratory for Structural Bioinformatics (Rutgers University, N.J., U.S.A.), LeadQuest (Tripos Associates, Inc., St. Louis, Mo.), Available Chemicals Directory (Symyx Technologies Inc.), and the NCI database (National Cancer Institute, U.S.A) is then searched for molecules which approximate the shape thus defined.
Molecules identified on the basis of geometric parameters, can then be modified to satisfy criteria associated with chemical complementarity, such as hydrogen bonding, ionic interactions and van der Waals interactions. Different scoring functions can be employed to rank and select the best molecule from a database. See for example Bohm & Stahl (1999). The software package FlexX, marketed by Tripos Associates, Inc. (St. Louis, Mo.) is another program that can be used in this direct docking approach (see Rarey et al., 1996).
The second preferred approach entails an assessment of the interaction of respective chemical groups (“probes”) with the active site at sample positions within and around the site, resulting in an array of energy values from which three-dimensional contour surfaces at selected energy levels can be generated. The chemical-probe approach to ligand design is described, for example, by Goodford, (1985), the contents of which are hereby incorporated by reference, and is implemented in several commercial software packages, such as GRID (product of Molecular Discovery Ltd., Italy).
Pursuant to this approach, the chemical prerequisites for a site-complementing molecule are identified at the outset, by probing the active site with different chemical probes, e.g., water, a methyl group, an amine nitrogen, a carboxyl oxygen, or a hydroxyl. Favoured sites for interaction between the active site and each probe are thus determined, and from the resulting three-dimensional pattern of such sites a putative complementary molecule can be generated. This may be done either by programs that can search three-dimensional databases to identify molecules incorporating desired pharmacophore patterns or by programs which use the favoured sites and probes as input to perform de novo design. Suitable programs for determining and designing pharmacophores include CATALYST (Accelrys Software, Inc), and CERIUS2, DISCO (Abbott Laboratories, Abbott Park, Ill.; distributed by Tripos Associates Inc.).
The pharmacophore can be used to screen in silico compound libraries/three-dimensional databases, using a program such as CATALYST (Accelrys Software, Inc) and Sybyl/3DB Unity (Tripos Associates, Inc., St. Louis, Mo.).
Databases of chemical structures are available from a number of sources including Cambridge Crystallographic Data Centre (Cambridge, U.K.), Molecular Design, Ltd., (San Leandro, Calif.), Tripos Associates, Inc. (St. Louis, Mo.), Chemical Abstracts Service (Columbus, Ohio), the Available Chemical Directory (Symyx Technologies, Inc.), the Derwent World Drug Index (WDI), BioByteMasterFile, the National Cancer Institute database (NCI), Medchem Database (BioByte Corp.), and the Maybridge catalogue.
De novo design programs include LUDI (Accelrys Software Inc., San Diego, Calif.), Leapfrog (Tripos Associates, Inc.), and LigBuilder (Peking University, China).
Once an entity or compound has been designed or selected by the above methods, the efficiency with which that entity or compound may bind to IR and/or IGF-1R can be tested and optimised by computational evaluation. For example, a compound that has been designed or selected to function as an IR binding compound must also preferably traverse a volume not overlapping that occupied by the binding site when it is bound to the native IR. An effective IR binding compound must preferably demonstrate a relatively small difference in energy between its bound and free states (i.e., a small deformation energy of binding). Thus, the most efficient IR binding compound should preferably be designed with a deformation energy of binding of not greater than about 10 kcal/mole, preferably, not greater than 7 kcal/mole. IR and/or IGF-1R binding compounds may interact with IR and/or IGF-1R in more than one conformation that are similar in overall binding energy. In those cases, the deformation energy of binding is taken to be the difference between the energy of the free compound and the average energy of the conformations observed when the compound binds to the protein.
A compound designed or selected as binding to IR and/or IGF-1R may be further computationally optimised so that in its bound state it would preferably lack repulsive electrostatic interaction with the target protein.
Such non-complementary (e.g., electrostatic) interactions include repulsive charge-charge, dipole-dipole and charge-dipole interactions. Specifically, the sum of all electrostatic interactions between the compound and the protein when the compound is bound to IR and/or IGF-IR, preferably make a neutral or favourable contribution to the enthalpy of binding.
Once an IR-binding compound and/or IGF-1R-binding compound has been optimally selected or designed, as described above, substitutions may then be made in some of its atoms or side groups to improve or modify its binding properties. Generally, initial substitutions are conservative, i.e., the replacement group will have approximately the same size, shape, hydrophobicity and charge as the original group. It should, of course, be understood that components known in the art to alter conformation should be avoided. Such substituted chemical compounds may then be analysed for efficiency of fit to IR and/or IGF-IR by the same computer methods described in detail above.
Specific computer software is available in the art to evaluate compound deformation energy and electrostatic interaction. Examples of programs designed for such uses include: Gaussian 03, (Frisch, Gaussian, Inc., Pittsburgh, Pa.); GAMESS (Gordon et al., Iowa State University); Jaguar (Schrödinger LLC, Portland); AMBER, version 9.0 (Case et al, University of California at San Francisco); CHARMM (Accelrys Software, Inc., San Diego, Calif.); and GROMACS version 4.0 (van der Spoel et al.).
The screening/design methods may be implemented in hardware or software, or a combination of both. However, preferably, the methods are implemented in computer programs executing on programmable computers each comprising a processor, a data storage system (including volatile and non-volatile memory and/or storage elements), at least one input device, and at least one output device. Program code is applied to input data to perform the functions described above and generate output information. The output information is applied to one or more output devices, in known fashion. The computer may be, for example, a personal computer, microcomputer, or workstation of conventional design.
Each program is preferably implemented in a high level procedural or object-oriented programming language to communicate with a computer system. However, the programs can be implemented in assembly or machine language, if desired. In any case, the language may be compiled or interpreted language.
Each such computer program is preferably stored on a storage medium or device (e.g., ROM or magnetic diskette) readable by a general or special purpose programmable computer, for configuring and operating the computer when the storage media or device is read by the computer to perform the procedures described herein. The system may also be considered to be implemented as a computer-readable storage medium, configured with a computer program, where the storage medium so configured causes a computer to operate in a specific and predefined manner to perform the functions described herein.
Compounds of the present invention include both those designed or identified using a screening method of the invention and those which are capable of recognising and binding to the low affinity binding site of IR and/or IGF-1R as defined above. Also encompassed by the present invention are compounds that bind to the L1 domain of IR in a manner similar to that of insulin and/or the C-terminal region of the α-chain of IR, i.e., compounds which mimic insulin and/or the C-terminal region of the α-chain of IR.
Compounds capable of recognising and binding to the low affinity binding site of IR and/or IGF-1R may be produced using (i) a screening method based on use of the atomic coordinates corresponding to the 3D structure of the low affinity binding site of IR in complex with insulin; or (ii) a screening method based on the use of the atomic coordinates corresponding to the 3D structure of insulin in complex with IR. Alternatively, compounds may be identified by screening against a specific target molecule which is indicative of the capacity to bind to the low affinity binding site of IR.
Compounds capable of recognising and binding to the L1 domain of IR and/or IGF-1R in a manner similar to that of insulin and/or the C-terminal region of the α-chain of IR to the L1 domain of IR (i.e. compounds which mimic insulin and/or the C-terminal region of the α-chain of IR) may be produced using a screening method based on use of the atomic coordinates corresponding to the 3D structure of the insulin and/or the C-terminal region of the α-chain of IR in isolation or as it associates with IR, or alternatively may be identified by screening against a specific target molecule which is indicative of the capacity to bind to the low affinity binding site of IR.
The candidate compounds and/or compounds identified or designed using a method of the present invention may be any suitable compound, synthetic or naturally occurring, preferably synthetic. In one embodiment, a synthetic compound selected or designed by the methods of the invention preferably has a molecular weight equal to or less than about 5000, 4000, 3000, 2000, 1000 or 500 daltons. A compound of the present invention is preferably soluble under physiological conditions.
The compounds may encompass numerous chemical classes, though typically they are organic molecules, preferably small organic compounds having a molecular weight of more than 50 and less than about 2,500 daltons, preferably less than 1,500, more preferably less than 1,000 and yet more preferably less than 500. Such compounds can comprise functional groups necessary for structural interaction with proteins, particularly hydrogen bonding, and typically include at least an amine, carbonyl, hydroxyl or carboxyl group, preferably at least two of the functional chemical groups. The compound may comprise cyclical carbon or heterocyclic structures and/or aromatic or polyaromatic structures substituted with one or more of the above functional groups. Compounds can also comprise biomolecules including peptides, saccharides, fatty acids, steroids, purines, pyrimidines, derivatives, structural analogues, or combinations thereof.
Compounds may include, for example: (1) peptides such as soluble peptides, including Ig-tailed fusion peptides and members of random peptide libraries (see, e.g., Lam et al., 1991; Houghten et al., 1991) and combinatorial chemistry-derived molecular libraries made of D- and/or L-configuration amino acids; (2) phosphopeptides (e.g., members of random and partially degenerate, directed phosphopeptide libraries, see, e.g., Songyang et al., 1993); (3) antibodies (e.g., polyclonal, monoclonal, humanized, anti-idiotypic, chimeric, and single chain antibodies as well as Fab, (Fab)2′, Fab expression library and epitope-binding fragments of antibodies); (4) nonimmunoglobulin binding proteins such as but not restricted to avimers, DARPins and lipocalins; (5) nucleic acid-based aptamers; and (6) small organic and inorganic molecules.
Ligands can be obtained from a wide variety of sources including libraries of synthetic or natural compounds. Synthetic compound libraries are commercially available from, for example, Maybridge Chemical Co. (Tintagel, Cornwall, UK), AMRI (Budapest, Hungary) and ChemDiv (San Diego, Calif.), Specs (Delft, The Netherlands).
Natural compound libraries comprising bacterial, fungal, plant or animal extracts are available from, for example, Pan Laboratories (Bothell, Wash.), TimTec (Newark, Del.). In addition, numerous means are available for random and directed synthesis of a wide variety of organic compounds and biomolecules, including expression of randomized oligonucleotides.
Alternatively, libraries of natural compounds in the form of bacterial, fungal, plant and animal extracts can be readily produced. Methods for the synthesis of molecular libraries are readily available (see, e.g., DeWitt et al., 1993; Erb et al., 1994; Zuckermann et al., 1994; Cho et al., 1993; Carell et al., 1994a; Carell et al., 1994b; and Gallop et al., 1994). In addition, natural or synthetic compound libraries and compounds can be readily modified through conventional chemical, physical and biochemical means (see, e.g., Blondelle and Houghton, 1996), and may be used to produce combinatorial libraries. In another approach, previously identified pharmacological agents can be subjected to directed or random chemical modifications, such as acylation, alkylation, esterification, amidification, and the analogues can be screened for IR and/or IGF-1R-modulating activity.
Numerous methods for producing combinatorial libraries are known in the art, including those involving biological libraries; spatially addressable parallel solid phase or solution phase libraries; synthetic library methods requiring deconvolution; the “one-bead one-compound” library method; and synthetic library methods using affinity chromatography selection. The biological library approach is limited to polypeptide or peptide libraries, while the other four approaches are applicable to polypeptide, peptide, nonpeptide oligomer, or small molecule libraries of compounds (Lam, 1997).
Compounds also include those that may be synthesized from leads generated by fragment-based drug design, wherein the binding of such chemical fragments is assessed by soaking or co-crystallizing such screen fragments into crystals provided by the invention and then subjecting these to an X-ray beam and obtaining diffraction data. Difference Fourier techniques are readily applied by those skilled in the art to determine the location within the IR ectodomain structure at which these fragments bind, and such fragments can then be assembled by synthetic chemistry into larger compounds with increased affinity for the receptor.
Compounds identified or designed using the methods of the invention can be a peptide or a mimetic thereof.
The isolated peptides or mimetics of the invention may be conformationally constrained molecules or alternatively molecules which are not conformationally constrained such as, for example, non-constrained peptide sequences. The term “conformationally constrained molecules” means conformationally constrained peptides and conformationally constrained peptide analogues and derivatives.
The term “analogues” refers to molecules having a chemically analogous structure to naturally occurring α-amino acids. Examples include molecules containing gem-diaminoalkyl groups or alklylmalonyl groups.
The term “derivatives” includes α-amino acids wherein one or more side groups found in the naturally occurring α-amino acids have been modified. Thus, for example the amino acids may be replaced with a variety of uncoded or modified amino acids such as the corresponding D-amino acid or N-methyl amino acid. Other modifications include substitution of hydroxyl, thiol, amino and carboxyl functional groups with chemically similar groups.
With regard to peptides and mimetics thereof, other examples of other unnatural amino acids or chemical amino acid analogues/derivatives which can be introduced as a substitution or addition include, but are not, limited to, 2,4-diaminobutyric acid, α-amino isobutyric acid, 4-aminobutyric acid, 2-aminobutyric acid, 6-amino hexanoic acid, 2-amino isobutyric acid, 3-amino propionic acid, ornithine, norleucine, norvaline, hydroxyproline, sarcosine, citrulline, homocitrulline, cysteic acid, t-butylglycine, t-butylalanine, phenylglycine, cyclohexylalanine, β-alanine, fluoro-amino acids, designer amino acids such as β-methyl amino acids, Cα-methyl amino acids, Nα-methyl amino acids, and amino acid analogues in general.
The mimetic may be a peptidomimetic. A “peptidomimetic” is a molecule that mimics the biological activity of a peptide but is no longer peptidic in chemical nature. By strict definition, a peptidomimetic is a molecule that no longer contains any peptide bonds (that is, amide bonds between amino acids). However, the term peptide mimetic is sometimes used to describe molecules that are no longer completely peptidic in nature, such as pseudo-peptides, semi-peptides and peptoids. Whether completely or partially non-peptide, peptidomimetics for use in the methods of the invention, and/or of the invention, provide a spatial arrangement of reactive chemical moieties that closely resembles the three-dimensional arrangement of active groups in the peptide on which the peptidomimetic is based. As a result of this similar active-site geometry, the peptidomimetic has effects on biological systems which are similar to the biological activity of the peptide.
There are sometimes advantages for using a mimetic of a given peptide rather than the peptide itself, because peptides commonly exhibit two undesirable properties: (1) poor bioavailability; and (2) short duration of action. Peptide mimetics offer an obvious route around these two major obstacles, since the molecules concerned are small enough to be both orally active and have a long duration of action. There are also considerable cost savings and improved patient compliance associated with peptide mimetics, since they can be administered orally compared with parenteral administration for peptides. Furthermore, peptide mimetics are generally cheaper to produce than peptides.
Suitable peptidomimetics based on insulin, a fragment thereof, and/or the C-terminal region of the α-chain of IR and/or IGR-1R can be developed using readily available techniques. Thus, for example, peptide bonds can be replaced by non-peptide bonds that allow the peptidomimetic to adopt a similar structure, and therefore biological activity, to the original peptide. Further modifications can also be made by replacing chemical groups of the amino acids with other chemical groups of similar structure. The development of peptidomimetics derived from peptides of insulin, a fragment thereof, and/or the C-terminal region of the IR and/or IGF-1R α-chain can be aided by reference to the three dimensional structure of these residues as provided in Appendix I. This structural information can be used to search three-dimensional databases to identify molecules having a similar structure, using programs such as Sybyl/3DB Unity (Tripos Associates, St. Louis, Mo.).
Those skilled in the art will recognize that the design of a peptidomimetic may require slight structural alteration or adjustment of a chemical structure designed or identified using the methods of the invention. In general, chemical compounds identified or designed using the methods of the invention can be synthesized chemically and then tested for ability to modulate IR and/or IGF-1R activity using any of the methods described herein. The methods of the invention are particularly useful because they can be used to greatly decrease the number potential mimetics which must be screened for their ability to modulate IR and/or IGF-1R activity.
The peptides or peptidomimetics of the present invention can be used in assays for screening for candidate compounds which bind to regions of IR and/or IGF-1R and potentially interfere with the binding of insulin to IR and/or signal transduction and/or the binding of IGF to IGF-1R and/or signal transduction. Peptides or peptidomimetics which mimic target binding sites are particularly useful as specific target molecules for identifying potentially useful ligands for IR and/or IGF-IR.
Standard solid-phase ELISA assay formats are particularly useful for identifying compounds that bind to the receptor. In accordance with this embodiment, the peptide or peptidomimetic immobilized on a solid matrix, such as, for example an array of polymeric pins or a glass support. Conveniently, the immobilized peptide or peptidomimetic is a fusion polypeptide comprising Glutathione-5-transferase (GST; e.g. a CAP-ERK fusion), wherein the GST moiety facilitates immobilization of the protein to the solid phase support. This assay format can then be used to screen for candidate compounds that bind to the immobilised peptide or peptidomimetic and/or interfere with binding of a natural binding partner of IR and/or IGF-IR to the immobilised peptide or peptidomimetic.
As used herein a “fragment” is a portion of a peptide of the invention which maintains a defined activity of the “full-length” peptide, namely the ability to bind to the low affinity binding site of IR and/or IGF-1R, or to bind to the L1 domain of IR and/or IGF-1R. Fragments can be any size as long as they maintain the defined activity. Preferably, the fragment maintains at least 50%, more preferably at least 75%, of the activity of the full length polypeptide.
The % identity of a peptide is determined by GAP (Needleman and Wunsch, 1970) analysis (GCG program) with a gap creation penalty=5, and a gap extension penalty=0.3. The query sequence is at least 10 amino acids in length, and the GAP analysis aligns the two sequences over a region of at least 10 amino acids. More preferably, the GAP analysis aligns two sequences over their entire length.
With regard to a defined peptide, it will be appreciated that % identity figures higher than those provided above will encompass preferred embodiments. Thus, where applicable, in light of the minimum % identity figures, it is preferred that the peptide comprises an amino acid sequence which is at least 50%, more preferably at least 55%, more preferably at least 60%, more preferably at least 65%, more preferably at least 70%, more preferably at least 75%, more preferably at least 80%, more preferably at least 85%, more preferably at least 90%, more preferably at least 91%, more preferably at least 92%, more preferably at least 93%, more preferably at least 94%, more preferably at least 95%, more preferably at least 96%, more preferably at least 97%, more preferably at least 98%, more preferably at least 99%, more preferably at least 99.1%, more preferably at least 99.2%, more preferably at least 99.3%, more preferably at least 99.4%, more preferably at least 99.5%, more preferably at least 99.6%, more preferably at least 99.7%, more preferably at least 99.8%, and even more preferably at least 99.9% identical to the relevant nominated SEQ ID NO.
Amino acid sequence mutants of the peptides identified or designed using the methods of the invention, and/or of the present invention, can be prepared by introducing appropriate nucleotide changes into a nucleic acid of the present invention, or by in vitro synthesis of the desired peptide. Such mutants include, for example, deletions, insertions or substitutions of residues within the amino acid sequence. A combination of deletion, insertion and substitution can be made to arrive at the final construct, provided that the final peptide product possesses the desired characteristics.
In designing amino acid sequence mutants, the location of the mutation site and the nature of the mutation will depend on characteristic(s) to be modified. The sites for mutation can be modified individually or in series, e.g., by (1) substituting first with conservative amino acid choices and then with more radical selections depending upon the results achieved, (2) deleting the target residue, or (3) inserting other residues adjacent to the located site.
Substitution mutants have at least one amino acid residue in the peptide removed and a different residue inserted in its place. Sites of interest are those in which particular residues obtained from various strains or species are identical. These sites, especially those falling within a sequence of at least three other identically conserved sites, are preferably substituted in a relatively conservative manner. Such conservative substitutions are shown in Table 1 under the heading of “exemplary substitutions”.
In a preferred embodiment a mutant/variant peptide has one or two or three or four conservative amino acid changes when compared to a peptide defined herein. Details of conservative amino acid changes are provided in Table 1.
Also included within the scope of the invention are peptides which are differentially modified during or after synthesis, e.g., by biotinylation, benzylation, glycosylation, acetylation, phosphorylation, amidation, derivatization by known protecting/blocking groups, proteolytic cleavage, linkage to an antibody molecule or other cellular ligand, etc. These modifications may serve to increase the stability and/or bioactivity of the peptide.
The residues that form the αCT segment of the IR α-chain, namely residues 704 to 719, can be grouped into four classes: Class A—those whose side chains are completely buried in the interface with insulin (His710, Asn711 and Phe714); Class B—those whose sides chains are completely buried in the interface with L1 (Phe703, Phe705 Tyr708, Leu709, Val712, Val713); Class C—those whose side chains lie at the periphery of the interface with insulin and/or the L1 domain; and Class D—those whose side chains appear to have no interaction with insulin or L1. In terms of using the αCT segment itself as a scaffold for mimetics that might compete with the naturally-occurring αCT segment in its binding to insulin and/or the L1 domain, design might focus in the first instance on substitution of those residues belonging to Class C, given that the residues lying in Classes A and B are relatively optimally packed within the interfaces and already have few degrees of freedom. Higher affinity binding might be achieved by substitution of one or more of the Class C residues with either naturally-occurring amino acids or non-natural amino acids. For example, the substitution of one or more of the charged residues with residues that have reduced rotameric degrees of freedom (i.e., reduced entropy) may lead to higher affinity binding or altered physicochemical properties of the compound. Such modification for example may include substitution by the naturally-occurring amino acids Phe, Tyr or Trp. As a further example, it may be possible to substitute residues with more bulky non-natural amino acids that retain the terminal cationic character, for example substitution by the basic phenyl propanoic acid derivatives App (L-2-amino-3-(4-aminophenyl)propanoic acid) and/or Gpp, (L-2-amino-3-(4-guanidinophenyl)propanoic acid) (Svenson et al., 2009). A further strategy for design might involve substitutions to improve the overall stability of the helical structure (for example helix stapling—see Danial et al., 2008). Such substitutions would likely be made within Class D residues. A yet further strategy to improve affinity or physicochemical properties might involve truncation of the helical segment and/or attaching an N- or C-terminal group also designed to improve affinity. Similar principles of design may be applied to generate modified peptides based on the corresponding native IGF-1R peptide as outlined above for the IR peptide.
Likewise, the residues that form the Site 1 binding surface of insulin can be grouped into five classes: Class A—those whose side chains are complete buried in the interface with the L1 domain (TyrB16, PheB24); Class B—those whose sides chains are completely buried in the interface with the αCT segment (ValA3, GlyB8, SerB9, GlyA1, IleA2, TyrA19, LeuB11, LeuB15, GluA4 and PheB25); Class C—those whose side chains mediate the interface with both the L1 domain and the αCT segment (ValB12 and TyrB26); Class D—those whose side chains lie at the periphery of the interface with the L1 domain and/or the αCT segment; and Class E—those whose side chains appear to have no interaction with the L1 domain or the αCT segment. In terms of using the insulin monomer as defined in complex with IR itself as a scaffold for mimetics that might compete with the naturally-occurring insulin in its binding to the L1 domain and/or the αCT segment, design might focus in the first instance on substitution of those residues belonging to Class D, given that the residues lying in Classes A and B are relatively optimally packed within the interfaces and already have few degrees of freedom. Higher affinity binding might be achieved by substitution of one or more of the Class D residues with either naturally-occurring amino acids or non-natural amino acids. For example, the substitution of one or more of the charged residues with residues that have reduced rotameric degrees of freedom (i.e., reduced entropy) may lead to higher affinity binding or altered physicochemical properties of the compound. Such modification for example may include substitution by the naturally-occurring amino acids Phe, Tyr or Trp. As a further example, it may be possible to substitute residues with more bulky non-natural amino acids that retain the terminal cationic character, for example substitution by the basic phenyl propanoic acid derivatives App (L-2-amino-3-(4-aminophenyl)propanoic acid) and/or Gpp, (L-2-amino-3-(4-guanidinophenyl)propanoic acid) (Swenson et, al., 2009). A further strategy for design might involve substitutions to improve the overall stability of the conformational change in insulin upon binding to the IR. Such substitutions would likely be made within Class E residues. A yet further strategy to improve affinity or physicochemical properties might involve truncation of the C-terminus residues B26 to B30 and/or attaching an N- or C-terminal group also designed to improve affinity. Similar principles of design may be applied to generate modified peptides based on the corresponding native IGF-1R peptide as outlined above for the IR peptide.
The design of synthetic non-peptide mimetics of α-helices is an established art (see for example Davis et al., 2006). In particular, methods of mimicry of i, i+4, i+7 motifs (such as those identified within the αCT segment of the α-chain of IR and IGF-1R and which interact the respective L1 domains are known. For example, these motifs may be mimicked by terphenyl, oligophenyl, chalcone or 1,4-benzodiazepine-2,5-dione scaffolds (Davis et al., 2006) or by benzoylurea scaffolds (US 2008153802). Non-peptide mimetics of α-helices have been investigated as therapeutics in a number of disease contexts, for example HIV1 infection (disruption of the assembly of the hexameric helical bundle (Ernst et al., 2001)) and cancer (disruption of the assembly of the HDM2-p53 complex (Yin et al., 2005); inhibitors of Bcl-2 family heterodimerisation (Degterev et al., 2001).
With regard to redesigning compounds using a method of the invention, in an embodiment the compound is redesigned to be more structurally similar to the native αCT segment of the α-chain of IR and/or the αCT segment of the α-chain of IGF-1R. Examples of peptides which could be redesigned in this manner include, but are not limited to, those described by Schäffer et al. (2003) and/or U.S. Pat. No. 7,173,005.
In an alternative embodiment, the compound is redesigned to be more structurally similar to native insulin in complex with the IR. Preferably, the compound is redesigned to mimic the conformational structural changes in insulin upon binding to the IR. More preferably, the compound is redesigned to mimic the conformational structural changes in the insulin B-chain upon binding to the IR.
Interaction of Compounds with IR and/or IGF-1R
A compound may interact with the low affinity binding site of IR and/or IGF-1R by binding either directly or indirectly to that region. A compound which binds directly, binds to the specified region. A compound which binds indirectly, binds to a region in close proximity to or adjacent to the low affinity binding site of IR and/or IGF-1R with the result that it interferes with the ability of IR to bind to insulin, or IGF-1R to bind IGF, either antagonistically or agonistically. Such interference may be steric, electrostatic, or allosteric. Preferably, a compound interacts with the low affinity binding site of IR and/or IGF-1R by binding directly to the specified region. In the case of compounds that bind to specific target molecules, such compounds bind directly to the specific target molecule.
A compound may alternatively interact with the L1 domain of IR and/or IGF-1R in a manner similar to that of the αCT segment of the α-chain of IR by binding either directly or indirectly to that region. A compound which binds directly, binds to the specified region. A compound which binds indirectly, binds to a region in close proximity to or adjacent to the L1 domain of IR and/or IGF-1R in a manner similar to that of the αCT segment of the α-chain of IR with the result that it interferes with the ability of IR to bind to insulin, or IGF-1R to bind IGF, either antagonistically or agonistically. Such interference may be steric, electrostatic, or allosteric. Preferably, a compound interacts with the L1 domain of IR and/or IGF-1R in a manner similar to that of the αCT segment of the α-chain of IR by binding directly to the specified region. In the case of compounds that bind to specific target molecules, such compounds bind directly to the specific target molecule.
Binding can be either by covalent or non-covalent interactions, or both. Examples of non-covalent interactions include electrostatic interactions, van der Waals interactions, hydrophobic interactions and hydrophilic interactions.
When a compound of the invention interacts with IR and/or IGF-1R, it preferably “modulates” IR or IGF-1R, respectively. By “modulate” we mean that the compound changes an activity of IR or IGF-1R by at least 10%. Suitably, a compound modulates IR or IGF-1R by increasing or decreasing signal transduction via IR or IGF-1R, respectively. The phrase “decreases signal transduction” is intended to encompass partial or complete inhibition of signal transduction via IR or IGF-1R. The ability of a candidate compound to increase or decrease signal transduction via IR or IGF-1R can be assessed by any one of the IR or IGF-1R cell-based assays described herein.
Compounds may act as antagonists or agonists for insulin binding to IR or as antagonists or agonists for IGF binding to IGF-1R.
Compounds of the present invention preferably have an affinity for IR or IGF-1R sufficient to provide adequate binding for the intended purpose. Suitably, such compounds and compounds which bind to specific target molecules of IR or IGF-1R have an affinity (Kd) of from 10−5 to 10−15 M. For use as a therapeutic, the compound suitably has an affinity (Kd) of from 10−7 to 10−15 M, preferably from 10−8 to 10−12 M and more preferably from 10−10 to 10−12 M. Where a compound is to be used as a reagent in a competitive assay to identify other ligands, the compound suitably has an affinity (Kd) of from 10−5 to 10−12 M.
As will be evident to the skilled person, the crystal structure presented herein has enabled, for the first time, direct visualisation of the regions binding insulin in the IR. The structure has enabled the identification of the αCT segment of the α-chain of IR, critical for the initial binding of insulin, and in the subsequent formation of the high affinity insulin-IR complex that leads to signal transduction.
In one preferred embodiment, a compound has a high specificity for IR and/or a specific target molecule of IR but not for IGF-1R, i.e., a compound selectively binds to IR or has enhanced selectivity for IR over IGF-1R. In this respect, a compound suitably has an affinity (IQ) for IR and/or a specific target molecule of IR of no more than 10−5 M, preferably no more than 10−7 M, and an affinity for IGF-1R of at least 10−5 M, preferably at least 10−3 M. Such compounds are desirable as, for example, IR agonists where the propensity to interact with IGF-1R and thus, for example, promote undesirable cell proliferation, is reduced.
In a preferred embodiment, the (IR or specific target molecule of IR)/IGF-1R binding affinity ratio for a compound is at least 10 and preferably at least 100.
In another preferred embodiment, a compound has a high specificity for IGF-1R and/or a specific target molecule of IGF-1R but not for IR, i.e., a compound selectively binds to IGF-1R or has enhanced selectivity for IGF-1R over IR. In this respect, a compound suitably has an affinity (IQ) for IGF-1R and/or a specific target molecule of IGF-1R of no more than 10−5 M, preferably no more than 10−7 M, and an affinity for IR of at least 10−5 M, preferably at least 10−3 M. Such compounds are desirable as, for example, IGF-1R agonists where there propensity to interact with IR and thus, for example, promote glucose uptake and metabolism, is reduced.
In a preferred embodiment, the (IGF-1R or specific target molecule of IGF-1R)/(IR) binding affinity ratio for a compound is at least 10 and preferably at least 100.
Compounds of the invention may be subjected to further confirmation of binding to IR and/or IGF-1R by co-crystallization of the compound with IR and/or IGF-1R and structural determination, as described herein.
Compounds designed or selected according to the methods of the present invention are preferably assessed by a number of in vitro and in vivo assays of IR and/or IGF-1R function to confirm their ability to interact with and modulate IR and/or IGF-1R activity. For example, compounds may be tested for their ability to bind to IR and/or IGF-1R and/or for their ability to modulate e.g. disrupt, IR and/or IGF-1R signal transduction.
Libraries may be screened in solution by methods generally known in the art for determining whether ligands competitively bind at a common binding site. Such methods may include screening libraries in solution (e.g., Houghten, 1992), or on beads (Lam, 1991), chips (Fodor, 1993), bacteria or spores (U.S. Pat. No. 5,223,409), plasmids (Cull et al., 1992), or on phage (Scott & Smith, 1990; Devlin, 1990; Cwirla et al., 1990; Felici, 1991; U.S. Pat. No. 5,223,409).
Where the screening assay is a binding assay, IR or IGF-1R may be joined to a label, where the label can directly or indirectly provide a detectable signal. Various labels include radioisotopes, fluorescent molecules, chemiluminescent molecules, enzymes, specific binding molecules, particles, e.g., magnetic particles, and the like. Specific binding molecules include pairs, such as biotin and streptavidin, digoxin and antidigoxin, etc. For the specific binding members, the complementary member would normally be labelled with a molecule that provides for detection, in accordance with known procedures.
A variety of other reagents may be included in the screening assay. These include reagents like salts, neutral proteins, e.g., albumin, detergents, etc., which are used to facilitate optimal protein-protein binding and/or reduce non-specific or background interactions. Reagents that improve the efficiency of the assay, such as protease inhibitors, nuclease inhibitors, antimicrobial agents, etc., may be used. The components are added in any order that produces the requisite binding. Incubations are performed at any temperature that facilitates optimal activity, typically between 4 and 40° C.
Direct binding of compounds to IR or IGF-1R can also be done by Surface Plasmon Resonance (BIAcore) (reviewed in Morton & Myszka, 1998). Here the receptor is immobilized on a CM5 or other sensor chip by either direct chemical coupling using amine or thiol-disulphide exchange coupling (Nice & Catimel, 1999) or by capturing the receptor ectodomain as an Fe fusion protein to an appropriately derivatised sensor surface (Morten & Myszka, 1998). The potential binding molecule (called an analyte) is passed over the sensor surface at an appropriate flow rate and a range of concentrations. The classical method of analysis is to collect responses for a wide range of analyte concentrations. A range of concentrations provides sufficient information about the reaction, and by using a fitting algorithm such as CLAMP (see Morton & Myszka, 1998), rate constants can be determined (Morton & Myszka, 1998; Nice & Catimel, 1999). Normally, the ligand surface is regenerated at the end of each analyte binding cycle. Surface regeneration ensures that the same number of ligand binding sites is accessible to the analyte at the beginning of each cycle.
Incubation periods are selected for optimum activity, but may also be optimized to facilitate rapid high-throughput screening. Normally, between 0.1 and 1 hour will be sufficient. In general, a plurality of assay mixtures is run in parallel with different test agent concentrations to obtain a differential response to these concentrations. Typically, one of these concentrations serves as a negative control, i.e. at zero concentration or below the level of detection.
The basic format of an in vitro competitive receptor binding assay as the basis of a heterogeneous screen for small organic molecular replacements for insulin may be as follows: occupation of the low affinity binding site of IR and/or IGF-1R is quantified by time-resolved fluorometric detection (TRFD) as described by Denley et al., 2004. R−IR-A, R−IR-B and P6 cells are used as sources of IR-A, IR-B and IGF-1R respectively. Cells are lysed with lysis buffer (20 mM HEPES, 150 mM NaCl, 1.5 mM MgCl2, 10% (v/v) glycerol, 1% (v/v) Triton X-100, 1 mM EGTA pH 7.5) for 1 hour at 4° C. Lysates are centrifuged for 10 minutes at 3500 rpm and then 100 μl is added per well to a white Greiner Lumitrac 600 plate previously coated with anti-insulin receptor antibody 83-7 or anti-IGF-IR antibody 24-31. Neither capture antibody interferes with receptor binding by insulin, IGF-I or IGF-II. Approximately 100,000 fluorescent counts of europium-labelled insulin or europium-labelled IGF-I are added to each well along with various amounts of unlabelled competitor and incubated for 16 hours at 4° C. Wells are washed with 20 mM Tris, 150 mM NaCl, 0.05% (v/v) Tween 20 (TBST) and DELFIA enhancement solution (100 μl/well) is added. Time-resolved fluorescence is measured using 340 nm excitation and 612 nm emission filters with a BMG Lab Technologies Polarstar™ Fluorimeter or a Wallac Victor II (EG & G Wallac, Inc.).
Examples of other suitable assays which may be employed to assess the binding and biological activity of compounds to and on IR are well known in the art. For example, suitable assays may be found in PCT International Publication Number WO 03/027246. Examples of suitable assays include the following:
(i) Receptor autophosphorylation (as described by Denley et al., 2004). R−IR-A, R−IR-B cells or P6 cells are plated in a Falcon 96 well flat bottom plate at 2.5×104 cells/well and grown overnight at 37° C., 5% CO2. Cells are washed for 4 hours in serum-free medium before treating with one of either insulin, IGF-I or IGF-II in 100 μl DMEM with 1% BSA for 10 minutes at 37° C., 5% CO2. Lysis buffer containing 2 mM Na3VO4 and 1 mg/ml NaF is added to cells and receptors from lysates are captured on 96 well plates precoated with antibody 83-7 or 24-31 and blocked with 1×TBST/0.5% BSA. After overnight incubation at 4° C., the plates are washed with 1×TBST. Phosphorylated receptor is detected with europium-labelled antiphosphotyrosine antibody PY20 (130 ng/well, room temperature, 2 hours). DELFIA enhancement solution (100 μl/well) is added and time resolved fluorescence detected as described above.
(ii) Glucose uptake using 2-deoxy-[U-14C] glucose (as described by Olefsky, 1978). Adipocytes between days 8-12 post-differentiation in 24-well plates are washed twice in Krebs-Ringer Bicarbonate Buffer (25 mM HEPES, pH 7.4 containing 130 mM NaCl, 5 mM KCl, KH2PO4, 1.3 mM MgSO4.7H2O, 25 mM NaHCO3 and 1.15 mM CaCl2) supplemented with 1% (w/v) RIA-grade BSA and 2 mM sodium pyruvate. Adipocytes are equilibrated for 90 min at 37° C. prior to insulin addition, or for 30 min prior to agonist or antagonist addition. Insulin (Actrapid, Novogen) is added over a concentration range of 0.7 to 70 nM for 30 min at 37° C. Agonist or antagonist (0 to 500 mM) is added to adipocytes for 90 min followed by the addition of insulin in the case of antagonists. Uptake of 50 mM 2-deoxy glucose and 0.5 mCi-2-deoxy-[U-14C] glucose (NEN, PerkinElmer Life Sciences) per well is measured over the final 10 min of agonist stimulation by scintillation counting.
(iii) Glucose transporter GLUT4 translocation using plasma membrane lawns (as described by Robinson & James (1992) and Marsh et al. (1995)).
(iv) GLUT4 translocation using plasma membrane lawns (as described by Marsh et al., 1995). 3T3-L1 fibroblasts are grown on glass coverslips in 6-well plates and differentiated into adipocytes. After 8-12 days post-differentiation, adipocytes are serum-starved for 18 hrs in DMEM containing 0.5% FBS. Cells are washed twice in Krebs-Ringer Bicarbonate Buffer, pH 7.4 and equilibrated for 90 min at 37° C. prior to insulin (100 nM) addition, or for 30 min prior to compound (100 μM) addition. After treatments, adipocytes are washed in 0.5 mg/ml poly-L-lysine in PBS, shocked hypotonically by three washes in 1:3 (v/v) membrane buffer (30 mM HEPES, pH 7.2 containing 70 mM KCl, 5 mM MgCl2, 3 mM EGTA and freshly added 1 mM DTT and 2 mM PMSF) on ice. The washed cells are then sonicated using a probe sonicator (Microson) at setting 0 in 1:1 (v/v) membrane buffer on ice, to generate a lawn of plasma membrane fragments that remain attached to the coverslip. The fragments are fixed in 2% (w/v) paraformaldehyde in membrane buffer for 20 min at 22° C. and the fixative quenched by 100 mM glycine in PBS. The plasma membrane fragments are then blocked in 1% (w/v) Blotto in membrane buffer for 60 min at 22° C. and immunolabelled with an in-house rabbit affinity purified anti-GLUT4 polyclonal antibody (clone R10, generated against a peptide encompassing the C-terminal 19 amino acids of GLUT4) and Alexa 488 goat anti-rabbit secondary antibody (Molecular Probes; 1:200). Coverslips are mounted onto slides using FluoroSave reagent (Calbiochem), and imaged using an OptiScan confocal laser scanning immunofluoroscence microscope (Optiscan, VIC., Australia). Data are analysed using ImageJ (NIH) imaging software. At least six fields are examined within each experiment for each condition, and the confocal microscope gain settings over the period of experiments are maintained to minimise between-experiment variability.
Insulin agonist activity may be determined using an adipocyte assay. Insulin increases uptake of 3H glucose into adipocytes and its conversion into lipid. Incorporation of 3H into a lipid phase is determined by partitioning of lipid phase into a scintillant mixture, which excludes water-soluble 3H products. The effect of compounds on the incorporation of 3H glucose at a sub-maximal insulin dose is determined, and the results expressed as increase relative to full insulin response. The method is adapted from Moody et al., (1974). Mouse epididymal fat pads are dissected out, minced into digestion buffer (Krebs-Ringer 25 mM HEPES, 4% HSA, 1.1 mM glucose, 0.4 mg/ml Collagenase Type 1, pH 7.4), and digested for up to 1.5 hours at 36.5 C. After filtration, washing (Krebs-Ringer HEPES, 1% HSA) and resuspension in assay buffer (Krebs-Ringer HEPES, 1% HSA), free fat cells are pipetted into 96-well Picoplates containing test solution and approximately an ED20 insulin.
The assay is started by addition of 3H glucose (e.g. ex. Amersham TRK 239), in a final concentration of 0.45 mM glucose. The assay is incubated for 2 hours at 36.5° C., in a Labshaker incubation tower, 400 rpm, then terminated by the addition of Permablend/Toluene scintillant (or equivalent), and the plates sealed before standing for at least 1 hour and detection in a Packard Top Counter or equivalent. A full insulin standard curve (8 dose) is run as control on each plate.
Data are presented graphically, as the effect of the compound on an (approximate) ED20 insulin response, with data normalized to a full insulin response. The assay can also be run at basal or maximal insulin concentration.
To test the in vivo activity of a compound, an intravenous blood glucose test may be carried out on Wistar rats as follows. Male Mol:Wistar rats, weighing about 300 g, are divided into two groups. A 10 μl sample of blood is taken from the tail vein for determination of blood glucose concentration. The rats are then anaesthetized (e.g. with Hypnorm/Dormicum) at t=30 min and blood glucose measured again at t=−20 min and at t=0 min. After the t=0 sample is taken, the rats are injected into the tail vein with vehicle or test substance in an isotonic aqueous buffer at a concentration corresponding to a 1 ml/kg volume of injection. Blood glucose is measured at times 10, 20, 30, 40, 60, 80, 120 and 180 min. The anaesthetic administration is repeated at 20 min intervals.
Additional assays to determine the effect of binding molecules on IGF-1R activity are as follows:
(i) Cell Viability Assay on HT29 cells with induction of Apoptosis: The ability of compounds to inhibit IGF-mediated rescue from apoptosis is measured using the colorectal cell line HT29 cells (ATCC: HTB 38) after induction with Na Butyrate. The HT29 cells are plated out onto white Fluoronunc 96 well plates (Nunc) at 12,000 cells/ml and incubated at 37° C., 5% CO2 for 48 hours. Media is aspirated and 100 μl/well of serum free DMEM/F12 is added for 2 hours to serum starve cells. IGF (100 μl/well 0.05-50 nM dilutions) in the presence and the absence of inhibitory compound is added in 0.1% BSA solution (Sigma) in DMEM/F12 (Gibco) in triplicate. A final concentration of 5 mM Butyrate (Sigma) is added to each well. Plates are incubated at 37° C., 5% CO2 for a further 48 hours. Plates are brought to room temperature and developed (as per instructions for CTG Assay (Promega)). Luminescence signal is measured on the Polarstar plate reader and data is evaluated using table curve to obtain the specific ED50.
(ii) Cell Migration Assay: The migration assays are performed in the modified 96-well Boyden chamber (Neuroprobe, Bethesda, Mass.). An 8 μM polycarbonate filter, which is pre-soaked in 25 μg/ml of collagen in 10 mM acetic acid overnight at 4° C., is placed so as to divide the chamber into an upper & lower compartment. Varying concentrations of the IGF-I analogues (25 μl of 0-100 nM) diluted in RPMI (Gibco) with 0.5% BSA (Sigma) tested for their migration including ability, are placed in the lower compartment in quadruplicates. The wells of the upper chamber are seeded with 50 μl/well of 2×105 SW480 (ATCC:CCL 228) pre-incubated for 30 mins/37° C. with 1.1 μl of 2 μM Calcein (Molecular Probes). Cells migrate for 8 hours at 37° C., 5% CO2. Unmigrated cells are removed by wiping the filter. The filter is then analysed in the Polarstar for fluorescence at excitation wavelength of 485 nm and emission wavelength of 520 nm. Data is evaluated using table curve to obtain the specific ED50 value.
(iii) Mouse Xenograft studies for anti-IGF-1R antibodies: In vivo studies are performed in 56-week-old female athymic BALBc nude mice, homozygous for the nunu allele. Mice are maintained in autoclaved micro-isolator cages housed in a positive pressure containment rack (Thoren Caging Systems Inc., Hazelton, Pa., USA. To establish xenografts, mice are injected subcutaneously into the left inguinal mammary line with 3×106 or 5×106 cells in 100 μl of PBS. Tumour volume (TV) is calculated by the formula (length×width2)/2 (Clarke et al., 2000), where length is the longest axis and width the measurement at right angles to length.
Initial biodistribution of potential binding molecules are ascertained by injecting 40 BALBc nude mice with established xenografts with radiolabelled 111In- or 125I-anti-IGFR antibody (3 μg, 10 μCi) intravenously via the tail vein (total volume=0.1 ml). At designated time points after injection of the radioconjugates (t=4 h, days 1, 2, 3, 5 and 7), groups of mice (n=35) are killed by Ethrane anaesthesia. Mice are then exsanguinated by cardiac puncture, and tumours and organs (liver, spleen, kidney, muscle, skin, bone (femur), lungs, heart, stomach, brain, small bowel, tail and colon) are resected immediately. All samples are counted in a dual gamma scintillation counter (Packard Instruments). Triplicate standards prepared from the injected material are counted at each time point with tissue and tumour samples enabling calculations to be corrected for the physical decay of the isotopes. The tissue distribution data are calculated as the mean±s.d. percent injected dose per gram tissue (% ID g−1) for the candidate molecule per time point.
Pharmacokinetics for the candidate compounds are ascertained as follows: Serum obtained from mice bearing xenografts, following infusion of radiolabelled-binding molecule as described above, is aliquoted in duplicate and counted in a gamma scintillation counter (Packard Instruments, Melbourne, Australia). Triplicate standards prepared from the injected material are counted at each time point with serum samples to enable calculations to be corrected for the isotope physical decay. The results of the serum are expressed as % injected dose per litre (% ID 1−1). Pharmacokinetic calculations are performed of serum data using a curve fitting program (WinNonlin, Pharsight Co., Mountain View, Calif., USA). A two-compartment model is used to calculate serum pharmacokinetic parameters of AUC (area under the serum concentration curve extrapolated to infinite time), CL (total serum clearance), T12α and T12β (half-lives of the initial and terminal phases of disposition) for 125I- and 111In-labelled molecule
(iv) Therapeutic in vivo studies: Tumour cells (3×106) in 100 μl of media are inoculated subcutaneously into both flanks of 46-week-old female nude mice (n=5 group−1). Candidate molecule treatment commences day 7 post-tumour cell inoculations (mean±s.e. tumour volume=60×15 mm3) and consists of six intraperitoneal injections over 2 weeks of appropriate amounts of the candidate molecule or vehicle control. Tumour volume in mm3 is determined as described previously. Data is expressed as mean tumour volume for each treatment group. Differences in tumour size between control and test groups are tested for statistical significance (P<0.05) by t-test.
Compounds/chemical entities designed or selected by the methods of the invention described above may be used to modulate IR and/or IGF-1R activity in cells, i.e., activate or inhibit IR and/or IGF-1R activity. Such compounds may interact with the low affinity binding sites of IR as defined herein, mimic the αCT segment of the α-chain of IR, or mimic insulin in complex with the IR as defined herein. They may also be used to modulate homodimerisation of IR and/or IGF-1R.
Modulation of homodimerisation of IR and/or IGF-1R may be achieved by direct binding of the chemical entity to a homodimerisation surface of IR and/or IGF-1R, and/or by an allosteric interaction elsewhere in the IR and/or IGF-1R extracellular domain.
Given that aberrant IR and/or IGF-1R activity is implicated in a range of disorders, the compounds described above may also be used to treat, ameliorate or prevent disorders characterised by abnormal IR and/or IGF-1R signalling. Examples of such disorders include malignant conditions including tumours of the brain, head and neck, prostate, ovary, breast, cervix, lung, pancreas and colon; and melanoma, rhabdomyosarcoma, mesothelioma, squamous carcinomas of the skin and glioblastoma.
The compounds designed to interact or identified as interacting with the extracellular domain of IR and/or IGF-1R, and in particular to interact with the target binding sites, are useful as agonists or antagonists against the action of insulin on IR and/or IGF on IGF-1R. The compounds are useful as assay reagents for identifying other useful ligands by, for example, competition assays, as research tools for further analysis of IR and/or IGF-1R and as potential therapeutics in pharmaceutical compositions.
Compounds provided by this invention are also useful as lead compounds for identifying other more potent or selective compounds. The mimetic compounds of the present invention are also potentially useful as inhibitors of the action of insulin and in the design of assay kits directed at identifying compounds capable of binding to the low affinity binding site for insulin on IR. The mimetic compounds of the present invention are also potentially useful as inhibitors of the action of IGF and in the design of assay kits directed at identifying compounds capable of binding to the low affinity binding site for IGF on IGF-1R. In particular, it is envisaged that compounds of the present invention will prove particularly useful in selecting/designing ligands which are specific for IR or IGF-1R.
In one embodiment, one or more of the compounds can be provided as components in a kit for identifying other ligands (e.g., small, organic molecules) that bind to IR or IGF-IR. Such kits may also comprise IR or IGF-IR, or functional fragments thereof. The compound and receptor components of the kit may be labelled (e.g., by radioisotopes, fluorescent molecules, chemiluminescent molecules, enzymes or other labels), or may be unlabeled and labelling reagents may be provided. The kits may also contain peripheral reagents such as buffers, stabilizers, etc. Instructions for use can also be provided.
IR and IGF-1R agonists and antagonists, and in particular antagonists, provided by this invention are potentially useful as therapeutics. For example, compounds are potentially useful as treatments for cancers, including, but not limited to, breast, prostate, colorectal, and ovarian cancers. Human and breast cancers are responsible for over 40,000 deaths per year, as present treatments such as surgery, chemotherapy, radiation therapy, and immunotherapy show limited success. Recent reports have shown that a previously identified IGF-1R antagonist can suppress retinal neovascularization, which causes diabetic retinopathy (Smith et al., 1999). IGF-1R agonist compounds (i.e. existing IGF-1R compounds which have been modified employing methods of the present invention) are useful for development as treatments for neurological disorders, including stroke and diabetic neuropathy. Reports of several different groups implicate IGF-1R in the reduction of global brain ischemia, and support the use of IGF-I for the treatment of diabetic neuropathy (reviewed in Auer et al., 1998; Apfel, 1999). A number of therapeutics directed against IGF-1R are currently undergoing clinical trial as anti-cancer agents (Hewish et al., 2009).
The IGF-1R agonist peptides of the invention may be useful for enhancing the survival of cells and/or blocking apoptosis in cells.
Compounds of the invention, i.e., ligands of the invention or modulators of IR and/or IGF-1R identified or identifiable by the screening methods of the invention, may preferably be combined with various components to produce compositions of the invention. Preferably the compositions are combined with a pharmaceutically acceptable carrier or diluent to produce a pharmaceutical composition (which may be for human or animal use).
The formulation will depend upon the nature of the compound and the route of administration but typically they can be formulated for, topical, parenteral, intramuscular, oral, intravenous, intra-peritoneal, intranasal inhalation, lung inhalation, intradermal or intra-articular administration. The compound may be used in an injectable form. It may therefore be mixed with any vehicle which is pharmaceutically acceptable for an injectable formulation, preferably for a direct injection at the site to be treated, although it may be administered systemically.
The pharmaceutically acceptable carrier or diluent may be, for example, sterile isotonic saline solutions, or other isotonic solutions such as phosphate-buffered saline. The compounds of the present invention may be admixed with any suitable binder(s), lubricant(s), suspending agent(s), coating agent(s), solubilising agent(s). It is also preferred to formulate the compound in an orally active form.
In general, a therapeutically effective daily oral or intravenous dose of the compounds of the invention, including compounds of the invention and their salts, is likely to range from 0.01 to 50 mg/kg body weight of the subject to be treated, preferably 0.1 to 20 mg/kg. The compounds of the invention and their salts may also be administered by intravenous infusion, at a dose which is likely to range from 0.001-10 mg/kg/hr.
Tablets or capsules of the compounds may be administered singly or two or more at a time, as appropriate. It is also possible to administer the compounds in sustained release formulations.
Typically, the physician will determine the actual dosage which will be most suitable for an individual patient and it will vary with the age, weight and response of the particular patient. The above dosages are exemplary of the average case. There can, of course, be individual instances where higher or lower dosage ranges are merited, and such are within the scope of this invention.
For some applications, preferably the compositions are administered orally in the form of tablets containing excipients such as starch or lactose, or in capsules or ovules either alone or in admixture with excipients, or in the form of elixirs, solutions or suspensions containing flavouring or colouring agents.
The compositions (as well as the compounds alone) can also be injected parenterally, for example intravenously, intramuscularly or subcutaneously. In this case, the compositions will comprise a suitable carrier or diluent.
For parenteral administration, the compositions are best used in the form of a sterile aqueous solution which may contain other substances, for example enough salts or monosaccharides to make the solution isotonic with blood.
For buccal or sublingual administration the compositions may be administered in the form of tablets or lozenges which can be formulated in a conventional manner.
For oral, parenteral, buccal and sublingual administration to subjects (such as patients), the daily dosage level of the compounds of the present invention and their pharmaceutically acceptable salts and solvates may typically be from 10 to 500 mg (in single or divided doses). Thus, and by way of example, tablets or capsules may contain from 5 to 100 mg of active compound for administration singly, or two or more at a time, as appropriate. As indicated above, the physician will determine the actual dosage which will be most suitable for an individual patient and it will vary with the age, weight and response of the particular patient.
The routes of administration and dosages described are intended only as a guide since a skilled practitioner will be able to determine readily the optimum route of administration and dosage for any particular patient depending on, for example, the age, weight and condition of the patient.
A 1741-base fragment of IR was synthesized (DNA2.0) and inserted into the HindIII/XbaI sites of the mammalian expression plasmid vector pEE14 (Bebbington & Hentschel, 1987). The fragment encoded a protein cIR485 (“cleavable” IR485; SEQ ID NO: 15) which consists, in order, of (i) the 27-residue IR native signal sequence, (ii) residues 1-310 of the IR α-chain (the L1-CR module), (iii) a thrombin cleavage site SSSLVPRGSSS, (iv) residues 311-485 of the IR α-chain (the L2 domain), (v) an enterokinase cleavage site DDDDK and (vi) a c-myc purification tag EQKLISEEDLN (Hoogenboom et al., 1991). A 106-base non-coding fragment GTCGACGGTACCCCGGGGAATTAATTCCGGGGGCCGCCTCGGAGCATGACCCCC GCGGGCCAGCGCCGCGCGCTCTGATCCGAGGAGACCCCGCGCTCCCGCAGCC was included between the Hind III site and the start codon; bases 29-106 of this fragment corresponding to those immediately upstream of the human IR coding region.
The expression plasmid pEE14/cIR485 was transfected into Lec8 mutant Chinese hamster ovary (CHO) cells (CRL-1737; ATCC) (Stanley, 1989) using LipofectAMINE 2000 (Life Technologies). The Lec8 cells were maintained in Dulbecco's Modified Eagle Medium: Nutrient Mixture F-12 (DMEM/F12; Life Technologies) containing 10% fetal bovine serum (FBS; Life Technologies). Transfected Lec8 cells were maintained in DMEM (High Glucose, Pyruvate, no Glutamine) plus 1× glutamine synthetase supplements (Sigma-Aldrich) and 10% dialyzed FBS (dFBS) containing 25 mM methionine sulphoximine (MSX, Sigma-Aldrich).
Transfectants from 96-well plates were screened for protein expression by Western blot with monoclonal antibodies 83-7 (Soos et al., 1986) and 9E10 (Evan et al., 1985). A high-yielding cell line was selected by screening a high number of transfectants.
The Lec8 host cell line containing the cIR485 construct was sub-cultured in tissue culture flasks initially and then in roller bottles (Greiner) for two weeks to prepare inocula for the spinner flasks. DMEM/F12 media supplemented with 10% dFBS and 25 μM MSX to maintain selection pressure was used during passaging and production work.
Cells from ten fully-confluent roller bottles were transferred to four spinner flasks (New Brunswick Scientific) containing Fibra-Cel disks (New Brunswick Scientific) to provide a matrix for cell attachment. Approximately 35×107 viable cells/spinner flask were used to initiate production. The spinners were incubated in 5% CO2 at 37° C. with 100 rpm stirring and ˜50 ml/min airflow. Spinner flasks were supplemented with 0.1% FoamAway (Life Technologies). Glucose and lactic acid levels were monitored daily, with spent media replaced to maintain residual glucose levels of ˜1.6 g/L.
The production phase lasted three weeks and ˜50 L of harvest collected, with the media being supplemented with 0.8 mM butyric acid (Sigma-Aldrich) during the last week.
Daily harvests from the spinners were kept at 4° C. after addition of 0.02% NaN3 and 0.1 mM phenylmethanesulfonylfluoride (PMSF; Sigma-Aldrich). Pooled harvest batches were filtered with a 1.2/0.2μ filter to remover cell debris and concentrated approximately 10× using a 30 kDa cut-off membrane. All of the large-scale mammalian cell culture was performed under contract by CSIRO (Parkville, Australia).
Initial purification of cIR485 was by 9E10 antibody affinity chromatography and size-exclusion chromatography (SEC) using procedures effectively identical to those described for purification of IR485 (Lou et al., 2006).
Purified cIR485 in Tris-buffered saline (TBS; 24.8 mM Tris-HCl pH 8.0, 137 mM NaCl, 2.7 mM KCl) plus 0.02% sodium azide (TBSA) was then incubated overnight at 37° C. with 0.25 units human thrombin (Roche) per mg of cIR485 in the presence of 10 mM CaCl2.
Completion of proteolysis was assessed by SDS-PAGE, which revealed bands corresponding to the estimated molecular weight of the IR310.T fragment and the IR L2 domain (51 kDa and 30 kDa, respectively;
Western blotting with Mab 83-7 (Soos et al., 1986) confirmed that the CR domain was contained in the upper band alone. The sample was diluted 8-fold in buffer A (10 mM ethanolamine-HCl, pH 9.6+0.02% sodium azide), centrifuged for 5 min at 17,000 g to remove particulates and then loaded onto MonoQ 5/50 GL column (GE Healthcare). The sample was eluted with a 60 column-volume gradient of buffer A to buffer B (10 mM ethanolamine-HCl, 400 mM NaCl, pH 9.6+0.02% sodium azide;
Fractions containing IR310.T (SEQ ID NO: 8) were pooled, concentrated and rerun in TBSA on a Superdex 200 10/300 column (GE Healthcare). The chromatogram exhibited three overlapping peaks, likely arising from multimerization (
SDS-PAGE of fractions revealed the presence of three closely-spaced bands, which we, attributed to varying glycosylation (
A hybridoma cell line expressing Mab 83-7 (a gift from Prof. Ken Siddle, University of Cambridge, UK; SEQ ID NOs: 11 and 12) was then grown in large scale using Gibco Hybridoma Serum-free medium (H-SFM). Cells lines originally required 7-10% serum for optimal growth and 30-60 mg/l final yield.
A brief adaption experiment to lower serum levels was unsuccessful, at lower than 5% serum levels both viability and yield dropped.
The cells were then grown in 950 cm2 roller bottles at 37° C., 5% CO2 balanced with air and 10 rpm using approximately 300 ml H-SFM media supplemented with 5% fetal calf serum. Typically, 2-3 litres of cell culture at the viable cell density in the range of 2-3*106 cells/ml was achieved. The cell culture was then pelleted (5 mins, 350 g) and re-suspended in serum-free medium. When cell viability dropped below 30% (after 5-7 days), the culture was harvested and the Mab 83-7 captured using a Protein-A column (Millipore). Under these conditions final yield was between 5-12 mg/l of cell culture medium.
Production and purification of Fab 83-7 (SEQ ID NOs: 13 and 14) from Mab 83-7 was based on protocols described previously (McKern et al., 2006). Briefly, Mab 83-7 was digested with dithiothreitol-activated papain (Sigma-Aldrich) at 37° C. The digestion was stopped by adding iodoacetamide (IAA; Sigma-Aldrich), and the reaction mixture passed down a Superdex 200 26/60 column (GE Healthcare). Fractions containing 83-7 F(ab′)2 were isolated and reduced with mercaptoethylamine (Sigma-Aldrich) and then alkylated with IAA, followed by further SEC and anion-exchange chromatography on Mono S (GE Healthcare).
IR310.T was mixed with a slight molar excess of Fab 83-7 and the complex purified by SEC using a S200 10/300 column (GE Healthcare) in TBSA buffer. Fractions containing the complex of IR310.T/Fab 83-7 were then pooled, concentrated and exchanged into 10 mM HEPES-NaOH (pH 7.5)+0.02% NaN3.
Samples of the IR310.T/Fab 83-7 complex were prepared for crystallization by combining with a 1.5× molar ratio of αCT704-719 (Genscript; SEQ ID NO: 9) and 3× molar ratio of human insulin (Sigma-Aldrich) to a final concentration of 3.5 mg/ml in 10 mM HEPES, pH 7.0. [D-ProB26]-DTI-NH2 was prepared as described previously (Zakova et al., 2008).
A single crystallization condition (1.0 M trisodium citrate, 0.1 M imidazole HCl, pH 8.0+0.02% NaN3) was detected for the IR310.T/Fab83-7/[D-ProB26]-DTI-NH2/αCT704-719 complex in a large (792-condition) sparse matrix sitting-drop screen set up in SBS-format sitting-drop plates, performed under contract by the CSIRO Collaborative Crystallization Centre (Parkville, Australia). This condition was refined manually for both [D-ProB26]-DTI-NH2 and human insulin complexes within a range of 0.9-1.1 M trisodium citrate, 0.1 M imidazole HCl, pH 8.0+0.02% NaN3 in hanging-drop plates. Crystals grew to a maximum size of 200-250 μm.
Single crystals of the human insulin IR310.T quaternary complex were mounted in cryo-loops in paraffin oil HR403 (Hampton Research) and flash frozen in liquid nitrogen. Data Set 1 for IR310.T human insulin quaternary complex was collected (McPhillips et al., 2002) at the MX2 beamline at the Australian Synchrotron (Melbourne, Australia) using a Quantum 315 detector (ADSC). Data Set 2 was collected in similar fashion from a further crystal at beamline 124 at, the Diamond Light Source (Oxford, UK) using a Pilatus 6M detector (Dectris).
Crystals were transferred to reservoir solutions supplemented with 20% glycerol, flash frozen in liquid nitrogen and mounted directly on the MX2 beamline at the Australian Synchrotron. A number of crystals of 50-200 μm size were obtained, but only one diffracted to 4.4 Å, with the remainder diffracting to ˜6.0 Å Diffraction data were collected (McPhillips et al., 2002) from multiple volumes within the crystal in an endeavour to reduce the effects of radiation damage; however, only 88% completeness was achieved.
All diffraction data were processed using the XDS suite (Kabsch, 2010) and TRUNCATE within the CCP4 suite. Precision-indicating merging R factors (Rpim) were computed using SCALA within the CCP4 suite. Statistics for all data sets are provided in Table 2.
1Numbers in parentheses refer to the statistic in the highest resolution shell.
2Rmerge = ΣhklΣi|Ii(hkl) − <I(hkl)>|/ΣhklΣiIi(hkl)
3CC(1/2) is the Pearson correlation coefficient between independently merged halves of the data set. Values for the highest resolution shell are significant at p = 0.001 and justify the use of data to the stated resolution (Karplus, 2012).
4Rwork and Rfree are computed using R = <|Fhxpct − Fhobs|>/<|Fhobs|> where Fhxpct is the expectation value of the model structure amplitude; Blanc et al., 2004).
Molecular replacement employed Data Set 1 (Table 2). A single copy of an L1-CR search fragment (residues 4-310, from PDB entry 3LOH) was located within the asymmetric unit using PHASER (TFZ=26.7) (McCoy, 2007). The variable module of Fab 83-7 (Fv, from PDB entry 3LOH) was then located in the presence of the L1-CR module (TFZ=37.6). Attempts to locate the Fab 83-7 constant module failed and it proved disordered. TLS parameters and individual isotropic B-factors for the L1-CR/Fv complex were then refined using autoBUSTER (Bricogne et al., 2011), followed by a subsequent atomic coordinate-only refinement, yielding Rxpctwork/Rxpctfree=0.368/0.363, where “xpct” denotes the statistic's expectation value (Blanc et al., 2004). The resultant difference density maps revealed tubular helix-like features (the 1st, 3rd, 4th, 5th and 7th highest peaks within the map) in the immediate vicinity of the L1-β2 surface, into which PHASER unambiguously positioned a rigid model comprising the three insulin helices (TFZ=11.4). The correctness of this solution was confirmed by an exhaustive 6-dimensional real-space search using ESSENS (Kleywegt and Jones, 1997) to locate the core fragment within the Fo-Fc map. The ESSENS search, refined on a (1°, 0.33 Å) grid, yielded a single solution, with a Z-score of 12.6 and an rmsd of 0.7 Å from that obtained by PHASER. The 2nd highest difference electron density feature corresponded to glycan attached to Asn111 (
The remaining helix-like feature in the above difference map (encompassing the 1st and 5th highest peaks) revealed clear side chain protrusions upon B-factor map sharpening, spaced consistent with an underlying α-helix (
Register assignment of the αCT704-719 sequence to the decameric polyalanine helix was achieved using a procedure that assessed both compatibility of individual residue side chains with difference electron density and compatibility of individual residue side chains with their surrounding protein environment. (a) Fit to difference density employed the following method that was designed not only to assess the fit of atoms within the density, but also to penalize the existence of volumes of positive difference density into which no atoms had been placed. Briefly, the map feature assigned to the αCT segment was excised from the B-factor sharpened map using CHIMERA (Pettersen et al., 2004) (cut-off level=0.16 e−/A3) and placed within a rectangular grid volume large enough to allow a >8 Å buffer zone around the αCT feature. All grid points outside of the feature were set to 0 e/Å3, with the resultant map being termed Mobs. Coordinates for residues 1, 2 and 10 of the fitted polyalanine helix were then deleted from the model, as the density associated with them displayed somewhat poorer α-helical geometry. Ten comparisons corresponding to all possible alignments of heptameric sequences from αCT704-719 with the heptameric polyalanine structure were considered. Rotamers for each residue within each of these ten models were then selected manually using COOT (Emsley and Cowtan, 2004) based on visual inspection of the fit of the trial rotamer to the density at the corresponding site in Mobs. The ten individual models were then “real-space-refined” within COOT to achieve optimal fit to Mobs, maintaining tight helical restraints. An electron density map Mcalc, on the same grid as Mobs was then generated (using SFALL within CCP4) for each heptamer model in isolation, with the B-factors of all main-chain atoms being set to 10 Å2 and of all side-chain atoms set to 20 Å2 to allow for subsequent comparison with a sharpened map. All density within Mcalc lower than 0.45 e−/A3 was set to 0 e−/A3. Correlation coefficients CC=(<xy>−<x><y>)/√[(<x2>−<x>2) (<y2>−<y>2)] between Mobs and each Mcalc were then calculated using MAPMAN (Jones and Thirup, 1986). Each CC was then “normalized” by dividing its value by the CC value calculated for the underlying polyalanine heptamer, and the quotient termed the trial sequence's “density score”. Trial register 705-711 was seen to have the highest density score (Table 3). (b) Compatibility of residue side-chain environment within the putative L1/αCT/insulin interface was assessed as follows. For each of the ten trial heptamer (real-space refined) models described above, an environment score was generated using VERIFY3D (Luthy et al., 1992) in order to assess compatibility with the surrounding L1-β2 and insulin surfaces. Trial register 705-711 again scored highest (Table 3). A “combined score” was then computed as the product (density score)×(environment score) in order to assist with assessing lower ranked trial registers (Table 3). The next highest combined score was for trial register 709-715, which is related trial register 705-711 by a one-turn translation along the helix, effectively maintaining hydrophobic-to-hydrophobic docking with the L1 surface. However, register 709-715 was judged to be most unlikely, as it would bring Pro716 and Pro718 into the remaining C-terminal part of the overall helix. We thus concluded that register assignment 705-711 was correct, given that it was the highest ranking on all criteria. The assignment was subsequently confirmed by the structure determination of the insulin-complexed IR593.αCT construct, which has the segment directly and covalently attached to the C-terminus of FnIII-1.
705-711
1.15
4
18.5
21.3
1Density score = (Correlation of the computed electron density for the trial IR αCT heptamer with the difference electron density feature shown in FIG. 6B)/(Correlation of the computed electron density for the initial polyalanine heptamer with the difference electron density feature shown in FIG. 6B).
2Environment fit score computed using VERIFY3D (Luthy et al., 1992).
3Combined score = (density fit score) × (environment fit score).
4Highest scores within each column are underlined.
The 705-711 peptide model was then included in the insulin/IR310.T/Fab83-7 atomic model and autoBUSTER refinement continued against Data Set 2, which became available in the interim (Table 2, assigned identical free-R set to Data Set 1). Residues 712-715 were built as a further α-helical turn beyond residue 711 and a total of nine sugars added at sites 16, 25, 111, 225 and 255 (Lou et al., 2006; Sparrow et al., 2008), followed by iterative rounds of refinement and model rebuilding (COOT). LSSR restraints to higher-resolution structures (Smart et al., 2012) were included for IR310.T (restrained to PDB entry 2HR7-A) and the Fab83-7 light chain (to PDB entry 11L1-B) and heavy chain (to PDB entry 1FNS-C). Final refinement statistics are detailed in Table 2. Inclusion of data to 3.9 Å was justified a posteriori—the average figure of merit for free set reflections remained above 0.70 at this limit.
Data sets 1 and 2 for the human insulin complexed IR310.T/83-7/αCT704-719 crystal were then re-processed to extended resolution of 3.5 Å, based on a recent report that additional information could be obtained by inclusion of weak data at the very limit of diffraction. The integrated reflection data from the two sets were then scaled and merged together, the combined data set having CC(1/2)=0.998, significant at the p=0.001 level of confidence.
The prior model of human insulin complexed IR310.T/83-7/αCT704-719 refined at 3.9 Å resolution against Data set 2 was then re-refined against this extended resolution data set using autoBUSTER.
Examination of the resultant difference density maps showed that insulin residue A21 was misplaced in the earlier refinement. This residue was then removed from model and the model re-refined. The resultant difference maps revealed that residues B21-B24 could be built in a near native-like conformation, with the side chain of residue B24 directed into a hydrophobic pocket formed by the side chains of residues Phe714, Phe39, Leu37 and LeuB15. We then observed that residues PheB25 and TyrB26 could be built into difference electron density, in a non-native like fashion with the side chain of residue TyrB25 directed towards the αCT peptide in the vicinity of residue Val715 and the side chain of residue TyrB26 directed towards L1 domain residues Asn12 and Arg19.
Val715 was then removed from the model to avoid steric clash with TyrB25. We noted that Val715 is replaced by glutamate in some insulin receptors, incompatible with it being directed towards the L1 surface in the fashion prescribed by the lower resolution model and hence it like A21 had been built incorrectly in the lower resolution model.
A number of rounds of model building then followed using autoBUSTER, with the loop B20-B23 restrained to a native-like conformation. The resultant model showed a good fit to the electron density and a reduced R/Rfree with respect to the starting structure.
In order to assess insulin binding to IR310.T in the presence of Fab 83.7 isothermal titration calorimetry (“ITC”) was used. ITC allowed a direct assay of the interactions between the construct IR310.T (described above) complexed with Fab 83.7 and zinc free human insulin and between the construct IR310.T (not complexed with Fab 83.7) and zinc free human insulin. All interactions were assayed in presence of a 10× molar ratio of αCT704-719.
Protocols and instrumentation for ITC determination of the thermodynamic parameters of binding of (a) zinc-free porcine insulin to a pre-formed complex of IR310.T and αCT704-719, (b) zinc-free porcine insulin to a pre-formed complex of IR310.T/Fab83-7 and αCT704-719, (c) alanine-substituted αCT peptide to the L1-CR-L2 construct IR485 and (d) zinc-free porcine insulin to a pre-formed complex of alanine-substituted αCT704-719 and IR485 were identical to those described previously (Menting et al., 2009).
Briefly, all experiments were conducted using a VP-ITC isothermal titration calorimeter (MicroCal) with the calorimeter cell held at 25° C. All samples were degassed prior to injection or placement into the cell, and the instrument was temperature-equilibrated prior to the start of the injections. In all experiments, the volume of the sample placed in the cell was 1.4 ml and the titrant was injected in 7 μl volumes over 14 s at 3 min intervals, with the total number of injections being 40. The sample contents were stirred at a speed of 310 rpm over the duration of the titration. For peptide titrations, the titrant was prepared at 80 μM in TBSA, with the sample cell holding IR485 at 10 μM concentration. For insulin and insulin analogue titrations, the titrant was prepared at 32 to 48 μM concentration in TBSA with the sample cell containing IR485 at 4 to 6 μM concentration in TBSA and pre-incubated with a 10× molar ratio of the respective Ala-substituted αCT peptide. Data were analysed using the instrument's software and in all cases fitted as a single-site interaction. All measurements were performed in triplicate.
ITC analysis showed that zinc-free human insulin bound IR310.T with Kd=30 nM in the presence of a 10× molar ratio of αCT704-719 (
The disclosure of all publications referred to in this application are incorporated herein by reference.
In compliance with the statute, the invention has been described in language more or less specific to structural or methodical features. It is to be understood that the invention is not limited to specific features shown or described since the means herein described comprises preferred forms of putting the invention into effect. The invention is, therefore, claimed in any of its forms or modifications within the proper scope of the appended claims (if any) appropriately interpreted by those skilled in the art.
McCoy, A. J. (2007). Solving structures of protein complexes by molecular replacement with Phaser. Acta Crystallogr D Biol Crystallogr 63, 32-41.
Schäffer et al. (2003) Proc. Natl. Acad. Sci. USA, 100, 4435-4439.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/AU2013/001068 | 9/18/2013 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
61705199 | Sep 2012 | US |