Cytokine receptor

Abstract
A crystalline composition comprising a crystal of the IL-6 receptor I chain is provided. Also provided are methods of using the crystal and related structural information to screen for and design compounds that interact with IL-6R, or variants thereof. Also provided arc methods of modulating an IL-6 receptor comprising contacting the IL-6 receptor with a compound identified by the screening method of the invention.
Description
FIELD OF THE INVENTION

The present invention relates generally to structural studies of the interleukin-6 (IL-6) receptor. More particularly, the present invention relates to the crystal structure of the IL-6 receptor α chain (IL-6R). Even more particularly, the instant invention relates to the crystal structure of an extracellular portion of IL-6R and to methods of using the crystal and related structural information to screen for and design compounds that interact with IL-6R, or variants of thereof.


BACKGROUND OF THE INVENTION

Interleukin-6 (IL-6) is a multifunctional cytokine that plays a central role in host defense due to its wide range of immune and haematopoietic activities and its potent ability to induce acute phase response (for review see Simpson et al. 1997, Protein Sci 6, 929-55). It appears to represent an important frontline component of the body's armory against infection or tissue damage (IL-6 knockout mice have impaired immune and acute phase responses. IL-6 was originally referred to by such diverse names as interferon-β2, 26K factor, B-cell stimulating factor 2, hybridoma growth factor, plasmacytoma growth factor, hepatocyte-stimulatory factor, a haematopoietic factor and cytotoxic T-cell differentiation factor—each name reflecting a different biological activity controlled by the same protein. Over the last ten years it has become clear that the functional pleiotropy of Interleukin-6 has implicated this cytokine in the pathology of many human diseases such as multiple myeloma, rheumatoid arthritis, Castleman's disease, AIDS, mesangial proliferative glomerulonephritis, Kaposi's sarcoma, sepsis, osteoporosis and psoriasis. Given the association of abnormal IL-6 production and clinical disorders there is great interest in the development of functional agonists and antagonists as potential therapeutic agents in the treatment of IL-6 associated diseases.


The biological activities of IL-6 are mediated by the IL-6 receptor complex, which consists of two membrane proteins, the ligand binding α-chain receptor (IL-6R, gp80) and the signal transducing β-chain, gp130, which also forms part of the receptor complexes of LIF, OSM, CNTF, IL-11 as well as CT-1. Co-expression of IL-6R and gp130 results in both low- and high-affinity binding sites for IL-6, the relative amounts of the two chains dictating the ratio between the two affinity states. High affinity binding is likely due to the ability of IL-6 to interact simultaneously with sites on the IL6-R and gp130. Typically the difference in affinity between low- and high-affinity binding is ˜100-fold; e.g., on human myeloma U266 cells, IL-6 first binds IL-6R with an affinity of ˜1 nM, and the IL-6/IL-6R complex then binds gp130 with a resulting affinity of ˜10 pM. Binding of IL-6 has been demonstrated on a variety of human cells. Although human IL-6R shows broad distribution (e.g., activated B-cells, resting T-cells, B lymphoblastoid cell types, hepatoma lines, myeloma and monocyte cell lines), some cells lack this type of receptor. Normal cells express between 102 and 103 receptors, while human myeloma U266 and EB virus-transformed CESS cells have up to 104 receptors.


The cDNA of the human IL-6R encodes a protein of 468 amino acids, including a signal peptide of 19 amino acids, an extracellular region of 339 amino acids, a transmembrane domain of 28 amino acids, and a short cytoplasmic domain of 82 amino acids (Yamasaki et al., 1988, Science 241: 825-828). There are six potential N-linked glycosylation sites on the extracellular domain, the mature 80 kDa IL-6R is a glycosylated form of the predicted 50 kDa precursor. Secondary structural predictions indicate that the extracellular region is highly modular, consisting of three domains (D1, D2 and D3) of approximately 100 residues. These domains consist of an N-terminal domain D1 characteristic of the immunoglobulin (Ig) superfamily and a cytokine binding domain (CBD), which consists of two fibronectin type III-like (FN III) domains (a subclass of the Ig-fold). The CBD of IL-6R is characteristic of the class I cytokine receptors. In common with other members of this receptor family, the N-terminal FIII domain (D2) of the CBD has four conserved cysteines, while the C-terminus FIII domain (D3) has a conserved sequence motif, the “WSXWS” motif. The Ig domain D1 has been deleted without major effect on the binding of IL-6 or signal transduction, suggesting that the CBD mediates binding to IL-6 and gp130. The transmembrane and cytoplasmic domains of the IL-6R are not necessary for signal transduction, as shown by the fact that the complex of IL-6 and extracellular “soluble” domain of IL-6R induces signal transduction on cells expressing gp130.


Notwithstanding the known biology of the IL-6R complex, the design of IL-6R complex agonists or antagonists is impeded greatly by the lack of three-dimensional structural information available for this complex. Accordingly, knowledge of the three-dimensional structure coordinates of the IL-6R complex would be useful in facilitating the design of potential selective agonists/antagonists which, in turn, are expected to have therapeutic utility.


SUMMARY OF THE INVENTION

The present inventors have now obtained three-dimensional structural information concerning IL-6 receptor. The information presented in the present application can be used to develop compounds which interact with the IL-6 receptor for use in therapeutic applications.


Accordingly, in a first aspect the present invention provides a method of selecting or designing a compound that interacts with the IL-6 receptor and modulates an activity mediated by the receptor, the method comprising the step of assessing the stereochemical complementarity between the compound and a topographic region of the receptor, wherein the receptor is characterised by


(i) amino acids 1-299 of the IL-6 receptor positioned at atomic coordinates as shown in Appendix I, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å; or


(ii) one or more subsets of said amino acids related to the coordinates shown in Appendix I by whole body translations and/or rotations.


In a preferred embodiment of the first aspect, the structural coordinates have a root mean square deviation from the backbone atoms of said amino acids of not more than 1.0 Å and more preferably not more than 0.7 Å.


By “stereochemical complementarity” we mean that the compound or a portion thereof makes a sufficient number of energetically favourable contacts with the receptor, or topographical region thereof, as to have a net reduction of free energy on binding to the receptor, or topographical region thereof.


In a second aspect the present invention provides computer-assisted method for identifying potential compounds able to interact with the IL-6 receptor and thereby modulate an activity mediated by the receptor, using a programmed computer comprising a processor, an input device, and an output device, comprising the steps of:

    • (a) inputting into the programmed computer, through the input device, data comprising the atomic coordinates of amino acids 1-299 of the IL-6 receptor as shown in Appendix I, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å, or one or more subsets of said amino acids, or one or more subsets of said amino acids related to the coordinates shown in Appendix I by whole body translations and/or rotations;
    • (b) generating, using computer methods, a set of atomic coordinates of a structure that possesses stereochemical complementarity to the atomic coordinates of amino acids 1-299 of the IL-6 receptor as shown in Appendix I, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å, or one or more subsets of said amino acids, or one or more subsets of said amino acids related to the coordinates shown in Appendix I by whole body translations and/or rotations, thereby generating a criteria data set;
    • (c) comparing, using the processor, the criteria data set to a computer database of chemical structures;
    • (d) selecting from the database, using computer methods, chemical structures which are similar to a portion of said criteria data set; and
    • (e) outputting, to the output device, the selected chemical structures which are complementary to or similar to a portion of the criteria data set.


In a third aspect the present invention provides a computer for producing a three-dimensional representation of a molecule or molecular complex, wherein the computer comprises:

    • (a) a machine-readable data storage medium comprising a data storage material encoded with machine-readable data, wherein the machine readable data comprise the atomic coordinates of amino acids 1-299 of the IL-6 receptor as shown in Appendix I, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å, or one or more subsets of said amino acids, or one or more subsets of said amino acids related to the coordinates shown in Appendix I by whole body translations and/or rotations;
    • (b) a working memory for storing instructions for processing the machine-readable data;
    • (c) a central-processing unit coupled to the working memory and to the machine-readable data storage medium, for processing the machine-readable data into the three dimensional representation; and
    • (d) an output hardware coupled to the central processing unit, for receiving the three-dimensional representation.


In a fourth aspect the present invention provides a compound able to bind to the IL-6 receptor and to modulate an activity mediated by the receptor, the compound being obtained by a method according to the present invention.


In a fifth aspect the present invention provides a compound which possesses stereochemical complementarity to a topographic region of the IL-6 receptor and which modulates an activity mediated by the receptor, wherein the receptor is characterised by

    • (i) amino acids 1-299 of the IL-6 receptor positioned at atomic coordinates as shown in Appendix I, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å; or
    • (ii) one or more subsets of said amino acids related to the coordinates shown in Appendix I by whole body translations and/or rotations;


with the proviso that the compound is not a naturally occurring ligand of a molecule of the IL-6 receptor or a mutant thereof.


By “mutant” we mean a ligand which has been modified by one or more point mutations, insertions of amino acids or deletions of amino acids.


In a sixth aspect, the present invention provides a pharmaceutical composition comprising a compound according to the present invention together with a pharmaceutically acceptable carrier or diluent. The present invention also provides the use of a compound or pharmaceutical composition of the invention in a method of preventing or treating a disease associated with signalling by the IL-6 receptor.


In a related aspect the present invention provides a method of preventing or treating a disease associated with signalling by the IL-6 receptor which method comprises administering to a subject in need thereof a compound according to the present invention.


In yet another aspect, the present invention provides a method for evaluating the ability of a chemical entity to interact with the IL-6 receptor, said method comprising the steps of:

    • (a) creating a computer model of at least one region of the IL-6 receptor using structure coordinates wherein the root mean square deviation between said structure coordinates and the structure coordinates of amino acids 1-299 of IL-6 receptor as set forth in Appendix I is not more than about 1.5 Å;
    • (b) employing computational means to perform a fitting operation between the chemical entity and said computer model of the binding surface; and
    • (c) analysing the results of said fitting operation to quantify the association between the chemical entity and the binding surface model.


In a further aspect the present invention provides a method of selecting or designing a compound that interferes with the formation of the IL-6, IL-6R, gp130 hexameric complex, the method comprising the step of assessing the stereochemical complementarity between the compound and a topographic region of the complex, wherein the complex is characterised by


(i) the amino acids of IL-6, IL-6 receptor and gp130 positioned at atomic coordinates as shown in Appendix II, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å; or


(ii) one or more subsets of said amino acids related to the coordinates shown in Appendix II by whole body translations and/or rotations.


In another aspect the present invention provides computer-assisted method for identifying compounds that interfere with the formation of the IL-6, IL-6R, gp130 hexameric complex, using a programmed computer comprising a processor, an input device, and an output device, comprising the steps of:

    • (a) inputting into the programmed computer, through the input device, data comprising the atomic coordinates of amino acids IL-6, IL-6R and gp130 as shown in Appendix II, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å, or one or more subsets of said amino acids, or one or more subsets of said amino acids related to the coordinates shown in Appendix II by whole body translations and/or rotations;
    • (b) generating, using computer methods, a set of atomic coordinates of a structure that possesses stereochemical complementarity to the atomic coordinates the IL-6, IL-6R, gp130 hexameric complex as shown in Appendix II, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å, or one or more subsets of said amino acids, or one or more subsets of said amino acids related to the coordinates shown in Appendix II by whole body translations and/or rotations, thereby generating a criteria data set;
    • (c) comparing, using the processor, the criteria data set to a computer database of chemical structures;
    • (d) selecting from the database, using computer methods, chemical structures which are similar to a portion of said criteria data set; and
    • (e) outputting, to the output device, the selected chemical structures which are complementary to or similar to a portion of the criteria data set.


In another aspect the present invention provides a computer for producing a three-dimensional representation of a molecule or molecular complex, wherein the computer comprises:

    • (a) a machine-readable data storage medium comprising a data storage material encoded with machine-readable data, wherein the machine readable data comprise the atomic coordinates of the IL-6, IL-6R, gp130 hexameric complex as shown in Appendix II, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å, or one or more subsets of said amino acids, or one or more subsets of the coordinates shown in Appendix II by whole body translations and/or rotations;
    • (b) a working memory for storing instructions for processing the machine-readable data;
    • (c) a central-processing unit coupled to the working memory and to the machine-readable data storage medium, for processing the machine-readable data into the three dimensional representation; and
    • (d) an output hardware coupled to the central processing unit, for receiving the three-dimensional representation.


In another aspect the present invention provides a compound which possesses stereochemical complementarity to a topographic region of the IL-6, IL-6R, gp130 hexameric complex and which modulates an activity mediated by the complex, wherein the complex is characterised by

    • (i) the amino acids of IL-6, IL-6R and gp130 positioned at atomic coordinates as shown in Appendix II, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å; or
    • (ii) one or more subsets of said amino acids related to the coordinates shown in Appendix II by whole body translations and/or rotations.


In yet another aspect, the present invention provides a method for evaluating the ability of a chemical entity to interact with the IL-6, IL-6R, gp130 hexameric complex, said method comprising the steps of:

    • (a) creating a computer model of at least one region of the IL-6, IL-6R, gp130 hexameric complex using structure coordinates wherein the root mean square deviation between said structure coordinates and the structure coordinates set forth in Appendix I is not more than about 1.5 Å;
    • (b) employing computational means to perform a fitting operation between the chemical entity and said computer model; and
    • (c) analysing the results of said fitting operation to quantify the association between the chemical entity and the model.


In a still further aspect the present invention consists in a crystalline composition comprising an IL-6 receptor or portion thereof or variant of these.


As will be readily understood by persons skilled in this field the methods of the present invention provide a rational method for designing and selecting compounds which interact with the IL-6 receptor. In many cases these compounds can be developed further to increase activity. Such further development is routine in this field and will be assisted by the structural information provided in this application. It is intended that in particular embodiments the methods of the present invention includes such further developmental steps.


In another aspect the present invention consists in a method of assessing the interaction between a compound and the IL-6 receptor, the method comprising exposing a crystalline composition comprising IL-6 receptor or portion thereof or variant of these to the compound and measuring the leveling of binding of the compound to the crystal.


In yet a further aspect, the invention provides a method of using molecular replacement to obtain structural information about a molecule or a molecular complex of unknown structure, comprising the steps of:

    • (i) crystallising said molecule or molecular complex;
    • (ii) generating an X-ray diffraction pattern from said crystallized molecule or molecular complex;
    • (iii) applying at least a portion of the structure coordinates set forth in Appendix I to the X-ray diffraction pattern to generate a three-dimensional electron density map of at least a portion of the molecule or molecular complex whose structure is unknown.


The term “molecular replacement” refers to a method that involves generating a preliminary model of an IL-6 extracellular domain crystal whose structure coordinates are unknown, by orienting and positioning a molecule whose structure coordinates are known (e.g., IL-6 receptor extracellular coordinates from Appendix I) within the unit cell of the unknown crystal so as best to account for the observed diffraction pattern of the unknown crystal. Phases can then be calculated from this model and combined with the observed amplitudes to give an approximate Fourier synthesis of the structure whose coordinates are unknown. This, in turn, can be subject to any of the several forms of refinement to provide a final, accurate structure of the unknown crystal (Lattman, 1985, Methods in Enzymology 115: 55-77; M. G. Rossmann, ed., “The Molecular Replacement Method”, Int. Sci. Rev. Ser., No. 13, Gordon & Breach, New York, 1972).


In yet a further aspect the present invention provides a method for preventing or treating a disease associated with signaling by the IL-6 receptor which method comprises administering to a subject in need thereof a compound identified by a method comprising the step of assessing the stereochemical complementarity between the compound and a topographic region of the receptor, wherein the receptor comprises:

    • (i) amino acids 1-299 of the IL-6 receptor positioned at atomic coordinates as shown in Appendix I, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å; or
    • (ii) one or more subsets of said amino acids related to the coordinates shown in Appendix I by whole body translations and/or rotations.


In yet a further aspect the present invention provides a method for preventing or treating a disease associated with signaling by the IL-6 receptor which method comprises administering to a subject in need thereof a compound identified by a method comprising the step of assessing the stereochemical complementarity between the compound and a topographic region of an IL-6, IL-6R, gp130 hexameric complex, wherein the hexameric complex comprises:

    • (i) the amino acids of IL-6, IL-6 receptor and gp130 positioned at atomic coordinates as shown in Appendix II, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å; or
    • (ii) one or more subsets of said amino acids related to the coordinates shown in Appendix II by whole body translations and/or rotations.


Throughout this specification, the word “comprise”, or variations such as “comprises” or “comprising”, will be understood to imply the inclusion of a stated element, integer or step, or group of elements, integers or steps, but not the exclusion of any other element, integer or step, or group of elements, integers or steps.


DETAILED DESCRIPTION

The present inventors have determined the three-dimensional structure of the extracellular region of the IL-6R. The X-ray structure of IL-6R reveals a structure consisting of the N-terminal Ig domain (D1) linked to a classical CBD (D2, D3). The first 5 residues of the N-terminus are poorly defined and amino acids past residue 299 are not visible in the X-ray structure. The D2 and D3 are connected at about 90° to each other and D1 is connected at about 45° to D2. The three domains lie on a similar plane making the receptor a long ‘flat’ structure. Carbohydrate is observed at 4 sites (five hexoses on the Asn226 site), all only on one of the ‘flat’ faces of the molecule, indicating that this face would have very restricted involvement in the signalling complex. This is consistent with evidence of expression of functional sIL-6R in Escherichia coli. The N-terminus Cys6 is disulphide bonded to Cys174 in D2, and this anchoring results in an unusual β-sheet arrangement of D1. D1 has an s-type Ig topology (0.7 Å rmsd with the ligand binding domain of fibroblast growth factor (Stauber et al., 2000, PNAS 97: 49-54), but the first ‘a’ strand of the 3-stranded β-sheet has moved to form a fifth ‘h’ strand of the 4-stranded β-sheet. This could be a result of crystal packing and represent a degree of flexibility of this domain, acting as a conformational switch, allowing D1 to rotate about 20° by β-strand shifting. This could also explain the higher disorder in this domain resulting in lack of isomorphism between different crystals of IL-6R. As in other cytokine FnIII domains, the domains are separated by a double proline motif (Pro94-Pro95, Pro199-Pro200 and Pro302-Pro303 at the end of D1, D2 and D3 respectively).


The other characteristic of the CBD of IL-6R is the long tryptophane-arginine ladder (from the C-terminus end of D3 is Arg239, Phe246, Arg237, Trp287, Arg274, Trp284, Gln276) found in other class 1 CBD structures (Carr et al., 2001). This ladder incorporates the conserved WSXWS motif (Residues 284 to 287 in IL-6R) located in the COOH-terminal of CBDs. The polypeptide backbone of this motif has an unusual left-handed 310 helix similar to a poly-proline helix, stabilised by the large tryptophane and arginine stacking and not by main chain H-bonds. This results in a long surface stripe of positive charge (from the guanidinium and tryptophane nitrogens) running along the side of the inside elbow portion of D3, and a groove formed by the 310 helix running parallel to it. The function of this structure is unknown, but the similarities with other CBDs would suggest that it could be involved in a general receptor transport system and not in IL-6 binding.


Following the fibroblast growth factor paradigm (Stauber et al., 2000), it would be expected that IL-6 would bind in the region of the outer elbow formed at the junction of D2 and D3, characterised by 4 loops (L1 to L4) from D2 and 3 loops (L5 to L7) from D3. IL-6 would engage residues in these loops, and mutation in these residues would affect IL-6 binding. Mutational analyses indicate that most of the mutations, when mapped onto the crystal structure, have altered binding due to alterations in the structural integrity of the molecule, in particular mutations in the tryptophane/arginine ladder. Discarding these ‘structural’ mutations, there are clusters of mutations in loops L3, L5 and L7 that reduce binding to IL-6. This site is at the juncture of D2 and D3 domains, and can be inferred to be primarily responsible for IL-6 binding.


In the crystal lattice two molecules of IL-6R related by a crystallographic 2-fold axis are closely associated along the length of each molecule. The association is primarily a hydrophobic contact around the 2-fold axis involving Phe134 and Phe168 of domain D2, the salt-link Glu97-Arg274, and H-bonding of Glu283 with the main chain nitrogens of Thr186. The buried accessible surface area of each molecule is about 1230 Å2 (shape and electrostatic complementarity of 0.722 and 0.728), and is what could be expected for a protein-protein interaction in solution. Furthermore the buried surface of a protein complex does not have to be as extensive for membrane bound receptors to interact, compared to complexes formed where one or both components are in solution. So it is consistent with IL6-R forming dimers on the cell surface of the type observed in the crystal.


The orientation of the dimer is such that the interaction is along the carbohydrate-free ‘flat’ face of the molecule. In this model, domain D3 comes up away from the membrane at 45° and D2 comes down towards the membrane at 30°, with finally D1 running parallel to the membrane. The dimer thus appears as a bridge structure like the common β-receptor of IL-3, with a 50 Å tunnel of triangular cross-section (base ˜40 Å; height ˜22 Å). Non-structural mutations in IL-6R that affect gp130 signalling are found on the loops L1, L3, L5 and L6, and also in a patch of residues at the dimer interface consisting of residues 140 to 142, 167 to 171 (reduce signalling (and 182 to 186 (increase signalling) of the N-terminal side of the d, f and g strands of D2 respectively. These is also mutations in a patch of residues (209 to 213 and 261 to 263) on the membrane distal side of domain D2 that reduce binding.


The recent structure of the Ig and CBD domain gp130 and viral IL-6 complex (Chow et al., 2001, Science 291: 2150-5) reveals a possible dimeric relationship between the IL-6 binding domains of gp130. Thus incorporating the dimeric relationship between all the IL-6 binding receptor domains it is possible to construct a model for the signalling complex that is substantially different from current proposed models and consistent with the available mutational data. Here two IL-6 molecules bind to the IL-6R dimer, followed by two gp130 each binding to both IL-6 molecules in a similar way to the viral IL-6/gp130 fragment complex. gp130 would bind with its second CBD pointing away from the membrane. The remaining three FnIII domains of gp130 would head down towards the membrane, and the signalling activated by dimerisation of the membrane proximal FnIII domains under the bridge of the IL-6R dimer likely through disulphide crosslinking.


Clearly the information provided in this application will enable rational design/selection of compounds which will interact with IL-6R.


Accordingly, in a first aspect the present invention provides a method of selecting or designing a compound that interacts with an IL-6 receptor and modulates an activity mediated by the receptor, the method comprising


(a) assessing the stereochemical complementarity between a compound and a topographic region of the receptor, wherein the receptor comprises:

    • (i) amino acids 1-299 of the IL-6 receptor positioned at atomic coordinates as shown in Appendix I, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å; or
    • (ii) one or more subsets of said amino acids related to the coordinates shown in Appendix I by whole body translations and/or rotations;


(b) obtaining a compound which possesses stereochemical complementarity to a topographic region of the receptor; and


(c) testing the compound for its ability to modulate an activity associated with the receptor.


In a preferred embodiment of the first aspect, the subset of amino acids is selected from the group consisting of the subset of amino acids representing the D1 domain (residues 1 to 93), the subset of amino acids representing the D2 domain (residues 94 to 194) and the subset of amino acids representing the D3 domain (residues 195 to 299).


As discussed above, following the fibroblast growth factor receptor paradigm it is expected that IL-6 will bind to IL-6R in the region of the outer elbow formed at the junction of D2 and D3, characterised by 4 loops (L1 to L4) from D2 and 3 loops (L5 to L7) from D3. The ligand binding surfaces of IL-6R are therefore defined by residues 106-110, 133-138, 160-168, 190-193, 227-233, 250-256 and 276-281.


Accordingly, in a preferred embodiment the compound is selected or designed to interact with the IL-6 receptor in a manner such as to interfere with the binding of natural ligand to one or more of the residues of IL-6R selected from the group consisting of 106-110, 133-138, 160-168, 190-193, 227-233, 250-256, 276-281 and combinations thereof.


The present inventors have determined that the IL-6R homodimer involves interaction along the carbohydrate free flat faces of the two IL-6 receptors. This dimer interface is defined by residues 1-5, 19-23, 65-69, 93-99, 118, 119, 132-141, 166-172, 179-196, 241-250, 261, 262, 272-276 and 282-290.


Accordingly, in a preferred embodiment the compound is selected or designed to interact with the IL-6 receptor in a manner such as to interfere with IL-6R homodimer formation.


The compound may interfere with ligand binding or dimer formation in a number of ways. For example the compound may bind or interact with the receptor at or near one or more of the specified residues or corresponding regions and by steric overlap and/or electrostatic repulsion prevent natural ligand binding or dimer formation. Alternatively the compound may bind to the receptor so as to interfere allosterically with natural ligand binding or dimer or formation.


It is presently preferred, however, that the compound binds or interacts with the receptor at or near one or more of the specified residues.


The present inventors have also identified two other regions of the IL-6 receptor that are of particular interest in the development of compounds which affect IL-6 receptor activity. The first is an apparent allosteric switch in D1. It was observed that the first ‘a’ strand of the 3-stranded β-sheet of D1 has moved to form a fifth ‘h’ strand of the 4-stranded β-sheet. This could be a result of crystal packing and represent a degree of flexibility of this domain, acting as a conformational switch, allowing D1 to rotate about 20° by b-strand shifting. This proposed allosteric movement can be blocked by designing a molecule to bind to this switch region and block the putative movement. This region is defined by the residues 11, 45, 46, 55, 62-66, 69-72, 75, 81, 88, 90-93, 122-124 and 178.


The second is the long tryptophane-arginine ladder which incorporates the conserved WSXWS motif. The polypeptide backbone of this motif has an unusual left-handed 310 helix similar to a poly-proline helix, stabilised by the large tryptophane and arginine stacking and not by main chain H-bonds. This results in a long surface stripe of positive charge (from the guanidinium and tryptophane nitrogens) running along the side of the inside elbow portion of D3, and a groove formed by the 3/10 helix running parallel to it. The function of this structure is unknown, but the similarities with other CBDs would suggest that it could be involved in a general receptor transport system and not in IL-6 binding. Blocking this region could be useful to regulate transport of receptor to the cell surface. This region is defined by the following residues 233-239, 244-248 and 270-290. The WSXWS motif is residues 284-287.


Accordingly in a preferred embodiment of the first aspect of the present invention, the method comprises selecting or designing a compound which has portions that match residues positioned on;

    • (i) the ligand binding surface of IL-6R defined by amino acids 106-110, 133-138, 160-168, 190-193, 227-233, 250-256 and 276-281; or
    • (ii) the IL-6R dimer interface defined by amino acids 1-5, 19-23, 65-69, 93-99, 118, 119, 132-141, 166-172, 179-196, 241-250, 261, 262, 272-276 and 282-290; or
    • (iii) amino acids 11, 45, 46, 55, 62-66, 69-72, 75, 81, 88, 90-93, 122-124 and 178 of IL-6R; or
    • (iv) amino acids residues 233-239, 244-248 and 270-290 of IL-6R.


By “match” we mean that the identified portions interact with the surface residues, for example, via hydrogen bonding or by enthalpy-reducing Van der Waals interactions which promote desolvation of the biologically active compound with the receptor, in such a way that retention of the compound by the receptor is favoured energetically.


In a further preferred embodiment of the present invention, the stereochemical complementarity is such that the compound has a Kd for the receptor site of less than 10−6M. More preferably, the Kd value is less than 10−8M and more preferably less than 10−9M.


In preferred embodiments of the first aspect of the present invention, the compound is selected or modified from a known compound identified from a data base.


For example this specification provides information regarding the portions of IL-6 which are involved in receptor binding. With this information IL-6 variants may be designed in which specific residues are modified or altered such that the variant is able to bind to the receptor but not initiate signalling. It would be expected that such a variant would compete with the natural ligand for binding to the receptor. Such a variant would therefore be an antagonist. In a similar manner variants which would act as agonists could be designed. In this case the modifications or alterations would be selected such as to increase the strength of interaction between the receptor and the variant so as to lead to increased signalling.


As will be understood by those skilled in this field knowledge of the structure of a protein complex is of assistance in the development of mutants of one of the proteins with enhanced affinity for its protein partner. Structural information can be used to select residues on one or more of the protein interfaces in the complex for alteration by methods such as site-directed mutagenesis or phage display. For example, amino acid positions in growth hormone which were allowed to vary were chosen in part from the crystal structure of the complex of growth hormone bound to two molecules of the human growth hormone extracellular region (Lowman and Wells, 1993, J. Mol. Biol. 234: 564-578.). Using a model of the granulocyte colony-stimulating factor (G-CSF) receptor ligand binding domain, residues of the receptor were chosen for mutagenesis by analogy with the structure of human growth hormone bound to its receptors (Layton et al., 1997, J Biol. Chem. 272: 29735-29741.). Some of the mutant G-CSF receptors were found to bind G-CSF with slightly enhanced affinity. The structure of the complex could also be used to design mutations which would potentially increase the binding affinity, for example by increasing the amount of hydrogen bonds and/or van der Waals interactions across the interface.


The modification of protein residues to enhance protein binding affinity is not restricted to those residues in the relevant protein-protein interfaces. Modification of residues outside of an interface may lead to alterations due to changes in the long-range electrostatic interactions between the two interacting proteins which changes the rate of association and subsequently the equilibrium binding constant (Seizer and Schreiber, 1999, J Mol Biol. 287: 409-419.; Seizer et al, 2000, Nat Struct Biol. 7: 537-541.). The contribution of mutations to the association rate can be calculated and has been used to increase the association rate (without greatly changing the dissociation rate) and the affinity of beta-lactamase inhibitory protein to TEM1 beta-lactamase by a factor of 250 (Seizer et al., 2000).


By “variant” we mean that the natural sequence of IL-6 has been modified by one or more point mutations, insertions of amino acids, deletions of amino acids or replacement of amino acids, in particular using non-natural amino acids such as D-isomers of natural amino acids, 2,4-diaminobutyric acid, α-amino isobutyric acid, 4-aminobutyric acid, 2-aminobutyric acid, 6-amino hexanoic acid, 2-amino isobutyric acid, 3-amino propionic acid, ornithine, norleucine, norvaline, hydroxyproline, sarcosine, citrulline, homocitrulline, cysteic acid, t-butylglycine, t-butylalanine, phenylglycine, cyclohexylalanine, beta-alanine, fluoro-amino acids, designer amino acids such as β-methyl amino acids, Cα-methyl amino acids, Nα-methyl amino acids, β-naphthalimo amino acids and amino acid analogues in general.


In a second aspect the present invention provides computer-assisted method for identifying potential compounds able to interact with the IL-6 receptor and thereby modulate an activity mediated by the receptor, using a programmed computer comprising a processor, an input device, and an output device, comprising the steps of:

    • (a) inputting into the programmed computer, through the input device, data comprising the atomic coordinates of amino acids 1-299 of the IL-6 receptor as shown in Appendix I, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å, or one or more subsets of said amino acids, or one or more subsets of said amino acids related to the coordinates shown in Appendix I by whole body translations and/or rotations;
    • (b) generating, using computer methods, a set of atomic coordinates of a structure that possesses stereochemical complementarity to the atomic coordinates of amino acids 1-299 of the IL-6 receptor as shown in Appendix I, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å, or one or more subsets of said amino acids, or one or more subsets of said amino acids related to the coordinates shown in Appendix I by whole body translations and/or rotations, thereby generating a criteria data set;
    • (c) comparing, using the processor, the criteria data set to a computer database of chemical structures;
    • (d) selecting from the database, using computer methods, chemical structures which are similar to a portion of said criteria data set; and
    • (e) outputting, to the output device, the selected chemical structures which are complementary to or similar to a portion of the criteria data set.


In a preferred embodiment the subset of amino acids are the amino acids defining the ligand binding surface(s) or dimer interface or allosteric switch or the tryptophane-arginine ladder.


In a preferred embodiment of the second aspect, the method is used to identify potential compounds which have the ability to decrease an activity mediated by the receptor.


In a further preferred embodiment of the second aspect, the method further comprises the step of selecting one or more chemical structures from step (e) which interact with the IL-6 receptor in a manner such as to:

    • (i) interfere with the binding of natural ligand to one or more of the residues of IL-6R selected from the group consisting of 106-110, 133-138, 160-168, 190-193, 227-233, 250-256, 276-281 and combinations thereof; or
    • (ii) interfere with IL-6R homodimer formation.


In a still further preferred embodiment of the second aspect, the method further comprises the step of selecting one or more chemical structures from step (e) which interact with:—

    • (i) one or more of the residues of IL-6R selected from the group consisting of 106-110, 133-138, 160-168, 190-193, 227-233, 250-256, 276-281 and combinations thereof; or
    • (ii) the IL-6R dimer interface defined by amino acids 1-5, 19-23, 65-69, 93-99, 118, 119, 132-141, 166-172, 179-196, 241-250, 261, 262, 272-276 and 282-290; or
    • (iii) one or more of the residues of IL-6R selected from the group consisting of 11, 45, 46, 55, 62-66, 69-72, 75, 81, 88, 90-93, 122-124, 178 and combinations thereof; or
    • (iv) one or more of the residues of IL-6R selected from the group consisting of 233-239, 244-248, 270-290 and combinations thereof.


In a further preferred embodiment of the second aspect, the method further comprises the step of obtaining a compound with a chemical structure selected in steps (d) and (e), and testing the compound for the ability to decrease an activity mediated by the receptor.


In a further preferred embodiment of the second aspect, the method further comprises the step of obtaining a molecule with a chemical structure selected in steps (d) and (e), and testing the compound for the ability to increase an activity mediated by the receptor molecule.


The present invention also provides a method of screening of a putative compound having the ability to modulate the activity of the IL-6 receptor, comprising the steps of identifying a putative compound by a method according to the first or second aspects, and testing the compound for the ability to increase or decrease an activity mediated by the molecule. In one embodiment, the test is carried out in vitro. Preferably, the in vitro test is a high throughput assay. In another embodiment, the test is carried out in vivo.


In a third aspect the present invention provides a computer for producing a three-dimensional representation of a molecule or molecular complex, wherein the computer comprises:

    • (a) a machine-readable data storage medium comprising a data storage material encoded with machine-readable data, wherein the machine readable data comprise the atomic coordinates of amino acids 1-299 of the IL-6 receptor as shown in Appendix I, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å, or one or more subsets of said amino acids, or one or more subsets of said amino acids related to the coordinates shown in Appendix I by whole body translations and/or rotations;
    • (b) a working memory for storing instructions for processing the machine-readable data;
    • (c) a central-processing unit coupled to the working memory and to the machine-readable data storage medium, for processing the machine-readable data into the three dimensional representation; and
    • (d) an output hardware coupled to the central processing unit, for receiving the three-dimensional representation.


In a preferred embodiment the subset of amino acids are the amino acids defining at least one ligand binding surface(s) or dimer or allosteric switch or the tryptophane-arginine ladder.


In a fourth aspect the present invention provides a compound able to bind to the IL-6 receptor and to modulate an activity mediated by the molecule, the compound being obtained by a method according to the present invention.


In a preferred embodiment of the fourth aspect, the compound is a mutant of the natural ligand of the IL-6 receptor, where at least one mutation occurs in the region of the natural ligand which interacts with the receptor.


In a fifth aspect the present invention provides a compound which possesses stereochemical complementarity to a topographic region of the IL-6 receptor and which modulates an activity mediated by the receptor, wherein the receptor is characterised by

    • (i) amino acids 1-299 of the IL-6 receptor positioned at atomic coordinates as shown in Appendix I, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å; or
    • (ii) one or more subsets of said amino acids related to the coordinates shown in Appendix I by whole body translations and/or rotations.


with the proviso that the compound is not a naturally occurring ligand of the IL-6 receptor or a mutant thereof.


By “mutant” we mean a ligand which has been modified by one or more point mutations, insertions of amino acids or deletions of amino acids.


In a preferred embodiment of the fifth aspect, the topographic region of the receptor is defined by amino acids 106-110, 133-138, 160-168, 190-193, 227-233, 250-256 and 276-281, or the IL-6R dimer interface defined by amino acids 1-5, 19-23, 65-69, 93-99, 118, 119, 132-141, 166-172, 179-196, 241-250, 261, 262, 272-276 and 282-290, or amino acids 11, 45, 46, 55, 62-66, 69-72, 75, 81, 88, 90-93, 122-124 and 178, or amino acids 233-239, 244-248 and 270-290 and combinations thereof.


In preferred embodiments of the fourth and fifth aspects, the stereochemical complementarity between the compound and the receptor site is such that the compound has a Kd for the receptor site of less than 10−6M, more preferably less than 10−8M.


In other embodiments of the fourth and fifth aspects, the compound decreases an activity mediated by the IL-6 receptor.


In a sixth aspect; the present invention provides a pharmaceutical composition for preventing or treating a disease associated with signalling by the IL-6 receptor which comprises a compound according to the fourth or fifth aspects of the present invention and a pharmaceutically acceptable carrier or diluent.


In another aspect the present invention provides a method of preventing or treating a disease associated with signalling by the IL-6 receptor which method comprises administering to a subject in need thereof a compound according to the fourth or fifth aspects of the present invention. Preferably, the disease is selected from multiple myeloma, lymphoma, inflammation, rheumatoid arthritis, prostate cancer, Castleman's disease, AIDS, mesangial proliferative glomerulonephritis, Kaposi's sarcoma, sepsis, osteoporosis and psoriasis.


In yet another aspect, the present invention provides a method for evaluating the ability of a chemical entity to interact with the IL-6 receptor, said method comprising the steps of:

    • (a) creating a computer model of at least one region of the IL-6 receptor using structure coordinates wherein the root mean square deviation between said structure coordinates and the structure coordinates of amino acids 1-299 of IL-6 receptor as set forth in Appendix I is not more than about 1.5 Å;
    • (b) employing computational means to perform a fitting operation between the chemical entity and said computer model of the binding surface; and
    • (c) analysing the results of said fitting operation to quantify the association between the chemical entity and the binding surface model.


In a preferred embodiment of this aspect of the invention the region of IL-6R is selected from the group consisting of the ligand binding surface defined by amino acids 106-110, 133-138, 160-168, 190-193, 227-233, 250-256, 276-281 and combinations thereof, or the IL-6R dimer interface defined by amino acids 1-5, 19-23, 65-69, 93-99, 118, 119, 132-141, 166-172, 179-196, 241-250, 261, 262, 272-276 and 282-290, or amino acids 11, 45, 46, 55, 62-66, 69-72, 75, 81, 88, 90-93, 122-124 and 178, or amino acids 233-239, 244-248 and 270-290 and combinations thereof 97, 134, 140-142, 167-171, 209-213, 261-263 and 274.


As discussed above the present inventors have developed a model of the IL-6, IL-6R, gp130 hexameric complex. The coordinates for this complex are set out in Appendix II where the six molecules are identified as follows:—

AAAAgp130BBBBgp130CCCCIL-6RDDDDIL-6REEEEIL-6FFFFIL-6


The interfaces in this complex are as follows:—


IL-6R/gp130 Interface:

    • on IL-6R: residues 135-138; 161-169
    • on gp130: residues 140-149


IL-6/IL-6R Interface (Site I)

    • on IL-6: residues 41; 45; 51; 52; 56; 60; 62; 69-71; 75-79; 166; 169; 172-173; 176-177; 179-180; 183-184
    • on IL-6R: residues 1; 6; 107-108; 137-139; 162-168; 190; 193; 226-231; 277-281


IL-6/gp130 Interface (Site II)

    • on IL-6: residues 20; 24-25; 27; 29; 31-32; 35-36; 39-40; 42; 110-115; 117-119; 121-123; 125-126; 129; 179; 183;
    • on gp130: residues 114; 116; 142; 144-145; 147-148; 163-172; 193-196; 226-227; 229; 231-232; 281-283; 285;


IL-6/gp130 Interface (Site III)

    • on IL-6: residues 63-64; 93-94; 97-98; 128; 130-147; 151-152; 155; 159; 162-163;
    • on gp130: residues 1-5; 8-15; 49; 75-78; 132-137; 148-154; 163; 177-184.


This information enables the selection or design of compounds that interfere with this hexamer formation.


Accordingly, in a further aspect the present invention provides a method of selecting or designing a compound that interferes with the formation of the IL-6, IL-6R, gp130 hexameric complex, the method comprising the step of assessing the stereochemical complementarity between the compound and a topographic region of the complex, wherein the complex is characterised by


(i) the amino acids of IL-6, IL-6 receptor and gp130 positioned at atomic coordinates as shown in Appendix II, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å; or


(ii) one or more subsets of said amino acids related to the coordinates shown in Appendix II by whole body translations and/or rotations.


In another aspect the present invention provides computer-assisted method for identifying compounds that interfere with the formation of the IL-6, IL-6R, gp130 hexameric complex, using a programmed computer comprising a processor, an input device, and an output device, comprising the steps of:

    • (a) inputting into the programmed computer, through the input device, data comprising the atomic coordinates of amino acids IL-6, IL-6R and gp130 as shown in Appendix II, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å, or one or more subsets of said amino acids, or one or more subsets of said amino acids related to the coordinates shown in Appendix II by whole body translations and/or rotations;
    • (b) generating, using computer methods, a set of atomic coordinates of a structure that possesses stereochemical complementarity to the atomic coordinates the IL-6, IL-6R, gp130 hexameric complex as shown in Appendix II, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å, or one or more subsets of said amino acids, or one or more subsets of said amino acids related to the coordinates shown in Appendix II by whole body translations and/or rotations, thereby generating a criteria data set;
    • (c) comparing, using the processor, the criteria data set to a computer database of chemical structures;
    • (d) selecting from the database, using computer methods, chemical structures which are similar to a portion of said criteria data set; and
    • (e) outputting, to the output device, the selected chemical structures which are complementary to or similar to a portion of the criteria data set.


In another aspect the present invention provides a computer for producing a three-dimensional representation of a molecule or molecular complex, wherein the computer comprises:

    • (a) a machine-readable data storage medium comprising a data storage material encoded with machine-readable data, wherein the machine readable data comprise the atomic coordinates of the IL-6, IL-6R, gp130 hexameric complex as shown in Appendix II, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5A, or one or more subsets of said amino acids, or one or more subsets of the coordinates shown in Appendix II by whole body translations and/or rotations;
    • (b) a working memory for storing instructions for processing the machine-readable data;
    • (c) a central-processing unit coupled to the working memory and to the machine-readable data storage medium, for processing the machine-readable data into the three dimensional representation; and
    • (d) an output hardware coupled to the central processing unit, for receiving the three-dimensional representation.


In another aspect the present invention provides a compound which possesses stereochemical complementarity to a topographic region of the IL-6, IL-6R, gp130 hexameric complex and which modulates an activity mediated by the complex, wherein the complex is characterised by

    • (i) the amino acids of IL-6, IL-6R and gp130 positioned at atomic coordinates as shown in Appendix II, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å; or
    • (ii) one or more subsets of said amino acids related to the coordinates shown in Appendix II by whole body translations and/or rotations.


In yet another aspect, the present invention provides a method for evaluating the ability of a chemical entity to interact with the IL-6, IL-6R, gp130 hexameric complex, said method comprising the steps of:

    • (a) creating a computer model of at least one region of the IL-6, IL-6R, gp130 hexameric complex using structure coordinates wherein the root mean square deviation between said structure coordinates and the structure coordinates set forth in Appendix I is not more than about 1.5 Å;
    • (b) employing computational means to perform a fitting operation between the chemical entity and said computer model; and
    • (c) analysing the results of said fitting operation to quantify the association between the chemical entity and the model.


The compound may interfere with hexamer formation in a number of ways. For example the compound may bind or interact with one of the molecules at or near one or more of the specified residues or corresponding regions and by steric overlap and/or electrostatic repulsion prevent natural ligand binding or dimer formation. Alternatively the compound may bind to one of the molecules so as to interfere allosterically with hexamer formation.


It is presently preferred, however, that the compound binds or interacts with at least one of the molecules positioned at the interfaces.


Accordingly in a preferred embodiment of the present invention, the method comprises selecting or designing a compound which has portions that match at least one region of the molecules that make up the hexamer, in which the region is selected from the group consisting of;

    • (i) amino acids 20, 24, 25, 27, 29, 31, 32, 35, 36, 39, 40-42, 45, 51, 52, 56, 60, 62-64, 69-71, 75-79, 93, 94, 110-115, 117-119, 121-123, 125, 126, 128-147, 151, 152, 155, 159, 161-169, 172, 173, 176, 177, 179, 180, 183 and 184 of IL-6 and combinations thereof; and
    • (ii) amino acids 1-5, 8-15, 49, 75-78, 114, 116, 132-137, 140-154, 163-172, 177-184, 193-196, 226, 227, 229, 231, 232, 281-283 and 285 of gp130 and combinations thereof; and
    • (iii) amino acids 1, 6, 107, 108, 135-139, 161-169, 190, 193, 226-231 and 277-281 of IL-6R and combinations thereof.


In a still further aspect the present invention consists in a crystalline composition comprising IL-6 receptor or portion thereof or variant of these.


As will be readily understood by persons skilled in this field the methods of the present invention provide a rational method for designing and selecting compounds which interact with the IL-6 receptor. In the majority of cases these compounds will require further development in order to increase activity. Such further development is routine in this field and will be assisted by the structural information provided in this application. It is intended that in particular embodiments the methods of the present invention includes such further developmental steps.


In another aspect the present invention consists in a method of assessing the interaction between a compound and the IL-6 receptor, the method comprising exposing a crystalline composition comprising IL-6 receptor or portion thereof or variant of these to the compound and measuring the leveling of binding of the compound to the crystal.


Accordingly, in another aspect the present invention consists in a method of designing or selecting a compound which modulates IL-6R signalling, the method comprising subjecting a compound obtained by a method according to any one of the previous aspects of the present invention to biological screens and assessing the ability of the compound to modulate IL-6R signalling.


In yet a further aspect, the invention provides a method of utilizing molecular replacement to obtain structural information about a molecule or a molecular complex of unknown structure, comprising the steps of:

    • (i) crystallising said molecule or molecular complex;
    • (ii) generating an X-ray diffraction pattern from said crystallized molecule or molecular complex;
    • (iii) applying at least a portion of the structure coordinates set forth in Appendix I to the X-ray diffraction pattern to generate a three-dimensional electron density map of at least a portion of the molecule or molecular complex whose structure is unknown.


The structure of the IL-6 receptor can be used to design mutant IL-6 receptor molecules or fragments thereof for use as therapeutics.


Accordingly, the present invention also provides a compound comprising an extracellular fragment of IL-6R, wherein the extracellular fragment is modified at one or more amino acids of IL-6R selected from the group consisting of:

    • (i) amino acids 106-110, 133-138, 160-168, 190-193, 227-233, 250-256 and 276-281; or
    • (ii) amino acids 1-5, 19-23, 65-69, 93-99, 118, 119, 132-141, 166-172, 179-196, 241-250, 261, 262, 272-276 and 282-290; or
    • (iii) amino acids 11, 45, 46, 55, 62-6, 69-72, 75, 81, 55, 90-93, 122-124 and 178; or
    • (iv) amino acids 233-239, 244-248 and 270-290; or
    • (v) amino acids 1, 6, 107, 108, 135-139, 161-169, 190, 193, 226-231 and 277-281.


In a preferred embodiment of this aspect of the invention, the modification is made to an amino acid which is not listed in Table 2.


In a further preferred embodiment, the IL-6R fragment comprises residues 1 to 325 of SEQ ID NO:1.


In one embodiment of this aspect, the modification increases the affinity of the IL-6R fragment for one or more of its natural ligands when compared to the unmodified fragment.


It will be appreciated by those skilled in the art that these compounds may be formulated into pharmaceutical compositions and used to treat diseases associated with signaling of the IL-6R.


The present inventors have now obtained three dimensional structural information about the IL-6 receptor which enables a more accurate understanding of how the binding of ligand leads to signal transduction. Such information provides a rational basis for the development of ligands for specific therapeutic applications, something that heretofore could not have been predicted de nova from available sequence data.


Those of skill in the art will understand that a set of structural coordinates determined by X-ray crystallography is not without standard error. For the purposes of this invention, any set of structure coordinates for IL-6R or an IL-6R complex that have a root mean square derivation of protein backbone atoms of less than 1.5 Å when superimposed (using backbone atoms) on the structure coordinates listed in Appendix I or II shall be considered identical.


The precise mechanisms underlying the binding of agonists and antagonists to the IL-6 receptor are not fully clarified. However, the binding of ligands to the receptor site, preferably with an affinity in the order of 10−8M or higher, is understood to arise from enhanced stereochemical complementarity relative to naturally occurring IL-6 receptor ligands.


Such stereochemical complementarity, pursuant to the present invention, is characteristic of a molecule that matches intra-site surface residues lining the groove of the receptor site as enumerated by the coordinates set out in Appendix I. By “match” we mean that the identified portions interact with the surface residues, for example, via hydrogen bonding or by non-covalent Van der Waals and Coulomb interactions which promote desolvation of the biologically active compound within the site, in such a way that retention of the biologically active compound within the groove is favoured energetically.


Substances which are complementary to the shape and electrostatics or chemistry of the receptor site characterised by amino acids positioned at atomic coordinates set out in Appendix I will be able to bind to the receptor, and when the binding is sufficiently strong, substantially prohibit binding of the naturally occurring ligands to the site.


It will be appreciated that it is not necessary that the complementarity between ligands and the receptor site extend over all residues lining the groove in order to inhibit binding of the natural ligand.


In general, the design of a molecule possessing stereochemical complementarity can be accomplished by means of techniques that optimize, chemically and/or geometrically, the “fit” between a molecule and a target receptor. Known techniques of this sort are reviewed by Sheridan and Venkataraghavan, 1987, Acc. Chem Res. 20: 322; Goodford, 1984, J. Med. Chem. 27: 557; Beddell, 1985, Chem. Soc. Reviews 279; Hol, 1986, Angew. Chem. 25: 767, Verlinde and Hol, 1984, Structure 2: 577, Walters et al., 1998, Drug Discovery Today 3: 160; Langer and Hoffmann, 2001, Current Pharmaceutical Design 7: 509; Good, 2001, Current Opinion in Drug Disc. Devel. 5, 301; Gane and Dean, 2000, Curr. Opinion Struct. Biol. 10: 401. the respective contents of which are hereby incorporated by reference. See also Blundell et al., 1987, Nature 326: 347 (drug development based on information regarding receptor structure) and Loughney et al., 1999, Med. Chem. Res. 9: 579 (database mining application on the growth hormone receptor).


There are two preferred approaches to designing a molecule, according to the present invention, that complements the stereochemistry of the IL-6 receptor. The first approach is to in silico directly dock molecules from a three-dimensional structural database, to the receptor site, using mostly, but not exclusively, geometric criteria to assess the goodness-of-fit of a particular molecule to the site. In this approach, the number of internal degrees of freedom (and the corresponding local minima in the molecular conformation space) is reduced by considering only the geometric (hard-sphere) interactions of two rigid bodies, where one body (the active site) contains “pockets” or “grooves” that form binding sites for the second body (the complementing molecule, as ligand).


This approach is illustrated by Kuntz et al., 1982, J. Mol. Biol. 161: 269, and Ewing et al., 2001, J. Comput-Aid. Mol. Design 15: 411, the contents of which are hereby incorporated by reference, whose algorithm for ligand design is implemented in a commercial software package, DOCK version 4.0, distributed by the Regents of the University of California and further described in a document, provided by the distributor, which is entitled “Overview of the DOCK program suite” the contents of which are hereby incorporated by reference. Pursuant to the Kuntz algorithm, the shape of the cavity represented by the IL-6 receptor site is defined as a series of overlapping spheres of different radii. One or more extant databases of crystallographic data, such as the Cambridge Structural Database System maintained by Cambridge University (University Chemical Laboratory, Lensfield Road, Cambridge CB2 1EW, U.K.), the Protein Data Bank maintained by the Research Collaboratory for Structural Bioinformatics (Rutgers University, N.J., U.S.A.), LeadQuest (Tripos Associates, Inc., St. Louis, Mo.), Available Chemicals Directory (Molecular Design Ltd., San Leandro, Calif.), and the NCI database (National Cancer Institute, U.S.A) is then searched for molecules which approximate the shape thus defined.


Molecules identified in this way, on the basis of geometric parameters, can then be modified to satisfy criteria associated with chemical complementarity, such as hydrogen bonding, ionic interactions and Van der Waals interactions. Different scoring functions can be employed to rank and select the best molecule from a database. See for example Bohm and Stahl, 1999, M. Med. Chem. Res. 9: 445. The software package FlexX, marketed by Tripos Associates, Inc. (St. Louis, Mo.) is another program that can be used in this direct docking approach (see Rarey, M. et al., J. Mol. Biol. 1996, 261: 470).


The second preferred approach entails an assessment of the interaction of respective chemical groups (“probes”) with the active site at sample positions within and around the site, resulting in an array of energy values from which three-dimensional contour surfaces at selected energy levels can be generated. The chemical-probe approach to ligand design is described, for example, by Goodford, 1985, J. Med. Chem. 28:849, the contents of which are hereby incorporated by reference, and is implemented in several commercial software packages, such as GRID (product of Molecular Discovery Ltd., West Way House, Elms Parade, Oxford OX2 9LL, U.K.). Pursuant to this approach, the chemical prerequisites for a site-complementing molecule are identified at the outset, by probing the active site with different chemical probes, e.g., water, a methyl group, an amine nitrogen, a carboxyl oxygen, and a hydroxyl. Favoured sites for interaction between the active site and each probe are thus determined, and from the resulting three-dimensional pattern of such sites a putative complementary molecule can be generated. This may be done either by programs that can search three-dimensional databases to identify molecules incorporating desired pharmacophore patterns or by programs which using the favoured sites and probes as input perform de novo design.


Programs suitable for searching three-dimensional databases to identify molecules bearing a desired pharmacophore include: MACCS-3D and ISIS/3D (Molecular Design Ltd., San Leandro, Calif.), ChemDBS-3D (Chemical Design Ltd., Oxford, U.K.), and Sybyl/3 DB Unity (Tripos Associates, Inc., St. Louis, Mo.).


Programs suitable for pharmacophore selection and design include: DISCO (Abbott Laboratories, Abbott Park, Ill.), Catalyst (Accelrys, San. Diego, Calif.), and ChemDBS-3D (Chemical Design Ltd., Oxford, U.K.).


Databases of chemical structures are available from a number of sources including Cambridge Crystallographic Data Centre (Cambridge, U.K.), Molecular Design, Ltd., (San Leandro, Calif.), Tripos Associates, Inc. (St. Louis, Mo.), and Chemical Abstracts Service (Columbus, Ohio).


De novo design programs include Ludi (Biosym Technologies Inc., San Diego, Calif.), Leapfrog (Tripos Associates, Inc.), Aladdin (Daylight Chemical Information Systems, Irvine, Calif.), and LigBuilder (Peking University, China).


Those skilled in the art will recognize that the design of a mimetic may require slight structural alteration or adjustment of a chemical structure designed or identified using the methods of the invention.


The invention may be implemented in hardware or software, or a combination of both. However, preferably, the invention is implemented in computer programs executing on programmable computers each comprising a processor, a data storage system (including volatile and non-volatile memory and/or storage elements), at least one input device, and at least one output device. Program code is applied to input data to perform the functions described above and generate output information. The output information is applied to one or more output devices, in known fashion. The computer may be, for example, a personal computer, microcomputer, or workstation of conventional design.


Each program is preferably implemented in a high level procedural or object-oriented programming language to communicate with a computer system. However, the programs can be implemented in assembly or machine language, if desired. In any case, the language may be compiled or interpreted language.


Each such computer program is preferably stored on a storage medium or device (e.g., ROM or magnetic diskette) readable by a general or special purpose programmable computer, for configuring and operating the computer when the storage media or device is read by the computer to perform the procedures described herein. The inventive system may also be considered to be implemented as a computer-readable storage medium, configured with a computer program, where the storage medium so configured causes a computer to operate in a specific and predefined manner to perform the functions described herein.


Modulators of II-6 Receptors


The present invention provides the atomic coordinates of the crystal structure of the soluble extracellular portion of an IL-6 receptor alpha subunit and the atomic coordinates of the crystal structure of a hexameric complex consisting of both IL-6 receptor subunits (sIL-6α and gp130) together with the ligand IL-6. As described above, these coordinates can be used in methods of the invention to screen in silico, for potential modulators of IL-6R activity.


Accordingly, the present invention provides a compound which modulates the activity of an IL-6R, said compound identified/identifiable by a method of the invention. Preferably said compound is specific for IL-6R, for example said compounds exhibits at least 10 times, preferably at least 50 times more activity towards IL-6R than other cytokine receptors, such as an IGR receptor.


In one embodiment, said method comprises the step of assessing the stereochemical complementarity between the compound and a topographic region of the receptor, wherein the receptor comprises:

    • (i) amino acids 1-299 of the IL-6 receptor positioned at atomic coordinates as shown in Appendix I, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å; or
    • (ii) one or more subsets of said amino acids related to the coordinates shown in Appendix I by whole body translations and/or rotations.


Preferably the topographic region of the IL-6 receptor is selected from:

    • (i) a ligand binding surface defined by residues 106-110, 133-138, 160-168, 190-193, 227-233, 250-256 or 276-281 and combinations thereof;
    • (ii) the homodimer interface defined by resides 1-5, 19-23, 65-69, 93-99, 118, 119, 132-141, 166-172, 179-196, 241-250, 261, 262, 272-276 or 282-290 and combinations thereof;
    • (iii) residues 11, 45, 46, 55, 62-66, 69-72, 75, 81, 88, 90-93, 122-124 and 178; and
    • (iv) residues 233-239, 244-248 and 270-290.


Compounds designed according to the methods of the present invention may be assessed by a number of in vitro and in vivo assays of hormone function. For example, the identification of IL-6 receptor antagonists of may be undertaken using a solid-phase receptor binding assay. Potential antagonists may be screened for their ability to inhibit the binding of europium-labelled IL-6 receptor ligands to soluble, recombinant IL-6 receptor in a microplate-based format. Europium is a lanthanide fluorophore, the presence of which can be measured using time-resolved fluorometry. The sensitivity of this assay matches that achieved by radioisotopes, measurement is rapid and is performed in a microplate format to allow high-sample throughput, and the approach is gaining wide acceptance as the method of choice in the development of screens for receptor agonists/antagonists (see Apell et al., J. Biomolec. Screening 3:19-27, 1998: Inglese et al., Biochemistry 37:2372-2377, 1998).


Binding affinity and inhibitor potency may be measured for candidate inhibitors using biosensor technology.


Biological assays to measure the activity of IL-6R are well known in the field. In particular various assay systems are described in Hammacher et al., 1998, J. Biol. Chem. 273, 22701-22707; and Hammacher et al, 1994, Protein Science, 3, 2280-2293, the disclosures of which are incorporated by reference.


The methods of the invention have been used to identify a number of compounds that potentially are agonist or antagonists of IL-6 activity. These compounds have subsequently been tested in vitro to determine biological activity and specificity. Compounds showing specific IL-6R agonist or antagonist activity have been modeled using the crystal structure of the invention to determine the site of binding and the amino acid residues of IL-6Rα that are contacted by the compounds. These results have enabled the generation of general structural criteria for both a class of antagonists and a class of agonists that occupy all or part of the IL-6 ligand binding surface defined by loops L1 to L7 as described above.


Antagonists


In a preferred embodiment, an antagonist of an IL-6 receptor is a compound of formula A-B-C, wherein


A consists of three fused 5-, 6- or 7-membered, saturated, unsaturated or aryl rings, optionally containing one or more heteroatoms and optionally substituted;


or two non-fused 5-, 6- or 7-membered saturated, unsaturated or aryl rings, optionally containing one or more heteroatoms and optionally substituted;


C consists of three fused 5-, 6- or 7-membered, saturated, unsaturated or aryl rings, optionally containing one or more heteroatoms and optionally substituted; or two non-fused 5-, 6- or 7-membered saturated, unsaturated or aryl rings, optionally containing one or more heteroatoms and optionally substituted; and


B is an aliphatic linker having a length substantially equivalent to an ethylene moiety, for example a linker of similar length and geometry to a peptide bond i.e. with a length of about 3.5 Å;


wherein said compound has stereocomplementarity to a ligand binding topographic region of:


(i) amino acids 1-299 of the IL-6 receptor positioned at atomic coordinates as shown in Appendix I, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å; or


(ii) one or more subsets of said amino acids related to the coordinates shown in Appendix I by whole body translations and/or rotations;


said ligand binding topographic region being defined by residues 106-110, 133-138, 160-168, 190-193, 227-233, 250-256 and 276-281 of said IL-6 receptor, or combinations thereof.


Ring substituents should be selected so as not to disrupt the stereocomplementarity with the IL-6 ligand binding surface. Consequently, substituents are generally selected from C1-C4 alkyl, halogen, OR24 or NR25R26, where R24, R25 and R26 are each independently hydrogen or C1-C4 alkyl. Moreover, the rings of A are preferably joined such that A is in a substantially “straight” configuration in relation to the linker B. Equally, the rings of A are preferably joined such that C is in a substantially “straight” configuration in relation to the linker B.


Preferably A has the following formula:
embedded image

wherein


Z is a bond; or Z, R4 and R10 taken together form an optionally substituted, saturated, unsaturated or aryl ring having 5, 6 or 7 members, preferably 5 or 6 members, optionally containing one or more heteroatoms selected from O, N and S; or Z, R3 and R6 taken together form an aryl ring or a heteroaryl, cycloalkyl, cycloalkenyl or heterocyclyl ring having 5, 6 or 7 members, preferably 5 or 6 members, wherein said aryl ring or said heteroaryl, cycloalkyl, cycloalkenyl, heterocyclyl ring are optionally substituted;


R7, R8 or R9 are bonded to linker B;


R1, R2 and R5 are each independently, hydrogen, C1-C4 alkyl, halogen, OR24 or NR25R26, where R24, R25 and R26 are each independently hydrogen or C1-C4 alkyl;


R3 and R6 unless bonded together with Z are each independently, hydrogen, C1-C4 alkyl, halogen, O, OR24 or NR25R26, where R24, R25 and R26 are each independently hydrogen or C1-C4 alkyl;


R4 and R10 unless bonded together with Z, are each independently, hydrogen, C1-C4 alkyl, halogen, O, OR24 or NR25R26, where R24, R25 and R26 are each independently hydrogen or C1-C4 alkyl;


R7, R8 and R9 unless bonded to linker B, are each independently, hydrogen, C1-C4 alkyl, halogen, O, OR24 or NR25R26, where R24, R25 and R26 are each independently hydrogen or C1-C4 alkyl;


Preferably the linker B has the following formula:
embedded image

wherein Y and Y1 are each independently C, O, S or N, provided that Y and Y1 are not both O, N or S;


R11 and R12 are each independently hydrogen, C1-C4 alkyl, halogen, O, OR24 or NR25R26, where R24, R25 and R26 are each independently hydrogen or C1-C4 alkyl;


Preferably at least Y and R11 together, or Y1 and R12 together are CO. Preferably Y or Y1 are NR25R26, where R24, R25 and R26 are each independently hydrogen or C1-C4 alkyl, preferably hydrogen.


Preferably C has the following formula:
embedded image

wherein


X is a bond; or X, R14 and R18 taken together form an optionally substituted, saturated, unsaturated or aryl ring having 5, 6 or 7 members, preferably 5 or 6 members, optionally containing one or more heteroatoms selected from O, N and S; or X, R15 and R22 taken together form an aryl ring or a heteroaryl, cycloalkyl, cycloalkenyl or heterocyclyl ring having 5, 6 or 7 members, preferably 5 or 6 members, wherein said aryl ring or said heteroaryl, cycloalkyl, cycloalkenyl, heterocyclyl ring are optionally substituted;


R13, R16 or R17 are bonded to linker B;


R19, R20 and R21 are each independently, hydrogen, C1-C4 alkyl, halogen, O, OR27 or NR28R29, where R27, R28 and R29 are each independently hydrogen or C1-C4 alkyl;


R14 and R18 unless bonded together with Z are each independently, hydrogen, C1-C4 alkyl, halogen, O, OR27 or NR28R29, where R27, R28 and R29 are each independently hydrogen or C1-C4 alkyl;


R15 and R22 unless bonded together with Z, are each independently, hydrogen, C1-C4 alkyl, halogen, O, OR27 or NR28R29, where R27, R28 and R29 are each independently hydrogen or C1-C4 alkyl;


R13, R16 and R17 unless bonded to linker B, are each independently, hydrogen, C1-C4 alkyl, halogen, O, OR27 or NR28R29, where R27, R28 and R29 are each independently hydrogen or C1-C4 alkyl;


In a highly preferred embodiment, said antagonist has the following formula I:
embedded image

Wherein X, Y, Y1, Z and R1 to R29 are as defined above.


More preferably, the antagonist is a compound of formula II:
embedded image

wherein X, Y, Y1, Z and R1 to R29 are as defined above; and


R23, R24, R25 and R26 are each independently hydrogen, C1-C4 alkyl, halogen, O, OR27 or NR28R29, where R27, R28 and R29 are each independently hydrogen or C1-C4 alkyl, preferably O.


Agonists


In a preferred embodiment, an agonist of an IL-6 receptor is a compound of formula A-B-D, wherein


A is as defined above for antagonists;


D consists of one, or two fused, 5-, 6- or 7-membered, saturated, unsaturated or aryl rings, optionally containing one or more heteroatoms and optionally substituted; and


B is an aliphatic linker as defined above for antagonists;


wherein said compound has stereocomplementarity to a ligand binding topographic region of:


(i) amino acids 1-299 of the IL-6 receptor positioned at atomic coordinates as shown in Appendix I, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å; or


(ii) one or more subsets of said amino acids related to the coordinates shown in Appendix I by whole body translations and/or rotations;


said ligand binding topographic region being defined by residues 106-110, 133-138, 160-168, 190-193, 227-233, 250-256 and 276-281 of said IL-6 receptor, or combinations thereof.


Ring substituents should be selected so as not to disrupt the stereocomplementarity with the IL-6 ligand binding surface. Consequently, substituents are generally selected from C1-C4 alkyl, halogen, OR24 or NR25R26, where R24, R25 and R26 are each independently hydrogen or C1-C4 alkyl. The length of the D moiety is less than that of the C moiety of the antagonist compounds. Consequently, substituents should be selected so as avoid the length of the D moiety exceeding the equivalent of two aryl rings. Moreover, where D consists of two fused rings, the rings are preferably joined such that D is in a substantially “straight” configuration in relation to the linker B.


Preferably D has the following formula:
embedded image

wherein


Y2, Y3 or Y4 are each independently C, O, N or S, provided that at least two of Y2, Y3 and Y4 are C;


R13, R17 or R18 are bonded to linker B;


R14, R15 and R16 are each independently, hydrogen, C1-C4 alkyl, halogen, O, OR27 or NR28R29, where R27, R28 and R29 are each independently hydrogen or C1-C4 alkyl; or


R16 is hydrogen, C1-C4 alkyl, halogen, O, OR27 or NR28R29, where R27, R28 and R29 are each independently hydrogen or C1-C4 alkyl, and R14 and R15 together form an optionally substituted, saturated, unsaturated or aryl ring having 5, 6 or 7 members, optionally containing one or more heteroatoms selected from O, N and S; or


R14 is hydrogen, C1-C4 alkyl, halogen, O, OR27 or NR28R29, where R27, R28 and R29 are each independently hydrogen or C1-C4 alkyl R15 and R16 together form an optionally substituted, saturated, unsaturated or aryl ring having 5, 6 or 7 members, optionally containing one or more heteroatoms selected from O, N and S;


R13, R17 and R18 unless bonded to linker B, are each independently, hydrogen, C1-C4 alkyl, halogen, O, OR27 or NR28R29, where R27, R28 and R29 are each independently hydrogen or C1-C4 alkyl;


In a preferred embodiment, said agonist has the following formula III:
embedded image

Wherein Y, Y1, Y2, Z and R1 to R29 are as defined above. Preferably Y2 is N.


In an alternative preferred embodiment, said agonist has the following formula IV:
embedded image

wherein X, Y, Y1, R1 to R13 and R17 are as defined above;


R14, R15 and R16 are each independently, hydrogen, C1-C4 alkyl, halogen, O, OR27 or NR28R29, where R27, R28 and R29 are each independently hydrogen or C1-C4 alkyl; preferably R14, R15 and R16 are each independently, hydrogen, C1-C2 alkyl, NH2, OH or halogen.


R23 is hydrogen, C1-C4 alkyl, halogen, O, OR27 or NR28R29, where R27, R28 and R29 are each independently hydrogen or C1-C4 alkyl, preferably O.


Reference above to halogen includes I, Br, Cl and F. Preferably the halogen is Br or Cl.


Where reference is made to C1 to C4 alkyl above, C1 to C3 alkyl is preferred, with C1 to C2 alkyl being especially preferred. Where rings include heteroatoms, selected from N, O and S, it is preferred that any particular ring has less than three heteroatoms, preferably less than two heteroatoms. Where reference is made to rings having 5, 6 or 7 members, it is preferred that rings have 5 or 6 members.


Uses of Modulators of IL-6R


The compounds described above may be used to modulate IL-6R activity in cells, i.e. activate or inhibit IL-6R activity. Given that aberrant IL-6R/IL-6 activity is implicated in a range of disorders, the compounds described above may also be used to treat, ameliorate or prevent disorders characterised by abnormal IL-6 signalling. Examples of such disorders include multiple myeloma, rheumatoid arthritis, Castleman's disease, AIDS, mesangial proliferative glomerulonephritis, Kaposi's sarcoma, sepsis, osteoporosis and psoriasis.


Administration


Compounds of the invention, i.e. modulators of IL-6R identified or identifiable by the screening methods of the invention, may preferably be combined with various components to produce compositions of the invention. Preferably the compositions are combined with a pharmaceutically acceptable carrier or diluent to produce a pharmaceutical composition (which may be for human or animal use).


The formulation will depend upon the nature of the compound and the route of administration but typically they can be formulated for topical, parenteral, intramuscular, oral, intravenous, intra-peritoneal, intranasal inhalation, lung inhalation, intradermal or intra-articular administration. The compound may be used in an injectable form. It may therefore be mixed with any vehicle which is pharmaceutically acceptable for an injectable formulation, preferably for a direct injection at the site to be treated, although it may be administered systemically.


The pharmaceutically acceptable carrier or diluent may be, for example, sterile isotonic saline solutions, or other isotonic solutions such as phosphate-buffered saline. The compounds of the present invention may be admixed with any suitable binder(s), lubricant(s), suspending agents), coating agent(s), solubilising agent(s). It is also preferred to formulate the compound in an orally active form.


In general, a therapeutically effective daily oral or intravenous dose of the compounds of the invention, including compounds of the invention and their salts, is likely to range from 0.01 to 50 mg/kg body weight of the subject to be treated, preferably 0.1 to 20 mg/kg. The compounds of the invention and their salts may also be administered by intravenous infusion, at a dose which is likely to range from 0.001-10 mg/kg/hr.


Tablets or capsules of the compounds may be administered singly or two or more at a time, as appropriate. It is also possible to administer the compounds in sustained release formulations.


Typically, the physician will determine the actual dosage which will be most suitable for an individual patient and it will vary with the age, weight and response of the particular patient. The above dosages are exemplary of the average case. There can, of course, be individual instances where higher or lower dosage ranges are merited, and such are within the scope of this invention.


For some applications, preferably the compositions are administered orally in the form of tablets containing excipients such as starch or lactose, or in capsules or ovules either alone or in admixture with excipients, or in the form of elixirs, solutions or suspensions containing flavouring or colouring agents.


The compositions (as well as the compounds alone) can also be injected parenterally, for example intracavernosally, intravenously, intramuscularly or subcutaneously. In this case, the compositions will comprise a suitable carrier or diluent.


For parenteral administration, the compositions are best used in the form of a sterile aqueous solution which may contain other substances, for example enough salts or monosaccharides to make the solution isotonic with blood.


For buccal or sublingual administration the compositions may be administered in the form of tablets or lozenges which can be formulated in a conventional manner.


For oral, parenteral, buccal and sublingual administration to subjects (such as patients), the daily dosage level of the compounds of the present invention and their pharmaceutically acceptable salts and solvates may typically be from 10 to 500 mg (in single or divided doses). Thus, and by way of example, tablets or capsules may contain from 5 to 100 mg of active compound for administration singly, or two or more at a time, as appropriate. As indicated above, the physician will determine the actual dosage which will be most suitable for an individual patient and it will vary with the age, weight and response of the particular patient.


The routes of administration and dosages described are intended only as a guide since a skilled practitioner will be able to determine readily the optimum route of administration and dosage for any particular patient depending on, for example, the age, weight and condition of the patient.


The present invention will now be described further with reference to the following examples, which are illustrative only and non-limiting. The examples refer to the figures:




DETAILED DESCRIPTION OF THE FIGURES


FIG. 1


(a) A MOLSCRIPT diagram of sIL-6R indicating the β-sheet arrangement (shades of green, orange, blue for domains D1, D2 and D3 and respectively). The different β-strand shades represent the separate β-sheets in the structure. Four asparagine (blue spheres) linked sites are shown with their associated carbohydrate moieties represented by yellow spheres linked by yellow bonds. The N-terminal residues 1 to 15 (gray tube), is tethered to strand F of D2 by a disulfide bond. The 310 helices (purple), the loops (red) L1 to L7 (L1 to L7 consist of the polypeptides S106 to N110 (L1), K133 to P138 (L2), A160 to F168 (L3), Q190 to G193 (L4), S227 to R233 (L5), M250 to H256 (L6) and Q276 to Q281 (L7)) that could interact with IL-6 around the juncture of D2 and D3 are also shown. The single cysteine residue disulfide linked to C192 is also shown, as are the disulfide links.


(b) The domain structure (CPK model) of sIL-6R showing domain D1 in green, D2 in red and D3 in blue, with carbohydrate in yellow spheres.



FIG. 2


A view of the surface associated with the WSXWS (green bonds) sequence motif in IL-6R and its association with the other residues of the tryptophane/arginine ladder (yellow bonds) in D3. The backbone of the WSXWS motif is shown as part of the 310 helix found in cytokine binding domains, and a groove on the surface lies above this helix. A stripe of positively charged residues (arginines) lie parallel to this grove. Nitrogen and oxygen atoms of the side chains are coloured blue and red spheres respectively as are the surfaces in proximity to these atoms.



FIG. 3


A MOLSCRIPT diagram of the CBD of sIL-6R viewed from the top of FIG. 1a showing the IL-6 binding loops L1 to L7. The side-chains of the loops are shown as ball and stick representations, and the remaining structure as in FIG. 1a. The IL-6 binding loops are represented by a backbone worm representation coloured red (L1), yellow (L2), green (L3), cyan (L4), pink (L5), dark green (L6) and orange (L7) respectively. The side-chain bonds are colour matched to the respective loops. Oxygen, nitrogen and sulphur atoms in the loop side-chains are coloured as red, blue and yellow spheres respectively (57). The cysteinyl-cysteine C192 is also shown on loop L4.



FIG. 4


The dimer interface of IL-6R showing the region of interaction by red, green, blue, purple and cyan areas representing interdimeric distances of less than 5, 4, 3, 2, and 1 Å respectively. The crystal 2-fold axis is vertical and the membrane is horizontal below the molecule.



FIG. 5


A MOLSCRIPT CPK representation of a crystallographic dimer of IL-6R, viewed towards the membrane anchors, showing the three mutational clusters found on the IL-6R structure. The crystallographic 2-fold axis lies at the centre of the figure and perpendicular to the paper and the dimer interface lies east to west through the two-fold axis. Residues in the upper molecule in the figure are coloured to represent mutations that effect IL-6 binding to IL-6R (red, pink and green spheres representing the atoms of residues which have <25%, 25% to 75% and >75% of the binding activity of the wild type respectively). Residues in the lower molecule are coloured to represent mutations that effect gp130 signalling (red, pink and green spheres representing the atoms of residues which have <25%, 25% to 75% and >75% of the signalling of wild type respectively). Residues in domains D1, D2 and D3 that there is no mutational data or alter the structure are coloured white, light grey and grey respectively. The cysteinyl-cysteine C192 is coloured dark grey, and carbohydrate is represented by yellow bonds.



FIG. 6


(a) Spacefilling model of the proposed hexameric receptor complex, consisting of two molecules each of IL-6, IL-6R and gp130 chains (shown in purple and blue, light and dark green, orange and red respectively). IL-6 is labeled with the binding sites I, II and III.


(b) A modular representation of the hexameric complex (D1 to D6 are labeled 1 to 6), coloured as in (a), but with the additional three fibronectin domains (D4 to D6) of gp130, that have not been modeled, placed to indicate how the association of the two D5′ modules from both gp130 molecules, under the ‘tunnel’ of the IL-6R dimer, can activate signaling by dimerisation of the cytoplasmic domains (white circles).



FIG. 7


Solid phase receptor binding assays. Ten compounds selected from Table 3 (NCI compounds) were tested for their ability to modulate the binding of europium-labelled IL-6 to soluble, recombinant IL-6 receptor (A) or the binding of europium-labelled IGF-1 to soluble, recombinant IGF-1 receptor (B) in a microplate-based format. Binding to the receptor is measured by time resolved fluorescence (TRF) as a % of control.



FIG. 8


Model of the IL-6 receptor showing the site of binding of an antagonist (NCI compound 39914) (A) and an agonist (NCI compound 17791) (B).




EXAMPLES
Example 1
Crystallisation and Structural Analysis of sIL-6

Methods


Expression, Purification and Crystallization of sIL-6R


Monomeric sIL-6R was purified from the conditioned medium of glycosylation defective mutant Chinese hamster ovary (CHO) cell line Lec3.2.8.1 (Stanley, 1989, Mol. Cell Biol. 9: 377-383), transfected with the construct pEE14sIL6RL325, which encodes sIL-6R. Briefly, sIL-6R transformed Lec3.2.8.1 cells were grown in fermentation apparatus with a working volume of 1.25 L (New Brunswick Celligen Plus fermenter, New Brunswick, USA). Conditioned media was concentrated 20-fold by ultrafiltration. sIL-6R was purified from concentrated Lec3.2.8.1 cell media by binding to a 5 mL column of human IL-6-Sepharose. SIL-6R was further purified by preparative size-exclusion chromatography and concentrated to 10 mg/mL using a 10000 MWCO centrifugal concentrator (Centricon, Millipore, USA).


The protein (8 mg/ml in 5 mM Tris-HCl pH 8.0) was crystallised in hanging drops by vapour diffusion against a reservoir containing 50 mM ammonium sulphate, 18% PEG 3350, 2.5 mM Sodium citrate and at pH 5.6. Crystals grew in the space group P43212 (a=51.4 Å, c=305.4 Å) with 51% solvent. To ensure isomorphism for heavy metal derivatives 0.1% gluteraldehyde was added to the well solution for 16 hrs. Crystals were then transferred to a stabilising solution of 0.1M lithium sulphate with 2.5 mM Tris-HCl at pH 6.5, and finally to a freezing solution of 20% PEG 3350, 16% glycerol, 0.1 M lithium sulphate, 2.5 mM sodium citrate and 2.5 mM Tris-HCl at pH 6.5.


Diffraction Data


X-ray data were collected from crystals flash-frozen to 100K in a nitrogen stream. Three data sets were collected for each of the native, and two platinum derivatives (PIP: di-u-iodo-bis(ethylene-diamine)-di-platinum(II)nitrate; PTN: K2Pt(NO2)4). Three data sets were collected for each native and two derivatives. A data set with the c* axis parallel to the spindle, one perpendicular to the spindle axis and one low resolution data set, using R-AXIS II and IV image-plate detectors fitted with both mirror and monocapillary optics (Balaic et al., 1996, J. Synchrotron Rad. 3: 289-295) respectively. A 2.4 Å native data set was collected at the Advance Photon Source (Argonne, U.S.A.) on beam line BM14C using an ADSC Quantum-4 CCD-detector. All data sets were collected at 0.5° or 1.0° oscillations and processed and scaled using HKL (Otwinowski and Minor, 1997, Macromolec. Crystallog. Pt A. 276: 307-326) (Table 1).

TABLE 1Data collection and MIR phasing statisticsNativePIPPTNData CollectionResolution (Å) 2.4 3.0 3.0Completeness* (%)99.4(100)91.6(63.4)95.7(80.2)Multiplicity18.6 6.5 5.6<I/σ(I)>*39.0(3.7)15.4(2.2)10.5(1.0)Rmerge*0.11(0.57)0.05(0.38)0.13(0.58)Wilson B (Å2)85.367.779.2MIR PhasingResolution (Å)50-4.050-4.0Sites per molecule 3 4Total occupation 0.77 0.95Phasing Power‡? 1.4(1.7)[1.1] 1.1(1.3)[1.1]RCullis0.71(0.76)[0.93]0.82(0.82)[0.92]
*Numbers in parentheses are values in the high resolution shell, where Rmerge = Σ(Ii − <I>)/ΣIi summed over all independent reflections.

Each site was split into pairs.

The three numbers represent summation over: centric(acentric)[acentric anomalous] reflections.

?Root mean square (rms) fh/residual = √(Σ(Fderi − FPH)2), where fh and FPH are the calculated heavy-atom and derivative structure factors, respectively.

Rcullis = Σ||fh| − (|Fderi| − |Fnati|)/|Σ||Fderi| − |Fnati||, where Fderi and Fnati are the observed derivative and native structure factors, respectively


Structure Solution.


The heavy atom sites of the PIP and PTN derivatives were determined by Patterson methods (CCP4, 1994, Acta Crystallogr. D. Biol. Crystallogr. 50: 760-763) and SOLVE (Terwilliger and Berendzen, 1999, Acta Crystallogr. D. Biol. Crystallogr. 55: 849-861). The space group and hand of the heavy atom sites was determined using the heavy atom anomalous data from both derivatives. The positions of sites were refined to 4 Å using SHARP (de la Fortelle and Bricogne, 1997, Macromolec. Crystallog. Pt A. 276: 472-494) and phasing statistics are shown on Table 1. The 4 Å solvent-flattened (51% solvent) MIRAS-phased electron density map was skeletonised and used to trace the Cchain of the protein. This map clearly showed the CBD module, and the Ig domain. A partial atomic model of the FnIII domains and part of the Ig domain was built using Lego fragments in “O” (Jones et al., 1991, Acta Crystallogr. A. 47: 110-119). N-linked carbohydrate, were included, at asparagines N36, N74, N2O2 and N226 and 80 solvent molecules. The disulfide bonds were consistent with the earlier mass spectrometric data. The model was then refined using CNS (Brunger et al., 1998, Acta Crystallogr. D. Biol Crystallogr. 54: 905-921), extending the resolution to 2.4 Å by an iterative process involving model rebuilding and gradual identification and refinement of the residues on the longer loops linking the strands. The quality of the electron density and the thermal parameters of the Ig domain were poorer than the CBD indicating a higher degree of disorder. Residues 1 to 5 and 82 to 84 were set to zero occupancy as the density was poor and residues beyond 299 (C-terminus) were not observable. The side chains of loops 49 to 52 and 136 to 138 appeared highly mobile or disordered. Additional solvent (including two sulphate ions) was added during the course of the refinement. A previously reported cysteine disulfide bonded to the free C192 was observed and built into the model. The final R-factor was 0.22 (R-free of 0.29 over 5% of the data) with r.m.s.d. in bond lengths and angles being 0.015 Å and 1.96° respectively with overall anisotropic temperature factors of B11=B22=−6.4 Å2 and B33=12.7 Å2 added to the individual atomic isotropic B-values.


The coordinates of the IL-6R structure are provided herein as Appendix I.


Modeling the Complex


The model of the hexameric interleukin-6 signal transduction complex (see FIG. 6a.) was assembled using a parallel version of the protein-protein docking algorithm FTDOCK 2.0 (Gabb et al., 1997, J. Mol. Biol. 272: 106-120). The protein chains were taken from the NMR structure (PDB code 1IL6) for IL-6 (Xu et al., 1997, J. Mol. Biol. 268: 468-481), the X-ray structure of gp130 from the viral IL-6 gp130 complex (Chow et al., 2001, Science 291: 2150-2155) (PDB code 1I1R) and the IL-6R dimer. Using a grid spacing of 0.5 Å and incorporating electrostatic treatment the following protein/protein docking calculations were undertaken. IL-6 was docked with IL-6R, IL-6 with IL-6R dimer, IL-6 with gp130 and gp130 to IL-6R. Complexes were re-scored (Moont et al., 1999, Proteins 35: 364-373) and the top ranking complexes satisfying the available mutational data (Table 2, Salvati et al., 1995, J. Biol. Chem. 270: 12242-12249) was subject to rigid body refinement and side-chain rotamer optimization using the program MULTIDOCK (Jackson et al., 1998, J. Mol. Biol. 276: 265-285). There was no significant difference in the docking solutions produced for IL-6 to IL-6R and IL-6 to the IL-6R dimer so the solutions of IL-6 to IL-6R dimer were used, with each alpha chain in the dimer binding its cognate IL-6 for assembly of the complex. The final complex was built by superimposing the chains in common from the best solutions using INSIGHTII (Accelrys Inc, San Diego, Calif., USA). The resulting complex was then minimized using CNS (Brunger et al., 1998), with a harmonic restraint applied to the α-carbon backbone.


The coordinates of the IL-6R/IL-6/gp130 hexamer are provided herein as Appendix II.


Results and Discussion


The IL-6R Extracellular Structure


The complete extracellular region of human IL-6R was expressed and secreted from the Chinese hamster ovary (CHO) cell line, Lec3.2.8.1, which produces carbohydrate-deficient glycoprotein (Stanley, 1989). This was necessary because normal CHO cell lines produce heterogeneously glycosylated protein, which did not crystallise. The Lec3.2.8.1 cell secretes a form of the extracellular domain of IL-6R (residues 1 to 325) which was purified and crystallized to diffract to 2.4 Å. The IL-6R 3-dimensional structure was determined by multiple isomorphous replacement with anomalous scattering (MIRAS) from two heavy atom derivatives.


Our X-ray structure of the extracellular domain of the IL-6R (FIG. 1) consists of the N-terminal Ig domain (D1, residues 1 to 93) linked to a classical is CBD (D2, residues 94 to 194; D3, residues 195 to 299). The first 5 residues of the N-terminus are poorly defined and amino acids past residue 299 are not visible in the X-ray structure. The three domains lie on a similar plane making the receptor a long ‘flat’ structure in which the domains D2 and D3 are connected at about 90° to each other and D1 is connected at about 135° to D2. Carbohydrate is attached at 4 sites on one of the faces of the molecule (FIG. 1), indicating that this face is unlikely to be involved in either the binding of IL-6 or the formation of the signalling complex with gp130. This conclusion is supported by the expression of functional sIL-6R in Escherichia coli which lacks carbohydrate.


The N-terminal cysteine residue C6 is disulfide bonded to residue C174 in D2, and there is an unusual β-sheet arrangement of D1 (FIG. 1a). D1 has an S-type Ig topology very similar to the ligand-binding domain of fibroblast growth factor (LBD-FGF)), but the first ‘A’ strand of the 3-stranded β-sheet of the LBD-FGF Ig domain is not observed in IL-6R. It appears that these residues have ‘peeled away’ from the sheet when compared to this strand in LBD-FGF (0.7 Å r.m.s.d. between the α-carbons of the two domain structures excluding the ‘A’ strand atoms). The C-terminus of this domain forms a fifth ‘A′’ strand of what would be a 4-stranded β-sheet in an S-type Ig topology. This arrangement of the ‘A’ strand of the β-sheet could be a result of crystal packing and we suggest there is a degree of flexibility of this domain, which acts as a conformational switch, to allow D1 to rotate up to 20° by β-strand shifting. This flexibility could explain what appears to be a higher degree of disorder in this domain when compared to the other domains as indicated by the higher temperature factors of residues in domain D1 compared to D2 or D3 (mean B values for domains D1, D2 and D3 are 68 Å2, 49 Å2 and 44 Å2 respectively). The molecules of IL-6R in the crystal are packed head to tail in a double helix arrangement with the helical axis along the 4-fold crystal screw axis in direction of the long cell edge (c-axis) and the pitch and diameter of the helix will vary if the angle between domains D1 and D2 changes. Such flexibility of domains would explain the lack of isomorphism between different crystals of IL-6R that were observed in this study.


A characteristic of the CBD of IL-6R is the long tryptophan-arginine ladder (from the N-terminus end of D3 is R239, F246, R237, W287, R274, W284, Q276) which is also found in other class 1 CBD structures. This ladder incorporates the conserved WSXWS motif (residues 284 to 287 in IL-6R) located in the carboxy terminus region of domain D3 of the CBD (FIG. 2). The polypeptide backbone of the WSXWS sequence has an unusual motif: a left-handed 310 helix similar to a polyproline helix. This helix is stabilised by the stacking of the indole side chains of the two tryptophan residues of the WSXWS sequence with arginine side-chains (R237 and R274), and H-bonding of serine side chain hydroxyls but not by main chain H-bonds. The ladder produces a long surface stripe of positive charge (from the guanidinium and tryptophan indol nitrogens) running along D3 near the inside elbow of D2-D3, and a groove formed by the 310 helix running parallel to the stripe (FIG. 2). Although this structure is conserved in other known structures of CBDs its functional role has not as yet been clarified. It could be implicated in the folding of the CBD, complex formation or involved in a general receptor transport system.


Following the paradigm of growth hormone (de Vos et al., 1992), and the available mutational data, it would be expected that IL-6 would bind in the region of the outer elbow formed at the junction of D2 and D3 (FIG. 3), characterised by 4 loops (S106-N110 (L1), K133-P138 (L2), A160-F168 (L3), Q190-G193 (L4)) from D2 and 3 loops (S227-R233 (L5), M250-H256 (L6) and Q276-Q281 (L7)) from D3 (FIG. 3). These loops present a long and narrow (43 Å by 9.5 Å) potential binding area held in place by the rigid D2 and D3 framework of the cytokine binding domains. IL-6 would engage residues in some of these loops.


A Dimer in the Crystal


In the crystal lattice, two molecules of IL-6R are related by a crystallographic 2-fold axis and are closely associated along the length of each molecule (FIG. 4). These two molecules (with a total buried accessible surface area of about 2460 Å2) represent a potential physiological dimer of IL-6R. The association is primarily a hydrophobic contact around the 2-fold axis involving F134 and F168 of domain D2, the salt-link E97-R274, and hydrogen-bonding of the side chain carboxylate of E283 with the main chain amide of T186. The buried accessible surface area (FIG. 4) of each molecule (surface- and electrostatic-complementarity of 0.722 and 0.728 respectively) is much larger than that of a protein-protein interaction surface in solution (typically ˜800 Å2). Furthermore the buried surface of membrane-associated receptor complexes do not have to be as extensive as their counterparts in solution, since their degrees of freedom are restricted to the two-dimensional surface of the membrane. Hence the dimeric association observed here in the crystal may well represent a complex formed on the cell surface.


The orientation of the monomers with the dimer is such that the interaction is along the carbohydrate-free ‘flat’ face of the molecule (FIG. 5). In this crystal structure, if the 2-fold axis of the dimer is perpendicular to the membrane surface, then domain D3 projects away from the membrane at 45° and D2 points down towards the membrane at 30o. D1 would then run parallel to the membrane (FIG. 4). The dimer appears as a bridge structure (FIG. 6) similar to the common β-receptor of IL-3, with a 50 Å long tunnel of triangular cross-section (base ˜40 Å; height ˜22 Å).


Structural Implications of Mutations


Mapping of the mutational data onto the crystal structure of IL-6R provide some insights into regions on the surface of IL-6R that are critical to binding of IL-6 and implicated in signalling through gp130. Analysis of these data indicate that most of the mutations, when mapped onto the crystal structure, cause altered binding due to compromised structural integrity of the molecule, in particular mutations in the tryptophan/arginine ladder, cysteines forming disulfide bonds, and buried hydrophobic residues forming the hydrophobic core. The non-structural mutations occur on three clusters on the surface of IL-6R (Table 2, FIG. 5).


There is clearly a cluster of residues (see FIG. 5, Cluster 1) that form the binding site to IL-6. The mutations of residues having the greatest effect on IL-6 binding are those at P162 and E163 from L3; S228, F229, Y230 and R231 from L5 and E278 and F279 from L7 (Table 1). Two of these loops, L5 and L7, are both situated on D3 and this is consistent with the report that D3 by itself is able to bind IL-6 (18). This site at the juncture of D2 and D3 domains, and can be inferred to be primarily responsible for IL-6 binding.


Non-structural mutations in IL-6R that affect gp130 signalling (FIG. 5) can be classified onto two main clusters. The first occurs at a hydrophobic patch (residues F134, F168 and Y169) at the dimer interface (FIG. 5, Cluster 2) located around the crystallographic two fold axis relating the two molecules of the dimer. Mutations here reduce signalling but have no effect on binding. There are also other residues, not included in Cluster 2, on domain D3 in this interface that reduce signalling but not binding (see FIG. 5). These data indicate that mutations that would be expected to interfere with the formation of the IL-6R dimer have a significant effect on IL-6 signalling but not on IL-6 binding. 14 The other cluster of mutations effecting gp130 signalling occurs on a patch of residues (FIG. 5, Cluster 3) centred around residue H261 on domain D3. Mutational studies in this region of IL-6R were based on the growth hormone paradigm, which demonstrated a contact region between the second CBD domains of the growth hormone complex (de Vos et al., 1992, Science 255: 306-312). This patch is clearly not involved in IL-6 binding or IL-6R dimerisation, but when mutated this region effects signalling. This is most likely a region that is involved in the formation of the IL-6/IL-6R complex with gp130.


It has been predicted previously that two SSFY sequence repeats in IL-6R are involved in binding sites to IL-6. The first repeat S165-Y169 is located in L3 of D2. Mutations in S167 and Y169 reduce signalling, but this is likely due to the structural importance of these buried residues. F168 is solvent exposed and forms the central part of Cluster 2 that is involved in IL-6R dimer formation, not IL-6 binding. A second aromatic residue, F134, is the other major residue in Cluster 2. The second SSFY repeat (S227-Y230) is located in L5 on D3 of IL-6R (FIG. 3). These residues (apart from S227) are critical to IL-6 binding and are all surface exposed residues.


It has been reported previously that N211, H261 and D262 together form a group of residues that affect signalling but not binding of IL-6 to IL-6R. Surprisingly, these residues cluster with W214 and V259, that when mutated (to N and Q respectively) increase the signalling of IL-6 over wild type. The role of the cysteinyl-cysteine C192 remains unsolved. The residue is on the periphery of the IL-6 binding site. When C192 is mutated to alanine, the binding of IL-6 to IL-6R and signalling are slightly increased. The human IL-6R sequence is the only vertebrate sequence in the database to date that has a cysteine in this position: pig and cow having a tyrosine and mouse, and rat having a leucine.

TABLE 2Structural Analysis of Selected Mutational DataLoopBindingSignallingCommentRCluster1 IL-6/IL-6R binding interfaceP162L3<10%N.T.PE-LQ double mutant. Glu mayYE163L3interact with IL-6, Pro unlikely toS228L3100%<25%a range of mutants. possible minorKIL-6 cluster I residueF229L5<5%0%IL-6 binding site residueKY230L5<5%0%a range of mutants. IL-6 binding siteKresidueR231L50%N.T.RL-SI double mutant. R231 is likely toYL232L5be a IL-6 binding site residue. L232 isa buried hydrophobic residueE278L70%N.T.EF-AI double mutant. IL-6 binding siteYresidueF279L7Cluster2 IL-6R dimer interfaceF134L2>100%70%F-L mutation, one of two Fs in IL-6RYdimerS167L3 50-100% 5-25%buried in loop & forms H bonds -Kmany mutantsF168L3 100-130-%50%-110%A range of mutants. Central aromaticKresidue in IL-6R dimer interface. F-Ymutant signals at 50%Y16930-70% 0-15%range of mutants - buried/structuralKresidueCluster3 D3 signalling clusterN21175%N.T.N-D mutant, <10% binding to gp130SW21470%>150%W-Q mutant - solvent exposed,Ypossibly part of site of gp130 bindingfaceV25940%140%exposed V-N mutant - maybeYinvolved in gp130 interactionI2600%N.T.I-D mutation. buried hydrophobicYresidueH261115%10%H-I mutation involved in gp130YinteractionD26230%<10%D-G mutant. H-bonds to stabiliseYbinding loop and maybe interacts withgp130
N.T. = not tested;

R = Reference

K = Kalai et al., 1997, Blood 89: 1319-1333;

Y = Yawata et al., 1993, EMBO J. 12: 1705-1712; and

S = Salvati et al., 1995, J. Biol. Chem. 270: 12242-12249.


The Hexameric Signalling Complex


A hexameric complex (FIG. 6a) incorporating the IL-6R dimer can be constructed in a manner that is compatible with the available mutational and functional data present in the literature except for the mutations in Cluster 3, which have no cognate binding partner in this model. Additionally, the placement of the three C-terminal fibronectin type III domains (D4, D5 and D6) from gp130 has not been considered. Several orientations of these domains are possible but in the absence of any structural information they are difficult to dock with sufficient surety. Using homology models of these three domains, one can orient in such a way so they contact Cluster 3 on D3 of IL-6R and form a disulfide link proposed between D5 of gp130 underneath the complex as shown schematically in FIG. 6b.


The recent structure of the Ig and CBD domain of gp130 complexed with viral IL-6 reveals a possible dimeric relationship between the human IL-6 and the binding domains of gp130. The dimeric relationship of the viral IL-6/gp130 complex is a crystallographic 2-fold relationship, as in the IL-6R dimer. Incorporating the same dimeric relationship between all the IL-6 binding receptor domains, it is possible to construct a model for the signalling complex (FIG. 6a). In this model, two IL-6 molecules bind to the IL-6R dimer via site I (FIG. 6) followed by two gp130 molecules each binding through sites II and III of different IL-6 molecules in a similar way to the viral IL-6/gp130 fragment complex. The signalling molecule gp130 would bind the IL-6/IL-6R complex with D3 pointing away from the membrane. The remaining three FnIII domains of each gp130 would orientate towards the membrane, and the signalling activated by dimerisation of the gp130 membrane proximal FnIII domains (FIG. 6b) under the bridge of the IL-6R dimer likely through disulfide crosslinking. Although this model is substantially different from current proposed models, it is consistent with the available biological data. Other gp130 signalling complexes (e.g., IL-11) could act through a similar mechanism. A notable feature of this model is the binding orientation of IL-6 to IL-6R is rotated 180° compared to the analogous positions in growth hormone, prolactin and erythropoeitin receptor structures, and the IL-12 structure.


The crystal structure of IL-6R and the model of the hexameric complex will provide the basis for the design of mutations of the proteins involved in this complex. It also enables the design/selection of small molecular weight antagonists and agonists to IL-6 signalling that can be developed into therapeutics targeted to the diseases modulated by IL-6 signalling. The selection of a number of small molecular weight compounds and testing for activity as antagonists or agonists to IL-6 signalling is described below.


Example 2
Screening for Agonist/Antagonists of IL-6R

In Silico Screening


The sIL-6R structural information provided in Appendix I was used in an in silico screen for compounds having complementarity to the ligand binding surfaces of IL-6R defined by loops L1-L7. The screen was conducted on compounds in the NCI database using DOCK and in-house ranking and re-scoring protocols. Briefly, the orientation from the docking algorithm considered as the correct orientation was calculated in a normalised rank-by-number strategy (Wang and Wang, 2001, Journal of Chemical Information and Computer Sciences 41(5):1422-6) using scoring functions based on the DOCK energy function (Kuntz et al., 1982, J. Mol Biol 161: 269-288), SCORE (Wang et al, 1988, J Mol Model 4: 379-394), chemscore (Gohlke et al, 2000, J Mol Biol 295: 337-336), potential of mean force (Muegge et al, 1999, J Med. Chem. 4: 379-394), SMOG (DeWitte et al, 1996, J Am Chem Soc. 118: 11733-11744). The scores associated with each are then re-ranked in a rank-by-rank strategy (Wang and Wang, 2001). The top 100 compounds are then sorted by calculated octanol/water solubility (log P) (Wang, 1997, J. Chem. Inf. Comput. Sci. 37: 615), with compounds with a logP greater than 6 excluded from the hit-list. Representative examples of the top 100 compounds are listed in Table 3.


High Capacity Screening Assay for Detection of Agonists/Antagonists of the IL6/IL6Rα/gp130 Interaction.


Ten of the compounds listed in Table 3 were tested for their ability to modulate binding of IL-6 to IL-6R/gp130. The high throughput screening assay used for this purpose detects the binding of europium-labelled human IL6 (Eu-hIL6) to human soluble interleukin 6 receptor α-chain (hsIL6Rα) and hsgp130 immobilised on the surface of wells of a microtitre plate. The protocol followed was essentially as follows:


Europium Labelling of hIL-6


HIL-6 (2.13 mg, 100 nmol) was mixed with 1 mg (1,500 nmol) of Eu labelling reagent in 100 mM NaHCO3 buffer; pH 9.3 and incubated at 4° C. for 48 h. The Eu-labelled IL-6 was then separated from unreacted labelling reagent and high molecular weight protein aggregates using a Pharmacia Superdex-200 column (HR 16/10) equilibrated in a buffer containing 50 mM Tris-HCl; pH 7.75, 0.9% NaCl, and 0.1% sodium azide. The column was run at a flow rate of 0.5 mL/min and fractions of 0.5 mL were collected.


The fractions were assayed for europium by diluting 10,000-fold in 200 μL of DELFIA enhancement solution and measuring europium fluorescence. The Eu-IL6 eluted (14-15 min) after high molecular weight protein aggregates (11-12 min) and before the unreacted labelling reagent (19 min). The concentration of Eu-IL6 was determined from the absorbance at 280 nm (0.5 AU/mg/mL) and the labelling stoichiometry based on a molecular mass of 21,300.


Screening Assay Protocol


Assays were performed in 384-well black microtitre plates. 50 μL of hsgp130 (2.5 μg/mL in PBS; pH 7.2) was added to each well in columns 1-22 of the assay plates. PBS was added only to wells in columns 23-24. The samples were then incubated at 4° C. overnight. Plates were washed 4 times with 100 μL of DELFIA® Wash Buffer using the EMBLA 384-well plate washer. DELFIA® Assay Buffer (25 μL) was added to all wells using the Multidrop 384-well dispenser.


Test compounds were reconstituted in methanol and transferred (3 μL samples) to the appropriate assay plate wells using the SAGIAN Multimek. Methanol was allowed to evaporate from assay plates by incubating at room temperature for 15 minutes.


A solution containing Eu-hIL6 (90 ng/mL) and hsIL6Rα (2 μg/mL) was prepared in DELFIA® Assay Buffer and added (25 μL samples) to all plate wells using the Multidrop 384-well dispenser. The plates were incubated at room temperature for 2 hours. Plate wells were then washed 8 times with 100 μL/well of DELFIA® Wash Buffer using the EMBLA 384-plate washer. DELFIA® Enhancement Solution (75 μL) was added to each well and the plates were then covered and incubated overnight at room temperature. Plates were then read on the Wallac 1420 Victor Multilabel Counter set at the Europium protocol for 384-well plates.


In order to examine specificity, the compounds were also assayed against the IGF receptor system in an identical manner.


Results of the compounds tested using this assay procedure are shown in FIG. 7. FIG. 7A shows the results obtained using the gp130/IL-6Rα/IL-6 system, and FIG. 7B shows the results obtained using the IGF-1R/IGF-1 system. These results show that compound 39914 acts specifically as an antagonist of IL-6 binding to IL-6R/gp130 and that compounds 17791 and 56681 act specifically as agonists of IL-6 binding to IL-6R/gp130.


Modelling studies have shown that compounds 39914 and 17791 are likely to interact with IL-6R via the groove on IL-6R defined by residues Phe229, Tyr230, Phe279, Glu278, Glu163, Cys192, Pro162 and Leu108 (see FIG. 8).

TABLE 3Examples of compounds identified by in silico screening.NSCFormulaCAS No.Sample Name/structure23127C42H25N3O6128-89-2N-(4-((5-(benzoylamino)-9,10-dioxo-9,10-dihydro-1-anthracenyl)amino)-9,10-dioxo-9,10-dihydro-1-anthracenyl)benzamide35855C34H16Cl2N2O66962-69-21,2-bis((3-chloro-1,4-dioxo-1,4-dihydro-2-naphthalenyl)amino)anthra-9,10-quinone39911C29H14ClNO56336-94-31-chloro-N-(9,10-dioxo-9,10-dihydro-2-anthracenyl)-9,10-dioxo-9,10-dihydro-2-anthracenecarboxamide39918C39H21N3O66941-47-5N-(3-methyl-2,7-dioxo-2,7-dihydro-3H-naphtho[1,2,3-de]quinolin-6-yl)-5,8,14-trioxo-5,8,13,14-tetrahydronaphtho[2,3-c]acridine-10-carboxamide51530C28H22N2O46636-38-01,2-dihydroxy-5,8-di(4-toluidino)anthra-9,10-quinone72123C42H45N4O536519-42-3Serpentinine95172C27H19NO31863-38-41-(4-benzo[a]anthracen-7-ylbenzoyl)aziridine102061C24H14O627172-29-8embedded image127622C24H16N2O1025351-81-9dimethyl 5-((4,5-dihydroxy-8-(hydroxy(oxido)amino)-9,10-dioxo-9,10-dihydro-1-anthracenyl)amino)isophthalate137564C30H24O2S51347-04-72,5-bis(diphenylmethylene)tetrahydrothiophene1,1-dioxide219732C30H26N458478-33-4N1, N4-di(9-acridinyl)-1,4-butanediamine244432C30H18O1213191-64-5Aurofusarin252124C28H24N10O380266-34-8N-(4-(((4-((3-(hydroxy(oxido)amino)-9-acridinyl)amino)anilino)carbonyl)amino)phenyl)dicarbonimido/ic diamide/imido299135C44H24N4O84430-70-01-amino-N-(4-(((1-amino-9,10-dioxo-9,10-dihydro-2-anthracenyl)carbonyl)amino)-9,10-dioxo-9,10-dihydro-1-anthracenyl)-9,10-dioxo-9,10-dihydro-2-anthracenecarboxamide304394C31H28Cl2N877476-26-72-(1-(7-(N-((4-chloroanilino)(imino)methyl)ethanehydrazonoyl)-9H-fluoren-2-yl)ethylidene)-N-(4-chlorophenyl)hydrazinecarboximidamide334710C32H18N2O570655-11-7embedded image346568C28H18N4O275357-014-methyl-2-(4-(4-methyl[1,3]oxazolo[4,5-c] quinolin-2-yl)phenyl)[1,3]oxazolo[4,5-c] quinoline351369C28H16O551870-19-06,9-dihydroxy-1,3-diphenylanthra[2,3-c]furan-5,10-dione112929C42H38O2081-27-6Sennoside A661748C42H42O1217912-85-5Hopeaphenol307240C51H43N13O12S656377-79-8Nosiheptide8680C34H28N6O14S4314-13-64-amino-6-((4′-((8-amino-1-hydroxy-5,7-disulfo-2-naphthyl)diazenyl)-3,3′-dimethyl[1,1′-biphenyl]-4-yl)diazenyl)-5-hydroxy-1,3-naphthalene disulfonic acid65849C34H28N6O14S46968-33-84-amino-6-((4′-((8-amino-1-hydroxy-5,7-disulfo-2-naphthyl)diazenyl)-3,3′-dimethyl[1,1-biphenyl]-4-yl)diazenyl)-5-hydroxy-1,3-naphthalenedisulfonic acid210354C25H17FN6O4S42447-68-74-((3-(4-(6-amino-9H-purin-8-yl)benzoyl)anilino)carbonyl)benzenesulfonyl fluoride9618C37H31N3O9S37401-32-34-(4-((4-(4-sulfoanilino)-2,5-cyclohexadien-1-ylidene)(4-(4-sulfoanilino)phenyl)methyl)anilino)benzenesulfonic acid367934C40H29NO684166-05-2embedded image56681C21H12BrNO46633-33-62-(((3-bromo-9-oxo-9H-fluoren-2-yl)amino)carbonyl)benzoic acid252124C28H24N10O380266-34-8N-(4-(((4-((3-(hydroxy(oxido)amino)-9-acridinyl)amino)anilino)carbonyl)amino)phenyl)dicarbonimido/ic diamide/imido367629C23H11N3O481092-84-412-(hydroxy(oxido)amino)naphtho[1′,2′,3′:4,5]quino[2,1-b]quinazoline-5,10-dione180974C34H32N2O551076-20-1Tricordatine17791C23H18NO72666-55-81-[1,1′-biphenyl]-4-yl-2-(25-isoquinolin-2-yl)ethanone39914C29H16N2O56336-97-61-amino-N-(9,10-dioxo-9,10-dihydro-2-anthracenyl)-9,10-dioxo-9,10-dihydro-2-anthracenecarboxamide72275C49H33N7O6S78064-60-6Primuline377102C38H40N4O85335-06-411-(3-ethylidene-1,2,3,4,6,7,12,12b-octahydroindolo[2,3-a]quinolizin-2-yl)-10,11-didehydrostrychnidine39965C29H14N2O4S6937-84-42-(1-amino-9,10-dioxo-9,10-dihydro-2-anthracenyl)anthra[2,1-d][1,3]thiazole-6,11-dione51306C21H10 BrNO36344-57-62-(7-bromo-9-oxo-9H-fluoren-2-yl)-1H-isoindole-1,3(2H)-dione


The disclosure of all publications referred to in this application are incorporated herein by reference.


It will be appreciated by persons skilled in the art that numerous variations and/or modifications may be made to the invention as shown in the specific embodiments without departing from the spirit or scope of the invention as broadly described. The present embodiments are, therefore, to be considered in all respects as illustrative and not restrictive.

APPENDIX IStructural coordinates of sIL-6Rtext missing or illegible when filedtext missing or illegible when filedtext missing or illegible when filedtext missing or illegible when filedtext missing or illegible when filedtext missing or illegible when filedtext missing or illegible when filedtext missing or illegible when filedtext missing or illegible when filed1CBLEU10.44942.37288.7771.00101.992CGLEU1−0.87241.88889.3921.0094.373CD1LEU1−1.20540.49488.8931.0092.504CD2LEU1−0.74741.89890.9081.0088.115CLEU1−0.37743.40686.6281.0099.906OLEU1−0.15144.61986.5721.0096.657NLEU12.07442.76686.9341.0093.318CALEU10.64442.43187.2491.00100.219NALA2−1.49442.84986.1541.0099.7610CAALA2−2.58043.61585.5481.0095.8411CBALA2−2.91443.03884.1831.0085.1112CALA2−3.78243.54786.5231.00100.6213OALA2−3.61643.86587.6951.0096.4714NPRO3−4.99043.12886.0821.00104.6315CDPRO3−5.58742.85384.7571.00106.8016CAPRO3−6.04843.11187.0961.00103.5817CBPRO3−7.11742.24686.4391.0098.1918CGPRO3−7.09942.78385.0661.0097.5319CPRO3−5.75242.70288.5421.00103.0920OPRO3−4.66942.22288.8811.00100.5421NARG4−6.77042.92489.3701.00103.3122CAARG4−6.78042.66190.8021.00100.6723CBARG4−8.22142.38791.2521.00104.2724CGARG4−9.09943.63691.3151.00102.6525CDARG4−8.75144.50692.5151.00103.4726NEARG4−9.53845.73892.5471.00103.5427CZARG4−9.59846.56493.5891.00103.3728NH1ARG4−8.91846.29494.6971.0099.6129NH2ARG4−10.34047.66393.5231.00104.8630CARG4−5.85441.60091.3871.0096.5931OARG4−4.63041.75691.3831.0096.8732NARG5−6.42640.52791.9171.0090.2433CAARG5−5.58339.53192.5301.0084.7434CBARG5−5.95639.37994.0091.0087.3335CGARG5−7.43839.23894.2891.0086.8336CDARG5−7.70939.27295.7961.0084.6837NEARG5−7.03538.18796.5040.0086.0638CZARG5−7.10637.99597.8180.0086.0539NH1ARG5−7.82238.81798.5730.0086.2540NH2ARG5−6.46036.98198.3780.0086.2541CARG5−5.42538.16391.8771.0081.2442OARG5−6.17437.74690.9831.0067.0043NCYS6−4.38737.49292.3611.0078.2644CACYS6−3.96236.17591.9141.0073.5845CCYS6−3.91235.28993.1481.0063.5246OCYS6−2.84034.92793.6231.0061.3647CBCYS6−2.57136.31491.2851.0061.7148SGCYS6−1.86034.80390.5961.0058.6749NPRO7−5.07934.95393.7021.0064.2750CDPRO7−6.44035.18893.1801.0058.8851CAPRO7−5.10334.09894.8991.0064.8952CBPRO7−6.51334.32095.4411.0062.1953CGPRO7−7.33434.41894.1781.0062.1154CPRO7−4.83132.61094.5771.0066.4855OPRO7−4.79832.20093.3991.0051.4656NALA8−4.63231.80695.6191.0059.5857CAALA8−4.39930.39195.4221.0064.4358CBALA8−3.78529.77996.6541.0064.8359CALA8−5.75829.76895.1521.0071.8660OALA8−6.78730.46095.2021.0063.9261NGLN9−5.75828.46694.8601.0072.6662CAGLN9−6.99127.73694.6011.0072.6763CBGLN9−6.66926.37994.0021.0072.9464CGGLN9−7.34026.14792.6711.0079.1165CDGLN9−6.80027.06291.6121.0077.5666OE1GLN9−7.55227.63790.8271.0061.5167NE2GLN9−5.48027.20391.5791.0079.6068CGLN9−7.78527.56495.8991.0066.3669OGLN9−7.20927.37996.9571.0059.9270NGLU10−9.11027.62295.8071.0077.0671CAGLU10−9.98127.52496.9831.0079.6672CBGLU10−11.43927.76396.5831.0073.1873CGGLU10−11.68929.04695.8111.0071.7374CDGLU10−11.29930.28696.5910.0075.2975OE1GLU10−11.72730.41497.7580.0075.5076OE2GLU10−10.57231.13696.0360.0075.5077CGLU10−9.90926.22897.7941.0084.4378OGLU10−10.11326.24799.0211.0086.0779NVAL11−9.62725.11597.1191.0077.7580CAVAL11−9.55823.79997.7771.0080.2381CBVAL11−8.48123.76098.8971.0081.1182CG1VAL11−8.28122.33999.3561.0083.8183CG2VAL11−7.16424.33598.4001.0080.6484CVAL11−10.91423.43298.3981.0073.9585OVAL11−11.52324.24499.1011.0078.5986NALA12−11.37722.20998.1411.0073.2887CAALA12−12.67321.73598.6461.0075.1188CBALA12−12.98020.36798.0571.0073.6789CALA12−12.81721.693100.1801.0082.5090OALA12−11.83521.822100.9261.0083.3891NARG13−14.05521.515100.6391.0083.5992CAARG13−14.37421.466102.0701.0081.6493CBARG13−15.84821.096102.2610.0081.0994CGARG13−16.82422.105101.6720.0081.0895CDARG13−18.26521.722101.9740.0081.2296NEARG13−18.64220.453101.3560.0081.4997CZARG13−18.75020.262100.0450.0081.6298NH1ARG13−18.51221.25999.2030.0081.7299NH2ARG13−19.09819.07299.5740.0081.72100CARG13−13.50120.515102.8991.0080.15101OARG13−13.34719.335102.5681.0077.92102NGLY14−12.93221.050103.9761.0074.94103CAGLY14−12.09320.266104.8631.0075.60104CGLY14−10.94319.494104.2311.0082.74105OGLY14−10.54318.457104.7721.0082.02106NVAL15−10.43119.958103.0871.0073.38107CAVAL15−9.29419.304102.4531.0057.66108CBVAL15−9.25619.546100.9231.0061.09109CG1VAL15−7.88619.156100.3451.0048.64110CG2VAL15−10.33918.743100.2421.0042.61111CVAL15−8.04419.922103.0991.0066.02112OVAL15−7.99421.114103.3731.0067.62113NLEU16−7.05019.088103.3601.0065.90114CALEU16−5.79919.499103.9751.0054.43115CBLEU16−5.38918.458105.0091.0061.09116CGLEU16−4.32918.691106.0741.0054.68117CD1LEU16−4.92119.504107.2521.0055.60118CD2LEU16−3.87717.332106.5771.0053.84119CLEU16−4.78719.476102.8451.0062.00120OLEU16−4.78418.539102.0341.0056.46121NTHR17−3.91720.479102.7971.0044.10122CATHR17−2.91820.533101.7541.0044.95123CBTHR17−3.07421.817100.9051.0048.77124OG1THR17−3.11922.943101.7851.0048.72125CG2THR17−4.38521.796100.0951.0048.05126CTHR17−1.57720.573102.4271.0049.81127OTHR17−1.49520.786103.6281.0053.80128NSER18−0.53120.358101.6471.0037.06129CASER180.81120.431102.1541.0036.34130CBSER181.13219.315103.1691.0045.34131OGSER180.59718.069102.7751.0063.40132CSER181.75820.344100.9941.0047.68133OSER181.40819.88299.9081.0048.83134NLEU192.96220.820101.2481.0047.97135CALEU194.01620.833100.2711.0053.12136CBLEU194.96321.986100.5791.0051.40137CGLEU194.26223.281100.9841.0056.98138CD1LEU195.31724.279101.3661.0058.94139CD2LEU193.40123.81299.8291.0056.85140CLEU194.76819.526100.3771.0053.21141OLEU194.69418.848101.3931.0060.27142NPRO205.49819.16099.3241.0054.31143CDPRO205.51119.78197.9821.0053.88144CAPRO206.26617.91799.3531.0045.95145CBPRO206.79617.78997.9261.0047.82146CGPRO205.80718.60197.0951.0059.03147CPRO207.41218.122100.3351.0053.81148OPRO208.01519.210100.3721.0043.31149NGLY217.71217.066101.0951.0049.09150CAGLY218.78517.081102.0781.0043.34151CGLY218.30017.368103.4881.0035.95152OGLY219.01917.193104.4571.0039.88153NASP227.06317.823103.6121.0043.97154CAASP226.55318.166104.9341.0048.72155CBASP225.38719.170104.8421.0040.75156CGASP225.85620.621104.6191.0065.71157OD1ASP227.09120.902104.7201.0065.01158OD2ASP224.96921.478104.3491.0059.85159CASP226.04216.927105.6041.0052.26160OASP225.76415.944104.9311.0058.05161NSER235.96116.972106.9311.0044.50162CASER235.37615.889107.6851.0037.87163CBSER236.01315.753109.0491.0032.56164OGSER237.25415.069108.9131.0059.71165CSER233.93116.317107.8471.0038.38166OSER233.65417.504107.9081.0051.39167NVAL243.01315.366107.8831.0041.93168CAVAL241.60415.686108.0611.0046.54169CBVAL240.80415.494106.7591.0040.44170CG1VAL24−0.68215.795106.9701.0036.10171CG2VAL241.38316.388105.6781.0035.61172CVAL241.04714.765109.1361.0052.56173OVAL241.37213.588109.1871.0052.11174NTHR250.23315.316110.0221.0053.11175CATHR25−0.33914.497111.0571.0055.62176CBTHR25−0.19815.153112.4621.0034.57177OG1THR251.18115.211112.8051.0048.14178CG2THR25−0.85314.333113.4981.0046.34179CTHR25−1.79714.214110.7281.0064.24180OTHR25−2.69015.060110.8961.0057.60181NLEU26−2.01313.007110.2181.0065.53182CALEU26−3.34712.556109.9141.0062.85183CBLEU26−3.32111.163109.2801.0066.87184CGLEU26−2.60710.950107.9371.0066.83185CD1LEU26−2.5459.446107.6211.0062.48186CD2LEU26−3.31211.695106.8551.0051.71187CLEU26−3.94112.474111.3041.0061.75188OLEW26−3.35911.878112.2091.0062.35189NTHR27−5.08813.095111.4831.0066.71190CATHR27−5.73813.071112.7761.0067.80191CBTHR27−5.72414.460113.3881.0061.33192OG1THR27−4.36214.908113.4391.0063.93193CG2THR27−6.32014.446114.7741.0060.73194CTHR27−7.15112.578112.5911.0063.45195OTHR27−7.99713.259112.0391.0062.43196NCYS28−7.38711.356113.0331.0078.61197CACYS28−8.70210.763112.9191.0079.00198CCYS28−9.70111.657113.6401.0076.66199OCYS28−9.55211.934114.8391.0070.85200CBCYS28−8.7089.389113.5511.0078.96201SGCYS28−10.2228.474113.1701.0091.84202NPRO29−10.72012.136112.9111.0075.64203CDPRO29−10.84311.998111.4481.0075.90204CAPRO29−11.77113.010113.4491.0078.49205CBPRO29−12.57713.381112.2101.0081.18206CGPRO29−11.54213.263111.0791.0083.97207CPRO29−12.64612.327114.5041.0081.15208OPRO29−12.91612.886115.5741.0078.68209NGLY30−13.06111.103114.1981.0078.44210CAGLY30−13.93210.364115.0901.0080.19211CGLY30−13.3279.653116.2781.0079.60212OGLY30−14.0418.931116.9691.0079.98213NVAL31−12.0339.819116.5301.0073.95214CAVAL31−11.4539.146117.6841.0072.63215CBVAL31−10.3598.142117.2921.0065.98216CG1VAL31−9.8477.464118.5301.0065.37217CG2VAL31−10.9147.087116.3361.0069.40218CVAL31−10.88810.144118.6831.0073.31219OVAL31−10.02910.967118.3571.0068.46220NGLU32−11.38710.051119.9111.0068.79221CAGLU32−10.98910.941120.9951.0075.74222CBGLU32−11.59410.444122.3151.0079.09223CGGLU32−13.11710.366122.3070.0078.48224CDGLU32−13.6929.863123.6200.0078.92225OE1GLU32−14.9339.766123.7280.0079.02226OE2GLU32−12.9069.563124.5430.0079.02227CGLU32−9.47611.104121.1391.0074.42228OGLU32−8.70010.232120.7521.0064.96229NPRO33−9.04012.236121.7141.0082.52230CDPRO33−9.81213.381122.2311.0081.58231CAPRO33−7.60212.459121.8861.0085.29232CBPRO33−7.54113.870122.4881.0079.15233CGPRO33−8.84713.985123.2271.0079.56234CPRO33−6.84811.422122.7321.0083.28235OPRO33−5.61611.386122.6991.0086.54236NGLU34−7.55310.573123.4751.0074.84237CAGLU34−6.8339.609124.3031.0085.61238CBGLU34−6.95410.003125.7821.0085.04239CGGLU34−8.3579.891126.3551.0091.95240CDGLU34−9.29810.957125.8281.0091.82241OE1GLU34−9.13912.136126.2211.0097.09242OE2GLU34−10.19210.615125.0221.0085.80243CGLU34−7.1308.108124.1691.0086.02244OGLU34−6.6587.326124.9881.0086.82245NASP35−7.8777.684123.1531.0088.67246CAASP35−8.1676.258123.0261.0089.92247CBASP35−9.3896.014122.1301.0090.08248CGASP35−10.1254.710122.4811.0094.14249OD1ASP35−9.5123.805123.1101.0085.08250OD2ASP35−11.3214.596122.1171.0090.82251CASP35−6.9725.480122.4721.0089.41252OASP35−6.7115.483121.2631.0094.62253NASN36−6.2624.803123.3691.0086.90254CAASN36−5.0964.013123.0031.0078.62255CBASN36−4.4373.448124.2681.0083.78256CGASN36−3.8414.526125.1581.0090.38257OD1ASN36−2.7955.111124.8431.0090.31258ND2ASN36−4.5154.787126.2741.0091.32259CASN36−5.4822.863122.0701.0076.89260OASN36−4.6312.053121.6831.0072.83261NALA37−6.7612.785121.7081.0073.54262CAALA37−7.2241.707120.8321.0075.00263CBALA37−8.7531.796120.6221.0066.82264CALA37−6.4991.712119.4781.0072.68265OALA37−6.0612.757118.9801.0050.84266NTHR38−6.3520.534118.8891.0070.07267CATHR38−5.7040.481117.6021.0074.82268CBTHR38−5.417−0.956117.1461.0071.82269OG1THR38−4.575−1.593118.1091.0082.67270CG2THR38−4.709−0.957115.7761.0071.84271CTHR38−6.6531.113116.6041.0076.60272OTHR38−7.8710.895116.6611.0074.96273NVAL39−6.1001.941115.7251.0071.86274CAVAL39−6.8942.550114.6771.0069.71275CBVAL39−6.8024.101114.6691.0062.14276CG1VAL39−7.4254.639113.4061.0071.47277CG2VAL39−7.5614.684115.8401.0064.03278CVAL39−6.3141.981113.3771.0067.00279OVAL39−5.1171.700113.2771.0063.20280NHIS40−7.1881.758112.4081.0064.17281CAHIS40−6.7691.257111.1201.0059.06282CBHIS40−7.513−0.014110.7841.0055.54283CGHIS40−7.040−1.184111.5701.0060.37284CD2HIS40−7.433−1.670112.7711.0048.89285ND1HIS40−5.956−1.939111.1831.0059.70286CE1HIS40−5.697−2.838112.1141.0055.97287NE2HIS40−6.576−2.694113.0871.0062.89288CHIS40−7.0452.322110.0791.0059.08289OHIS40−8.0643.023110.1271.0053.03290NTRP41−6.1032.442109.1531.0051.62291CATRP41−6.1973.401108.0801.0049.90292CBTRP41−5.0194.375108.1211.0056.38293CGTRP41−4.9475.165109.3521.0051.16294CD2TRP41−5.4726.471109.5531.0060.65295CE2TRP41−5.1606.846110.8801.0059.64296CE3TRP41−6.1737.369108.7431.0057.13297CD1TRP41−4.3594.798110.5331.0059.46298NE1TRP41−4.4855.802111.4531.0062.96299CZ2TRP41−5.5198.073111.4091.0056.70300CZ3TRP41−6.5358.595109.2741.0065.95301CH2TRP41−6.2058.937110.5961.0062.56302CTRP41−6.1952.713106.7311.0057.85303OTRP41−5.4861.712106.4871.0050.96304NVAL42−6.9973.263105.8411.0052.35305CAVAL42−7.0442.725104.5141.0056.46306CBVAL42−8.4242.103104.1941.0057.82307CG1VAL42−8.4231.565102.7671.0054.15308CG2VAL42−8.7240.963105.1711.0050.16309CVAL42−6.7933.944103.6801.0052.80310OVAL42−7.4984.931103.8231.0054.05311NLEU43−5.7533.888102.8531.0055.97312CALEU43−5.4035.003101.9891.0059.83313CBLEU43−3.8935.250102.0011.0055.98314CGLEU43−3.3295.989100.7691.0053.52315CD1LEU43−3.8767.424100.6831.0041.04316CD2LEU43−1.7785.961100.8341.0048.18317CLEU43−5.8474.704100.5691.0064.86318OLEU43−5.5373.630100.0271.0054.65319NARG44−6.5545.66899.9741.0065.34320CAARG44−7.0515.54798.5971.0071.53321CBARG44−8.5945.50598.5971.0068.85322CGARG44−9.1564.29499.3681.0077.94323CDARG44−10.6694.04399.1701.0078.71324NEARG44−11.0752.76399.7691.0074.34325CZARG44−11.4962.595101.0281.0079.10326NH1ARG44−11.5943.625101.8731.0055.57327NH2ARG44−11.7911.371101.4591.0074.96328CARG44−6.5436.67897.6971.0062.72329OARG44−7.1687.72897.5981.0065.18330NLYS45−5.4026.45397.0561.0058.94331CALYS45−4.7967.43396.1571.0060.24332CBLYS45−3.4556.90295.6531.0053.69333CGLYS45−2.3956.71696.7671.0062.98334CDLYS45−1.0516.22696.2021.0052.29335CELYS45−0.0026.00797.3111.0068.99336NZLYS451.3985.81296.7911.0058.39337CLYS45−5.7197.77794.9771.0066.70338OLYS45−6.5947.00094.6341.0060.79339NPRO46−5.5398.96194.3491.0076.24340CDPRO46−4.86710.14494.9101.0072.65341CAPRO46−6.3959.36093.2181.0074.98342CBPRO46−6.28510.88393.2101.0071.65343CGPRO46−5.89311.21994.6281.0080.40344CPRO46−6.0758.78991.8451.0079.19345OPRO46−6.7759.09590.8801.0082.19346NALA47−5.0257.97891.7411.0082.33347CAALA47−4.6567.39890.4501.0084.51348CBALA47−3.2326.85290.5011.0070.10349CALA47−5.6336.29090.0531.0086.16350OALA47−6.0315.47390.8901.0087.14351NALA48−6.0206.28288.7771.0087.33352CAALA48−6.9415.27888.2431.0087.92353CBALA48−7.1975.53786.7471.0082.34354CALA48−6.3433.88488.4601.0090.04355OALA48−5.1173.70588.3871.0082.32356NGLY49−7.2082.90288.7281.0095.77357CAGLY49−6.7311.55188.9891.0093.69358CGLY49−5.6501.64790.0501.0097.85359OGLY49−4.4951.28989.8071.0093.57360NSER50−6.0432.14591.2251.00100.32361CASER50−5.1402.35492.3531.0098.95362CBSER50−5.7753.33893.3481.0099.14363OGSER50−4.8323.78694.3101.0099.30364CSER50−4.6921.08593.0821.0099.10365OSER50−4.4830.04792.4621.0096.91366NHIS51−4.5691.17394.4041.00100.90367CAHIS51−4.0780.05695.2101.00102.38368CBHIS51−2.582−0.12594.9121.00103.47369CGHIS51−1.8071.16494.9181.00103.87370CD2HIS51−1.6362.10295.8841.0098.07371ND1HIS51−1.1041.61693.8201.00103.86372CE1HIS51−0.5332.77394.1091.00101.19373NE2HIS51−0.8413.09095.3551.0097.91374CHIS51−4.2570.27496.7221.0096.96375OHIS51−3.2660.31597.4551.0095.98376NPRO52−5.5110.38097.2061.0091.94377CDPRO52−6.727−0.01096.4701.0093.73378CAPRO52−5.8220.59798.6291.0087.96379CBPRO52−7.164−0.10598.7991.0088.73380CGPRO52−7.8330.18997.5181.0090.59381CPRO52−4.7750.10199.6251.0079.22382OPRO52−4.414−1.07199.6271.0076.14383NSER53−4.2841.018100.4561.0080.39384CASER53−3.2830.706101.4761.0074.21385CBSER53−2.3231.886101.6551.0084.01386OGSER53−1.2751.858100.7001.0088.72387CSER53−3.9630.404102.8071.0068.27388OSER53−4.9791.023103.1391.0064.18389NARG54−3.406−0.549103.5571.0064.51390CAARG54−3.965−0.931104.8581.0067.89391CBARG54−4.679−2.278104.7341.0068.57392CGARG54−5.920−2.182103.8661.0063.14393CDARG54−6.573−3.523103.5251.0069.89394NEARG54−7.644−3.310102.5511.0077.10395CZARG54−8.744−2.604102.8051.0076.59396NH1ARG54−8.914−2.069104.0041.0074.58397NH2ARG54−9.650−2.394101.8561.0070.56398CARG54−2.924−0.943105.9921.0060.53399OARG54−2.051−1.792106.0501.0053.27400NTRP55−3.0390.021106.9031.0059.18401CATRP55−2.0810.149107.9901.0060.59402CBTRP55−1.2791.462107.8241.0050.36403CGTRP55−0.6491.696106.4471.0049.83404CD2TRP55−0.3542.965105.8391.0042.60405CE2TRP550.2832.696104.5971.0050.18406CE3TRP55−0.5624.294106.2211.0034.80407CD1TRP55−0.1990.752105.5821.0039.67408NE1TRP550.3601.335104.4711.0048.41409CZ2TRP550.7243.716103.7301.0035.19410CZ3TRP55−0.1225.315105.3681.0040.98411CH2TRP550.5155.016104.1381.0052.09412CTRP55−2.7410.159109.3781.0064.36413OTRP55−3.9370.467109.5041.0062.24414NALA56−1.953−0.182110.4061.0056.70415CAALA56−2.417−0.132111.7971.0056.96416CBALA56−2.440−1.510112.4311.0043.05417CALA56−1.5560.794112.6561.0061.79418OALA56−0.3050.688112.6741.0052.89419NGLY57−2.2451.693113.3661.0054.08420CAGLY57−1.5762.606114.2781.0061.77421CGLY57−2.2622.639115.6471.0066.52422OGLY57−3.4522.320115.7651.0066.82423NMET58−1.5293.004116.6951.0057.96424CAMET58−2.1443.084118.0251.0063.85425CBMET58−1.1162.750119.1111.0055.90426CGMET58−1.6662.694120.5161.0057.48427SDMET58−0.4022.166121.7461.0063.83428CEMET58−1.4291.785123.1731.0060.54429CMET58−2.7034.503118.2341.0067.72430OMET58−1.9875.506118.0901.0064.22431NGLY59−3.9894.586118.5521.0067.18432CAGLY59−4.5945.880118.7571.0064.80433CGLY59−5.0406.544117.4621.0071.90434OGLY59−5.0065.932116.3891.0062.06435NARG60−5.4257.818117.5701.0068.35436CAARG60−5.9408.574116.4431.0061.14437CBARG60−6.9529.606116.9511.0062.34438CGARG60−6.36210.882117.5301.0059.80439CDARG60−7.32411.487118.5611.0071.88440NEARG60−7.16712.933118.7521.0074.16441CZARG60−7.95213.859118.1971.0069.68442NH1ARG60−8.97613.511117.4071.0046.24443NH2ARG60−7.69715.144118.4231.0064.41444CARG60−4.9329.263115.5451.0058.42445OARG60−5.3149.766114.5031.0065.21446NARG61−3.6619.292115.9341.0050.09447CAARG61−2.6429.954115.1321.0047.76448CBARG61−1.71710.835116.0061.0057.06449CGARG61−2.35312.125116.5381.0057.54450CDARG61−1.30113.082117.1031.0062.78451NEARG61−1.84214.418117.4361.0076.54452CZARG61−2.63515.164116.6531.0072.22453NH1ARG61−3.02314.729115.4651.0071.05454NH2ARG61−3.02816.371117.0451.0072.76455CARG61−1.7799.006114.3221.0050.59456OARG61−1.5327.875114.7011.0043.89457NLEU62−1.3059.499113.1921.0047.93458CALEU62−0.4718.718112.3171.0047.86459CBLEU62−1.3097.859111.3661.0051.27460CGLEU62−0.4787.072110.3521.0055.21461CD1LEU620.3436.018111.0791.0049.34462CD2LEU62−1.4026.438109.3101.0062.13463CLEU620.2649.758111.5421.0040.31464OLEU62−0.33010.553110.8431.0047.28465NLEU631.5749.738111.6671.0045.58466CALEU632.41010.697111.0101.0048.22467CBLEU633.60310.952111.9151.0052.66468CGLEU634.48012.194111.8081.0063.27469CD1LEU635.35912.129110.5601.0059.94470CD2LEU633.57413.406111.8291.0059.24471CLEU632.86410.227109.6221.0057.85472OLEU633.1479.039109.4041.0052.19473NLEU642.91111.160108.6771.0049.48474CALEU643.38110.845107.3431.0048.87475CBLEU642.37511.278106.2741.0045.23476CGLEU640.93910.742106.3871.0046.41477CD1LEU640.09911.323105.3031.0052.43478CD2LEU640.9319.223106.2611.0050.15479CLEU644.68711.599107.1851.0050.59480OLEU644.73912.821107.2561.0050.61481NARG655.75210.846107.0191.0040.29482CAARG657.05211.416106.8301.0054.71483CBARG658.09810.342107.1091.0048.29484CGARG659.54010.716106.8671.0065.38485CDARG6510.3819.440106.7781.0061.84486NEARG6511.7709.692106.3951.0076.35487CZARG6512.6198.744105.9891.0084.78488NH1ARG6512.2277.467105.9091.0082.41489NH2ARG6513.8639.068105.6511.0078.69490CARG657.18411.926105.3861.0052.15491OARG656.57211.397104.4641.0051.21492NSER667.97612.976105.2371.0048.47493CASER668.31913.597103.9751.0045.49494CBSER669.77913.213103.6741.0045.32495OGSER6610.16813.555102.3641.0070.17496CSER667.37713.291102.7951.0042.85497OSER667.80112.838101.7411.0041.58498NVAL676.09313.562103.0001.0040.45499CAVAL675.05713.369102.0061.0039.39500CBVAL673.79714.105102.4201.0037.84501CG1VAL672.73413.899101.4221.0061.68502CG2VAL673.33513.584103.7241.0034.69503CVAL675.46413.853100.6031.0052.57504OVAL676.25514.788100.4311.0044.66505NGLN684.88113.22299.5961.0041.81506CAGLN685.20713.53798.2181.0045.96507CBGLN686.02012.38297.6001.0040.84508CGGLN687.42312.19098.2561.0041.84509CDGLN688.26813.45598.1751.0050.56510OE1GLN688.26214.13197.1501.0050.20511NE2GLN688.99813.77999.2491.0047.56512CGLN683.90613.72897.5051.0039.57513OGLN682.86213.43898.0651.0044.24514NLEU693.96314.23196.2801.0042.17515CALEU692.75614.46895.5001.0047.79516CBLEU693.13814.97194.0901.0047.86517CGLEU693.84316.35294.0231.0046.84518CD1LEU694.29616.60592.6371.0037.74519CD2LEU692.92917.48794.4781.0039.16520CLEU691.90713.19895.4521.0050.47521OLEU690.69113.23795.6721.0044.43522NHIS702.56812.06495.2271.0048.13523CAHIS701.88410.77595.1731.0047.08524CBHIS702.8559.70094.6601.0051.12525CGHIS704.0689.52195.5201.0057.96526CD2HIS704.1989.31896.8581.0050.12527ND1HIS705.3539.64395.0261.0061.23528CE1HIS706.2209.53996.0201.0051.08529NE2HIS705.5459.34597.1421.0057.79530CHIS701.28710.32096.5141.0051.84531OHIS700.7139.24596.5841.0054.76532NASP711.45511.09197.5901.0047.38533CAASP710.86610.68898.8631.0047.33534CBASP711.70611.097100.0891.0042.05535CGASP712.99210.269100.2281.0049.63536OD1ASP712.9239.019100.2141.0052.98537OD2ASP714.08210.866100.3531.0051.23538CASP71−0.47611.35198.9281.0050.80539OASP71−1.21111.18899.9041.0049.43540NSER72−0.80512.12997.8981.0052.17541CASER72−2.13012.73697.8841.0054.20542CBSER72−2.37613.59696.6431.0039.77543OGSER72−1.77314.85696.7631.0051.83544CSER72−3.05511.55697.8361.0052.86545OSER72−2.73210.53797.2041.0057.51546NGLY73−4.19211.69398.5071.0053.54547CAGLY73−5.16710.63298.5241.0055.41548CGLY73−6.17510.78999.6381.0056.10549OGLY73−6.26911.823100.2891.0057.39550NASN74−6.9549.74599.8541.0062.16551CAASN74−7.9619.775100.8911.0068.78552CBASN74−9.3309.477100.2701.0073.41553CGASN74−10.05810.74299.8471.0071.59554OD1ASN74−10.67711.408100.6681.0077.51555ND2ASN74−9.96311.09298.5751.0081.67556CASN74−7.6098.792102.0021.0069.26557OASN74−7.5477.576101.7861.0064.42558NTYR75−7.3329.327103.1871.0067.46559CATYR75−6.9858.475104.3111.0060.02560CBTYR75−5.8629.092105.1701.0052.32561CGTYR75−4.5719.204104.4101.0041.71562CD1TYR75−4.44310.110103.3691.0033.39563CE1TYR75−3.29910.160102.6001.0038.06564CD2TYR75−3.5128.342104.6661.004565CE2TYR75−2.3508.378103.9041.0043.61566CZTYR75−2.2469.301102.8701.0044.97567OHTYR75−1.0689.415102.1601.0040.63568CTYR75−8.2078.260105.1471.0059.98569OTYR75−8.6269.154105.8791.0057.57570NSER76−8.7787.063105.0121.0069.14571CASER76−9.9526.661105.7781.0074.53572CBSER76−10.8205.689104.9851.0068.66573OGSER76−11.6046.389104.0391.0075.07574CSER76−9.5426.025107.1051.0077.27575OSER76−8.7665.053107.1811.0068.93576NCYS77−10.0836.606108.1591.0078.72577CACYS77−9.8036.158109.4991.0086.89578CCYS77−10.9245.250110.0461.0083.28579OCYS77−12.0955.642110.0741.0083.09580CBCYS77−9.6047.394110.3851.0085.54581SGCYS77−9.4846.913112.1161.00108.50582NTYR78−10.5664.035110.4661.0081.54583CATYR78−11.5513.095111.0121.0078.92584CBTYR78−11.6681.833110.1311.0072.07585CGTYR78−11.9672.094108.6701.0070.82586CD1TYR78−10.9462.389107.7671.0077.93587CE1TYR78−11.2202.626106.4191.0076.94588CD2TYR78−13.2702.048108.1881.0070.90589CE2TYR78−13.5602.290106.8481.0065.48590CZTYR78−12.5322.575105.9651.0080.82591OHTYR78−12.8092.793104.6261.0088.20592CTYR78−11.2352.669112.4561.0080.75593OTYR78−10.1032.263112.7821.0070.45594NARG79−12.2472.782113.3141.0079.63595CAARG79−12.1432.398114.7261.0085.33596CBARG79−12.5423.575115.6201.0082.02597CGARG79−12.6643.218117.0971.0084.64598CDARG79−11.3262.868117.7401.0083.04599NEARG79−11.5102.235119.0511.0088.00600CZARG79−12.1992.763120.0641.0082.53601NH1ARG79−12.7833.954119.9481.0079.11602NH2ARG79−12.3292.080121.1881.0077.31603CARG79−13.0641.193114.9951.0085.02604OARG79−14.2111.351115.4281.0078.72605NALA80−12.545−0.006114.7261.0087.23606CAALA80−13.293−1.252114.8971.0084.46607CBALA80−13.868−1.338116.3181.0079.94608CALA80−14.416−1.346113.8581.0085.99609OALA80−14.426−2.249113.0181.0081.86610NGLY81−15.341−0.389113.9061.0089.93611CAGLY81−16.470−0.379112.9881.0092.08612CGLY81−16.174−0.022111.5451.0087.28613OGLY81−15.677−0.851110.7811.0085.96614NARG82−16.5211.205111.1690.0092.45615CAARG82−16.2911.710109.8200.0095.52616CBARG82−17.3311.149108.8380.0094.95617CGARG82−17.003−0.248108.3130.0094.58618CDARG82−18.068−0.768107.3550.0094.22619NEARG82−17.744−2.101106.8470.0093.88620CZARG82−18.537−2.814106.0530.0093.71621NH1ARG82−19.709−2.325105.6690.0093.58622NH2ARG82−18.160−4.017105.6410.0093.58623CARG82−16.3103.239109.8000.0098.67624OARG82−15.3773.867109.3020.0098.14625NPRO83−17.3733.864110.3371.00103.39626CDPRO83−18.7003.327110.7021.00105.19627CAPRO83−17.4005.331110.3291.00104.87628CBPRO83−18.9015.642110.3721.00103.05629CGPRO83−19.4284.559111.2451.00101.61630CPRO83−16.6206.010111.4731.0099.40631OPRO83−17.0826.064112.6141.0097.91632NALA84−15.4356.525111.1461.0097.87633CAALA84−14.5787.228112.1061.0094.69634CBALA84−13.4336.315112.5741.0091.69635CALA84−14.0218.485111.4201.0091.93636OALA84−13.4279.356112.0621.0080.79637NGLY85−14.2178.552110.1021.0089.61638CAGLY85−13.7879.702109.3251.0076.77639CGLY85−12.7269.461108.2731.0074.02640OGLY85−12.1438.374108.1811.0078.43641NTHR86−12.50610.482107.4491.0066.80642CATHR86−11.46910.450106.4291.0061.26643CBTHR86−12.02110.241105.0101.0062.31644OG1THR86−12.5328.914104.8821.0067.30645CG2THR86−10.91110.467103.9691.0054.98646CTHR86−10.77911.807106.4461.0069.17647OTHR86−11.40012.828106.7421.0069.62648NVAL87−9.48611.807106.1461.0072.13649CAVAL87−8.70613.029106.0791.0065.24650CBVAL87−7.40512.918106.9321.0066.87651CG1VAL87−6.52114.144106.7291.0053.62652CG2VAL87−7.75212.758108.3851.0069.44653CVAL87−8.31413.156104.6101.0062.08654OVAL87−7.52012.356104.1231.0068.19655NHIS88−8.86014.141103.9031.0055.26656CAHIS88−8.51014.342102.4871.0055.55657CBHIS88−9.62915.079101.7501.0045.86658CGHIS88−10.96114.406101.8711.0068.29659CD2HIS88−11.78613.883100.9331.0070.79660ND1HIS88−11.57214.178103.0891.0072.48661CE1HIS88−12.71513.543102.8941.0074.65662NE2HIS88−12.86813.352101.5961.0077.31663CHIS88−7.21715.137102.3741.0054.27664OHIS88−7.18316.320102.6871.0056.68665NLEU89−6.16314.476101.9201.0051.05666CALEU89−4.84815.089101.7681.0055.76667CBLEU89−3.78214.210102.4001.0044.10668CGLEU89−2.33914.683102.2331.0063.17669CD1LEU89−2.13615.929103.0691.0052.92670CD2LEU89−1.35413.579102.6571.0049.65671CLEU89−4.41915.381100.3351.0060.30672OLEU89−4.19714.48199.5221.0063.48673NLEU90−4.26216.658100.0381.0062.02674CALEU90−3.82917.05698.7211.0053.90675CBLEU90−4.86818.02398.1351.0057.62676CGLEU90−4.46418.88796.9471.0065.94677CD1LEU90−4.01117.97495.8461.0049.14678CD2LEU90−5.61619.82096.5361.0051.17679CLEU90−2.43417.68898.8221.0051.80680OLEU90−2.30918.82399.2421.0054.93681NVAL91−1.39116.92898.4841.0044.81682CAVAL91−0.02617.44298.5181.0043.52683CBVAL91616.30398.6841.0037.98684CG1VAL912.41016.85198.8591.0034.55685CG2VAL910.67415.49199.9021.0041.03686CVAL910.15718.16797.1881.0050.08687OVAL91−0.07717.59496.1241.0054.83688NASP920.56119.43397.2351.0050.26689CAASP920.66120.20095.9921.0049.93690CBASP92−0.60821.06895.8331.0044.53691CGASP92−0.87821.49194.3821.0050.72692OD1ASP92−0.00121.31593.5091.0060.21693OD2ASP92−1.98022.01494.1181.0049.19694CASP921.91421.04695.9041.0043.21695OASP922.55821.32896.9031.0050.41696NVAL932.25421.44794.6931.0048.54697CAVAL933.45722.23594.4221.0056.64698CBVAL934.11721.76393.0621.0065.00699CG1VAL935.48222.36792.8921.0057.14700CG2VAL934.16620.24792.9721.0057.19701CVAL933.09423.70794.2241.0054.91702OVAL931.96024.00793.8481.0048.69703NPRO944.02724.63794.5451.0057.90704CDPRO944.92624.39995.6901.0061.07705CAPRO943.84826.09394.3761.0051.73706CBPRO944.91226.69395.2781.0046.67707CGPRO944.92325.77996.4071.0048.86708CPRO944.17126.33892.9101.0046.65709OPRO945.21025.88392.4151.0042.39710NPRO953.30427.07692.1911.0040.25711CDPRO952.08827.78792.5921.0030.77712CAPRO953.61727.29090.7721.0039.13713CBPRO952.43128.09590.2491.0029.26714CGPRO951.36127.90491.2951.0041.20715CPRO954.92327.96790.4621.0043.75716OPRO955.50928.66391.2901.0050.53717NGLU965.40627.71889.2521.0054.99718CAGLU966.61028.39088.7881.0047.41719CBGLU967.14127.70387.5441.0057.60720CGGLU967.31126.22087.6911.0048.45721CDGLU967.90125.59586.4351.0068.71722OE1GLU968.41026.35785.5661.0062.79723OE2GLU967.86024.34286.3271.0070.89724CGLU966.04929.77188.4101.0047.41725OGLU974.82729.89288.1711.0039.12726NGLU976.89230.80888.3691.0042.01727CAGLU976.36132.12988.0051.0044.48728CBGLU977.29633.27688.3841.0035.88729CGGLU976.64334.63288.0481.0052.12730CDGLU977.16035.80688.8631.0056.72731OE1GLU977.25735.69990.1091.0062.38732OE2GLU977.44336.85588.2471.0060.92733CGLU976.13732.11886.5081.0033.61734OGLU977.02931.83985.7631.0030.35735NPRO984.93232.43786.0451.0039.63736CDPRO983.61932.75086.6031.0035.46737CAPRO984.88632.35484.5831.0039.23738CBPRO983.40432.54584.2711.0037.32739CGPRO982.69932.17385.5271.0042.18740CPRO985.80333.33883.8321.0043.37741OPRO986.21334.37284.3751.0034.56742NGLN996.17332.95582.6131.0037.43743CAGLN997.00033.79281.7261.0046.36744CBGLN998.28233.08081.2481.0049.33745CGGLN999.37132.84882.2761.0042.67746CDGLN999.52933.99983.1981.0056.22747OE1GLN999.01533.96684.3211.0055.11748NE2GLN9910.22335.05082.7431.0050.83749CGLN996.05733.90680.5401.0048.35750OGLN995.96633.00479.7301.0048.46751NLEU1005.36935.02180.4321.0048.85752CALEU1004.37235.16879.3961.0046.80753CBLEU1003.25936.12579.8971.0033.22754CGLEU1001.99336.40779.0781.0046.78755CD1LEU1001.17435.16778.9841.0040.64756CD2LEU1001.17537.52579.7321.0036.60757CLEU1004.89335.60578.0541.0040.31758OLEU1005.72436.49277.9491.0046.96759NSER1014.38034.96177.0221.0039.64760CASER1014.74035.29275.6521.0041.21761CBSER1015.49934.14374.9721.0043.96762OGSER1015.89534.52073.6591.0059.52763CSER1013.40835.49074.9741.0034.29764OSER1012.59134.54374.9231.0041.42765NCYS1023.15036.72074.5141.0037.29766CACYS1021.88737.00673.8031.0046.92767CCYS1022.20937.30772.3281.0049.40768OCYS1023.24237.90672.0411.0041.30769CBCYS1021.15738.20174.3971.0045.80770SGCYS1020.36238.04576.0211.0047.25771NPHE1031.34036.89571.4011.0039.80772CAPHE1031.61337.13369.9831.0037.94773CBPHE1032.63836.11469.4771.0031.69774CGPHE1032.14734.65969.5461.0036.43775CD1PHE1031.41034.10968.5111.0031.53776CD2PHE1032.38933.88170.6571.0035.41777CE1PHE1030.92132.80968.5861.0043.33778CE2PHE1031.90432.58070.7411.0046.34779CZPHE1031.16332.03969.7001.0040.86780CPHE1030.36937.03369.1291.0042.42781OPHE103−0.67036.55669.5851.0045.54782NARG1040.48137.48967.8831.0036.44783CAARG104−0.61037.42366.9371.0030.87784CBARG104−1.49938.66567.1081.0037.76785CGARG104−2.45038.93765.9721.0035.03786CDARG104−3.83739.23166.3991.0034.41787NEARG104−4.16840.64766.4141.0038.04788CZARG104−5.41841.09566.5091.0047.99789NH1ARG104−6.41540.23566.5741.0041.92790NH2ARG104−5.67742.39166.6051.0049.53791CARG104−0.10437.24465.4761.0036.29792OARG1040.47338.14064.8711.0034.66793NLYS105−0.32336.05064.9431.0038.88794CALYS1050.06735.70663.5741.0048.56795CBLYS105−0.08434.19763.3891.0038.91796CGLYS1050.77133.36764.3491.0037.33797CDLYS1052.24133.76064.2331.0054.88798CELYS1053.10333.17965.3591.0065.25799NZLYS1053.40031.74265.2011.0049.35800CLYS105−0.63436.48262.4111.0049.69801OLYS1050.03336.91561.4951.0053.72802NSER106−1.95436.66262.4341.0046.97803CASER106−2.61837.42761.3751.0046.49804CBSER106−3.40136.51560.4461.0048.20805OGSER106−4.00635.53761.2091.0047.79806CSER106−3.53338.47161.9801.0048.92807OSER106−3.91438.35263.1371.0049.81808NPRO107−3.90939.50461.2021.0042.09809CDPRO107−3.72239.61059.7441.0031.34810CAPRO107−4.77740.58561.6911.0041.24811CBPRO107−4.97141.48460.4591.0044.71812CGPRO107−3.85541.09859.5371.0046.51813CPRO107−6.12440.20062.3021.0051.12814OPRO107−6.66340.93763.1651.0046.09815NLEU108−6.65439.05861.8561.0048.55816CALEU108−7.95838.58962.3081.0057.21817CBLEU108−8.82738.17861.1061.0046.02818CGLEU108−9.97339.09060.6041.0054.59819CD1LEU108−9.83740.55361.0701.0039.70820CD2LEU108−10.00838.98759.0821.0041.31821CLEU108−7.95837.46863.3241.0058.78822OLEU108−9.01237.17363.8841.0062.46823NSER109−6.81336.83063.5641.0050.34824CASER109−6.77835.76764.5681.0040.58825CBSER109−5.64034.79664.3201.0043.38826OGSER109−4.40535.47564.2721.0070.48827CSER109−6.62336.41965.9281.0041.40828OSER109−6.26837.60366.0391.0036.98829NASN110−6.91935.66066.9711.0039.51830CAASN110−6.83636.21368.3231.0042.42831CBASN110−7.55735.32769.3191.0051.49832CGASN110−9.03535.41869.2001.0052.17833OD1ASN110−9.59336.48968.9501.0052.02834ND2ASN110−9.68634.30769.3991.0045.26835CASN110−5.45036.37068.8401.0040.70836OASN110−4.54235.69268.3891.0034.47837NVAL111−5.28437.27369.8041.0044.99838CAVAL111−3.97937.41070.4321.0030.99839CBVAL111−3.88538.62771.3171.0031.57840CG1VAL111−2.62738.52172.1911.0043.27841CG2VAL111−3.80539.90070.4361.0031.41842CVAL111−3.85536.16271.2811.0031.76843OVAL111−4.80435.76371.9421.0035.82844NVAL112−2.72235.48971.1841.0041.08845CAVAL112−2.49234.31472.0001.0035.85846CBVAL112−2.01433.10971.1961.0043.27847CG1VAL112−1.96331.91272.0981.0036.63848CG2VAL112−2.95332.82470.0221.0048.26849CVAL112−1.43234.66073.0311.0038.53850OVAL112−0.43035.32572.7361.0043.19851NCYS113−1.65334.21574.2611.0046.31852CACYS113−0.69134.46875.3171.0041.85853CCYS113−0.42433.13675.9661.0042.26854OCYS113−1.34932.46776.4121.0043.58855CBCYS113−1.25835.44876.3151.0040.17856SGCYS113−1.47237.15275.7501.0047.14857NGLU1140.85132.75675.9971.0040.00858CAGLU1141.25531.46776.5361.0047.49859CBGLU1141.46130.46975.3791.0038.32860CGGLU1142.39831.01774.3381.0047.09861CDGLU1142.87429.99573.3101.0062.17862OE1GLU1142.03029.22972.7721.0054.04863OE2GLU1144.10729.98573.0311.0062.75864CGLU1142.52331.48877.4081.0046.47865OGLU1143.25232.47977.4791.0043.21866NTRP1152.75730.34578.0421.0039.36867CATRP1153.86730.09278.9251.0038.92868CBTRP1153.58530.58380.3571.0032.46869CGTRP1154.55829.97681.3731.0033.00870CD2TRP1154.20129.39382.6281.0019.31871CE2TRP1155.39829.13683.3381.0025.04872CE3TRP1152.98929.08883.2281.0036.87873CD1TRP1155.95630.01881.3491.0022.44874NE1TRP1156.45029.51682.5421.0031.17875CZ2TRP1155.40128.59684.6031.0046.54876CZ3TRP1152.99528.53984.5031.0051.37877CH2TRP1154.18828.30485.1761.0036.41878CTRP1153.95428.58879.0011.0043.57879OTRP1152.94627.91379.2651.0054.50880NGLY1165.14728.05778.7931.0048.25881CAGLY1165.31926.62578.9061.0034.75882CGLY1166.13626.41080.1701.0045.74883OGLY1167.30226.81580.2061.0040.47884NPRO1175.55225.83381.2391.0034.02885CDPRO1174.11325.58381.4251.0032.14886CAPRO1176.29625.58982.4801.0038.56887CBPRO1175.27724.86183.3481.0044.21888CGPRO1173.96625.48582.9261.0038.49889CPRO1177.51924.72582.2051.0042.91890OPRO1177.55024.02081.2111.0052.27891NARG1188.51724.79583.0801.0052.30892CAARG1189.73823.99982.9621.0060.14893CBARG11810.83524.51383.9141.0053.02894CGARG11811.45425.82383.5111.0067.97895CDARG11812.62026.21984.4081.0076.97896NEARG11812.26826.43185.8181.0080.60897CZARG11811.32627.26986.2521.0086.10898NH1ARG11810.60827.98885.3881.0081.53899NH2ARG11811.11627.40787.5561.0084.72900CARG1189.43922.54383.3221.0059.67901OARG11810.21621.65582.9921.0065.23902NSER1198.32222.32084.0091.0051.40903CASER1197.87120.98584.4421.0060.61904CBSER1198.23520.70685.9221.0061.14905OGSER1199.63620.61486.1561.0061.55906CSER1196.34620.90484.3141.0057.51907OSER1195.65321.92684.3301.0056.20908NTHR1205.82919.68784.2081.0053.08909CATHR1204.39319.46584.0791.0055.91910CBTHR1204.13618.02783.6571.0043.60911OG1THR1204.65317.89682.3371.0060.80912CG2THR1202.67317.67483.6411.0031.27913CTHR1203.64419.83985.3501.0057.63914OTHR1203.78419.21886.4111.0049.94915NPRO1212.83720.89285.2531.0062.50916CDPRO1212.47921.65684.0471.0064.06917CAPRO1212.07321.35986.3981.0063.14918CBPRO1211.44222.64585.8741.0052.64919CGPRO1211.17422.30984.4771.0069.13920CPRO1211.05920.34686.8511.0049.47921OPRO1210.51619.61086.0411.0058.72922NSER1220.80920.35088.1591.0055.49923CASER122−0.14719.46488.8201.0048.82924CBSER122−0.18719.77890.3311.0055.95925OGSER122−1.14120.79290.6641.0054.54926CSER122−1.53219.64088.2111.0044.78927OSER122−1.73720.54787.4421.0059.65928NLEU123−2.49218.79488.5671.0054.52929CALEU123−3.83118.91388.0001.0057.07930CBLEU123−4.56317.55788.0541.0055.73931CGLEU123−3.89616.42087.2471.0067.94932CD1LEU123−4.74015.15287.3631.0056.21933CD2LEU123−3.71716.82685.7641.0038.88934CLEU123−4.65119.99588.6911.0056.28935OLEU123−5.73820.36188.2281.0059.85936NTHR124−4.14520.50389.8061.0052.98937CATHR124−4.84921.56790.5141.0053.57938CBTHR124−4.82821.37392.0611.0056.34939OG1THR124−3.48221.12892.4901.0053.00940CG2THR124−5.74420.25592.4931.0044.62941CTHR124−4.21022.93990.2231.0056.80942OTHR124−4.42223.89190.9891.0057.07943NTHR125−3.44023.03889.1341.0043.79944CATHR125−2.76524.28988.7631.0046.89945CBTHR125−1.30724.03888.3121.0047.29946OG1THR125−0.56223.42689.3701.0043.72947CG2THR125−0.65625.31987.9261.0041.17948CTHR125−3.46324.98987.6071.0048.99949OTHR125−3.29024.58286.4761.0064.28950NLYS126−4.25126.02587.8801.0038.84951CALYS126−4.93626.75386.8201.0045.34952CBLYS126−6.40527.01487.1841.0050.48953CGLYS126−7.16325.77387.6171.0058.30954CDLYS126−6.99224.65886.5971.0061.76955CELYS126−7.61923.34987.0571.0064.05956NZLYS126−7.66322.40285.9031.0068.15957CLYS126−4.26828.08586.5051.0044.50958OLYS126−3.58528.68187.3471.0043.84959NALA127−4.48128.56485.2871.0045.08960CAALA127−3.90229.83784.8601.0037.24961CBALA127−2.82129.60083.8131.0042.13962CALA127−4.99930.67984.2761.0036.63963OALA127−5.96930.16083.7661.0054.88964NVAL128−4.87331.98884.3671.0042.57965CAVAL128−5.87732.84083.7711.0038.06966CBVAL128−7.00933.26684.7701.0046.10967CG1VAL128−7.77132.03885.3131.0036.01968CG2VAL128−6.41034.06685.8871.0039.87969CVAL128−5.12134.07783.3401.0045.97970OVAL128−4.07334.41583.8881.0037.87971NLEU129−5.66634.75782.3521.0039.37972CALEU129−5.04835.94381.8551.0035.49973CBLEU129−5.13635.96380.3191.0028.09974CGLEU129−4.17436.96679.6701.0048.04975CD1LEU129−2.72636.39579.7011.0031.77976CD2LEU129−4.63837.27578.2341.0042.39977CLEU129−5.78737.13482.4691.0038.11978OLEU129−7.00837.24382.3401.0048.40979NLEU130−5.05037.99483.1751.0031.69980CALEU130−5.61539.19483.7571.0035.11981CBLEU130−4.86639.59885.0321.0033.65982CGLEU130−4.84738.61286.2161.0046.79983CD1LEU130−4.50839.39087.4241.0040.22984CD2LEU130−6.19537.94286.4471.0052.37985CLEU130−5.36740.22382.6651.0039.15986OLEU130−4.23140.34782.1841.0047.42987NVAL131−6.42740.91582.2541.0037.89988CAVAL131−6.37041.93081.2131.0033.95989CBVAL131−7.23741.50179.9841.0033.95990CG1VAL131−7.35042.66178.8931.0027.58991CG2VAL131−6.63140.23879.3731.0031.78992CVAL131−6.84543.30981.6881.0043.64993OVAL131−7.89043.43482.3221.0049.13994NARG132−6.06044.33381.3851.0040.64995CAARG132−6.41045.70981.7061.0043.19996CBARG132−5.43846.34182.6831.0036.47997CGARG132−5.96747.62383.3061.0063.76998CDARG132−4.95048.32284.2361.0068.20999NEARG132−4.67847.61185.4871.0075.941000CZARG132−5.58447.37986.4361.0079.151001NH1ARG132−6.83947.79886.2801.0076.471002NH2ARG132−5.23446.74387.5501.0069.951003CARG132−6.35146.45080.3691.0047.071004OARG132−5.30346.60179.7561.0039.171005NLYS133−7.52246.87679.9281.0048.961006CALYS133−7.72447.57478.6831.0044.691007CBLYS133−8.94846.95578.0111.0044.541008CGLYS133−9.42647.64976.7511.0062.071009CDLYS133−10.55246.83976.1281.0062.691010CELYS133−11.03147.43474.8331.0061.891011NZLYS133−12.06846.53474.2261.0071.971012CLYS133−7.92249.07978.9211.0045.451013OLYS133−8.67549.48779.7881.0049.881014NPHE134−7.21249.88378.1511.0041.561015CAPHE134−7.27551.33178.2231.0043.831016CBPHE134−5.87551.93578.4211.0039.491017CGPHE134−5.17651.50179.6911.0043.931018CD1PHE134−4.81950.14879.9011.0039.011019CD2PHE134−4.90252.43480.6971.0039.931020CE1PHE134−4.21749.74181.0991.0037.661021CE2PHE134−4.28352.03781.9241.0037.661022CZPHE134−3.95150.70082.1171.0044.211023CPHE134−7.82451.77576.8491.0061.241024OPHE134−7.06551.91075.8831.0057.591025NGLN135−9.14051.97576.7501.0068.861026CAGLN135−9.73052.39675.4861.0064.841027CBGLN135−10.95651.54875.1551.0065.151028CGGLN135−11.24651.47773.6521.0064.671029CDGLN135−12.46950.64373.3111.0063.721030OE1GLN135−12.69450.30372.1481.0068.971031NE2GLN135−13.27150.31474.3241.0073.321032CGLN135−10.10553.86575.5491.0062.681033OGLN135−9.31354.68976.0101.0072.041034NASN136−11.30354.20075.0790.0068.191035CAASN136−11.77555.58175.1030.0068.931036CBASN136−12.63155.87773.8700.0069.871037CGASN136−11.86655.69672.5730.0070.231038OD1ASN136−10.85756.36072.3350.0070.481039ND2ASN136−12.34654.79471.7250.0070.481040CASN136−12.59755.78776.3700.0069.661041OASN136−13.07656.88676.6500.0069.811042NSER137−12.75254.70577.1250.0069.871043CASER137−13.49554.71378.3790.0069.971044CBSER137−14.64453.70578.3110.0070.201045OGSER137−14.15852.40078.0440.0070.461046CSER137−12.52154.32079.4850.0069.721047OSER137−11.39553.90979.2040.0069.901048NPRO138−12.93454.43880.7560.0069.291049CDPRO138−14.23854.85181.3000.0069.401050CAPRO138−12.01354.06581.8340.0068.421051CBPRO138−12.83254.32983.0980.0069.041052CGPRO138−14.25054.15782.6370.0069.331053CPRO138−11.51052.62381.7350.0067.001054OPRO138−12.24051.72381.3180.0067.591055NALA139−10.25252.42782.1181.0063.131056CAALA139−9.59251.12082.0851.0059.191057CBALA139−8.33551.17382.9311.0063.391058CALA139−10.45449.94382.5391.0062.091059OALA139−10.70849.77583.7291.0067.131060NGLU140−10.89449.11181.6081.0060.461061CAGLU140−11.71747.97681.9961.0064.651062CBGLU140−12.77747.68880.9281.0071.731063CGGLU140−13.72346.52181.2811.0082.731064CDGLU140−14.58646.07080.0921.0092.271065OE1GLU140−14.00945.75779.0181.0088.041066OE2GLU140−15.83446.02280.2321.0082.561067CGLU140−10.86146.72782.2301.0065.141068OGLU140−10.03646.36081.3941.0066.101069NASP141−11.05846.09983.3841.0050.751070CAASP141−10.35544.88283.7711.0055.141071CBASP141−10.06744.89185.2651.0050.521072CGASP141−8.79145.61185.6101.0062.661073OD1ASP141−8.18746.24284.7231.0071.611074OD2ASP141−8.38445.54386.7811.0074.871075CASP141−11.19043.64683.4751.0053.621076OASP141−12.40943.69883.5081.0069.301077NPHE142−10.54342.53583.1591.0052.101078CAPHE142−11.27141.29982.9481.0041.641079CBPHE142−12.31241.43681.8051.0046.981080CGPHE142−11.73641.51580.4191.0047.601081CD1PHE142−11.59840.37179.6451.0051.871082CD2PHE142−11.37542.73679.8731.0048.501083CE1PHE142−11.10340.44678.3281.0058.301084CE2PHE142−10.88542.82078.5691.0048.211085CZPHE142−10.75141.67277.7971.0048.261086CPHE142−10.34140.10382.7661.0046.751087OPHE142−9.11940.25982.7041.0049.751088NGLN143−10.91438.91082.7011.0041.281089CAGLN143−10.11837.71182.5771.0042.261090CBGLN143−10.26536.83683.8031.0038.911091CGGLN143−9.97037.52385.1151.0057.411092CDGLN143−10.01136.54486.2571.0059.521093OE1GLN143−10.38035.38286.0771.0049.801094NE2GLN143−9.63337.00087.4331.0054.071095CGLN143−10.46636.87381.3991.0046.341096OGLN143−11.57436.90680.9071.0063.361097NGLU144−9.49236.10380.9581.0046.311098CAGLU144−9.66535.20379.8431.0049.211099CBGLU144−8.80135.67078.6581.0053.601100CGGLU144−9.25636.94277.9651.0041.811101CDGLU144−10.60636.78277.3141.0050.941102OE1GLU144−10.97935.63176.9931.0050.031103OE2GLU144−11.29837.80677.1121.0054.721104CGLU144−9.12933.88580.3801.0050.351105OGLU144−8.05633.84980.9721.0047.591106NPRO145−9.85332.78280.1861.0051.511107CDPRO145−11.16432.54779.5551.0053.001108CAPRO145−9.28231.54480.7281.0051.961109CBPRO145−10.45030.55880.6301.0058.731110CGPRO145−11.19031.04679.4071.0051.681111CPRO145−8.04931.08479.9481.0053.191112OPRO145−7.73131.63378.8871.0055.411113NCYS146−7.34130.09780.4861.0047.541114CACYS146−6.16029.56179.8131.0050.291115CCYS146−6.24828.07079.9291.0035.561116OCYS146−6.52827.56480.9701.0044.361117CBCYS146−4.85530.01680.4701.0042.181118SGCYS146−4.70531.80280.5811.0055.511119NGLN147−5.95227.36578.8591.0039.891120CAGLN147−6.03325.93978.8861.0042.601121CBGLN147−6.88525.46677.7021.0047.041122CGGLN147−8.22626.23577.5631.0056.191123CDGLN147−8.08527.63176.9031.0060.331124OE1GLN147−8.64828.61477.3821.0066.531125NE2GLN147−7.34927.70475.7941.0059.181126CGLN147−4.65925.33278.8421.0044.771127OGLN147−3.74925.86378.1941.0050.021128NTYR148−4.48124.23779.5701.0052.731129CATYR148−3.20123.57879.5321.0045.081130CBTYR148−2.89822.76780.7681.0036.591131CGTYR148−1.47722.29780.7211.0043.751132CD1TYR148−0.42523.21180.7361.0039.071133CE1TYR1480.89222.79180.6281.0055.871134CD2TYR148−1.17020.94480.6021.0048.151135CE2TYR1480.14720.51180.5031.0032.871136CZTYR1481.16821.43480.5081.0051.391137OHTYR1482.45721.01780.3241.0050.551138CTYR148−3.29722.65478.3691.0052.841139OTYR148−4.29421.97878.1851.0063.181140NSER149−2.25922.64977.5571.0064.151141CASER149−2.24021.81576.3881.0057.561142CBSER149−1.82622.63575.1741.0050.841143OGSER149−1.21821.79874.2081.0070.891144CSER149−1.25420.69976.6391.0068.421145OSER149−0.03020.92776.7511.0056.211146NGLN150−1.79719.48976.7631.0070.671147CAGLN150−0.97018.31676.9771.0070.901148CBGLN150−1.84417.07777.1091.0068.031149CGGLN150−2.50316.98278.4731.0076.851150CDGLN150−1.91615.87179.3041.0075.141151OE1GLN150−2.27214.71479.1211.0086.991152NE2GLN150−0.99916.20880.2041.0068.251153CGLN1500.00418.18175.8191.0066.791154OGLN1501.17417.88176.0431.0068.071155NGLU151−0.46918.43174.5931.0065.241156CAGLU1510.40118.35073.4151.0066.371157CBGLU151−0.33918.75072.1001.0071.261158CGGLU151−1.18217.63571.4211.0072.871159CDGLU151−1.20317.69969.8661.0083.211160OE1GLU151−0.18018.10469.2571.0085.871161OE2GLU151−2.22817.30969.2441.0066.611162CGLU1511.59619.27673.6391.0069.201163OGLU1512.68518.80273.9821.0061.031164NSER1521.37920.58673.4641.0059.321165CASER1522.43021.59173.6251.0061.341166CBSER1521.92322.97373.2311.0064.131167OGSER1520.96523.44774.1631.0063.701168CSER1523.04221.71875.0161.0055.291169OSER1524.14222.24475.1451.0055.791170NGLN1532.36621.23476.0551.0049.351171CAGLN1532.92921.36977.4101.0059.771172CBGLN1534.34920.74677.4811.0058.601173CGGLN1534.41719.25877.8661.0062.841174CDGLN1534.11318.92279.3631.0068.701175OE1GLN1534.90719.34180.2401.0078.041176NE2GLN1532.96618.36979.6471.0066.891177CGLN1533.02422.86877.7791.0051.811178OGLN1534.01623.29178.3651.0054.371179NLYS1542.00423.64777.4041.0046.161180CALYS1541.92725.09677.6531.0050.571181CBLYS1542.28525.88576.4121.0038.451182CGLYS1543.74225.90476.0201.0054.111183CDLYS1543.83026.55174.6411.0048.621184CELYS1545.22426.54774.0821.0041.061185NZLYS1545.21726.88672.6131.0057.601186CLYS1540.54125.59678.0741.0054.921187OLYS154−0.49825.02477.7091.0043.661188NPHE1550.53126.66478.8671.0048.031189CAPHE155−0.73227.26179.2421.0039.541190CBPHE155−0.62728.00780.5731.0041.401191CGPHE155−0.73127.11781.7841.0036.681192CD1PHE1550.41326.75382.5101.0041.111193CD2PHE155−1.96526.65282.2091.0033.061194CE1PHE1550.31925.93583.6441.0039.671195CE2PHE155−2.07125.82783.3581.0036.901196CZPHE155−0.93125.47684.0631.0044.921197CPHE155−0.96628.23078.0781.0039.921198OPHE155−0.03128.86077.6171.0037.121199NSER156−2.20128.32777.5921.0036.691200CASER156−2.48529.17676.4491.0037.611201CBSER156−2.47828.31675.1811.0038.421202OGSER156−2.56629.09574.0091.0046.671203CSER156−3.83329.87576.5961.0044.981204OSER156−4.79629.27476.9871.0039.161205NCYS157−3.88331.15776.2641.0043.711206CACYS157−5.10131.92676.3971.0041.171207CCYS157−5.28232.79975.1791.0039.871208OCYS157−4.33933.12674.4951.0042.771209CBCYS157−5.05432.83377.6511.0038.541210SGCYS157−3.75932.41178.8661.0054.871211NGLN158−6.51733.18674.9171.0043.331212CAGLN158−6.76334.04173.7981.0042.101213CBGLN158−7.57133.30972.7491.0047.261214CGGLN158−6.81232.19272.1451.0042.301215CDGLN158−7.48931.66870.9291.0056.101216OE1GLN158−8.16132.41670.1931.0048.871217NE2GLN158−7.31130.37770.6831.0051.801218CGLN158−7.46935.28574.1991.0038.011219OGLN158−8.27635.30575.1141.0048.281220NLEU159−7.15636.34573.4951.0042.091221CALEU159−7.79837.61173.7541.0042.111222CBLEU159−6.79638.54574.3921.0031.901223CGLEU159−7.07240.04874.4591.0030.381224CD1LEU159−8.29240.31975.3211.0035.271225CD2LEU159−5.83840.72375.0491.0027.231226CLEU159−8.22438.08472.3591.0046.031227OLEU159−7.42738.08971.4161.0041.401228NALA160−9.50138.41372.2271.0040.781229CAALA160−10.05938.88170.9601.0049.801230CBALA160−11.57838.61270.9101.0032.431231CALA160−9.80840.37370.8491.0046.691232OALA160−10.29841.13271.6581.0050.911233NVAL161−9.01240.79569.8751.0053.331234CAVAL161−8.77842.22269.6991.0049.121235CBVAL161−7.27742.54169.6731.0049.301236CG1VAL161−7.06144.00869.3981.0048.161237CG2VAL161−6.64042.156711.0050.621238CVAL161−9.43442.58668.3671.0054.331239OVAL161−9.01542.10267.3301.0063.241240NPRO162−10.50943.40068.3891.0059.461241CDPRO162−11.08944.14569.5211.0048.501242CAPRO162−11.17343.77767.1371.0055.331243CBPRO162−12.28844.71367.6061.0054.211244CGPRO162−11.71045.32768.8361.0050.741245CPRO162−10.17744.47966.2461.0052.411246OPRO162−9.20245.01766.7531.0050.051247NGLU163−10.40644.45964.9311.0054.021248CAGLU163−9.48745.11964.0231.0051.651249CBGLU163−9.74044.68362.5861.0061.561250CGGLU163−8.55443.90561.9441.0077.441251CDGLU163−7.14844.45962.3041.0077.961252OE1GLU163−6.61644.10763.3871.0079.061253OE2GLU163−6.57545.23761.5041.0072.231254CGLU163−9.54946.65664.1381.0057.771255OGLU163−10.59947.23464.4241.0043.251256NGLY164−8.39947.29963.9351.0049.641257CAGLY164−8.31848.73964.0501.0059.591258CGLY164−8.35749.20865.5011.0064.821259OGLY164−8.38350.41765.7661.0063.601260NASP165−8.36048.26266.4421.0052.821261CAASP165−8.41448.59267.8691.0049.581262CBASP165−8.51047.28868.6541.0056.631263CGASP165−8.91847.48670.0941.0066.561264OD1ASP165−8.69748.58970.6401.0071.821265OD2ASP165−9.45146.51970.6841.0053.011266CASP165−7.17149.40568.3031.0052.811267OASP165−6.03948.93768.1871.0056.021268NSER166−7.37450.62968.7741.0052.021269CASER166−6.25851.45769.2231.0051.681270CBSER166−6.48152.90868.8391.0054.841271OGSER166−6.56353.02867.4341.0082.351272CSER166−6.09151.39270.7351.0051.791273OSER166−5.35752.18871.3201.0044.251274NSER167−6.79850.48071.3851.0037.581275CASER167−6.65650.37672.8311.0052.111276CBSER167−7.72349.46673.4381.0039.521277OGSER167−8.97949.62972.8271.0052.251278CSER167−5.29049.78073.1711.0048.801279OSER167−4.65349.12972.3541.0047.941280NPHE168−4.83450.05174.3761.0043.431281CAPHE168−3.61149.46974.8371.0044.421282CBPHE168−2.72650.47875.5391.0026.371283CGPHE168−1.99351.33574.5851.0039.661284CD1PHE168−2.60952.44974.0131.0033.601285CD2PHE168−0.71050.99374.1811.0029.871286CE1PHE168−1.95253.19373.0581.0040.751287CE2PHE168−0.66251.72873.2271.0041.691288CZPHE168−0.04852.83472.6561.0035.651289CPHE168−4.08548.40775.7641.0043.721290OPHE168−5.20148.46676.2991.0044.371291NTYR169−3.25747.39275.9011.0049.151292CATYR169−3.60046.29776.7601.0035.791293CBTYR169−3.97745.07175.9621.0040.551294CGTYR169−5.25145.23175.1951.0045.831295CD1TYR169−5.27045.85573.9401.0048.251296CE1TYR169−6.44945.93973.1981.0044.961297CD2TYR169−6.43744.71775.6831.0050.321298CE2TYR169−7.61744.81174.9481.0049.481299CZTYR169−7.61045.41673.7141.0044.531300OHTYR169−8.77245.48573.0091.0047.891301CTYR169−2.42545.99177.6151.0034.141302OTYR169−1.27746.14077.1811.0036.211303NILE170−2.72245.66478.8681.0033.461304CAILE170−1.70045.23979.7981.0038.421305CBILE170−1.57546.166831.0039.431306CG2ILE170−0.48645.60781.9441.0027.431307CG1ILE170−1.22347.58580.5201.0038.201308CD1ILE170−0.93348.57281.6301.0036.441309CILE170−2.13843.83080.2381.0041.741310OILE170−3.23643.63680.7431.0042.031311NVAL171−1.30142.83779.9841.0042.791312CAVAL171−1.64041.47880.3701.0033.581313CBVAL171−1.80640.51879.1291.0032.571314CG1VAL171−2.9674078.2461.0043.011315CG2VAL171−0.54640.45878.3131.0033.951316CVAL171−0.61440.87081.2991.0039.221317OVAL1710.56441.21381.2751.0045.361318NSER172−1.08839.94482.1071.0037.481319CASER172−0.26839.22883.0461.0037.921320CBSER172−0.20139.99684.3711.0037.351321OGSER1720.80939.44385.1881.0051.101322CSER172−0.94337.87083.2791.0030.611323OSER172−2.16037.76883.3401.0040.901324NMET173−0.17136.82683.4311.0035.711325CAMET173−0.81335.53883.6641.0037.991326CBMET173−0.22534.46082.7611.0034.901327CGMET173−0.81333.07082.9661.0039.851328SDMET1730.14131.88581.9811.0046.541329CEMET173−0.49732.25280.3381.0027.071330CMET173−0.62535.15585.1191.0040.811331OMET1730.46935.23985.6891.0041.351332NCYS174−1.73134.77985.7211.0033.011333CACYS174−1.73934.33787.0731.0036.171334CCYS174−1.86532.81587.0361.0044.261335OCYS174−2.74532.29086.3531.0055.091336CBCYS174−2.93834.93087.7841.0035.891337SGCYS174−3.29534.10489.3451.0062.531338NVAL175−0.96632.11287.7301.0045.181339CAVAL175−1.04830.66587.8351.0043.881340CBVAL1750.14129.95087.2451.0051.651341CG1VAL1750.02628.48487.5451.0045.861342CG2VAL1750.14930.13185.7381.0069.661343CVAL175−1.12630.34489.3091.0053.011344OVAL175−0.35730.88890.1031.0050.611345NALA176−2.07129.47489.6701.0049.031346CAALA176−2.27429.10991.0521.0036.271347CBALA176−3.38929.90091.6351.0049.891348CALA176−2.54727.65191.2521.0046.901349OALA176−3.15327.00090.4141.0044.961350NSER177−2.07627.14192.3841.0039.291351CASER177−2.28725.75492.7491.0046.651352CBSER177−0.94425.02392.8331.0045.221353OGSER177−0.08025.60793.7981.0050.991354CSER177−2.94425.92694.1091.0044.291355OSER177−3.22727.05994.4941.0047.841356NSER178−3.20424.84594.8431.0049.381357CASER178−3.84824.99596.1571.0034.081358CBSER178−4.44223.67296.6131.0046.731359OGSER178−3.49922.62496.4561.0061.091360CSER178−2.91225.50797.2181.0039.701361OSER178−3.36325.84098.3151.0060.421362NVAL179−1.61225.58096.9061.0043.011363CAVAL179−0.61926.05897.8711.0042.311364CBVAL1790.46224.95198.2121.0052.491365CG1VAL179−0.18123.79498.9491.0044.011366CG2VAL1791.19924.49896.9261.0029.171367CVAL1790.16427.32397.4651.0052.421368OVAL1791.21127.61298.0451.0048.691369NGLY180−0.33628.06896.4831.0056.291370CAGLY1800.36229.26496.0451.0047.341371CGLY180−0.07629.77094.6751.0047.021372OGLY180−0.82929.14793.9531.0045.511373NSER1810.39230.94594.3151.0048.021374CASER1810.02531.50193.0441.0040.201375CBSER181−1.28832.26093.1581.0050.451376OGSER181−1.19133.23394.1701.0046.341377CSER1811.13732.42892.6501.0037.491378OSER1811.88832.86193.4911.0039.351379NLYS1821.28232.66591.3571.0030.891380CALYS1822.27733.58990.8791.0036.581381CBLYS1823.56032.87590.4421.0039.171382CGLYS1824.34932.18791.5271.0027.021383CDLYS1824.79233.11192.6131.0038.021384CELYS1826.16032.64393.1481.0056.921385NZLYS1826.89033.62594.0081.0044.061386CLYS1821.70334.36489.6931.0039.371387OLYS1820.73633.95989.0681.0040.661388NPHE1832.29535.49889.3831.0040.261389CAPHE1831.81736.23088.2491.0031.061390CBPHE1830.96837.43688.6601.0037.201391CGPHE1831.62438.35889.6411.0052.971392CD1PHE1832.65939.24689.2291.0052.231393CD2PHE1831.20038.37690.9801.0044.371394CE1PHE1833.27140.14690.1461.0049.031395CE2PHE1831.80639.28491.9171.0049.531396CZPHE1832.83540.15691.4891.0055.761397CPHE1833.04936.61187.5431.0032.621398OPHE1834.10136.76988.1681.0037.471399NSER1842.94236.70186.2291.0026.171400CASER1844.07337.06085.4341.0023.091401CBSER1843.87136.59784.0171.0028.431402OGSER1842.61037.01883.5281.0033.081403CSER1844.22438.54385.3701.0032.481404OSER1843.33839.28685.7711.0030.511405NLYS1855.37038.93784.8271.0031.371406CALYS1855.65740.31184.5021.0037.361407CBLYS1857.02640.44383.8541.0030.261408CGLYS1858.16740.25284.8351.0041.571409CDLYS1859.51140.59984.2011.0039.511410CELYS18510.63740.26985.1751.0052.191411NZLYS18511.99040.71584.6791.0066.481412CLYS1854.60940.62283.4481.0031.961413OLYS1854.12739.75082.7171.0040.831414NTHR1864.26941.88383.3701.0048.131415CATHR1863.27442.39782.4441.0034.031416CBTHR1862.98943.83782.8661.0031.331417OG1THR1861.95343.83783.8751.0045.681418CG2THR1862.63644.66181.7361.0044.071419CTHR1863.74642.34480.9831.0038.661420OTHR1864.93942.35880.6831.0028.831421NGLN1872.78542.26280.0871.0033.991422CAGLN1873.04542.30178.6651.0041.071423CBGLN1872.80940.90778.0851.0033.311424CGGLN1873.05940.79376.6261.0058.841425CDGLN1874.45240.32376.3061.0059.151426OE1GLN1874.72339.90475.1611.0066.551427NE2GLN1875.35440.37977.2991.0043.861428CGLN1872.05043.37778.1431.0033.771429OGLN1870.84443.22778.2831.0038.551430NTHR1882.58944.49577.6401.0042.841431CATHR1881.83045.65377.1201.0029.851432CBTHR1882.32947.01177.6341.0031.081433OG1THR1882.60846.95379.0301.0040.811434CG2THR1881.26448.08677.3621.0027.811435CTHR1881.89045.83575.6211.0030.991436OTHR1882.92745.68675.0141.0052.501437NPHE1890.78546.22575.0341.0032.541438CAPHE1890.72546.47773.6031.0031.881439CBPHE1890.90645.16572.8521.0031.771440CGPHE189−0.09444.16873.1961.0043.971441CD1PHE189−1.34844.20772.6061.0046.621442CD2PHE1890.14843.25174.2141.0044.391443CE1PHE189−2.34943.36173.0281.0037.551444CE2PHE189−0.87142.39974.6391.0035.331445CZPHE189−2.10542.46074.0511.0034.371446CPHE189−0.57047.16173.1141.0037.611447OPHE189−1.63847.11173.7301.0030.141448NGLN190−0.44647.82671.9891.0037.991449CAGLN190−1.59448.44871.3741.0040.931450CBGLN190−1.16649.53370.4281.0046.591451CGGLN190−2.31150.25569.8871.0045.551452CDGLN190−1.90651.60269.4611.0052.981453OE1GLN190−0.89251.76068.7671.0053.301454NE2GLN190−2.68552.60469.8571.0050.891455CGLN190−2.22447.34970.5771.0042.261456OGLN190−1.51146.56069.9901.0043.021457NGLY191−3.55547.29070.5661.0056.591458CAGLY191−4.25746.23969.8531.0045.601459CGLY191−3.69445.98968.4711.0063.131460OGLY191−3.49544.84568.0711.0052.461461NCYS192−3.42347.07267.7461.0066.671462CACYS192−2.89546.99166.3791.0076.681463CCYS192−1.40247.21866.2591.0072.101464OCYS192−0.97348.05965.4691.0079.181465CBCYS192−3.60748.00965.4941.0080.631466SGCYS192−4.01549.53266.3931.0088.301467NGLY193−0.61946.46667.0331.0072.361468CAGLY1930.82646.60067.0061.0054.451469CGLY1931.54145.30867.3511.0058.631470OGLY1932.73945.15967.0841.0053.131471NILE1940.81544.35867.9301.0052.041472CAILE1941.44343.11868.3341.0044.381473CBILE1940.66042.44769.5141.0045.561474CG2ILE194−0.66841.92869.0571.0036.301475CG1ILE1941.49141.30970.1311.0041.081476CD1ILE1940.87340.74171.3411.0039.861477CILE1941.63942.14467.1871.0046.451478OILE1942.53241.29567.2481.0043.391479NLEU1950.84142.29266.1311.0041.081480CALEU1950.93141.42464.9561.0030.611481CBLEU1950.12642.02563.7731.0026.451482CGLEU1950.17141.27962.4201.0023.921483CD1LEU195−0.51839.95162.5321.0027.311484CD2LEU195−0.43942.05561.3591.0023.891485CLEU1952.34241.11864.4711.0034.621486OLEU1953.14242.01264.2321.0035.891487NGLN1962.64139.83564.3501.0028.591488CAGLN1963.90639.37263.7851.0032.441489CBGLN1964.98439.08064.8371.0033.231490CGGLN1966.38039.05564.1631.0043.561491CDGLN1967.49238.35964.9601.0038.911492OE1GLN1968.68338.47764.6301.0039.551493NE2GLN1967.11137.62865.9841.0035.471494CGLN1963.65838.10562.9341.0032.491495OGLN1963.36837.02863.4491.0034.281496NPRO1973.75338.22961.6101.0028.021497CDPRO1974.13039.40260.8061.0034.211498CAPRO1973.52037.04860.7721.0027.671499CBPRO1973.50637.61159.3471.0027.891500CGPRO1973.47939.11059.4841.0024.681501CPRO1974.63636.02460.9361.0033.521502OPRO1975.71436.38361.3491.0030.591503NASP1984.36134.74960.6481.0034.231504CAASP1985.42833.74260.6721.0026.241505CBASP1984.90132.32060.4791.0037.691506CGASP1984.39731.70661.7691.0045.161507OD1ASP1985.12931.76762.7851.0045.511508OD2ASP1983.27331.15861.7581.0048.531509CASP1986.30634.05659.4641.0037.681510OASP1985.86734.83958.5461.0033.161511NPRO1997.55033.46759.4221.0038.921512CDPRO1998.09532.41460.2931.0028.031513CAPRO1998.49533.68458.3181.0029.631514CBPRO1999.70732.84558.7201.0033.411515CGPRO1999.56032.68860.2041.0034.881516CPRO1997.92733.22256.9821.0034.711517OPRO1997.01232.36156.9091.0036.651518NPRO2008.39033.84955.9141.0032.631519CDPRO2008.77635.26455.8711.0034.811520CAPRO2007.88033.42154.6241.0030.351521CBPRO2008.66834.27853.6551.0029.671522CGPRO2008.66535.59954.3511.0034.131523CPRO2008.11931.92754.5121.0028.471524OPRO2008.98631.40455.1551.0034.081525NALA2017.31431.24253.7151.0041.161526CAALA2017.43329.80753.5921.0042.801527CBALA2016.09829.18553.9271.0040.911528CALA2017.91329.30852.2241.0047.511529OALA2017.74229.98051.2021.0042.151530NASN2028.48028.10352.2501.0042.071531CAASN2029.03627.37951.1041.0046.411532CBASN2027.94726.60450.3891.0053.301533CGASN2027.27625.59051.2951.0068.861534OD1ASN2027.92824.94152.1231.0065.651535ND2ASN2025.96825.45051.1361.0067.111536CASN2029.81928.21650.1181.0044.721537OASN2029.41528.39048.9641.0049.521538NILE20310.95128.71750.5991.0039.531539CAILE20311.85229.55849.8301.0039.991540CBILE20312.95930.17850.7031.0041.371541CG2ILE20313.77731.14049.8521.0040.071542CG1ILE20312.39830.77252.0091.0046.011543CD1ILE20311.45031.89451.8641.0057.611544CILE20312.62028.71348.8431.0047.661545OILE20313.33327.81149.2651.0048.781546NTHR20412.50428.98547.5461.0037.381547CATHR20413.31928.22446.6101.0039.501548CBTHR20412.54327.53045.4561.0049.711549OG1THR20411.87928.52844.6801.0054.831550CG2THR20411.58526.50045.9691.0038.241551CTHR20414.22529.20745.9391.0040.011552OTHR20413.81330.32645.6371.0041.741553NVAL20515.46728.79145.7161.0036.981554CAVAL20516.43629.63845.0421.0041.751555CBVAL20517.70029.86645.8751.0039.671556CG1VAL20518.61830.86945.1601.0037.361557CG2VAL20517.32530.36047.2411.0030.871558CVAL20516.76528.78943.8441.0048.461559OVAL20517.02927.58443.9771.0052.221560NTHR20616.75829.40342.6721.0040.561561CATHR20616.97028.62741.4661.0048.051562CBTHR20615.60128.23740.8761.0039.581563OG1THR20614.93627.41441.8261.0048.821564CG2THR20615.72527.49139.5921.0053.891565CTHR20617.78229.34440.4401.0045.981566OTHR20617.53930.51540.1641.0048.971567NALA20718.74528.62239.8731.0043.241568CAALA20719.59329.18838.8451.0045.851569CBALA20720.83328.29438.6141.0042.681570CALA20718.81329.37337.5331.0047.751571OALA20717.96428.56237.1641.0041.361572NVAL20819.09430.47636.8591.0039.021573CAVAL20818.48230.76435.5911.0041.831574CBVAL20818.01132.19735.5271.0035.721575CG1VAL20817.24532.39034.2521.0026.281576CG2VAL20817.20132.52636.7411.0043.471577CVAL20819.51930.54334.4691.0041.281578OVAL20820.43831.32934.2961.0043.821579NALA20919.34529.45333.7321.0047.121580CAALA20920.18229.06132.5991.0033.471581CBALA20919.39328.12531.6991.0026.181582CALA20920.70830.20931.7521.0045.921583OALA20919.94331.10631.3331.0033.821584NARG21022.01730.16731.4951.0037.981585CAARG21022.67531.16230.6431.0039.921586CBARG21021.96131.22429.2711.0042.711587CGARG21021.98229.88528.4431.0035.371588CDARG21020.87029.84227.2861.0050.761589NEARG2102230.89026.2581.0053.731590CZARG21019.98631.42325.5611.0062.221591NH1ARG21018.74831.02625.7791.0056.791592NH2ARG21020.20032.34224.6091.0059.591593CARG21022.83232.57331.2091.0040.241594OARG21023.20833.49330.4881.0049.751595NASN21122.55332.74732.4921.0045.651596CAASN21122.66934.05733.1551.0045.301597CBASN21121.27434.58433.4821.0043.531598CGASN21120.57335.11432.2721.0033.161599OD1ASN21120.69136.29831.9331.0050.541600ND2ASN21119.85634.25331.5961.0038.921601CASN21123.50333.84534.4251.0046.661602OASN21122.98033.62835.5131.0033.691603NPRO21224.82933.90234.2711.0045.451604CDPRO21225.54134.44133.0951.0044.031605CAPRO21225.75433.69735.3761.0048.121606CBPRO21227.11934.03434.7561.0042.311607CGPRO21226.80934.99233.7121.0033.991608CPRO21225.50334.43436.6701.0053.401609OPRO21225.94133.96237.7381.0060.101610NARG21324.77435.54736.6141.0036.451611CAARG21324.57436.30437.8411.0044.531612CBARG21325.09037.71737.5901.0047.171613CGARG21326.58737.73037.2301.0043.481614CDARG21327.36138.34738.3531.0052.401615NEARG21327.29639.80338.2931.0055.091616CZARG21327.62740.62139.2931.0054.961617NH1ARG21328.04740.14340.4561.0051.541618NH2ARG21327.54841.93039.1181.0054.511619CARG21323.15536.31238.4331.0051.871620OARG21322.83337.08639.3391.0058.981621NTRP21422.32235.40537.9511.0043.121622CATRP21420.95135.34038.3811.0037.741623CBTRP21420.04935.27037.1561.0038.751624CGTRP21420.03936.50336.3681.0029.791625CD2TRP21419.05536.89435.4141.0037.851626CE2TRP21419.51938.07634.8021.0034.151627CE3TRP21417.82236.35935.0131.0049.411628CD1TRP21421.01737.43236.3161.0030.361629NE1TRP21420.72338.37935.3771.0044.131630CZ2TRP21418.79638.74733.7981.0046.251631CZ3TRP21417.09937.02034.0171.0053.331632CH2TRP21417.59538.20533.4191.0050.731633CTRP21420.57834.22039.3131.0037.361634OTRP21421.14833.13239.2491.0051.381635NLEU21519.62934.51140.2051.0034.141636CALEU21519.06933.51441.1211.0027.351637CBLEU21519.67233.67342.4931.0034.731638CGLEU21521.08333.10442.6621.0038.601639CD1LEU21521.59333.43044.0651.0043.951640CD2LEU21521.03231.60742.4791.0034.001641CLEU21517.54433.79141.1371.0036.601642OLEU21517.11434.92341.3891.0038.551643NSER21616.72632.79540.7971.0040.811644CASER21615.27733.01740.7931.0042.271645CBSER21614.57432.23239.6951.0036.641646OGSER21613.18032.14739.9181.0041.261647CSER21614.75032.57742.1341.0043.671648OSER21614.78631.39642.4491.0035.431649NVAL21714.22833.52742.9111.0031.291650CAVAL21713.73333.18044.2471.0033.241651CBVAL21714.42034.06845.3551.0030.911652CG1VAL21714.17533.50446.7531.0040.721653CG2VAL21715.90134.11645.1031.0040.981654CVAL21712.22933.32344.3261.0036.871655OVAL21711.63434.34343.9051.0037.091656NTHR21811.62132.27944.8611.0030.791657CATHR21810.17932.24945.0381.0036.091658CBTHR2189.48531.17344.0991.0039.511659OG1THR2189.92829.85344.4461.0036.701660CG2THR2189.81831.44442.5941.0018.811661CTHR2189.98331.91546.5071.0039.461662OTHR21810.96531.67047.2211.0039.201663NTRP2198.72931.92946.9511.0038.621664CATRP2198.35631.62348.3301.0033.931665CBTRP2199.06332.57549.2921.0026.621666CGTRP2198.69034.04549.1291.0026.061667CD2TRP2199.36435.03048.3161.0023.671668CE2TRP2198.72436.27148.5441.0032.011669CE3TRP21910.44134.98347.4271.0025.031670CD1TRP2197.69734.71149.7851.0024.371671NE1TRP2197.71736.05549.4481.0034.141672CZ2TRP2199.12137.44247.9151.0023.851673CZ3TRP21910.84636.13946.8121.0023.841674CH2TRP21910.18137.37447.0581.0026.351675CTRP2196.85031.83748.4571.0043.621676OTRP2196.24532.47447.5981.0044.361677NGLN2206.24231.32749.5201.0039.651678CAGLN2204.80931.57749.7131.0051.571679CBGLN2204.01530.32549.4121.0050.401680CGGLN2204.44829.18050.2281.0059.691681CDGLN2204.16727.91249.5091.0058.331682OE1GLN2204.04427.91548.2911.0051.981683NE2GLN2204.07026.81250.2451.0052.321684CGLN2204.41432.08451.0931.0047.171685OGLN2205.19031.97852.0491.0050.981686NASP2213.20332.64951.1851.0057.451687CAASP2212.65333.16952.4561.0042.111688CBASP2211.18333.60152.2921.0044.431689CGASP2211.02035.00051.6771.0046.931690OD1ASP221−0.14235.42051.4231.0044.501691OD2ASP2212.03635.69151.4531.0040.841692CASP2212.71332.04553.4851.0036.841693OASP2212.63130.85053.1501.0036.071694NPRO2222.91332.39954.7601.0040.081695CDPRO2223.42133.62155.3941.0034.721696CAPRO2222.95131.29355.7071.0033.911697CBPRO2223.53931.93056.9501.0026.831698CGPRO2223.16833.29656.8291.0036.911699CPRO2221.52330.85055.9161.0043.031700OPRO2220.61231.63455.7161.0035.511701NHIS2231.36329.58656.2841.0037.131702CAHIS2230.08628.97756.5811.0047.601703CBHIS2230.31527.59257.2021.0057.921704CGHIS223−0.02326.46556.2841.0071.351705CD2HIS223−0.97325.50356.3691.0075.371706ND1HIS2230.62526.26355.0801.0064.601707CE1HIS2230.09225.22454.4651.0071.901708NE2HIS223−0.88224.74555.2241.0088.571709CHIS223−0.78829.80857.5231.0039.581710OHIS223−1.93930.07857.2111.0052.251711NSER224−0.22930.20658.6631.0040.651712CASER224−0.94730.98359.6611.0035.591713CBSER224−0.05931.27660.8761.0033.271714OGSER2241.11931.96760.4821.0042.921715CSER224−1.49432.29159.1071.0038.411716OSER224−2.39932.88459.6921.0046.861717NTRP225−0.96732.76257.9861.0039.371718CATRP225−1.49234.01157.4451.0026.771719CBTRP225−0.39734.79656.7241.0028.301720CGTRP225−0.79736.17156.2691.0033.601721CD2TRP225−0.73637.40857.0311.0028.411722CE2TRP225−1.11938.44956.1641.0022.111723CE3TRP225−0.38337.72858.3451.0027.001724CD1TRP225−1.21836.51755.0281.0032.531725NE1TRP225−1.41037.88254.9521.0034.651726CZ2TRP225−1.14939.78456.5591.0020.871727CZ3TRP225−0.40839.06358.7351.0037.631728CH2TRP225−0.78740.08257.8341.0023.371729CTRP225−2.60733.63056.5111.0036.751730OTRP225−2.39333.49055.3111.0041.441731NASN226−3.78433.37057.0991.0050.651732CAASN226−4.97933.01156.3311.0045.841733CBASN226−5.95132.08857.0911.0043.861734CGASN226−5.96432.31258.6341.0044.681735OD1ASN226−5.68933.39359.1361.0048.371736ND2ASN226−6.32331.27659.3771.0042.391737CASN226−5.63434.32655.9921.0053.431738OASN226−5.00835.38856.0941.0068.671739NSER227−6.88934.28455.6141.0054.231740CASER227−7.57935.49955.1821.0068.041741CBSER227−7.63436.56956.2691.0060.181742OGSER227−8.60437.54655.8801.0064.301743CSER227−6.88036.08853.9531.0061.231744OSER227−5.65536.12053.8741.0077.301745NSER228−7.67736.55052.9991.0057.481746CASER228−7.17337.11651.7701.0049.041747CBSER228−7.98536.59850.6021.0059.591748OGSER228−9.31137.09150.6961.0075.431749CSER228−7.34238.60751.8621.0044.401750OSER228−6.87139.36951.0191.0054.221751NPHE229−7.99439.03552.9171.0045.521752CAPHE229−8.22140.45853.1021.0051.101753CBPHE229−9.37440.64254.0971.0055.231754CGPHE229−10.65339.97453.6471.0052.411755CD1PHE229−11.57639.49454.5731.0064.421756CD2PHE229−10.91139.78652.2791.0059.121757CE1PHE229−12.75538.82154.1511.0062.791758CE2PHE229−12.07339.12251.8461.0071.291759CZPHE229−12.99838.63752.7891.0061.721760CPHE229−6.96241.21053.5311.0051.961761OPHE229−6.92442.44853.5021.0048.191762NTYR230−5.92440.47253.9261.0048.371763CATYR230−4.67941.13254.3401.0050.081764CBTYR230−4.45140.98955.8601.0046.471765CGTYR230−5.47541.72956.6911.0045.871766CD1TYR230−6.66141.12657.0901.0036.391767CE1TYR230−7.61341.85257.8561.0044.551768CD2TYR230−5.25943.05457.0521.0048.881769CE2TYR230−6.19343.78357.7851.0031.641770CZTYR230−7.35743.18658.2011.0050.471771OHTYR230−8.19643.91459.0421.0056.401772CTYR230−3.49140.58653.5641.0036.441773OTYR230−3.27039.39653.5161.0038.091774NARG231−2.74541.46652.9261.0036.841775CAARG231−1.5934552.1761.0042.221776CBARG231−1.50241.73350.8371.0044.231777CGARG231−2.72541.63649.9831.0053.591778CDARG231−2.65840.42349.0871.0058.621779NEARG231−1.65840.61848.0401.0064.221780CZARG231−1.44239.77347.0431.0057.631781NH1ARG231−2.16838.66846.9551.0058.851782NH2ARG231−0.49240.02546.1451.0063.771783CARG231−0.29841.22452.9631.0036.691784OARG231−0.19442.07553.8691.0033.851785NLEU2320.69340.45052.5761.0034.251786CALEU2322.02640.52153.1521.0026.861787CBLEU2322.52139.11453.3941.0029.551788CGLEU2322.77038.60554.8101.0038.271789CD1LEU2321.99739.42255.8421.0026.311790CD2LEU2322.50237.08854.8271.0025.141791CLEU2322.95041.23052.1651.0030.031792OLEU2322.81841.09650.9611.0048.111793NARG2333.86242.02952.6851.0040.211794CAARG2334.86742.71451.8821.0034.961795CBARG2334.95644.14152.3851.0043.371796CGARG2336.09644.93251.8441.0046.201797CDARG2336.03246.34852.3791.0035.261798NEARG2334.77747.03252.0831.0036.721799CZARG2334.38247.45550.8841.0040.211800NH1ARG2335.13247.25449.8161.0043.901801NH2ARG2333.26348.17250.7691.0036.591802CARG2336.18541.94852.1901.0038.111803OARG2336.36641.50953.3191.0027.191804NPHE2347.10741.76251.2441.0033.421805CAPHE2348.32541.03751.6451.0037.961806CBPHE2348.43539.68550.9461.0033.161807CGPHE2347.26238.79751.1491.0024.951808CD1PHE2346.17038.85850.2741.0037.711809CD2PHE2347.23637.88352.1951.0033.931810CE1PHE2345.08938.01250.4371.0024.301811CE2PHE2346.14537.02152.3751.0033.011812CZPHE2345.06737.08151.4891.0022.441813CPHE2349.71041.70651.5481.0044.481814OPHE2349.95142.65450.8071.0039.311815NGLU23510.63741.19152.3301.0038.101816CAGLU23511.97441.73352.2771.0032.971817CBGLU23512.29542.52553.5421.0028.081818CGGLU23513.61243.32553.4371.0031.101819CDGLU23513.82944.20754.6691.0044.531820OE1GLU23513.76743.65655.7821.0037.001821OE2GLU23514.06745.43754.5361.0043.381822CGLU23512.92940.56052.0861.0026.981823OGLU23512.80339.52052.7351.0027.631824NLEU23613.84140.70651.1321.0035.281825CALEU23614.80239.64950.8701.0038.831826CBLEU23614.57739.14449.4541.0042.591827CGLEU23615.57738.25248.7231.0043.051828CD1LEU23614.84837.47347.6541.0052.721829CD2LEU23616.63339.10748.0771.0049.101830CLEU23616.28440.01751.1111.0037.171831OLEU23616.73441.15150.9631.0031.441832NARG23717.05139.03951.5171.0034.351833CAARG23718.45139.31651.7301.0039.431834CBARG23718.73039.66553.2121.0029.181835CGARG23718.67738.48954.2091.0030.421836CDARG23719.07839.09955.5701.0036.951837NEARG23719.22038.17556.6761.0031.851838CZARG23719.40838.59257.9311.0034.621839NH1ARG23719.49139.88858.2261.0032.341840NH2ARG23719.45737.73758.8951.0023.821841CARG23719.25038.10751.2851.0040.071842OARG23718.80336.94851.4291.0036.371843NTYR23820.40538.39050.6971.0035.861844CATYR23821.27837.33250.2151.0043.631845CBTYR23820.95037.00248.7501.0035.841846CGTYR23821.12738.17247.8081.0031.361847CD1TYR23822.29738.33547.0891.0035.381848CE1TYR23822.47939.43446.2371.0037.611849CD2TYR23820.12439.12947.6521.0039.531850CE2TYR23820.28340.22646.7951.0033.691851CZTYR23821.47440.36446.0961.0044.161852OHTYR23821.67041.42145.2631.0047.161853CTYR23822.74637.70550.3191.0046.651854OTYR23823.12238.87950.3361.0043.791855NARG23923.57736.68550.3701.0045.601856CAARG23925.00536.90750.4231.0050.191857CBARG23925.46937.20851.8651.0040.991858CGARG23925.46736.02652.7871.0033.671859CDARG23925.68436.49454.2421.0036.881860NEARG23925.32235.44655.2101.0033.501861CZARG23925.92334.26255.2961.0041.081862NH1ARG23926.92533.98154.4761.0038.341863NH2ARG23925.50933.35256.1841.0037.141864CARG23925.70035.66949.9001.0049.271865OARG23925.11934.57049.9131.0042.581866NALA24026.93435.85849.4261.0052.311867CAALA24027.75734.74848.9551.0038.151868CBALA24029.04335.27448.3911.0043.961869CALA24027.98733.99150.2541.0038.231870OALA24028.12834.58851.3131.0047.791871NGLU24127.96432.67850.2001.0039.861872CAGLU24128.12231.89751.4071.0050.521873CBGLU24128.15830.42651.0171.0060.701874CGGLU24128.29329.45252.1401.0058.151875CDGLU24128.08728.04151.6421.0072.571876OE1GLU24126.93727.56151.7081.0079.461877OE2GLU24129.06227.42051.1571.0067.861878CGLU24129.38032.29952.1701.0055.561879OGLU24129.41132.26653.4021.0047.411880NARG24230.40432.70551.4201.0062.761881CAARG24231.69533.12251.9681.0063.111882CBARG24232.74733.02350.8901.0060.321883CGARG24232.66734.13049.8651.0069.881884CDARG24233.68033.86548.7641.0071.901885NEARG24233.87435.01647.9011.0075.361886CZARG24234.51934.95746.7421.0085.191887NH1ARG24235.02233.79246.3331.0081.841888NH2ARG24234.65136.04945.9871.0074.211889CARG24231.75134.54052.5561.0071.411890OARG24232.57534.81653.4281.0069.711891NSER24330.89135.44052.0851.0069.391892CASER24330.89636.80752.6001.0071.041893CBSER24330.33837.79151.5571.0079.601894OGSER2433139.05551.6241.0078.061895CSER24330.07936.86553.8791.0069.071896OSER24329.29435.95054.1581.0067.881897NLYS24430.28537.92754.6611.0058.861898CALYS24429.57038.09655.9301.0060.141899CBLYS24430.50338.56557.0521.0060.041900CGLYS24431.25537.46257.7921.0058.681901CDLYS24432.11838.03258.9080.0062.151902CELYS24432.84836.93059.6590.0062.431903NZLYS24433.69137.47260.7620.0062.971904CLYS24428.44539.09555.8341.0061.791905OLYS24427.55039.07256.6681.0062.891906NTHR24528.46539.96354.8231.0055.141907CATHR24527.41140.95254.7241.0046.491908CBTHR24527.99642.39054.6141.0049.941909OG1THR24528.68542.54553.3701.0055.001910CG2THR24528.95942.64955.7821.0060.751911CTHR24526.34540.78753.6491.0053.391912OTHR24526.62840.61352.4501.0049.671913NPHE24625.09940.88454.1041.0050.841914CAPHE24623.94640.77553.2351.0042.461915CBPHE24622.70340.48854.0831.0040.901916CGPHE24622.61339.09354.5941.0042.141917CD1PHE24622.88738.80855.9141.0040.171918CD2PHE24622.22938.05453.7561.0033.931919CE1PHE24622.78437.50256.3961.0040.181920CE2PHE24622.12536.74554.2331.0035.221921CZPHE24622.40036.46455.5461.0041.581922CPHE24623.65242.03052.3861.0039.021923OPHE24623.95443.16352.7571.0040.271924NTHR24723.05041.78651.2371.0036.741925CATHR24722.53942.82150.3371.0035.161926CBTHR24722.93042.52648.8781.0051.271927OG1THR24724.32842.84448.7181.0056.621928CG2THR24722.08843.33947.8811.0041.691929CTHR24721.03942.58550.6621.0038.051930OTHR24720.57741.44150.7251.0037.691931NTHR24820.32143.65650.9881.0046.241932CATHR24818.91843.53551.3951.0046.621933CBTHR24818.70544.00052.8651.0040.531934OG1THR24819.70243.37353.6931.0042.401935CG2THR24817.28743.52453.4141.0032.111936CTHR24818.05444.31450.4621.0048.651937OTHR24818.38545.43950.0901.0044.441938NTRP24916.94243.70050.0781.0040.581939CATRP24916.02844.31849.1201.0050.641940CBTRP24916.14943.61947.7451.0055.421941CGTRP24917.44643.75846.9641.0061.341942CD2TRP24917.80143.02545.7711.0071.581943CE2TRP24918.97743.63045.2381.0065.131944CE3TRP24917.23341.91945.0951.0066.831945CD1TRP24918.40144.72547.1061.0063.111946NE1TRP24919.31544.66046.0721.0065.201947CZ2TRP24919.59843.17444.0531.0074.501948CZ3TRP24917.85141.45743.9051.0077.251949CH2TRP24919.02842.09243.3991.0078.891950CTRP24914.56444.18949.5371.0044.231951OTRP24914.16943.18150.1271.0044.261952NMET25013.76245.20849.2591.0035.131953CAMET25012.32145.07749.4991.0041.641954CBMET25011.64046.43249.7191.0033.961955CGMET25011.96647.136581.0041.531956SDMET25011.14646.31952.4471.0054.101957CEMET2509.49245.72651.5561.0034.591958CMET25011.75744.49248.1781.0045.741959OMET25011.99645.05547.1201.0032.671960NVAL25111.06943.35248.2311.0042.221961CAVAL25110.42342.79447.0491.0038.751962CBVAL2519.85641.44747.4131.0049.521963CG1VAL2519.20740.78746.2001.0037.231964CG2VAL25110.99140.59447.9831.0044.181965CVAL2519.29243.76146.6281.0043.351966OVAL2518.41944.08147.4161.0050.691967NLYS2529.28344.26345.4081.0041.061968CALYS2528.21345.21845.0941.0042.191969CBLYS2528.67246.17643.9851.0037.241970CGLYS2528.56745.59342.5471.0047.211971CDLYS2529.37546.41041.5470.0046.211972CELYS25210.87146.29141.8080.0047.331973NZLYS25211.35244.88641.6750.0047.251974CLYS2526.83544.63744.7301.0026.171975OLYS2526.68743.43844.4541.0047.891976NASP2535.82645.48844.7911.0042.161977CAASP2534.46745.12544.3841.0037.781978CBASP2534.45944.98042.8481.0039.371979CGASP2534.63846.31942.1291.0047.981980OD1ASP2534.80046.28140.8811.0061.761981OD2ASP2534.60247.39642.7981.0045.421982CASP2533.81343.90345.0181.0033.711983OASP2532.97943.24144.4021.0035.541984NLEU2544.20843.60246.2381.0032.031985CALEU2543.65642.49446.9731.0030.301986CBLEU2542.15142.73747.1691.0040.571987CGLEU2541.84744.03147.9231.0045.551988CD1LEU2540.29044.24648.1351.0033.191989CD2LEU2542.60543.92149.2601.0043.391990CLEU2543.91441.11646.3491.0038.421991OLEU2543.28740.10346.7231.0039.221992NGLN2554.87541.05445.4401.0040.771993CAGLN2555.17639.80044.7501.0034.971994CBGLN2556.11140.07743.5661.0037.381995CGGLN2555.48441.04442.5971.0061.251996CDGLN2556.36341.39941.4361.0050.251997OE1GLN2557.43941.95941.6021.0059.721998NE2GLN2555.89341.09540.2441.0055.691999CGLN2555.75638.74345.6481.0035.432000OGLN2556.30039.07046.6991.0036.432001NHIS2565.64037.48645.2261.0024.942002CAHIS2566.17736.35645.9831.0031.812003CBHIS2565.11735.27146.1731.0030.352004CGHIS2564.03435.64947.1271.0033.392005CD2HIS2563.82635.30548.4141.0038.372006ND1HIS2562.99936.48446.7831.0032.742007CE1HIS2562.20136.63847.8171.0026.752008NE2HIS2562.67735.93648.8171.0033.832009CHIS2567.39135.71845.3101.0038.162010OHIS2567.65134.53345.4571.0042.322011NHIS2578.12836.50444.5541.0030.632012CAHIS2579.26435.95743.8851.0037.122013CBHIS2578.85435.20742.6061.0041.872014CGHIS2578.49036.12541.4821.0039.912015CD2HIS2579.25136.67540.5031.0033.872016ND1HIS2577.26736.75841.4201.0047.182017CE1HIS2577.30137.67340.4671.0049.262018NE2HIS2578.49437.64339.8991.0052.842019CHIS25710.02337.15443.4651.0043.282020OHIS2579.47238.25643.4211.0030.962021NCYS25811.29036.92543.1521.0039.412022CACYS25812.11637.98142.6211.0042.812023CBCYS25812.30739.09943.6691.0053.732024SGCYS25813.84339.17444.5891.0055.132025CCYS25813.42137.37042.1101.0037.342026OCYS25813.82936.27642.5021.0043.292027NVAL25914.06038.05741.1921.0039.862028CAVAL25915.28137.52640.6181.0041.362029CBVAL25915.18837.45939.0711.0038.542030CG1VAL25916.53137.00038.4481.0045.292031CG2VAL25914.13336.50138.6971.0029.532032CVAL25916.47438.341451.0036.032033OVAL25916.54139.52240.7241.0039.672034NILE26017.39837.69741.6901.0044.082035CAILE26018.64838.34442.0831.0038.842036CBILE26019.35537.44943.0951.0041.642037CG2ILE26020.82337.96443.3641.0050.742038CG1ILE26018.49037.38544.3701.0028.552039CD1ILE26018.96836.45545.4591.0023.682040CILE26019.43238.48340.7541.0035.502041OILE26019.62837.49040.0311.0033.202042NHIS26119.80739.71040.3951.0026.092043CAHIS26120.53439.94839.1331.0043.382044CBHIS26119.93841.12438.3621.0036.992045CGHIS26118.58940.86537.7761.0061.382046CD2HIS26117.46241.61437.7821.0057.022047ND1HIS26118.31939.77936.9731.0066.712048CE1HIS26117.08939.87536.5001.0062.222049NE2HIS26116.54840.98236.9751.0060.942050CHIS26122.04740.20739.2371.0036.642051OHIS26122.70740.44838.2151.0042.862052NASP26222.60040.14140.4381.0036.002053CAASP26224.02340.43140.6061.0049.392054CBASP26224.21741.88841.0601.0043.722055CGASP26223.52442.17742.3751.0061.272056OD1ASP26223.48343.36542.7821.0055.332057OD2ASP26223.02241.20042.9981.0058.692058CASP26224.69839.51141.5791.0038.222059OASP26225.58839.92642.3071.0054.562060NALA26324.25238.25941.6021.0042.472061CAALA26324.82537.23942.4561.0039.032062CBALA26323.98636.02742.3581.0030.332063CALA26326.26336.93441.9671.0047.652064OALA26326.56737.06640.7731.0058.152065NTRP26427.14636.51542.8621.0056.142066CATRP26428.53936.20942.4681.0059.612067CBTRP26429.42636.11443.7081.0067.312068CGTRP26429.87137.40944.3051.0076.502069CD2TRP26431.22037.91144.3571.0087.222070CE2TRP26431.19239.11045.1111.0091.592071CE3TRP26432.45237.46143.8421.0089.252072CD1TRP26429.10838.30144.9951.0081.582073NE1TRP26429.89239.32345.4881.0082.492074CZ2TRP26432.35039.87045.3641.0095.102075CZ3TRP26433.61038.21744.0961.0087.892076CH2TRP26433.54539.40644.8511.0094.392077CTRP26428.68834.90141.6701.0048.822078OTRP26428.44633.82542.2071.0052.912079NSER26529.13534.99340.4151.0051.222080CASER26529.28033.82139.5381.0049.092081CBSER26530.03634.18338.2521.0065.782082OGSER26529.77633.24437.1891.0069.912083CSER26529.89932.57240.1501.0049.642084OSER26530.92732.62240.7931.0050.722085NGLY26629.22231.45039.9471.0046.942086CAGLY26629.66230.16640.4611.0046.532087CGLY26629.64129.95141.9671.0053.652088OGLY26629.90828.85042.4331.0053.602089NLEU26729.32730.97442.7471.0054.992090CALEU26729.32530.78844.1941.0039.642091CBLEU26729.81332.05144.8831.0047.502092CGLEU26731.21432.55444.5271.0050.962093CD1LEU26731.58933.52145.6171.0052.062094CD2LEU26732.23931.42344.4461.0034.642095CLEU26727.97130.42644.7651.0051.632096OLEU26726.92930.90544.2731.0047.092097NARG26827.99729.57445.7941.0045.442098CAARG26826.79629.14846.5021.0046.752099CBARG26827.10128.03147.5011.0041.612100CGARG26827.60926.72246.9171.0056.542101CDARG26828.50525.98147.9361.0073.312102NEARG26827.77625.26748.9891.0077.492103CZARG26827.05724.16248.7821.0081.372104NH1ARG26826.95023.64147.5601.0077.102105NH2ARG26826.48623.54649.8051.0072.412106CARG26826.42930.38847.2831.0048.782107OARG26827.31131.04547.7931.0046.052108NHIS26925.14530.71947.3711.0038.532109CAHIS26924.72831.91248.0981.0043.122110CBHIS26924.10432.95147.1701.0041.972111CGHIS26925.06433.61746.2461.0031.332112CD2HIS26925.67533.18045.1211.0036.372113ND1HIS26925.34634.96146.3291.0041.382114CE1HIS26926.07135.32745.2881.0035.072115NE2HIS26926.28234.26544.5351.0038.252116CHIS26923.67831.54049.0861.0045.222117OHIS26923.04030.49448.9481.0042.502118NVAL27023.49832.40450.0851.0045.322119CAVAL27022.45532.19351.0871.0043.162120CBVAL27023.00232.20852.5481.0041.992121CG1VAL27021.84132.03053.5331.0052.032122CG2VAL27023.96531.08652.7441.0039.142123CVAL27021.40933.30650.8961.0042.492124OVAL27021.73434.46750.6171.0037.042125NVAL27120.15632.910581.0034.602126CAVAL27119.02333.79950.8421.0034.972127CBVAL27118.28633.47549.5451.0032.812128CG1VAL27117.13834.42849.3191.0038.752129CG2VAL27119.28133.50748.3871.0040.592130CVAL27118.04533.67452.0031.0035.772131OVAL27117.80832.60052.5361.0036.302132NGLN27217.48934.79652.4001.0035.172133CAGLN27216.52434.77553.4851.0048.952134CBGLN27217.22135.13954.8181.0040.502135CGGLN27218.05534.00055.4241.0034.142136CDGLN27218.58034.37156.8161.0034.662137OE1GLN27219.23135.40656.9881.0033.142138NE2GLN27218.27733.54057.8091.0032.172139CGLN27215.37635.74453.1861.0033.432140OGLN27215.57936.72752.4581.0031.692141NLEU27314.20135.46253.7611.0034.832142CALEU27313.00136.29353.5751.0041.032143CBLEU27312.02335.65652.5781.0034.772144CGLEU27312.00935.74251.0481.0035.062145CD1LEU27311.75137.16650.5461.0036.252146CD2LEU27313.26935.14150.5241.0028.962147CLEU27312.21336.53254.8681.0041.752148OLEU27312.11235.65455.7301.0030.432149NARG27411.64237.72854.9781.0028.002150CAARG27410.83438.06456.1231.0037.092151CBARG27411.64338.86157.1501.0037.162152CGARG27412.06740.28356.7561.0023.222153CDARG27412.67340.92257.9861.0028.072154NEARG27413.17842.27457.7461.0029.612155CZARG27413.71243.05658.6781.0029.942156NH1ARG27413.80442.63659.9311.0028.242157NH2ARG27414.16044.25258.3531.0029.722158CARG2749.61638.85855.6201.0049.842159OARG2749.66639.53454.5771.0043.652160NALA2758.52338.76456.3651.0044.602161CAALA2757.29039.42055.9711.0039.292162CBALA2756.27038.34055.6201.0033.602163CALA2756.66540.42256.9551.0043.502164OALA2756.78640.31358.1811.0043.882165NGLN2765.99541.41156.3971.0032.082166CAGLN2765.30242.38757.1991.0033.852167CBGLN2766.13143.64957.3391.0033.942168CGGLN2765.48044.62558.2881.0041.152169CDGLN2766.20045.94758.3271.0048.002170OE1GLN2766.41446.55457.2811.0035.342171NE2GLN2766.57546.41359.5431.0043.262172CGLN2763.94442.73356.5231.0038.592173OGLN2763.85942.88455.3011.0036.702174NGLU2772.88742.82457.3131.0032.282175CAGLU2771.58343.19356.7891.0033.472176CBGLU2770.62143.45157.9521.0032.732177CGGLU277−0.78943.76557.5551.0030.252178CDGLU277−0.93945.22857.2181.0031.542179OE1GLU277−0.27546.06457.8691.0056.802180OE2GLU277−1.71345.56456.3001.0064.092181CGLU2771.76044.44455.8981.0035.012182OGLU2772.37245.43756.2831.0034.512183NGLU2781.18344.36854.7101.0034.292184CAGLU2781.34445.36353.6791.0030.452185CBGLU2780.36745.05152.5191.0038.902186CGGLU278−1.11045.29752.9111.0057.322187CDGLU278−2.11545.00551.7991.0059.362188OE1GLU278−2.85744.00251.9331.0068.352189OE2GLU278−2.17045.77250.8071.0057.172190CGLU2781.27446.82554.0431.0033.012191OGLU2781.83447.64053.3441.0046.382192NPHE2790.62247.17755.1301.0041.522193CAPHE2790.50148.58655.4811.0043.182194CBPHE279−0.95448.91155.8441.0042.012195CGPHE279−1.91348.70554.7001.0042.332196CD1PHE279−2.82847.66354.7151.0039.072197CD2PHE279−1.85649.52153.5881.0047.162198CE1PHE279−3.67347.42453.6431.0048.802199CE2PHE279−2.70349.29452.4931.0055.382200CZPHE279−3.61348.23752.5271.0043.202201CPHE2791.41049.07456.5871.0051.232202OPHE2791.23750.19857.0911.0049.782203NGLY2802.36648.23156.9701.0039.062204CAGLY2803.30048.58458.0261.0048.462205CGLY2802.74048.48959.4301.0036.312206OGLY2803.10349.26560.2981.0059.082207NGLN2811.86547.51959.6371.0045.262208CAGLN2811.21347.25460.9071.0031.452209CBGLN281−0.28746.99160.7041.0032.462210CGGLN281−0.91646.20161.8721.0055.242211CDGLN281−2.41145.85961.6931.0068.872212OE1GLN281−2.95545.01862.4251.0068.402213NE2GLN281−3.07146.50460.7311.0061.432214CGLN2811.86746.00961.4821.0038.872215OGLN2812.09445.02660.7801.0038.482216NGLY2822.17146.04662.7661.0045.522217CAGLY2822.79744.90263.3761.0035.122218CGLY2824.28144.94063.1101.0042.162219OGLY2824.83245.96862.7361.0048.532220NGLU2834.92243.80363.2641.0029.982221CAGLU2836.34343.75063.0951.0033.072222CBGLU2837.00743.19464.3761.0043.382223CGGLU2836.68243.92265.7161.0043.112224CDGLU2837.49345.22365.9541.0038.622225OE1GLU2837.34345.83267.0291.0045.262226OE2GLU2838.27645.65065.0821.0053.212227CGLU2836.67742.82361.9521.0044.102228OGLU2835.83142.04661.4821.0033.282229NTRP2847.93642.89461.5271.0038.662230CATRP2848.42642.02060.5021.0029.412231CBTRP2849.80842.41860.0391.0035.712232CGTRP2849.80443.53859.1451.0033.192233CD2TRP2849.55943.48157.7321.0021.812234CE2TRP2849.63744.81457.2401.0031.282235CE3TRP2849.30042.44156.8371.0028.552236CD1TRP28410.01044.84559.4621.0030.062237NE1TRP2849.91345.63258.3091.0035.132238CZ2TRP2849.44845.12555.8921.0014.922239CZ3TRP2849.12142.75455.4921.0016.122240CH2TRP2849.19844.08255.0351.0023.052241CTRP2848.55140.66761.0911.0030.552242OTRP2848.83640.53062.2761.0043.892243NSER2858.32039.65060.2571.0039.672244CASER2858.52638.27960.6731.0031.642245CBSER2858.11637.26959.5691.0043.692246OGSER2859.00937.22358.4511.0036.012247CSER28510.02738.05860.9091.0032.812248OSER28510.90438.82760.4931.0025.832249NGLU28610.29736.97061.5891.0038.112250CAGLU28611.64136.56461.7691.0032.542251CBGLU28611.64935.37462.6991.0041.342252CGGLU28611.26435.69864.1191.0047.362253CDGLU28612.12336.81364.7061.0059.932254OE1GLU28613.36736.74464.5531.0053.752255OE2GLU28611.54737.75065.3201.0056.442256CGLU28612.06836.13760.3381.0042257OGLU28611.23135.74959.5241.0038.342258NTRP28713.36136.21960.0291.0031.322259CATRP28713.87335.79258.7271.0021.682260CBTRP28715.39736.03558.6171.0027.332261CGTRP28715.74237.44058.5761.0027.172262CD2TRP28715.65838.30457.4351.0024.892263CE2TRP28716.08639.60057.8641.0029.662264CE3TRP28715.26938.11656.0921.0031.752265CD1TRP28716.18338.19859.6071.0027.822266NE1TRP28716.39939.50259.1901.0028.122267CZ2TRP28716.14240.69757.0141.0022.702268CZ3TRP28715.33039.21255.2231.0028.872269CH2TRP28715.76740.48655.6871.0031.962270CTRP28713.63634.30458.5611.0026.432271OTRP28713.67433.53559.5161.0031.242272NSER28813.39133.87857.3441.0025.582273CASER28813.21432.44657.1461.0038.122274CBSER28812.69932.20155.7471.0028.042275OGSER28813.54532.88554.8531.0026.762276CSER28814.57031.74557.2911.0036.422277OSER28815.61232.39257.2211.0033.692278NPRO28914.55530.40857.4881.0040.562279CDPRO28913.32429.59857.6181.0051.452280CAPRO28915.73729.55857.6351.0026.022281CBPRO28915.13828.15557.6671.0037.892282CGPRO28913.84228.35058.2921.0044.212283CPRO28916.50929.80056.3311.0038.962284OPRO28915.89629.98855.2751.0037.402285NGLU29017.84029.77656.4021.0045.452286CAGLU29018.68430.02555.2461.0039.832287CBGLU29020.16429.92655.6171.0056.292288CGGLU29020.54930.70956.8691.0071.732289CDGLU29022.06030.80757.0571.0075.522290OE1GLU29022.71131.52456.2641.0066.312291OE2GLU29022.59130.16858.0001.0081.242292CGLU29018.41929.11654.0941.0044.842293OGLU29018.36827.89154.2481.0048.202294NALA29118.24829.71852.9231.0042.672295CAALA29117.99728.93451.7211.0049.002296CBALA29116.70629.37451.0541.0039.232297CALA29119.17029.22650.8211.0053.162298OALA29119.58630.38650.7141.0057.912299NMET29219.74228.20050.1981.0049.162300CAMET29220.85728.49249.3061.0051.932301CBMET29222.18728.00949.8761.0060.832302CGMET29222.33326.54250.2161.0063.532303SDMET29223.94826.46251.0981.0080.712304CEMET29225.15326.38049.6961.0056.942305CMET29220.71228.01047.8971.0050.872306OMET29219.86527.16247.5991.0041.342307NGLY29321.53028.58647.0171.0037.832308CAGLY29321.47328.18045.6391.0033.632309CGLY29322.67228.74044.9581.0042.012310OGLY29323.31629.60445.5331.0036.892311NTHR29422.99728.24943.7621.0042.182312CATHR29424.14528.79843.0561.0045.712313CBTHR29425.40527.85043.1211.0049.052314OG1THR29425.25326.76042.2291.0063.332315CG2THR29425.54527.26644.4691.0047.912316CTHR29423.76329.10441.6201.0040.852317OTHR29423.02728.35540.9681.0042.822318NPRO29524.23330.23841.1131.0028.042319CDPRO29524.98731.27541.8221.0028.462320CAPRO29523.91030.61639.7341.0029.712321CBPRO29524.58631.98039.5421.0019.022322CGPRO29524.79032.47440.9521.0028.892323CPRO29524.42029.59338.7051.0040.542324OPRO29525.25828.75539.0211.0038.792325NTRP29623.88129.68837.4871.0036.892326CATRP29624.26828.86936.3541.0042.812327CBTRP29623.40129.24535.1561.0035.152328CGTRP29623.89928.71333.8311.0050.032329CD2TRP29624.76129.39032.8961.0043.502330CE2TRP29624.96328.50931.8111.0054.352331CE3TRP29625.37730.65632.8671.0048.952332CD1TRP29623.63127.49033.2891.0046.072333NE1TRP29624.26027.36232.0831.0052.782334CZ2TRP29625.75828.85330.6921.0039.392335CZ3TRP29626.1663131.7531.0048.672336CH2TRP29626.34730.10230.6891.0038.072337CTRP29625.75729.08435.9701.0037.102338OTRP29626.30230.16236.1541.0043.362339NTHR29726.38328.02335.4701.0037.792340CATHR29727.74928.02734.9751.0046.982341CBTHR29728.80527.60436.0381.0048.292342OG1THR29728.54426.27236.4911.0059.022343CG2THR29728.79428.54837.2181.0042.062344CTHR29727.67426.97433.8941.0050.452345OTHR29726.66726.24233.8091.0040.712346NGLU29828.68726.92533.0311.0045.382347CAGLU29828.70425.89531.9981.0051.452348CBGLU29829.48326.35030.7691.0048.682349CGGLU29828.96627.61530.1661.0054.522350CDGLU29829.88128.16629.0891.0071.172351OE1GLU29831.06028.51829.3971.0073.912352OE2GLU29829.40828.24827.9351.0058.352353CGLU29829.42024.74532.6731.0053.072354OGLU29830.31724.97533.4981.0052.232355NSER29929.01923.52532.3171.0054.522356CASER29929.54922.27332.8681.0046.072357CBSER29928.82821.11632.1861.0050.732358OGSER29927.45621.09132.5350.0050.542359CSER29931.06622.06332.7721.0054.722360OSER29931.49221.14432.0390.0052.292361OXTSER29931.80822.82433.4280.0052.322362CBCYS692−2.30851.64365.0011.0091.482363SGCYS692−2.26750.55866.4851.0088.362364CCYS692−4.61452.73165.0001.0095.542365OCYS692−5.17151.92765.7811.0090.962366OXTCYS692−5.25253.35564.1141.0095.282367NCYS692−2.67353.70866.3541.0078.082368CACYS692−3.08652.97765.1171.0091.282369OH2TIP7101.4468.260101.8291.0048.772370OH2TIP711−3.10236.78752.1911.0056.742371OH2TIP7123.90028.65457.0331.0060.012372OH2TIP713−7.62925.597101.2441.0067.182373OH2TIP7149.0486.640103.7201.0043.722374OH2TIP715−12.46242.00487.2601.0052.022375OH2TIP716−4.818−9.266106.9811.0068.442376OH2TIP7176.93332.50577.4601.0044.822377OH2TIP7188.35931.38863.6291.0057.052378OH2TIP71912.04628.03753.2811.0047.272379OH2TIP72021.98643.47156.5971.0047.402380OH2TIP721−10.21115.823105.4471.0085.872381OH2TIP7222.51319.804105.8141.0085.682382OH2TIP72317.3463730.0051.0042.462383OH2TIP7241.53934.48259.8341.0039.642384OH2TIP7251.49532.56448.8991.0049.092385OH2TIP72619.15425.95340.5061.0038.012386OH2TIP7273.01438.70667.4551.0037.922387OH2TIP72821.60432.04337.2501.0032.042388OH2TIP72910.50429.19055.4611.0042.142389OH2TIP7305.40634.31942.9121.0060.512390OH2TIP7316.57929.58557.8771.0042.582391OH2TIP7326.54942.06948.3901.0033.932392OH2TIP73314.86836.50462.5091.0037.942393OH2TIP734−11.91215.524108.2971.0085.952394OH2TIP7368.68128.05457.6151.0063.732395OH2TIP73814.63446.65660.6201.0042.352396OH2TIP739−1.49031.35953.4881.0058.012397OH2TIP740−0.62948.63949.1801.0042.042398OH2TIP7413.35729.65398.1831.0049.622399OH2TIP742−2.41034.70266.3791.0045.292400OH2TIP7430.5045.606118.9121.0048.912401OH2TIP744−8.94732.69276.3631.0046.462402OH2TIP74510.01044.37563.0521.0043.802403OH2TIP746−6.07644.90465.9601.0074.812404OH2TIP74727.15425.04144.3311.0072.572405OH2TIP748−2.10433.50052.4041.0054.462406OH2TIP7493.2806.75198.9721.0056.632407OH2TIP7506.40438.16574.1261.0055.282408OH2TIP751−3.20347.58557.9461.0053.762409OH2TIP7526.1029.12899.8541.0042.592410OH2TIP753−6.38231.30288.6501.0044.762411OH2TIP754−6.11136.46759.8701.0065.592412OH2TIP755−10.40044.28259.9861.0053.422413OH2TIP756−9.045−3.63199.6531.0053.102414OH2TIP7579.96746.11483.3511.0058.782415OH2TIP7587.33847.61339.6851.0058.892416OH2TIP7593.00231.62496.3651.0065.652417OH2TIP7603.26010.81491.7011.0065.072418OH2TIP76116.89729.23427.7811.0043.862419OH2TIP7627.57237.08584.6121.0045.302420OH2TIP7633.21742.80860.2661.0039.352421OH2TIP76414.81024.83843.3431.0049.552422OH2TIP765−3.30843.00866.5341.0047.942423OH2TIP76626.21731.37258.1141.0075.332424OH2TIP76734.95733.16255.7271.0057.922425OH2TIP768−12.43433.60669.6611.0097.002426OH2TIP769−3.81043.92054.1671.0063.212427OH2TIP77011.64916.34599.8591.0054.892428OH2TIP7718.10035.40262.3181.0040.932429OH2TIP7726.77629.80877.7031.0062.932430OH2TIP77327.84338.31048.8441.0045.032431OH2TIP77419.52242.48956.3301.0055.142432OH2TIP77517.08622.83742.3741.0050.942433OH2TIP77621.93345.27454.0461.0054.292434OH2TIP77712.84840.31161.5701.0042.742435OH2TIP7782.85729.23069.6971.0053.352436OH2TIP7799.57740.49142.5211.0042.522437OH2TIP7800.12138.08551.1111.0033.562438OH2TIP7814.72245.41379.9681.0034.362439OH2TIP7826.14947.54381.0111.0039.312440OH2TIP7830.3286.268121.2301.0045.102441OH2TIP784−11.69938.11873.9911.0046.372442OH2TIP806−4.76255.89275.8481.0070.202443OH2TIP807−3.00521.56184.5121.0046.812444OH2TIP8109.01133.98291.8311.0055.392445OH2TIP81613.25629.33142.9981.0043.842446OH2TIP82524.11437.01434.5991.0042.272447C1NAG631−6.96731.49260.6651.0060.682448C2NAG631−8.40830.95460.6601.0067.892449N2NAG631−9.20831.63459.6601.0067.822450C7NAG631−9.24331.18858.4071.0070.872451O7NAG631−8.90830.04458.0881.0066.232452C8NAG631−9.71632.17357.3441.0068.042453C3NAG631−9.06331.10862.0441.0074.122454O3NAG631−10.28330.37762.0771.0073.272455C4NAG631−8.15130.59263.1661.0077.892456O4NAG631−8.6753164.4511.0082.642457C5NAG631−6.75431.16663.0081.0074.102458O5NAG631−6.23430.84961.7041.0067.492459C6NAG631−5.82030.57464.0331.0062.902460O6NAG631−6.15929.22264.2721.0071.832461C1NAG632−8.95530.03065.4091.0078.672462C2NAG632−8.99030.72066.7901.0080.332463N2NAG632−7.62931.04567.1941.0061.312464C7NAG632−7.16832.30167.1571.0059.622465O7NAG632−7.84633.28766.8201.0036.832466C8NAG632−5.71432.48967.5861.0044.942467C3NAG632−9.65529.85667.8851.0083.612468O3NAG632−9.94430.68169.0091.0084.182469C4NAG632−10.95929.18067.4081.0087.972470O4NAG632−11.38328.20168.3981.0087.592471C5NAG632−10.71628.50366.0381.0092.692472O5NAG632−10.23329.47165.0761.0084.662473C6NAG632−11.98527.92965.4371.0089.362474O6NAG632−12.66528.92664.6881.0083.972475C1MAN633−12.68627.68768.3721.0095.322476C2MAN633−12.80126.47969.3531.00101.032477O2MAN633−12.36826.86070.6621.0091.922478C3MAN633−14.25825.99069.4121.0098.592479O3MAN633−14.39124.88170.2971.0098.762480C4MAN633−15.11827.17169.8681.0095.702481O4MAN633−16.47126.77470.0471.0090.822482C5MAN633−14.99228.23768.7791.0093.082483O5MAN633−13.62628.73168.7301.0096.412484C6MAN633−15.96529.40468.8831.0098.622485O6MAN633−15.40330.52869.5921.0099.102486C1MAN635−16.40631.48969.8271.00107.042487C2MAN635−15.81932.86969.9531.00101.172488O2MAN635−16.87833.77370.2341.0099.302489C3MAN635−14.82432.88971.1081.00108.132490O3MAN635−14.37834.22571.3101.00110.392491C4MAN635−15.42632.33872.4341.00109.372492O4MAN635−14.34631.88073.3011.00110.972493C5MAN635−16.39331.13672.2211.00111.452494O5MAN635−17.16731.23570.9991.00112.102495C6MAN635−17.4033273.3331.00105.712496O6MAN635−18.63730.52672.8201.0095.502497C1NAG636−13.38332.81273.7491.00108.372498C2NAG636−12.07532.10574.1021.00106.852499N2NAG636−11.50731.45372.9291.00108.142500C7NAG636−10.79230.33273.0401.00106.542501O7NAG636−9.77830.24573.7371.00104.972502C8NAG636−11.27529.11972.2581.00104.702503C3NAG636−12.31131.09575.2361.00107.632504O3NAG636−11.05630.70675.7881.00104.942505C4NAG636−13.19531.62876.3781.00104.082506O4NAG636−13.87130.52776.9691.0099.702507C5NAG636−14.25332.68475.9871.00102.182508O5NAG636−13.81733.51974.8901.00107.222509C6NAG636−14.49833.63277.1431.0096.312510O6NAG636−13.88234.89076.8911.0092.062511C1NAG6215.29524.27851.6541.0082512C2NAG6213.82124.62851.9241.0083.542513N2NAG6213.75425.61352.9961.0076.262514C7NAG6212.94726.67252.9311.0069.412515O7NAG6213.11027.66053.6471.0066.322516C8NAG6211.78526.65051.9481.0070.852517C3NAG6213.01623.39752.3411.0086.422518O3NAG6211.63123.71352.3191.0089.942519C4NAG6213.27022.19651.4351.0087.262520O4NAG6212.69521.04152.0431.0094.402521C5NAG6214.78121.97551.2211.0087.922522O5NAG6215.40623.18550.7291.0081.542523C6NAG6215.08120.86250.2171.0086.522524O6NAG6215.17221.35848.8851.0084.862525C1NAG611−11.14811.43797.8061.0091.262526C2NAG611−11.42810.36696.7701.0092.622527N2NAG611−10.27310.29195.8931.0086.942528C7NAG611−9.9959.17895.2291.0091.342529O7NAG611−9.6308.14295.7891.0085.622530C8NAG611−10.1519.22493.7051.0087.482531C3NAG611−12.69810.68795.9511.0094.522532O3NAG611−13.1919.48395.3791.0097.372533C4NAG611−13.85211.36196.7421.0095.702534O4NAG611−14.63112.18695.8341.00101.812535C5NAG611−13.36112.25597.8991.0095.742536O5NAG611−12.29611.61898.6251.0095.102537C6NAG611−14.44812.54698.9081.0094.162538O6NAG611−14.54511.49399.8571.0096.642539C1NAG612−15.71611.62495.1531.00107.992540C2NAG612−16.52912.73494.4741.00106.192541N2NAG612−17.08413.64895.4521.00103.352542C7NAG612−17.84213.19296.4441.00102.042543O7NAG612−17.37712.92197.5451.00100.562544C8NAG612−19.33913.01496.1931.0092.282545C3NAG612−17.65412.13893.6321.00107.092546O3NAG612−18.28113.17892.8961.00115.892547C4NAG612−17.12311.07492.6651.00107.172548O4NAG612−18.21210.41492.0281.00107.762549C5NAG612−16.27910.05093.4231.00109.102550O5NAG612−15.21410.72094.1481.00114.002551C6NAG612−15.6469.01492.4991.00103.512552O6NAG612−14.2259.02992.5811.00101.852553C1NAG641−4.1675.922127.1131.0099.542554C2NAG641−5.0495.892128.3711.0099.252555N2NAG641−6.4306.155128.0091.0096.882556C7NAG641−7.2675.139127.8211.00103.112557O7NAG641−7.7154.451128.7511.00100.912558C8NAG641−7.6554.824126.3791.0099.462559C3NAG641−4.6176.895129.4431.0097.282560O3NAG641−5.2406.539130.6711.00102.542561C4NAG641−3.0996.926129.6491.0095.742562O4NAG641−2.7538.057130.4371.0089.412563C5NAG641−2.3696.995128.3021.00102.022564O5NAG641−2.7745.879127.4661.00106.082565C6NAG641−0.8486.924128.4611.0095.692566O6NAG641−0.1757.145127.2261.0083.832567SSO47013.45637.10542.5131.0074.452568O1SO47013.04838.02241.4151.0076.752569O2SO47014.91237.28942.7141.0074.312570O3SO47012.67337.41443.7321.0071.582571O4SO47013.23535.68642.1831.0068.842572SSO470210.16930.76490.1121.0091.702573O1SO470210.56032.17389.9051.0086.642574O2SO470211.29430.04690.7441.0097.682575O3SO47028.97630.73190.9891.0090.282576O4SO47029.87130.09788.8221.0090.492577OH2TIP8265.50345.24077.1771.0040.352578OH2TIP8275.13135.04295.5521.0043.302579OH2TIP8286.98123.68278.4981.0046.282580OH2TIP82912.40245.25862.6731.0041.432581OH2TIP83012.99531.42361.1311.0044.50REMARK coordinates from minimization and B-factor refinementREMARK refinement resolution: 6-2.4 AREMARK starting r= 0.2240 free_r= 0.2963REMARK final    r= 0.2202 free_r= 0.3008REMARK rmsd bonds= 0.014632 rmsd angles= 1.96283REMARK B rmsd for bonded mainchain atoms= 6.394 target= 3.5REMARK B rmsd for bonded sidechain atoms= 7.922 target= 4REMARK B rmsd for angle mainchain atoms= 7.619 target= 4REMARK B rmsd for angle sidechain atoms= 9.105 target= 4.5REMARK target= mlf final wa= 10REMARK final rweight= 0.0800 (with wa= 10)REMARK md-method= cartesian annealing schedule= slowcoolREMARK starting temperature= 2000 total md steps= 20 * 50REMARK cycles= 2 coordinate steps= 20 B-factor steps= 10REMARK sg= P4(3)2(1)2 a= 51.13 b= 51.13 c= 303.388 alpha= 90   beta= 90 gamma= 90REMARK topology file 1 : ../protein.topREMARK topology file 2 : ../carbohydrate.topREMARK topology file 3 : CNS_TOPPAR:water.topREMARK topology file 4 : CNS_TOPPAR:ion.topREMARK parameter file 1 : ../protein_rep.paramREMARK parameter file 2 : ../carbohydrate.paramREMARK parameter file 3 : CNS_TOPPAR:water_rep.paramREMARK parameter file 4 : CNS_TOPPAR:ion.paramREMARK molecular structure file: water_cyc9.5x.mtfREMARK input coordinates: posi_cyc9.6x.pdbREMARK reflection file= ggnew_hl.cvREMARK ncs= noneREMARK B-correction resolution: 6-2.4REMARK initial B-factor correction applied to fobs:REMARK  B11= −6.369 B22= −6.369 B33= 12.737REMARK  B12=   0.000 B13=   0.000 B23=  0.000REMARK B-factor correction applied to coordinate array B:   1.790REMARK bulk solvent: density level= 0.662903 e/A{circumflex over ( )}3,        B-factor= 87.8363 A{circumflex over ( )}2REMARK reflections with |Fobs|/sigma_F < 0.0 rejectedREMARK reflections with |Fobs| > 10000 * rms(Fobs) rejectedREMARK theoretical total number of refl. in resol. range: 15634(100%)REMARK number of unobserved reflections (no entry or |F|=0): 336(2.1%)REMARK number of reflections rejected: 0 (  0.0%)REMARK total number of reflections used:15298 (  97.9%)REMARK number of reflections in working set: 14567 ( 93.2%)REMARK number of reflections in test set:731 (  4.7%)CRYST1  51.130  51.130 303.388 90.00 90.00 90.00 P 43 21 2REMARK FILENAME=“refine_cyc9.7x.pdb”REMARK DATE:16-Oct-00 11:00:40  created by user: joseREMARK VERSION:1.0


Deduction of Dimer Coordinates


For a dimer comprising a first monomer unit and a second monomer unit, the structures coordinates (in Å) of the first monomer unit are defined by X1, Y1 and Z1 as set forth above, and the structure coordinates of the second monomer unit (in Å) are defined by X2, Y2 and Z2, which can be deduced from the following equations:

X2=51.13−Y1  (1)
Y2=51.13−X1  (2)
Z2=151.7−Z1  (3)









LENGTHY TABLE REFERENCED HERE




US20070032640A1-20070208-T00001


Please refer to the end of the specification for access instructions.











LENGTHY TABLE




The patent application contains a lengthy table section. A copy of the table is available in electronic form from the USPTO web site () An electronic copy of the table will also be available from the USPTO upon request and payment of the fee set forth in 37 CFR 1.19(b)(3).





Claims
  • 1. A method of selecting or designing a compound that interacts with an IL-6 receptor and modulates an activity mediated by the receptor, the method comprising: (a) assessing the stereochemical complementarity between a compound and a topographic region of the receptor, wherein the receptor comprises, (i) amino acids 1-299 of the IL-6 receptor positioned at atomic coordinates as shown in Appendix I, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å; or (ii) one or more subsets of said amino acids related to the coordinates shown in Appendix I by whole body translations and/or rotations; (b) obtaining a compound which possesses stereochemical complementarity to a topographic region of the receptor; and (c) testing the compound for its ability to modulate an activity associated with the receptor.
  • 2. A method as claimed in claim 1 wherein the topographic region of the IL-6 receptor is a ligand binding surface defined by residues 106-110, 133-138, 160-168, 190-193, 227-233, 250-256 or 276-281 and combinations thereof.
  • 3. A method as claimed in claim 1 wherein the topographic region of the IL-6 receptor is a region on the homodimer interface defined by resides 1-5, 19-23, 65-69, 93-99, 118, 119, 132-141, 166-172, 179-196, 241-250, 261, 262, 272-276 or 282-290 and combinations thereof.
  • 4. A method as claimed in claim 1 wherein the topographic region of the IL-6 receptor is defined by residues 11, 45, 46, 55, 62-66, 69-72, 75, 81, 88, 90-93, 122-124 and 178.
  • 5. A method as claimed in claim 1 wherein the topographic region of the IL-6 receptor is defined by residues 233-239, 244-248 and 270-290.
  • 6. A method for identifying a potential modulator compound for an IL-6 receptor which method comprises: (a) providing a three-dimensional structure of amino acids 1-299 of an IL-6 receptor as defined by the atomic coordinates shown in Appendix I, or atomic coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å, or one or more subsets of said amino acids, or one or more subsets of said amino acids related to the coordinates shown in Appendix I by whole body translations and/or rotations; (b) providing the three-dimensional structure of a candidate compound; and (c) assessing the stereochemical complementarity between the three-dimensional structure of step (b) and a topographical region of the three-dimensional structure of step (a).
  • 7. A method as claimed in claim 6 which further comprises: (d) synthesising or obtaining a candidate compound assessed in step (c) as possessing stereochemical complementarity with a topographical region of the three-dimensional structure of step (a); (e) determining the ability of the candidate compound to interact with and/or modulate the activity of the IL-6 receptor.
  • 8. A method as claimed in claim 6 wherein the topographic region of the IL-6 receptor is a ligand binding surface defined by residues 106-110, 133-138, 160-168, 190-193, 227-233, 250-256 or 276-281 and combinations thereof.
  • 9. A method as claimed in claim 6 wherein the topographic region of the IL-6 receptor is a region on the homodimer interface defined by resides 1-5, 19-23, 65-69, 93-99, 118, 119, 132-141, 166-172, 179-196, 241-250, 261, 262, 272-276 or 282-290 and combinations thereof.
  • 10. A method as claimed in claim 6, wherein the topographic region of the IL-6 receptor is defined by residues 11, 45, 46, 55, 62-66, 69-72, 75, 81, 88, 90-93, 122-124 and 178.
  • 11. A method as claimed in claim 6 wherein the topographic region of the IL-6 receptor is defined by residues 233-239, 244-248 and 270-290.
  • 12. A computer-assisted method for identifying potential compounds able to interact with an IL-6 receptor and thereby modulate an activity mediated by the receptor, using a programmed computer comprising a processor, an input device, and an output device, comprising the steps of: (a) inputting into the programmed computer, through the input device, data comprising the atomic coordinates of amino acids 1-299 of the IL-6 receptor as shown in Appendix I, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å, or one or more subsets of said amino acids, or one or more subsets of said amino acids related to the coordinates shown in Appendix I by whole body translations and/or rotations; (b) generating, using computer methods, a set of atomic coordinates of a structure that possesses stereochemical complementarity to the atomic coordinates of amino acids 1-299 of the IL-6 receptor as shown in Appendix I, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å, or one or more subsets of said amino acids, or one or more subsets of said amino acids related to the coordinates shown in Appendix I by whole body translations and/or rotations, thereby generating a criteria data set; (c) comparing, using the processor, the criteria data set to a computer database of chemical structures; (d) selecting from the database, using computer methods, chemical structures which are similar to a portion of said criteria data set; and (e) outputting, to the output device, the selected chemical structures which are complementary to or similar to a portion of the criteria data set.
  • 13. A method for evaluating the ability of a chemical entity to interact with an IL-6 receptor, said method comprising the steps of: (a) creating a computer model of at least one region of the IL-6 receptor using structure coordinates wherein the root mean square deviation between said structure coordinates and the structure coordinates of amino acids 1-299 of IL-6 receptor as set forth in Appendix I is not more than about 1.5 Å; (b) employing computational means to perform a fitting operation between the chemical entity and said computer model of the binding surface; and (c) analysing the results of said fitting operation to quantify the association between the chemical entity and the binding surface model.
  • 14. A computer for producing a three-dimensional representation of a molecule or molecular complex, wherein the computer comprises: (a) a machine-readable data storage medium comprising a data storage material encoded with machine-readable data, wherein the machine readable data comprise the atomic coordinates of amino acids 1-299 of the IL-6 receptor as shown in Appendix I, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å, or one or more subsets of said amino acids, or one or more subsets of said amino acids related to the coordinates shown in Appendix I by whole body translations and/or rotations; (b) a working memory for storing instructions for processing the machine-readable data; (c) a central-processing unit coupled to the working memory and to the machine-readable data storage medium, for processing the machine-readable data into the three dimensional representation; and (d) an output hardware coupled to the central processing unit, for receiving the three-dimensional representation.
  • 15. A method of selecting or designing a compound that interferes with the formation of an IL-6, IL-6 receptor, gp130 hexameric complex, the method comprising (a) assessing the stereochemical complementarity between the compound and a topographic region of the complex, wherein the complex is characterised by (i) the amino acids of IL-6, IL-6 receptor and gp130 positioned at atomic coordinates as shown in Appendix II, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å; or (ii) one or more subsets of said amino acids related to the coordinates shown in Appendix II by whole body translations and/or rotations; (b) obtaining a compound which possesses stereochemical complementarity to a topographic region of the receptor; and (c) testing the compound for its ability to interfere with the formation of the IL-6, IL-6 receptor, gp130 hexameric complex.
  • 16. A method as claimed in claim 15 wherein the topographic region of the complex is selected from the group consisting of; (i) amino acids 20, 24, 25, 27, 29, 31, 32, 35, 36, 39, 40-42, 45, 51, 52, 56, 60, 62-64, 69-71, 75-79, 93, 94, 110-115, 117-119, 121-123, 125, 126, 128-147, 151, 152, 155, 159, 161-169, 172, 173, 176, 177, 179, 180, 183 and 184 of IL-6 and combinations thereof; and (ii) amino acids 1-5, 8-15, 49, 75-78, 114, 116, 132-137, 140-154, 163-172, 177-184, 193-196, 226, 227, 229, 231, 232, 281-283 and 285 of gp130 and combinations thereof; and (iii) amino acids 1, 6, 107, 108, 135-139, 161-169, 190, 193, 226-231 and 277-281 of IL-6R and combinations thereof.
  • 17. A computer-assisted method for identifying compounds that interfere with the formation of an IL-6, IL-6 receptor, gp130 hexameric complex, using a programmed computer comprising a processor, an input device, and an output device, comprising the steps of: (a) inputting into the programmed computer, through the input device, data comprising the atomic coordinates of amino acids IL-6, IL-6 receptor and gp130 as shown in Appendix II, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å, or one or more subsets of said amino acids, or one or more subsets of said amino acids related to the coordinates shown in Appendix II by whole body translations and/or rotations; (b) generating, using computer methods, a set of atomic coordinates of a structure that possesses stereochemical complementarity to the atomic coordinates the IL-6, IL-6 receptor, gp130 hexameric complex as shown in Appendix II, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å, or one or more subsets of said amino acids, or one or more subsets of said amino acids related to the coordinates shown in Appendix II by whole body translations and/or rotations, thereby generating a criteria data set; (c) comparing, using the processor, the criteria data set to a computer database of chemical structures; (d) selecting from the database, using computer methods, chemical structures which are similar to a portion of said criteria data set; and (e) outputting, to the output device, the selected chemical structures which are complementary to or similar to a portion of the criteria data set.
  • 18. A method for evaluating the ability of a chemical entity to interact with an IL-6, IL-6 receptor, gp130 hexameric complex, said method comprising the steps of: (a) creating a computer model of at least one region of the IL-6, IL-6 receptor, gp130 hexameric complex using structure coordinates wherein the root mean square deviation between said structure coordinates and the structure coordinates set forth in Appendix I is not more than about 1.5 Å; (b) employing computational means to perform a fitting operation between the chemical entity and said computer model; and (c) analysing the results of said fitting operation to quantify the association between the chemical entity and the model.
  • 19. A computer for producing a three-dimensional representation of a molecule or molecular complex, wherein the computer comprises: (a) a machine-readable data storage medium comprising a data storage material encoded with machine-readable data, wherein the machine readable data comprise the atomic coordinates of the IL-6, IL-6 receptor, gp130 hexameric complex as shown in Appendix II, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å, or one or more subsets of said amino acids, or one or more subsets of the coordinates shown in Appendix II by whole body translations and/or rotations; (b) a working memory for storing instructions for processing the machine-readable data; (c) a central-processing unit coupled to the working memory and to the machine-readable data storage medium, for processing the machine-readable data into the three dimensional representation; and (d) an output hardware coupled to the central processing unit, for receiving the three-dimensional representation.
  • 20. A crystalline composition comprising a crystal of an IL-6 receptor.
  • 21. A composition according to claim 20 wherein the crystal has the structure defined by the atomic coordinates as shown in Appendix I.
  • 22. A method of assessing the interaction between a compound and an IL-6 receptor, the method comprising contacting a crystalline composition according to claim 20 with the compound and measuring the level of binding of the compound to the crystal of the IL-6 receptor.
  • 23. A method of using molecular replacement to obtain structural information about a molecule or a molecular complex of unknown structure, comprising the steps of: (i) crystallising said molecule or molecular complex; (ii) generating an X-ray diffraction pattern from said crystallized molecule or molecular complex; (iii) applying at least a portion of the structure coordinates set forth in Appendix I to the X-ray diffraction pattern to generate a three-dimensional electron density map of at least a portion of the molecule or molecular complex whose structure is unknown.
  • 24. A method according to claim 23 wherein the molecule of unknown structure is an IL-6 receptor or variant.
  • 25. A method according to claim 23 wherein the molecular complex of unknown structure is a complex of an IL-6 receptor, or variant thereof, and a ligand.
  • 26. A method for preventing or treating a disease associated with signaling by the IL-6 receptor which method comprises administering to a subject in need thereof a compound identified by a method comprising the step of assessing the stereochemical complementarity between the compound and a topographic region of the receptor, wherein the receptor comprises: (i) amino acids 1-299 of the IL-6 receptor positioned at atomic coordinates as shown in Appendix I, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å; or (ii) one or more subsets of said amino acids related to the coordinates shown in Appendix I by whole body translations and/or rotations.
  • 27. A method as claimed in claim 26 wherein the topographic region of the IL-6 receptor is a ligand binding surface defined by residues 106-110, 133-138, 160-168, 190-193, 227-233, 250-256 or 276-281 and combinations thereof.
  • 28. A method as claimed in claim 26 wherein the topographic region of the IL-6 receptor is a region on the homodimer interface defined by resides 1-5, 19-23, 65-69, 93-99, 118, 119, 132-141, 166-172, 179-196, 241-250, 261, 262, 272-276 or 282-290 and combinations thereof.
  • 29. A method as claimed in claim 26 wherein the topographic region of the IL-6 receptor is defined by residues 11, 45, 46, 55, 62-66, 69-72, 75, 81, 88, 90-93, 122-124 and 178.
  • 30. A method as claimed in claim 26 wherein the topographic region of the IL-6 receptor is defined by residues 233-239, 244-248 and 270-290.
  • 31. A method for preventing or treating a disease associated with signaling by the IL-6 receptor which method comprises administering to a subject in need thereof a compound identified by a method comprising the step of assessing the stereochemical complementarity between the compound and a topographic region of an IL-6, IL-6R, gp130 hexameric complex, wherein the hexameric complex comprises: (i) the amino acids of IL-6, IL-6 receptor and gp130 positioned at atomic coordinates as shown in Appendix II, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å; or (ii) one or more subsets of said amino acids related to the coordinates shown in Appendix II by whole body translations and/or rotations.
  • 32. A method as claimed in claim 31 wherein the topographic region of the complex is selected from the group consisting of; (i) amino acids 20, 24, 25, 27, 29, 31, 32, 35, 36, 39, 40-42, 45, 51, 52, 56, 60, 62-64, 69-71, 75-79, 93, 94, 110-115, 117-119, 121-123, 125, 126, 128-147, 151, 152, 155, 159, 161-169, 172, 173, 176, 177, 179, 180, 183 and 184 of IL-6 and combinations thereof; and (ii) amino acids 1-5, 8-15, 49, 75-78, 114, 116, 132-137, 140-154, 163-172, 177-184, 193-196, 226, 227, 229, 231, 232, 281-283 and 285 of gp130 and combinations thereof; and (iii) amino acids 1, 6, 107, 108, 135-139, 161-169, 190, 193, 226-231 and 277-281 of IL-6R and combinations thereof.
  • 33. A method as claimed in claim 26 wherein the disease is selected from multiple myeloma, lymphoma, inflammation, rheumatoid arthritis, prostate cancer, Castleman's disease, AIDS, mesangial proliferative glomerulonephritis, Kaposi's sarcoma, sepsis, osteoporosis and psoriasis.
  • 34. A compound comprising an extracellular portion of IL-6R, wherein the extracellular portion is modified at one or more amino acids of IL-6R selected from the group consisting of: (i) amino acids 106-110, 133-138, 160-168, 190-193, 227-233, 250-256 and 276-281; or (ii) amino acids 1-5, 19-23, 65-69, 93-99, 118, 119, 132-141, 166-172, 179-196, 241-250, 261, 262, 272-276 and 282-290; or (iii) amino acids 11, 45, 46, 55, 62-66, 69-72, 75, 81, 88, 90-93, 122-124 and 178; or (iv) amino acids 233-239, 244-248 and 270-290; or (v) amino acids 1, 6, 107, 108, 135-139, 161-169, 190, 193, 226-231 and 277-281.
  • 35. A pharmaceutical composition comprising a compound as claimed in claim 34.
  • 36. A method of preventing or treating a disease associated with signalling by the IL-6 receptor which method comprises administering to a subject in need thereof a composition according to claim 35.
  • 37. A method of modulating the activity of an IL-6 receptor which method comprises contacting the IL-6 receptor with a compound of formula A-B-C, wherein A consists of three fused 5-, 6- or 7-membered, saturated, unsaturated or aryl rings, optionally containing one or more heteroatoms and optionally substituted; or two non-fused 5- or 6-membered saturated, unsaturated or aryl rings, optionally containing one or more heteroatoms and optionally substituted; C consists of three fused 5-, 6- or 7-membered, saturated, unsaturated or aryl rings, optionally containing one or more heteroatoms and optionally substituted; or two non-fused 5-, 6- or 7-membered saturated, unsaturated or aryl rings, optionally containing one or more heteroatoms and optionally substituted; and B is an aliphatic linker having a length substantially equivalent to an ethylene moiety; wherein said compound has stereocomplementarity to a ligand binding topographic region of: (i) amino acids 1-299 of the IL-6 receptor positioned at atomic coordinates as shown in Appendix I, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å; or (ii) one or more subsets of said amino acids related to the coordinates shown in Appendix I by whole body translations and/or rotations; said ligand binding topographic region being defined by residues 106-110, 133-138, 160-168, 190-193, 227-233, 250-256 and 276-281 of said IL-6 receptor, or combinations thereof.
  • 38. A method according to claim 37 wherein: A has the following formula wherein, (i) Z is a bond; or Z, R4 and R10 taken together form an optionally substituted, saturated, unsaturated or aryl ring having 5, 6 or 7 members, optionally containing one or more heteroatoms selected from O, N and S; or Z, R3 and R6 taken together form an aryl ring or a heteroaryl, cycloalkyl, cycloalkenyl or heterocyclyl ring having 5, 6 or 7 members, wherein said aryl ring or said heteroaryl, cycloalkyl, cycloalkenyl, heterocyclyl ring are optionally substituted; (ii) R7, R8 or R9 are bonded to linker B; (iii) R1, R2 and R5 are each independently, hydrogen, C1-C4 alkyl, halogen, OR24 or NR25R26, where R24, R25 and R26 are each independently hydrogen or C1-C4 alkyl; (iv) R3 and R6 unless bonded together with Z are each independently, hydrogen, C1-C4 alkyl, halogen, O, OR24 or NR25R26, where R24, R25 and R26 are each independently hydrogen or C1-C4 alkyl; (v) R4 and R10 unless bonded together with Z, are each independently, hydrogen, C1-C4 alkyl, halogen, O, OR24 or NR25R26, where R24, R25 and R26 are each independently hydrogen or C1-C4 alkyl; (vi) R7, R8 and R9 unless bonded to linker B, are each independently, hydrogen, C1-C4 alkyl, halogen, O, OR24 or NR25R26, where R24, R25 and R26 are each independently hydrogen or C1-C4 alkyl; B has the following formula: wherein, (i) Y and Y1 are each independently C, O, S or N, provided that Y and Y1 are not both O, N or S; (ii) R11 and R12 are each independently hydrogen, C1-C4 alkyl, halogen, O, OR24 or NR25R26, where R24, R25 and R26 are each independently hydrogen or C1-C4 alkyl; C has the following formula: wherein, (i) X is a bond; or X, R14 and R18 taken together form an optionally substituted, saturated, unsaturated or aryl ring having 5, 6 or 7 members, optionally containing one or more heteroatoms selected from O, N and S; or X, R15 and R22 taken together form an aryl ring or a heteroaryl, cycloalkyl, cycloalkenyl or heterocyclyl ring having 5, 6 or 7 members, wherein said aryl ring or said heteroaryl, cycloalkyl, cycloalkenyl, heterocyclyl ring are optionally substituted; (ii) R13, R16 or R17 are bonded to linker B; (iii) R19, R20 and R21 are each independently, hydrogen, C1-C4 alkyl, halogen, O, OR27 or NR28R29, where R27, R28 and R29 are each independently hydrogen or C1-C4 alkyl; (iv) R14 and R18 unless bonded together with Z are each independently, hydrogen, C1-C4 alkyl, halogen, O, OR27 or NR28R29, where R27, R28 and R29 are each independently hydrogen or C1-C4 alkyl; (v) R15 and R22 unless bonded together with Z, are each independently, hydrogen, C1-C4 alkyl, halogen, O, OR27 or NR28R29, where R27, R28 and R29 are each independently hydrogen or C1-C4 alkyl; (vi) R13, R16 and R17 unless bonded to linker B, are each independently, hydrogen, C1-C4 alkyl, halogen, O, OR27 or NR28R29, where R27, R28 and R29 are each independently hydrogen or C1-C4 alkyl.
  • 39. A method of modulating the activity of an IL-6 receptor which method comprises contacting the IL-6 receptor with a compound of formula A-B-D, wherein A consists of three fused 5-, 6- or 7-membered, saturated, unsaturated or aryl rings, optionally containing one or more heteroatoms and optionally substituted at any position; or two non-fused 5-, 6- or 7-membered saturated, unsaturated or aryl rings, optionally containing one or more heteroatoms and optionally substituted; D consists of one, or two fused, 5-, 6- or 7-membered, saturated, unsaturated or aryl rings, optionally containing one or more heteroatoms and optionally substituted at any position; or two non-fused 5-, 6- or 7-membered saturated, unsaturated or aryl rings, optionally containing one or more heteroatoms and optionally substituted; and B is an aliphatic linker having a length substantially equivalent to an ethylene moiety; wherein said compound has stereocomplementarity to a ligand binding topographic region of: (i) amino acids 1-299 of the IL-6 receptor positioned at atomic coordinates as shown in Appendix I, or structural coordinates having a root mean square deviation from the backbone atoms of said amino acids of not more than 1.5 Å; or (ii) one or more subsets of said amino acids related to the coordinates shown in Appendix I by whole body translations and/or rotations; said ligand binding topographic region being defined by residues 106-110, 133-138, 160-168, 190-193, 227-233, 250-256 and 276-281 of said IL-6 receptor, or combinations thereof.
  • 40. A method according to claim 39 wherein: A has the following formula: wherein, (i) Z is a bond; or Z, R4 and R10 taken together form an optionally substituted, saturated, unsaturated or aryl ring having 5, 6 or 7 members, optionally containing one or more heteroatoms selected from O, N and S; or Z, R3 and R6 taken together form an aryl ring or a heteroaryl, cycloalkyl, cycloalkenyl or heterocyclyl ring having 5 or 6 members, wherein said aryl ring or said heteroaryl, cycloalkyl, cycloalkenyl, heterocyclyl ring are optionally substituted; (ii) R7, R8 or R9 are bonded to linker B; (iii) R1, R2 and R5 are each independently, hydrogen, C1-C4 alkyl, halogen, OR24 or NR25R26, where R24, R25 and R26 are each independently hydrogen or C1-C4 alkyl; (iv) R3 and R6 unless bonded together with Z are each independently, hydrogen, C1-C4 alkyl, halogen, O, OR24 or NR25R26, where R24, R25 and R26 are each independently hydrogen or C1-C4 alkyl; (v) R4 and R10 unless bonded together with Z, are each independently, hydrogen, C1-C4 alkyl, halogen, O, OR24 or NR25R26, where R24, R25 and R26 are each independently hydrogen or C1-C4 alkyl; (vi) R7, R8 and R9 unless bonded to linker B, are each independently, hydrogen, C1-C4 alkyl, halogen, O, OR24 or NR25R26, where R24, R25 and R26 are each independently hydrogen or C1-C4 alkyl; B has the following formula: wherein, Y and Y1 are each independently C, O, S or N, provided that Y and Y1 are not both O, N or S; D has the following formula: wherein, (i) Y2, Y3 or Y4 are each independently C, O, N or S, provided that at least two of Y2, Y3 and Y4 are C; (ii) R13, R17 or R18 are bonded to linker B; (iii) R14, R15 and R16 are each independently, hydrogen, C1-C4 alkyl, halogen, O, OR27 or NR28R29, where R27, R28 and R29 are each independently hydrogen or C1-C4 alkyl; or (iv) R16 is hydrogen, C1-C4 alkyl, halogen, O, OR27 or NR28R29, where R27, R28 and R29 are each independently hydrogen or C1-C4 alkyl, and R14 and R15 together form an optionally substituted, saturated, unsaturated or aryl ring having 5, 6 or 7 members, optionally containing one or more heteroatoms selected from O, N and S; or (v) R14 is hydrogen, C1-C4 alkyl, halogen, O, OR27 or NR28R29, where R27, R28 and R29 are each independently hydrogen or C1-C4 alkyl R15 and R16 together form an optionally substituted, saturated, unsaturated or aryl ring having 5, 6 or 7 members, optionally containing one or more heteroatoms selected from O, N and S; (vi) R13, R17 and R18 unless bonded to linker B, are each independently, hydrogen, C1-C4 alkyl, halogen, O, OR27 or NR28R29, where R27, R28 and R29 are each independently hydrogen or C1-C4 alkyl.
  • 41. A method for preventing or treating a disease associated with signaling by the IL-6 receptor which method comprises administering to a subject in need thereof a compound as defined in claim 37.
  • 42. A method as claimed in claim 7 wherein the topographic region of the IL-6 receptor is a ligand binding surface defined by residues 106-110, 133-138, 160-168, 190-193, 227-233, 250-256 or 276-281 and combinations thereof.
  • 43. A method as claimed in claim 7 wherein the topographic region of the IL-6 receptor is a region on the homodimer interface defined by resides 1-5, 19-23, 65-69, 93-99, 118, 119, 132-141, 166-172, 179-196, 241-250, 261, 262, 272-276 or 282-290 and combinations thereof.
  • 44. A method as claimed in claim 7 wherein the topographic region of the IL-6 receptor is defined by residues 11, 45, 46, 55, 62-66, 69-72, 75, 81, 88, 90-93, 122-124 and 178.
  • 45. A method as claimed in claim 7 wherein the topographic region of the IL-6 receptor is defined by residues 233-239, 244-248 and 270-290.
  • 46. A method for preventing or treating a disease associated with signaling by the IL-6 receptor which method comprises administering to a subject in need thereof a compound as defined in claim 38.
  • 47. A method for preventing or treating a disease associated with signaling by the IL-6 receptor which method comprises administering to a subject in need thereof a compound as defined in claim 39.
  • 48. A method for preventing or treating a disease associated with signaling by the IL-6 receptor which method comprises administering to a subject in need thereof a compound as defined in claim 40.
Priority Claims (1)
Number Date Country Kind
7695 Sep 2001 AU national
PCT Information
Filing Document Filing Date Country Kind 371c Date
PCT/AU02/01255 9/16/2002 WO 5/20/2004