Hepatocyte growth factor (HGF) also known as scatter factor (SF), is the ligand for Met (Bottaro et al., 1991), a receptor tyrosine kinase encoded by the c-met protooncogene (Cooper et al., 1984). HGF binding to Met induces phosphorylation of the intracellular kinase domain resulting in activation of a complex set of intracellular pathways that lead to cell growth, differentiation and migration in a variety of cell types; several recently published reviews provide a comprehensive overview (Birchmeier et al., 2003; Trusolino and Comoglio, 2002; Maulik et al., 2002). In addition to its fundamental importance in embryonic development and tissue regeneration, the HGF/Met signaling pathway has also been implicated in invasive tumor growth and metastasis and as such represents an interesting therapeutic target (Birchmeier et al., 2003; Trusolino and Comoglio, 2002; Danilkovitch-Miagkova and Zbar, 2002; Ma et al., 2003).
HGF belongs to the plasminogen-related growth factor family and comprises a 69 kDa α-chain, containing the N-terminal finger domain (N) and four Kringle (K1-K4) domains, and a 34 kDa β-chain, which has strong similarity to protease domains of chymotrypsin-like serine proteases from Clan PA(S)/FamilyS1 (Nakamura et al., 1989; Donate et al., 1994; Rawlings et al., 2002). Like plasminogen and other serine protease zymogens, HGF is secreted as a single chain precursor form (scHGF). scHGF binds to heparin sulfate proteoglycans, such as syndecan-1 (Derksen et al., 2002) on cell surfaces or in the extracellular matrix. Heparin sulfate proteoglycans bind to the N domain (Hartmann et al., 1998), which also contributes to the high affinity Met binding together with amino acids located in K1 (Lokker et al., 1994). Although scHGF is able to bind Met with high affinity, it cannot activate the receptor (Lokker et al., 1992; Hartmann et al., 1992). Acquisition of HGF signaling activity is contingent upon proteolytic cleavage (activation) of scHGF at Arg494-Val495 resulting in the formation of mature HGF, a disulfide-linked α/β heterodimer (Lokker et al., 1992; Hartmann et al., 1992; Naldini et al., 1992). The protease-like domain of HGF (HGF β-chain) lacks the Asp [c102]-His [c57]-Ser [c195] (standard chymotrypsinogen numbering in brackets used throughout) catalytic triad found in all serine proteases (Perona and Craik, 1995; Hedstrom, 2002), having a Gln534 [c57] and Tyr673 [c195], and thus is devoid of any enzymatic activity.
Currently, there is no detailed structural information about HGF β-chain or HGF β-chain binding and activation of Met. A completely solved crystal structure of the HGF β-chain can be used, for example, in assays for Met-ligand (e.g., HGF β-chain) interaction and function, modeling the structure-function relationship of Met and other molecules, diagnostic assays for mutation-induced pathologies, and rational design of agents useful in modulating Met or HGF activity.
In some embodiments, the present disclosure provides a crystal of hepatocyte growth factor beta chain (HGF β) and the structural coordinates of the crystal. Coordinates of the crystal structure are listed in Table 5. In some embodiments, HGF β has an amino acid sequence of SEQ ID NO:1, or conservative substitutions thereof.
In some embodiments, the disclosure provides a crystal structure of hepatocyte growth factor beta chain (HGF β), as well as use of the crystal structure to model HGF β activity. This use of the structure includes modeling the interaction of ligands with the HGF β; activation and inhibition of HGF β; and the rational design of modulators of HGF and/or HGF β activity. For example, these modulators include ligands that interact with HGF β and modulate HGF β activities, such as cell migration, HGF β binding to Met, and Met phosphorylation, as well as molecules that mimic HGF β that can bind to a ligand but have altered ability to modulate the activity of a ligand.
In other embodiments, amino acid residues that form the binding site for the Met receptor on HGF β are identified and are useful, for example, in methods to model the structure of HGF binding site and to identify agents that can associate with, bind or fit into the binding site. Other structural features of HGF β have also been identified, including the active site, activation domain, a tunnel, and a HGF β dimerization region. Amino acid residues that form these structural features can also be used in methods to model the structure and to identify agents that can interact with these structural features.
The term “hepatocyte growth factor” or “HGF”, as used herein, refers, unless specifically or contextually indicated otherwise, to any native or variant (whether native or synthetic) HGF polypeptide that is capable of binding to Met and/or activating the HGF/Met signaling pathway under conditions that permit such process to occur, for example, conditions that allow for the formation of the two chain form. The term “wild type HGF sequence” generally refers to an amino acid sequence found in a naturally occurring HGF and includes naturally occurring truncated or secreted forms, variant forms (e.g. alternatively spliced forms) and naturally occurring allelic variants.
“HGF β” or “HGF β-chain”, “HGF-beta” or variations thereof, refers to any HGF β chain having the conformation that is adopted by wild type HGF β chain upon conversion of wild type HGF protein from a single chain form to a 2 chain form (i.e., α and β chain). In some embodiments, the conversion results at least in part from cleavage between residue 494 and residue 495 of the wild type HGF protein. In some embodiments, the conformation refers specifically to the conformation of the activation domain of the protease-like domain in the β chain. In some embodiments, the conformation refers even more specifically to the conformation of the active site of the protease-like domain in the HGF β chain. Generally, adoption of the conformation reveals a Met binding site, as described herein. HGF β includes variants of wild type HGF β, for example, those shown in Table 1 and in SEQ ID NO:1. The HGF β chain may be isolated from a variety of sources such as human tissue or prepared by recombinant or synthetic methods. One embodiment of HGF β chain comprises an amino acid sequence of SEQ ID NO:1 in Table 7. Another embodiment of HGF β chain comprises an amino acid sequence of SEQ ID NO:5 in Table 9.
“HGF β variant” as used herein refers to polypeptide that has a different sequence than a reference polypeptide. In some embodiments, the reference polypeptide is a HGF β polypeptide comprising SEQ ID NO:1 in Table 7. In some embodiments, a variant has at least 80% amino acid sequence identity with the amino acid sequence of Table 7 (SEQ ID NO:1). The variants include those polypeptides that have substitutions, additions or deletions. The variants also include those polypeptides that have at least one conservative amino acid substitution, and preferably all substitutions are conservative. In some embodiments, the HGF β variant has about 1-25 conservative amino amino acid substitutions, more preferably about 1-20 conservative amino acids substitutions, more preferably about 1-10 conservative amino acid substitutions, more preferably about 1-5 conservative amino acid substitutions, and more preferably about 1-2 conservative amino acid substitutions. In some embodiments, the variants have the biological activity of binding to the Met receptor and/or activating it. In other embodiments, the variant can bind to the Met receptor but not activate it.
Ordinarily, a HGF β variant polypeptide will have at least 80% sequence identity, more preferably at least 81% sequence identity, more preferably at least 82% sequence identity, more preferably at least 83% sequence identity, more preferably at least 84% sequence identity; more preferably at least 85% sequence identity, more preferably at least 86% sequence identity, more preferably at least 87% sequence identity, more preferably at least 88% sequence identity, more preferably at least 89% sequence identity, more preferably at least 90% sequence identity, more preferably at least 91% sequence identity, more preferably at least 92% sequence identity, more preferably at least 93% sequence identity, more preferably at least 94% sequence identity, more preferably at least 95% sequence identity, more preferably at least 96% sequence identity, more preferably at least 97% sequence identity, more preferably at least 98% sequence identity, more preferably at least 99% sequence identity or greater with a HGF β polypeptide having an amino acid sequence comprising SEQ ID NO:1 or SEQ ID NO:5.
“Binding site” as used herein, refers to a region of a molecule or molecular complex that, as a result of its shape, distribution of electrostatic charge and/or distribution of non-polar regions, favorably associates with a ligand. Thus, a binding site may include or consist of features such as cavities, surfaces, or interfaces between domains. Ligands that may associate with a binding site include, but are not limited to, cofactors, substrates, receptors, agonists, and antagonists. Binding site refers to a functional binding site and/or a structural binding site. A structural binding site includes “in contact” amino acid residues as determined from examination of a three-dimensional structure. “Contact” can be determined using van der Waals radii of atoms, or by proximity sufficient to exclude solvent, typically water, from the space between a ligand and the molecule or molecular complex. “In contact” amino acid residues may not cause changes, for example, in a biochemical assay, a cell-based assay, or an in vivo assay used to define a functional binding site, but may contribute to the formation of the three-dimensional structure. Typically, at least one or more of “in contact” amino acid residues do not cause any change in these assays. A functional binding site includes amino acid residues that are identified as binding site residues based upon loss or gain of function, for example, loss of binding to ligand upon mutation of the residue. In some embodiments, the amino acid residues of a functional binding site are a subset of the amino acid residues of the structural binding site.
The term “HGF β structural binding site” includes all or a portion of a molecule or molecular complex whose shape, distribution of electrostatic charge and/or distribution of non-polar regions is sufficiently similar to at least a portion of a binding site of HGF β for Met as to be expected to bind Met or related structural analogs of Met. In some embodiments, a structurally equivalent ligand binding site is defined by a root mean square deviation from the structure coordinates of the backbone atoms of the amino acids that make up binding sites in HGF β of at most about 0.70 Å, preferably about 0.5 Å.
In some embodiments, a structural binding site for Met receptor on HGF β comprises, consists essentially of, or consists of at least one amino acid residue corresponding to residues 513, 516, 533, 534, 537-539, 578, 619, 647, 656, 668-670, 673, 692-697, 699, 702, 705 or 707, or mixtures thereof. In some embodiments, a functional binding site comprises, consists essentially of, or consists of at least one amino acid residue corresponding to residues 534, 578, 673, 692, 694 to 696, or mixtures thereof.
“Active site” refers to a substrate binding cleft and a catalytic triad typically associated with a polypeptide with enzymatic activity. The substrate binding cleft includes the “S-1 binding site” where a substrate/enzyme interaction arises. The catalytic triad refers to 3 amino acids that are associated with an enzymatic activity of proteolysis. In typical serine proteases, the catalytic triad residues are Asp [c102], Ser[c195], and His[c57]. In a wild-type HGF molecule, the corresponding catalytic triad residues are Asp578, Tyr673, and Gln534. The active site of HGF β also includes amino acids that are a part of the Met binding site.
“Activation site” of HGF refers to a cleavage site that converts a single chain HGF to a two chain form including an alpha and beta chain. The cleavage at this site results in a conformational change in the molecule, including the “activation domain” and formation of a binding site for Met receptor on the HGF β chain. In a wild-type HGF, an activation site is located at, between or adjacent to amino acid residues 494 and 495.
“Activation domain” refers to the region on a HGF β chain that undergoes conformational change upon cleavage of a single chain HGF. Upon of cleavage of scHGF at, between or adjacent to amino acids 494 and 495, the HGF β chain undergoes a conformational change, including the formation of a Met receptor binding site. In some embodiments, the activation domain in a HGF β comprises, consists essentially of, or consists of at least one amino acid residue corresponding to residues of HGF β from about 495 to 498, amino acid residues from about 615 to about 625, or from about 660 to about 670, from about 692 to about 697, from about 642 to about 652, in some instances, amino acid residues from about 550 to about 560, or mixtures thereof.
“Tunnel” refers to a conformation of a polypeptide, or portion thereof, that forms a void. In a HGF β crystal structure, the void is formed by amino acid residues. In some embodiments, the void is formed by at least one amino acid residue in a position that comprises, consists essentially of, or consists of at least one amino acid residue 643, 673, from about 693 to 706, from about 660 to 670, or 691, or mixtures thereof.
“Dimerization domain” refers to a region of a HGF β chain that interacts with another HGF β chain to form a dimer. Upon cleavage of scHGF, the HGF β chain undergoes a conformational change. The HGF-β N-terminal residue 495 forms a salt bridge with residue Asp 672. In some embodiments, the dimerization region of a HGF β comprises, consists essentially of, or consists of at least one amino acid residue corresponding to residues of HGF β from about 495 to 502, the [c140 loop] amino acids including Y619, T620, G621, the [c180] loop amino acids including 662 to 665, or the [c220] loop amino acids including 700, or mixtures thereof.
As used herein, “crystal” refers to one form of solid state of matter in which atoms are arranged in a pattern that repeats periodically in three dimensions, typically forming a lattice.
“Complementary or complement” as used herein, refers to the fit or relationship between two molecules that permits interaction, including for example, space, charge, three-dimensional configuration, and the like.
The term “corresponding” or “corresponds” refers to an amino acid residue or amino acid sequence that is found at the same positions or positions in a sequence when the amino acid position or sequences is aligned with a reference sequence. In some embodiments, the reference sequence is HGF β having a sequence of SEQ ID NO:1. It will be appreciated that when the amino acid position or sequence is aligned with the reference sequence the numbering of the amino acids may differ from that of the reference sequence or a different numbering system may be employed.
“Heavy atom derivative”, as used herein, refers to a derivative produced by chemically modifying a crystal with a heavy atom such as Hg, Au, or halogen.
“Structural homolog” of HGF β as used herein refers to a protein that contains one or more amino acid substitutions, deletions, additions, or rearrangements with respect to the amino acid sequence of HGF β, but that, when folded into its native conformation, exhibits or is reasonably expected to exhibit at least a portion of the tertiary (three-dimensional) structure of HGF β. Portions of the three dimensional structure include structural features such as the binding site for Met on HGF β, activation domain, activation site, active site, tunnel, dimerization region and combinations thereof. For example, structurally homologous molecules of HGF β include MSP and HGF β variants, preferably variants with one or more conservative amino acid substitutions, preferably only conservative amino acid substitutions. Homolog tertiary structure can be probed, measured, or confirmed by known analytic or diagnostic methods, for example, X-ray, NMR, circular dichroism, a panel of monoclonal antibodies that recognize native HGF β, and like techniques. For example, structurally homologous molecules can contain deletions or additions of one or more contiguous or noncontiguous amino acids, such as a loop or a domain. Structurally homologous molecules also include “modified” HGF β molecules that have been chemically or enzymatically derivatized at one or more constituent amino acids, including side chain modifications, backbone modifications, and N- and C-terminal modifications including acetylation, hydroxylation, methylation, amidation, and the attachment of carbohydrate or lipid moieties, cofactors, and like modifications.
“Ligand”, as used herein, refers to an agent that associates with a binding site on a molecule, for example, Met and/or HGF β binding sites, and may be an antagonist or agonist of Met or the HGF β activity. Ligands include molecules that mimic HGF β binding to Met and in some embodiments, are not capable of activating HGF β/Met signaling pathway.
“Molecular complex”, as used herein, refers to a combination of bound substrate or ligand with polypeptide, such as HGF β bound to Met, or a ligand bound to HGF β.
“Machine-readable data storage medium”, as used herein, refers to a data storage material encoded with machine-readable data, wherein a machine programmed with instructions for using such data is capable of displaying data in the desired format, for example, a graphical three-dimensional representation of molecules or molecular complexes.
“Scalable,” as used herein, refers to the increasing or decreasing of distances between coordinates (configuration of points) by a scalar factor while keeping the angles essentially the same.
“Space group symmetry”, as used herein, refers to the whole symmetry of the crystal that combines the translational symmetry of a crystalline lattice with the point group symmetry. A space group is designated by a capital letter identifying the lattice type (e.g. P, A, F,) followed by the point group symbol in which the rotation and reflection elements are extended to include screw axes and glide planes. Note that the point group symmetry for a given space group can be determined by removing the cell centering symbol of the space group and replacing all screw axes by similar rotation axes and replacing all glide planes with mirror planes. The point group symmetry for a space group describes the true symmetry of its reciprocal lattice.
“Unit cell”, as used herein, refers to the atoms in a crystal that are arranged in a regular repeating pattern, in which the smallest repeating unit is called the unit cell. The entire structure can be reconstructed from knowledge of the unit cell, which is characterized by three lengths (a, b and c) and three angles (α, β and γ). The quantities a and b are the lengths of the sides of the base of the cell and γ is the angle between these two sides. The quantity c is the height of the unit cell. The angles α and β describe the angles between the base and the vertical sides of the unit cell.
“X-ray diffraction pattern” refers to the pattern obtained from X-ray scattering of the periodic assembly of molecules or atoms in a crystal. X-ray crystallography is a technique that exploits the fact that X-rays are diffracted by crystals. X-rays have the proper wavelength (in the Ångström (Å) range, approximately 10−8 cm) to be scattered by the electron cloud of an atom of comparable size. Based on the diffraction pattern obtained from X-ray scattering of the periodic assembly of molecules or atoms in the crystal, the electron density can be reconstructed. Additional phase information can be extracted either from the diffraction data or from supplementing diffraction experiments to complete the reconstruction. A model is then progressively built into the electron density, refined against the data to produce an accurate molecular structure.
X-ray structure coordinates define a unique configuration of points in space. Those of skill in the art understand that a set of structure coordinates for a protein or a protein/ligand complex, or a portion thereof, define a relative set of points that, in turn, define a configuration in three dimensions. A similar or identical configuration can be defined by an entirely different set of coordinates, provided the distances and angles between coordinates remain essentially the same. In addition, a configuration of points can be defined by increasing or decreasing the distances between coordinates by a scalar factor, while keeping the angles essentially the same.
The present disclosure includes a crystalline form of and a crystal structure of hepatocyte growth factor beta chain (HGF β) and methods of using the HGF β crystal structure and structural coordinates to identify homologous proteins and to design or identify agents that can modulate the function of HGF and/or HGF β chain whether alone or as naturally found linked to HGF alpha chain. The present disclosure also includes the three-dimensional configuration of points derived from the structure coordinates of at least a portion of a HGF β molecule or molecular complex, as well as structurally equivalent configurations, as described below. Structurally equivalent configurations can include HGF β variants that have at least one conservative amino acid substitution, preferably all substitutions of a HGF β variant are conservative. The three-dimensional configuration includes points derived from structure coordinates representing the locations of a plurality of the amino acids defining the HGF β binding site, active site, activation domain, tunnel, dimerization region and combinations thereof.
In some embodiments, the three-dimensional configuration includes points derived from structure coordinates representing the locations of the backbone atoms of a plurality of amino acid residues defining the HGF β ligand binding site. Alternatively, the three-dimensional configuration includes points derived from structure coordinates representing the locations of the side chain and the backbone atoms (other than hydrogens) of a plurality of the amino acid residues defining the HGF β ligand binding site, preferably the amino acids listed in Tables 4A and 4B.
The disclosure also includes the three-dimensional configuration of points identifying other structural features of the HGF β domain. Those other structural features include the active site, activation domain, tunnel and/or HGF β dimerization region. A plurality of amino acid residues have been identified as contributing to these structural features of HGF β. In some embodiments, the amino acid residues comprise those listed in Table 4 and/or the figures.
Likewise, the disclosure also includes the three-dimensional configuration of points derived from structure coordinates of molecules or molecular complexes that are structurally homologous to HGF β, as well as structurally equivalent configurations. Structurally homologous molecules or molecular complexes are defined below. Advantageously, structurally homologous molecules can be identified using the structure coordinates of HGF β according to a method of the disclosure.
The configurations of points in space derived from structure coordinates according to the disclosure can be visualized as, for example, a holographic image, a stereodiagram, a model, or a computer-displayed image, and the disclosure thus includes such images, diagrams or models.
The crystal structure and structural coordinates can be used in methods for obtaining structural information of a related molecule, and for identifying and designing agents that modulate HGF β chain activity.
The coordinates of the crystal structure of HGF β have been deposited with the RSCB Databank, Accession No: PDB 1SI5.
1. HGF β Chain Polypeptides, Polynucleotides and Variants Thereof
The present disclosure includes a description of hepatocyte growth factor and/or portions thereof. Hepatocyte growth factor comprises a 69 kDa alpha chain and 34 kDa beta chain. HGF is secreted as a single chain precursor form (scHGF). The 69 kDa alpha chain comprise a N terminal finger domain and four kringle domains (K1-K4). A representative amino acid sequence of human HGF β chain is shown in Table 7 (SEQ ID NO:1). The sequence of Table 7 has one amino acid change from wild type shown in Table 9; the cysteine at amino acid position 604 is changed to a serine. It would be expected that a wild type HGF β would have an equivalent crystal structure. The amino acids of the alpha and beta chain are numbered based on the amino acid numbering system of scHGF. Numbering in brackets are those amino acid positions of the HGF β that correspond to chymotrypsinogen numbering system.
Native or wild type HGF, HGF α chain or HGF β polypeptides are those polypeptides that have a sequence of a polypeptide obtained from nature. Native or wild type HGF, HGF a or HGF β include naturally occurring variants, secreted and truncated forms. Some domains of the α chain and β chain are known to those of skill in the art. Several isoforms of HGF are known such as isoform 1, isoform 2, isoform 3, isoform 4, and isoform 5. Representative sequences can be found at GenBank Accession Numbers NM—000601, NM—001010931, NM—001010932, NM—001010933, NM—001010934, and NP—000592. A wild type HGF β chain comprises an amino acid sequence of SEQ ID NO:5 as shown in Table 9. A wild type HGF sequence of isoform 1 comprises an amino acid sequence of SEQ ID NO:6 and is shown in Table 10.
The present disclosure also includes a polypeptide comprising, consisting essentially of, or consisting of a portion of HGF β starting at any one of amino acid residues 513 to 534 and ending at any one of amino acid residues 696 to 707 or residues corresponding to these positions. This polypeptide includes amino acid positions that form the binding site for the Met receptor on HGF β and in some embodiments, can bind to the Met receptor. In some embodiments, the polypeptide portion may be fused to a heterologous polypeptide or other compound and, preferably, the fusion protein can bind to the Met receptor.
The present disclosure also includes a polypeptide comprising, consisting essentially of, or consisting of a portion of the HGF β starting at amino acid residue 495 and ending at any one of amino acid residues 696 to 704 or residues corresponding to these positions. This polypeptide includes amino acid residues that form the activation domain and in some embodiments, can bind and/or activate the Met receptor. The activation domain is formed upon cleavage of single chain HGF and a change in conformation of HGF β to provide for binding and/or activation of the Met receptor. In some embodiments, this polypeptide can be fused to a heterologous polypeptide or other compound and the fusion protein preferably can bind and/or activate the Met receptor.
In some embodiments, a polypeptide comprises, consists essentially of, or consists of a portion of the HGF β starting at amino acid residues 532 to 534 and ending at any one of amino acid residues 697 to 707 or residues corresponding to these positions. This polypeptide includes amino acid positions that form an active site and in some embodiments, can bind the Met receptor. The active site includes amino acids that correspond to a catalytic triad typically found in proteases and the substrate binding site. The active site of HGF β includes amino acids Asn578, Gln534, Tyr673, as well as amino acids that are involved in binding the Met receptor. In some embodiments, this polypeptide can be fused to a heterologous polypeptide, or other compound, and the fusion protein can bind to Met receptor.
In some embodiments, a polypeptide comprises, consists essentially of, or consists of a portion of the HGF β starting at any one of amino acid residues 634 to 660 and ending at any one of amino acid residues 696 to 706 or residues corresponding to these positions. This polypeptide includes amino acid residues that form a tunnel in the crystal structure in HGF β. The polypeptide includes some of the amino acids that bind or contact the Met receptor, and in some embodiments, can bind to the Met receptor. The polypeptide portion may be fused to a heterologous polypeptide or compound, and preferably, retains binding to the Met receptor.
In some embodiments, a polypeptide position comprises, consists essentially of, or consists of a portion of HGF β starting at amino acid residue 496 and ending at any one of amino residues 670 to 700 or residues corresponding to those positions. This polypeptide includes amino acids that contact another HGF β molecule to form a dimer, and preferably, can dimerize with another HGF β chain. The polypeptide position may be fused to a heterologous polypeptide or compound, and preferably can dimerize with another HGF β chain.
The present disclosure also includes variants of the HGF β. Variants include those polypeptides that have amino acid substitutions, deletions, and additions. Amino acid substitutions can be made, for example, to replace cysteines and eliminate formation of disulfide bonds. Other variants can be made at the binding site for Met, activation site, active site, activation domain, dimerization region, tunnel or combinations thereof. In some embodiments, variants have alterations at amino acid positions other than those amino acid positions associated with Met receptor binding. In some embodiments, a variant of HGF β has at least 90% sequence identity to a polypeptide comprising an amino acid sequence of SEQ ID NO:1 or SEQ ID NO:5 and only has conservative amino acid substitutions. Preferably, the conservative amino acid substitutions are at amino acid positions other than those associated with amino acids of the Met receptor binding site such as at least the core amino acids of the Met receptor binding site as shown in Table 4B. In other embodiments, the variants bind to and/or activate the Met receptor. In other embodiments, the variants bind to but do not activate the Met receptor. Some examples of specific embodiments of variants are listed in Table 1.
HGF β chains, structural homologs, or portions thereof, may be fused to a heterologous polypeptide or compound. The heterologous polypeptide is a polypeptide that has a different function than that of the HGF β chain. Examples of a heterologous polypeptide include polypeptides that may act as carriers, may extend half life, may act as epitope tags, or may provide ways to detect or purify the fusion protein. Heterologous polypeptides include KLH, albumin, salvage receptor binding epitopes, immunoglobulin constant regions, and peptide tags. Peptide tags useful for detection or purification include FLAG, gD protein, polyhistidine tags, hemaglutinin influenza virus, T7 tag, S tag, Strep tag, chloramiphenicol acetyl transferase, biotin, glutathione-S transferase, green fluorescent protein and maltose binding protein. Compounds that can be combined with HGF β, or portions thereof, include radioactive labels, protecting groups, and carbohydrate or lipid moieties.
HGF β chain variants can be prepared by introducing appropriate nucleotide changes into DNA encoding HGF β or by synthesis of the desired polypeptide variants using standard methods.
Amino acid substitutions, include one or more conservative amino acid substitutions. The term “conservative” amino acid substitution as used herein refers to an amino acid substitution which substitutes a functionally equivalent amino acid. Conservative amino acid changes result in silent changes in the amino acid sequence of the resulting polypeptide. For example, one or more amino acids of a similar polarity act as functional equivalents and result in a silent alteration within the amino acid sequence of the peptide. In general, substitutions within a group can be considered conservative with respect to structure and function. However, the skilled artisan will recognize that the role of a particular residue is determined by its context within the three-dimensional structure of the molecule in which it occurs. For example, Cys residues may occur in the oxidized (disulfide) form, which is less polar than the reduced (thiol) form. The long aliphatic portion of the Arg side chain can constitute a feature of its structural or functional role, and this may be best conserved by substitution of a nonpolar, rather than another basic residue. Also, it will be recognized that side chains containing aromatic groups (Trp, Tyr, and Phe) can participate in ionic-aromatic or “cation-pi” interactions. In these cases, substitution of one of these side chains with a member of the acidic or uncharged polar group may be conservative with respect to structure and function. Residues such as Pro, Gly, and Cys (disulfide form) can have direct effects on the main chain conformation, and often may not be substituted without structural distortions.
Amino acid substitutions can be the result of replacing one amino acid with another amino acid having similar structural and/or chemical properties, such as the replacement of a leucine with a serine, i.e., conservative amino acid replacements. Examples of conservative substitutions are shown in Table 11. The variation allowed can be determined by systematically making insertions, deletions or substitutions of amino acids in the sequence and testing the resulting variants for activity exhibited by the native sequence.
Polynucleotide sequences encoding the polypeptides described herein can be obtained using standard recombinant techniques. Desired polynucleotide sequences may be isolated and sequenced from appropriate source cells. Alternatively, polynucleotides can be synthesized using nucleotide synthesizer or PCR techniques. Once obtained, sequences encoding the polypeptides or variant polypeptides are inserted into a recombinant vector capable of replicating and expressing heterologous polynucleotides in a host cell. Many vectors that are available and known in the art can be used for the purpose of the present invention. Selection of an appropriate vector will depend mainly on the size of the nucleic acids to be inserted into the vector and the particular host cell to be transformed with the vector. Each vector contains various components, depending on its function (amplification or expression of heterologous polynucleotide, or both) and its compatibility with the particular host cell in which it resides. The vector components generally include, but are not limited to: an origin of replication (in particular when the vector is inserted into a prokaryotic cell), a selection marker gene, a promoter, a ribosome binding site (RBS), a signal sequence, the heterologous nucleic acid insert and a transcription termination sequence.
In general, plasmid vectors containing replicon and control sequences, which are derived from a species compatible with the host cell, are used in connection with these hosts. The vector ordinarily carries a replication site, as well as marking sequences, which are capable of providing phenotypic selection in transformed cells. For example, E. coli is typically transformed using pBR322, a plasmid derived from an E. coli species. pBR322 contains genes encoding ampicillin (Amp) and tetracycline (Tet) resistance and thus provides easy means for identifying transformed cells. pBR322, its derivatives, or other microbial plasmids or bacteriophage may also contain, or be modified to contain, promoters which can be used by the microbial organism for expression of endogenous proteins.
In addition, phage vectors containing replicon and control sequences that are compatible with the host microorganism can be used as transforming vectors in connection with these hosts. For example, bacteriophage such as λGEM™-11 may be utilized in making a recombinant vector which can be used to transform susceptible host cells such as E. coli LE392.
Either constitutive or inducible promoters can be used in the present invention, in accordance with the needs of a particular situation, which can be ascertained by one skilled in the art. A large number of promoters recognized by a variety of potential host cells are well known. The selected promoter can be operably linked to cistron DNA encoding a polypeptide described herein by removing the promoter from the source DNA via restriction enzyme digestion and inserting the isolated promoter sequence into the vector of choice. Both the native promoter sequence and many heterologous promoters may be used to direct amplification and/or expression of the target genes. However, heterologous promoters are preferred, as they generally permit greater transcription and higher yields of expressed target gene as compared to the native target polypeptide promoter.
Promoters suitable for use with prokaryotic hosts include the PhoA promoter, the β-galactamase and lactose promoter systems, a tryptophan (trp) promoter system and hybrid promoters such as the tac or the trc promoter. However, other promoters that are functional in bacteria (such as other known bacterial or phage promoters) are suitable as well. Their nucleotide sequences have been published, thereby enabling a skilled worker operably to ligate them to cistrons encoding the polypeptides or variant polypeptides (Siebenlist et al. (1980) Cell 20: 269) using linkers or adaptors to supply any required restriction sites.
In some embodiments, each cistron within a recombinant vector comprises a secretion signal sequence component that directs translocation of the expressed polypeptides across a membrane. In general, the signal sequence may be a component of the vector, or it may be a part of the target polypeptide DNA that is inserted into the vector. The signal sequence selected for the purpose of this invention should be one that is recognized and processed (i.e. cleaved by a signal peptidase) by the host cell. For prokaryotic host cells that do not recognize and process the signal sequences native to the heterologous polypeptides, the signal sequence is substituted by a prokaryotic signal sequence selected, for example, from the group consisting of the alkaline phosphatase, penicillinase, Ipp, or heat-stable enterotoxin II (STII) leaders, LamB, PhoE, PelB, OmpA and MBP.
Prokaryotic host cells suitable for expressing polypeptides include Archaebacteria and Eubacteria, such as Gram-negative or Gram-positive organisms. Examples of useful bacteria include Escherichia (e.g., E. coli), Bacilli (e.g., B. subtilis), Enterobacteria, Pseudomonas species (e.g., P. aeruginosa), Salmonella typhimurium, Serratia marcescans, Klebsiella, Proteus, Shigella, Rhizobia, Vitreoscilla, or Paracoccus. Preferably, gram-negative cells are used. Preferably the host cell should secrete minimal amounts of proteolytic enzymes, and additional protease inhibitors may desirably be incorporated in the cell culture.
Besides prokaryotic host cells, eukaryotic host cell systems are also well established in the art. Examples of invertebrate cells include insect cells such as Drosophila S2 and Spodoptera Sf9, as well as plants and plant cells. Examples of useful mammalian host cell lines include Chinese hamster ovary (CHO) and COS cells. More specific examples include monkey kidney CV1 line transformed by SV40 (COS-7, ATCC CRL 1651); Chinese hamster ovary cells/−DHFR (CHO, Urlaub and Chasin, Proc. Natl. Acad. Sci. USA, 77:4216 (1980)); mouse sertoli cells (TM4, Mather, Biol. Reprod., 23:243-251 (1980)); and mouse mammary tumor (MMT 060562, ATCC CCL51).
Host cells are transformed or transfected with the above-described expression vectors and cultured in conventional nutrient media modified as appropriate for inducing promoters, selecting transformants, or amplifying the genes encoding the desired sequences.
Transfection refers to the taking up of an expression vector by a host cell whether or not any coding sequences are in fact expressed. Numerous methods of transfection are known to the ordinarily skilled artisan, for example, CaPO4 precipitation and electroporation. Successful transfection is generally recognized when any indication of the operation of this vector occurs within the host cell.
Transformation means introducing DNA into the prokaryotic host so that the DNA is replicable, either as an extrachromosomal element or by chromosomal integrant. Depending on the host cell used, transformation is done using standard techniques appropriate to such cells. The calcium treatment employing calcium chloride is generally used for bacterial cells that contain substantial cell-wall barriers. Another method for transformation employs polyethylene glycol/DMSO. Yet another technique used is electroporation.
Prokaryotic cells used to produce the polypeptides of the invention are grown in media known in the art and suitable for culture of the selected host cells. Examples of suitable media include luria broth (LB) plus necessary nutrient supplements. In preferred embodiments, the media also contains a selection agent, chosen based on the construction of the expression vector, to selectively permit growth of prokaryotic cells containing the expression vector. For example, ampicillin is added to media for growth of cells expressing ampicillin resistant gene.
Any necessary supplements besides carbon, nitrogen, and inorganic phosphate sources may also be included at appropriate concentrations introduced alone or as a mixture with another supplement or medium such as a complex nitrogen source. Optionally the culture medium may contain one or more reducing agents selected from the group consisting of glutathione, cysteine, cystamine, thioglycollate, dithioerythritol and dithiothreitol.
The prokaryotic host cells are cultured at suitable temperatures. For E. coli growth, for example, the preferred temperature ranges from about 20° C. to about 39° C., more preferably from about 25° C. to about 37° C., even more preferably at about 30° C. The pH of the medium may be any pH ranging from about 5 to about 9, depending mainly on the host organism. For E. coli, the pH is preferably from about 6.8 to about 7.4, and more preferably about 7.0.
If an inducible promoter is used in the expression vector, protein expression is induced under conditions suitable for the activation of the promoter. For example, if a PhoA promoter is used for controlling transcription, the transformed host cells may be cultured in a phosphate-limiting medium for induction. A variety of other inducers may be used, according to the vector construct employed, as is known in the art.
Eukaryotic host cells are cultured under conditions suitable for expression of the polypeptides of the invention. The host cells used to produce the polypeptides may be cultured in a variety of media. Commercially available media such as Ham's F10 (Sigma), Minimal Essential Medium ((MEM), (Sigma), RPMI-1640 (Sigma), and Dulbecco's Modified Eagle's Medium ((DMEM), Sigma) are suitable for culturing the host cells. In addition, any of the media described in one or more of Ham et al., 1979, Meth. Enz. 58:44, Barnes et al., 1980, Anal. Biochem. 102: 255, U.S. Pat. No. 4,767,704, U.S. Pat. No. 4,657,866, U.S. Pat. No. 4,927,762, U.S. Pat. No. 4,560,655, or U.S. Pat. No. 5,122,469, WO 90/103430, WO 87/00195, and U.S. Re. 30,985 may be used as culture media for the host cells. Any of these media may be supplemented as necessary with hormones and/or other growth factors (such as insulin, transferrin, or epidermal growth factor), salts (such as sodium chloride, calcium, magnesium, and phosphate), buffers (such as HEPES™), nucleotides (such as adenosine and thymidine), antibiotics (such as GENTAMYCIN™), trace elements (defined as inorganic compounds usually present at final concentrations in the micromolar range), and glucose or an equivalent energy source. Other supplements may also be included at appropriate concentrations that would be known to those skilled in the art. The culture conditions, such as temperature, pH, and the like, are those previously used with the host cell selected for expression, and will be apparent to the ordinarily skilled artisan.
Polypeptides described herein expressed in a host cell may be secreted into and recovered from the periplasm of the host cells. Protein recovery typically involves disrupting the cell, generally by such means as osmotic shock, sonication or lysis. Once cells are disrupted, cell debris or whole cells may be removed by centrifugation or filtration. The proteins may be further purified, for example, by affinity resin chromatography. Alternatively, proteins can be transported into the culture media and isolated there from. Cells may be removed from the culture and the culture supernatant being filtered and concentrated for further purification of the proteins produced. The expressed polypeptides can be further isolated and identified using commonly known methods such as fractionation on immunoaffinity or ion-exchange columns; ethanol precipitation; reverse phase HPLC; chromatography on silica or on a cation exchange resin such as DEAE; chromatofocusing; SDS-PAGE; ammonium sulfate precipitation; gel filtration using, for example, Sephadex G-75; hydrophobic affinity resins, ligand affinity using a suitable antigen immobilized on a matrix and Western blot assay.
Polypeptides that are produced may be purified to obtain preparations that are substantially homogeneous for further assays and uses. Standard protein purification methods known in the art can be employed. The following procedures are exemplary of suitable purification procedures: fractionation on immunoaffinity or ion-exchange columns, ethanol precipitation, reverse phase HPLC, chromatography on silica or on a cation-exchange resin such as DEAE, chromatofocusing, SDS-PAGE, ammonium sulfate precipitation, and gel filtration using, for example, Sephadex G-75.
2. Crystals and Crystal Structure of HGF β Chain
The present disclosure provides crystals of HGF β chain as well as the crystal structure of HGF β chain as determined therefrom. In some embodiments, the crystals can be diffracted to a resolution of 5 Å or better. In some embodiments, the crystal is that of activated HGF β. Activated HGF β refers to the form of HGF β that occurs upon cleavage of scHGF and has a conformational change forming an activation domain and binding site for Met. In some embodiments, HGF β comprises an amino acid sequence of SEQ ID NO:1 or conservative substitutions thereof or portions thereof. In some embodiments, HGF β comprising an amino acid sequence of SEQ ID NO:1 only has conservative amino acid substitutions, preferably at amino acid positions other than those of the binding site for Met.
The crystals are useful to provide the crystal structure and/or to provide a stable form of the molecule for storage. In a specific embodiment, the structure of human HGF β chain comprising SEQ ID NO:1 was solved by molecular replacement using AMoRe (Navaza, 1994) in space group P3121, using parts of the protease domain of coagulation factor VIIa (Dennis et al., 2000) as the search probe. Refinement was performed using X-PLOR98 (MSI, San Diego) and REFMAC (Murshudov et al., 1997). Inspection of electron density maps and model manipulation were performed using XtalView (McRee, 1999).
Each of the constituent amino acids of HGF β is defined by a set of structure coordinates as set forth in Table 5. The term “structure coordinates” refers to Cartesian coordinates derived from mathematical equations related to the patterns obtained on diffraction of a monochromatic beam of X-rays by the atoms (scattering centers) of a HGF β in crystal form. The diffraction data are used to calculate an electron density map of the repeating unit of the crystal. The electron density maps are then used to establish the positions of the individual atoms of the HGF β protein or protein/ligand complex.
Slight variations in structure coordinates can be generated by mathematically manipulating the HGF β or HGF β/ligand structure coordinates. For example, the structure coordinates as set forth in Table 5 could be manipulated by crystallographic permutations of the structure coordinates, fractionalization of the structure coordinates, integer additions or subtractions to sets of the structure coordinates, inversion of the structure coordinates, or any combination of the above. Alternatively, modifications in the crystal structure due to mutations, additions, substitutions, deletions, and combinations thereof, of amino acids, or other changes in any of the components that make up the crystal, could also yield variations in structure coordinates. Such slight variations in the individual coordinates will have little effect on overall shape. If such variations are within an acceptable standard error as compared to the original coordinates, the resulting three-dimensional shape is considered to be structurally equivalent. Structural equivalence is described in more detail below.
It should be noted that slight variations in individual structure coordinates of the HGF β would not be expected to significantly alter the nature of chemical entities such as ligands that could associate with an active site. In this context, the phrase “associating with” refers to a condition of proximity between a ligand, or portions thereof, and a HGF β molecule or portions thereof. The association may be non-covalent, wherein the juxtaposition is energetically favored by hydrogen bonding, van der Waals forces, and/or electrostatic interactions, or it may be covalent.
To better interpret Met receptor binding and activity data from HGF mutants, the HGF β structure at 2.53 Å resolution was solved. Data reduction and refinement statistics and final model metrics appear in Table 3. The crystal of HGF β has a space group symmetry of P3121 and comprises a unit cell having dimensions of a=b and c, wherein a and b are about 63.7 angstroms and c is about 135.1 angstroms.
HGF β crystals were formed using three intermolecular contacts for each molecule (
HGF β adopts the fold of chymotrypsin-like serine proteases, comprising two tandem distorted β-barrels. There are two poorly ordered and untraceable segments—His645-Thr651 [c170a-c175] and the C-terminal region beginning with Tyr723 [c245]. The ‘active-site’ region of HGF β clearly differs from those of true enzymes (
HGF β residues involved in interactions with Met are shown in
The structural binding site identified herein is in excellent agreement with the structural Met binding site revealed in the co-crystal structure of an extracellular fragment including the soluble Met Sema domain bound to HGF β as disclosed in application U.S. Ser. No. 60/568,865, filed May 6, 2004, which application is hereby incorporated by reference. For instance, the co-crystal structure shows that residues on the [c220]-loop, such as R695 [c217] contact the Met receptor.
The HGF β chain forms a symmetric dimer in the crystal structure. The amino acid residues that form the dimerization region were identified by making a determination of those residues that lose solvent accessibility when two molecules of HGF β from the crystal structure were analyzed. In some embodiments, the dimerization amino acid residues include at least one amino acid from about 495 to 502, from about 619 to 624, 626, 628, 630, from about 662 to 665, or 700 or mixtures thereof. The HGF β-chain may have functions in receptor activation beyond those involved in direct interactions with Met that would favor a 2:2 complex of HGF:Met. It was found that proHGF β, the single chain ‘unactivated’ form of the HGF β-chain, bound more tightly to Met than several mutants in the ‘activated’ form of HGF β, i.e. Y673A, V692A, and R695A (
The β-chain of HGF comprises a new interaction site with Met, which is similar to the ‘active-site’ region of serine proteases. HGF is bivalent, having a high affinity Met binding site in the NK1 region of the α-chain and a low affinity binding site in the β chain. Other interactions may occur between two HGF β-chains, two HGF α-chains (Donate et al., 1994), and as found with MSP/Ron between two Met Sema domains. Heparin also plays a role in HGF/Met receptor binding. The identification of a distinct Met binding site on the HGF β-chain can be used to design new classes of HGF or Met modulators, such as antagonists, agonists, and like agents, having therapeutic applications, such as, for treating cancer.
3. Structurally Equivalent Crystal Structures
Various computational analyses can be used to determine whether a molecule or portions of the molecule define structural features that are “structurally equivalent” to all or part of HGF β or its structural features. Such analyses may be carried out in current software applications, such as the Molecular Similarity application of QUANTA (Molecular Simulations Inc., San Diego, Calif.), Version 4.1, and as described in the accompanying User's Guide.
The Molecular Similarity application permits comparisons between different structures, different conformations of the same structure, and different parts of the same structure. A procedure used in Molecular Similarity to compare structures comprises: 1) loading the structures to be compared; 2) defining the atom equivalences in these structures; 3) performing a fitting operation; and 4) analyzing the results.
One structure is identified as the target (i.e., the fixed structure); all remaining structures are working structures (i.e., moving structures). Since atom equivalency within QUANTA is defined by user input, for the purpose of this disclosure equivalent atoms are defined as protein backbone atoms (N, Cα, C, and O) for all conserved residues between the two structures being compared. A conserved residue is defined as a residue that is structurally or functionally equivalent. Only rigid fitting operations are considered.
When a rigid fitting method is used, the working structure is translated and rotated to obtain an optimum fit with the target structure. The fitting operation uses an algorithm that computes the optimum translation and rotation to be applied to the moving structure, such that the root mean square difference of the fit over the specified pairs of equivalent atom is an absolute minimum. This number, given in Angstroms, is reported by QUANTA.
Structurally equivalent crystal structures have portions of the two molecules that are substantially identical, within an acceptable margin of error. The margin of error can be calculated by methods known to those of skill in the art. In some embodiments, any molecule or molecular complex or any portion thereof, that has a root mean square deviation of conserved residue backbone atoms (N, Cα, C, O) of less than about 0.70 Å, preferably 0.5 Å. For example, structurally equivalent molecules or molecular complexes are those that are defined by the entire set of structure coordinates listed in Table 5 and/or 6±a root mean square deviation from the conserved backbone atoms of those amino acids of not more than 0.70 Å, preferably 0.5 Å. The term “root mean square deviation” means the square root of the arithmetic mean of the squares of the deviations. It is a way to express the deviation or variation from a trend or object. For purposes of this disclosure, the “root mean square deviation” defines the variation in the backbone of a protein from the backbone of HGF β (as defined by the structure coordinates of HGF β described herein) or a defining structural feature thereof.
4. Structurally Homologous Molecules, Molecular Complexes, and Crystal Structures
Structure coordinates can be used to aid in obtaining structural information about another crystallized molecule or molecular complex. The method of the disclosure allows determination of at least a portion of the three-dimensional structure of molecules or molecular complexes that contain one or more structural features that are similar to structural features of HGF β. In some embodiments, a portion of the three-dimensional structure includes the structural features of a HGF β chain, for example, binding site for Met, activation domain, active site, tunnel and/or dimerization region. These molecules are referred to herein as “structurally homologous” to HGF β. Similar structural features can also include, for example, regions of amino acid identity, conserved active site or binding site motifs, and similarly arranged secondary structural elements (e.g., α helices and β sheets). Preferably, the structural homolog has at least one biological function of HGF β.
Optionally, structural homology is determined by aligning the residues of the two amino acid sequences to optimize the number of identical amino acids along the lengths of their sequences; gaps in either or both sequences are permitted in making the alignment in order to optimize the number of identical amino acids, although the amino acids in each sequence must nonetheless remain in their proper order. Two amino acid sequences can be compared using the BLASTP program, version 2.0.9, of the BLAST 2 search algorithm, as described by Tatusova et al. (56), and available at URL www.ncbi.nlm.nih.gov/BLAST/. Preferably, the default values for all BLAST 2 search parameters are used, including matrix=BLOSUM62; open gap penalty=11, extension gap penalty=1, gap x_dropoff=50, expect=10, wordsize=3, and filter on. In the comparison of two amino acid sequences using the BLAST search algorithm, structural similarity is referred to as “identity.”
In some embodiments, a structurally homologous molecule is a protein that has an amino acid sequence sharing at least 80% identity with a native or recombinant amino acid sequence of HGF β. In some embodiments, HGF β has a sequence of SEQ ID NO:1 or SEQ ID NO:5, and the structurally homologous molecule is a variant that has a % sequence identity to SEQ ID NO:1 or SEQ ID NO:5 of at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% or greater. In some embodiments, HGF β variant or structurally homologous molecule has one or more conservative amino acid substitutions, preferably only conservative amino acid substitutions. In some embodiments, a HGF β variant does not have substitutions in the binding site for Met, including at least the core amino acid residues as shown in Table 4B. In some embodiments, the HGF β variant has about 1-25 conservative amino amino acid substitutions, more preferably about 1-20 conservative amino acids substitutions, more preferably about 1-10 conservative amino acid substitutions, more preferably about 1-5 conservative amino acid substitutions, and more preferably about 1-2 conservative amino acid substitutions. Preferably, the variant retains the globular core structure and/or at least one or more domains such as the binding site for Met, activation domain, active site, tunnel and/or dimerization region.
For example, a structurally homologous protein is the wild type HGF β (SEQ ID NO: 5), which differs from HGF β of SEQ ID NO:1 due to substitution of a cysteine at position 604 with a serine (99.55% identity to SEQ ID NO:1). The substitution of a serine at this position is unlikely to substantially affect the crystal structure because serine is similar to cysteine in size and functionality. More preferably, a protein that is structurally homologous to HGF β includes at least one contiguous stretch of at least 50 amino acids that shares at least 80% amino acid sequence identity with the corresponding portion of the native or recombinant HGF β and preferably, has only conservative amino acid substitutions that maintain the size and functionality of the substituted amino acid. Methods for generating structural information about the structurally homologous molecule or molecular complex are well known and include, for example, molecular replacement techniques.
Therefore, in another embodiment this disclosure provides a method of utilizing molecular replacement to obtain structural information about a molecule or molecular complex whose structure is unknown or incompletely known, comprising:
(a) generating an X-ray diffraction pattern from a crystallized molecule or molecular complex of unknown structure or incompletely known, for example in embodiments a structural homolog of HGF β; and/or
(b) applying at least a portion of the structural coordinates of HGF β or HGFβ/ligand complex to the X-ray diffraction pattern of the unknown or incompletely known structure to generate a three-dimensional electron density map of the molecule or molecular complex whose structure is unknown or incompletely known.
By using molecular replacement, all or part of the structure coordinates of HGF β or the HGF β/ligand complex as provided by this disclosure can be used to determine the unsolved structure of a crystallized molecule or molecular complex more quickly and efficiently than attempting to determine such information ab initio.
Molecular replacement can provide an accurate estimation of the phases for an unknown structure. Phases are one factor in equations that are used to solve crystal structures, and this factor cannot be determined directly. Obtaining accurate values for the phases, by methods other than molecular replacement, can be a time-consuming process that involves iterative cycles of approximations and refinements and greatly hinders the solution of crystal structures. However, when the crystal structure of a protein containing at least a structurally homologous portion has been solved, molecular replacement using the known structure can provide a useful estimate of the phases for the unknown or incompletely known structure.
Thus, this method involves generating a preliminary model of a molecule or molecular complex whose structure coordinates are unknown or incompletely known, by orienting and positioning the relevant portion of HGF β or a HGF β/ligand complex within the unit cell of the crystal of the unknown or incompletely known molecule or molecular complex. This orientation or positioning is conducted so as best to account for the observed X-ray diffraction pattern of the crystal of the molecule or molecular complex whose structure is unknown. Phases can then be calculated from this model and combined with the observed X-ray diffraction pattern amplitudes to generate an electron density map of the structure. This map, in turn, can be subjected to established and well-known model building and structure refinement techniques to provide a final, accurate structure of the unknown crystallized molecule or molecular complex (see for example, Lattman, 1985. Methods in Enzymology 115:55-77).
Structural information about a portion of any crystallized molecule or molecular complex that is sufficiently structurally homologous to a portion of HGF β can be resolved by this method. In addition to a molecule that shares one or more structural features with HGF β as described above, a molecule that has similar bioactivity, such as the same catalytic activity, substrate specificity or ligand binding activity as HGF β, may also be sufficiently structurally homologous to HGF β to permit use of the structure coordinates of HGF β to solve its crystal structure or identify structural features that are similar to those identified in HGF β chain described herein. It will be appreciated that amino acid residues in the structurally homologous molecule identified as corresponding to HGF β chain structural feature may have different amino acid numbering.
In one embodiment of the disclosure, the method of molecular replacement is utilized to obtain structural information about a molecule or molecular complex, wherein the molecule or molecular complex includes at least one HGF β fragment or homolog. HGF β is an inhibitor of full length HGF and can be used to identify or design other like inhibitors. In the context of the present disclosure, a “structural homolog” of HGF β includes a protein that comprises one or more amino acid substitutions, deletions, additions, or rearrangements with respect to the amino acid sequence of HGF β, but that, when folded into its native conformation, exhibits or is reasonably expected to exhibit at least a portion of the tertiary (three-dimensional) structure of HGF β as described above. As mentioned above, homolog tertiary structure can be probed, measured, or confirmed by known analytic and/or diagnostic methods, for example, X-ray, NMR, circular dichroism, panel of monoclonal Abs that recognize native HGF beta. For example, structurally homologous molecules can contain deletions or additions of one or more contiguous or noncontiguous amino acids, such as a loop or a domain. Structurally homologous molecules also include “modified” HGF β molecules that have been chemically or enzymatically derivatized at one or more constituent amino acid, including side chain modifications, backbone modifications, and N- and C-terminal modifications including acetylation, hydroxylation, methylation, amidation, and the attachment of carbohydrate or lipid moieties, cofactors, and like modifications.
A heavy atom derivative of HGF β is also included as a HGF β homolog. The term “heavy atom derivative” refers to derivatives of HGF β produced by chemically modifying a crystal of HGF β. In practice, a crystal is soaked in a solution containing heavy metal atom salts, or organometallic compounds, e.g., lead chloride, gold thiomalate, thiomersal or uranyl acetate, which can diffuse through the crystal and bind to the surface of the protein. The location(s) of the bound heavy metal atom(s) can be determined by X-ray diffraction analysis of the soaked crystal. This information, in turn, is used to generate the phase information used to construct three-dimensional structure of the protein (Blundell, et al., 1976, Protein Crystallography, Academic Press, San Diego, Calif.).
The structure coordinates of HGF β provided by this disclosure are particularly useful in solving the structure of HGF β variants. Variants may be prepared, for example, by expression of HGF β cDNA previously altered in its coding sequence by oligonucleotide-directed mutagenesis as described herein. Variants may also be generated by site-specific incorporation of unnatural amino acids into HGF β proteins using known biosynthetic methods (e.g. Noren, et al., 1989, Science 244:182-88). In this method, the codon encoding the amino acid of interest in wild-type HGF β is replaced by a “blank” nonsense codon, TAG, using oligonucleotide-directed mutagenesis. A suppressor tRNA directed against this codon is chemically aminoacylated in vitro with the desired unnatural amino acid. The aminoacylated tRNA is then added to an in vitro translation system to yield a mutant HGF β with the site-specific incorporated unnatural amino acid.
The structure coordinates of HGF β are also particularly useful to solve or model the structure of crystals of HGF β, HGF β variants, or HGF β homologs co-complexed with a variety of ligands. This approach enables the determination of the optimal sites for interaction between ligand entities, including candidate HGF β ligands and HGF β. Potential sites for modification within the various binding sites of the molecule can also be identified. HGF β variants that may bind to the Met receptor but not activate it may also be identified. This information provides an additional tool for determining more efficient binding interactions, for example, increased hydrophobic interactions, between HGF β and a ligand. For example, high-resolution X-ray diffraction data collected from crystals exposed to different types of solvent allows the determination of where each type of solvent molecule resides. Small molecules that bind tightly to those sites can then be designed and synthesized and tested for their HGF β affinity using standard assays.
In another embodiment, homology modeling can be conducted using the structural coordinates of HGF β and a program designed to generate models of structures, such as Protein Explorer, Swiss Model, or RASMOL. The programs can provide a structural model of a homolog or variant of HGF β by providing the structural coordinates such as provided in Table 5 and an alignment of the sequences.
All of the complexes referred to above may be studied using well-known X-ray diffraction techniques and may be refined versus X-ray data extending to between 1.5 and 3.5 Å to an R-factor of about 0.30 or less using computer software, such as X-PLOR (Yale University, distributed by Molecular Simulations, Inc.) (see for example, Blundell, et al. 1976. Protein Crystallography, Academic Press, San Diego, Calif., and Methods in Enzymology, Vol. 114 & 115, H. W. Wyckoff et al., eds., Academic Press (1985)). This information may thus be used to optimize a candidate HGF β modulator or to design new HGF and/or HGF β modulators.
The disclosure also includes the unique three-dimensional configuration defined by a set of points defined by the structure coordinates for a molecule or molecular complex structurally homologous to HGF β as determined using the method of the present disclosure, structurally equivalent configurations, and storage media, such as magnetic media, including such set of structure coordinates.
5. Homology Modeling
Using homology modeling, a computer model of a HGF β homolog can be built or refined without crystallizing the homolog. First, a preliminary model of the HGF β homolog is created by sequence alignment with HGF β, secondary structure prediction, the screening of structural libraries, or any combination of those techniques. Computational software may be used to carry out the sequence alignments and the secondary structure predictions. Programs available for such an analysis include Protein Explorer (eg available at molvissdsc.edu.protexpl.frontdoor.htm), Swiss Model (eg available at swissmodel.expasy.org) and RASMOL. Structural incoherences, e.g., structural fragments around insertions and deletions, can be modeled by screening a structural library for peptides of the desired length and with a suitable conformation. For prediction of the side chain conformation, a side chain rotamer library may be employed. If the HGF β homolog has been crystallized, the final homology model can be used to solve the crystal structure of the homolog by molecular replacement, as described above. Next, the preliminary model is subjected to energy minimization to yield an energy-minimized model. The energy-minimized model may contain regions where stereochemistry restraints are violated, in which case such regions are remodeled to obtain a final homology model. The homology model is positioned according to the results of molecular replacement, and subjected to further refinement including molecular dynamics calculations.
6. Methods for Identification of Modulators of HGF β
Potent and selective ligands that modulate activity (antagonists and agonists) can be identified using the three-dimensional model of HGF β using structural coordinates of a crystal of HGF β, such as all or a portion of the coordinates of Table 5. Using this model, candidate ligands that interact with HGF β are assessed for the desired characteristics (e.g., interaction with HGF β) by fitting against the model, and the result of the interactions is predicted. Alternatively, molecules that can mimic the binding of HGF β for Met and that are altered in the activation of HGF/Met signaling pathway can also be modeled and identified. Agents predicted to be molecules capable of modulating the activity of HGF β can then be further screened or confirmed against known bioassays. For example, the ability of an agent to inhibit the morphogenic or mitogenic effects of HGF can be measured using assays known in the art. Using the modeling information and the assays described, one can identify agents that possess HGF and/or HGF β-modulating properties. Modulators of HGF β of the present disclosure can include compounds or agents having, for example, allosteric regulatory activity.
Ligands which can interact with HGF β can also be identified using commercially available modeling software, such as docking programs, in which all or a portion of the solved crystal structure coordinates of a crystal of HGF β such as those of Table 5 can be computationally represented and screened against a large virtual library of small molecules or virtual fragment molecules for interaction with a site of interest, such as the binding site for Met, activation domain, active site, tunnel and/or dimerization region. Preferred small molecules or fragments identified in this way can be synthesized and contacted with the HGF β. The resulting molecular complex or association can be further analyzed by, for example, NMR or X-ray co-crystallography, to optimize the HGF β-ligand interaction and/or desired biological activity. Fragment-based drug discovery methods are known and computational tools for their use are commercially available, for example “SAR by NMR” (Shukers, S. B., et al., Science, 1996, 274, 1531-1534), “Fragments of Active Structures” (www.stromix.com; Nienaber, V. L., et al., Nat. Biotechnol., 2000, 18, 1105-1108), and “Dynamic Combinatorial X-ray Crystallography” (e.g., permitting self-selection by the protein molecule of self-assembling fragments; Congreve, M. S., et al., Angew. Chem., Int. Ed., 2003, 42, 4479-4482). Still other molecular modeling, docking, and like methods are discussed below and in the Examples.
The present disclosure also includes identification of allosteric modulators of HGF β. “Allosteric regulation” and like terms refers to regulation of a functional site of HGF β by way of large scale conformational changes in the shape of HGF β which can be caused by, for example, the binding of a regulatory molecule elsewhere (i.e., other than at the functional site) in the HGF β molecule. An “allosteric regulator” or signal molecule is any molecule capable of effecting such allosteric regulation or signaling in the HGF β molecule. An allosteric regulator can be either positive (an activator) or negative (an inhibitor) of HGF β activity. Allosteric regulation of HGF β activity can involve cooperativity, which requires cooperative interaction of its multiple protein subunits, or allosteric regulation of HGF β activity can occur without cooperativity in any of the protein subunits.
The methods of the disclosure also include methods of identifying molecules that mimic HGF β binding to a ligand (such as the Met receptor), but do not activate the HGF/Met signaling pathway. HGF β is an inhibitor of full length HGF and can be used to identify or design other like inhibitors. These molecules can be identified using the three-dimensional model of HGF β using the coordinates of Table 5.
In another embodiment, a candidate modulator can be identified using a biological assay such as binding to HGF β, modulating Met phosphorylation or modulating HGF induced cell migration. The candidate modulator can then serve as a model to design similar agents and/or to modify the candidate modulator for example, to improve characteristics such as binding to HGF β. Design or modification of candidate modulators can be accomplished using the crystal structure coordinates and available software.
The disclosure provides information about the structure and shape of the binding site for Met, active site, activation domain, tunnel and dimerization region of HGF β. These structural features can be used in the methods for identification of modulators of HGF and/or HGF β.
The term “structural binding site,” as used herein, refers to a region of a molecule or molecular complex that, as a result of its structure can favorably associate with a ligand. Binding site structure factors can include, for example, the presence and disposition of amino acids residues in the binding region, and the two- or three-dimensional shape or topology of the HGF β molecule in or near the binding region, such as secondary structure (i.e., helices, sheets, or combinations thereof) or tertiary structure (i.e., the three dimensional disposition of molecular chains and features). Thus, a binding site may include or consist of features such as cavities, surfaces, or interfaces between domains. Ligands that may associate with a binding site include, but are not limited to, cofactors, substrates, agonists, and antagonists.
Binding sites are of significant utility in fields such as drug discovery. The association of natural ligands or substrates with the binding sites of their corresponding receptors or enzymes is the basis of many biological mechanisms of action. Similarly, many drugs exert their biological effects through association with the binding sites of receptors and enzymes. Such associations may occur with all or any part of the binding site. An understanding of such associations helps lead to the design of drugs having more favorable associations with their target, and thus improved biological effects. Therefore, this information is valuable in designing potential modulators of HGF and/or HGF β, as discussed in more detail below.
The amino acid constituents of a HGF β binding site for Met as defined herein are positioned in three dimensions. In one aspect, the structure coordinates defining a binding site of HGF β include structure coordinates of all atoms in the constituent amino acids; in another aspect, the structure coordinates of a binding site include structure coordinates of just the backbone atoms of the constituent amino acids.
In some embodiments, the amino acid residues of the structural HGF β binding site for Met comprise, consist essentially of, or consist of at least one or all of amino acid residues at positions 513, 516, 533, 534, 537-539, 578, 619, 647, 656, 668-670, 673, 692-697, 699, 702, 705, or 707, or mixtures thereof or residues corresponding to these positions.
In another embodiment, HGF β binding site for Met comprises, consists essentially of, or consist of at least one or more or all of amino acid residues Tyr513, Lys516, Arg533, Gln534, Pro537, Ser538, Arg539, Asp578, Tyr619, Arg 647, Glu656, Pro668, Cys669, Glu670, Tyr673, Val692, Pro693, Gly694, Arg695, Gly696, Cys697, Ile699, Arg702, Ile705, Val707, or mixtures thereof, or conservative substitutions thereof. In other embodiments, the amino acid residues of the binding site comprise, consist essentially of, or consist of at least one or more or all of amino acids at a position 514, 534, 537, 578, 619, 621, 673, 692, 694 to 696, 699, or 701, or mixtures thereof. In another embodiment, the HGF β binding site for Met comprises, consists essentially of, or consists of at least one or more or all amino acid residues comprising Arg519, Gln534, Pro537, Asp578, Tyr619, Gly621, Tyr673, Val692, Gly694, Arg695, Gly696, Ile699, Asn 701, or mixtures thereof, or conservative substitutions thereof.
In another embodiment, the HGF β binding site for Met comprises, consists essentially of, or consists of at least one or all of core amino acid residues in positions 534, 578, 673, 692-694, 695, 696, or mixtures thereof. In a further embodiment, the HGF β binding site for Met comprise, consist essentially of, or consist of at least one or more or all of core amino acid residues comprising Gln534, Asp578, Tyr673, Val692, Pro693, Gly694, Arg 695, Gly696, or mixtures thereof, or conservative substitutions thereof. In yet another embodiment, the binding site for Met on HGF β comprises, consists essentially of, or consists of at least one or more or all core amino acid or all amino acid residues in positions 673, 692-694, 695, 696, or mixtures thereof. In a further embodiment, the binding site for Met on HGF β comprises, consists essentially of, or consists of at least one or all amino acid residues Tyr673, Val692, Pro693, Gly694, Arg695, Gly696, or mixtures thereof, or conservative substitutions thereof. The numbering of the corresponding amino acid positions that form HGF β structural binding site in a structurally homologous molecule may change depending on the alignment of the structural homologous molecules with HGF β chain.
Alternatively, the structural binding site of HGF β may be defined by those amino acids whose backbone atoms are situated within about 5 Å of one or more constituent atoms of a bound substrate or ligand. In yet another alternative, the binding site for Met on HGF β can be defined by those amino acids whose backbone atoms are situated within a sphere centered on the coordinates representing the alpha carbon atom of a central amino acid residue Gly694, the sphere having a radius of about 5-6 Å, for example about 5.8 Å.
Accordingly, the disclosure provides molecules or molecular complexes including a HGF β structural binding site, as defined by the sets of structure coordinates of Table 5 and/or Table 6. In some embodiments, a structurally equivalent ligand binding site is defined by a root mean square deviation from the structure coordinates of Table 5 of the backbone atoms of the amino acids that make up the binding site in HGF β of at most about 0.70 Å, preferably about 0.5 Å.
Another structural feature of the HGF β chain is an activation domain. The activation domain in the β-chain can be identified by analogy to amino acid residues in serine proteases that undergo conformational change upon cleavage of chymotrypsinogen-like serine protease single chain pro-enzymes to two-chain enzymes. The activation domain includes parts of the Met binding site and other conformation changes in the HGF β chain. In some embodiments, the activation domain comprises, consists essentially of, or consists of one or more or all of amino acid residues in positions from about 495 to 498, 502 to 505, 618 to 627, 637 to 655, 660 to 672, 692 to 704, from 553 to 562, or mixtures thereof or residues corresponding to these positions. In some embodiments, the activation domain of HGF β comprises, consists essentially of, or consists of at least one or more or all amino acid residues Val495, Val496, Asn497, Gly498, Arg502, Thr503, Asn504, Ile505, Val553, His 554, Gly555, Arg556, Gly557, Asp558, Glu559, Lys560, Cys561, Lys562, Gly618, Tyr619, Thr620, Gly621, Leu622, Ile623, Asn624, Tyr625, Asp626, Gly627, Met637, Gln638, Asn639, Glu640, Lys641, Cys642, Ser643, Gln644, His645, His646, Arg647, Gly648, Lys649, Val650, Thr651, Leu652, Asn653, Glu654, Ser655, Gly660, Ala661, Glu662, Lys663, Ile664, Gly665, Ser666, Gly667, Pro668, Cys669, Glu670, Gly671, Asp672, Val692, Pro693, Gly694, Arg695, Gly696, Cys697, Ala698, Ile699, Pro700, Asn701, Arg702, Pro703, Gly704, or mixtures thereof or conservative amino acid substitutions thereof.
In other embodiments, the amino acid residues of the activation domain comprise, consist essentially of, or consist of one or more or all amino acid residues from about position 495 to 498, 615 to 625, 660 to 670, 692 to 697, or 550 to 560 or mixtures thereof. In some embodiments, the activation domain of HGF β comprises, consists essentially of, or consists of at least one or all amino acid residues Val495, Val496, Asn497, Gly498, Tyr615, Gly616, Trp617, Gly618, Tyr619, Thr620, Gly621, Leu622, Ile623, Asn624, Tyr625, Ile550, His551, Asp552, Val553, His554, Gly555, Arg556, Gly557, Asp558, Glu559, Lys560, Gly660, Ala661, Glu662, Lys663, Ile664, Gly665, Ser666, Gly667, Pro668, Cys669, Glu670, Val692, Pro693, Gly694, Arg695, Gly696, Cys697, or mixtures thereof or conservative amino acids substitutions thereof. In other embodiments, the activation domain of HGF β comprises, consists essentially of or consists of at least one or all core amino acid residue in positions 495-498, 618-627, 660-672, 692-704, or mixtures thereof. In some embodiments, the activation domain of HGF β comprises, consists essentially of, or consists of one or more or even all core amino acid residues Val495, Val496, Asn497, Gly498, Gly618, Tyr619, Thr620, Gly621, Leu622, Ile623, Asn624, Tyr625, Asp626, Gly627, Gly660, Ala661, Glu662, Lys663, Ile664, Gly665, Ser666, Pro668, Cys669, Glu670, Gly671, Asp672, Val692, Pro693, Gly694, Arg695, Gly696, Cys697, Ala698, Ile699, Pro700, Asn701, Arg702, Pro703, Gly704, or mixtures thereof or conservative amino acid substitutions thereof.
Another structural feature identified in the HGF β chain crystal structure is an active site. The “active site” of HGF β refers to features analogous to the substrate binding cleft and catalytic amino acid triad capable of substrate cleavage in true serine protease enzymes. In some embodiments, amino acid residues associated with the active-site region of HGF β are summarized in Table 4 and comprise, consist essentially of, or consist of one or more or all amino acid residues corresponding to the catalytic triad, Asp 578, Tyr 673 and Gln534. The active site also includes amino acids that form the Met binding site including one or more or all amino acid residues from about 667 to 673, from about 532-536, from about 690 to 697, from about 637 to 655, or from about 574 to 579, or mixtures thereof. In some embodiments, the active site of HGF β comprises, consists essentially of, or consists of one or more or all amino acid residues Ala532, Arg533, Gln534, Cys535, Phe536, Pro574, Glu575, Gly576, Ser577, Asp578, Leu579, Met 637, Gly 638, Asn 639, Glu640, Lys641, Cys 642, Ser643, Gln644, His645, His646, Arg647, Gly648, Lys659, Val650, Thr651, Leu652, Asn 653, Glu654, Ser655, Gly667, Pro668, Cys669, Glu670, Gly671, Asp672, Tyr673, Val690, Ile691 Val692, Pro693, Gly694, Arg695, Gly696, Cys697, or mixtures thereof or conservative substitutions thereof.
In other embodiments, amino acid residues in the active site comprise, consist essentially of, or consist of some or all core amino acid residues 534, 578, 673, 693, 695, 696, 697, or 699, or mixtures thereof. In some embodiments, the active site of HGF β comprises, consists essentially of, or consists of one or more or all amino acid residues Gln534, Asp578, Tyr673, Pro693, Arg695, Gly696, Cys697, Ile699, or mixtures thereof or conservative substitutions thereof.
Another structural feature identified in the HGF β crystal is a tunnel. “Tunnel” refers to a pore-like void or aperture present in the HGF β crystal structure. The amino acid positions forming the tunnel can be identified by determining the solvent accessibility of the amino acid positions in the HGF β crystal structure using standard methods. The “tunnel” feature, has an entrance near amino acid residues Tyr673 and Arg695, and comprises, consists essentially of, or consist of some or all amino acid residues 660 to 670, amino acid residues 693 to 706, amino acid residue 691, or amino acid residue 634, or mixtures thereof or residues corresponding to these positions. In some embodiments, the tunnel is formed by one or more or all amino acid residues comprising Tyr673, Arg695, Leu634, Ile691, Gly660, Ala661, Glu662, Lys663, Ile664, Gly665, Ser666, Gly667, Pro668, Cys669, Glu670, Pro693, Gly695, Gly696, Cys697, Ala698, Ile699, Pro700, Asn701, Arg703, Pro703, Gly704, or mixtures thereof or conservative substitutions thereof.
In other embodiments, the tunnel is formed by at least one or more or all core amino residues in positions comprising 669, 670, 673, 693-697, 662, 663, 701, or mixtures thereof. In some embodiments, the tunnel is formed by at least one or more or all core amino acid residues Cys669, Glu670, Tyr673, Pro693, Gly694, Arg695, Gly696, Cys697, Glu662, Lys663, Asn701, or mixtures thereof or conservative substitutions thereof. The tunnel, especially the tunnel entrance, is a likely interaction site for allosteric regulators of HGF β and/or HGF.
Another structural feature identified in the crystal structure of HGF β includes a dimerization region. In the crystal of HGF β a symmetric dimer is formed. The dimerization region includes amino acid residues that contact another HGF β-chain and are identified as those positions that lose solvent accessibility when two HGF β molecules are analyzed as a dimer. The dimerization region amino acid residues comprise, consist essentially of, or consists of some or all amino acid residues from about 495 to 502, 617-630, 660 to 670, or 700, or mixtures thereof. In some embodiments, the dimerization region of HGF β comprises, consists essentially of, or consists of one or more or all amino acid residues Val495, Val496, Asn497, Gly498, Ile499, Pro500, Thr501, Arg502, Trp617, Gly618, Tyr619, Thr620, Gly621, Leu622, Ile623, Asn624, Tyr625, Asp626, Gly627, Leu628, Leu629, Arg630, Gly660, Ala661, Glu662, Lys663, Ile664, Gly665, Ser666, Gly667, Pro668, Cys669, Glu670, Pro700, or mixtures thereof or conservative substitutions thereof.
In other embodiments, the amino acid positions of the dimerization domain comprise, consist essentially of, or consist of some or all amino acid residues from about 495 to 502, 620 to 624, 626, 628, 630, 662 to 665, or 700, or mixtures thereof. In some embodiments, the dimerization region of HGF β comprises, consists essentially of, or consists of one or more or all amino acid residues Val495, Val496, Asn497, Gly498, Ile499, Pro500, Thr501, Arg502, Trp620, Gly621, Leu622, Ile623, Asn624, Asp626, Gly627, Leu628, Arg630, Gly662, Lys663, Ile664, Gly665, Pro700, or mixtures thereof or conservative substitutions thereof.
In some embodiments, the dimerization of HGF β comprises, consists essentially of, or consists of one or more or all core amino acid residues in positions 497, 499, 500, 502, 621-623, 662, 664, or mixtures thereof. In additional embodiments, the dimerization region of HGF β comprises, consists essentially of, or consists of one or more or all core amino acid residues Asn497, Ile499, Pro500, Arg502, Gly621, Leu622, Ile623, Gly662, Ile664 or mixtures thereof or conservative substitutions thereof.
Accordingly, the disclosure provides molecules or molecular complexes including the HGF β activation domain, active site, binding site for Met, tunnel and/or dimerization region as defined by the sets of structural coordinates of a crystal of HGF β, such as provided in Table 5 and/or Table 6. In some embodiments, structurally equivalent sites are defined by a root mean square deviation of at most about 0.70 Å, preferably about 0.50 Å, from the structural coordinates of the backbone of amino acids that makeup the activation domain, active site, binding site for Met, tunnel and/or dimerization region in HGF β. As discussed previously, it is understood that the amino acid numbering of corresponding positions of the structural features defined herein in a structurally homologous molecule may differ than that of the HGF β.
Computational techniques can be used to screen, identify, select, design ligands, and combinations thereof, capable of associating with and/or modulating activity of HGF and/or HGF β or structurally homologous molecules. Candidate modulators of HGF and/or HGF β may be identified using functional assays, such as binding to HGF β or inhibiting binding of HGF β to Met, KIRA assay, or cell migration assay as described herein. Novel modulators can then be designed based on the structure of the candidate molecules so identified. Knowledge of the structure coordinates for HGF β permits, for example, the design, the identification of synthetic compounds, and like processes, and the design, the identification of other molecules and like processes, that have a shape complementary to the conformation of the HGF β binding site, activation domain, active site, tunnel and/or dimerization region. In particular, computational techniques can be used to identify or design ligands, such as agonists and/or antagonists, that associate with and/or modulate activity of a HGF β binding site and/or other structural features, such as the active site, activation domain, dimerization region, and/or the tunnel.
Antagonists may bind to or interfere with all or a portion of an active site, activation domain, tunnel, dimerization region or binding site of HGF β, and can be competitive, non-competitive, or uncompetitive inhibitors. Once identified and screened for biological activity, these agonists, antagonists, and combinations thereof, may be used therapeutically or prophylactically, for example, to block HGF and/or HGF β activity and thus prevent the onset and/or further progression of diseases associated with dysregulation of HGF activity. Structure-activity data for analogues of ligands that bind to or interfere with HGF β binding sites, active sites, activation domain, dimerization region and/or tunnel can also be obtained computationally.
Data stored in a machine-readable storage medium that is capable of displaying a graphical three-dimensional representation of the structure of HGF β or a structurally homologous molecule, as identified herein, or portions thereof may thus be advantageously used for drug discovery. The structure coordinates of the ligand are used to generate a three-dimensional image that can be computationally fit to the three-dimensional image of HGF β or a structurally homologous molecule. The three-dimensional molecular structure encoded by the data in the data storage medium can then be computationally evaluated for its ability to associate with ligands. When the molecular structures encoded by the data is displayed in a graphical three-dimensional representation on a computer screen, the protein structure can also be visually inspected for potential association with ligands.
One embodiment of a method of drug design involves evaluating the potential association of a candidate ligand with HGF β or a structurally homologous molecule, particularly with a HGF β binding site. The method of drug design thus includes computationally evaluating the potential of a selected ligand to associate with any of the molecules or molecular complexes set forth above. This method includes the steps of: (a) employing computational means, for example, such as a programmable computer including the appropriate software known in the art or as disclosed herein, to perform a fitting operation between the selected ligand and a ligand binding site or a region nearby the ligand binding site of the molecule or molecular complex; and (b) analyzing the results of the fitting operation to identify and/or quantify the association between the ligand and the ligand binding site.
In another embodiment, the method of drug design involves computer-assisted design of ligands that associate with HGF β, its homologs, or portions thereof. Ligands can be designed in a step-wise fashion, one fragment at a time, or may be designed as a whole or de novo. Ligands can be designed based on the structure of molecules that can modulate at least one biological function of HGF β.
Other embodiments of a method of drug design involves evaluating the potential association of a candidate ligand with other structural features of HGF β or structurally homologous molecule. The method of drug design includes computationally evaluating the potential of the selected ligand to associate with HGF β and/or portion of the HGF β associated with the structural features. The structural features include activation domain, active site, tunnel, and/or dimerization region as described herein. The method comprises: (a) employing a computational means, for example, such as a programmable computer including the appropriate software to perform a fitting operation between the selected ligand and the structural feature of the HGF β; and (b) analyzing the results of the fitting operation to identify and/or quantify the association between the ligand and structural feature of HGF β chain.
Generally, to be a viable drug candidate, the ligand identified or designed according to the method is capable of structurally associating with at least part of a HGF β structural feature, and is able, sterically and energetically, to assume a conformation that allows it to associate with the HGF β structural feature, such as a binding site. Non-covalent molecular interactions important in this association include hydrogen bonding, van der Waals interactions, hydrophobic interactions, and/or electrostatic interactions. In some embodiments, agents may contact at least one, or any successive integer number up to all of the amino acid positions in the HGF β binding site or other structural feature. Conformational considerations include the overall three-dimensional structure and orientation of the ligand in relation to the ligand binding site, and the spacing between various functional groups of a ligand that directly interact with the HGF β binding site or homologs thereof.
Optionally, the potential binding of a ligand to a HGF β structural feature is analyzed using computer modeling techniques prior to the actual synthesis and testing of the ligand. If these computational experiments suggest insufficient interaction and association between it and the HGF β structural feature, testing of the ligand is obviated. However, if computer modeling indicates a sufficiently and/or desirably strong interaction, the molecule may then be synthesized and tested for its ability to bind to or interfere with, for example, a HGF β binding site. Binding assays to determine if a compound actually modulates HGF and/or HGF β activity can also be performed and are well known in the art.
Several methods can be used to screen ligands or fragments for the ability to associate with a HGF β structural feature. This process may begin by visual inspection of, for example, a HGF β structural feature, such as a binding site, on the computer screen based on the HGF β structure coordinates or other coordinates which define a similar shape generated from the machine-readable storage medium. Selected ligands may then be positioned in a variety of orientations, or docked, within the binding site, or other structural feature. Docking may be accomplished using software such as QUANTA and SYBYL, followed by energy minimization and molecular dynamics with standard molecular mechanics forcefields, such as CHARMM and AMBER.
Specialized computer programs may also assist in the process of selecting ligands. Examples include GRID (Hubbard, S. 1999. Nature Struct. Biol. 6:711-4); MCSS (Miranker, et al. 1991. Proteins 11:29-34) available from Molecular Simulations, San Diego, Calif.; AUTODOCK (Goodsell, et al. 1990. Proteins 8:195-202) available from Scripps Research Institute, La Jolla, Calif.; and DOCK (Kuntz, et al. 1982. J. Mol. Biol. 161:269-88) available from University of California, San Francisco, Calif.
HGF β binding ligands can be designed to fit a HGF β structural feature, based on the binding of a known modulator. There are many ligand design methods including, without limitation, LUDI (Bohm, 1992. J. Comput. Aided Molec. Design 6:61-78) available from Molecular Simulations Inc., San Diego, Calif.; LEGEND (Nishibata, Y., and Itai, A. 1993. J. Med. Chem. 36:2921-8) available from Molecular Simulations Inc., San Diego, Calif.; LeapFrog, available from Tripos Associates, St. Louis, Mo.; and SPROUT (Gillet, et al. 1993. J. Comput. Aided Mol. Design. 7:127-53) available from the University of Leeds, UK.
Once a compound has been designed or selected by the above methods, the efficiency with which that ligand may bind to, modulate and/or interfere with a HGF β binding site or other structural feature may be tested and optimized by computational evaluation. For example, an effective HGF β binding site ligand should preferably demonstrate a relatively small difference in energy between its bound and free states (i.e., a small deformation energy of binding). Thus, an efficient HGF β binding site ligand should preferably be designed with a deformation energy of binding of not greater than about 10 to about 15 kcal/mole, such as about 12 kcal/mole, preferably not greater than about 8 to about 12 kcal/mole, such as about 10 kcal/mole, and more preferably not greater than about 5 to about 10 kcal/mole, such as about 7 kcal/mole. HGF β binding site ligands may interact with the binding site in more than one conformation that is similar in overall binding energy. In those cases, the deformation energy of binding is taken to be the difference between the free energy of the ligand and the average energy of the conformations observed when the ligand binds to the protein.
A ligand designed or selected as binding to, modulating and/or interfering with a HGF β binding site or other structural feature may be further computationally optimized so that in its bound state it would preferably lack repulsive electrostatic interaction with the target molecule and with the surrounding water molecules. Such non-complementary electrostatic interactions include repulsive charge-charge, dipole-dipole, and/or charge-dipole interactions.
Specific computer software is available to evaluate compound deformation energy and electrostatic interactions. Examples of programs designed for such uses include: Gaussian 94, revision C (M. J. Frisch, Gaussian, Inc., Pittsburgh, Pa.); AMBER, version 4.1 (P. A. Kollman, University of California at San Francisco,); QUANTA/CHARMM (Molecular Simulations, Inc., San Diego, Calif.); Insight II/Discover (Molecular Simulations, Inc., San Diego, Calif.); DelPhi (Molecular Simulations, Inc., San Diego, Calif.); and AMSOL (Quantum Chemistry Program Exchange, Indiana University). These programs can be implemented, for instance, using a Silicon Graphics workstation, such as an Indigo2 with IMPACT graphics. Other hardware systems and software packages will be known to those skilled in the art.
Another approach encompassed by this disclosure is the computational screening of small molecule databases for ligands or compounds that can bind in whole, or in part, to a HGF β structural feature, including binding site, active site, activation domain, tunnel, and/or dimerization region. In this screening, the quality of fit of such ligands to the binding site may be judged either by shape complementarity or by estimated interaction energy (Meng, et al., 1992. J. Comp. Chem., 13:505-24).
The disclosure also provides methods of identifying a molecule that mimics HGF β HGF β is an inhibitor of full length HGF and can be used to identify or design other like inhibitors. One method involves searching a molecular structure database with the structural coordinates of Table 5, and selecting a molecule from the database that mimics the structural coordinates of HGF β. The method may also be conducted with portions of the HGF β structural coordinates that define structural features, such as binding site for Met, activation domain, active site, tunnel and/or dimerization region. The selected molecule can also be analyzed for differences between HGF β and the selected molecule at sites of the structural feature or can be tested for the ability to mimic one of the functional activities of HGF β. HGF β can then be modified to incorporate these differences and tested for functional activity and the modified HGF β can be selected for altered functional activity. In some embodiments, the modified HGF molecule can bind Met, but not activate Met/HGF β signaling pathway.
Another method involves assessing agents that are antagonists or agonists of HGF β. A method comprises applying at least a portion of the crystallography coordinates of a crystal of HGF β, such as provided in Table 5 to a computer algorithm that generates a three-dimensional model of HGF β suitable for designing molecules that are antagonists or agonists and searching a molecular structure database to identify potential antagonists or agonists. In some embodiments, a portion of the structural coordinates of the crystal such as in Table 5 that define a structural feature, for example, binding site for Met, activation domain, active site, tunnel and/or dimerization region, may be utilized. The method may further comprise synthesizing or obtaining the agonist or antagonist and contacting the agonist or antagonist with HGF β and selecting the antagonist or agonist that modulates the HGF β and/or HGF activity compared to a control without the agonist or antagonists and/or selecting the antagonist or agonist that binds to HGF β. Activities of HGF β include phosphorylation of Met, stimulation of cell proliferation, and stimulation of cell migration.
A compound that is identified or designed as a result of any of these methods can be obtained (or synthesized) and tested for its biological activity, for example, binding to HGF and/or HGF β and/or modulation of HGF and/or HGF β activity. Other modulators of HGF β include, for example, monoclonal antibodies directed against HGF β, peptide(s) that can modulate HGF β function, or small-molecule compounds, such as organic and inorganic molecules, which can be identified with methods of the present disclosure.
7. Machine-Readable Storage Media
Transformation of the structure coordinates for all or a portion of HGF β or the HGF β/ligand complex or one of its ligand binding sites, for structurally homologous molecules (as defined below), or for the structural equivalents of any of these molecules or molecular complexes (as defined above), into three-dimensional graphical representations of the molecule or complex can be conveniently achieved through the use of commercially-available software.
The disclosure thus further provides a machine-readable storage medium including a data storage material encoded with machine-readable data wherein a machine programmed with instructions for using said data displays a graphical three-dimensional representation of any of the molecule or molecular complexes of this disclosure that have been described above. In one embodiment, the machine-readable data storage medium includes a data storage material encoded with machine-readable data wherein a machine programmed with instructions for using the abovementioned data displays a graphical three-dimensional representation of a molecule or molecular complex including all or any parts of a HGF βligand binding site or a HGF β-like ligand binding site or other structural features, as defined above. In another embodiment, the machine-readable data storage medium includes a data storage material encoded with machine readable data wherein a machine programmed with instructions for using the data displays a graphical three-dimensional representation of a molecule or molecular complex having a root mean square deviation from the atoms of the amino acids of not more than about ±0.05 Å.
In an alternative embodiment, the machine-readable data storage medium can include, for example, a data storage material encoded with a first set of machine readable data which includes the Fourier transform of structure coordinates of HGF β, and wherein a machine programmed with instructions for using the first set of data is combined with a second set of machine readable data including the X-ray diffraction pattern of an unknown or incompletely known molecule or molecular complex to determine at least a portion of the structure coordinates corresponding to the second set of machine readable data.
For example, a system for reading a data storage medium may include a computer including a central processing unit (“CPU”), a working memory which may be, for example, RAM (random access memory) or “core” memory, mass storage memory (such as one or more disk drives or CD-ROM drives), one or more display devices (e.g., cathode-ray tube (“CRT”) displays, light emitting diode (“LED”) displays, liquid crystal displays (“LCDs”), electroluminescent displays, vacuum fluorescent displays, field emission displays (“FEDs”), plasma displays, projection panels, etc.), one or more user input devices (e.g., keyboards, microphones, mice, track balls, touch pads, etc.), one or more input lines, and one or more output lines, all of which are interconnected by a conventional bi-directional system bus. The system may be a stand-alone computer, or may be networked (e.g., through local area networks, wide area networks, intranets, extranets, or the internet) to other systems (e.g., computers, hosts, servers, etc.). The system may also include additional computer controlled devices such as consumer electronics and appliances.
Input hardware may be coupled to the computer by input lines and may be implemented in a variety of ways. Machine-readable data of this disclosure may be inputted via the use of a modem or modems connected by a telephone line or dedicated data line. Alternatively or additionally, the input hardware may include CD-ROM drives or disk drives. In conjunction with a display terminal, a keyboard may also be used as an input device.
Output hardware may be coupled to the computer by output lines and may similarly be implemented by conventional devices. By way of example, the output hardware may include a display device for displaying a graphical representation of a binding site of this disclosure using a program such as QUANTA as described herein. Output hardware might also include a printer, so that hard copy output may be produced, or a disk drive, to store system output for later use.
In operation, a CPU coordinates the use of the various input and output devices, coordinates data accesses from mass storage devices, accesses to and from working memory, and determines the sequence of data processing steps. A number of programs may be used to process the machine-readable data of this disclosure. Such programs are discussed in reference to the computational methods of drug discovery as described herein. Machine-readable storage devices useful in the present disclosure include, but are not limited to, magnetic devices, electrical devices, optical devices, and combinations thereof. Examples of such data storage devices include, but are not limited to, hard disk devices, CD devices, digital video disk devices, floppy disk devices, removable hard disk devices, magneto-optic disk devices, magnetic tape devices, flash memory devices, bubble memory devices, holographic storage devices, and any other mass storage peripheral device. It should be understood that these storage devices can include necessary hardware (e.g., drives, controllers, power supplies, etc.) as well as any necessary media (e.g., disks, flash cards, etc.) to enable the storage of data.
8. Therapeutic Use
HGF modulator compounds obtained by methods of the invention are useful in a variety of therapeutic settings. For example, HGF β antagonists designed or identified using the crystal structure of the HGF β can be used to treat disorders or conditions, where inhibition or prevention of HGF and/or HGF β binding or activity is indicated.
Likewise, HGF β agonists designed or identified using the crystal structure of the HGF β can be used to treat disorders or conditions, where induction or stimulation or enhancement of HGF β activity is indicated.
An indication can be, for example, inhibition or stimulation of Met phosphorylation and the concomitant activation of a complex set of intracellular pathways that lead to cell growth, differentiation, and migration in a variety of cell types. The ability of HGF to stimulate mitogenesis, cell motility, and matrix invasion gives it a central role in angiogenensis, tumorogenesis and tissue regeneration. Another indication can be, for example, in inhibition or stimulation of embryonic development, for example, organogenesis. Still another indication can be, for example, in inhibition or stimulation of tissue regeneration. Another indication can be, for example, in inhibition of angiogenesis, mitogenesis and/or vasculogenesis. Expression of HGF has been associated with thyroid cancer, colon cancer, lymphoma, prostate cancer, and multiple myeloma. Yet another indication can be, for example, in inhibition or stimulation of the HGF/Met signaling pathway. Still yet another indication can be, for example, in inhibition of invasive tumor growth and metastasis.
HGF β antagonists are also useful as chemosensitizing agents, useful in combination with other chemotherapeutic drugs or growth inhibitory compounds, in particular, drugs that induce apoptosis. Examples of other chemotherapeutic drugs that can be used in combination with chemosensitizing HGF β inhibitors include topoisomerase I inhibitors (e.g., camptothecin or topotecan), topoisomerase II inhibitors (e.g., daunomycin and etoposide), alkylating agents (e.g., cyclophosphamide, melphalan and BCNU), tubulin-directed agents (e.g., taxol and vinblastine), and biological agents (e.g., antibodies such as anti CD20 antibody, IDEC 8, anti-VEGF antibody, immunotoxins, and cytokines). Other examples of chemotherapeutic agents include alkylating agents such as thiotepa and CYTOXAN® cyclosphosphamide; alkyl sulfonates such as busulfan, improsulfan and piposulfan; aziridines such as benzodopa, carboquone, meturedopa, and uredopa; ethylenimines and methylamelamines including altretamine, triethylenemelamine, trietylenephosphoramide, triethiylenethiophosphoramide and trimethylolomelamine; acetogenins (especially bullatacin and bullatacinone); a camptothecin (including the synthetic analogue topotecan); bryostatin; callystatin; CC-1065 (including its adozelesin, carzelesin and bizelesin synthetic analogues); cryptophycins (particularly cryptophycin 1 and cryptophycin 8); dolastatin; duocarmycin (including the synthetic analogues, KW-2189 and CB1-TM1); eleutherobin; pancratistatin; a sarcodictyin; spongistatin; nitrogen mustards such as chlorambucil, chlomaphazine, cholophosphamide, estramustine, ifosfamide, mechlorethamine, mechlorethamine oxide hydrochloride, melphalan, novembichin, phenesterine, prednimustine, trofosfamide, uracil mustard; nitrosureas such as carmustine, chlorozotocin, fotemustine, lomustine, nimustine, and ranimnustine; antibiotics such as the enediyne antibiotics (e.g., calicheamicin, especially calicheamicin gamma1I and calicheamicin omegaI1 (see, e.g., Agnew, Chem. Intl. Ed. Engl., 33: 183-186 (1994)); dynemicin, including dynemicin A; bisphosphonates, such as clodronate; an esperamicin; as well as neocarzinostatin chromophore and related chromoprotein enediyne antibiotic chromophores), aclacinomysins, actinomycin, authramycin, azaserine, bleomycins, cactinomycin, carabicin, caminomycin, carzinophilin, chromomycinis, dactinomycin, daunorubicin, detorubicin, 6-diazo-5-oxo-L-norleucine, ADRIAMYCIN® doxorubicin (including morpholino-doxorubicin, cyanomorpholino-doxorubicin, 2-pyrrolino-doxorubicin and deoxydoxorubicin), epirubicin, esorubicin, idarubicin, marcellomycin, mitomycins such as mitomycin C, mycophenolic acid, nogalamycin, olivomycins, peplomycin, potfiromycin, puromycin, quelamycin, rodorubicin, streptonigrin, streptozocin, tubercidin, ubenimex, zinostatin, zorubicin; anti-metabolites such as methotrexate and 5-fluorouracil (5-FU); folic acid analogues such as denopterin, methotrexate, pteropterin, trimetrexate; purine analogs such as fludarabine, 6-mercaptopurine, thiamiprine, thioguanine; pyrimidine analogs such as ancitabine, azacitidine, 6-azauridine, carmofur, cytarabine, dideoxyuridine, doxifluridine, enocitabine, floxuridine; androgens such as calusterone, dromostanolone propionate, epitiostanol, mepitiostane, testolactone; anti-adrenals such as aminoglutethimide, mitotane, trilostane; folic acid replenisher such as frolinic acid; aceglatone; aldophosphamide glycoside; aminolevulinic acid; eniluracil; amsacrine; bestrabucil; bisantrene; edatraxate; defofamine; demecolcine; diaziquone; elfornithine; elliptinium acetate; an epothilone; etoglucid; gallium nitrate; hydroxyurea; lentinan; lonidainine; maytansinoids such as maytansine and ansamitocins; mitoguazone; mitoxantrone; mopidanmol; nitraerine; pentostatin; phenamet; pirarubicin; losoxantrone; podophyllinic acid; 2-ethylhydrazide; procarbazine; PSK® polysaccharide complex (JHS Natural Products, Eugene, Oreg.); razoxane; rhizoxin; sizofuran; spirogermanium; tenuazonic acid; triaziquone; 2,2′,2″-trichlorotriethylamine; trichothecenes (especially T-2 toxin, verracurin A, roridin A and anguidine); urethan; vindesine; dacarbazine; mannomustine; mitobronitol; mitolactol; pipobroman; gacytosine; arabinoside (“Ara-C”); cyclophosphamide; thiotepa; taxoids, e.g., TAXOL® paclitaxel (Bristol-Myers Squibb Oncology, Princeton, N.J.), ABRAXANE™ Cremophor-free, albumin-engineered nanoparticle formulation of paclitaxel (American Pharmaceutical Partners, Schaumberg, Ill.), and TAXOTERE® doxetaxel (Rhône-Poulenc Rorer, Antony, France); chloranbucil; GEMZAR® gemcitabine; 6-thioguanine; mercaptopurine; methotrexate; platinum analogs such as cisplatin and carboplatin; vinblastine; platinum; etoposide (VP-16); ifosfamide; mitoxantrone; vincristine; NAVELBINE® vinorelbine; novantrone; teniposide; edatrexate; daunomycin; aminopterin; xeloda; ibandronate; CPT-1; topoisomerase inhibitor RFS 2000; difluoromethylomithine (DMFO); retinoids such as retinoic acid; capecitabine; and pharmaceutically acceptable salts, acids or derivatives of any of the above.
Also included in the definition of “chemotherapeutic agent” above are anti-hormonal agents that act to regulate or inhibit hormone action on tumors such as anti-estrogens and selective estrogen receptor modulators (SERMs), including, for example, tamoxifen (including NOLVADEX® tamoxifen), raloxifene, droloxifene, 4-hydroxytamoxifen, trioxifene, keoxifene, LY117018, onapristone, and FARESTON toremifene; aromatase inhibitors that inhibit the enzyme aromatase, which regulates estrogen production in the adrenal glands, such as, for example, 4(5)-imidazoles, aminoglutethimide, MEGASE® megestrol acetate, AROMASIN® exemestane, formestanie, fadrozole, RIVISOR® vorozole, FEMARA® letrozole, and ARIMIDEX® anastrozole; and anti-androgens such as flutamide, nilutamide, bicalutamide, leuprolide, and goserelin; as well as troxacitabine (a 1,3-dioxolane nucleoside cytosine analog); antisense oligonucleotides, particularly those which inhibit expression of genes in signaling pathways implicated in abherant cell proliferation, such as, for example, PKC-alpha, Ralf and H-Ras; ribozymes such as a VEGF expression inhibitor (e.g., ANGIOZYME ribozyme) and a HER2 expression inhibitor; vaccines such as gene therapy vaccines, for example, ALLOVECTIN® vaccine, LEUVECTIN® vaccine, and VAXID® vaccine; PROLEUKIN® rIL-2; LURTOTECAN® topoisomerase 1 inhibitor; ABARELIX® rmRH; and pharmaceutically acceptable salts, acids or derivatives of any of the above.
9. Other Uses
The HGF β chain, or variants thereof, form crystals in accord with the methods described herein. The crystals also are useful to store and/or deliver HGF β-chain molecules. HGF β may be useful as an inhibitor or antagonist of HGF. Crystals can be prepared and used to store HGF β-chain molecule for later use.
A variety of methods are known to those of skill in the art for formation of crystals. In some embodiments, for crystals prepared for storage, the crystal size and structure does not have to be so uniform or homogenous as for X-ray diffraction. In other embodiments, the crystals effectively diffract x-rays to a resolution of 5 Å or better. Typically, a purified polypeptide is contacted with a precipitant in the presence of a buffer. Precipitants include salts, polymers, or organic molecules. Organic precipitants include isopropanol, ethanol, hexanediol, and 2-methyl-2,4-pentanediol. Polymeric precipitants include polyethylene glycol and polyamines. Salts used include ammonium sulfate, sodium citrate, sodium acetate, ammonium dichloride, sodium chloride and magnesium formate. Many buffers can be utilized and are known to those of skill in the art.
In some cases, crystals can be cross-linked to one another. Such cross-linking may enhance the stability of the crystal. Methods of cross-linking crystals are know to those of skill in the art and have been described, for example, in U.S. Pat. No. 5,849,296.
The crystals can be maintained in crystallization solution, they can be dried, or combined with other carriers and/or other ingredients to form compositions and formulations. In some embodiments, the crystals can be combined with a polymeric carrier for stability and sustained release. In some embodiments, the HGF β has at least one biological activity when resolubilized. Biological activities of HGF β include binding to Met, phosphorylation of Met, stimulation of cell growth, and stimulation of cell migration.
Formulations of crystals of proteins, such as enzymes, receptors, antibodies, and like molecules, or fragments thereof, can include at least one ingredient or excipient. Ingredient or expedients are known to those of skill in the art and include acidifying agents, aerosol propellants, alcohol denaturants, alkalizing agents, anti-caking agents, antifoaming agents, microbial preservatives, anti-antioxidants, buffering agents, lubricants, chelating agents, colors, desiccants, emulsifying agents, filtering aids, flavors and perfumes, humectants, ointments, plasticizers, solvents (e.g. oils or organic), sorbents, carbon dioxide sorbents, stiffening agents, suppository bases, suspending or viscosity increasing agents, sweetening agents, tablet binders, table or capsule diluents, tablet disintegrants, tablet or capsule lubricants, tonicity agent, flavored or sweetened vehicles, oleaginous vehicles, solid carrier vehicles, water repelling agent, and wetting or solubilizing agents.
In some embodiments, the ingredients enhance storage stability. In other embodiments, the ingredient or excipient is preferably selected from the group consisting of albumin, sucrose, trehalose, lactitol, gelatin, and hydroxyproyl-β-cyclodextran.
All publications, patents, and patent documents are incorporated by reference herein, as though individually incorporated by reference. The disclosure has been described with reference to various specific and preferred embodiments and techniques. However, it should be understood that many variations and modifications can be made while remaining within the spirit and scope of the disclosure.
HGF β proteins were expressed in insect cells using baculovirus secretion vector pAcGP67 (Pharmingen, San Diego, Calif.). All constructs contained a His6 tag at the carboxy terminus and were purified to homogeneity (>95% purity) by Ni NTA metal chelate and gel filtration chromatography. For wildtype HGF β (SEQ ID NO:5), a cDNA fragment encoding the HGF β-chain from residues Val495 [c16] to Ser728 [c250] was cloned by PCR such that Val495 [c16] was inserted immediately after the secretion signal sequence. Site-directed mutagenesis was carried out using QuikChange™ (Stratagene, La Jolla, Calif.) with oligonucleotide 5′CCTAATTATGGATCCACAATTCCTG3′ (SEQ ID NO: 2) to make HGF β containing a Cys604 to Ser mutation (HGF β) (SEQ ID NO:1) HGF β mutants of SEQ ID NO:1 include Q534A [c57], D578A [c102], Y673A [c195], V692A [c214] and R695A [c217] were made as above in the HGFβ construct.
proHGF β (SEQ ID NO:7) encodes HGF from residues Asn479 to Ser728 and has a R494E mutation made using the oligonucleotide 5′CAAAACGAAACAATTGGAAGTTGTAAATGGGATTC 3′ (SEQ ID NO: 3). The cysteine was not altered in this construct to allow putative disulfide formation between Cys487 and Cys604.
Numbering for all amino acid sequences is as follows: full length HGF sequence starting with MWV . . . as numbers 1-3 [chymotrypsinogen numbering is shown in the brackets]. It will be readily apparent that the numbering of amino acids in other isoforms of HGF β may be different than that of the HGF β numbering disclosed herein. The disclosure provides sequential numbering based on sequence only. In some embodiments, an isoform may have structural “differences”, for example, if it carries insertion(s) or deletion(s) relative to the HGF β reference sequence. The chymotrypsinogen numbering convention may be useful for comparison.
The amino acid sequence of a HGF β (SEQ ID NO:1) is shown in Table 7. The amino acid sequence of wild type HGF β (SEQ ID NO:5) is shown in Table 9 and a full length HGF comprising an amino acid sequence of SEQ ID NO:6 is shown in Table 10. Other sequences are known to those of skill in the art.
Baculovirus vectors containing the desired inserts were transfected into Spodoptera frugiperda (Sf 9) cells on plates in TNM-FH media via the Baculogold™ Expression System according to manufacturer's instructions (Pharmingen, San Diego, Calif.). After 2-4 rounds of virus amplification, 10 mL of viral stock was used to infect 1 L of High Five™ cells (Invitrogen, San Diego, Calif.) in suspension at 5×105 cells/mL in TNM-FH media. Cultures were incubated at 27° C. for 72 h before harvesting the culture media by centrifugation at 8,000×g for 15 min. Cell culture media was applied to a 4 mL Ni-NTA agarose column (Qiagen, Valencia, Calif.). After washing with 4 column volumes of 50 mM Tris.HCl, pH 8.0, 500 mM NaCl, 5 mM imidazole, HGF β proteins were eluted with 50 mM Tris.HCl pH 8.0, 500 mM NaCl, 500 mM imidazole. The eluate was pooled and applied to a Superdex™-200 column (Amersham Biosciences, Piscataway, N.J.) equilibrated in 10 mM HEPES pH 7.2, 150 mM NaCl, 5 mM CaCl2. Protein peaks were collected and concentrated using a Centriprep™ YM-10 (Millipore, Bedford, Mass.). Fractions were analyzed by 12% SDS-PAGE stained with Coomassie blue. All mutations were verified by DNA sequencing and mass spectrometry. Protein concentration was determined by quantitative amino acid analysis. N-terminal sequencing revealed a single correct N-terminus present for proHGF β and HGF β. Purified proteins showed the correct molecular mass on SDS-PAGE; multiple bands observed were likely due to heterogeneous glycosylation, consistent with the mass spectrometry data having molecular masses about 2 kDa higher than predicted from the sequence.
Recombinant proteins were produced in 1 L cultures of Chinese hamster ovary (CHO) cells by transient transfection (Peek et al., 2002). pRK5.1 vectors used for CHO expression (Lokker 1992). Amino acid changes were introduced by site-directed mutagenesis (Kunkel, 1985) and verified by DNA sequencing. The expression medium (F-12/Dulbecco's modified Eagle's medium) contained 1% (v/v) ultra low IgG fetal bovine serum (FBS) (Gibco, Grand Island, N.Y.). After 8 days the medium was harvested and supplemented with FBS to give a final content of 5-10% (v/v). Additional incubation for 2-3 days at 37° C. resulted in complete single-chain HGF conversion. This step was omitted for expression of proHGF, an uncleavable single chain form, which has amino acid changes at the activation cleavage site (R494E) and at a protease-susceptible site in the α-chain (R424A) (Peek et al., 2002). Mutant proteins were purified from the medium by HiTrap-Sepharose SP cation exchange chromatography (Amersham Biosciences, Piscataway, N.J.) as described (Peek et al., 2002). Examination by SDS-PAGE (4-20% gradient gel) under reducing conditions and staining with Simply Blue Safestain showed that all mutant HGF proteins were >95% pure and were fully converted into α/β-heterodimers except for proHGF, which remained as a single-chain form. Protein concentration for each mutant was determined by quantitative amino acid analysis.
The mature form of the Met ECD (Glu25 to Gln929) (SEQ ID NO:4) domain containing a C-terminal His6 tag were expressed in insect cells and purified by Ni-NTA metal chelate and gel filtration chromatography using standard protocols described above. Met-IgG fusion protein was obtained as previously described (Mark et al., 1992). A representative amino acid sequence of wild type extracellular domain of the Met receptor is shown in Table 8. (SEQ ID NO: 4) Other sequences are known to those of skill in the art.
The binding affinity between HGF β and Met was determined by surface plasmon resonance using a Biacore 3000 instrument (Biacore, Inc., Piscataway, N.J.). The Met ECD domain was immobilized on a CM5 chip using amine coupling at about 2000 resonance units according to the manufacturer's instructions. A series of concentrations of HGF β in 10 mM HEPES pH 7.2, 150 mM NaCl, 5 mM CaCl2 ranging from 12.5 nM to 100 nM were injected at a flow rate of 20 μL/min for 40 s. Bound HGF β was allowed to dissociate for 10 min. Appropriate background subtraction was carried out. The association (kon) and dissociation (koff) rate constants were obtained by a global fitting program provided with the instrument; the ratio of koff/kon was used to calculate the dissociation constant (Kd).
Microtiter plates (Nunc, Roskilde, Denmark) were coated overnight at 4° C. with 2 μg/mL of rabbit anti-human IgG Fc specific antibody (Jackson ImmunoResearch Laboratory, West Grove, Pa.) in 50 mM sodium carbonate buffer, pH 9.6. After blocking with 1% BSA in HBS buffer (50 mM HEPES pH 7.2, 150 mM NaCl, 5 mM CaCl2 and 0.1% Tween-20), 1 μg/mL Met-IgG fusion protein (Mark et al., 1992) was added and plates were incubated for 1 h with gentle shaking at room temperature. After washing with HBS buffer, HGF β proteins were added for 1 h. Bound HGF β was detected using anti-His-HRP (Qiagen, Valencia, Calif.) followed by addition of TMB/H2O2 substrate (KPL, Gaithersburg, Md.). The reaction was stopped with 1M H3PO4 and the A450 was measured on a Molecular Devices SpectraMax Plus384 microplate reader. The effective concentration to give half-maximal binding (EC50) was determined by a four parameter fit using Kaleidagraph (Synergy Software, Reading, Pa.).
In order to develop a competition ELISA, wildtype HGF β was biotinylated using a 20-fold molar excess of biotin-maleimide (Pierce, Rockford, Ill.) at room temperature for 2 h. Plates were treated as above except biotinylated wildtype HGF β was used and detected using HRP-neutravidin (Pierce, Rockford, Ill.). Competition assays contained a mixture of 250 nM biotinylated wildtype HGF β and various concentrations of unlabeled HGF βvariants, HGF or proHGF. After incubation for 1 h at room temperature, the amount of biotinylated wildtype HGF β bound on the plate was measured as described above. IC50 values were determined by fitting the data to a four-parameter equation (Kaleidagraph, Synergy Software, Reading, Pa.).
Biotinylated HGF was prepared using the Sigma immunoprobe biotinylation kit (Sigma, St. Louis, Mo.). Microtiter plates were coated with rabbit anti-human IgG Fc specific antibody as above. Plates were washed in PBS 0.05% (v/v) Tween-20 followed by a 1 h incubation with 0.5% (w/v) of BSA, 0.05% Tween-20 in PBS, pH 7.4 at room temperature. After washing, 1 nM biotinylated HGF and 0.2 nM Met-IgG fusion protein together with various concentrations of HGF mutants were added to the wells and incubated for 2 h. After washing, bound biotinylated HGF was detected by addition of diluted (1:3000) streptavidin horseradish peroxidase conjugate (Zymed, South San Francisco, Calif.) followed by SureBlue TMB peroxidase substrate and stop solution TMB STOP (KPL, Gaithersburg, Md.). The A450 was measured and IC50 values were determined as described above. Relative binding affinities are expressed as the IC50(mutant)/IC50(wildtype HGF).
The kinase receptor activation assay (KIRA) was run as follows. Confluent cultures of lung carcinoma A549 cells (CCL-185, ATCC, Manassas, Va.), previously maintained in growth medium (Ham's F12/DMEM 50:50 (Gibco, Grand Island, N.Y.) containing 10% FBS, (Sigma, St. Louis, Mo.), were detached using Accutase (ICN, Aurora, Ohio) and seeded in 96 well plates at a density of 50,000 cells per well. After overnight incubation at 37° C., growth media was removed and cells were serum starved for 30 to 60 min in medium containing 0.1% FBS. Met phosphorylation activity by HGF, HGF mutants or HGF β-chain was determined from addition of serial dilutions from 500 to 0.2 ng/mL in medium containing 0.1% FBS followed by a 10 minute incubation at 37° C., removal of media and cell lysis with 1× cell lysis buffer (Cat. #9803, Cell Signaling Technologies, Beverly, Mass.) supplemented with 1× protease inhibitor cocktail set I (Cat. No. 539131, Calbiochem, San Diego, Calif.). Inhibition of HGF dependent Met phosphorylation activity by HGF β-chain was determined from addition of serial dilutions from 156 to 0.06 nM to assay plates followed by a 15 min incubation at 37° C., addition of HGF at 12.5, 25 or 50 nM, an additional 10 min incubation at 37° C., removal of media and cell lysis as above. Cell lysates were analyzed for phosphorylated Met via an electrochemiluminescence assay using an ORIGEN M-Series instrument (IGEN International, Gaithersburg, Md.). Anti-phosphotyrosine mAb 4G10 (Upstate, Lake Placid, MY) was labeled with ORI-TAG via NHS-ester chemistry according to manufacturer's directions (IGEN). Anti-Met ECD mAb 1928 (Genentech) was biotinylated using biotin-X-NHS (Research Organics, Cleveland, Ohio). The ORI-TAG-labeled 4G10 and biotinylated anti-Met mAb were diluted in assay buffer (PBS, 0.5% Tween-10, 0.5% BSA) and the cocktail was added to the cell lysates. After incubation at room temperature with vigorous shaking for 1.5 to 2 h, addition of streptavidin magnetic beads (Dynabeads, IGEN), and another incubation for 45 min, plates were read on the ORIGEN instrument.
Breast cancer cells MDA-MB-435 (HTB-129, ATCC, Manassas, Va.) were cultured in recommended serum-supplemented medium. Confluent cells were detached in PBS containing 10 mM EDTA and diluted with serum-free medium to a final concentration of 0.6-0.8×105 cells/mL. 0.2 mL of this suspension (1.2-1.6×105 total cells) was added in triplicate to the upper chambers of 24-well transwell plates (8 μm pore size) (HTS Multiwell™ Insert System, Falcon, Franklin Lakes, N.J.) pre-coated with 10 μg/mL of rat tail collagen Type I (Upstate, Lake Placid, N.Y.). Wildtype HGF or HGF mutants were added to the lower chamber at 100 ng/mL in serum-free medium, unless specified otherwise. After incubation for 13-14 h cells on the apical side of the membrane were removed and those that migrated to the basal side were fixed in 4% paraformaldehyde followed by staining with a 0.5% crystal violet solution. After washing and air-drying, cells were solubilized in 10% acetic acid and the A560 was measured on a Molecular Devices microplate-reader. Pro-migratory activities of HGF mutants were expressed as percent of HGF controls after subtracting basal migration in the absence of HGF. Photographs of stained cells were taken with a Spot digital camera (Diagnostics Instruments, Inc., Sterling Heights, Mich.) connected to a Leitz microscope (Leica Mikroskope & Systeme GmbH, Wetzlar, Germany). Pictures were acquired by Adobe Photoshop 4.0.1 (Adobe Systems Inc., San Jose, Calif.).
HGF β binding to Met was assessed from the change in resonance units measured by surface plasmon resonance on a CM5 chip derivatized with the extracellular domain of Met (Met ECD). The results show that HGF β binds to Met ECD with a Kd of 90 nM calculated from relatively fast association (kon=1.2×105 M−1s−1) and dissociation rate constants (koff=0.011 s−1) (
Since single-chain HGF binds to Met with comparable affinity to two-chain HGF, but does not induce Met phosphorylation (Lokker et al., 1992; Hartmann et al., 1992). This may be due to the lack of a Met binding site in the uncleaved form of the β-chain. proHGF β, a zymogen-like form of HGF β containing the C-terminal 16 residues from the HGF α-chain and a mutation at the cleavage site (R494E) to ensure that the single-chain form remained intact was expressed and purified. Binding of HGF β and proHGF β to Met was determined with a competition binding ELISA, resulting in IC50 values of 0.86±0.17 and 11.6±1.8 μM, respectively (
Although HGF β binds to Met, it does not induce Met phosphorylation (
To identify the Met binding site in the β-chain, residues were systematically changed in regions corresponding to the activation-domain and the active-site of serine proteases. Initial expression of HGF mutants in CHO cells yielded a mixture of single- and two-chain HGF forms, exemplified by mutant HGF I623A (
The functional consequence of mutating β-chain residues in HGF was assessed by determining the ability of the HGF mutants to stimulate migration of MDA-MB435 cells. The results showed that 3 HGF mutants, R695A [c217], G696A [c219] and Y673A [c195] were severely impaired, having less than 20% of wildtype activity, while 4 mutants Q534A [c57], D578A [c102], V692A [c214] and G694A [c216] had 20%-60% of wildtype activity (
To examine whether reduced activities in cell migration correlated with reduced Met phosphorylation, a subset of HGF mutants was examined in a kinase receptor assay (KIRA). For wildtype HGF and HGF mutants, maximal Met phosphorylation was observed at concentrations between 0.63 and 1.25 nM (
The affinity of each mutant for Met-IgG fusion protein was analyzed by HGF competition binding; 34 HGF mutants had essentially the same binding affinity as two-chain HGF (IC50=0.83±0.32 nM; n=30), indicated by their IC50 ratios (IC50mut/IC50WT), which ranged from 0.36 to 2.0 (Table 1). HGF Y673A [c195], K649A, and proHGF showed about a 4-fold weaker binding to Met-IgG compared to HGF (Table 1). The cell migration activities of selected mutants at 10- and 50-fold higher concentrations was examined; no increase in pro-migratory activity was observed (Table 2). Therefore, the impaired function of HGF mutants is not due to reduced binding to Met, since an increase in concentration of up to 50-fold had no compensatory effect.
The poor correlation between HGF binding to Met and either HGF dependent cell migration or Met phosphorylation is likely due to the relatively high affinity between Met and the HGF α-chain, which could mask any reduced affinity due to the β-chain. Therefore, selected mutations in HGF β itself were made to eliminate any α-chain effects. HGF β mutants Q534A [c57], D578A [c102], Y673A [c195], V692A [c214] and R695A [c217] were tested in a competition ELISA with biotinylated HGF β binding to Met-IgG (
HGF acquires biological activity upon proteolytic conversion of the single chain precursor form into two-chain HGF (Naka et al., 1992; Hartmann et al., 1992; Lokker et al., 1992; Naldini et al. 1992). Based on the structural similarity of HGF with chymotrypsin-like serine proteases (Perona and Craik, 1995; Rawlings et al., 2002; Donate et al., 1994) and plasminogen in particular, whether this activation process is associated with structural changes occurring in the HGF β-chain was studied.
Binding studies with purified HGF β-chains revealed that the ‘activated’ form of HGF β (Val495-Ser728) binds to Met with about a 13-fold higher affinity than its precursor form, proHGF β (Asn479-Ser728), consistent with the view that optimization of the Met binding site is contingent upon processing of single-chain HGF. This suggested that the Met binding site includes the HGF region undergoing conformational rearrangements after scHGF cleavage, i.e. the ‘activation domain’. Indeed, functional analysis of HGF variants with amino acid substitutions in the ‘activation domain’ led to the identification of the functional Met binding site. However, HGF mutants with the greatest losses in pro-migratory activities (Q534A, D578A, Y673A, V692A, G694A, R695A, G696A) displayed essentially unchanged binding affinities for Met, except for Y673A (4-fold loss), because HGF affinity is dominated by the HGF α-chain (Lokker et al., 1994; Okigaki et al., 1992). Consistent with this, the reduced activities remained unchanged upon increasing the concentration of HGF mutants by more than 50-fold. Therefore, the reduced activities of HGF mutants were interpreted as resulting from perturbed molecular interactions of HGF β-chain with its specific, low affinity, binding site on Met. In support of this, it was found that the reduced biological activities of selected HGF mutants were well correlated with reduced Met binding of the corresponding HGF β mutants.
Purified HGF β (SEQ ID NO:1) was concentrated to 10 mg/mL using a Centriprep® YM-10 in 10 mM HEPES pH 7.2, 150 mM NaCl, 5 mM CaCl2. Hanging drops (1 microliter protein and 1 microliter 30% PEG-1500) over a reservoir containing 500 microliter 30% PEG-1500 (Hampton Research, Laguna Niguel, Calif.) yielded crystalline rods (about 25×25×500 micrometers) during incubation at 19° C. overnight. A crystal fragment was preserved directly from the mother liquor by immersion in liquid nitrogen. Data extending to 2.53 Å resolution were collected on a Quantum 4 CCD detector (ADSC, Poway, Calif.) at ALS beam line 5.0.2 with 1.0 Å wavelength λ-rays. Data processing and reduction were performed using HKL (Otwinowski and Minor, 1996) (HKL Research, Charlottesville, Va.) and ccp4 (CCP4, 1994).
The structure was solved by molecular replacement using AMoRe (Navaza, 1994) in space group P3121, using parts of the protease domain of coagulation factor VIIa (Dennis et al., 2000) as the search probe. Refinement was performed using X-PLOR98 (MSI, San Diego) and REFMAC (Murshudov et al., 1997). Inspection of electron density maps and model manipulation were performed using XtalView (McRee, 1999) (Syrrx, San Diego, Calif.). The number in parenthesisis the number of atoms assigned zero occupancy.
1Nmeas is the total number of observations measured.
2Nref is the number of unique reflections measured at least once.
3Complete is the percentage of possible reflections actually measured at least once.
4Rmerge = Σ||I| − |<I>||/Σ|<I>|, where I is the intensity of a single observation and <I> the average intensity for symmetry equivalent observations.
5Rwork = Σ|Fo − Fc|/Σ|Fo|, where Fo and Fc are observed and calculated structure factor amplitudes, respectively.
6Rfree = Rwork for 531 reflections (5%) sequestered from refinement, selected at random from 99 resolution shells. R for all reflections is 0.249.
Each of the constituent amino acids of HGF β is defined by a set of structure coordinates as set forth in Table 5. The term “structure coordinates” refers to Cartesian coordinates derived from mathematical equations related to the patterns obtained on diffraction of a monochromatic beam of X-rays by the atoms (scattering centers) of a HGF β in crystal form. The diffraction data are used to calculate an electron density map of the repeating unit of the crystal. The electron density maps are then used to establish the positions of the individual atoms of the HGF β protein or protein/ligand complex.
Slight variations in structure coordinates can be generated by mathematically manipulating the HGF β or HGF β/ligand structure coordinates. For example, the structure coordinates as set forth in Table 5 could be manipulated by crystallographic permutations of the structure coordinates, fractionalization of the structure coordinates, integer additions or subtractions to sets of the structure coordinates, inversion of the structure coordinates, or any combination of the above. Alternatively, modifications in the crystal structure due to mutations, additions, substitutions, deletions, and combinations thereof, of amino acids, or other changes in any of the components that make up the crystal, could also yield variations in structure coordinates. Such slight variations in the individual coordinates will have little effect on overall shape. If such variations are within an acceptable standard error as compared to the original coordinates, the resulting three-dimensional shape is considered to be structurally equivalent. Structural equivalence is described in more detail below.
It should be noted that slight variations in individual structure coordinates of the HGF β would not be expected to significantly alter the nature of chemical entities such as ligands that could associate with an active site. In this context, the phrase “associating with” refers to a condition of proximity between a ligand, or portions thereof, and a HGF β molecule or portions thereof. The association may be non-covalent, wherein the juxtaposition is energetically favored by hydrogen bonding, van der Waals forces, and/or electrostatic interactions, or it may be covalent.
Resolution of the HGF β crystal structure revealed several structural features including the activation-domain, “active-site” region, a binding site for Met, a tunnel, dimerization region and the nature of the catalytic triad.
As shown in the examples, modeling of the crystal structure revealed a novel ligand-binding site for Met on HGF β. In some embodiments, amino acids defining HGF βstructural features include those amino acids summarized in Table 4A. In some embodiments, amino acids defining a “core” set of HGF β structural features include those amino acids summarized in Table 4B.
The atomic coordinates of HGF β are summarized in Table 5. The atomic coordinates of HGF 13 secondary structural features are summarized in Table 6.
To better interpret Met binding and activity data from HGF mutants, the HGF β structure at 2.53 Å resolution was solved. Data reduction and refinement statistics and final model metrics appear in Table 3.
HGF β crystals were formed using three intermolecular contacts for each molecule (
HGF β adopts the fold of chymotrypsin-like serine proteases, comprising two tandem distorted β-barrels. There are two poorly ordered and untraceable segments—His 645-Thr651 [c170a-c175] and the C-terminal region beginning with Tyr723 [c245]. The ‘active-site’ region of HGF β clearly differs from those of true enzymes (
HGF β residues that interact with Met are shown in
The functional importance of the [c220]-loop has precedent in the well-described family of chymotrypsin-like serine proteases (Perona and Craik, 1994; Hedstrom, 2002). The extended canonical conformation of substrates and inhibitors includes residues that can form main chain interactions with amino acid residues 692-696 [c214-c218]. This peptide segment has an amino acid which is inappropriate for “substrate” binding (Pro693) and overall the wrong conformation for “substrate” binding. This region is also recognized as an allosteric regulator of thrombin catalytic activity (Di Cera et al., 1995) and as an interaction site with its inhibitor hirudin (Stubbs and Bode, 1993). In addition, residues in Factor VIIa and thrombin that correspond to HGF R695 [c217] are important for enzyme-catalyzed substrate processing (Tsiang et al., 1995; Dickinson et al., 1996). Moreover, the corresponding residue in MSP, R683 [c217], plays a pivotal role in the high affinity interaction of MSP β-chain with its receptor Ron (Danilkovitch et al., 1999). MSP R683 [c217] is part of a cluster of five surface exposed arginine residues proposed to be involved in high affinity binding to Ron (Miller and Leonard, 1998). Although only R695 [c217] and possibly K649 [c173] are conserved in HGF, these residues are all located within the Met binding region of the HGF β-chain.
The binding site identified herein is in excellent agreement with the Met binding site revealed in the co-crystal structure of soluble Met Sema domain bound to HGF β3 as disclosed in the abovementioned copending application U.S. Ser. No. 60/568,865, filed May 6, 2005. For instance, the co-crystal structure shows that residues on the [c220]-loop, such as R695 [c217], make contacts to the Met receptor.
Our results are in contrast with previous studies demonstrating that HGF β-chain itself neither binds to nor inhibits HGF binding to Met (Hartmann et al., 1992; Matsumoto et al., 1998). In one instance, the HGF β-chain was different from ours, having extra α-chain residues derived from elastase cleavage of HGF, which could adversely affect Met binding. However, it is more likely that, for example, the concentrations used, the sensitivity of the assays, or the extent of pro-HGF processing may have been insufficient to observe binding to this low affinity site (Matsumoto et al., 1998). HGF β-chain has been reported to bind to Met although only in the presence of NK4 fragment from the α-chain (Matsumoto et al., 1998).
In principle, the existence of two Met binding sites in one HGF molecule could support a 2:1 model of a Met:HGF signaling complex, analogous to the proposed 2:1 model of Ron:MSP (Miller and Leonard, 1998). In the related MSP/Ron ligand/receptor system, individual α- and β-chains of MSP, which are devoid of signaling activity, can bind to Ron and compete with full length MSP for receptor binding (Danilkovitch et al., 1999). The same is true in the HGF/Met system. However, biochemical studies have not identified any 2:1 complexes of Met:HGF (Gherardi et al., 2003). In addition, this model of receptor activation requires some as yet unknown molecular mechanism that would prevent one HGF molecule from simultaneously binding to one Met receptor through its α- and β-chains.
The results suggest that the HGF β-chain may have functions in receptor activation beyond those involved in direct interactions with Met that would favor a 2:2 complex of HGF:Met. It was found that proHGF β the single chain ‘inactivated’ form of the HGF β-chain, bound more tightly to Met than several mutants in the ‘activated’ form of HGF β, i.e. Y673A, V692A, and R695A (
Although no structure exists for proHGF β, the most dramatic molecular change between activated and unactivated HGF β-chain almost certainly occurs at the activation cleavage site, where the new N-terminus inserts into the protein to form the salt bridge with the side chain of D672 [c194], akin to molecular changes seen with zymogens and proteases. In the crystal structure, HGF β forms a symmetric dimer. Upon inspection of intermolecular contacts seen in the HGF β crystal lattice, one of the dimer interfaces (
In conclusion, the results presented herein show that the β-chain of HGF contains a new interaction site with Met, which is similar to the ‘active-site’ region of serine proteases. Thus, HGF is bivalent, having a high affinity Met binding site in the NK1 region of the α-chain and a low affinity binding site on the HGF β chain. Other important interactions may occur between two HGF β-chains, two HGF α-chains (Donate et al., 1994), and as found with MSP/Ron (Angeloni et al., JBC, in press), between two Met Sema domains. Furthermore, heparin also plays a key role in HGF/Met receptor binding. The identification of a distinct Met binding site on the HGF β-chain can be used to design new classes of HGF and/or Met modulators, such as antagonists, agonists, inhibitors, and like agents, having therapeutic applications, such as, for treating cancer.
Among proteins with reported molecular structures, the amino acid sequence of HGF β is most homologous with that of plasmin/plasminogen, having 37% identity. Superimposition (Cohen, 1997) of the plasmin protease domain 1BUI (Berman et al., 2000; Parry et al., 1998) with HGF β using Cα atoms yields an rmsd of 1.2 Å for 192 atom pairs (out of 227 in our HGF β structure). A structure-based sequence alignment with plasmin shows HGF β has single amino acid deletions immediately before and after the sequence 505IGWMVSLRYR514 (
The plasmin structure (Parry et al., 1998) includes the C-terminal fragment from the plasmin A-chain, which is connected to the protease domain with two disulfide bonds (
These features lead to the conclusions that plasminogen is a poor structural model for proHGF in the region where the activating cleavage occurs and that is more different from HGF than plasminogen is from plasmin. Based on the MSP pro-sequence, the same conclusions are not applicable to MSP. This result suggests that pro-HGF is unlike single chain MSP or single chain chymotrypsin. This implication, coupled with the result showing that HGF-β (as would be found in 2-chain HGF) is reasonably similar to chymotrypsin, leads to a conclusion that the structural differences between single chain and 2-chain HGF are larger than differences between single chain and 2-chain forms of MSP, or chymotrypsin. This tends to supports the view that HGF-β conversion from single chain to 2-chain form mediates receptor activation.
The nonenzymatic ‘catalytic triad’ of HGF is shared by the acute phase plasma protein haptoglobin (Kurosky et al., 1980), the Trypanosome lytic factor binding protein haptoglobin-related protein (Drain et al. 2001) and the blood coagulation cofactor protein Z (Broze et al., 2001). Like HGF, they retain the intact ‘catalytic triad residue’ Asp [c102], but have changes in residues [c57] (Lys or Gln) and [c195] (Ala or Gly). MSP, the other member of the plasminogen-related growth factors, also has a nonenzymatic ‘catalytic triad’ in which residues [c57] and [c102] are each changed to Gln. Except for MSP, which uses the β-chain for a high affinity interaction with its receptor tyrosine kinase Ron, the role of these other nonenzymatic protease-like domains is not well understood. Their function may involve activation dependent formation of a protein binding epitope similar to that found on the β-chains of HGF and MSP.
Although zymogen forms of proteases are generally not catalytically competent, some are still capable of binding and even cleaving substrates. For example, single-chain forms of t-PA and u-PA still have catalytic activity, albeit somewhat reduced from the corresponding activated forms, (Boose et al., 1989; Lijnen et al., 1990). Thus, binding of the zymogen-like 13-chain of scHGF to Met, would not be without precedent; our binding data of proHGF β to Met supports this idea.
Another HGF β-chain region with the potential for protein-protein interactions corresponds to exosite I of thrombin (fibrinogen binding exosite). Exosite I is present as zymogen and active forms (Vijayalakshmi et al., 1994) and contains a positively charged patch centered around the [c70-80]-loop (Stubbs and Bode, 1993), which is involved in interactions with substrates, cofactors and inhibitors (Stubbs and Bode, 1993). HGF β also has a positively charged surface in this region, suggesting a potential role in protein interactions. Although two mutational changes introduced in this region (I550-E559 [c70-c77]) did not affect HGF function in cell migration assays, the possibility remains of it interacting with cell surface co-stimulatory factors of Met signaling. The positive charge observed is consistent with heparin interactions. Heparin modulates HGF activity. The positively charged region comprises, consists essentially of, or consists of some or all residues 512, 515-517, 545, 547, 550, 553-565 or mixtures thereof. In some embodiments, the amino acid residues comprise, consist essentially of, or consist of one or more of Arg512, Asn515, Lys516, His517, Glu545, Trp547, Ile550, Val553, His554, Gly555, Arg556, Gly557, Asp558, Glu559, Lys560, Cys561, Lys562, Gln563, Val564, Leu565, or mixtures thereof.
This application claims the benefit of 35 USC 119 (e) to U.S. Ser. No. 60/569,301, filed May 6, 2004, which application is incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
60569301 | May 2004 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 11124635 | May 2005 | US |
Child | 12057291 | US |