The present invention relates generally to structural studies of plasmepsin V in complex with an inhibitor. In one aspect, the present invention relates to a crystal structure of plasmepsin V from Plasmodium vivax in complex with an inhibitor, and to methods of using the crystal structure and related structural information to identify, design and/or screen for inhibitors or redesign known inhibitors that interact with and/or modulate plasmepsin V activity. In another aspect, the present invention relates to a class of compounds based on the inhibitor useful in the treatment of malaria.
Plasmodium falciparum and P. vivax are responsible for the most severe forms of malaria in humans. These parasites infect humans through the bite of an infected Anopheles mosquito. The parasites then migrate to the liver and develop into merozoites that are then released and invade erythrocytes in which they develop and amplify through successive rounds of infection.
Effector proteins exported to the host cell by the parasites (Marti M. et al., 2004; Hiller N. L. et al., 2004) are responsible for a remarkable process of cellular renovation that is induced in parasite-infected erythrocytes (reviewed in Boddey J. A. & Cowman A. F., 2013). Remodelling of the host cell provides the means for the parasite to gain nutrients and remain hidden from host protective responses in a relatively protected niche.
Plasmepsin V is an aspartyl protease expressed by protozoan parasites of Plasmodium spp. and known to play a pivotal role in the recognising and processing of effector proteins for export to the host cell (Boddey J. A., et al. 2010; Russo I., et al. 2010). The protease is a type I integral membrane-bound protease with the active domain located on the luminal side of the endoplasmic reticulum (“ER”) (Boddey J. A., et al. 2010; Russo I., et al. 2010; Sleebs B. E., et al. 2014a).
Plasmepsin V recognises and cleaves a pentameric sequence R×L×E/Q/D called the Plasmodium EXport ELement (“PEXEL”) (Marti M. et al., 2004) or Vacuolar Transport Signal (“VTS”) (Hiller N. L. et al., 2004) on the carboxy side of the leucine residue (Chang H. H. et al., 2008; Boddey J. A., et al. 2009). This motif is located 15 to 30 amino acids C-terminal to the hydrophobic ER-type signal sequence of over 400 proteins destined for export (Marti M. et al., 2004; Hiller N. L. et al., 2004; Sargeant T. J. et al. 2006). The processing step by plasmepsin V uncovers the export signal (xE/Q/D) that is acetylated and directs these proteins to the plasma membrane and the parasitophorous vacuole membrane (Boddey J. A., et al. 2010; Russo I., et al. 2010; Sleebs B. E., et al. 2014a; Boddey J. A., et al. 2009) that surrounds the parasite. The proteins are then recognised by Plasmodium Translocon of EXported proteins (“PTEX”), a protein machine that translocates them into the parasite-infected erythrocyte (Elsworth B., et al. 2014; Beck J. R. et al. 2014).
Cleavage of the PEXEL motif by plasmepsin V is essential for protein export in P. falciparum-infected erythrocytes as the gene encoding this enzyme cannot be disrupted (Boddey J. A., et al. 2010; Klemba M & Goldberg D. E. 2005). Indeed, expression of a dominant negative form of plasmepsin V in P. falciparum has been shown to block the export of proteins to the host erythrocyte suggesting it is required for this trafficking pathway (Russo I., et al. 2010; Sleebs B. E., et al. 2014a).
Also, PEXEL mimetic compounds containing statine, a molecule that possesses a hydroxyl functionality that mimics the transition state of amide bond proteolysis, have been shown to inhibit in vitro activity of P. falciparum and P. vivax plasmepsin V, block protein export and be lethal to P. falciparum growth (Sleebs B. E., et al. 2014a). This demonstrates that plasmepsin V plays a crucial role in protein export and that this process is essential for parasite survival, thus validating plasmepsin V as an important antimalarial target in asexual blood stages.
Currently, there is no structural information of plasmepsin V in an apo form or in complex with a substrate or inhibitor. Thus, there is no structural information detailing how substrates or inhibitors interact with the substrate-binding site of plasmepsin V nor structural information detailing how plasmepsin V functions in export after proteolysis.
Accordingly, there is a need to determine the structure of plasmepsin V in complex with an inhibitor in order to better understand the nature of plasmepsin V substrate or inhibitor interactions and to assist in the rational design or re-design of inhibitors to develop new antimalarials.
The present invention is predicated in part by the determination of the crystal structure of plasmepsin V in complex with inhibitor WEHI-842, which allows visualisation, for the first time, of the structure of plasmepsin V as well as structural information detailing the substrate-binding site of plasmepsin V and its interactions with WEHI-842. The structure also identifies, for the first time, two features that are not present in canonical aspartyl proteases, namely a disulfide-bonded surface loop consisting of 17 amino acids, including four cysteine residues, and a helix-turn-helix motif. The atomic coordinates for the structure are presented in Appendix I.
As mentioned, the crystal structure reveals much needed structural information about the substrate-binding site of plasmepsin V through the binding of WEHI-842. The crystal structure reveals a canonical aspartyl protease fold including an enzyme domain comprised of a crescent-shaped and predominantly β-sheet core about the substrate-binding site. The enzyme domain includes N- and C-terminal subdomains that contact each other along a bottom of the substrate-binding site that contains the active site aspartates (Asp80 and Asp313). The amino and carboxyl ends of the plasmepsin V polypeptide are assembled into a characteristic six-stranded inter-domain β-sheet, which serves to anchor the N- and C-terminal subdomains together. The N-terminal subdomain further includes a distinctive β-hairpin loop structure, known as a “flap”, which lies perpendicular over the substrate-binding site and interacts with the bound WEHI-842.
The present invention is also predicated by development of inhibitor WEHI-842. WEHI-842 was developed from its precursor WEHI-916, which mimics the transition-state of amide bond proteolysis for PEXEL substrates using statine (Sleebs B. E., et al. 2014a). Whilst WEHI-916 was a good inhibitor of plasmepsin V protease activity in vitro, it was not particularly potent in blocking P. falciparum growth (EC50 5 μM) and the present inventors hypothesised that this was due to the highly polarized guanidium group on the P3 Arg of the transition state mimetic that impairs permeability across membranes.
In contrast, WEHI-842 has a non-proteinogenic amino acid canavanine, which replaces the P3 Arg in WEHI-916 and also an N-terminal carbamate that replaces the sulfonamide seen in WEHI-916. WEHI-842 has been observed to be a more potent inhibitor of plasmepsin V and of P. falciparum growth than WEHI-916. From previous studies, it is known that the exchange of the sulfonamide with a carbamate has no effect on binding affinity to plasmepsin V or parasite activity (Gazdik M. et al. 2015), so the improvement in activities seen with WEH-842 is attributed to replacing the P3 Arg with canavanine.
With the foregoing in view, one aspect of the present invention provides a plasmepsin V/WEHI-842 crystalline complex having the atomic coordinates as set forth in Appendix I. Generally, the crystalline complex comprising plasmepsin V and WEHI-842 or derivatives or components thereof are in essentially a pure native form.
Another aspect of the present invention is directed to a dataset of atomic coordinates defining the interaction between plasmepsin V and WEHI-842. In one embodiment, the dataset of atomic coordinates defines the interaction between the substrate-binding site of plasmepsin V and WEHI-842. In another embodiment, the dataset of atomic coordinates defines the interaction between the flap of plasmepsin V and WEHI-842.
Yet another aspect of the present invention is directed to conformational mimetics of the substrate-binding site surface of WEHI-842. The mimetics may interfere directly or interact directly with the substrate-binding site of plasmepsin V and/or with the flap of plasmepsin V or may interact elsewhere causing conformational changes to the substrate-binding site and/or the flap.
In one embodiment, a substrate-binding site and/or flap inhibitor is proposed to, for example, block or at least reduce cleavage of the PEXEL motif of effector proteins by plasmepsin V to inhibit or at least reduce the activity of P. falciparum and P. vivax plasmepsin V, block or at least reduce protein export and optionally be lethal to P. falciparum and/or P. vivax growth.
In another embodiment, a substrate-binding site and/or flap inhibitor is proposed, for example, in the treatment of malaria.
Accordingly, the present invention in one form resides in a plasmepsin V/WEHI-842 complex in crystalline form or a derivative, homologue, component and/or soluble form thereof.
In one embodiment, the complex comprises at least part of a plasmepsin V polypeptide chain in complex with WEHI-842.
In a preferred embodiment, the complex includes an enzyme domain from the plasmepsin V polypeptide chain in complex with WEHI-842.
In another preferred embodiment, the complex includes at least part of an N-terminal subdomain of the enzyme domain from the plasmepsin V polypeptide chain and at least part of a C-terminal subdomain of the enzyme domain from the plasmepsin V polypeptide chain, wherein the at least part of the N-terminal subdomain and the at least part of the C-terminal subdomain together define the substrate-binding site of the plasmepsin V polypeptide chain in complex with WEHI-842.
In a more preferred embodiment, the N-terminal subdomain includes at least part of a disulfide-bonded surface loop.
In another more preferred embodiment, the N-terminal subdomain includes at least part of the β-hairpin structure (known as the “flap”).
In yet another more preferred embodiment, the C-terminal subdomain includes at least part of the helix-turn-helix motif.
In a further preferred embodiment, the complex comprises at least part of the substrate-binding site of plasmepsin V in complex with at least a portion of WEHI-842, preferably the complex also comprises at least part of the flap.
In a yet further preferred embodiment, the complex comprises at least part of the substrate-binding site of plasmepsin V including the S1, S2 and S3 binding pockets, wherein: the S1 binding pocket includes amino acids Ile78, Tyr139, Phe180 and Val188; the S2 binding pocket includes amino acid Gly315; and the S3 binding pocket includes amino acids Glu141 and Gln183.
In a most preferred embodiment, the complex comprises the components of the structure defined by the atomic coordinates shown in Appendix I or a subset a thereof.
The present invention in another form provides a method of identifying, designing and/or screening for a compound that can potentially interact with plasmepsin V, preferably the substrate-binding site and/or the flap of plasmepsin V, including performing structure-based identification, design and/or screening of a compound based on the compound's interaction with a plasmepsin V structure defined by the atomic coordinates of Appendix I or a subset thereof.
In another form, the present invention provides a method of identifying, designing and/or screening for a compound that can potentially mimic WEHI-842 interacting with plasmepsin V, including performing structure-based identification, design and/or screening of a compound based on (i) the compound's interaction with a plasmepsin V structure and/or (ii) the compound's similarity with a WEHI-842 structure in complex with a plasmepsin V structure defined by the atomic coordinates of Appendix I or a subset thereof.
In one embodiment, the method includes identifying, designing and/or screening for a compound which interacts with the three-dimensional structure of the substrate-binding site and/or the flap of plasmepsin V, the structure being defined by the atomic coordinates shown in Appendix I, wherein interaction of the compound with the structure is favoured energetically.
In another embodiment, the method includes identifying, designing and/or screening for a compound based upon the three-dimensional structure of WEHI-842 in complex with components of the substrate-binding site and/or the flap of plasmepsin V, the structure being defined by the atomic coordinates shown in Appendix I or a subset thereof, wherein interaction of the compound with the structure is favoured energetically.
The present invention in another form provides a method of identifying an inhibitor compound comprising an entity selected from the group consisting of an antibody, an antigen-binding fragment, a peptide, a non-peptide molecule and a chemical compound, wherein said inhibitor compound is capable of blocking biological activity resulting from an interaction with plasmepsin V, wherein said process includes:
In one embodiment, the method includes identifying an inhibitor compound capable of interacting with at least part of the substrate-binding site and/or the flap of plasmepsin V as defined by the atomic coordinates shown in Appendix I, preferably the entire substrate-binding site and the flap.
In a further embodiment, the atomic coordinates as shown in Appendix I or a subset thereof define one or more regions of WEHI-842 in complex with plasmepsin V.
In a preferred embodiment, the atomic coordinates as shown in Appendix I or a subset thereof define WEHI-842 in complex with at least part of the substrate-binding site from the plasmepsin V polypeptide chain and at least part of the flap from the N-terminal subdomain of the plasmepsin V polypeptide chain.
In another preferred embodiment, the atomic coordinates as shown in Appendix I or a subset thereof define portions of the molecular surface of the substrate binding site of plasmepsin V and the flap from the N-terminal subdomain from plasmepsin V, which interact with at least a portion of the molecular surface of WEHI-842.
In another embodiment, the atomic coordinates or a subset thereof define one or more amino acids selected from 44 to 240 and 273 to 470 of plasmepsin V of P. vivax in complex with WEHI-842.
In another preferred embodiment, the one or more amino acids selected from 44 to 240 and 273 to 470 of plasmepsin V include one or more amino acids selected from the group consisting of Tyr59, Ala60, Ile78, Asp80, Gly82, Tyr139, Cys140, Glu141, Phe180, Gln183, Val188, Asp313, Gly315 and Thr317.
In another preferred embodiment, the atomic coordinates as shown in Appendix I or a subset thereof define at least part of the substrate-binding site of plasmepsin V including the S1, S2 and S3 binding pockets, wherein: the S1 binding pocket includes amino acids Ile78, Tyr139, Phe180 and Val188; the S2 binding pocket includes residue Gly315; and the S3 binding pocket includes amino acids Glu141 and Gln183.
In another form, the present invention includes use of the atomic coordinates or a subset thereof as shown in Appendix I at least representing:
In another form, the present invention includes use of the atomic coordinates or a subset thereof as shown in Appendix I at least representing:
In another form, the present invention includes a set of atomic coordinates as shown in Appendix I, or a subset thereof, at least representing:
In another form, the present invention includes an inhibitor of a site comprising one or more amino acids selected from 44 to 240 and 273 to 470 of plasmepsin V, including one or more amino acids selected from the group consisting of Tyr59, Ala60, Ile78, Asp80, Gly82, Tyr139, Cys140, Glu141, Phe180, Gln183, Val188, Asp313, Gly315 and Thr317. In one embodiment, the site comprises one or more amino acids forming at least part of the substrate-binding site of plasmepsin V including the S1, S2 and S3 binding pockets, wherein: the S1 binding pocket includes amino acids Ile78, Tyr139, Phe180 and Val188; the S2 binding pocket includes amino acid Gly315; and the S3 binding pocket includes amino acids Glu141 and Gln183. In one embodiment, the inhibitor of the site may be an isolated, synthetic, purified, recombinant and/or non-naturally occurring inhibitor.
The present invention has enabled the identification of molecular surface interactions between WEHI-842 and the substrate-binding site and/or the flap of plasmepsin V. In particular, the present invention has enabled the determination of key amino acids involved in the binding of WEHI-842 to the substrate-binding site and/or the flap. It will be evident to a person skilled in the art that these findings can be transposed on to related aspartyl proteases including homologous plasmepsin V from Plasmodium spp., including P. falciparum.
The present invention is therefore also useful in the identification and/or design of candidate compounds that bind to the substrate-binding site and/or interact with the flap of related aspartyl proteases, including homologous plasmepsin V from Plasmodium spp., including P. falciparum.
In one embodiment, candidate compounds for interacting with plasmepsin V or related aspartyl proteases may be chemically modified as a result of structure-based evaluation using the atomic coordinates as defined in Appendix I or a subset thereof.
Candidate compounds and compounds identified or designed using a method or process of the present invention may be any suitable compound, including naturally occurring compounds, de novo designed compounds, library generated compounds (chemically or recombinantly generated), mimetics etc., and may include organic compounds, new chemical entities, antibodies, binding proteins other than antibody-based molecules (non-immunoglobulin proteins) including, for example, protein scaffolds, designed ankyrin repeat proteins (DARPins, Stumpp et al., 2007) and protein A domains (reviewed in Binz et al, 2005), avimers (Silverman et al., 2005), and other new biological entities such as nucleic acid aptamers (reviewed in Ulrich, 2006).
The present invention is also useful for improving the properties of ligands for the substrate-binding site of plasmepsin V and/or related aspartyl proteases. For example, existing ligands may be screened against the 3D structure of the substrate-binding site and/or the flap of plasmepsin V or a region thereof as defined by the atomic coordinates of Appendix I or a subset thereof, and an assessment made of the potential to energetically interact with the substrate-binding site and/or the flap of plasmepsin V.
Similarly, existing substrate-binding ligands can be screened against the 3D structure of the substrate-binding surface and/or the flap-interacting surface of WEHI-842 bound to plasmepsin V as defined by the atomic coordinates of Appendix I or a subset thereof, and an assessment made of the potential to energetically interact with the substrate-binding site and/or the flap of plasmepsin V.
Thus, the present invention also provides a method of re-designing a compound which is known to bind to plasmepsin V comprising performing structure-based evaluation of the compound based on the compound's interactions with a plasmepsin V structure defined by the atomic coordinates of Appendix I or a subset thereof and re-designing or chemically modifying the compound as a result of the evaluation.
In another form, the present invention provides a method of re-designing a compound which is known to bind to plasmepsin V comprising performing structure-based evaluation of the compound's similarity with a structure of WEHI-842 in complex with plasmepsin V as defined by the atomic coordinates of Appendix I or a subset thereof and re-designing or chemically modifying the compound as a result of the evaluation.
In one embodiment, the compound which is known to bind to plasmepsin V is re-designed or chemically modified to (i) improve affinity for binding to plasmepsin V, and/or (ii) optionally, lower affinity for binding to other aspartyl proteases, such as, e.g., plasmepsin I-IV.
In another embodiment, the method may further include synthesising the compound once re-designed or chemically modified, and optionally incorporating said compound once re-designed or chemically modified in a biological activity assay, preferably to determine whether said compound inhibits the biological activity of plasmepsin V.
The present invention also provides a computer system for identifying one or more compounds that can potentially interact with plasmepsin V, the system containing data representing the structure of: (i) the substrate-binding site of plasmepsin V, the structure being defined by a subset of the atomic coordinates shown in Appendix I; (ii) the flap of plasmepsin V, the structure being defined by a subset of the atomic coordinates shown in Appendix I; (iii) the substrate-binding site surface on WEHI-842, the structure being defined by a subset of the atomic coordinates shown in Appendix I; (iv) the flap-interacting surface on WEHI-842, the structure being defined by a subset of the atomic coordinates shown in Appendix I; and/or (v) a combination thereof, the structure being defined by the atomic coordinates shown in Appendix I or a subset thereof.
In another aspect, the present invention provides a computer-readable medium having recorded data thereon representing a model and/or the atomic coordinates as shown in Appendix I, or a subset of atomic coordinates thereof, the model and/or the atomic coordinates at least representing:
Also provided are a set of atomic coordinates as shown in Appendix I, or a subset thereof, at least representing:
The three-dimensional structure of plasmepsin V and/or substrate-binding site of plasmepsin V and/or the flap of plasmepsin V and/or WEHI-842 and/or the one or more regions of WEHI-842 in complex with plasmepsin V and/or the one or more regions of WEHI-842 in complex with the substrate-binding site of plasmepsin V and/or the one or more regions of WEHI-842 in complex with the flap of plasmepsin V may be used to develop models useful for drug design and/or in silico screening of candidate compounds that interact with and/or modulate plasmepsin V. Other physicochemical characteristics may also be used in developing the model, e.g. bonding, electrostatics, etc.
Generally, the term “in silico” refers to the creation in a computer memory, i.e., on a silicon or other like chip. Stated otherwise “in silico” means “virtual”. When used herein the term “in silico” is intended to refer to screening methods based on the use of computer models rather than in vitro or in vivo experiments.
Accordingly, the present invention also provides a computer-assisted method of identifying a compound that potentially interacts with plasmepsin V, which method comprises fitting the structure of: (i) the substrate-binding site of plasmepsin V, the structure being defined by a subset of the atomic coordinates shown in Appendix I; and/or (ii) portions of plasmepsin V, preferably including the flap, the structure being defined by a subset of the atomic coordinates shown in Appendix I, to the structure of a candidate compound.
Also provided by the present invention is a computer-assisted method of identifying a molecule able to interact with plasmepsin V using a programmed computer comprising a processor, which method comprises the steps of: (a) generating, using computer methods, a set of atomic coordinates of a structure that possesses energetically favourable interactions with the atomic coordinates of: (i) the substrate-binding site of plasmepsin V, the structure being defined by a subset of the atomic coordinates shown in Appendix I; and/or (ii) at least portions of plasmepsin V, preferably including the flap, the structure being defined by a subset of the atomic coordinates shown in Appendix I, which coordinates are entered into the computer thereby generating a criteria data set; (b) comparing, using the processor, the criteria data set to a computer database of chemical structures; (c) selecting from the database, using computer methods, chemical structures which are complementary or similar to a region of the criteria data set; and (d) optionally, outputting, to an output device, the selected chemical structures which are complementary to or similar to a region of the criteria data set.
The present invention further provides a computer-assisted method of identifying potential mimetics of WEHI-842 using a programmed computer comprising a processor, the method comprising the steps of: (a) generating a criteria data set from a set of atomic coordinates of: (i) the substrate-binding site of plasmepsin V, the structure being defined by a subset of the atomic coordinates shown in Appendix I; (ii) at least a portion of plasmepsin V, preferably including the flap, the structure being defined by a subset of the atomic coordinates shown in Appendix I; and/or WEHI-842, the structure being defined by a subset of the atomic coordinates shown in Appendix I, which coordinates are entered into the computer; (b)(i) comparing, using the processor, the criteria data set to a computer database of chemical structures stored in a computer data storage system and selecting from the database, using computer methods, chemical structures having a region that is structurally similar to the criteria data set; or (ii) constructing, using computer methods, a model of a chemical structure having a region that is structurally similar to the criteria data set; and (c) optionally, outputting to an output device: (i) the selected chemical structures from step (b)(i) having a region similar to the criteria data set; or (ii) the constructed model from step (b)(ii).
The present invention further provides a method of evaluating the ability of a compound to interact with plasmepsin V, the method comprising the steps of: (a) employing computational means to perform: (i) a fitting operation between the compound and the binding surface of a computer model of the substrate-binding site for WEHI-842 on plasmepsin V; and/or (ii) a superimposing operation between the compound and WEHI-842, the substrate-binding site for WEHI-842 on plasmepsin V or a portion thereof, using atomic coordinates wherein the root mean square deviation between the atomic coordinates and a subset of atomic coordinates of Appendix I or a subset of atomic coordinates of one or more thereof at least representing the substrate-binding site of plasmepsin V or a portion thereof, the flap or a portion thereof, plasmepsin V or WEHI-842, is not more than 1.5 Å; and (b) analysing the results of the fitting operation and/or superimposing operation to quantify the association between the compound and a binding surface model.
The present invention also provides a method of using molecular replacement to obtain structural information about a molecule or a molecular complex of an unknown structure, comprising the steps of: (i) generating an X-ray diffraction pattern of the crystallized molecule or molecular complex; and (ii) applying the atomic coordinates of Appendix I, or a subset of atomic coordinates thereof at least representing plasmepsin V, the substrate-binding site of plasmepsin V, the flap of plasmepsin V, WEHI-842, mimetics thereof, derivatives thereof or portions thereof, to the X-ray diffraction pattern to generate a three-dimensional electron density map of at least a region of the molecule or molecular complex whose structure is unknown.
The present invention also encompasses compounds that bind to plasmepsin V designed, re-designed and/or modified using the methods or processes of the present invention. Preferably, such compounds have an affinity (KD) for plasmepsin V of less than 10−7 M. Preferably, the compounds bind to the substrate-binding site of plasmepsin V. More preferably, the compounds bind to the substrate-binding site and the flap of plasmepsin V.
In another aspect, the present invention relates to a compound of Formula I or a pharmaceutically acceptable salt thereof:
wherein:
In some embodiments of the compounds of formula (I), one or more of the following applies:
In one embodiment, the compound of Formula I is a compound of Formula II or a pharmaceutically acceptable salt thereof:
In another embodiment, the compound of formula (I) is the compound:
As used herein, the term “alkyl” refers to a straight chain or branched saturated hydrocarbon group. The alkyl group may have a specified number of carbon atoms, for example, C1-7 alkyl refers to alkyl groups having 1, 2, 3, 4, 5, 6 or 7 carbon atoms in a linear or branched arrangement. Unless otherwise defined, the term “alkyl” may be C1-12alkyl or C1-6alkyl. Examples of suitable alkyl groups may include, but are not limited to, methyl, ethyl, propyl (n-propyl and i-propyl), butyl (n-butyl, i-butyl and t-butyl), n-pentyl, 2-methylbutyl, 3-methylbutyl, 4-methylbutyl, 2-ethylpropyl, n-hexyl, 2-methylpentyl, 3-methylpentyl, 4-methylpentyl, 2-ethylbutyl, 3-ethylbutyl, heptyl, octyl, nonyl, decyl, undecyl and dodecyl.
As used herein, groups such as “haloalkyl”, “alkylOH”, “alkylCO2H”, “alkylNH2” and the like means that the halo, OH, CO2H and NH2 group may be positioned on any suitable carbon of the alkyl chain. In one embodiment, for “alkylOH” the OH group is positioned at a terminal or non-terminal carbon of the alkyl group.
As used herein, the term “alkenyl” refers to a straight-chain or branched hydrocarbon group having one or more double bonds between two carbon atoms. The alkenyl group may have a specified number of carbon atoms, for example, C2-7 alkenyl refers to alkenyl groups having 2, 3, 4, 5, 6 or 7 carbon atoms in a linear or branched arrangement. Unless otherwise defined, the term “alkenyl” may be C2-12alkenyl or C2-6alkenyl. Exemplary alkenyl groups may include, but are not limited to, ethenyl, propenyl, isopropenyl, butenyl, butadienyl, pentenyl, pentadienyl, hexenyl, hexadienyl, heptenyl, octenyl, nonenyl, decenyl, undecenyl and dodecenyl.
As used herein, the term “alkynyl” refers to a straight chain or branched hydrocarbon group having one or more triple bonds between two carbon atoms. The alkynyl group may have a specified number of carbon atoms, for example, C2-7alkynyl refers to alkynyl groups having 2, 3, 4, 5, 6 or 7 carbon atoms in a linear or branched arrangement. Unless otherwise defined, the term “alkynyl” may be C2-12alkynyl or C2-6alkynyl. Exemplary alkynyl groups may include, but are not limited to, ethynyl, propynyl, butynyl, pentynyl, hexynyl, heptynyl, octynyl, nonynyl, decynyl, undecynyl and dodecynyl.
As used herein, the term “alkoxy” refers to the group —O-alkyl. Similarly, “haloalkoxy” refers to the group —O-alkyl, in which one or more hydrogen atoms in the alkyl group is substituted by halogen atoms. “Haloalkyl” includes perhaloalkyl groups in which all hydrogen atoms are replaced with halogen atoms.
As used herein, the term “halo” refers to a halogen atom. Exemplary halo groups include fluoro (fluorine), chloro (chlorine), bromo (bromine) and iodo (iodine); especially fluoro or chloro; most especially fluoro.
As used herein, the term “aryl” refers to any stable, monocyclic, bicyclic or tricyclic ring (or ring system) of up to 7 atoms in each ring, wherein at least one ring is aromatic. The rings may be fused to one another when more than one ring is present. The aryl group may have a specified number of carbon atoms in the ring system. For the avoidance of doubt, the term “aryl” does not encompass a group having a heteroaryl ring. For example, C6-10 aryl refers to an aryl group with 6, 7, 8, 9 or 10 carbon atoms in the ring system (and encompasses phenyl and naphthyl groups). Exemplary aryl groups include, but are not limited to, phenyl, naphthyl, tetrahydronaphthyl, indanyl, biphenyl and binaphthyl.
As used herein, the term “heteroaryl” represents a stable monocyclic, bicyclic or tricyclic ring of up to 7 atoms in each ring, wherein at least one ring is aromatic and at least one ring contains from 1 to 4 heteroatoms selected from the group consisting of O, N and S. The rings may be fused if more than one ring is present. The heteroaryl group may also include at least one carbonyl group attached to an unsaturated carbon in the ring system. The heteroaryl group may include a specified number of carbon atoms in the ring system. For example, C6-10heteroaryl refers to a heteroaryl group with 6, 7, 8, 9 or 10 carbon atoms in the ring system, in which the ring system may include other heteroatoms such as O, S or N. Exemplary heteroaryl groups may include pyrrolyl, furanyl, thienyl, pyrazolyl, imidazolyl, triazolyl, isoxazolyl, oxazolyl, thiazolyl, isothiazolyl, oxadiazolyl, oxatriazolyl, pyridinyl, pyridazinyl, pyrimidinyl, pyrazinyl, triazinyl, azepinyl, oxepinyl, thiepinyl, diazepinyl, benzofuranyl, isobenzofuranyl, benzothienyl, indolyl, indolinyl, isoindolyl, benzimidazolyl, benzisoxazolyl, benzoxazolyl, benzothiazolyl, benzyopyranyl, benzopyranonyl, quinolinyl, tetrahydroquinolinyl, isoquinolinyl, quinazolinyl, quinoxalinyl, tetrahydroquinoxalinyl and naphthyridinyl.
As used herein, the term “cycloalkyl” refers to a saturated cyclic hydrocarbon. The cycloalkyl ring may include a specified number of carbon atoms. For example, a C3-10cycloalkyl group includes 3, 4, 5, 6, 7, 8, 9 or 10 carbon atoms. The cycloalkyl group may include two or three rings. When there are two or three rings, each ring is linked to one or more of the other rings by sharing one or more ring atoms to thereby form a spirane or fused ring system. The cycloalkyl group may also include a carbonyl group attached to a ring carbon atom. For the avoidance of doubt, the term “cycloalkyl” does not encompass a group having an aryl, heteroaryl or heterocyclyl ring. Examples of cycloalkyl groups include, but are not limited to, cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl, cycloheptanyl and cyclooctanyl.
As used herein, the term “heterocyclyl” refers to a cycloalkyl group or cycloalkenyl group in which one or more carbon atoms have been replaced by heteroatoms independently selected from N, S and O. Between 1 and 4 carbon atoms in each ring may be replaced by heteroatoms independently selected from N, S and O. The heterocyclic group may be monocyclic, bicyclic or tricyclic. If there are two or three rings, one ring may be linked to another by sharing one or more ring atoms to thereby form a spirane or fused ring system. The heterocyclyl group may include a carbonyl group attached to an unsaturated ring carbon. The heterocyclyl group may have a specified number of ring carbon atoms, for example C3-10heterocyclyl refers to a heterocyclyl group having 3, 4, 5, 6, 7, 8, 9 or 10 carbon atoms in the ring system. Exemplary heterocyclyl groups include tetrahydrofuranyl, tetrahydrothiophenyl, pyrrolidinyl, 2-pyrrolidonyl, pyrrolinyl, dithiolyl, 1,3-dioxolanyl, pyrazolinyl, imidazolinyl, imidazolidonyl, piperidinyl, piperazinyl, morpholinyl and tetrahydropyranyl.
The compounds of Formula I or II may be in the form of a pharmaceutically acceptable salt. Suitable pharmaceutically acceptable salts may include, but are not limited to, salts of pharmaceutically acceptable inorganic acids such as hydrochloric, sulphuric, phosphoric, nitric, carbonic, boric, sulfamic, and hydrobromic acids, or salts of pharmaceutically acceptable organic acids such as acetic, propionic, butyric, tartaric, maleic, lactic, citric, benzoic and glutamic acids. Non-pharmaceutically acceptable salts may also fall within the scope of the invention as these may be useful as intermediates in the preparation of pharmaceutically acceptable salts or during storage or transport.
The compounds of Formula I and II possess asymmetric centres. The invention may also relate to compounds in substantially pure isomeric form at one or more of said centres, for example greater than about 90% ee, such as greater than 95% ee or greater than 97% ee or greater than 99% ee. Such isomers may be prepared by asymmetric synthesis, for example using chiral intermediates, or by chiral resolution.
The compounds of Formula I and II may be synthesised employing solution or solid phase peptide chemistry procedures, as appropriately modified to include non-peptidic groups. Solution and solid phase synthetic procedures may be known to a skilled person. For example, solid phase synthetic procedures may be performed on a resin. The peptide chemistry procedures may employ either BOC or Fmoc chemistry. For example, the synthetic procedure may include successive treatments of: (i) forming an amide bond by mixing an amine with an N-protected amino acid, a coupling agent (such as HBTU (N,N,N′,N′-tetramethyl-O-(1H-benzotriazol-1-yl)uronium hexafluorophosphate)) and a base (such as diisopropylethylamine) in a polar aprotic solvent (such as dimethylformamide); and (ii) deprotecting the N-protected amino acid to provide an amine for further reaction.
The present invention may also involve combination therapies, such as the administration to a subject of a compound of the invention including a compound of Formula I or II, a peptide or mimetic of the invention or a pharmaceutically acceptable salt thereof, together with other agents or procedures for treating or preventing malaria.
The compounds of the present invention may be used as pharmaceuticals. Consequently, in a further aspect the present invention provides a pharmaceutical composition comprising a compound of the invention including a compound of Formula I or II, a peptide or mimetic of the invention or a pharmaceutically acceptable salt thereof. The pharmaceutical composition may include a pharmaceutically acceptable carrier or excipient.
The carrier(s) must be acceptable in the sense of being compatible with the other ingredients of the composition and not being deleterious to the subject.
Pharmaceutical compositions include those suitable for oral, rectal, nasal, topical (including buccal and sub-lingual), vaginal or parenteral (including intramuscular, sub-cutaneous, intrathecal and intravenous) administration or in a form suitable for administration by inhalation or insufflation. The pharmaceutical composition may be especially suitable for parenteral administration. The compound of the invention including a compound of Formula I or II, a peptide or mimetic of the invention or a pharmaceutically acceptable salt thereof may be placed with a pharmaceutically acceptable carrier or excipient in a pharmaceutical composition. Said composition may be in the form of a solid (including tablets, filled capsules, powders, capsules, suppositories, dispersible granules and pessaries), or a liquid (including solutions, suspensions, emulsions, colloids, elixirs, creams, gels and foams). In one embodiment, the pharmaceutical composition may be in the form of a sterile injectable solution for parenteral use. It is believed that the use of various carriers and excipients for pharmaceutically acceptable compounds are well known in the art. Except insofar as any conventional excipient or carrier is incompatible with the active compound, use thereof in the pharmaceutical composition is contemplated.
The nature of the pharmaceutical composition and the carrier or excipient will depend on the route of administration and the nature of the condition and the subject being treated. It is believed that the choice of a particular carrier or delivery system, and route of administration could be readily determined by a person skilled in the art. In some circumstances it may be necessary to protect the compound (in view of the amide bonds) by means known in the art, for example, by micro encapsulation. The route of administration should also be chosen such that the compound reaches its site of action.
The pharmaceutically acceptable carrier or excipient may be either a solid or a liquid. A solid carrier or excipient may act as a diluent, flavouring agent, solubilizer, lubricant, suspending agent, binder, preservative, tablet disintegrating agent or an encapsulating material. Suitable solid carriers and excipients would be known to a skilled person.
If the pharmaceutical composition is a powder, the active component and a carrier or excipient may both be finely divided powders which are mixed together.
If the pharmaceutical composition is a tablet, the active component may be mixed with a suitable amount of a carrier or excipient which has the necessary binding capacity before compaction into a tablet of the desired shape and size.
Exemplary carriers or excipients for powders and tablets may include, for example, magnesium carbonate, talc, sugar, lactose, pectin, dextrin, starch, gelatin, tragacanth, methylcellulose, a low melting wax, cocoa butter and the like.
Liquid form preparations may include, for example, water or water-propylene glycol solutions. For example, parenteral injection liquid preparations can be formulated as solutions in aqueous polyethylene glycol solution.
Liquid pharmaceutical compositions may be formulated in unit dose form. For example, the compositions may be presented in ampoules, pre-filled syringes, small volume infusion or in multi-dose containers. Such compositions may include a preservative. The compositions may also include formulatory agents such as suspending, stabilising and/or dispersing agents. The composition may also be in powder form for constitution with a suitable vehicle (such as sterile water) before use. Liquid carriers and excipients may include colorants, flavours, stabilizers, buffers, artificial and natural sweeteners, dispersants, thickeners, solubilizing agents, suspending agents and the like.
Aqueous solutions for oral use may be prepared by dissolving the active compound in water and adding colourants, thickeners, flavours, and stabilizing agents, as necessary.
Aqueous suspensions for oral use may be prepared by dispersing the active component in water with viscous material, such as natural or synthetic gums, resins, methyl cellulose or other suspending agents.
The pharmaceutical composition may also include solid-form preparations intended to be converted into a liquid form for oral administration. For topical administration to the epidermis the compounds may be formulated as an ointment, cream or lotion, or as a transdermal patch.
For oral administration, the active compound may be incorporated with carriers or excipients and used in the form of ingestible tablets, buccal or sublingual tablets, troches, capsules, elixirs, suspensions, syrups, wafers and the like. Some of these oral administration routes may have the potential to avoid liver metabolism.
The compositions may also be administered by inhalation in the form of an aerosol spray from a pressurised dispenser or container, which contains a propellant such as carbon dioxide gas, dichlorodifluoromethane, nitrogen, propane or other suitable gas or gas combination.
In another aspect, the present invention provides a method of preventing or treating malaria, the method comprising administering to a subject a pharmaceutical composition of the invention, a compound of the invention including a compound of Formula I or II, a peptide or mimetic of the invention or a pharmaceutically acceptable salt thereof.
In a further aspect, the present invention provides use of a compound of the invention including a compound of Formula I or II, a peptide or mimetic of the invention or a pharmaceutically acceptable salt thereof in the manufacture of a medicament for the treatment or prevention of malaria.
In another aspect, the present invention provides use of a compound of the invention including a compound of Formula I or II, a peptide or mimetic of the invention or a pharmaceutically acceptable salt thereof in the manufacture of a medicament for the treatment or prevention of a disease or condition associated with Plasmodium spp.
In a further aspect, the present invention provides a method of treating or preventing a disease or condition associated with Plasmodium spp., the method comprising administering to a subject in need thereof a pharmaceutical composition of the invention, a compound of Formula I or II, a peptide or mimetic of the invention or a pharmaceutically acceptable salt thereof. The “disease or condition associated with Plasmodium spp.” by a disease or condition caused or affected by a Plasmodium spp. In one embodiment, the Plasmodium spp. is P. falciparum or P. vivax. In another embodiment, said disease or condition associated with Plasmodium spp. is malaria.
In yet a further aspect, the present invention provides a method inhibiting plasmepsin V, the method comprising a step of contacting plasmepsin V with a compound of the invention including a compound of Formula I or II, a peptide or mimetic of the invention or a pharmaceutically acceptable salt thereof. In one embodiment, the plasmepsin V is derived from a Plasmodium spp., especially from P. falciparum or P. vivax. The plasmepsin V may be located in vitro or in vivo. The method may include screening of compound libraries to identify compounds that bind to plasmepsin V, and to experiments to investigate the physiology or pharmacology of plasmepsin V.
As used herein, the terms “treatment” and “prevention” are to be considered in their broadest contexts. For example, the term “treatment” does not necessarily imply that a subject is treated until full recovery. The term “treatment” includes amelioration of the symptoms of a disease or condition, or reducing the severity of a disease or condition. Similarly, “prevention” does not necessarily imply that a subject will never contract a disease or condition. “Prevention” may be considered as reducing the likelihood of onset of a disease or condition, or preventing or otherwise reducing the risk of developing a disease or condition.
An “effective amount” of a compound of the invention including a compound of Formula I or II, a peptide or mimetic of the invention or a pharmaceutically acceptable salt thereof means an amount necessary to at least partly attain a desired response, or to delay the onset or progression of the disease or condition being treated. The amount may vary depending on factors such as: the health and physical condition of the individual to whom the compound is administered, the taxonomic group of the individual to whom the compound is administered, the extent of treatment/prevention desired, the formulation of the composition, and the assessment of the medical situation. It is expected that the “effective amount” will fall within a broad range that can be determined through routine trials.
As used herein, the term “subject” may include mammals, especially humans, primates, livestock animals, laboratory test animals, companion animals and wild animals (whether captive or free). Livestock animals may include sheep, cattle, pigs, horses and donkeys. Laboratory test animals may include mice, rabbits, rats, pigs, and guinea pigs. Companion animals may include dogs and cats. In one embodiment, the subject is a human.
Throughout this specification, preferred aspects and embodiments apply, as appropriate, separately, or in combination, to other aspects and embodiments, mutatis mutandis, whether or not explicitly stated as such.
The present invention will now be described further with reference to the following examples, which are illustrative only and non-limiting.
Various embodiments of the invention will be described with reference to the following drawings. Some of the figures contain colour representations or entities. Coloured versions of the figures are available from the applicant upon request or from an appropriate patent office.
SEQ ID NO: 1—Amino acid sequence of plasmepsin V from P. falciparum.
SEQ ID NO: 2—Amino acid sequence of plasmepsin V from P. vivax.
SEQ ID NO: 3—Nucleotide sequence encoding recombinant P. vivax plasmepsin V with an N-terminal fusion tag comprising a FLAG tag, a SUMO domain and a TEV protease cleavage site.
SEQ ID NO: 4—Nucleotide sequence encoding recombinant P. vivax plasmepsin V including KpnI and XhoI restriction sites for insertion into pTriex2 expression vector.
SEQ ID NO: 5—Amino acid sequence of expressed recombinant P. vivax plasmepsin V with the uncleaved N-terminal fusion tag comprising a FLAG tag, a SUMO domain and a TEV protease cleavage site.
SEQ ID NO: 6—Amino acid sequence of recombinant P. vivax plasmepsin V following cleavage of the N-terminal fusion tag comprising a FLAG tag and a SUMO domain at the TEV cleavage site (the “plasmepsin V polypeptide chain”).
SEQ ID NO: 7—Amino acid sequence of a peptide from knob-associated histidine-rich protein (“KAHRP”) containing the PEXEL sequence (RTLAQ).
SEQ ID NO: 8—Amino acid sequence of a variant KAHRP peptide containing a mutated PEXEL sequence (KTLAQ).
SEQ ID NO: 9—Amino acid sequence of another variant KAHRP peptide containing a mutated PEXEL sequence (RTIAQ).
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art (e.g. in molecular biology, biochemistry, structural biology, and/or computational biology). Standard techniques are used for molecular and biochemical methods (see generally, Sambrook et al., 2001, Ausubel et al. 1999 and Green and Sambrook, 2012, which are incorporated herein by reference) and chemical methods.
In the present specification and claims, the word ‘comprising’ and its derivatives including ‘comprises’ and ‘comprise’ include each of the stated integers but does not exclude the inclusion of one or more further integers.
Reference throughout this specification to ‘one embodiment’ or ‘an embodiment’ means that a particular feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment of the present invention. Thus, the appearance of the phrases ‘in one embodiment’ or ‘in an embodiment’ in various places throughout this specification are not necessarily all referring to the same embodiment. Furthermore, the particular features, structures, or characteristics may be combined in any suitable manner in one or more combinations.
As used herein the term “homologue” means a protein having at least 30% amino acid sequence identity with plasmepsin V and/or portions thereof. Preferably, the percentage identity is 40% or 50%, more preferably 60% or 70% and most preferably 80% or 90%. A 95% or above identity is most particularly preferred such as 95%, 96%, 97%, 98%, 99% or 100%.
As used herein the term “derivative” means plasmepsin V that displays the biological activity of wild-type plasmepsin V, characterised by the replacement of at least one amino acid from the wild-type sequence or the modification of one or more of the naturally-occurring amino acids.
Crystals and Crystal Structure of WEHI-842 in Complex with Plasmepsin V
The present invention provides a crystal comprising a P. vivax plasmepsin V construct (SEQ ID NO: 6) in complex with WEHI-842 (see
The crystal structure of P. vivax plasmepsin V complexed with WEHI-842 is presented in
WEHI-842 is seen to interact with the substrate-binding site and the flap of plasmepsin V. The details of these interactions are presented in
The structure also reveals two features that are not present in aspartyl proteases, namely a NAP1 insert consisting of 17 amino acids, including four cysteine residues, and a helix-turn-helix motif. The latter motif is posited to likely be important in plasmepsin V's function in export after cleavage of effector substrates.
The NAP1 insert consisting of 17 amino acids including four cysteine residues is located in the N-terminal subdomain and has been found to be similar to that found in plant aspartic proteases such as nepenthesin 1. The insert in nepenthesin 1 has been named the ‘nepenthesin 1-type aspartyl protease (NAP1) fold (Athauda S. B., et al. 2004) and is considered to play a role in functional regulation of napenthesin 1 (Athauda S. B., et al. 2004).
The helix-turn-helix motif consists of 43 amino acids and appears to be unique to plasmepsin V including orthologous proteases of Plasmodium spp.
The b-factor putty schematic shown in
The loop region between R241 to E272 (according to SEQ ID NO:2) in P. vivax plasmepsin V had poor electron density and its structure could not be determined. Nevertheless, the structure for P. vivax plasmepsin V indicates that these residues would be located on an opposite side of the molecule to the substrate-binding site and presumably would not interact with effector proteins. Plasmepsin V from other Plasmodium spp. also contained regions of similar size at this location and this loop is observed to have poor conservation of sequence between species (see
The enzyme domain of P. vivax plasmepsin V contains fifteen cysteine residues forming seven disulfide linkages, with four of these located in the N-terminal subdomain and three in the C-terminal subdomain (see
A C1-C8 disulfide-linkage spans the N-terminal subdomain; nestled within this region is a conserved pepsin-like C2-C3 disulfide bond and the NAP1 insert consisting of 17 amino acids (Athauda, S. B., et al. 2004) including four cysteine residues with disulfide linkages between C4-C6 and C5-C7. The P. vivax plasmepsin V NAP1 insert linkage pattern differs to the C4-C7 and C5-C6 architecture previously predicted using sequence alignments and known structures of pepsin-like aspartic proteases (Kay J., et al. 2011; Athauda, S. B., et al. 2004). The unpaired cysteine residue (C7a) located adjacent to the archetypal tyrosine in the flap above the substrate-binding site in the structure was not observed in other Apicomplexans or in closely related plant/fungal aspartic proteases and appears to be unique to Plasmodium spp. (see
In the C-terminal subdomain the C9-C14 linkage stabilises the orientation of the substrate-binding site by securing the six-stranded interdomain β-sheet (seated behind the binding site) to the region of the polypeptide chain leading toward the membrane anchor point of the protein. The C10-C11 linkage in P. vivax plasmepsin V tethers each end of a helix-turn-helix motif to the structure. This structural element is present in Plasmodium orthologues but not in other Apicomplexa, plant or fungal homologues of plasmepsin V or other plasmepsin family proteases, where it is usually replaced by a small, unstructured loop, which may be tethered by the C10-C11 linkage. The C12-C13 linkage found in P. vivax plasmepsin V was also observed in pepsin-like enzymes and appears to be conserved in other Apicomplexan homologues of plasmepsin V. Previous studies with Phytophthora spp. have suggested that similarly located cysteine residues may be involved in intermolecular disulfide bonds (Kay J., et al. 2011). However, sequence alignments within the same studies reveal these residues to be positioned such that they could potentially participate in the C12-C13 linkage found in P. vivax plasmepsin V.
A sequence and secondary structural alignment between P. vivax plasmepsin V and P. falciparum plasmepsin II, another member of the plasmepsin family that is involved in haemoglobin digestion, shows low sequence homology but a high level of preservation of secondary structural elements throughout the enzyme domains (see
In P. vivax plasmepsin V the NAP1 insert forms a surface loop (Tyr116-Gly121), (coloured green in
The helix-turn-helix motif is another key feature of P. vivax plasmepsin V and is conserved only within orthologues of Plasmodium spp. (see
As shown in
The oxo-guanidinium ion of WEHI-842 lies deep within the S3 pocket of plasmepsin V and participates in multiple interactions that anchor it to the substrate-binding site. The carboxylic acid moiety of Glu141 of the flap forms a “side-on” salt bridge with the guanidinium ion while the carbonyl group on the Gln183 side chain interacts with two hydrogen atoms located at the distal end of the same ion (see
The P1 Leu in the PEXEL has been shown to be important for binding affinity of plasmepsin V substrates and inhibitors (Sleebs, B. E., et al. 2014a; Sleebs, B. E., et al. 2014b; Boddey, J. A., et al. 2013). The structure reveals that this residue occupies the S1 pocket and is tightly encapsulated within the hydrophobic environment created by the juxtaposition of residues Ile78, Tyr139 and Val188 (see
For orthologues of Plasmodium spp. aligned, the residues lining the surface of the substrate binding site are essentially identical except for two alternative residues observed in the S5 position for P. falciparum, P. vinckei, and P. berghei plasmepsin V (not shown). Such high levels of conservation indicate that high affinity inhibitors identified for plasmepsin V should be effective against most if not all Plasmodium spp. and suggests animal models can be utilised for in vivo kinetic studies. The cavity surface for the orthologues from other Apicomplexa and nepenthesin 1 show variations in sequence around some of the key interactive residues and various substrate binding pockets throughout the substrate-binding site, suggesting inhibitors optimised for use against Plasmodium spp. may not be as active against other related aspartyl proteases.
Analysis of the crystal structure reveals that not all cavity space has been utilised in this complex. The ability for naturally occurring peptides to efficiently fill this space may be limited, whereas non-peptide-based inhibitors may offer greater scope in geometry and physico-chemical properties that could lead to improved affinities for plasmepsin V. For example, the P2 position on WEHI-842 has been found optimal for Val but Ile and Leu can be tolerated in this position with minimum changes to affinity. It is apparent that the S2 pocket is only partially filled by the Val residue of WEHI-842. Furthermore, the main cavity of the P3 pocket is largely filled by the canavanine side chain, however, a smaller yet equally deep cavity also exists on the floor of this pocket that is not utilised by this inhibitor. Finally, the pockets on the edge of the substrate binding cavity, such as S1′, S2′, S4 and S5 may offer additional opportunities to improve the affinities of future inhibitors to plasmepsin V.
As used herein, the term “crystal” means a structure (such as a three-dimensional (3D) solid aggregate) in which the plane faces intersect at definite angles and in which there is a regular structure (such as an internal structure) of the constituent chemical species. The term “crystal” refers in particular to a solid physical crystal form such as an experimentally prepared crystal.
Crystals according to the invention may be prepared using any plasmepsin V polypeptide containing the enzyme domain, including the N- and C-terminal subdomains, and lacking the C-terminal membrane anchor point (SEQ ID NOs: 2, 5 and 6). Typically, the plasmepsin V polypeptide (SEQ ID NO: 6) comprises residues 35 to 476, according to SEQ ID NO: 2, or the equivalent thereof together with any post-translational modifications of these residues such as N- or O-linked glycosylation.
In a preferred embodiment, the plasmepsin V polypeptide is from P. vivax (SEQ ID NOs: 2, 5 and 6). However, the plasmepsin V polypeptide may be obtained from other species, such as, e.g., P. falciparum (SEQ ID NO: 1).
Crystals may be constructed with wild-type plasmepsin V polypeptide sequence or variants thereof, including allelic variants and naturally occurring mutations as well as genetically engineered variants. Typically, variants have at least 95% or 98% sequence identity with a corresponding wild-type plasmepsin V polypeptide.
Crystals according to the invention may preferably be prepared using inhibitor WEHI-842 (see
Optionally, the crystal of plasmepsin V in complex with WEHI-842 may comprise one or more compounds which bind to plasmepsin V and/or WEHI-842, or otherwise are soaked into the crystal or co-crystallised with plasmepsin V and/or WEHI-842. Such compounds include ligands or small molecules, which may be candidate pharmaceutical agents intended to modulate the interaction between plasmepsin V and biological substrates.
The production of crystals of plasmepsin V in complex with WEHI-842 is described below.
In a preferred embodiment, a crystal of plasmepsin V in complex with WEHI-842 of the invention has the atomic coordinates as set forth in Appendix I.
As used herein, the term “atomic coordinates” or “set of coordinates” refers to a set of values which define the position of one or more atoms with reference to a system of axes. It will be understood by those skilled in the art that the atomic coordinates may be varied, without affecting significantly the accuracy of models derived therefrom. Thus, although the invention provides a very precise definition of a preferred atomic structure, it will be understood that minor variations are envisaged and the claims are intended to encompass such variations.
It will be understood that any reference herein to the atomic coordinates or subset of the atomic coordinates shown in Appendix I shall include, unless specified otherwise, atomic coordinates having a root mean square deviation of backbone atoms of not more than 1.5 Å, preferably not more than 1 Å, when superimposed on the corresponding backbone atoms described by the atomic coordinates shown in Appendix I.
The following defines what is intended by the term “root mean square deviation (‘RMSD’)” between two data sets. For each element in the first data set, its deviation from the corresponding item in the second data set is computed. The squared deviation is the square of that deviation, and the mean squared deviation is the mean of all these squared deviations. The root mean square deviation is the square root of the mean squared deviation.
Preferred variants are those in which the RMSD of the x, y and z coordinates for all backbone atoms other than hydrogen is less than 1.5 Å (preferably less than 1 Å, 0.7 Å or less than 0.3 Å) compared with the coordinates given in Appendix I. It will be readily appreciated by those skilled in the art that a 3D rigid body rotation and/or translation of the atomic coordinates does not alter the structure of the molecule concerned.
In a highly preferred embodiment, the crystal has the atomic coordinates as shown in Appendix I.
The present invention also provides a crystal structure of the substrate-binding site of plasmepsin V comprising the N- and C-terminal subdomains of plasmepsin V, or regions or parts thereof.
The atomic coordinates obtained experimentally for: amino acids 44 to 240 and 273 to 470 of P. vivax plasmepsin V and WEHI-842 are shown in Appendix I. However, a person skilled in the art will appreciate that a set of atomic coordinates determined by X-ray crystallography is not without standard error. Accordingly, any set of structure coordinates for plasmepsin V, optionally in complex with WEHI-842, that has a root mean square deviation of protein backbone atoms of less than 0.75 Å when superimposed (using backbone atoms) on the atomic coordinates listed in Appendix I shall be considered identical.
A structure that “substantially conforms” to a given set of atomic coordinates is a structure wherein at least about 50% of such structure has an RMSD of less than about 1.5 Å for the backbone atoms in secondary structure elements in each domain, and more preferably, less than about 1.3 Å for the backbone atoms in secondary structure elements in each domain, and, in increasing preference, less than about 1.0 Å, less than about 0.7 Å, less than about 0.5 Å, and most preferably, less than about 0.3 Å for the backbone atoms in secondary structure elements in each domain.
In a more preferred embodiment, a structure that substantially conforms to a given set of atomic coordinates is a structure wherein at least about 75% of such structure has the recited RMSD value, and more preferably, at least about 90% of such structure has the recited RMSD value, and most preferably, about 100% of such structure has the recited RMSD value.
In an even more preferred embodiment, the above definition of “substantially conforms” can be extended to include atoms of amino acid side chains. As used herein, the phrase “common amino acid side chains” refers to amino acid side chains that are common to both the structure which substantially conforms to a given set of atomic coordinates and the structure that is actually represented by such atomic coordinates.
As used herein, the term “enzyme domain” refers to the core enzymatic aspartyl protease fold of plasmepsin V lacking the C-terminal membrane anchor, and typically comprising residues 35 to 470 of P. vivax plasmepsin V as given in SEQ ID No: 2.
As used herein, the term “N-terminal subdomain” refers an N-terminal region of plasmepsin V that forms a part of the enzyme domain and which together with the C-terminal subdomain and the six-stranded interdomain β-sheet define the substrate-binding site of plasmepsin V.
As used herein, the term “C-terminal subdomain” refers to a C-terminal region of plasmepsin V that forms a part of the enzyme domain and which together with the N-terminal subdomain and the six-stranded interdomain β-sheet define the substrate-binding site of plasmepsin V.
As used herein, the term “six-stranded interdomain β-sheet” refers to an N-terminal region of the plasmepsin V polypeptide and a C-terminal region of the plasmepsin V polypeptide located before the absent membrane anchor that are assembled into a six-stranded interdomain 3-sheet structural motif, which serves to anchor the N- and C-terminal subdomains together.
As used herein, the term “substrate-binding site” for plasmepsin V means the regions of plasmepsin V involved in binding a substrate or inhibitor (also known as a “binding cleft”). The substrate-binding site is formed between the N- and C-terminal subdomains, which are together anchored to the six-stranded interdomain β-sheet to form a crescent-shaped enzyme domain. The substrate-binding site contains the catalytic dyad, Asp80 and Asp313 as given in SEQ ID NO: 2.
As used herein, the term the “flap” refers to a β-hairpin structure in the N-terminal subdomain of plasmepsin V previously described in the literature on aspartyl proteases as interacting with substrate or inhibitors in the substrate-binding site (Baldwin, E. T., et al. 1993), and comprising amino acids 139 to 142 as given in SEQ ID No: 2.
As used herein, the term “NAP1 insert” for plasmepsin V refers to a sequence insert comprising residues 116 to 132 as given in SEQ ID: No 2, which form a surface loop in the N-terminal subdomain.
As used herein, the term “helix-turn-helix motif” refers in plasmepsin V to a structural motif including two c-helices joined by a short loop and located in the C-terminal subdomain. The helix-turn-helix motif comprises amino acids 338 to 381 as given in SEQ ID No: 2.
It will be appreciated that a set of atomic coordinates for one or more polypeptides is a relative set of points that define a shape in three dimensions. Thus, it is possible that an entirely different set of coordinates could define a similar or identical shape. Moreover, slight variations in the individual coordinates will have little effect on overall shape.
The variations in coordinates may be generated due to mathematical manipulations of the atomic coordinates. For example, the atomic coordinates set forth in Appendix I could be manipulated by crystallographic permutations of the atomic coordinates, fractionalisation of the atomic coordinates, integer additions or subtractions to sets of the structure coordinates, inversion of the atomic coordinates, or any combination thereof.
Alternatively, modification in the crystal structure due to mutations, additions, substitutions, and/or deletions of amino acids, or other changes in any of the components that make up the crystal could also account for variations in atomic coordinates.
Various computational analyses are used to determine whether a molecular complex or a portion thereof is sufficiently similar to all or parts of the structure of plasmepsin V in complex with WEHI-842 described above. Such analyses may be carried out in current software applications, such as the Sequoia program (Bruns et al., 1999).
The Molecular Similarity program permits comparisons between different structures, different conformations of the same structure, and different parts of the same structure.
Comparisons typically involve calculation of the optimum translations and rotations required such that the root mean square deviation of the fit over the specified pairs of equivalent atoms is an absolute minimum. This number is given in Angstroms (“Å”).
Accordingly, atomic coordinates of plasmepsin V comprising a substrate-binding site in complex with WEHI-842 of the present invention include atomic coordinates related to the atomic coordinates listed in Appendix I by whole body translations and/or rotations. Accordingly, RMSD values listed above assume that at least the backbone atoms of the structures are optimally superimposed which may require translation and/or rotation to achieve the required optimal fit from which to calculate the RMSD value.
A three dimensional structure of a plasmepsin V polypeptide or a region thereof and/or a three dimensional structure of WEHI-842 or a region or portion thereof which substantially conforms to a specified set of atomic coordinates can be modelled by a suitable modelling computer program such as MODELLER (Sali & Blundell, 1993), using information, for example, derived from the following data: (1) the amino acid sequence of plasmepsin V and/or the sequence of WEHI-842; (2) the amino acid sequence of the related portion(s) of the protein represented by the specified set of atomic coordinates having a three dimensional configuration; and (3) the atomic coordinates of the specified three dimensional configuration. A three dimensional structure of plasmepsin V and/or a three dimensional structure of WEHI-842 which substantially conforms to a specified set of atomic coordinates can also be calculated by a method such as molecular replacement, which is described in detail below.
Atomic coordinates are typically loaded onto a machine-readable medium for subsequent computational manipulation. Thus models and/or atomic coordinates are advantageously stored on machine-readable media, such as magnetic or optical media and random-access or read-only memory, including tapes, diskettes, hard disks, CD-ROMs and DVDs, flash memory cards or chips, servers and the internet. The machine is typically a computer.
The atomic coordinates may be used in a computer to generate a representation, e.g. an image of the three-dimensional structure of plasmepsin V in complex with WEHI-842 which can be displayed by the computer and/or represented in an electronic file.
The atomic coordinates and models derived therefrom may also be used for a variety of purposes such as drug discovery, biological reagent (binding protein) selection and X-ray crystallographic analysis of other protein crystals.
The structure coordinates of plasmepsin V in complex with WEHI-842, such as those set forth in Appendix I, or a subset thereof, can also be used for determining the three-dimensional structure of a molecular complex which contains at least N- and/or C-terminal regions of plasmepsin V. In particular, structural information about another crystallised molecular complex may be obtained. This may be achieved by any of a number of well-known techniques, including molecular replacement.
Methods of molecular replacement are generally known by those of skill in the art (generally described in Brunger, 1997; Navaza & Saludjian, 1997; Tong & Rossmann, 1997; Bentley, 1997; Lattman, 1985; Rossmann, 1972; McCoy, 2007).
Generally, molecular replacement involves the following steps. X-ray diffraction data are collected from the crystal of a crystallised target structure. The X-ray diffraction data are transformed to calculate a Patterson function. The Patterson function of the crystallised target structure is compared with a Patterson function calculated from a known structure (referred to herein as a search structure). The Patterson function of the search structure is rotated on the target structure Patterson function to determine the correct orientation of the search structure in the crystal. A translation function is then calculated to determine the location of the search structure with respect to the crystal axes. Once the search structure has been correctly positioned in the unit cell, initial phases for the experimental data can be calculated. These phases are necessary for calculation of an electron density map from which structural differences can be observed and for refinement of the structure. Preferably, the structural features (e.g., amino acid sequence, conserved di-sulphide bonds, and beta-strands or beta-sheets) of the search molecule are related to the crystallised target structure.
The electron density map can, in turn, be subjected to any well-known model building and structure refinement techniques to provide a final, accurate structure of the unknown (i.e., target) crystallised molecular complex (e.g. see Jones et al., 1991; Brünger et al., 1998).
Obtaining accurate values for the phases, by methods other than molecular replacement, is a time-consuming process that involves iterative cycles of approximations and refinements and greatly hinders the solution of crystal structures. However, when the crystal structure of a protein containing at least a homologous portion has been solved, the phases from the known structure provide a satisfactory starting estimate of the phases for the unknown structure.
By using molecular replacement, all or part of the structure coordinates of plasmepsin V in complex with WEHI-842 provided herein (and set forth in Appendix I) can be used to determine the structure of a crystallised molecular complex whose structure is unknown more rapidly and more efficiently than attempting to determine such information ab initio. This method is especially useful in determining the structure of plasmepsin V.
The structure of any portion of any crystallised molecular complex that is sufficiently homologous to any portion of P. vivax plasmepsin V can be solved by this method, such as, e.g., P. falciparum plasmepsin V.
Such structure coordinates are also particularly useful to solve the structure of crystals of plasmepsin V co-complexed with a variety of molecules, such as chemical entities. For example, this approach enables the determination of the optimal sites for the interaction between chemical entities, and the interaction of candidate plasmepsin V inhibitors.
All of the complexes referred to above may be studied using well-known X-ray diffraction techniques and may be refined against 0.8-3.5 Å resolution X-ray data to an R value of about 0.25 or less using computer software, such as X-PLOR (Yale University, distributed by Molecular Simulations, Inc.; see Brünger, 1996) or Phenix (Adams, P. D., et al. 2010), for example. This information may thus be used to optimize known inhibitors, and more importantly, to design new or improved plasmepsin V inhibitors.
The three-dimensional structure of the substrate-binding site and the flap of plasmepsin V in complex with WEHI-842 provided by the present invention (Appendix I) can be used to identify potential target binding sites in the substrate-binding site and/or the flap of P. vivax plasmepsin V and/or P. falciparum plasmepsin V (i.e., to identify those regions of the substrate-binding site and/or the flap of P. vivax plasmepsin V and/or P. falciparum plasmepsin V involved in and important to the binding of WEHI-842) as well as in methods for identifying and/or designing other compounds which can interact with the substrate-binding site and/or the flap of P. vivax plasmepsin V and/or P. falciparum plasmepsin V, e.g., potential inhibitors of P. vivax plasmepsin V and/or P. falciparum plasmepsin V.
In one embodiment, the target binding site may comprise portions of the molecular surface of the substrate-binding site and the flap. In a preferred embodiment, the target binding site may comprise one or more residues from residues 44 to 240 and/or amino acids 273 to 470 of P. vivax plasmepsin V as given in SEQ ID NO: 2. In a more preferred embodiment, the target binding site includes one or more residues selected from the group consisting of Tyr59, Ala60, Ile78, Asp80, Gly82, Tyr139, Cys140, Glu141, Phe180, Gln183, Val188, Asp313, Gly315 and Thr317 of P. vivax plasmepsin V as given in SEQ ID NO: 2.
Design, Selection, Fitting and Assessment of Chemical Entities that Bind Plasmepsin V
Using a variety of known modelling techniques, the crystal structure of the present invention can be used to produce a model of one or more regions of the structure shown to interact with WEHI-842.
As used herein, the term “modelling” includes the quantitative and qualitative analysis of molecular structure and/or function based on atomic structural information and interaction models. The term “modelling” includes conventional numeric-based molecular dynamic and energy minimisation models, interactive computer graphic models, modified molecular mechanics models, distance geometry and other structure-based constraint models.
Molecular modelling techniques can be applied to the atomic coordinates of plasmepsin V in complex with WEHI-842, or at least parts, or regions thereof to derive a range of 3D models and to investigate the structure of the substrate-binding sites, the flap and any other bindings sites, such as the binding sites of monoclonal antibodies, non-immunoglobulin binding proteins and inhibitory peptides.
These techniques may also be used to screen for or design small and large chemical entities which are capable of binding to plasmepsin V, preferably within the substrate-binding site, to, for example, inhibit or at least reduce cleavage of the PEXEL motif of effector proteins to inhibit or at least reduce the activity of P. falciparum and/or P. vivax plasmepsin V, inhibit or at least reduce protein export and optionally be lethal to P. falciparum and/or P. vivax growth.
Such modelling methods are to design or select chemical entities that possess stereochemical complementary to the substrate-binding site of P. falciparum and/or P. vivax plasmepsin V and/or to at least a portion of the flap of P. falciparum and/or P. vivax plasmepsin V with which WEHI-842 interact. By “stereochemical complementarity” it is meant that the compound or a portion thereof makes a sufficient number of energetically favourable contacts with the substrate-binding site and/or the flap so as to have a net reduction of free energy on binding to the substrate binding site and/or the flap.
Such modelling methods may also be used to design or select chemical entities that possess stereochemical similarity to the substrate-binding site surface of WEHI-842 and/or to the portions of WEHI-842 that interact with at least a portion of the flap of P. falciparum and/or P. vivax plasmepsin V. By “stereochemical similarity” it is meant that the compound or portion thereof makes about the same number of energetically favourable contacts with plasmepsin V as WEHI-842 makes as determined by the crystal structure of WEHI-842 in complex with plasmepsin V as set out by the coordinates shown in Appendix I.
Stereochemical complementarity is characteristic of a molecule that matches intra-site surface residues lining the substrate-binding site and in the flap as enumerated by the coordinates set out in Appendix I or a subset thereof. By “match” it is meant that the identified portions interact with the surface residues, for example, via hydrogen bonding or by non-covalent Van der Waals and Coulomb interactions (with surface or residue) which promote desolvation of the molecule within the site, in such a way that retention of the molecule at the binding site is favoured energetically.
It is preferred that the stereochemical complementarity is such that the compound has a KD for the substrate-binding site and/or the flap of less than 10−5M, more preferably less than 10−6M and yet more preferably 10−7M. In a most preferred embodiment, the KD value is less than 10−8 M or better yet less than 10−9 M.
Chemical entities which are complementary to the shape and electrostatics or chemistry of the substrate-binding site and/or the flap characterised by amino acids positioned at atomic coordinates set out in Appendix I will be able to bind to the substrate-binding site and/or the flap, and when the binding is sufficiently strong, substantially inhibit or at least reduce the interaction of P. falciparum and/or P. vivax plasmepsin V with biological target molecules, such as effector proteins with the PEXEL motif.
It will be appreciated that it is not necessary that the complementarity between chemical entities and the substrate-binding site and/or the flap or similarity between the chemical entities and biological target molecules such as effector proteins with the PEXEL motif need extend over all residues of the substrate-binding site and/or the flap or the target molecule in order to inhibit or mimic binding of a molecule or complex that naturally interacts with plasmepsin V.
A number of methods may be used to identify chemical entities possessing stereochemical complementarity to the substrate-binding site of P. falciparum and/or P. vivax plasmepsin V and/or to at least a portion of the flap of P. falciparum and/or P. vivax plasmepsin V with which WEHI-842 interacts. For instance, the process may begin by visual inspection of the entire P. vivax substrate-binding site, or the equivalent region in P. falciparum plasmepsin V, on the computer screen based on the coordinates in Appendix I generated from the machine-readable storage medium. Alternatively, selected fragments or chemical entities may then be positioned in a variety of orientations, or docked, within the substrate-binding site of P. falciparum and/or P. vivax plasmepsin V or relative to the at least a portion of the flap of P. falciparum and/or P. vivax plasmepsin V with which WEHI-842 interacts in a manner similar to WEHI-842 and plasmepsin V as defined above.
Modelling software that is well known and available in the art may be used (Guida, 1994). These include Discovery Studio (Accelrys Software Inc., San Diego), SYBYL (Tripos Associates, Inc., St. Louis, Mo., 1992), Maestro (Schrödinger LLC, Portland), MOE (Chemical Computing Group Inc., Montreal, Canada). This modelling step may be followed by energy minimization with standard molecular mechanics force fields such as AMBER (Weiner et al., 1984), OPLS (Jorgensen and Tirado-Rives, 1988) and CHARMM (Brooks et al., 1983). In addition, there are a number of more specialized computer programs to assist in the process of selecting the binding moieties of this invention.
Specialised computer programs may also assist in the process of selecting fragments or chemical entities. These include, inter alia:
Once suitable chemical entities or fragments have been selected, they can be assembled into a single compound. In one embodiment, assembly may proceed by visual inspection of the relationship of the fragments to each other on the three-dimensional image displayed on a computer screen in relation to the structure coordinates of the substrate-binding site of P. vivax plasmepsin V and/or to at least a portion of the flap of P. vivax plasmepsin V with which WEHI-842 binds. This is followed by manual model building using software such as Discovery Studio, Maestro, MOE or Sybyl. Alternatively, fragments may be joined to additional atoms using standard chemical geometry.
The above-described evaluation process for chemical entities may be performed in a similar fashion for chemical compounds.
Useful programs to aid one of skilled in the art in connecting the individual chemical entities or fragments include:
Other molecular modelling techniques may also be employed in accordance with this invention, see, e.g., Cohen et al. (1990) and Navia & Murcko (1992).
There are two preferred approaches to designing a molecule according to the present invention that complement the stereochemistry of the substrate-binding site of plasmepsin V and/or to at least a portion of the flap of plasmepsin V with which WEHI-842 binds. The first approach is to in silico directly dock molecules from a three-dimensional structural database, to the target binding site, using mostly, but not exclusively, geometric criteria to assess the goodness-of-fit of a particular molecule to the site. In this approach, the number of internal degrees of freedom (and the corresponding local minima in the molecular conformation space) is reduced by considering only the geometric (hard-sphere) interactions of two rigid bodies, where one body (the active site) contains “pockets” or “grooves” that form binding sites for the second body (the complementing molecule).
Flexibility of plasmepsin V, can be incorporated into the in silico screening by the application of multiple conformations of plasmepsin V. The multiple conformations of plasmepsin V can be generated from the coordinates listed in Appendix I or a subset thereof computationally by use of molecular dynamics simulation or similar approaches.
This approach is illustrated by Kuntz et al. (1982) and Ewing et al. (2001), the contents of which are hereby incorporated by reference, whose algorithm for ligand design is implemented in a commercial software package, DOCK version 4.0, distributed by the Regents of the University of California and further described in a document, provided by the distributor, which is entitled “Overview of the DOCK program suite” the contents of which are hereby incorporated by reference. Pursuant to the Kuntz algorithm, the shape of the cavity in which WEHI-842 fits may be defined as a series of overlapping spheres of different radii. One or more extant databases of crystallographic data, such as the Cambridge Structural Database System (The Cambridge Crystallographic Data Centre, Cambridge, U.K.), the Protein Data Bank maintained by the Research Collaboratory for Structural Bioinformatics (Rutgers University, N.J., U.S.A.), LeadQuest (Tripos Associates, Inc., St. Louis, Mo.), Available Chemicals Directory (Symyx Technologies Inc.), and the NCI database (National Cancer Institute, U.S.A) is then searched for molecules which approximate the shape thus defined.
Molecules identified on the basis of geometric parameters, can then be modified to satisfy criteria associated with chemical complementarity, such as hydrogen bonding, ionic interactions and van der Waals interactions. Different scoring functions can be employed to rank and select the best molecule from a database (see, e.g., Bohm & Stahl, 1999). The software package FlexX, marketed by Tripos Associates, Inc. (St. Louis, Mo.) is another program that can be used in this direct docking approach (see Rarey et al., 1996).
The second preferred approach entails an assessment of the interaction of respective chemical groups (“probes”) with the active site at sample positions within and around the site, resulting in an array of energy values from which three-dimensional contour surfaces at selected energy levels can be generated. The chemical-probe approach to ligand design is described, for example, by Goodford, (1985), the contents of which are hereby incorporated by reference, and is implemented in several commercial software packages, such as GRID (product of Molecular Discovery Ltd., Italy).
Pursuant to this approach, the chemical prerequisites for a site-complementing molecule are identified at the outset, by probing the active site with different chemical probes, e.g., water, a methyl group, an amine nitrogen, a carboxyl oxygen, or a hydroxyl. Favoured sites for interaction between the active site and each probe are thus determined, and from the resulting three-dimensional pattern of such sites a putative complementary molecule can be generated. This may be done either by programs that can search three-dimensional databases to identify molecules incorporating desired pharmacophore patterns or by programs which use the favoured sites and probes as input to perform de novo design. Suitable programs for determining and designing pharmacophores include CATALYST (Accelrys Software, Inc), and CERIUS2, DISCO (Abbott Laboratories, Abbott Park, Ill.; distributed by Tripos Associates Inc.).
The pharmacophore can be used to screen in silico compound libraries/three-dimensional databases, using a program such as CATALYST (Accelrys Software, Inc) and Sybyl/3DB Unity (Tripos Associates, Inc., St. Louis, Mo.).
Databases of chemical structures are available from a number of sources including Cambridge Crystallographic Data Centre (Cambridge, U.K.), Molecular Design, Ltd., (San Leandro, Calif.), Tripos Associates, Inc. (St. Louis, Mo.), Chemical Abstracts Service (Columbus, Ohio), the Available Chemical Directory (Symyx Technologies, Inc.), the Derwent World Drug Index (WDI), BioByteMasterFile, the National Cancer Institute database (NCI), Medchem Database (BioByte Corp.), and the Maybridge catalogue.
De novo design programs include LUDI (Accelrys Software Inc., San Diego, Calif.), Leapfrog (Tripos Associates, Inc.), and LigBuilder (Peking University, China).
Once an entity or compound has been designed or selected by the above methods, the efficiency with which that entity or compound may bind to plasmepsin V can be tested and optimised by computational evaluation. For example, a compound that has been designed or selected to function as plasmepsin V binding compound must also preferably traverse a volume not overlapping that occupied by the binding site when it is bound to native plasmepsin V. An effective plasmepsin V binding compound must preferably demonstrate a relatively small difference in energy between its bound and free states (i.e., a small deformation energy of binding). Thus, the most efficient plasmepsin V binding compound should preferably be designed with a deformation energy of binding of not greater than about 10 kcal/mole, preferably, not greater than 7 kcal/mole.
A compound designed or selected as binding to plasmepsin V may be further computationally optimised so that in its bound state it would preferably lack repulsive electrostatic interaction with the target protein.
Such non-complementary (e.g., electrostatic) interactions include repulsive charge-charge, dipole-dipole and charge-dipole interactions. Specifically, the sum of all electrostatic interactions between the compound and the protein when the compound is bound to plasmepsin V, preferably make a neutral or favourable contribution to the enthalpy of binding.
Once a plasmepsin V binding compound has been optimally selected or designed, as described above, substitutions may then be made in some of its atoms or side groups to improve or modify its binding properties. Generally, initial substitutions are conservative, i.e., the replacement group will have approximately the same size, shape, hydrophobicity and charge as the original group. It should, of course, be understood that components known in the art to alter conformation should be avoided. Such substituted chemical compounds may then be analysed for efficiency of fit to plasmepsin V by the same computer methods described in detail above.
Specific computer software is available in the art to evaluate compound deformation energy and electrostatic interaction. Examples of programs designed for such uses include: Gaussian 03, (Frisch, Gaussian, Inc., Pittsburgh, Pa.); GAMESS (Gordon et al., Iowa State University); Jaguar (Schrödinger LLC, Portland); AMBER, version 9.0 (Case et al, University of California at San Francisco); CHARMM (Accelrys Software, Inc., San Diego, Calif.); and GROMACS version 4.0 (van der Spoel et al.).
The screening/design methods may be implemented in hardware or software, or a combination of both. However, preferably, the methods are implemented in computer programs executing on programmable computers each comprising a processor, a data storage system (including volatile and non-volatile memory and/or storage elements), at least one input device, and at least one output device. Program code is applied to input data to perform the functions described above and generate output information. The output information is applied to one or more output devices, in known fashion. The computer may be, for example, a personal computer, microcomputer, or workstation of conventional design.
Each program is preferably implemented in a high level procedural or object-oriented programming language to communicate with a computer system. However, the programs can be implemented in assembly or machine language, if desired. In any case, the language may be compiled or interpreted language.
Each such computer program is preferably stored on a storage medium or device (e.g., ROM or magnetic diskette) readable by a general or special purpose programmable computer, for configuring and operating the computer when the storage media or device is read by the computer to perform the procedures described herein. The system may also be considered to be implemented as a computer-readable storage medium, configured with a computer program, where the storage medium so configured causes a computer to operate in a specific and predefined manner to perform the functions described herein.
Compounds of the present invention include those designed and/or identified using a screening method of the invention, those encompassed by the compounds of Formulas I and II described above and those which are capable of recognising and binding to the substrate-binding site of plasmepsin V and/or interacting with the flap of plasmepsin V as defined above.
Compounds capable of recognising and binding to the substrate-binding site of plasmepsin V and/or interacting with the flap of plasmepsin V may be produced using (i) a screening method based on use of the atomic coordinates corresponding to the 3D structure of the substrate-binding site and/or the flap in complex with WEHI-842; or (ii) a screening method based on the use of the atomic coordinates corresponding to the 3D structure of WEHI-842 in complex with plasmepsin V. Alternatively, compounds may be identified by screening against a specific target molecule which is indicative of the capacity to bind to the substrate-binding site and/or the flap of plasmepsin V.
The candidate compounds and/or compounds identified or designed using a method of the present invention may be any suitable compound, synthetic or naturally occurring, preferably synthetic. In one embodiment, a synthetic compound selected or designed by the methods of the invention preferably has a molecular weight equal to or less than about 5000, 4000, 3000, 2000, 1000 or 500 daltons. A compound of the present invention is preferably soluble under physiological conditions.
The compounds may encompass numerous chemical classes, though typically they are organic molecules, preferably small organic compounds having a molecular weight of more than 50 and less than about 2,500 daltons, preferably less than 1,500, more preferably less than 1,000 and yet more preferably less than 500. Such compounds can comprise functional groups necessary for structural interaction with proteins, particularly hydrogen bonding, and typically include at least an amine, carbonyl, hydroxyl or carboxyl group, preferably at least two of the functional chemical groups. The compound may comprise cyclical carbon or heterocyclic structures and/or aromatic or polyaromatic structures substituted with one or more of the above functional groups. Compounds can also comprise biomolecules including peptides, saccharides, fatty acids, steroids, purines, pyrimidines, derivatives, structural analogues, or combinations thereof.
Compounds may include, for example: (1) peptides such as soluble peptides, including Ig-tailed fusion peptides and members of random peptide libraries (see, e.g., Lam et al., 1991; Houghten et al., 1991) and combinatorial chemistry-derived molecular libraries made of D- and/or L-configuration amino acids; (2) phosphopeptides (e.g., members of random and partially degenerate, directed phosphopeptide libraries, see, e.g., Songyang et al., 1993); (3) antibodies (e.g., polyclonal, monoclonal, humanized, anti-idiotypic, chimeric, and single chain antibodies as well as Fab, (Fab)2′, Fab expression library and epitope-binding fragments of antibodies); (4) non-immunoglobulin binding proteins such as but not restricted to avimers, DARPins and lipocalins; (5) nucleic acid-based aptamers; and (6) small organic and inorganic molecules.
Ligands can be obtained from a wide variety of sources including libraries of synthetic or natural compounds. Synthetic compound libraries are commercially available from, for example, Maybridge Chemical Co. (Tintagel, Cornwall, UK), AMRI (Budapest, Hungary) and ChemDiv (San Diego, Calif.), Specs (Delft, The Netherlands).
Natural compound libraries comprising bacterial, fungal, plant or animal extracts are available from, for example, Pan Laboratories (Bothell, Wash.), TimTec (Newark, Del.). In addition, numerous means are available for random and directed synthesis of a wide variety of organic compounds and biomolecules, including expression of randomized oligonucleotides.
Alternatively, libraries of natural compounds in the form of bacterial, fungal, plant and animal extracts can be readily produced. Methods for the synthesis of molecular libraries are readily available (see, e.g., DeWitt et al., 1993; Erb et al., 1994; Zuckermann et al., 1994; Cho et al., 1993; Carell et al., 1994a; Carell et al., 1994b; and Gallop et al., 1994). In addition, natural or synthetic compound libraries and compounds can be readily modified through conventional chemical, physical and biochemical means (see, e.g., Blondelle and Houghton, 1996), and may be used to produce combinatorial libraries. In another approach, previously identified pharmacological agents can be subjected to directed or random chemical modifications, such as acylation, alkylation, esterification, amidification, and the analogues can be screened for plasmepsin V-modulating activity.
Numerous methods for producing combinatorial libraries are known in the art, including those involving biological libraries; spatially addressable parallel solid phase or solution phase libraries; synthetic library methods requiring deconvolution; the “one-bead one-compound” library method; and synthetic library methods using affinity chromatography selection. The biological library approach is limited to polypeptide or peptide libraries, while the other four approaches are applicable to polypeptide, peptide, non-peptide oligomer, or small molecule libraries of compounds (Lam, 1997).
Compounds also include those that may be synthesized from leads generated by fragment-based drug design, wherein the binding of such chemical fragments is assessed by soaking or co-crystallizing such screen fragments into crystals provided by the invention and then subjecting these to an X-ray beam and obtaining diffraction data. Difference Fourier techniques are readily applied by those skilled in the art to determine the location within plasmepsin V at which these fragments bind, and such fragments can then be assembled by synthetic chemistry into larger compounds with increased affinity for plasmepsin V.
Compounds identified or designed using the methods of the invention can be a peptide or a mimetic thereof.
The isolated peptides or mimetics of the invention may be conformationally constrained molecules or alternatively molecules which are not conformationally constrained such as, for example, non-constrained peptide sequences. The term “conformationally constrained molecules” means conformationally constrained peptides and conformationally constrained peptide analogues and derivatives.
The term “analogues” refers to molecules having a chemically analogous structure to naturally occurring a-amino acids. Examples include molecules containing gem-diaminoalkyl groups or alklylmalonyl groups.
The term “derivatives” includes a-amino acids wherein one or more side groups found in the naturally occurring a-amino acids have been modified. Thus, for example the amino acids may be replaced with a variety of uncoded or modified amino acids such as the corresponding D-amino acid or N-methyl amino acid. Other modifications include substitution of hydroxyl, thiol, amino and carboxyl functional groups with chemically similar groups.
With regard to peptides and mimetics thereof, other examples of other unnatural amino acids or chemical amino acid analogues/derivatives which can be introduced as a substitution or addition include, but are not limited to, 2,4-diaminobutyric acid, a-amino isobutyric acid, 4-aminobutyric acid, 2-aminobutyric acid, 6-amino hexanoic acid, 2-amino isobutyric acid, 3-amino propionic acid, ornithine, norleucine, norvaline, hydroxyproline, sarcosine, citrulline, homocitrulline, cysteic acid, t-butylglycine, t-butylalanine, phenylglycine, cyclohexylalanine, β-alanine, fluoro-amino acids, designer amino acids such as β-methyl amino acids, C a-methyl amino acids, N a-methyl amino acids, and amino acid analogues in general.
The mimetic may be a peptidomimetic. A “peptidomimetic” is a molecule that mimics the biological activity of a peptide but is no longer peptidic in chemical nature. By strict definition, a peptidomimetic is a molecule that no longer contains any peptide bonds (i.e., amide bonds between amino acids). However, the term peptide mimetic is sometimes used to describe molecules that are no longer completely peptidic in nature, such as pseudo-peptides, semi-peptides and peptoids. Whether completely or partially non-peptide, peptidomimetics for use in the methods of the invention, and/or of the invention, provide a spatial arrangement of reactive chemical moieties that closely resembles the three-dimensional arrangement of active groups in the peptide on which the peptidomimetic is based. As a result of this similar active-site geometry, the peptidomimetic has effects on biological systems which are similar to the biological activity of the peptide.
There are sometimes advantages for using a mimetic of a given peptide rather than the peptide itself, because peptides commonly exhibit two undesirable properties: (1) poor bioavailability; and (2) short duration of action. Peptide mimetics offer an obvious route around these two major obstacles, since the molecules concerned are small enough to be both orally active and have a long duration of action. There are also considerable cost savings and improved patient compliance associated with peptide mimetics, since they can be administered orally compared with parenteral administration for peptides. Furthermore, peptide mimetics are generally cheaper to produce than peptides.
Suitable peptidomimetics based on WEHI-842 or a fragment thereof can be developed using readily available techniques. Thus, for example, peptide bonds, or in the case of WEHI-842, further peptide bonds can be replaced by non-peptide bonds that allow the peptidomimetic to adopt a similar structure, and therefore biological activity, to the original peptide. Further modifications can also be made by replacing chemical groups of the amino acids with other chemical groups of similar structure. The development of peptidomimetics derived from WEHI-842 or a fragment thereof can be aided by reference to the three dimensional structure of the inhibitor as provided in Appendix I. This structural information can be used to search three-dimensional databases to identify molecules having a similar structure, using programs such as Sybyl/3DB Unity (Tripos Associates, St. Louis, Mo.).
Those skilled in the art will recognize that the design of a peptidomimetic may require slight structural alteration or adjustment of a chemical structure designed or identified using the methods of the invention. In general, chemical compounds identified or designed using the methods of the invention can be synthesized chemically and then tested for ability to modulate and/or inhibit plasmepsin V activity using any of the methods described herein. The methods of the invention are particularly useful because they can be used to greatly decrease the number potential mimetics which must be screened for their ability to modulate and/or inhibit plasmepsin V activity.
The peptides or peptidomimetics of the present invention can be used in assays for screening for candidate compounds which bind to regions of plasmepsin V and potentially interfere with substrate binding within the substrate-binding site to, for example, inhibit or at least reduce cleavage of the PEXEL motif of effector proteins to inhibit or at least reduce the activity of P. falciparum and/or P. vivax plasmepsin V, inhibit or at least reduce protein export and optionally be lethal to P. falciparum and/or P. vivax growth. Peptides or peptidomimetics which mimic target binding sites are particularly useful as specific target molecules for identifying potentially useful ligands for plasmepsin V.
As used herein a “fragment” is a portion of a peptide of the invention which maintains a defined activity of the “full-length” peptide, namely the ability to bind to the substrate-binding site of plasmepsin V and/or interact with the flap of plasmepsin V. Fragments can be any size as long as they maintain the defined activity. Preferably, the fragment maintains at least 50%, more preferably at least 75%, of the activity of the full length polypeptide.
The % identity of a peptide is determined by GAP (Needleman and Wunsch, 1970) analysis (GCG program) with a gap creation penalty=5, and a gap extension penalty=0.3. The query sequence is at least 10 amino acids in length, and the GAP analysis aligns the two sequences over a region of at least 10 amino acids. More preferably, the GAP analysis aligns two sequences over their entire length.
With regard to a defined peptide, it will be appreciated that % identity figures higher than those provided above will encompass preferred embodiments. Thus, where applicable, in light of the minimum % identity figures, it is preferred that the peptide comprises an amino acid sequence which is at least 50%, more preferably at least 55%, more preferably at least 60%, more preferably at least 65%, more preferably at least 70%, more preferably at least 75%, more preferably at least 80%, more preferably at least 85%, more preferably at least 90%, more preferably at least 91%, more preferably at least 92%, more preferably at least 93%, more preferably at least 94%, more preferably at least 95%, more preferably at least 96%, more preferably at least 97%, more preferably at least 98%, more preferably at least 99%, more preferably at least 99.1%, more preferably at least 99.2%, more preferably at least 99.3%, more preferably at least 99.4%, more preferably at least 99.5%, more preferably at least 99.6%, more preferably at least 99.7%, more preferably at least 99.8%, and even more preferably at least 99.9% identical to the relevant nominated SEQ ID NO.
Amino acid sequence mutants of the peptides identified or designed using the methods of the invention, and/or of the present invention, can be prepared by introducing appropriate nucleotide changes into a nucleic acid of the present invention, or by in vitro synthesis of the desired peptide. Such mutants include, for example, deletions, insertions or substitutions of residues within the amino acid sequence. A combination of deletion, insertion and substitution can be made to arrive at the final construct, provided that the final peptide product possesses the desired characteristics.
In designing amino acid sequence mutants, the location of the mutation site and the nature of the mutation will depend on characteristic(s) to be modified. The sites for mutation can be modified individually or in series, e.g., by (1) substituting first with conservative amino acid choices and then with more radical selections depending upon the results achieved, (2) deleting the target residue, or (3) inserting other residues adjacent to the located site.
Substitution mutants have at least one amino acid residue in the peptide removed and a different residue inserted in its place. Sites of interest are those in which particular residues obtained from various strains or species are identical. These sites, especially those falling within a sequence of at least three other identically conserved sites, are preferably substituted in a relatively conservative manner. Such conservative substitutions are shown in Table 1 under the heading of “Exemplary Substitutions”.
In a preferred embodiment a mutant/variant peptide has one or two or three or four conservative amino acid changes when compared to a peptide defined herein. Details of conservative amino acid changes are provided above in Table 1.
Also included within the scope of the invention are peptides which are differentially modified during or after synthesis, e.g., by biotinylation, benzylation, glycosylation, acetylation, phosphorylation, amidation, derivatization by known protecting/blocking groups, proteolytic cleavage, linkage to an antibody molecule or other cellular ligand, etc. These modifications may serve to increase the stability and/or bioactivity of the peptide.
With regard to redesigning compounds using a method of the invention, in an embodiment the compound is redesigned to be more structurally similar to the native effector proteins containing the targeted PEXEL motif.
Interaction of Compounds with Plasmepsin V
A compound may interact with the substrate-binding site of plasmepsin V and/or with the flap of plasmepsin V by binding either directly or indirectly to these regions. A compound which binds directly, binds to a specified region. A compound which binds indirectly, binds to a region in close proximity to or adjacent to the substrate-binding site of plasmepsin V and/or to the flap of plasmepsin V with the result that it interferes with the ability of plasmepsin V to bind to native effector proteins containing the targeted PEXEL motif, either antagonistically or agonistically. Such interference may be steric, electrostatic or allosteric. Preferably, a compound interacts with the substrate-binding site of plasmepsin V and/or with the flap of plasmepsin V by binding directly to one or both of the regions. In the case of compounds that bind to specific target molecules, such compounds bind directly to the specific target molecule.
Binding can be either by covalent or non-covalent interactions, or both. Examples of non-covalent interactions include electrostatic interactions, van der Waals interactions, hydrophobic interactions and hydrophilic interactions.
When a compound of the invention interacts with plasmepsin V, it preferably “modulates” plasmepsin V. By “modulate” we mean that the compound changes an activity of plasmepsin V by at least 10%. Suitably, a compound modulates plasmepsin V by reducing or inhibiting plasmepsin V activity. The ability of a candidate compound to reduce or inhibit plasmepsin V activity can be assessed by any one of the plasmepsin V assays described herein.
Compounds of the present invention preferably have an affinity for plasmepsin V sufficient to provide adequate binding for the intended purpose. Suitably, such compounds and compounds which bind to specific target molecules of plasmepsin V have an affinity (KD) of from 10−5 to 10−15 M. For use as a therapeutic, the compound suitably has an affinity (KD) of from 10−7 to 10−15 M, preferably from 10−8 to 10−12 M and more preferably from 10−10 to 10−12 M. Where a compound is to be used as a reagent in a competitive assay to identify other ligands, the compound suitably has an affinity (KD) of from 10−5 to 10−12 M.
As will be evident to the skilled person, the crystal structure presented herein has enabled, for the first time, direct visualisation of the regions binding WEHI-842 in plasmepsin V.
In one embodiment, a compound may have a high specificity for plasmepsin V and/or a specific target molecule of plasmepsin V but not for other aspartyl proteases, i.e., a compound selectively binds to plasmepsin V. In this respect, a compound suitably has an affinity (KD) for plasmepsin V and/or a specific target molecule of plasmepsin V of no more than 10−5 M, preferably no more than 10−7 M, and an affinity for other aspartyl proteases of at least 10−5 M, preferably at least 10−3 M. Such compounds are desirable as, for example, plasmepsin V inhibitors where a propensity to interact with other non-Plasmodium Spp. aspartyl proteases and thus, for example, promote undesirable outcomes, is reduced.
In a preferred embodiment, the plasmepsin V or specific target molecule of plasmepsin V/other aspartyl protease binding affinity ratio for a compound is at least 10, preferably at least 100, more preferably at least 1000.
Compounds of the invention may be subjected to further confirmation of binding to plasmepsin V by co-crystallization of the compound with plasmepsin V and structural determination as described herein.
Compounds designed or selected according to the methods of the present invention are preferably assessed by a number of in vitro and in vivo assays of plasmepsin V function to confirm their ability to interact with and modulate plasmepsin V activity. For example, compounds may be tested for their ability to bind to plasmepsin V and/or for their ability to modulate, e.g., disrupt plasmepsin V activity.
Libraries may be screened in solution by methods generally known in the art for determining whether ligands competitively bind at a common binding site. Such methods may include screening libraries in solution (e.g., Houghten, 1992), or on beads (Lam, 1991), chips (Fodor, 1993), bacteria or spores (U.S. Pat. No. 5,223,409), plasmids (Cull et al., 1992), or on phage (Scott & Smith, 1990; Devlin, 1990; Cwirla et al., 1990; Felici, 1991; U.S. Pat. No. 5,223,409).
Where the screening assay is a binding assay, PEXEL cleavage assays may be used. In such assays, plasmepsin V, potential binding molecules and a labelled PEXEL containing peptide substrate are incubated together, wherein binding efficiency is determined by a detectable signal with signal generated being directly proportional to protease activity. Various labels may be used including radioisotopes, fluorescent molecules, chemiluminescent molecules, enzymes, specific binding molecules, particles, e.g., magnetic particles, and the like.
A variety of other reagents may also be included in the screening assay. These include reagents like salts, neutral proteins, e.g., albumin, detergents, etc., which are used to facilitate optimal binding and/or reduce non-specific or background interactions. Reagents that improve the efficiency of the assay, such as nuclease inhibitors, antimicrobial agents, etc., may be used. The components are added in any order that produces the requisite binding. Incubations are performed at any temperature that facilitates optimal activity, typically between 4 and 40° C.
Direct binding of compounds to plasmepsin V can also be done by Surface Plasmon Resonance (BIAcore) (reviewed in Morton & Myszka, 1998). Here plasmepsin V may be immobilized on a CM5 or other sensor chip by either direct chemical coupling using amine or thiol-disulphide exchange coupling (Nice & Catimel, 1999) or by capturing plasmepsin V as an Fc fusion protein to an appropriately derivatized sensor surface (Morten & Myszka, 1998). The potential binding molecule (called an analyte) is passed over the sensor surface at an appropriate flow rate and a range of concentrations. The classical method of analysis is to collect responses for a wide range of analyte concentrations. A range of concentrations provides sufficient information about the reaction, and by using a fitting algorithm such as CLAMP (see Morton & Myszka, 1998), rate constants can be determined (Morton & Myszka, 1998; Nice & Catimel, 1999). Normally, the ligand surface is regenerated at the end of each analyte binding cycle. Surface regeneration ensures that the same number of ligand binding sites is accessible to the analyte at the beginning of each cycle.
Incubation periods are selected for optimum activity, but may also be optimized to facilitate rapid high-throughput screening. Normally, between 0.05 and 1 hour will be sufficient. In general, a plurality of assay mixtures is run in parallel with different test agent concentrations to obtain a differential response to these concentrations. Typically, one of these concentrations serves as a negative control, i.e. at zero concentration or below the level of detection.
To measure the efficiency of compounds in vivo in inhibiting plasmepsin V activity, pulse chase analysis may be used. In such analysis, the pulse may include P. falciparum and/or P. vivax-infected erythrocytes being cultured in medium containing potential binding molecules, labelled-substrate and a radiolabel. The chase may include chasing the export of labelled proteins to the erythrocyte by culturing the infected erythrocytes in label-free and inhibitor-free medium for various incubation periods before quantifying the amount of labelled protein exported by densitometry analysis.
Compounds/chemical entities designed or selected by the methods of the invention described above may be used to modulate plasmepsin V activity in cells, i.e., inhibit or at least reduce plasmepsin V activity. Such compounds may interact with the substrate-binding site and/or the flap of plasmepsin V as defined herein.
Given that P. falciparum and P. vivax infections are causative of malaria, the compounds described above may be used to treat, ameliorate or prevent malaria by modulating plasmepsin V activity, preferably inhibit plasmepsin V activity.
Compounds provided by this invention may also be useful as assay reagents for identifying other useful ligands by, for example, competition assays, as research tools for further analysis of plasmepsin V and as potential therapeutics in pharmaceutical compositions.
Compounds provided by this invention may also useful as lead compounds for identifying other more potent or selective compounds.
In one embodiment, one or more of the compounds may be provided as components in a kit for identifying other ligands (e.g., small, organic molecules) that bind to plasmepsin V. Such kits may also include plasmepsin V, or functional fragments thereof. The compound and plasmepsin V or components thereof of the kit may be labelled (e.g., by radioisotopes, fluorescent molecules, chemiluminescent molecules, enzymes or other labels), or may be unlabelled and labelling reagents may be provided. The kits may also contain peripheral reagents such as buffers, stabilizers, etc. Instructions for use may also be provided.
Compounds of the invention may preferably be combined with various components to produce compositions of the invention. Preferably the compositions are combined with a pharmaceutically acceptable carrier or diluent to produce a pharmaceutical composition (which may be for human or animal use).
The formulation will depend upon the nature of the compound and the route of administration but typically they can be formulated for topical, parenteral, intramuscular, oral, intravenous, intra-peritoneal, intranasal inhalation, lung inhalation, intradermal or intra-articular administration. The compound may be used in an injectable form. It may therefore be mixed with any vehicle which is pharmaceutically acceptable for an injectable formulation, preferably for a direct injection at the site to be treated, although it may be administered systemically.
The pharmaceutically acceptable carrier or diluent may be, for example, sterile isotonic saline solutions, or other isotonic solutions such as phosphate-buffered saline. The compounds of the present invention may be admixed with any suitable binder(s), lubricant(s), suspending agent(s), coating agent(s), solubilising agent(s). It is also preferred to formulate the compound in an orally active form. Pharmaceutically acceptable carriers, diluents and excipients which can be used in the pharmaceutical compositions of the invention will be known to those of skill in the art. The British Pharmacopoeia (BP) and the United States Pharmacopeia and National Formulary (USP-NF) contain details of suitable carriers, diluents and excipients, as does Sweetman S (Ed.), ‘Martindale: The complete drug reference.’ London: Pharmaceutical Press, 37th Ed., (2011), and Rowe R C, Sheskey P J, Quinn M E (Ed.), ‘Handbook of Pharmaceutical Excipients’, 6th Ed., London: Pharmaceutical Press (2009), the contents of which are incorporated herein by cross reference.
In general, a therapeutically effective daily oral or intravenous dose of the compounds of the invention, including compounds of the invention and their salts, is likely to range from 0.01 to 50 mg/kg body weight of the subject to be treated, preferably 0.1 to 20 mg/kg. The compounds of the invention and their salts may also be administered by intravenous infusion, at a dose which is likely to range from 0.001-10 mg/kg/hr.
Tablets or capsules of the compounds may be administered singly or two or more at a time, as appropriate. It is also possible to administer the compounds in sustained release formulations.
Typically, the physician will determine the actual dosage which will be most suitable for an individual patient and it will vary with the age, weight and response of the particular patient. The above dosages are exemplary of the average case. There can, of course, be individual instances where higher or lower dosage ranges are merited, and such are within the scope of this invention.
For some applications, preferably the compositions are administered orally in the form of tablets containing excipients such as starch or lactose, or in capsules or ovules either alone or in admixture with excipients, or in the form of elixirs, solutions or suspensions containing flavouring or colouring agents.
The compositions (as well as the compounds alone) can also be injected parenterally, for example intravenously, intramuscularly or subcutaneously. In this case, the compositions will comprise a suitable carrier or diluent.
For parenteral administration, the compositions are best used in the form of a sterile aqueous solution which may contain other substances, for example enough salts or monosaccharides to make the solution isotonic with blood.
For buccal or sublingual administration the compositions may be administered in the form of tablets or lozenges which can be formulated in a conventional manner.
For oral, parenteral, buccal and sublingual administration to subjects (such as patients), the daily dosage level of the compounds of the present invention and their pharmaceutically acceptable salts and solvates may typically be from 10 to 500 mg (in single or divided doses). Thus, and by way of example, tablets or capsules may contain from 5 to 100 mg of active compound for administration singly, or two or more at a time, as appropriate. As indicated above, the physician will determine the actual dosage which will be most suitable for an individual patient and it will vary with the age, weight and response of the particular patient.
The routes of administration and dosages described are intended only as a guide since a skilled practitioner will be able to determine readily the optimum route of administration and dosage for any particular patient depending on, for example, the age, weight and condition of the patient.
P. vivax plasmepsin V (residues R35-R476 according to SEQ ID NO:2), bearing an N-terminal gp67 signal peptide, and a fusion tag comprised of a FLAG tag, SUMO domain and tobacco etch virus (TEV) protease cleavage site was expressed in High Five insect cells (SEQ ID NO: 5).
Recombinant protein was affinity purified initially from cell supernatant using anti-FLAG M2-agarose (Sigma). Pooled fractions were concentrated and the N-terminal fusion tag removed using TEV protease (1:25 v/v, 5 hr, room temperature; see
Size exclusion chromatography (SEC) was then used to obtain high purity plasmepsin V. SEC (Superdex 75, GE lifesciences) was carried out in 20 mM HEPES pH7.2/100 mM NaCl/0.2 mM DTT and resulted in pure and stable protein that was concentrated for crystallisation (see again
A similar procedure was used for the production of P. falciparum plasmepsin V (residues N80-R528 according to SEQ ID NO:1) except that sf21 insect cells were used for protein expression as less covalent aggregate was produced in this cell system.
The in vitro activity of recombinant P. falciparum and P. vivax plasmepsin V was assessed against a fluorogenic peptide of nine amino acids that contained the PEXEL sequence from knob-associated histidine-rich protein (“KAHRP”) (DABCYL-RNKRTLAQKQ-E-EDANS; SEQ ID NO: 7).
Both plasmepsin V demonstrated specific activity against the PEXEL substrates and cleaved the peptides after the P1 Leucine residue as expected (see
Values for Km were derived from both the inverse Michaelis-Menten and the Lineweaver-Burk plots. These values were of the same magnitude for each enzyme and are also similar to the Km derived from the activity of another recombinant form of P. falciparum plasmepsin V on a different PEXEL substrate (Xiao, H. et al. 2014).
The Km for recombinant plasmepsin V using the fluorogenic PEXEL substrate from KAHRP (SEQ ID NO:7) was 20.2 μM and 6.0 μM for the P. falciparum and P. vivax enzymes, respectively (see
PEXEL cleavage assays (20 μl total volume) consisted of 1 ng/well of P. vivax plasmepsin V or 1.5 ng/well of P. falciparum plasmepsin V in buffer (25 mM Tris.HCl, 25 mM MES, pH 6.4) with 5 μM FRET peptide substrate (DABCYL-RNKRTLAQKQ-E-EDANS (SEQ ID NO:7) or peptides with the sequence DABCYL-RNKKTLAQKQ-E-EDANS (SEQ ID NO:8) or DABCYL-RNKRTIAQKQ-E-EDANS (SEQ ID NO:9)) (Sleebs, B. E. et al. 2014a; see
Compounds WEHI-916 (see
Reactions comprised of a fluorescent peptide of nine amino acids containing the PEXEL sequence (RTLAQ) from KAHRP. The KAHRP PEXEL peptide substrate DABCYL-RNKRTLAQKQ-E-EDANS (SEQ ID NO:7) was obtained commercially and used at a final assay concentration of 7.5 μM.
The end-point for all assays was set within the linear range of activity (approximately 1 hr.).
Tween-20 was used at 0.005% final assay concentration. Final assay buffer concentration was as follows: 25 mM Tris HCl, 25 mM MES (pH 6.4). Final assay volume was 20 μL.
An 11-point 1/3 serial dilution of compounds was generated using DMSO as a diluent (final assay concentration of 1%).
Assay reaction was incubated for 60 min at 37° C. and read using a fluorescence plate reader (Ex 340 nm, Em 495 nm). IC50 values were determined using a nonlinear regression four-parameter fit analysis, where two of the parameters were constrained to 0 and 100%.
WEHI-842 was shown to have a 15-fold increased potency against recombinant P. falciparum plasmepsin V (IC50 of 0.79 nM) compared to WEHI-916 (IC50 of 12.9 nM) (see
P. falciparum strains 3D7, NF54, CS2 and W2mef were cultured in human O+ erythrocytes at 4% haematocrit in RPMI 1640 medium containing 25 mM HEPES, pH 7.4, 0.2% sodium bicarbonate, 0.5% Albumax II (Life Technologies) and 5 nM WR99210 selection where required (a gift from Jacobus Pharmaceuticals) in 5% CO2, 5% O2, 90% N at 37° C.
P. falciparum 3D7 expressing the PEXEL protein erythrocyte membrane protein 3 (PfEMP3) fused to green fluorescent protein (GFP) (PfEMP3-GFP) was generated previously (Boddey, J. A. et al, 2010) (see
Growth assays were performed in 96 well plates by incubating ring-stage P. falciparum parasites with WEHI-916 or WEHI-842 solubilized in DMSO, or chloroquine solubilised in water, at the indicated concentrations and parasitaemia was determined at 72 hr by flow cytometry as described previously (Sleebs, B. E. et al. 2014a).
P. falciparum trophozoites expressing PfEMP3-GFP were magnet-purified (Miltenyi Biotech), incubated with inhibitors for 1-4 hr. at 37° C., treated with 0.09% saponin containing inhibitor, and washed pellets were solubilized in Laemmli's buffer, boiled for 3 min and proteins were separated by SDS-PAGE, transferred to nitrocellulose and blocked in 1% skim milk. Membranes were probed with mouse a-GFP (Roche; 1:1000), rabbit a-Aldolase (1:1000) or rabbit a-HSP70 (1:4000) antibodies followed by horseradish peroxidase-conjugated secondary antibodies (Silenius; 1:2000) and visualized using enhanced chemiluminescence (Amersham). To radiolabel P. falciparum proteins, magnet-purified trophozoites expressing PfEMP3-GFP were treated with 10 μM inhibitors for 3 hr. (the final 30 min in Met/Cys-free medium) at 37° C. before addition of 800 μCi/ml 35S-Met/Cys (Perkin/Elmer) to the medium for 10 min. PfEMP3-GFP protein species were then immunoprecipitated from parasite lysates solubilized in 1% Triton X-100/PBS containing 1× complete protease inhibitors (Roche) using anti-GFP agarose (MBL) for 2 hours at 4° C. and proteins were resolved by SDS-PAGE, visualized by autoradiography and quantified using a GS-800 Calibrated Densitometer (Bio-Rad) (Sleebs, B. E. et al. 2014a).
The ability of WEHI-842 to inhibit growth of P. falciparum (3D7) parasites, a chloroquinesensitive strain, was found to be 10 fold better than parasites treated the same way with WEHI-916 (EC50 0.40 μM and EC50 5.0 μM, respectively) (see
The growth assays also showed that WEHI-842 efficiently inhibited PEXEL cleavage, as shown by accumulation of a 35 kDa band corresponding to full-length PfEMP3-GFP (Boddey, J. A. et al. 2010; Sleebs, B. E. et al. 2014a) (see
In order to provide evidence WEHI-842 specifically inhibited plasmepsin V and no other normal cellular functions, magnet purified trophozoites treated with DMSO or WEHI-842 (2.5, 5, 10 μM) for 3 hr prior to labelling parasite proteins with 35S-methionine/cysteine for 15 min (see
Treatment with 10 jμM WEHI-842 for 3 hours showed no inhibitory effect on translation but potently inhibited plasmepsin V cleavage of PfEMP3-GFP. These results show that WEHI-842 is a potent and specific inhibitor of plasmepsin V cleavage of the PEXEL in P. falciparum parasites.
Pulse chases were performed by radiolabelling proteins as above, before further culture in radiolabel-free, inhibitor-free complete medium for 30 and 60 min at 37° C. before PfEMP3-GFP protein species were purified, resolved, visualized and quantified by densitometry, as described above.
Pulse: P. falciparum trophozoites expressing PfEMP3-GFP were magnet-purified (Miltenyi Biotech) and treated with 10 μM WEHI-842 or DMSO for 3 hr. Parasite proteins were labelled in the presence of inhibitors by addition of 800 μCi/ml 35S-Met/Cys (Perkin/Elmer) to the medium for the last 50 min of the 3 hr. inhibitor treatment time (this included 30 min in Met/Cys-free medium containing inhibitor prior to labeling). Chase: Export of labeled proteins to the erythrocyte was chased by culturing parasites in radiolabel-free, inhibitor-free complete medium for either 20, 40, 60 or 120 min at 37° C. before exported proteins were liberated from the infected erythrocytes by tetanolysin treatment (100 U/mL tetanolysin (Sigma), 0.2% bovine serum albumin (Sigma), 1× complete protease inhibitor cocktail (Roche)) for 5 min at 37° C. and centrifugation at 1500 g for 1 min. PfEMP3-GFP proteins were purified from the tetanolysin supernatant fraction and resolved, visualized and quantified by densitometry, as described above (see
WEHI-842 was found to be 10-fold more potent than WEHI-916 at inhibiting plasmepsin V cleavage of the PEXEL motif in PfEMP3-GFP. Removal of WEHI-842 and continued growth in inhibitor-free medium showed inhibition of PEXEL cleavage was reversible, as the parasite recommenced cleaving full-length PfEMP3-GFP (see
Surface plasmon resonance was performed using a Biacore 4000 (GE Healthcare). Plasmepsin V was immobilized on a Series S CM5 sensor chip using amine-coupling chemistry. The surfaces of flow cells were activated for 10 min with a 1:1 mixture of 0.1 M NHS (N-hydroxysuccinimide) and 0.1 M EDC (3-(N,N-dimethylamino)propyl-N-ethylcarbodiimide) at a flow rate of 5 al/min. Plasmepsin V (20 μg/ml in 10 mM sodium acetate, pH 4.5, was immobilized at a density of approximately 10,000. Spot 3 was activated and deactivated as above and used as a reference surface. All surfaces were blocked with a 7 min injection of 1 M ethanolamine, pH 8.0. WEHI-916 and WEHI-842 11-point titrations were prepared by diluting 1:2 from 1000 nM and injected at a flow rate of 30 μl/min. Buffer used was 10 mM HEPES, 150 mM NaCl, 3 mM EDTA, 0.05% Tween and 2% DMSO. Analysis was performed at 25° C. After compound injection, the chip surface was regenerated with 10 mM glycine-HCl, pH 2 for 30 s. Compounds were allowed to associate and dissociate for 250 s and 600 s, respectively. Data were collected at a rate of 10 Hz and fit to a 1:1 interaction model using the Biacore 4000 Evaluation Software Version 1.0.
The surface plasmon experiments showed that the affinity (KD) of WEHI-842 and WEHI-916 for P. vivax plasmepsin V were 13.4 and 42.0 nM, respectively (see
Furthermore, the kd for WEHI-842 was found to be lower than that for WEHI-916 (kd of 1.11 E-03 (1/s) and 2.47 E-03 (1/s) respectively), this slower off-rate is consistent with the higher affinity of WEHI-842. Taken together these results show that WEHI-842 is a considerably more potent inhibitor for of P. falciparum and P. vivax plasmepsin V than WEHI-916.
Crystallisation, Structure Solution and Refinement of Plasmepsin V in Complex with WEHI-842
Samples of P. vivax plasmepsin V (8 mg/ml) were prepared for crystallisation by combining with a 6× molar ratio of WEHI-842.
A crystallization condition (0.11M ammonium sulfate/5% (v/v) jeffamine M-600/15.5% (w/v) polyethylene glycol 4000/0.1M sodium acetate-acetic acid pH4.16) was detected and refined for P. vivax plasmepsin V/WEHI-842.
Single crystals of the P. vivax plasmepsin V/WEHI-842 were frozen in well solution supplemented with 20% Ethylene Glycol.
All diffraction data was processed using the XDS suite (Kabsch, W., 2010; Evans, P. R., 2011), Pointless (Evans, P. R., 2011) and Aimless (Evans, P. R. & Murshudov, G. N., 2013). Statistics for the dataset is provided in Table 2 below.
The structure was solved by molecular replacement with Phaser (McCoy, A. J. et al., 2007) using a Sculptor (Bunkoczi, G. & Read, R. J., 2011) modified version of Cathepsin E (PDB 1TZS; Ostermann, N. et al., 2004) as the search model.
Further rounds of building and refinement with Coot (Emsley, P. & Cowtan, K., 2004) and Phenix (Adams, P. D., et al. 2010) yielded the final model. Patches of poorly defined density connected residues R241-E272 as per SEQ ID NO:2 but was of inadequate quality for confident model building. Density observed proximal to Asn355 may be due to glycosylation of this residue during protein expression. Refinement statistics are also provided in Table 2 below.
1Numbers in parentheses refer to the statistic in the highest resolution shell.
2Rmerge = hkl i | Ii(hkl) − <I(hkl)> |/hkl i Ii(hkl)
3Rwork and Rfree are computed using R = <|Fhxpct − Fhobs|>/<| Fhobs |> (where Fhxpct is the expectation value of the model structure amplitude; Blanc et al., 2004).
Analytical thin-layer chromatography was performed on Merck silica gel 60F254 aluminum-backed plates and were visualized by fluorescence quenching under UV light or by KMnO4 staining. Flash chromatography was performed with silica gel 60 (particle size 0.040-0.063 mm). NMR spectra were recorded on a Bruker Avance DRX 300 (1H NMR at 300 MHz) or a Varian 600 (1H NMR at 600 MHz) with the solvents indicated. Chemical shifts are reported in ppm on the δ scale and referenced to the appropriate solvent peak. MeOD contains H2O. HRESMS were acquired by Jason Dang at the Monash Institute of Pharmaceutical Sciences Spectrometry Facility using an Agilent 1290 infinity 6224 TOF LCMS. Column used was RRHT 2.1×50 mm 1.8 μm C18. Gradient was applied over the 5 min with the flow rate of 0.5 mL/min. For MS: Gas temperature was 325° C.; drying gas 11 L/min; nebulizer 45 psig and the fragmentor 125V. LCMS were recorded on a Waters ZQ 3100 using a 2996 Diode Array Detector. LCMS conditions used to assess purity of compounds were as follows, column: XBridge™ C18 5 μm 4.6×100 mm, injection volume 10 μL, gradient: 10-100% B over 10 min (solvent A: water 0.1% formic acid; solvent B: AcCN 0.1% formic acid), flow rate: 1.5 mL/min, detection: 100-600 nm. All final compounds were analyzed using ultrahigh performance liquid chromatography/ultraviolet/evaporative light scattering detection coupled to mass spectrometry. Unless otherwise noted, all compounds were found to be >95% pure by this method. WEHI-916 was prepared as previously described (Sleebs, B. E. et al, 2014a; Sleebs, B. E. et al., 2014b).
a) SOCl2, MeOH, 18 h; b) Cbz-OSu, Et3N, THF, H2O, 1 h; c) (Boc)2O, Et3N, THF, H2O, 18 h; d) LiOH, THF, H2O, 4 h; e) phenylethylamine, HBTU, DIPEA, DMF, 18 h; f) 4N HCl in dioxane, 1 h; g) Boc-Val-OH, HBTU, DIPEA, DMF, 18 h; h) 4N HCl in dioxane, 1 h; i) HCl.NH2—Val-Sta-NH(CH2)2Ph 8, HBTU, DIPEA, DMF, 18 h; j) TFA, DCM, 18 h.
Thionyl chloride (529 μL, 7.29 mmol) was added drop-wise to MeOH (8 mL) at 0° C. under a nitrogen atmosphere. H2SO4.H-Cav-OH 1 (1 g, 3.65 mmol) was added and the resulting suspension was allowed to stir for 18 h at 20° C. The reaction mixture was concentrated to dryness in vacuo to obtain 2 as a colourless hygroscopic residue (920 mg, 99%). 1H NMR (600 MHz, DMSO) δ 8.63 (br s, 2H), 7.76 (s, 4H), 4.17 (t, J=6.7 Hz, 1H), 4.02-3.92 (m, 2H), 3.76 (s, 3H), 2.24-2.06 (m, 2H).
A mixture of 2HCl.NH2—Cav-OMe 2 (330 mg, 1.25 mmol), Et3N (262 μL, 1.88 mmol), and Cbz-OSu (281 mg, 1.13 mmol) in a mixture of water (4 mL) and THF (5.3 mL) was allowed to stir for 1 h at 20° C. The reaction was quenched with saturated NaHCO3 and extracted with EtOAc (3×20 mL). The combined organic layers were washed with brine (30 mL) and dried with MgSO4. The solvent was concentrated in vacuo to obtain a crude residue. The crude residue was subjected to silica chromatography gradient eluting with 100% DCM to 20% MeOH/DCM to obtain 3 as a colourless hygroscopic residue (270 mg, 57%). 1H NMR (600 MHz, MeOD) δ 7.39-7.24 (m, 5H), 5.17-5.03 (m, 2H), 4.43-4.10 (m, 1H), 3.95-3.78 (m, 2H), 3.77-3.59 (m, 3H), 2.28-1.86 (m, 2H). 13C NMR (75 MHz, MeOD) δ 174.69, 159.16, 158.67, 138.08, 129.47, 129.02, 128.76, 70.27, 67.72, 67.49, 52.76, 31.65. MS, m/z=325.1 [M+H]+.
A mixture of Cbz-Cav-OMe 3 (270 mg, 0.832 mmol), Et3N (147 μL, 1.25 mmol), and Boc-anhydride (218 mg, 0.999 mmol) in a mixture of water (3 mL) and THF (12 mL) was allowed to stir for 18 h at 20° C. Water was added to the reaction mixture and extracted with EtOAc (3×10 mL). The combined organic layers were dried over MgSO4 and the solvent was concentrated in vacuo to obtain a crude residue. The crude residue was subjected to silica chromatography gradient eluting with 100% cyclohexane to 80% EtOAc/cyclohexane to obtain 4 as a colourless hygroscopic residue (122 mg, 35%). 1H NMR (600 MHz, CDCl3) δ 7.38-7.24 (m, 5H), 6.10 (br s, 1H), 5.77 (d, J=8.2 Hz, 1H), 5.08 (s, 2H), 4.55-4.44 (m, 1H), 3.93-3.83 (m, 2H), 3.76-3.60 (m, 3H), 2.24-1.97 (m, 2H), 1.45 (d, J=12.5 Hz, 9H). 13C NMR (151 MHz, CDCl3) δ 173.29, 156.06, 153.27, 151.25, 136.31, 128.54, 128.17, 128.07, 82.04, 68.81, 67.01, 52.51, 51.58, 31.38, 28.17. MS, m/z=425.3 [M+H]+.
A mixture of Cbz-Cav(N,N-Boc)-OMe 4 (100 mg, 0.236 mmol), and LiOH hydrate (35 mg, 0.825 mmol) in a mixture of water (1 mL) and THF (3 mL) was allowed to stir for 4 h at 20° C. 10% Citric acid solution was added to the reaction mixture. The solution was extracted with EtOAc (3×10 mL). The combined organic layers were washed with brine (20 mL) and dried with MgSO4. The solvent was concentrated in vacuo to obtain 5 as an oil (96 mg, 99%). 1H NMR (600 MHz, MeOD) δ 7.39-7.23 (m, 5H), 5.14-5.09 (m, 2H), 4.40-4.29 (m, 1H), 4.00 (br s, 2H), 2.35-2.23 (m, 1H), 2.03-1.94 (m, 1H), 1.51 (s, 9H). MS, m/z=411.3 [M+H]+.
Compound 6 HCl.NH2—Sta-NH(CH2)2Ph (6)
Compound 6 was synthesized according to previously described procedure (Sleebs, B. E. et al, 2014a; Sleebs, B. E. et al., 2014b).
Compound 7 Boc-Val-Sta-NH(CH2)2Ph (7)
To a stirred solution of Boc-Val-OH (90 mg, 0.414 mmol) in DMF (1.8 mL) was added HBTU (204 mg, 0.538 mmol) and DIPEA (361 μL, 2.07 mmol). The reaction mixture was stirred for 10 min at 20° C. An excess of HCl.NH2—Sta-NH(CH2)2Ph 6 (196 mg, 0.621 mmol) was added and the resulting suspension was allowed to stir for 18 h at 20° C. The reaction mixture was quenched with 10% citric acid solution. The resultant precipitate was filtered off to obtain 7 as an off-white solid (196 mg, 99%). 1H NMR (600 MHz, MeOD) δ 7.30-7.15 (m, 5H), 4.00-3.90 (m, 2H), 3.81 (d, J=6.8 Hz, 1H), 3.40 (t, J=7.4 Hz, 2H), 2.80 (t, J=7.4 Hz, 2H), 2.31-2.20 (m, 2H), 2.10-2.01 (m, 1H), 1.68-1.51 (m, 2H), 1.45 (s, 9H), 1.37-1.27 (m, 1H), 0.96 (dd, J=23.0, 6.8 Hz, 6H), 0.91 (dd, J=12.6, 6.6 Hz, 6H). 13C NMR (75 MHz, MeOD) δ 174.56, 173.86, 158.20, 140.52, 129.78, 129.48, 127.32, 80.79, 71.14, 62.44, 52.28, 42.04, 41.76, 41.68, 36.49, 31.30, 28.75, 25.81, 23.69, 22.32, 19.99, 18.52. MS, m/z=478.6 [M+H]+.
Compound 8 HCl.NH2—Val-Sta-NH(CH2)2Ph (8)
A mixture of Boc-Val-Sta-NH(CH2)2Ph 7 (170 mg, 0.356 mmol), in 4N HCl in dioxane (0.8 mL) was allowed to stir for 1 h at 20° C. The reaction mixture was concentrated to dryness in vacuo. The oil was triturated with Et2O and the supernatant decanted to obtain 8 as a dull yellow solid (145 mg, 99% yield). 1H NMR (600 MHz, MeOD) δ 7.31-7.16 (m, 5H), 4.03-3.95 (m, 2H), 3.80 (d, J=4.8 Hz, 1H), 3.51-3.42 (m, 2H), 2.83 (t, J=7.4 Hz, 2H), 2.42-2.23 (m, 3H), 1.72-1.63 (m, 1H), 1.58-1.51 (m, 1H), 1.43-1.36 (m, 1H), 1.08 (dd, J=38.2, 6.9 Hz, 6H), 0.94 (dd, J=14.4, 6.6 Hz, 6H). 13C NMR (75 MHz, MeOD) δ 174.76, 169.66, 140.13, 129.78, 129.52, 127.44, 71.03, 59.69, 53.45, 42.64, 41.11, 40.97, 36.17, 31.56, 25.73, 23.66, 22.40, 19.28, 17.51. MS, m/z=378.4 [M+H]+.
Compound 9 Cbz-Cav(N,N-Boc)-Val-Sta-NH(CH2)2Ph (9)
To a stirred solution of Cbz-Cav(N,N-Boc)-OH 5 (50 mg, 0.122 mmol) in DMF (1 mL) was added HBTU (60 mg, 0.158 mmol) and DIPEA (127 μL, 0.731 mmol). The reaction mixture was stirred for 10 min at 20° C. An excess of HCl.NH2—Val-Sta-NH(CH2)2Ph 8 (61 mg, 0.146 mmol) was added and the resulting suspension was allowed to stir for 18 h at 20° C. The reaction mixture was quenched with 10% citric acid solution and extracted with EtOAc (2×10 mL). The combined organic layers were then washed with saturated NaHCO3 solution (1×20 mL). The organic layer was washed with brine (20 mL), dried with MgSO4 and the solvent was concentrated in vacuo to obtain a crude residue. The crude residue was subjected to silica chromatography gradient eluting with 100% DCM to 10% MeOH/DCM to obtain 9 as a colourless oil (52 mg, 55%). 1H NMR (600 MHz, CDCl3) δ 7.42 (br s, 1H), 7.35-7.26 (m, 7H), 7.24-7.14 (m, 3H), 6.86 (br s, 1H), 6.62 (br s, 1H), 6.25 (br s, 1H), 6.09 (br s, 1H), 5.14-5.03 (m, 2H), 4.42-4.31 (m, 1H), 4.23 (t, J=6.9 Hz, 1H), 4.03-3.78 (m, 4H), 3.56-3.37 (m, 2H), 2.80 (t, J=7.3 Hz, 2H), 2.39-1.85 (m, 5H), 1.62-1.51 (m, 2H), 1.51-1.41 (m, 9H), 1.39-1.30 (m, 1H), 0.99-0.79 (m, 12H). 13C NMR (75 MHz, CDCl3) δ 172.46, 172.38, 171.58, 157.83, 156.64, 152.76, 139.12, 136.45, 128.87, 128.71, 128.36, 128.12, 127.95, 126.59, 83.45, 71.24, 70.72, 67.30, 60.38, 53.00, 51.69, 41.09, 40.83, 35.77, 32.36, 30.50, 28.39, 28.23, 25.06, 23.23, 22.29, 19.57, 18.32. MS, m/z=770.5 [M+H]+.
WEHI-842 Cbz-Cav(NH2)—Val-Sta-NH(CH2)2Ph.TFA (WEHI-842)
A mixture of Cbz-Cav(N,N-Boc)-Val-Sta-NH(CH2)2Ph 9 (52 mg, 0.068 mmol), in TFA (0.8 mL) and DCM (0.8 mL) was allowed to stir for 18 h at 20° C. The reaction mixture was concentrated to dryness in vacuo. The oil was triturated with Et2O and the supernatant decanted to obtain WEHI-842 as a colourless hygroscopic residue (52 mg, 98%). 1H NMR (600 MHz, MeOD) δ 7.40-7.14 (m, 10H), 5.11 (s, 2H), 4.43-4.26 (m, 1H), 4.26-4.12 (m, 1H), 4.01-3.89 (m, 3H), 3.46-3.33 (m, 2H), 2.86-2.70 (m, 2H), 2.65-2.48 (m, 1H), 2.32-1.90 (m, 4H), 1.68-1.43 (m, 2H), 1.40-1.23 (m, 1H), 1.02-0.77 (m, 12H). 13C NMR (75 MHz, MeOD) δ 174.14, 173.96, 173.55, 160.45, 158.51, 140.48, 137.98, 129.77, 129.70, 129.50, 129.47, 129.10, 128.86, 127.33, 74.53, 71.28, 67.92, 60.80, 53.34, 52.87, 42.06, 41.56, 41.21, 36.50, 31.73, 31.49, 25.86, 23.71, 22.30, 19.97, 18.66. MS, m/z=670.5 [M+H]+. HRMS found: (M+H) 670.3925; C34H51N7O7 require (M+H), 670.3928.
The disclosure of all publications referred to in this application are incorporated herein by reference.
In the present specification and claims (if any), the word ‘comprising’ and its derivatives including ‘comprises’ and ‘comprise’ include each of the stated integers but does not exclude the inclusion of one or more further integers.
Reference throughout this specification to ‘one embodiment’ or ‘an embodiment’ means that a particular feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment of the present invention. Thus, the appearance of the phrases ‘in one embodiment’ or ‘in an embodiment’ in various places throughout this specification are not necessarily all referring to the same embodiment. Furthermore, the particular features, structures, or characteristics may be combined in any suitable manner in one or more combinations.
In compliance with the statute, the invention has been described in language more or less specific to structural or methodical features. It is to be understood that the invention is not limited to specific features shown or described since the means herein described comprises preferred forms of putting the invention into effect. The invention is, therefore, claimed in any of its forms or modifications within the proper scope of the appended claims (if any) appropriately interpreted by those skilled in the art.
Number | Date | Country | Kind |
---|---|---|---|
2015902134 | Jun 2015 | AU | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/AU2016/050459 | 6/8/2016 | WO | 00 |