The present invention relates generally to methods of using the structure coordinates of a Tau binding pocket. In particular, the invention provides a three-dimensional model of a Tau binding pocket and means for identifying, selecting and/or designing modulators of Tau aggregation by rational drug design. The invention also relates generally to using the structure coordinates of a Tau protein that is an intermediate in the process of folding of the Tau protein from a compact conformation comprising the binding pocket to an aggregated confirmation in a paired helical filament.
Disorders related to tau, collectively referred to as neurodegenerative tauopathies (Lee et al., 2001) represent a group of diseases of protein aggregation (Buee et al., 2000). Alzheimer's disease (AD) is part of this group of neurodegenerative diseases. Conditions of dementia such as Alzheimer's disease (AD) are frequently characterised by a progressive accumulation of intracellular and/or extracellular deposits of proteinaceous structures such as β-amyloid plaques and neurofibrillary tangles (NFTs) composed of tau, in the brains of affected patients. The appearance of these lesions largely correlates with pathological neuronal degeneration and brain atrophy, as well as with cognitive impairment (see, e.g., Mukaetova-Ladinska et al., 2000). In AD, tau protein self-assembles to form paired helical filaments (PHFs) and straight filaments that constitute the neurofibrillary tangles within neurons and dystrophic neurites in brain. Protein misfolding to form amyloid fibrils is a hallmark of many different diseases collectively known as the amyloidoses, each of which is characterised by a specific precursor protein (Sipe, 1992).
Tau exists in alternatively-spliced isoforms, which contain three or four copies of a repeat sequence corresponding to the microtubule-binding domain (see, e.g., Goedert, M., et al., 1989). The tau species isolated from proteolytically stable PHF-core preparations from AD brain tissue comprise a mixture of fragments derived from both three- and four-repeat isoforms, but restricted to the equivalent of three repeats, with an N-terminus at residues Ile-297 or His-299 and the C-terminus at residue Glu-391, or at homologous positions in other species (Wischik et al., 1988; Jakes et al., 1991). The present inventors have previously shown that the fragment 297-391 (referred to as dGAE) self-assembles to form Paired helical filament-like structures under specific conditions (Al-Hilaly et al., 2017) and that assembly is enhanced under reducing conditions (in the presence of DTT) or at high concentrations. Assembly is accompanied by a conformational change from random coil in solution to beta sheet structure in the insoluble fraction (Al-Hilaly et al., 2017). Residues 306-378 from this same molecule (referred to as “dGAE73” herein) have been visualised in electron density and confirmed as forming part of the core of the PHF using cryo-electron microscopy analysis of PHFs extracted from AD brain tissue (Fitzpatrick et al., 2017).
However, despite the availability of structural data on amyloid fibrils formed by different proteins and from PHFs, the elucidation of a mechanism of amyloid fibril formation from disjoint monomers remains elusive (Sipe 1992; Chiti & Dobson 2006; Roberts 2007; Kelly 2000). To date there has been no agreed description of the primary stages of nucleation from a pool of disjoint soluble oligomeric species (Roberts 2007, Kodali & Wetzel, 2007; Serio et al., 2000; Modler et al., 2004; Glabe 2008; Meisl et al., 2016). A number of amyloidogenic precursors and oligomeric states have been identified (Campioni et al., 2010; Novo et al., 2018; Kayed et al., 2003; Buccianti et al., 2002 and 2004; Gerson et al., 2016), yet isolation and further characterisation of these structurally heterogeneous and transient aggregates remains a challenge. Evidence of soluble protein oligomers as the primary cause of cell impairment and dysfunction in the pathogenesis of tauopathies has been supported through biochemical and biophysical experiments and in vivo assays (Macdonald et al., 2019; Lasagna-Reeves et al., 2012; Gerson et al., 2016).
The present inventors have shown previously that the methylthioninium (MT) species, which can exist in oxidized (MTC,
As tau aggregation pathology and cognitive decline are closely associated (Okamura et al., 2014; Chien et al., 2013; Mukaetova-Ladinska et al., 2000; Grober et al., 1999; Wilcock and Esiri, 1982; Duyckaerts et al., 1997; Bancher et al., 1993 & 1996; Arriagada et al., 1992), treatment based on the use of inhibitors of tau aggregation offers a potentially promising therapeutic approach for AD (Wischik et al., 2018). A fundamental component for the development of new chemical entities for the treatment of neurodegenerative tauopathies is an understanding of the structure of key intermediates in this aggregation process, and how compounds such as LMT can interfere with this process. This would enable the development of a structure-based hypothesis for ligand design.
In an attempt to address the above-mentioned needs, the present inventors devised multistep computational protocols to explore the conformational flexibility of a previously reported truncated tau fragment corresponding to one of the species isolated from proteolytically stable PHF preparations (residues 297Ile-Glu391) and referred to as “dGAE” (Wischik et al., 1988; Novak et al., 1991; Novak et al., 1993). dGAE assembles spontaneously in physiological conditions in vitro into filaments that closely resemble native PHFs and straight filaments from AD brain morphologically (AI-Hilaly et al., 2017; AI-Hilaly et al., 2020). The analyses were aimed at understanding the flexibility of dGAE, its aggregation into PHFs and the potential of small molecules to inhibit PHF assembly and to cause PHF disaggregation in vivo. In an effort to understand the mechanism of how small molecules could inhibit dGAE assembly into fibrils, the conformation space of a single Tau97 monomer (comprising dGAE and preceding residues 295Asp296Asn) was evaluated with atomistic molecular simulation. The inventors hypothesized that LMT could complex to dGAE monomers and prevent fibril formation. To test this hypothesis, representative protein structures were sampled from the conformational ensemble of apo Tau97. From this, the inventors were able to identify a unique cryptic pocket which would bind to LMT and form a stabilised complex. From the conformation of the complex a pharmacophore model was built and a series of de novo designed compounds were synthesised in order to assess whether stabilising dGAE oligomers would result in inhibition of Tau aggregation. Compounds were assessed in a cell-based tau aggregation assay. More than half of the assayed molecules showed sub micro molar activity. From this, the inventors were able to rationalise the structure-activity relationship from this series of Tau assembly inhibitors.
Thus, the present invention is based, in part, on the discovery of a ligand binding pocket in the Tau protein structure, wherein binding of a ligand in the pocket stabilises the Tau protein in a conformation that is not prone to aggregation into paired helical filaments (PHFs). Interactions between the ligand and any of residues Leu315, Ser341, Glu342, Lys343, Phe346, Lys347, Val350, Ser352, Ile354, Lys369, Ile371, Glu372, Phe378 and Thr373 of Tau were found to be particularly important in stabilising the ligand bound conformation.
The formation of PHF was then analysed through immunostaining dot blots and ELISA assays. These showed that in vitro assembly of PHFs is associated with loss of immunoreactivity with antibodies directed against the PHF core. Furthermore, loss of immunoreactivity could be prevented by including LMT during assembly. Starting from the stabilised conformation of dGAE73 (residues 306-378), a molecular simulation was performed to examine assembly onto a preformed PHF. This simulation supported findings from the dot blots for the assembly of PHFs from stabilized dGAE oligomers. The simulation identified 3 key stages in assembly of PHFs. The first is the electrostatic attraction of oligomers to PHFs and the anchoring of a hairpin turn. This is followed by a proline trans-cis-trans switch which allows for the final step of the formation of stabilising cross-β sheets through hydrophobic zippering of the N- and C-terminal tails of the protein structure. Taken together, the structure-activity results and the dot blots provide strong evidence for the binding of small molecules to the cryptic pocket proposed in our dGAE stabilised conformation, which finds uses in a structure-based approach to design further inhibitors of tau assembly.
Thus the present invention is based, in other aspects, on the discovery of a folding process through which the Tau protein in the compact folded state that includes the binding pocket aggregates with a paired helical filament, where binding of a ligand that inhibits the formation of intermediates in the folding process may stabilise the Tau protein in a conformation that is not prone to aggregation into PHFs. A series of conformational changes from a compact folded state to an aggregated state were found to be particularly important. These conformational changes are such that:
In embodiments, residue Pro332 switching between a trans and a cis configuration causes residues His329, His330 and Lys331 to move close enough to PHF to establish interactions with the partner residues in the next layer of the PHF stack through hydrophobic stacking and a strong hydrogen bond between the side chains of His330 in a preformed layer of the PHF stack and Thr361 of the newly formed dGAE layer on the PHF stack.
In embodiments, step (iii) comprises the following steps: (iii)(1) residues 355-360 move to establish a zipper with the corresponding anti-parallel residues of the PHF; (iii)(2) residues 361-367 move to establish a zipper with the corresponding anti-parallel residues of the PHF; and (iii)(3) residues 306-318 and 368-378 move to form a cross-beta sheet conformation with corresponding residues of the PHF.
According to a first aspect, there is provided a method for selecting or designing a compound for modulating the aggregation of a Tau protein or a truncated form thereof, the method comprising using computer-implemented molecular modelling means to:
According to a related aspect, there is provided a method for selecting or designing a compound for modulating the aggregation of a Tau protein or a truncated form thereof, the method comprising using computer-implemented molecular modelling means to:
As used herein, the step of comparing the three-dimensional structure of a candidate compound with a three-dimensional structure of at least a part of the Tau protein comprises providing the three-dimensional structure of the candidate compound, providing the three-dimensional structure of the part of the Tau protein, and obtaining a molecular model that includes both three-dimensional structures.
In embodiments, the non-covalent interactions comprise interactions with one or more, optionally with two or more of: Leu315, Ser341, Glu342, Lys343, Phe346, Lys347, Val350, Ser352, Lys369, Ile371, Glu372, and Thr373, of SEQ ID NO:1, or equivalent amino acids in a variant or derivative. Non-covalent interactions with the above-mentioned residues of the Tau protein were found to stabilise a compact conformation of the Tau protein (an example of which is provided by the coordinates defined in Table 1), reducing the propensity of the Tau protein to aggregate.
In embodiments, the non-covalent interactions comprise at least a non-covalent interaction with Lys347. Indeed, a potent inhibitor of Tau aggregation, compound 16, was identified by the inventors as binding the Tau protein by establishing an interaction with Lys347, thereby stabilising a conformation of the protein that has reduced propensity to aggregate. Further potent inhibitors of Tau aggregation were also found to interact with Lys347, including LMT, compound 17, compound 12, compound 1, compound 7, compound 5, compound 14, compound 2, compound 8, compound 13 and compound 18.
In embodiments, the non-covalent interactions comprise at least a non-covalent interaction with Lys343. Indeed, a potent inhibitor of Tau aggregation, compound 11, was identified by the inventors as binding the Tau protein by establishing an interaction with Lys343, thereby stabilising a conformation of the protein that has reduced propensity to aggregate. Further potent inhibitors of Tau aggregation were also found to interact with Lys343, including LMT, compound 17, compound 12, compound 3, compound 14, compound 10, compound 13 and compound 18.
In embodiments, the non-covalent interactions comprise at least a non-covalent interaction with Thr373. Indeed, a potent inhibitor of Tau aggregation, compound 9, was identified by the inventors as binding the Tau protein by establishing an interaction with Thr373, thereby stabilising a conformation of the protein that has reduced propensity to aggregate. Further potent inhibitors of Tau aggregation were also found to interact with Thr373, including LMT, compound 3, compound 4, and compound 15.
In embodiments of any aspect, the non-covalent interactions comprise an interaction with two or more of Lys343, Leu315, Lys347, Glu372, and Thr373, of SEQ ID NO:1.
In embodiments of any aspect, the non-covalent interactions comprise an interaction with one or more of Leu315, Val350, and Ile371 of SEQ ID NO:1. Interactions with these residues are preferably hydrophobic interactions, involving the side chain of the residue. Indeed, multiple potent inhibitors of Tau aggregations, such as compounds 16, 9 and 11, were identified by the inventors as binding the Tau protein by establishing an interaction with these residues.
In embodiments of any aspect, the non-covalent interactions comprise an interaction with two or more (such as all of) Lys347, Lys343, Ser352 and Thr373 of SEQ ID NO: 1. A known inhibitor of Tau aggregation, LMT was identified by the inventors as binding the Tau protein in a cryptic binding pocket, where it establishes interactions with the above-mentioned residues, thereby stabilising a conformation of the protein that has reduced propensity to aggregate.
In embodiments of any aspect, the non-covalent interactions comprise an interaction with two or more (such as all of) Lys347, Lys343, Ser341, Leu315, Ile371, Glu372 and Phe346 of SEQ ID NO: 1. A potent inhibitor of Tau aggregation, compound 17, was identified by the inventors as binding the Tau protein by establishing interactions with the above-mentioned residues, thereby stabilising a conformation of the protein that has reduced propensity to aggregate.
In embodiments of any aspect, the non-covalent interactions comprise an interaction with two or more (such as all of) Lys347, Lys343, Ile371, and Glu372 of SEQ ID NO: 1. A potent inhibitor of Tau aggregation, compound 12, was identified by the inventors as binding the Tau protein by establishing interactions with the above-mentioned residues, thereby stabilising a conformation of the protein that has reduced propensity to aggregate.
In embodiments of any aspect, the non-covalent interactions comprise an interaction with two or more (such as all of) Lys347, Ile371, Glu342, Phe346 and Glu372 of SEQ ID NO: 1. A potent inhibitor of Tau aggregation, compound 1, was identified by the inventors as binding the Tau protein by establishing interactions with the above-mentioned residues, thereby stabilising a conformation of the protein that has reduced propensity to aggregate.
In embodiments of any aspect, the non-covalent interactions comprise an interaction with two or more (such as all of) Leu315, Lys347, Ile371, Glu342, Phe346 and Glu372 of SEQ ID NO: 1. A potent inhibitor of Tau aggregation, compound 7, was identified by the inventors as binding the Tau protein by establishing interactions with the above-mentioned residues, thereby stabilising a conformation of the protein that has reduced propensity to aggregate.
In embodiments of any aspect, the non-covalent interactions comprise an interaction with at least Lys343 and Thr373 of SEQ ID NO: 1. A potent inhibitor of Tau aggregation, compound 3, was identified by the inventors as binding the Tau protein by establishing interactions with the above-mentioned residues, thereby stabilising a conformation of the protein that has reduced propensity to aggregate.
In embodiments of any aspect, the non-covalent interactions comprise an interaction with two or more (such as all of) Glu372, Lys369, and Thr373 of SEQ ID NO: 1. A potent inhibitor of Tau aggregation, compound 7, was identified by the inventors as binding the Tau protein by establishing interactions with the above-mentioned residues, thereby stabilising a conformation of the protein that has reduced propensity to aggregate.
In embodiments of any aspect, the non-covalent interactions comprise an interaction with two or more (such as all of) Lys343, Lys347, Ile371, Glu342, Phe346 and Glu372 of SEQ ID NO: 1. A potent inhibitor of Tau aggregation, compound 5, was identified by the inventors as binding the Tau protein by establishing interactions with the above-mentioned residues, thereby stabilising a conformation of the protein that has reduced propensity to aggregate.
In embodiments of any aspect, the non-covalent interactions comprise an interaction with two or more (such as all of) Leu315, Lys347, and Lys343 of SEQ ID NO: 1. A potent inhibitor of Tau aggregation, compound 14, was identified by the inventors as binding the Tau protein by establishing interactions with the above-mentioned residues, thereby stabilising a conformation of the protein that has reduced propensity to aggregate.
In embodiments of any aspect, the non-covalent interactions comprise an interaction with two or more (such as all of) Lys343, Lys347, and Glu372 of SEQ ID NO: 1. A potent inhibitor of Tau aggregation, compound 2, was identified by the inventors as binding the Tau protein by establishing interactions with the above-mentioned residues, thereby stabilising a conformation of the protein that has reduced propensity to aggregate.
In embodiments of any aspect, the non-covalent interactions comprise an interaction with two or more (such as all of) Lys347, Glu342, Phe346 and Glu372 of SEQ ID NO: 1. A potent inhibitor of Tau aggregation, compound 8, was identified by the inventors as binding the Tau protein by establishing interactions with the above-mentioned residues, thereby stabilising a conformation of the protein that has reduced propensity to aggregate.
In embodiments of any aspect, the non-covalent interactions comprise an interaction with two or more (such as all of) Thr373, Lys369, and Lys343 of SEQ ID NO: 1. A potent inhibitor of Tau aggregation, compound 10, was identified by the inventors as binding the Tau protein by establishing interactions with the above-mentioned residues, thereby stabilising a conformation of the protein that has reduced propensity to aggregate.
In embodiments of any aspect, the non-covalent interactions comprise an interaction with two or more (such as all of) Leu315, Ile371, and Lys343 of SEQ ID NO: 1. A potent inhibitor of Tau aggregation, compound 6, was identified by the inventors as binding the Tau protein by establishing interactions with the above-mentioned residues, thereby stabilising a conformation of the protein that has reduced propensity to aggregate.
In embodiments of any aspect, the non-covalent interactions comprise an interaction with two or more (such as all of) Ile371, Glu372, Lys343, Phe346, and Lys347 of SEQ ID NO: 1. A potent inhibitor of Tau aggregation, compound 13, was identified by the inventors as binding the Tau protein by establishing interactions with the above-mentioned residues, thereby stabilising a conformation of the protein that has reduced propensity to aggregate.
In embodiments of any aspect, the non-covalent interactions comprise an interaction with two or more (such as all of) Glu372 and Thr373 of SEQ ID NO: 1. A potent inhibitor of Tau aggregation, compound 15, was identified by the inventors as binding the Tau protein by establishing interactions with the above-mentioned residues, thereby stabilising a conformation of the protein that has reduced propensity to aggregate.
In embodiments of any aspect, the non-covalent interactions comprise an interaction with two or more (such as all of) Lys343, Phe346, and Lys347 of SEQ ID NO: 1. A potent inhibitor of Tau aggregation, compound 18, was identified by the inventors as binding the Tau protein by establishing interactions with the above-mentioned residues, thereby stabilising a conformation of the protein that has reduced propensity to aggregate.
The methods of the first and second aspect may comprise determining whether the candidate compound is able to simultaneously form non-covalent molecular interactions with Lys343 and Glu372 of SEQ ID NO:1.
Multiple compounds found to have a strong effect in modulating Tau aggregation were predicted to form stabilising interactions with Lys343 and Glu372, stabilising the Tau protein in a conformation that include the binding pocket exposing these residues.
In embodiments, a non-covalent molecular interaction with Lys343 comprises a cation-pi interaction and/or a hydrogen bond and/or a pi-H interaction. In some embodiments, a cation-pi interaction with Lys343 is between an aromatic ring of the candidate compound and the sidechain amino group of Lys343. In some embodiments, a hydrogen bond with Lys343 is between an acceptor group of the candidate compound and the backbone amino group of Lys343. In some embodiments, a hydrogen bond with Lys343 is between a donor group of the candidate compound and the backbone carbonyl of Lys343. In some embodiments, a pi-H interaction with Lys343 is between an aromatic ring of the candidate compound and the sidechain Cp of Lys343. In some embodiments, a pi-H interaction with Lys343 is between an aromatic ring of the candidate compound and the backbone amino group of Lys343. In some embodiments, a pi-H interaction with Lys343 is between an aromatic ring of the candidate compound and the sidechain CE of Lys343.
In embodiments, a non-covalent interaction with Ser352 comprises a pi-H interaction. In some such embodiments, a pi-H interaction with Ser352 is between an aromatic ring of the candidate compound and the backbone carbonyl of Ser352.
In embodiments, the non-covalent interaction with any of Leu315, Ser341, Phe346, Lys347, Ile371, Glu372, and Thr373, of SEQ ID NO:1 is a hydrogen bond.
In embodiments, the non-covalent interaction with Glu372 is a hydrogen bond. In some such embodiments, a hydrogen bond with Glu372 is between a H donor group of the candidate compound and the sidechain carboxyl (OE1) of Glu372. In some embodiments, a hydrogen bond with Glu372 is between an acceptor group of the candidate compound and the backbone amino group of Glu372. In some embodiments, a hydrogen bond with Glu372 is between an acceptor group of the candidate compound and the sidechain amino group of Glu372.
In embodiments, the non-covalent interaction with Lys347 is a hydrogen bond. In embodiments, the hydrogen bond with Lys347 is between an acceptor group of the candidate compound and the backbone amino group of Lys347.
In embodiments, the non-covalent interaction with Thr373 is a hydrogen bond. In embodiments, a hydrogen bond with Thr373 is between a donor group of the candidate compound and the hydroxyl group of the side-chain of Thr373.
In embodiments, the non-covalent interaction with Leu315 is a hydrogen bond. In embodiments, a hydrogen bond with Leu315 is between a donor group of the candidate compound and the backbone carbonyl of Leu315.
In embodiments, the candidate compound is able to form a non-covalent interaction with Phe378. In embodiments, the non-covalent interaction is a pi-interaction.
In embodiments, the non-covalent interaction with Ile371 is a hydrogen bond. In embodiments, a hydrogen bond with Ile371 is between an acceptor group of the candidate compound and the backbone Ca of Ile371.
In embodiments, the non-covalent interaction with Ser341 is a hydrogen bond. In embodiments, a hydrogen bond with Ser341 is between an acceptor group of the candidate compound and the sidechain Cp of Ser341.
In embodiments, the non-covalent interaction with Phe346 is a hydrogen bond. In embodiments, a hydrogen bond with Phe346 is between an acceptor group of the candidate compound and the backbone Ca of Phe346.
In embodiments, the non-covalent interaction with Glu342 is a hydrogen bond. In some embodiments, a hydrogen bond with Glu342 is between an acceptor group of the candidate compound and the backbone amino group of Glu342.
In embodiments, the non-covalent interaction with Lys369 is a hydrogen bond. In embodiments, a hydrogen bond with Lys369 is between an acceptor group of the candidate compound and the backbone CE of Lys369. In embodiments, a hydrogen bond with Lys369 is between an acceptor group of the candidate compound and the backbone amino group of Lys369.
In embodiments, the interactions with any of Val350, Leu315, Ile354, Ile371, Phe378 and Phe346 are hydrophobic interactions.
In embodiments, the compound is for inhibiting the aggregation of a Tau protein or a truncated form thereof into paired helical filaments.
The compound may be a small molecule, a peptide, a polypeptide or a combination thereof. Preferably, the compound is a small molecule.
In embodiments, the three-dimensional structure of the part of the Tau protein comprising amino acids 315-378 of SEQ ID NO:1 comprises a binding pocket that is capped by Phe378 and Phe346, exposes the hydrophobic side chains of residues Val350, Leu315, Ile354 and/or Ile371, and contains residues Leu315, Lys343, Lys347, Glu372, and/or Thr373 capable of forming hydrogen bonds to a molecule within the binding pocket.
In embodiments, the candidate compound is able to simultaneously form non-covalent molecular interactions with one or more of residues Leu315, Ser341, Glu342, Lys343, Phe346, Lys347, Ser352, Lys369, Ile371, Glu372, and Thr373, of SEQ ID NO:1, or equivalent amino acids in a variant or derivative, without exposing hydrophilic groups at a distance below 4 Å from the hydrophobic side chains of residues Val350, Leu315, Ile354 and Ile371.
In embodiments, the three-dimensional structure of the part of the Tau protein comprising amino acids 315-378 of SEQ ID NO:1 comprises a hairpin loop between residues Val337 and Gly355 of SEQ ID NO:1.
The present inventors have identified that a compact folded state that is able to interact with inhibitors of Tau aggregation such as LMT, and is stabilised by this interaction, comprises a hair pin loop between residues Val337 and Gly355 of SEQ ID NO:1.
In embodiments, the three-dimensional structure of the part of the Tau protein comprising amino acids 315-378 of SEQ ID NO:1 comprises the sequence Pro364-Gly367 located between a loop formed by the sequence Tyr219-Lys331 and the sequence Pro332-Gly335.
In embodiments, the three-dimensional structure of the part of the Tau protein comprising amino acids 315-378 of SEQ ID NO:1 is stabilised at least in part by hydrogen bonds between Glu342 and Val318 and/or Thr319, between Gly367 and Lys340, between Asn368 and Lys340, between Lys369 and Glu372, between Lys370 and Asp358, between Ile371 and Ser316, between Glu372 and Ser356, between Thr373 and Gln351, Ser316, between His374 and Gln351, and between Lys375 and Gln351.
The three-dimensional structure of the part of the Tau protein comprising amino acids 315-378 of SEQ ID NO:1 may be that represented by the structure co-ordinates in Table 1, or a structure modelled on these coordinates.
The three-dimensional structure of the part of the Tau protein may be obtainable by performing molecular dynamics simulations of a part of the Tau protein comprising amino acids 315-378 of SEQ ID NO:1 in the presence of LMT to obtain one or more complex conformations and selecting a three-dimensional structure of the part of the Tau protein by applying a stability criterion and a binding affinity criterion to the one or more complex conformations.
Advantageously, the stability criterion may apply to the distance between complex conformations in consecutive frames of the molecular dynamics simulation after a predetermined amount of time, and/or the binding affinity criterion may apply to the value of a placement scoring function <−80 kcal/mol or a scoring function <−8.5 as determined by the GB IV scoring function within the CCG MOE docking software.
In embodiments, the stability criterion applies to the root-mean-square deviation (RMSD) of atomic positions, for atoms in the backbone of the part of protein in the complexes, between consecutive frames of the molecular dynamics simulation.
In embodiments, the stability criterion applies to the root-mean-square deviation (RMSD) of atomic positions, for atoms in LMT. In embodiments, a stability criterion may be defined as a threshold on the average RMSD of atomic positions, for atoms in the backbone of the part of protein in the complexes, between consecutive frames of the molecular dynamics simulation, where the average is calculated over a predetermined set of frames of a MD simulation. For example, the predetermined set of frames may be defined as the last 10% of frames, the last 5% of frames, or the last 1% of frames in a MD simulation. The predetermined set of frames may be defined as the last 30 k frames, the least 25 k frames, the last 20 k frames, the last 15 k frames, or the last 10 k frames. The threshold on the average RMSD may be chosen as about 0.5 Å, about 0.6 Å, about 0.7 Å, about 0.8 Å or about about 0.9 Å.
In particular, the present inventors have found that a stable LMT-bound conformation could be obtained by running molecular dynamics simulation of a part of the Tau protein in the presence of LMT, and excluding those complexes where after a predetermined amount of simulation time, the LMT was no longer tightly bound (as indicated by the docking score or ligand RMSD fluctuations) and/or the RMSD fluctuations between consecutive frames of the simulation indicated that the conformation was not stable.
In embodiments, the predetermined amount of time is at least 50 ns, at least 100 ns, or about 100 ns.
In embodiments, the methods described herein further comprises designing a pharmacophore model, wherein the pharmacophore includes features representative of non-covalent molecular interactions with two or more of: Leu315, Ser341, Glu342, Lys343, Phe346, Lys347, Val350, Ser352, Ile354, Lys369, Ile371, Glu372, Phe378 and Thr373, of SEQ ID NO:1.
In embodiments, designing a pharmacophore model comprises computing the interaction energy between the three-dimensional structure of each candidate compound in a library of candidate compounds and the three-dimensional structure of at least a part of the Tau protein comprising amino acids 315-378 of SEQ ID NO:1 or a variant or derivative thereof that is structurally equivalent to said region, selecting a subset of the library that has an interaction energy below a threshold, and deriving a pharmacophore model based on the selected subset.
The part of the Tau protein may comprise amino acids 306-378 of SEQ ID NO:1. The part of the Tau protein may comprise amino acids 297-391 of SEQ ID NO:1. The part of the Tau protein may comprise amino acids 295-391 of SEQ ID NO:1. The part of the Tau protein may consist of amino acids 297-391 of SEQ ID NO:1. The part of the Tau protein may consist of amino acids 295-391 of SEQ ID NO:1. The part of the Tau protein may consist of amino acids 306-378 of SEQ ID NO:1. In embodiments, the part of the Tau protein comprises amino acids 306-378 of SEQ ID NO:1 (also referred to herein as dGAE73, SEQ ID NO: 4) or a variant or derivative thereof that is structurally equivalent to said part.
In embodiments, the part of the Tau protein comprises amino acids 297-391 of SEQ ID NO:1 (also referred to herein as dGAE, SEQ ID NO: 3) or a variant or derivative thereof that is structurally equivalent to said part. In embodiments, the part of the Tau protein comprises amino acids 295-391 of SEQ ID NO:1 (also referred to herein as Tau97, SEQ ID NO: 5) or a variant or derivative thereof that is structurally equivalent to said part.
In embodiments, the part of the Tau protein comprises the full sequence of the human Tau isoform 2N4R (SEQ ID NO: 1) or a homolog or variant thereof. In embodiment, a variant has at least 90% amino acid identity with the sequence of SEQ ID NO: 1, or the portion thereof that is represented in the three-dimensional structure.
The methods may further comprise repeating the steps of comparing and determining with a further candidate compound that differs from the previous candidate compound in at least one substituent.
In embodiments, one or more candidate compounds may be modelled such that their properties can be compared to thereby define the features that advantageously stabilise the ligand-bound conformation of the part of the Tau protein. Comparing the properties of multiple candidate compounds may comprise comparing, using computational molecular modelling means, the characteristics of the complexes comprising the part of the Tau protein and each of the candidate compounds in terms of e.g. stability, binding affinity, etc. Comparing the properties of multiple candidate compounds may comprise performing one or more experiments to quantify the Tau aggregation modulation associated with each candidate compound.
In embodiments, such comparisons may be performed sequentially to progressively optimise a candidate compound, where the information obtained by comparing a candidate compound to a previously obtained candidate compound is used to design a new candidate compound in a subsequent iteration. Instead or in addition to this, such comparisons may be performed in parallel where multiple candidate compounds are simultaneously compared to derive insights from common properties of subsets of candidate compounds tested. Such insights can be used to design one or more further candidate compounds.
Without wishing to be bound by theory, it is believed that such a process may help to define the features of a candidate compound that optimally modulates Tau aggregation by stabilising a compact folded conformation of the Tau protein.
Comparing the three-dimensional structure of a candidate compound with a three-dimensional structure of at least a part of the Tau protein comprising amino acids 315-378 of SEQ ID NO:1 or a variant or derivative thereof that is structurally equivalent to said region may comprise computing the interaction energy between the candidate compound and the part of the Tau protein represented in the three-dimensional structure.
According to a further aspect, there is provided a computer-implemented method for evaluating the ability of a candidate compound to bind to a binding pocket of a Tau protein, the method comprising the steps of:
In embodiments, step (c) comprises determine whether the candidate compound is able fit at least in part within the binding pocket and form non-covalent molecular interactions with one or more of Lys343, and at least one of Leu315, Ser341, Glu342, Lys343, Phe346, Lys347, Val350, Ser352, Ile354, Lys369, Ile371, Glu372, Phe378 and Thr373, of SEQ ID NO:1, or equivalent amino acids in a variant or derivative.
In embodiments, the three-dimensional structure of the part of the Tau protein comprising amino acids 315-378 of SEQ ID NO:1 is that represented by the structure co-ordinates shown in Table 1, or a structure modelled on these coordinates.
In embodiments, the compound is a small molecule, a peptide, a polypeptide or a combination thereof.
In embodiments, a candidate compound is considered able to bind to the binding pocket if it is able to establish non-covalent interactions with amino acids in the binding pocket. In embodiments, the non-covalent interactions are selected from: hydrophobic interactions, hydrogen bonds, pi-cation interactions, and pi-stack interactions.
In embodiments, a candidate compound that is considered able to bind to the binding pocket is predicted to inhibit the aggregation of the Tau protein or a truncated form thereof. In embodiments, a candidate compound that is considered able to bind to the binding pocket is predicted to inhibit the aggregation of the Tau protein or a truncated form thereof into paired helical filaments.
In embodiments, the method further comprises evaluating the ability of a further candidate compound to bind the Tau protein at a different site from the binding site of the first candidate compound.
In embodiments, evaluating the ability of a further candidate compound to bind the Tau protein at a different site from the binding site of the first candidate compound comprises computing the interaction energy between the further candidate compound and the three-dimensional structure of the part of the Tau protein and determining whether the interaction between the further candidate compound and the Tau protein further stabilises the conformation of the complex between the first candidate compound and the Tau protein.
In embodiments, the first candidate compound is a small molecule or a peptide, and the further candidate compound is a peptide or a polypeptide, such as an antibody or fragment thereof.
In embodiments, the candidate compound is able to bind to the pocket if it is able to simultaneously form non-covalent molecular interactions with two or more of Leu315, Ser341, Glu342, Lys343, Phe346, Lys347, Val350, Ser352, Ile354, Lys369, Ile371, Glu372, Phe378 and Thr373, of SEQ ID NO:1.
In embodiments, the candidate compound is able to bind to the pocket if it is able to simultaneously form non-covalent molecular interactions with Lys343 and at least one of Leu315, Lys347, Glu372 and Thr373.
In some cases, the candidate compound is able to bind to the pocket if it is able to simultaneously form non-covalent molecular interactions with Lys343 and Glu372 of SEQ ID NO:1.
In embodiments, the binding pocket is capped by Phe378 and Phe346, exposes the hydrophobic side chains of residues Val350, Leu315, Ile354 and/or Ile371, and contains residues Leu315, Lys343, Lys347, Glu372, and/or Thr373 capable of forming hydrogen bonds to a molecule within the binding pocket.
In embodiments, the three-dimensional structure of the part of the Tau protein comprising amino acids 315-378 of SEQ ID NO:1 comprises a hairpin loop between residues Val337 and Gly355 of SEQ ID NO:1.
In embodiments, the three-dimensional structure of the part of the Tau protein comprising amino acids 315-378 of SEQ ID NO:1 comprises the sequence Pro364-Gly367 located between a loop formed by the sequence Tyr219-Lys331 and the sequence Pro332-Gly335.
In embodiments, the three-dimensional structure of the part of the Tau protein comprising amino acids 315-378 of SEQ ID NO:1 is stabilised at least in part by hydrogen bonds between Glu342 and Val318, Thr319, between Gly367 and Lys340, between Asn368 and Lys340, between Lys369 and Glu372, between Lys370 and Asp358, between Ile371 and Ser316, between Glu372 and Ser356, between Thr373 and Gln351, Ser316, between His374 and Gln351, between Lys375 and Gln351.
In embodiments, the three-dimensional structure coordinates of a part of the Tau protein comprising amino acids 315-378 of SEQ ID NO:1 or equivalent amino acids in a variant or derivative thereof that is structurally equivalent to said part have been obtained by performing molecular dynamics simulations of a part of the Tau protein comprising amino acids 315-378 of SEQ ID NO:1 in the presence of LMT to obtain one or more complex conformations and selecting a complex conformation using a stability criterion and a binding affinity criterion.
In embodiments, step (a) receiving the three-dimensional structure coordinates of a part of the Tau protein comprises obtaining the three-dimensional structure coordinates of a part of the Tau protein comprising amino acids 315-378 of SEQ ID NO:1 or equivalent amino acids in a variant or derivative thereof that is structurally equivalent to said part by:
In embodiments, selecting a complex conformation using a stability criterion comprises:
In embodiments, the predetermined amount of time is at least 50 ns, at least 100 ns, or about 100 ns.
In embodiments, selecting a complex conformation using a binding affinity criterion comprises computing the interaction energy between the candidate compound and the part of the Tau protein represented in the three-dimensional structure.
In embodiments, performing molecular dynamics simulations of a part of the Tau protein comprising amino acids 315-378 of SEQ ID NO:1 in the presence of LMT comprises obtaining the three-dimensional structure coordinates of a part of the Tau protein comprising amino acids 315-378 of SEQ ID NO:1 or equivalent amino acids in a variant or derivative thereof that is structurally equivalent to said part, wherein the three-dimensional structure coordinates correspond to a conformation of the Tau protein as part of a PHF stack. In embodiments, the three-dimensional structure coordinates correspond to those of PDB ID: 5O3L or a structure modelled on this structure.
The part of the Tau protein may comprise amino acids 306-378 of SEQ ID NO:1. The part of the Tau protein may comprise amino acids 297-391 of SEQ ID NO:1. The part of the Tau protein may comprise amino acids 295-391 of SEQ ID NO:1. The part of the Tau protein may consist of amino acids 297-391 of SEQ ID NO:1. The part of the Tau protein may consist of amino acids 295-391 of SEQ ID NO:1. The part of the Tau protein may consist of amino acids 306-378 of SEQ ID NO:1.
In embodiments, the part of the Tau protein comprises amino acids 306-378 of SEQ ID NO:1 (also referred to herein as dGAE73, SEQ ID NO: 4) or a variant or derivative thereof that is structurally equivalent to said part.
In embodiments, the part of the Tau protein comprises amino acids 297-391 of SEQ ID NO:1 (also referred to herein as dGAE, SEQ ID NO: 3) or a variant or derivative thereof that is structurally equivalent to said part.
In embodiments, the part of the Tau protein comprises amino acids 295-391 of SEQ ID NO:1 (also referred to herein as Tau97, SEQ ID NO: 5) or a variant or derivative thereof that is structurally equivalent to said part.
In embodiments, the part of the Tau protein comprises the full sequence of the human Tau isoform 2N4R (SEQ ID NO: 1) or a homolog or variant thereof. In embodiment, a variant has at least 90% amino acid identity with the sequence of SEQ ID NO: 1, or the portion thereof that is represented in the three-dimensional structure.
In embodiments, the three-dimensional structure coordinates of the part of the Tau protein comprising amino acids 315-378 of SEQ ID NO:1 or equivalent amino acids in a variant or derivative thereof that is structurally equivalent to said part correspond to a conformation that has the following structural characteristics:
In embodiments, the three-dimensional structure coordinates of the part of the Tau protein is such that the distance (RMSD) between the backbone carbonyl oxygen of Gln351 and the sidechain hydroxyl oxygen of Thr373 during the final 10 ns of a 50 ns simulation is between 2 Å and 4 Å, or below 5 Å.
In embodiments, the three-dimensional structure coordinates of the part of the Tau protein is such that the distance (RMSD) between the sidechain carbonyl oxygen of Gln351 and the sidechain amine nitrogen of His374 during the final 10 ns of a 50 ns simulation is between 2 Å and 4 Å, or below 5 Å.
In embodiments, the three-dimensional structure coordinates of the part of the Tau protein is such that the distance (RMSD) between the sidechain carbonyl oxygen of Gln351 and the backbone amine nitrogen of Lys375 during the final 10 ns of a 50 ns simulation is between 2.5 Å and 5 Å, or below 5 Å.
In embodiments, the three-dimensional structure coordinates of the part of the Tau protein is such that the distance (RMSD) between the backbone carbonyl oxygen of Arg349 and the hydroxyl sidechain oxygen of Thr377 during the final 10 ns of a 50 ns simulation is between 2.5 Å and 4 Å, or below 5 Å.
In embodiments, the three-dimensional structure coordinates of the part of the Tau protein is such that the distance (RMSD) between the carboxylic acid sidechain oxygen of Glu372 and the backbone NH of Ser356 during the final 10 ns of a 50 ns simulation is between 2 Å and 4 Å, or below 5 Å.
According to a further aspect, there is provided a method for selecting or designing a compound for modulating the aggregation of a Tau protein or a truncated form thereof, the method comprising:
In embodiments, residue Pro332 switching between a trans and a cis configuration causes residues His329, His330 and Lys331 to move close enough to PHF to establish interactions with the partner residues in the next layer of the PHF stack through hydrophobic stacking and a strong hydrogen bond between the side chains of His330 in a preformed layer of the PHF stack and Thr361 of the newly formed dGAE layer on the PHF stack.
In embodiments, step (iii) comprises the following steps: (iii)(1) residues 355-360 move to establish a zipper with the corresponding anti-parallel residues of the PHF; (iii)(2) residues 361-367 move to establish a zipper with the corresponding anti-parallel residues of the PHF; and (iii)(3) residues 306-318 and 368-378 move to form a cross-beta sheet conformation with corresponding residues of the PHF.
In embodiments, residues 355-360 move to establish a zipper with the corresponding anti-parallel residues of the PHF from the C- to N-direction.
In embodiments, residues 361-367 move to establish a zipper with the corresponding anti-parallel residues of the PHF in the N- to C-direction.
In embodiments, the cross-beta sheet conformation is formed simultaneously starting from Phe378 and closing the residues from C- to N-terminal, and joining residues 318-306 of the N-terminal in the C- to N-direction.
In embodiments, the compound is a small molecule, a peptide, a polypeptide or a combination thereof. In embodiments, the polypeptide is an antibody or a fragment thereof.
In embodiments, the method further comprises selecting or designing a further compound for modulating the aggregation of a Tau protein or a truncated form thereof by generating a model of a complex of the first compound, the further compound and the intermediate.
In embodiments, the further compound is a small molecule, a peptide, a polypeptide or a combination thereof. In embodiments, the polypeptide is an antibody or a fragment thereof.
The compact folded state may have any of the structural characteristics defined in relation to the preceding aspect.
The compact folded state may have the structure co-ordinates shown in Table 1, or a structure modelled on these coordinates.
Generating a model of a complex of the compound and the intermediate may comprise identifying the compound as binding to the intermediate and preventing the occurrence of any of steps (i) to (v).
Generating a model of a complex of the compound and the intermediate may comprise identifying the compound as binding to the intermediate and preventing the occurrence of any of steps (i) to (iii).
In embodiments, simulating the conformational changes of the part of the Tau protein from a compact folded state to an aggregated state comprises performing a molecular dynamics simulation of the assembly of a Tau protein or a part thereof with a PHF stack. In embodiments, the PHF stack comprises at least 2, at least 4, at least 6, at least 8 or at least 10 monomers. In embodiments, the PHF stack comprises the structure defined in PDB ID: 5O3L, or a structure modelled from the structure in PDB ID: 5O3L.
In embodiments, the part of the Tau protein comprises amino acids 306-378 of SEQ ID NO:1 (also referred to herein as dGAE73, SEQ ID NO: 4) or a variant or derivative thereof that is structurally equivalent to said part.
In embodiments, the part of the Tau protein comprises amino acids 297-391 of SEQ ID NO:1 (also referred to herein as dGAE, SEQ ID NO: 3) or a variant or derivative thereof that is structurally equivalent to said part.
In embodiments, the part of the Tau protein comprises amino acids 295-391 of SEQ ID NO:1 (also referred to herein as Tau97, SEQ ID NO: 5) or a variant or derivative thereof that is structurally equivalent to said part.
In embodiments, simulating the conformational changes of the part of the Tau protein from a compact folded state to an aggregated state comprises performing a first set of molecular dynamics simulation of the assembly of a Tau protein or a part thereof with a PHF stack, in which the anchoring of the part of the Tau protein to the PHF stack is determined by starting the molecular dynamics simulations in the set with different relative orientations of the part of the Tau protein and the PHF stack.
In embodiments, the molecular dynamics simulation in the set are each started with a random orientation of the part of the Tau protein. Preferably, the molecular dynamics simulation in the set are each started with the part of the Tau protein placed beyond hydrogen-bonding distance (such as e.g. at least 4 Å away) of the PHF stack.
The present inventors have found that such molecular dynamics simulation enabled the identification of an intermediate in which the part of the Tau protein has anchored itself on the PHF stack. Such an intermediate may correspond to one where step (i) has occurred.
In embodiments, simulating the conformational changes of the part of the Tau protein from a compact folded state to an aggregated state comprises performing a nudged elastic band molecular dynamics simulation starting from an intermediate in which step (i) has occurred.
According to a further aspect, there is provided a computing system comprising a processor and a memory storing machine-readable instructions that, when executed by the processor, cause the processor to implement the method of any embodiment of the aspects described above.
According to a further aspect, there is provided a computer readable storage medium (or a plurality of computer readable storage media) storing instructions that, when executed by one or more processors, cause the processor(s) to implement the method of any embodiment of the aspects described above.
According to a further aspect, there is provided a computer software product comprising instructions that, when executed by one or more processors, cause the processor(s) to implement the method of any embodiment of the aspects described above.
All residue numbers of the Tau protein sequence and structure in the present disclosure refer to the residues of SEQ ID NO:1, which is the sequence of the four repeat isoform 2N4R of human Tau protein (Uniprot ID P10636-8), or homologous positions in other species or variants thereof. Human Tau isoform 2N4R (Uniprot ID P10636-8) corresponds to amino acids 1-124, 376-394 and 461-758 of full length Tau, Uniprot ID P10636 or P10636-1, provided as SEQ ID NO:2.
References to residues Leu315, Phe346, Lys347, Val350, Ile354, Ile371, Thr373, and Phe378, refer to the residues in bold and underline in the sequence of SEQ ID NO:1 reproduced below (or the corresponding residues in any fragment, homologue or variant thereof, including e.g. dGAE, Tau97 or dGAE73 as defined below):
dGAE refers to the 95 residues fragment of Tau (2N4R) with N-terminus at residue Ile-297 and C-terminus at residue Glu-391, as described in SEQ ID NO: 3, or at homologous positions in other species (the residues mentioned referring to the human or mouse Tau sequence, which are identical in this region). As will be apparent to the skilled person, dGAE95 also corresponds to the fragment of the large peripheral nervous system (PNS) Tau isoform (P10636-1) with N-ter at Ile-614 and C-ter at Glu-708. This sequence may sometimes be referred to simply as “dGAE”.
dGAE73 refers to the fragment of Tau (2N4R) with N-terminus at residue Val-306 and C-terminus at residue Phe-378, as described in SEQ ID NO: 4, or at homologous positions in other species (the residues mentioned referring to the human or mouse Tau sequence, which are identical in this region). As will be apparent to the skilled person, dGAE73 also corresponds to the fragment of Isoform PNS-Tau (P10636-1) with N-ter at Val-623 and C-ter at Phe-695.
Tau97 refers to the 97 residues fragment of Tau (2N4R) with N-terminus at residue Asp-295 and C-terminus at residue Glu-391, as described in SEQ ID NO: 5, or at homologous positions in other species (the residues mentioned referring to the human or mouse Tau sequence, which are identical in this region). As will be apparent to the skilled person, Tau97 also corresponds to the fragment of the large peripheral nervous system (PNS) Tau isoform (P10636-1) with N-ter at Asp-612 and C-ter at Glu-708. This sequence may also be referred to as “dGAE97” of “AspAsn-dGAE” as it includes the 95 amino acids dGAE and the two preceding amino acids Asp and Asn.
In the context of the present disclosure, polypeptides are considered to be “structurally equivalent” if they are able to fold into conformations that have the same three-dimensional interaction properties. In particular, a polypeptide that folds into at least one conformation that has a binding pocket is structurally equivalent to another polypeptide if the latter folds into at least one conformation that has a binding pocket, where the binding pockets of the two polypeptides can be stabilised by forming molecular interactions with the same ligand at corresponding positions in the ligand and the amino acid sequences of the respective polypeptides. For example, two polypeptides may be structurally equivalent if the primary sequence of the first polypeptide differs from the primary sequence of the second polypeptides such that the two polypeptides have at least one predicted conformation that forms a binding pocket, and the interaction properties of the binding pocket are not materially affected by the difference in primary sequence. This may be the case for example where one or more substitutions are made that do not change the conformation of the binding pocket and that do not affect residues that establish stabilising interactions with the ligand, or do not affect said residues in such a way that the stabilising interaction is no longer present. In particular, a variant of dGAE (or Tau97 or dGAE73) may be considered to be structurally equivalent to dGAE (or Tau97 or dGAE73) if it is able to fold in a conformation that forms a binding pocket that comprises multiple hydrophobic side chains and is able to accommodate LMT in a pose where LMT forms stabilising interactions with the amino acids in the variant equivalent to Lys343 and Thr373 of dGAE (or Tau97 or dGAE73).
Aggregation of the tau protein is a hallmark of diseases referred to as “tauopathies”. Various tauopathy disorders that have been recognized which feature prominent tau pathology in neurons and/or glia and this term has been used in the art for several years. The similarities between these pathological inclusions and the characteristic tau inclusions in diseases such as AD indicate that the structural features are shared and that it is the topographic distribution of the pathology that is responsible for the different clinical phenotypes observed. In particular, cryo-electron microscope structures of aggregated Tau in AD, Frontotemporal dementia (including Pick's disease), chronic traumatic encephalopathy (CTE) and cortico-basal degeneration (CBD) have been previously obtained, and all show common conformational features, indicating that compounds that have the ability to modulate Tau aggregation in e.g. PHFs (as observed in AD), may also modulate aggregation of Tau in other tauopathies. In addition to specific diseases discussed below, those skilled in the art can identify tauopathies by combinations of cognitive or behavioural symptoms, plus additionally through the use of appropriate ligands for aggregated tau as visualised using PET or MRI, such as those described in WO02/075318.
Aspects of the present invention relate to “tauopathies”. As well as Alzheimer's disease (AD), the pathogenesis of neurodegenerative disorders such as Pick's disease and Progressive Supranuclear Palsy (PSP) appears to correlate with an accumulation of pathological truncated tau aggregates in the dentate gyrus and stellate pyramidal cells of the neocortex, respectively. Other dementias include fronto-temporal dementia (FTD); parkinsonism linked to chromosome 17 (FTDP-17); disinhibition-dementia-parkinsonism-amyotrophy complex (DDPAC); pallido-ponto-nigral degeneration (PPND); Guam-ALS syndrome; pallido-nigro-luysian degeneration (PNLD); cortico-basal degeneration (CBD); Dementia with Argyrophilic grains (AgD); Dementia pugilistica (DP) wherein despite different topography, NFTs are similar to those observed in AD (Bouras et al., 1992); Chronic traumatic encephalopathy (CTE), a tauopathy including DP as well as repeated and sports-related concussion (McKee, et al., 2009). Others are discussed in Wischik et al. 2000, for detailed discussion—especially Table 5.1).
Abnormal tau in NFTs is found also in Down's Syndrome (DS) (Flament et al., 1990), and in dementia with Lewy bodies (DLB) (Harrington et al., 1994). Tau-positive NFTs are also found in Postencephalitic parkinsonism (PEP) (Charpiot et al., 1992). Glial tau tangles are observed in Subacute sclerosing panencephalitis (SSPE) (Ikeda et al., 1995). Other tauopathies include Niemann-Pick disease type C (NPC) (Love et al., 1995); Sanfilippo syndrome type B (or mucopolysaccharidosis Ill B, MPS Ill B) (Ohmi, et al., 2009); myotonic dystrophies (DM), DM1 (Sergeant, et al., 2001 and references cited therein) and DM2 (Maurage et al., 2005). Additionally there is a growing consensus in the literature that a tau pathology may also contribute more generally to cognitive deficits and decline, including in mild cognitive impairment (MCI) (see e.g. Braak, et al., 2003, Wischik et al., 2018).
All of these diseases, which are characterized primarily or partially by abnormal tau aggregation, are referred to herein as “tauopathies” or “diseases of tau protein aggregation”. In aspects of the invention relating to tauopathies, preferably the tauopathy is selected from the list consisting of the indications above, i.e., AD, PSP, FTD (including Pick's disease), FTDP-17, DDPAC, PPND, Guam-ALS syndrome, PNLD, and CBD and AgD, DS, SSPE, DP, PEP, DLB, CTE and MCI. In one preferred embodiment the tauopathy is Alzheimer's disease (AD). Without wishing to be bound by theory, the present inventors believe that all structures solved for tauopathies encompass the dGAE region of Tau. As such, findings in relation to stabilising a conformation of dGAE (or Tau97) that is not prone to assembly can reasonably be expected to apply to all tau diseases including but not limited to AD.
Aspects of the present disclosure relate to methods for identifying, selecting and/or designing a compound for modulating Tau aggregation, which make use of computer-implemented molecular modelling means.
As illustrated in
As illustrated on
Candidate compounds may be selected using in silico methods known in the art. For example, in silico methods via substructure search may be employed. The structure of a known modulator of Tau aggregation, such as e.g. LMT, may be used, and various structural features of the compound (such as e.g. hydrophobic features, H-bond acceptor or donor features, etc.) may be submitted to a program that will search through libraries of chemical compounds for chemicals with substructures that have similar features. The candidate structures may then optionally be reviewed by an expert, for example in order to remove those compounds that are too large. Selected candidates may be submitted to a program that creates structural coordinates for the compounds, such as e.g. AMBER. Docking and scoring of these candidates may be performed using e.g. the MOE software from CCG.
Selected or designed compounds may further be synthesised or obtained and tested for their ability to modulate Tau aggregation, for example in a cell-based aggregation assay as described in Rickard et al., 2017.
Molecular dynamics simulations may be performed as known in the art, for example using the tools available as part of the AMBER software.
Overview
In this example, the conformational landscape of dGAE monomers in an aqueous environment was studied using explicit water molecular dynamics simulations based on the model from Fitzpatrick et al. (2017)—from PDB identifier 5O3L (https://www.rcsb.org/structure/5O3L). In particular, a slightly extended version of dGAE (Tau97) was used. However, the results herein are believed to be unaffected by the presence of 2 additional N-terminal residues, and equally apply to dGAE. As such, any references to “Tau97” in this example apply equally to dGAE.
Methods
Protein molecular dynamics (MD) simulations (see
The structural model was then refined using an energy minimisation process, by means of molecular mechanics using the AMBER force field. Proteins were parameterised in LEaP (a module from the Amber suite of programmes which generates force field files for use with the molecular dynamics packages of Amber) using the AMBER ff14SB force field (Maier et al., 2015). The complexes were neutralised by addition of an appropriate number of chloride counter ions and immersed in a truncated octahedral box of pre-equilibrated TIP3P water molecules (Jorgensen, 1982; Jorgensen et al., 1983). Each water box extended 8 Å away from any solute atom. At each stage, parameter, topology and coordinate files were saved. The cut-off distance for non-bonded interactions was 10 Å. Periodic boundary conditions were applied and electrostatic interactions were represented using the smooth PME method (smooth particle mesh Ewald method; Darden et al., 1998), with constant volume conditions applied. Minimisations were performed using the sander module (MD simulation engine) of AMBER 17 package (Case et al., 2017). The simulation protocol involved initial solvent and ion density equilibration and minimisation. A total of 2,000 minimisation steps were performed; initially 1,000 steps of steepest descent, followed by 1,000 steps of conjugate gradient minimisation. The cut-off distance for non-bonded interactions was 10 Å and a force constant of 500 kcal/mol/Å2 was used to restrain the protein. The entire system was then subjected to 2,500 steps of minimisation without restraints.
Following protein refinement using energy minimisation, a 200 ps heating phase with a cut-off distance for non-bonded interactions of 10 Å and a force constant of 10 kcal/mol/Å2 was used to restrain the protein, before a 200 ps equilibration run. At this point, 30 independent replicas were generated by randomly assigning different sets of velocities (adjusted to a temperature of 300 K) to the initial coordinates.
For each replica a 60 ns unrestrained trajectory was simulated at 300 K temperature and 1 atm pressure. SHAKE (Ryckaert et al., 1977; a constraint algorithm that can be applied to molecular dynamics simulations to ensure that constraints such as bond length, bond angle and torsion angle constraints are satisfied—in this case the algorithm constrains the vibrational stretching of hydrogen bond lengths, fixing the bond distance to equilibrium value) was applied to all bonds involving hydrogen atoms, allowing an integration step of 2 fs. System coordinates were saved every 100 ps for further analysis.
Analysis of conformational flexibility (
Results
A total of 30 independent 50 ns molecular simulations were set up and analysed using statistical methods to see whether protein folds followed a similar trajectory or whether protein dynamics would fall into clusters. Principal component analysis (PCA) of the molecular trajectories was performed, identifying 19 clusters by clustering in 6 dimensions (see
Overview
Molecular docking was performed using LMT to identify stable LMT-bound conformations of dGAE. As above, a slightly extended version of dGAE (Tau97) was used. However, the results herein are believed to be unaffected by the presence of 2 additional N-terminal residues, and equally apply to dGAE. As such, any references to “Tau97” in this example apply equally to dGAE.
Methods
Ligand parametrisation: Optimised structures and electrostatic potentials for the ligands were calculated at the MP2/6-311 G* level using General Atomic and Molecular Structure System (GAMESS) (Schmidt et al., 1993; Guest et al., 2005). Non-standard amino acid residues and ligands not described in the parameterisation libraries in LEaP (Schafmeister et al., 1995) were parameterised using the AMBER utility antechamber (Wang et al., 2006) and general AMBER force field (GAFF) (Wang et al., 2004). Ligand parameterisations were checked against the QM calculations by in vacuo MD simulations with sander (Crowley et al., 1997; Pearlman et al., 1995).
Docking and scoring (
Iterative MD sampling of LMT bound conformations (
Results
In particular, protein docking was performed using LMT against the 178 protein conformations identified in Example 1. This generated 15,090 ligand poses which were refined based on the placement scoring function, with a cutoff of −80 kcal/mol) as determined by visual inspection to 65 candidate binding modes with a docking score <−8.5. The docking scores are based on the assumption that the protein is in a stable conformation, which may not be the case. Therefore, the protein ligand complexes were minimised in explicit solvent and then a 1 ns molecular dynamics run was performed. Complexes that were observed to dissociate from the complex during the 1 ns simulation (based on the RMSD of the ligand during the minimization as well as the ligand binding energy as determined from the docking scoring function) were excluded. This excluded 21 complexes where the ligand was weakly bound. The remaining complexes were subject to a further 10 ns molecular dynamics simulation. Complexes where the LMT ligands had dissociated from Tau97 at that point were eliminated (16 complexes). Further rounds of 10 ns simulation were performed, followed by calculation of the RMSD and ligand binding energy and elimination of the complexes where the protein was too flexible and/or the ligand not tightly bound as determined by visual inspection of the trajectories.
After 40 ns (i.e. a total of 50 ns), 22 complexes remained. PCA analysis of these 22 complexes was performed to assess which complexes were exhibiting similar dynamics.
Simulations were continued for a total of 100 ns. A single complex was identified where the ligand remained tightly bound and the protein structure did not vary greatly after a 100 ns of molecular dynamics simulation time, with an average RMSD between frames varying by about 0.5±0.1 Å on average (as shown on
The structure of complex 3 is stabilized by a number of strong hydrogen bonds (see
The LMT-bound conformation is a compact folded state with no beta sheets (or at least no beta sheets that are intrinsic to the stability of the LMT-bound complex). In other words, the LMT-bound conformation is a compact folded state where any beta sheets that may be present would be in the N or C-termini (which are dynamic sections and could adopt beta sheet conformations at least transiently) and would not be intrinsic to the stability of the LMT-bound conformation of the Tau97/dGAE protein. Without wishing to be bound by theory, the present inventors believe that the absence of beta sheets in this conformation that contributes to inhibition of assembly of the protein into oligomers, such as e.g. PHF. This compact folded state is very different from the structured conformation in PHFs. The compact folded state shows a tight hair pin loop between residues Val337-Gly355. Residues Val363-Gly367 contain a PGGG sequence which is threaded between the PGGG sequence Pro332-Gly335 and a loop formed by the sequence Thr319-Lys331. Further, residues Lys369-Thr377 are sandwiched between residues Asp314-Ser316, with close distances (about 2.5-5.0 Å) between the Ser316 beta-carbon and Thr373 backbone carbonyl (see
Comparing the starting conformation in Example 1 (which is based on cryo-EM analysis of paired helical filaments-like structures) to the LMT-bound conformation identified herein reveals that residues Gly355-Gly367 and Asn368-Arg379 are brought together in close proximity in the LMT-bound conformation, from a distance of over 36 Å to within hydrogen bonding distance (see
Overview
The molecular simulations performed in Example 2 suggest that LMT is able to bind dGAE and enhance the stabilisation of a compact folded conformation. The inventors hypothesized that molecules which are able to bind tightly within this pocket would further stabilise dGAE and prevent assembly into PHF. They therefore set out to analyse the pharmacophore features of the LMT binding pocket.
Methods
A pharmacophore model was designed by analysing the interactions between LMT and the LMT binding pocket identified in Example 2. A series of compounds were designed to test the binding hypothesis generated from this model. These compounds were tested in a cell-based tau aggregation assay as described in Rickard J E, Horsley D, Wischik C M, Harrington C R., Methods Mol Biol. 2017; 1523:129-140, using LMT (EC50=0.096 μM) as a reference compound.
Results
As shown on
Using in silico design approaches, a set of compounds were designed that exploit the hydrogen-bonding features observed with LMT, and the shape and lipophilic features of the LMT-induced binding pocket. These compounds were assessed in a cell-based Tau aggregation assay. The features of these compounds and the results of the cell-based Tau aggregation assays are shown in Table 2. All compounds had the same central ring and one of 6 substituents to the left of the central ring and one of 5 substituents to the right of the central ring (by comparison to the central ring, left and right rings or LMT shown on
The results suggest that interactions with Lys343 is important to differentiate compound potency, with both hydrogen bonding to the backbone carbonyl of Lys343 and Pi-cation interactions with Lys343 are favourable for ligand potency. Small substituents that are able to bind in the lipophilic pocket towards Phe378, and the ability to form a H-bond with the NH of Thr373 also appear relevant. The structural differences between compounds 17 and 18 include the removal of the aromatic substituent that binds in the lipophilic pocket towards Phe378, and the introduction of a substituent on one of the heavy atoms that formed a H bond with the NH of Thr373. These changes bring about a marked 16-fold loss in potency, compound 18 (EC50=2.246 μM). The structural differences between compounds 17 and 14 include a reorientation of the right substituent (the left substituent in these compounds being identical), which affect the hydrogen-bonding orientation and hence strength between the left substituent of the ligand and the protein. The interaction with Phe378 is also lost. These changes result in a loss of potency by 8-fold (EC50=1.17 μM for compound 14).
The structure-activity relationships of the 19 compounds examined (including LMT) supports the hypothesis that a stabilised cryptic pocket exists which can be exploited to achieve tau assembly inhibition.
Overview
Molecular dynamics simulation was used to study the assembly of dGAE in a conformation that is able to bind LMT, into a paired helical filament (PHF) arrangement.
Methods
The parameterised 3D protein structures of the PHF 10 oligomer (stack of 10 monomers) modelled from PDB ID: 5O3L (as explained in Example 1), was used as a reference template to model the approach of a LMT-free dGAE monomer (as determined using Tau97 in Example 2) towards the PHF and then complete assembly. First the system was truncated from 97 to 73 residues according to the sequence reported in the literature (Fitzpatrick et al., 2017), which describes based on cryo-EM investigation of tau filaments that filament cores are made of two identical protofilaments comprising residues 306V-to-F378 of the Tau protein. This truncated system will be referred to as dGAE73 and PHF73 for convenience. The LMT-free dGAE73 monomer was obtained from the LMT MD simulation experiment (Example 2) and truncated appropriately. The C-terminal truncated amino acids were left as free acids and the N-terminal residues as free bases. Following an established protocol for MD simulations, system minimisation followed by an equilibration run, in total 1,200 ns, and a production MD simulation was then performed for 258 ns. A structure was extracted from the simulation at 50 ns which represented an average of the flexibility observed in the system over the 258 ns, see
The 50 ns timeframe structure was allowed to assemble following the simulated annealing approach to nudged elastic band (NEB) method using the AMBER software. The 50 ns timeframe was used as a starting reference structure and the fully assembled and minimised state was used as a final reference structure. A total of 144 replica frames were employed in constructing the trajectory for assembly.
The NEB simulation was run in a number of stages, 1) Heating up the system, 2) simulated annealing and equilibration, and finally 3) slow cooling. These are described below.
During the heating stage, the system was allowed to warm up from 0 to 300K, performed with a small spring constant. A run of 20 ps of MD was performed with a 0.5 fs time step (nstlim=40000, dt=0.0005). The SHAKE algorithm was not used (ntc=1, ntf=1). Since the coordinates of the end points are fixed in Cartesian space in NEB and the intervening structures cannot move far due to the springs there is no need to re-centre the coordinates every few hundred steps so this option was turned off (NSCM=0). The generalised Born model was used for the implicit solvent with a sodium chloride salt concentration of 0.2M (igb=1, saltcon=0.2). Nonbonded cutoffs and truncated Born radii calculations were obtained by setting these to 999.0 angstroms (cut=999.0, rgbmax=999.0). The NEB specific options used were (ineb=1 [turn on neb], skmin=10, skmax=10 [spring constants]). The simulation was started with a relatively small spring constant of 10 KCal/mol/Å2. For temperature regulation the langevin thermostat (ntt=3) was used together with a high collision rate of 1000 ps-1 (gamma_ln=1000). NMR weight restraints (nmropt=1) were used to linearly increase the value of temp0 from 0.0 to 300.0 K over 35,000 steps of the 40,000 step simulation (istep1=0, istep2=35000, value1=0.0, value2=300.0). This means that at step 0 the value of the target temperature (temp0) will be 0.0 K. At step 5,000 it will be 42.86 K, at 10,000 it will be 85.72 K, by 20,000 it will be 171.44 K and by 35,000 it will be at 300.0 K. It will then remain at 300.0 K until the end of the 40,000 step run.
In the simulated annealing and equilibration stage, a run of 100 ps of MD at 300K with a larger spring constant was used. The coordinates were saved at a frequency of once every 5,000 steps (ntwx=5000). The next step is to run a simulated annealing run. The NMR restraints option was used to control the value of temp0 during the run. A total of 300 ps of MD was run with the following temperature profile:
The final stage is to slowly cool the system down and this was performed in two stages. In the first stage the system was gradually cooled down in 50 K steps over 120 ps using the following profile:
Finally, to calculate a final energy minimization of the path the velocity Verlet algorithm was used. This was performed in a one final 200 ps long stage of cooling where setting temp0=0.0 K completes the final quenched MD and having turned on the quenched velocity Verlet method (vv=1).
Results
In an effort to further evaluate the hypothetical LMT-stabilised conformation of dGAE, we used molecular dynamics simulations to study the assembly of PHFs from a conformer of dGAE which is able to bind LMT. A PHF arrangement of 5 stacked dGAE73 (assembled into a 10 monomers PHF, referred to herein as PHF73) oligomers and a monomer dGAE73 in the stabilised state were modelled in a fully solvated environment, as shown on
In the first stage (anchor stage), 13 molecular dynamics simulations were started and the orientation of alignment of the dGAE73 monomer relative to the PHF73 stack was observed. The starting orientation was randomly chosen, and the monomer was placed beyond hydrogen-bonding distance to the PHF73 stack. The dGAE73 monomer aligned itself over the residues Val337-Gly355 which form a tight hairpin (as shown on
Then, after a period of 50 ns of production molecular dynamics simulation, a nudged elastic band (NEB) molecular dynamics simulation was used to find the minimum energy path for the rearrangement into the fully assembled PHF73 conformation. The observed folding pathway is shown on
A key step in the formation of PHF73 is the formation of alternating positively charged and negatively charged sidechain stacks in the hairpin loop of PHF73 (residues Val337-Gln355), as best seen on
Through this process, the N-terminal arm (residues Val306-Lys321), closely coupled to the C-terminal arm (residues Gly367-Phe378), of dGAE73 collapses onto the PHF73 driven by electrostatic interactions between dGAE73 and the PHF73. Indeed, temporary hydrogen bonds between Asp314 in dGAE73 and Lys370 in the PHF73 stack and between Gln307 of dGAE73 and Lys375 and Thr377 of the PHF753 stack assist the zipper-like closure, helping to bring together the terminal arms of the dGAE73 and PHF73. During the zipper closure, the preferential binding conformation is maintained through hydrophobic interactions between the two arms (C- and N-ter) of the monomer. Indeed, the hydrophobic residues Ile308, Tyr310 and Pro312 of the N-terminal dGAE73 create well-packed hydrophobic interactions with the C-terminal residues of dGAE73 including Leu375 and Phe378, as shown on
where the indices refer to:
A crystal structure of a hexapeptide corresponding to residues V306QIVYK311 underlined in the above sequence was obtained by Sawaya M R, et al. (2007). This structure was used to design peptide inhibitors of VQIVYK aggregation by Sievers S A, et al. (2011) and Zheng J, et al. (2011). The PHF6 (306VQIVYK311) peptide has been shown to be sufficient for in vitro polymerization to filamentous structures and microcrystals (Goux et al., 2004; Sawaya et al., 2007; von Bergen et al., 2000 and 2001). The PHF6 motif is located in the repeat regions of the microtubule binding domain of tau and has been suggested to play a prominent role in the formation of PHFs and is also part of the PHF core composed of cross-β structure (47-48). Our observations support the finding that the PHF6 sequence in dGAE does form a cross-β structure, although it occurs at the end of the assembly stage and in our experiments is seen to be the last step in completing the stabilisation of the PHF, the closing of the zip.
More recently, the structure of another hexapeptide that is not part of dGAE (VQIINK) was determined and used to design inhibitors of aggregation based on the finding that VQIINK forms a more extensive steric zipper interface than VQIVYK. These inhibitors, peptides WINK and MINK, were found to reduce Tau40 aggregation.
Based on the modelling above, the peptide inhibitors of VQIVYK aggregation would at most be able to prevent the formation of the very last step of the aggregation process.
Overview
In this example, the inventors identified the core region of dGAE filaments (using protease digestion experiments) and monitored the progression of self-assembly of dGAE into filaments by following the surface exposure of specific epitopes to a range of monoclonal antibodies (using immuno-precipitation and ELISA experiments) were performed to examine the folding pathway and the effect that LMT had on the process.
Methods
Protein production: Recombinant tau297-391 (dGAE) was produced and purified from ‘E. coli as previously described in phosphate buffer (20 mM, pH 7.4) (AI-Hilaly et al, 2017) and stored at −20° C. until required.
Filament assembly: dGAE (100 μM) was incubated at 37° C. in phosphate buffer (20 mM, pH7.4), with or without DTT (10 mM) and, with or without agitation (700 rpm), for 24 hours.
Methylthioninium chloride (MTC): Methylthioninium chloride (MTC) was provided by TauRx Therapeutics Ltd. The concentrations are expressed as free methylthioninium base (MT), and MTC added as the specified time with or without DTT.
Protease digestion: 100 μM dGAE with 10 mM DTT in phosphate buffer (20 mM, pH 7.4), with or without MTC at ratios up to 1:5 was agitated at 700 rpm, 37° C. for 72 hours. 10 μg aliquots were digested with increasing concentrations of Proteinase K (PK), Pronase E (PE), or left untreated. The digestion was carried out at 37° C. for 1 hour then the reaction was stopped with 1 mM Pefabloc and samples were placed on ice for 5 minutes. Sample buffer without any reducing agent was added, incubated for 5 minutes at room temperature then loaded onto a gel.
Immunogold labelling transmission electron microscopy: For all dilutions of antibodies and secondary gold probes a modified phosphate buffer saline (PBS, pH 8.2) was used; this buffer was supplemented with BSA (1%), Tween-20 (0.005%), 10 mM Na EDTA, and NaN3 (0.2 g/I). dGAE-C322A (100 μM) was incubated with and without MT (10 μM) and incubated for 24 h at 37 C with agitation at 700 oscillations per min in dark. Preformed fibrils were decorated ‘on grid’ using a polyclonal anti-tau antibody (Sigma-Aldrich, SAB4501831). In summary, 4 μl of dGAE-C322 fibrils were pipetted onto Formvar/carbon coated 400-mesh copper TEM support grids (Agar Scientific, Essex, UK) and left for 1 min, then a filter paper was used to remove the excess. Normal goat serum (1.10 in PBS, pH 8.2) was used for blocking for 15 min at room temperature. Grids were then incubated with (10 μg/ml IgG) rabbit anti-tau polyclonal antibody for 2 h at room temperature, rinsed three times for two minutes using PBS (pH 8.2), and then immunolabeled in a 10-nm gold particle-conjugated goat anti-mouse IgG secondary probe (GaR10 British BioCell International, Cardiff, UK; 1.10 dilution) for 1 h at room temperature. The grids were then rinsed five times for two minutes in PBS (pH 8.2) and rinsed five times for two minutes in distilled water. Finally, the grids were negatively stained using 0.5% uranyl acetate. As a negative control, Aß342 fibrils were examined using the same protocol.
SDS-PAGE: SDS-PAGE was conducted on the entire assembly mixtures as well as the supernatant and pellet fractions used for CD (3 μl of each per lane). Samples were mixed with SDS-PAGE sample buffer (without reducing agent) and separated using Any kDa Mini-Protean™ TGX™ Precast gels (Bio-Rad) at 120 V, until the sample buffer reached the end of the gel. The gel was stained using Imperial Protein Stain (Thermo Scientific), following the manufacturer's instructions, before sealing the gel and scanning on a Canon ImageRunner Advance 6055i scanner.
Immunoblotting: A stock solution of recombinant dGAE was diluted to 100 μM in phosphate buffer (20 mM, pH7.4) with 10 mM DTT. The solution was agitated at 700 rpm, 37° C. and aliquots taken at t0 (following a 10 second vortex), 2, 4, 6, 8, 24 and 72 hours and stored at −20° C. When all the time points had been collected, samples were vortexed well and 3 μL of the whole mixture applied to a nitrocellulose membrane, 1.5 μL at a time, allow to almost dry then the next 1.5 μL applied. The membrane was washed for 2 minutes in TBS-T (tris buffered saline with 0.05% tween), 2 minutes in TBS then blocked in 5% milk in TBS-T for 1 hr. scAbs were diluted in block and incubated with the membrane for 1 hour followed by 3×10 minute washes in TBS-T. Secondary antibody (anti-Human GHRP, Invitrogen) diluted 1:1000 in block was then added to the membrane for 1 hour then washed 3×10 min TBS-T. ECL reagent (BioRad) was applied to the membrane for 3 minutes, drained then imaged using a scanner with a maximum exposure time of 10 minutes.
Results
dGAE Filaments Contain a Protease Resistant Core.
Soluble or fibrils of dGAE (100 μM) in reducing conditions (10 mM DTT) were incubated with increasing concentrations of proteinase K or Pronase E then samples were run on SDS-PAGE. Soluble dGAE runs as a doublet at 10/12Kda under reducing conditions and no bands of this size were observed in the presence of either PK or PE. Soluble dGAE in the supernatant was completely digested with even the lowest concentration of enzyme tested (25 μg/mL). The pellet contains mostly fibrillar dGAE and runs as a monomer ad dimer on SDS-PAGE. With PK, a protease resistant band at around 8 kDa was observed with enzyme concentrations up to around 100 μg/mL, after which the band disappeared indicating the core was fully digested. For PE the core withstood even up to 500 μg/mL enzyme. Mass spectrometry analysis of both protease resistant bands (see
Repeating these experiments in the presence of LMT revealed that LMT prevents the formation of a protease resistant core. Proteolysis revealed that soluble dGAE is digested while filaments retain a protease resistant core. In the absence of DTT, dGAE remains able to self-assemble and remains protease resistant (data not shown) while in the presence of DTT, LMT is able to prevent inhibition and the resulting soluble dGAE can be digested (
Assembly of dGAE Results in Progressive Loss of Epitopes
A series of monoclonal single chain antibodies (scAbs) raised to recognise different regions of the peptide were used to perform epitope mapping to follow the conformational change that accompanies self-assembly. Immunoblots were performed on 100 uM dGAe with 10 mM DTT dGAE assembled in reducing conditions over 72 hours at different time points. These were repeated 10 times and the data was pooled to produce graphical output normalised to the intensity at time 0 and error bars were provided to show the variation in the data (see
I297HVPGGGSVQIVYKPVDLSKV[T319SKCGSLGNIHHK331]AB1PGGGQ[V337EVKSEKLDFKDRV QSKI{G355]AB2SLDNITHVPGG[G367}AB3NKKIETHKLTF378]AB4[{R379ENAKAKTDHGA390}AB6E391]AB5
Binding Profiles in the Absence of LMT
As best seen on
Turning now to AB1 (319-331), the signal from the Dot Blot experiments implies that this epitope conformation is less exposed at t0 than AB2-5 (see
Turning to AB2 (337-355), in the context of PHF structure (Fitzpatrick, 2017) is found at the b-helix (see
The short AB4 (367-378, which binds within the C-shape region of the PHF, see
Together with the data from Example 4, this data confirms that the epitope of AB2 (337-355) is first to bind to PHF, from the middle of the sequence at R349. This is followed by the binding of charged groups E342, D348, followed by the sequences of V350-Q351-K343, Q336-V337-K347. The PGGG repeat between the epitopes of AB1 (319-331) and AB2 (337-355) then binds, followed by residues 1360-T261 in the epitope of AB3 (355-367). Then slowly the C-terminal end of AB1 (319-331) binds sequentially towards to N-terminal end, and the epitope of AB4 (367-378) then binds to PHF. This starts with starts with K375 and then amino acids on both sides are assembled. The final sequences to bind is the P364-GG-G367 sequence in the epitope of AB3 (355-378).
From these experiments, we can infer that as dGAE follows a sequence whereby the association of two molecules initiates the assembly (AB1, 323-328) followed by a folding into a C-shape (AB3, 358-364) and the formation of the b-helical hairpin. The structure then curves to bury (AB4, 370-378) and (AB6, 379-391) regions at the centre and C-termini.
Immunogold labelling of soluble and preformed filaments of dGAE confirmed that the AB3 recognition sequence is exposed in soluble dGAE and buried in PHF (see
LMT Mediated Inhibition of dGAE Aggregates and Restoration of Immunoreactivity
The truncated core repeat domain dGAE (297-391), is the predominant fragment that constitutes bulk of the PHF core in AD (Wischik et al, 1988). During dGAE aggregation in vitro, scAb binding regions on dGAE are ‘hidden’ or ‘occluded’ which leads to a loss of immunoreactivity in aggregation samples. Here we have shown the occlusion of binding regions in aggregated dGAE samples and the recovery of immunoreactivity in the presence of LMTM, a tau-aggregation inhibitor. The scAbs tested for binding are core region specific with binding regions given in Table 3 below). For preparing the aggregates, 10 μl 10 mM DTT was added to 1000 μl 100 μM dGAE and agitated with/without LMTM (1:5 ratio) at 700 rpm for 24 h at 37° C. Following overnight agitation, one third of each sample was kept aside as ‘total’ and the rest was spun down at 16000×g for 30 min and separated into ‘supernatant’ and ‘pellet’. The pellet was then resuspended in half the original volume for further experiments. The immunoreactivities of core region specific scAbs towards dGAE aggregates formed with/without LMTM was tested using a sandwich ELISA format using a ‘E’ specific monoclonal antibody 423 mAb. This mAb has been shown to specifically bind to the Pronase resistant core structure in the PHFs (Wischik et al, 1988). ELISA plates were coated with 10 μg/ml 423 mAb and blocked as normal. Doubling dilutions of dGAE aggregate ‘total’, ‘supernatant’ and ‘pellet’ samples at 10 μg/ml starting concentration were added to designated wells in doubling dilutions in 1×PBS. dGAE monomer (non-aggregated) was included as assay control. All double dilutions were done in final volumes of 100 μl. This was left to incubate on lab bench for 1 h followed by the addition of test scAbs at 10 μg/ml. Anti-HuCk HRP labelled secondary antibody was added as described previously and ELISA data generated is represented using the graph below.
All scAbs tested showed increased binding to aggregated dGAE ‘total’ and ‘supernatant’ samples, when aggregation was conducted in the presence of LMTM. This proves the opening or revealing of occluded antibody binding regions on dGAE where LMTM is preventing the aggregation event, leading to an increased immunoreactivity (
All references cited herein are incorporated herein by reference in their entirety and for all purposes to the same extent as if each individual publication or patent or patent application was specifically and individually indicated to be incorporated by reference in its entirety.
The specific embodiments described herein are offered by way of example, not by way of limitation. Any sub-titles herein are included for convenience only, and are not to be construed as limiting the disclosure in any way.
Throughout this specification, including the claims which follow, unless the context requires otherwise, the word “comprise,” and variations such as “comprises” and “comprising,” will be understood to imply the inclusion of a stated integer or step or group of integers or steps but not the exclusion of any other integer or step or group of integers or steps.
It must be noted that, as used in the specification and the appended claims, the singular forms “a,” “an,” and “the” include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to “a pharmaceutical carrier” includes mixtures of two or more such carriers, and the like.
Ranges are often expressed herein as from “about” one particular value, and/or to “about” another particular value. When such a range is expressed, another embodiment includes from the one particular value and or to the other particular value. Similarly when values are expressed as approximations, by the use of the antecedent “about,” it will be understood that the particular value forms another embodiment.
Number | Date | Country | Kind |
---|---|---|---|
2010679.5 | Jul 2020 | GB | national |
This Application is a National Stage filing under 35 U.S.C. § 371 of International PCT Application No. PCT/EP2021/068718, filed Jul. 6, 2021, which claims priority to Great Britain Application No. 2010679.5, filed Jul. 10, 2020. The contents of these applications are incorporated herein by reference in their entirety.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2021/068718 | 7/6/2021 | WO |