The invention relates to the design and expression of proteins with a symmetrical structures.
The invention relates to synthetic proteins with functional properties such as metal binding and enzymatic activity.
The functional and structural diversity of proteins has inspired researchers to engineer them for various applications. Recent examples have demonstrated the engineering of proteins as enhanced catalysts, vaccines, biosensors, and building blocks for 2D/3D frameworks [McConnell et al. ACS Synth. Biol. 2020, 9, 381-391; Brouwer et al. Cell 2021, 184, 1188-1200; Schuster Biosensors 2018, 8, 40; Chen et al. J. Am. Chem. Soc. 2019, 141, 8891-8895; Pyles et al. Nature 2019, 571, 251-256; Zhang et al. Nat. Commun. 2020, 11, 1-12; Ben-Sasson et al. Nature 2021, 589, 468-473]. In many cases, proteins with new functions are obtained via the redesign of existing proteins. Advances in computational protein design have stimulated the development of unique proteins with various conformations and functionality [Kuhlman & Bradley Nat. Rev. Mol. Cell Bio. 2019, 20, 681-697]. Therefore, protein engineers are not limited to re-purposing natural proteins and are able to expand their toolkit with new molecules with improved properties. Symmetric proteins are highly desirable due to their stability and versatility as building blocks for the development of 2D/3D assemblies [Yeates Annu. Rev. Biophys. 2017, 46, 23-42]. An exceptionally stable, symmetric β-propeller protein called Pizza is described in [Voet et al. Proc. Natl. Acad. Sci. U.S.A. 2014, 111, 15102-15107]. To showcase its functional potential, Pizza was redesigned to obtain protein assemblies, artificial enzymes, and high affinity scaffolds for various metals and metal-oxo clusters [Vrancken et al. J. Struct. Biol.: 4 2020, 100027; Clarke et al. Chem. Commun. 2019, 55, 8880-8883; Voet et al. Angew. Chem. Int. Ed. 2015, 127, 9995-9998; Vandebroek et al. Chem. Commun. 2020]. However, Pizza lacks an extensively modifiable interface which limits its capacity to bind more complex molecules.
Advances in computational protein design have allowed for the development of new proteins with unique properties. Symmetric designer proteins have remarkable stability and can serve as versatile building blocks for the creation of macromolecular assemblies. The present invention describes the development of SAKe: A new symmetric, stable protein building block with modifiable loops. In the present invention polypeptides as claimed will be referred to as SAKE proteins. Following the observation of pH induced 3D self-assembly, metal binding sites were engineered along the protein's internal rotational axis to fabricate 2D surface arrays. Using atomic force microscopy, Cu(II) dependent on-surface 2D self-assembly is demonstrated. The present invention discloses a stable and highly modifiable SAKe protein scaffold, which for use as a building block for the creation of multi-functional macromolecular materials.
In the conserved sequence motif, the amino acids Xn are not numbered. As the sequence listing does number the Xn aminoacids, the numbering of amino acids cterminal of Xn will differ.
Thus X14, X16 and X18 of the sequence motif become respectively positions 29, 31 and 33 in SEQ ID NO 34 of the sequence listing. Throughout the specification the numbering of the conserved sequence motif is used.
Throughout the specification, when wording is used such as “Ile4 has Leu or Phe as alternative” this means that at position 4 of the sequence Ile, Leu or Phe can be present.
The invention is further summarised in the following statements:
1. A polypeptide comprising a sequence having at least 60 or 70% identity with NGRIY5AVG8G9-Xn-LNSVE14AY16DP18ETDEW23SFVAPM29TTPR33SGVG37VAV40L [SEQ ID NO:32] or comprising 2 to 9 repeats of said sequence, wherein Xn are between 1 and 15 amino acids, wherein x can be any amino acid,
2. A polypeptide comprising 2 to 9 repeats of a sequence, each of said sequences having at least 70% identity with [SEQ ID NO: 34] NGRIX5AVG8G9-Xn-LNSVX14AX16DX18ETDEW23SFVAPM29TTPR33SGVG37VAV40L], wherein Xn are between 1 and 15 amino acids, wherein x can be any amino acid, and wherein repeats are separated from each other from between 0 and 15 amino acids, wherein the amino acids, Gly8, Gly9, and Trp23 in each of said sequences are conserved,
An example of this proviso is represented as follows:
A polypeptide with three repeats schematically represented as
Can thus also occur as
3. The polypeptide according to statement 2,
Embodiments for all above combinations of X5, X16 and X18 are explicitly envisaged and disclosed herein.
In a specific embodiment X5 is Tyrosine Phenylalanine or Tryptophane, and
4. The polypeptide according to statement 2 or 3, wherein, in one or more, or in all of said sequences X14 is Glu, Asp, Cys, or Ser, or wherein X14 is Glu or Asp.
5. The polypeptide according to any one of statements 1 to 4, wherein, in one or more, or in all of said sequences,
5. The polypeptide according to any one of statements 2 to 4, wherein, in one or more, or in all of said sequences,
6. The polypeptide according to any one of statements 2 to 5, wherein, in one or more, or in all of said sequences, the amino acids, Trp23, Met29, and Gly37 are conserved.
Embodiments of all possible variations of X5, X14, X16, X18 are herewith envisaged and explicitly disclosed.
As recited above, apart from the absolute conserved Gly8, Gly9, and Trp23 sequence variation is limited at position X5, X14, X16, X18, and sequence variation is less stringent for the remaining positions as long as the overall sequence identity is above the defined percentage.
Further specific embodiments of sequences falling under the definition of SEQ ID NO:34 are sequences wherein:
Sequences wherein Ile4 has Leu as alternative; Alas has Val as alternative; Leu10 has His as alternative; Asn11 has Asp as alternative; X18 is Pro or Val; Met29 has Leu as alternative; Gly35 has Ala as alternative choice; Gly37 is conserved
Sequences wherein Ile4 has Leu as alternative; Ala6 has Val as alternative; Leu10 has His as alternative; X18 is Pro or Val; Met29 has Leu as alternative; Gly35 has Ala as alternative; Gly37 is conserved.
For all the above sequences any sequence obtained by combination of the different possibilities for the recited amino acids is herewith explicitly disclosed.
The amino acids X of Xn wherein n=1 to 15, or the amino acids in between sequence in a repeat can be any of the 20 natural amino acids encountered in polypeptides as well as modified versions thereof as obtained by post-translation modification. Other side chain can be envisaged when synthetic peptides are produced as long as the amino acids can be incorporated via its NH2 and COOH group in a polypeptide.
It is further envisaged that in the non-conserved parts of the sequence other amino acids occur which differ from the regular 20 naturally occurring amino acids as long as they are incorporated via its NH2 and COOH group in a polypeptide.
7. The polypeptide according to any one of statements 1 to 6, wherein, in one or more, or in all of said sequences, the amino acids Tyr5, Gly8, Gly9, Glu14, Tyr16, Pro18, Trp23, Met29, Arg33, and Gly37 and Val40 in said sequence are conserved.
8. The polypeptide according to any one of statements 1 to 7, comprising between 2 to 9 repeats of said sequence.
9. The polypeptide according to any one of statements 1 to 8, comprising 2, 3 or 6 repeats of said sequence.
10. The polypeptide according to any one of statements 1 to 9, wherein one or more, or all of said sequences have at least 80, 90 or 95% identity with SEQ ID NO: 32 or is identical to SEQ ID NO: 32.
As an example, for a polypeptide with 3 repeats of the sequence, one sequence can be for example 82% identical (>80%), a second 92% identical (>90%), and a third 97% identical (>95%).
11. The polypeptide according to any one of statements 1 to 10, wherein Xn are between 5 to 15 amino acids, or between 5 to 10 amino acids.
In these embodiments, for each of Xn the length can differ or can be identical, as long as the length falls within the defined range
12. The polypeptide according to any one statements 1 to 11, wherein one or more of said repeats of a sequence are separated from each other with 1 up to 5 amino acids.
In these embodiments, the length between two sequences can differ or can be identical, as long as the length falls within the defined range.
13. The polypeptide according to any one of statements 1 to 12, wherein one or more of said repeats of said sequence are immediately adjacent to each other.
14. The polypeptide according to any one of statements 1 to 13, wherein all of said repeats of said sequence are immediately adjacent to each other.
15. The polypeptide according to any one of statements 1 to 14, wherein two or more, or all repeats of said sequence are immediately adjacent to each other.
16. The polypeptide according to any one of statements 1 to 15, wherein two or more or all the repeats of said sequence are identical, with exception of the amino acids Xn.
17. The polypeptide according to any one of statements 1 to 16, comprising a first repeat and a second repeat of said sequence wherein said first and second repeat occur alternating in said polypeptide.
18. A multimeric polypeptide which of polypeptides as defined in any one of statements 1 to 17. In such multimer the polypeptides may be non-covalently bound to each other, and optionally via the presence of disulfide cystine bridges.
19. The multimeric polypeptide according to statement 18, which is a hexamer of 3 non non-covalently bound polypeptides as defined in any one of statements 1 to 9 having two repeats.
20. The multimeric polypeptide according to statement 19, which is a hexamer of 2 non non-covalently bound polypeptides as defined in any one of statements 1 to 9 having 3 repeats.
The invention further relates to nucleic acids encoding these polypeptides, as well as expressions vector comprising these nucleic acids, and bacterial yeast or eukaryotic cells comprising these vectors.
21. A method of producing a functional protein comprising the steps of:
22. The method according to statement 21, wherein the function is protein binding, an enzymatic activity, or the binding to an organic molecule.
23. The method according to statement 21 or 22, further comprising the step of determining whether the multimeric protein has rotational symmetry.
24. The method according to any one of statements 21 to 23, further comprising the step of determining whether the multimeric protein is stable at pH below 4, or wherein the multimeric protein is resistant against proteolytic degradation.
25. The method according to any one of statements 21 to 24, further comprising determining whether said multimeric protein assembles into quaternary structures, such as fibres, tubes, three dimensional cages or two-dimensional layers.
Particle analysis of the topography, (E) average radius and (F) maximum diameter.
The present invention discloses the design and engineering an improved protein building block named the Self-Assembling Kelch (SAKe) protein. SAKe has a stable, symmetric design with readily accessible loops that can be varied in both sequence and length to later bind larger molecules or scaffold a catalytic site. To demonstrate SAKe's versatility, its structure was modified to undergo metal-mediated self-assembly into 2D surface arrays. This highlights SAKe as a promising new protein scaffold which can be readily redesigned to target various applications.
Through an investigation into naturally occurring pseudosymmetric proteins, the human keap1 Kelch protein was identified as a template for the development of SAKe [Beamer et al. Acta. Crystallogr., Sect. D.: Biol. Crystallogr. 2005, 61, 1335-1342].
Kelch repeat proteins are β-propeller proteins composed of six nearly identical tandem sequence repeats that fold into 4-stranded anti-parallel sheets around a central cavity [Adams et al. Trends Cell Biol. 2000, 10, 17-24]. This structural family has well-conserved blades, with the first and second strands connected by loops that vary in length and sequence. Using a computational procedure that combines ancestral sequence reconstruction with computational protein backbone optimization and subsequent sequence scoring, a new family of proteins named SAKe were derived from the keap1 β-propeller (
To investigate loop modification and further functionalisation, three loop variants of S6BE were created (
SAKe's symmetry and modifiable interfaces make it an ideal building block for construction of macromolecular assemblies. While dialyzing S6BE to low pH, a pH induced self-assembly was observed (
The hexagonal packing and complementary interactions found in S6BE's self-assembled structures provide a promising starting point for its application as a supramolecular building block. Therefore, this protein was rationally reengineered to coordinate divalent metal-ions, and induce self-assembly into on-surface 2D arrays (
The metal-induced assembly of S6BE and S6BE-3HH proteins was first screened using Dynamic Light Scattering (DLS) in solution. Divalent metals (Cu(NO3)2 and Zn(NO3)2) were titrated into the protein solution in different buffers and pH, resulting in larger structures being formed as the ratio of metal:protein was increased (
To investigate the formation of on-surface 2D protein arrays, in solution amplitude-modulated atomic force microscopy (AFM) was utilized (
The assembly of S6BE in the presence of Cu(II), which does not contain His mutations on its bottom loops was also investigated. At a ratio of 20:1 Cu(II): S6BE, amorphous aggregated proteins were observed with no obvious crystallinity (
Using a computational approach combining consensus design and Rosetta energy scoring, SAKe proteins were developed: a new symmetric, stable protein scaffold with modifiable loops. The loops can be varied in both length and sequence, highlighting their potential to be optimized for the binding of clinically relevant molecules or programming of catalytic activity. Following SAKe's modification with metal binding sites, Cu(II) induced self-assembly was observed of on-surface 2D arrays. The present invention discloses SAKe as a highly modifiable protein scaffold which can double as a building block for the fabrication of 2D protein assemblies. In summary, SAKe's versatility holds great promise for the creation of biotherapeutics and innovative on-surface materials.
The following crystal structures were submitted to RCSB PDB: SAKe6AE (7ON6), SAKe6AR (7ON8), SAKe6AC (7ONA), SAKe6BE (7ONC and 7ONE), SAKe6BE3HH (7OP4, 7OPU and 70PV), SAKe6BE-L1 (7ONG), SAKe6BE-L1 (7ON7) and SAKe6BE-L1 (7ONH).
SEQ ID NO: 1 to SEQ ID NO: 19 (odd numbers): DNA sequences of designed SAKe proteins. The outermost emphasized 5′ and 3′sequences contain a start codon, NdeI restriction site, stop codon and XhoI restriction site.
CGAG
CGG
The proteins examined in this research were designed using the RE3Volutionary protein design method [Voet et al. Proceedings of the National Academy of Sciences 2014, 111, 15102-15107]. The KELCH domain of human Keap1 (PDB code: 1ZGK) was chosen as a template for the SAKe designs. Clustal Omega was used to generate multiple sequence alignments (MSAs) starting from the six repeats of the keap1 β-propeller [Madeira et al. Nucleic acids research 2019, 47, W636-W641]. The MSAs and their accompanying unrooted phylogenetic trees were used to construct lists of putative ancestral sequences using the FastML server [The PyMOL Molecular Graphics System, Version 1.2r3pre, Schrodinger, LLC.], with 250 sequences per node for a total of 10000 sequences for each SAKe construct. Idealized symmetric backbone models were designed using both PyMOL [Ashkenazy et al. Nucleic acids research 2012, 40, W580-W584] and PyRosetta [Chaudhury et al. Bioinformatics 2010, 26, 689-691]. With PyMOL, the second and last blades of the Keap1 Bpropeller were extracted. The first blade was used for the design of type A SAKe (47 amino acids per repeat) and the last blade for type B SAKe (51 amino acids per repeat). The N-termini were truncated to minimize clashing with their symmetry equivalents during the subsequent Rosetta Symmetric Docking procedure [Andre et al. Proc Natl Acad Sci 2007, 104, 17656-17661]. A sixfold rotational symmetry was enforced and generated 20000 Monte Carlo Simulated Annealing (MCSA) optimized models per SAKe type. Results were evaluated on their docking score and RMSD from a manually constructed symmetric backbone. The blades of the best models were reconnected, adding in only the exact amount of amino acids that were removed earlier. The putative ancestral sequences were mapped on their corresponding backbone models using a custom PyRosetta script. For each SAKe type a model was selected with the lowest Talaris2013 energy score (e.g. SAKe6E ‘E’), a model with lowest RMSD from the input backbone (e.g. SAKe6R ‘R’) and a c-optimized model (e.g. SAKe6C ‘C’). For all SAKe constructs except S6AR, cysteine residues were mutated to serine or alanine. Following the interpretation of later experiments, S6BE mutants were rationally designed to have altered self-assembly properties. The S6BE-3HH variant was designed from S6BE via following mutations: E24H-R25H, E118H-R119H and E212H-R213H. Amino acid sequences were reverse translated into DNA sequences, using a codon optimization tool provided by the supplier (Integrated DNA Technologies, Iowa, United States).
DNA sequences were cloned into pET-28a(+) via NdeI and XhoI restriction sites, adding an N-terminal hexahistidine tag to the constructs. For cloning, the recombinant vectors were transformed into E. coli DH5a via heatshock. The vectors were validated via Sanger Sequencing (LGC Ltd, Teddington, United Kingdom), using T7 promotor and terminator primers provided by LGC. Correct plasmids were transformed into E. coli BL21 via heat shock. 1L cultures were grown in a shaking incubator at 37° C. to an OD600 of 0.6. Thereafter, cultures were incubated on ice for 20 min. After adding 1 mM isopropyl β-D-1thiogalactopyranoside (IPTG) incubation was continued at 20° C. for 16-18 h while shaking. Cells were harvested via centrifugation at 3000 g. The supernatant was discarded and pellets were immediately stored at −24° C. Pellets were thawed on ice and suspended in 40 ml of 50 mM NaH2PO4 (pH 8), 200 mM NaCl, 10 mM Imidazole, 1 mM phenylmethylsulfonyl fluoride (PMSF) and 30 mg hen eggwhite lysozyme. They were incubated for 30 min at 15° C., while rotating. After, they were lysed via sonication. Lysates were centrifuged at 3000 g for 30 min. The Supernatant was filtered (0.45 μM) and loaded on a Nickel nitrilotriacetic (NINTA) column equilibrated with a 50 mM NaH2PO4 (pH 8), 200 mM NaCl and 10 mM imidazole buffer. The column was washed with 10 mM and 20 mM imidazole and the proteins eluted with 300 mM imidazole. The fractions containing proteins were collected and dialyzed overnight in 50 mM NaH2PO4 (pH 8) and 200 mM NaCl. At the same time, histidine tags were removed via thrombin (100 U per protein). The dialyzed samples were subjected to an additional Ni-NTA chromatography step and then loaded on a Superdex200 pg 16/600 column equilibrated with 20 mM HEPES (pH 8) and 200 mM NaCl.
Abs280 peaks were collected, dialyzed in 20 mM HEPES (pH8) and concentrated to stocks of 20 mg/ml or more. These stock proteins were stored at 4° C. The Superdex column was standardized using the Bio-Rad “gel filtration standard 151-1901” protein marker.
Proteins were crystallized via sitting-drop vapor diffusion; using Qiagen Nextal Crystal Screening kits, MRC 96-well plates and an ARI Gryphon robot. For native crystallography droplets consisted of 0.3 μL mother liquor and 0.3 μL of 10 mg/ml protein in 20 mM HEPES pH 8.0, 200 mM NaCl. Protein crystals were vitrified after single-step soaking. PEG 400 or glycerol were used as cryoprotectant. Xray diffraction experiments were performed at Diamond Light Source (United Kingdom), Elletra (Italy) and SLS (Switzerland). The diffraction patterns were indexed using XDS or DIALS [Kabsc Acta Crystallographica Section D: Biological Crystallography 2010, 66, 125-132; Winter et al. Acta Crystallographica Section D. 2018, 74, 85-97]. Data reduction was done with Aimless in CCP4 [Evans et al. Acta Crystallographica Section D: Biological Crystallography 2013, 69, 1204-1214; Winn et al. Acta Crystallographica Section D: Biological Crystallography 2011, 67, 235-242]. Molecular Replacement phasing was done with PHASER, using computationally designed models as search ensemble [McCoy et al. Journal of applied crystallography 2007, 40, 658-674]. Refinement was done manually with phenix.refine and Coot [Adams et al. Acta Crystallographica Section D: Biological Crystallography 2010, 66, 213-221; Emsley et al. Acta Crystallographica Section D: Biological Crystallography 2010, 66, 486-501]. The final structures were validated using Molprobity and the PDB validation tool [Chen et al. Acta Crystallographica Section D: Biological Crystallography 2010, 66, 12-21], before being deposited at RCSB PDB. The data for S6BE-L2 was severely anisotropic. The Diffraction Anisotropy Server was used to improve the corresponding MTZ file (merged with Aimless at a 1.95° A cutoff).
Surface electrostatics were calculated at pH 4.0 and 8.0 via PDB2PQR, using a complete
SAKe monomer as input. The calculated surfaces were visualized via PyMOL. pI values were calculated via PROPKA [Dolinsky et al. Nucleic acids research 2004, 32, W665-W667; Li et al. Proteins: Structure, Function, and Bioinformatics 2005, 61, 704-721; Dolinsky et al. Nucleic acids research 2007, 35, W522-W525; Olsson et al. Journal of chemical theory and computation 2011, 7, 525-537].
CD spectroscopy was performed with a JASCO J-1500 spectrometer. To measure the CD spectra, protein samples were diluted to 400 μL of 0.1 mg/ml in 20 mM NaH2PO4 (pH 7.6). Ellipticity was measured at 20° C. from 260 nm to 200 nm, using 1 mm cuvettes. 5 Accumulations were averaged. For determination of melting temperatures, samples were diluted to 400 uL of 0.25 mg/ml in 20 mM NaH2PO4 (pH 7.6). The signal at 233 nm was followed from 0 to 95° C. with intervals of 0.2° C., using sealable 2 mm cuvettes. The data was analyzed with a custom Python script, which fits a sigmoidal curve and extracts its midpoint. The positive signal at 233 nm was previously attributed to interactions between aromatic residues. Disappearance of this signal during melting experiments seemed indicative of tertiary structure disruption [Saxena et al. Biochemistry 1996, 35, 15215-15221; Nikkhah et al. Biomolecular engineering 2006, 23, 185-194].
The tipping point of pH induced self-assembly was found to be approximately 4.5. To show reversibility of assembly, 500 μL of 5 mg/mL S6BE was dialyzed at 20° C. in 50 mM citrate-citric acid buffer at pH 4.0, 4.5 and 5.0. To confirm the pH 4.5 tipping point, a similar experiment was repeated for both S6BE and S6BE-3HH. 500 μL Samples of various protein concentrations (5.0 mg/mL, 2.5 mg/ml, 1.0 mg/ml and 0.5 mg/mL) were dialyzed at 20° C. in 50 mM citrate-citric acid buffer (pH 4.5 and 4.0). For each protein, a self-assembled crystal was soaked in cryo-protectant, vitrified and shipped of for Xray diffraction. Pictures of the self-assembled crystals were taken with a Nikon SMZ800N microscope, outfitted with a TV Lens C 0.45× (Nikon, Japan).
Concentrated Sake6 and Sake6-3HH protein stock solutions (20 mg/ml, HEPES pH 8) were diluted with either 20 mM MES pH 5.6 or MilliQ to a concentration of 1 mg/ml. Metal suspensions (Cu(NO3)2 and Zn(NO3)2, MilliQ) were titrated into the 100 μl of prepared protein solution to achieve the desired ratio of protein:metal. Size measurements were obtained at 25° C. using a Zetasizer Nano ZS instrument (Malvern Instruments) and quartz cuvette (ZEN2112). Data analysis was performed using the Zetasizer software 7.11 (Malvern Instruments).
The protein stock solutions (40 mg/ml, HEPES pH 8) were diluted with the imaging buffer (20 mM MES pH 5.6 or MilliQ) to a desired concentration. The metal salts were suspended in MilliQ (Cu(NO3)2 and Zn(NO3)2, Sigma-Aldrich) and mixed with the protein solutions to obtain the correct ratio of protein:metal, before being left to incubate for 20 minutes. Next, 30 μl of diluted protein/metal solution was drop cast onto freshly cleaved substrates, muscovite mica (Agar Scientific) or HOPG (ZYB grade, Advanced Ceramics Inc.).
An additional 30 μl droplet of the protein/metal solution was then carefully pipetted onto the AFM's cantilever holder, previously loaded with Nanoworld ARROW-UHF AuD20 (Resonance Frequency 0.7-2.0 MHZ) and brought into contact with the surface droplet in the AFM. The sample and AFM system was then left to equilibrate for 1 hour at 25° C. before imaging.
The images were captured on a Cypher ES atomic force microscope (Asylum Research) using the amplitude modulation mode whilst in solution. The imaging force and frequency were both carefully adjusted to reduce any disruption in the self-assembled surface arrays. The AFM data processing was performed with a combination of SPIP and Gwyddion software. The imaging buffer was either 20 mM MES (PH 5.6) or MilliQ, with protein concentrations of 0.5-5 μM and Cu(NO3)2/Zn(NO3)2 concentrations (5 μM-50 μM) depending on the requirements of the experiment.
In vivo half-life is an important factor in developing pharmaceutical molecules, as it directly effects the duration of adequate therapeutic effect. Herein, size is an important determinant for in vivo half-life of proteins. Below 70 kDa, most proteins are quickly removed via renal clearance. SAKe proteins of the present invention are roughly 30 kDa. To avoid clearance, SAKe proteins can either be genetically fused or the formation of larger complexes can be induced to surpass the glomerular filtration cutoff. Various SAKe mutants have been engineered and characterized to increasing biological size (Table 6). Additionally, these SAKe mutants show potential as bi/multi-specifics.
S6BE-3CHR (30.4 kDa), S6BE-3 HR-L3 (34.2 kDa) and mEm-v22S6BE-3 HR (57.7 kDa) are soluble and SDS PAGE confirms their expected sizes (
DS6AC (65.0 kDa), a fusion of two S6AC units, is soluble and SDS PAGE analysis confirms the expected size (
S6BE-3 HR (30.5 kDa) is soluble and SDS PAGE analysis confirms the expected size (
S2BE-3 HR (10.4 kDa) is soluble and SDS PAGE confirms the expected size
(
Concentrated protein samples were loaded on HiLoad Superdex 75 pg 16/600 or HiLoad Superdex 200 pg 16/600 SEC columns (Cytiva). SEC can also be used to assess biological size. All samples were run with a HEPES buffer (20 mM HEPES, pH 8.0, 200 mM NaCl). Proteins expected to interact with metals were incubated with at least 5 mM EDTA prior to SEC injection.
Size exclusion chromatography confirms the expected sizes of each SAKe mutant in solution (
Protein crystals can be diffracted to study the atomic three dimensional structure of the constituents. This way, the mechanism of metal-binding could be unraveled for proteins such as S6BE-3 HR and S6BE-3CHR. Crystals were grown via sitting drop vapor diffusion in MRC 2-well plates (Hampton Research, UK). Droplets were set up using a Gryphon crystallization robot (Art Robbins Instruments, USA): 0.5 μL crystal screening kit buffer was mixed with 0.25 μL Zn(NO3) (in water) and 0.25 μl from a 20 mg/ml protein stock (20 mM HEPES pH 8.0, 200 mM NaCl) (Table 7).
The crystals were vitrified after single-step soaking using 25% PEG400 or glycerol as a cryoprotectant. Xray diffraction experiments were performed at Diamond Light Source (UK). Diffraction patterns were indexed using XDS or DIALS. Data reduction was done with Aimless in CCP4. Molecular Replacement phasing was done with PHASER, using an in-house model of S6BE. Refinement was done manually with phenix.refine and Coot.
Xray diffraction experiments carried out on S6BE-3 HR and S6BE-3CHR highlight their mechanisms for zinc coordination and subsequent cage assembly (
SEC shows that S2BE-3 HR proteins first self-assemble as six-bladed trimers (
Number | Date | Country | Kind |
---|---|---|---|
21183379.3 | Jul 2021 | EP | regional |
22150399.8 | Jan 2022 | EP | regional |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2022/068475 | 7/4/2022 | WO |