The present invention relates to biomimetic design of affinity peptide ligands for target protein using molecular simulation methods, and the purification of target protein by affinity chromatography. The invention belongs to the field of in silico modeling and protein purification of biotechnology.
Antibodies (immunoglobulin, Ig) exist in blood and interstitial fluid of animals, which belong to a kind of glycoprotein produced by B lymphocytes in immune response to antigens, Antibodies are widely used in biological studies and clinic treatment on account of their high affinity for antigens. Especially under gradual maturity of genetic technology and clone technology, monoclonal antibodies (mAbs) have become effective therapeutics for inflammation, tumor, and infectious diseases. Up to now, there're approximately 20 mAbs that have been approved by FDA and at least 300 are under research and development. The output value of mAb-based drugs had reached 20.6 billion dollars in 2006. The increasing significance of antibodies in medical treatment calls for an urgent need of an efficient, stable, and low-cost manufacturing technique. Due to the complexity of antibody expression and high-quality standard required for pharmaceutical antibodies, antibody purification has become a critical step in the whole production process. IgG is the main component of Igs in blood serum, accounting for 75% of the total Igs, and also has the highest market demand among all Igs.
Antibody purification is often conducted by multi-steps including salting-out, gel-filtration chromatography, hydrophobic interaction chromatography, and ion-exchange chromatography and affinity chromatography. The feature of efficient and specific purification of target protein has made affinity chromatography into one of the most commonly used chromatographic methods in later period of antibody purification. Affinity chromatography makes use of the affinity ligand that specifically and reversibly binds the target molecule thus to separate it from complicated biological samples. It has the advantage of high selectivity and high purification efficiency. The purification performance of affinity chromatography depends on the affinity between the ligand and the target molecule. Therefore, the exploitation of suitable affinity ligand targeting specific molecule is the priority of building an affinity chromatographic system.
Staphylococal Protein A (SpA), protein G, and protein L have been widely used as affinity ligands for producing high-purity antibodies. Such ligands possess high selectivity, nonetheless need harsh elution condition, which tends to denature target proteins and causes ligands to fall off, leading to a low adsorption capacity. Moreover, protein ligands are difficult to prepare, costly, and are prone to lose some activity upon immobilization. These disadvantages have limited the application of the above protein ligands.
The research into affinity peptide ligands began with Geysen's study of synthetic peptide library in 1986, who proposed that the short peptide containing key residues could simulate the protein determinant. And in most cases, the non-covalent interactions between a few of key residues and the target molecule constitute the main interaction force for the complex binding. The two viewpoints have laid the theoretical basis for affinity peptide ligands. Firstly, peptide ligands are usually comprised of a few amino acids and therefore have little chance of causing immunogenic responses in use. Secondly, peptide ligands have a small molecular size. So even if they fall off from the stationary phase and mix with the products, it is easy to remove them from the end-products. Finally, the peptide ligands interact with target protein in mild conditions, which can make it easier to control the separation conditions and avoid the denaturation of target proteins. Compared with the high affinity ligands such as protein ligands, peptide ligands also exhibit sufficient affinity for binding target proteins. Besides, the conformation and physicochemical properties of peptide ligands are more stable than those of protein ligands, and thus the ligands can sustain harsh acid or base elution and regeneration conditions during the separation process, realizing the large-scale aseptic manufacturing that meets GMP requirements. In recent years, some peptide ligands with great purification performance for antibodies have sprung up, such as TG19318, Peptide H, Mabsorbent A1P and A2P, 8/7, and linear peptide ligand (HWRGWV etc.).
Although affinity peptide ligands have such many advantages, there are only a limited number of peptides that have affinity for specific target proteins in nature. In addition, although the above-mentioned micromolecule ligands have a huge superiority in antibody purification, they have some deficiencies compared with SpA affinity resin, like weak specificity and affinity, etc. Therefore, research in screening and designing ligands for affinity chromatography is vital. The question about how to choose proper peptides as affinity ligands and how to improve the affinity and selectivity of the peptide ligands is crucial in the application of affinity chromatography. The present screening and designing methods are mainly divided into two kinds of approaches, namely experimental screening and rational design. Experimental screening is based on the combinatorial library technology to conduct high-throughput screening. According to the different ways of constructing peptide library, the screening methods for affinity ligands are mainly divided into combinatorial chemistry synthetic peptide library technology, such as TG19318/D-TG19318, Peptide H, Mabsorbent A1P and A2P, and 8/7 mentioned above; phage display peptide library technology, such as HWRGWV, HYFKFD, and HFRRHL; and ribosome display peptide library technology. Rational design is mainly based on the structure and properties of the target protein or the known ligands to design new ligands. As the computer technique, computational chemistry, and medicinal chemistry develop, the design of affinity ligands has entered the rational design phase dominated by computer aided design. The various virtual screening and rational design methods of computer aided ligand design include molecular docking, 3D-QSAR, pharmacophore model, molecular dynamics (MD) simulation, and de novo design, etc.
Molecular docking involves two molecules recognizing each other through geometric matching and energy matching. Molecular docking is a calculation process in which the ligand is put on the binding site of the target protein, and the binding strength is evaluated via the criteria of geometric complementarity, energy complementarity, and chemical environment complementarity. Meanwhile, the best binding conformation can be found. Since molecular docking has considered the interaction between the target protein and the ligand, so in principle, molecular docking is a direct design method on the basis of the receptor. In recent years, as the protein crystal structure data grows rapidly and the small molecule databases updates constantly, molecular docking has become the most important structure-based design method. The common softwares include DOCK, Autodock, and FlexX, etc.
Molecular dynamics (MD) simulation is a kind of molecular simulation method on the basis of Newtonian mechanics. It has been used for studying the particle movement in many-particle system. The basic procedure in MD simulation can be divided into four steps: (1) initialization; (2) calculation of atomic force; (3) updating the atom coordinates and speed; (4) analysis of trajectory. According to the atom coordinates, speed, and force of the last step, the coordinates and speed of next step can be obtained. Steps (2) and (3) are repeated to obtain the change of physicochemical properties in the system with simulation time. The softwares commonly used in MD simulation include GROMACS, NAMD, AMBER, and CHARMM, etc. Based on the analysis of MD simulation trajectories, we can study the physicochemical properties of the simulation system, such as conformation, energy, kinetic property, and the interaction force between the ligand and the target protein, etc.
Combination of a few of rational design methods and proper combination strategy can reduce the cost and realize higher-accuracy ligand design. In early phase, some methods of rapid speed but limited accuracy can be used to enrich potential candidates, like molecular docking. Then, some methods with more calculation but also more accuracy can be adopted to further select the best ligand molecules, like MD simulation. In the final phase, experiments which are time-consuming and costly are used to conduct the last verification.
The purpose of the invention is to provide a method of constructing a novel affinity ligand peptide library of IgG on the basis of Protein A affinity model, and the application of the design method. In this invention, this simulation design method is firstly adopted to identify affinity peptide ligands for antibodies, and is verified to be feasible.
In the invention, the novel affinity ligand peptide library of IgG is constructed on the basis of Protein A affinity model, and the basis is six hot spots of SpA: F132, Y133, H137, E143, R146, and K154. In order to immobilize the ligands, cysteine is added to the middle region of the peptides. Thus eight models for the peptide library are obtained:
The ‘X’ denotes the common 19 kinds of amino acids except Cys.
The application of the design method is using amino acid location method to determine the kinds of amino acids (‘X’) in the peptide models on the basis of above peptide library, which are summarized in the following table:
The detailed peptide sequences are listed in Table 5.
The peptide library is then screened by molecular docking, root-mean-square deviation (RMSD) comparison and rescreened by MD simulations to identify the peptide ligands of high affinity for the target protein. The peptide ligands that are predicted to have high affinity for human IgG (hIgG) are: FYWHCLDE, FYFCRWE, FYIHCLPE, FYYHCKKE, FYCHWALE, FYCHWQDE, FYCHTIDE, FYRHCQRE, FYCHHKTE, FYCHLQKE, FYCHRKAE, FYCHNQDE, FYCHRQEE, and FYNHCASE.
What needs to be stressed is that the overall 2173 peptides contained in the above peptide library are all likely to be affinity ligands of hIgG theoretically; the purpose of carrying out molecular docking screening and MD simulation rescreening is to enrich the peptides that have high affinity for hIgG, and effectively reduce the number of peptide candidates so as to feasibly validate their affinity using later experimental methods. The finally obtained 14 peptides bear the maximum probability of high affinity for hIgG. Although in silico modeling has constantly developed towards maturity so far, the prediction of the intermolecular interaction has not yet been completely in line with the actual situation, resulting in some deficiencies that molecular simulation softwares have revealed unavoidably (e.g., the different softwares or parameters adopted may lead to different results). And therefore the prediction could just approach the actual situation as far as possible. There may exist two cases when molecular simulation is utilized for screening peptides: first, the hit rate of peptides that have high affinity for hIgG is increasing with screening times; second, the chance of missing and excluding affinity peptide ligands for hIgG is also increasing with screening times. As a result, it can't be ruled out that the other 2159 peptides are also likely affinity peptide ligands of hIgG.
The method of this invention obtains affinity peptide ligands of hIgG utilizing molecular simulation and experimental measures, and the method is listed as the 1-5 steps below:
Detailed operations are depicted as follows: first, the 3D coordinate files of Fc fragment and SpA B domain are derived form the crystal structure of hIgG1-SpA complex (PDB ID: 1FC2); second, the coordinate structures of the six hot spots (F132, Y133, H137, E143, R146, and K154) are obtained from the coordinate files of SpA B domain, and the distance between the C and N terminal of two hot spots are calculated. Thereafter, according to the principle of amino acid insertion, the number of residues that should be inserted into between the hot spots is calculated and the peptide models are identified (Table 1); then one of the middle residues is determined to be Cys (Table 2); after that, the corresponding Fc region located between every two hot spots is chosen and covered in the grid box in Vina docking. Then each of the 19 common amino acids except Cys is docked to the Fc region contained in the grid box sequentially, and the top 20 conformations with highest docking scores are selected for each amino acid. The screening criteria for amino acid selection is listed as below: first, the conformation should be proper, namely the C/N terminal of the amino acid conformation should be located end to end with the N/C terminal of the nearby hot spots, thus they could be linked up to form peptide bonds. Second, the affinity binding energy should be less than −2.0 kcal/mol. At last, we identify all the unknown amino acide ‘X’s in the peptide models (Table 3). A perl script invoking CHARMM is used to build the peptide library, which comprises 2173 peptides (Table 5). Every peptide contains four hot spots.
The FlexPepDock docking is performed twice to reduce the randomness of just one docking process. It is found that only a few peptides could not bind to Fc and the other majority could. The docking scores (interface energy score, I_sc) of the majority of peptides are in the range of −4 to −22 and most I_sc values are in the range of −14 to −16. The I_sc value of no more than −16 is selected as the screening criterion, thus it could be possible to avoid the waste of computing resource (considering that the next round of MD simulation rescreening could consume too much computing resource and the speed would be slow), and avoid missing too many potential candidates. Certainly related researchers can set a more proper I_sc value as screening criterion according to their own research conditions. There are 15 peptides with both I_sc values from two parallel dockings less than −16 and they are selected for further MD simulations.
In this invention, free energy calculation and free energy decomposition method coupled with MM/PBSA are utilized to obtain the hot spots of Protein A that has high affinity for human IgG, and a SpA affinity binding model of hIgG is constructed on the basis of the hot spots. Then, amino acid location, insertion of a Cys into the middle region of the peptide, and molecular docking are conducted for building the peptide library; thereafter, molecular docking is used to screen and MD simulation is used to rescreen the peptide library, and a pool of peptide ligands of high affinity for IgG is obtained. We have identified two peptides that could effectively purify hIgG—FYWHCLDE and FYCHWALE. A series of experimental methods including affinity chromatography, SDS-PAGE, and UV-Vis spectrometer are conducted to investigate the IgG purification and related characterizations.
It should be noted that, all the 2173 peptides of the library may be affinity ligands of hIgG in theory. But due to the limitations of molecular simulations, it cannot fully simulate the actual situation and can just approach it as far as possible. Therefore experimental methods are needed to verify the results. Since time and resources were limited, the invention could not verify all the peptides through experimental methods. Only two peptides were randomly selected from the finally obtained 14 peptides to perform experimental characterizations. The main purpose is to explain how to use experimental methods to verify the effectiveness of the peptides as affinity ligands of hIgG rather than to indicate just the two peptides mentioned in the invention are effective. Researchers can verify other peptides of the library by the experimental methods used in this invention.
Combined with the attached figures, the followings are the further description of the invention in detail. The implementation cases are to elucidate rather than confine the invention in any way.
First, MM/PBSA was used to calculate the absolute binding free energy of SpA-hIgG1 complex. Then, free energy decomposition method based on MM/PBSA was utilized to analyze the molecular mechanism of high affinity between SpA and hIgG1 as well as the contribution of residues on the binding surface of SpA-hIgG1 complex to the binding free energy. According to the free energy contribution of each residue and pair residues interaction analysis, the hot spots of the SpA-hIgG1 complex were identified. At last, SpA affinity binding model was constructed on the basis of the molecular mechanism and hot spots obtained from the above analyses.
SpA-hIgG1 molecular system was investigated in MD simulations. The complex model contained a single heavy chain of the Fc fragment and the B domain of SpA. The structure of the SpA-hIgG1 complex was obtained from Protein Data Bank (PDB ID: 1FC2) (
The binding free energy (ΔGbind) for SpA-hIgG1 complex was calculated using the MM/PBSA method. It was estimated as the sum of the gas-phase energy (ΔGgas), the solvation energy (ΔGsol.), and the entropic energy (−TS), according to
ΔGbind=<ΔGgas>+<ΔGsol>−<TΔS>
The brackets, <. . . >, indicate an average of an energy term along the MD simulation trajectory. T is the absolute temperature, and S is the solute entropy.
Ggas contains an intermolecular electrostatic term (Gelec), a van derWaals (vdW) term (Gvdw), and an internal energy term (Ginter).
In this study, “the same trajectory method” was used in all analyses. So, the internal energy term (ΔGinter) is zero. Thus, for the contribution to the bind free energy, ΔGgas is the sum of ΔGelec and ΔGvdw:
ΔGgas=ΔGelec+ΔGvdw
The solvation energy is divided into the electrostatic salvation energy (GPB) and the nonpolar solvation energy (Gnp):
Gsol=GPB+Gnp
GPB was calculated by solving the linear Poisson-Boltzmann (PB) equation using the PBEQ module of the CHARMM program. The solute and solvent dielectric constants were set to 1 and 80 in all BP calculations, respectively. The ionic strength was set to zero. The solvent molecule radius was set to 1.4 Å. Gnp, which could be considered as the sum of a solvent-solvent cavity term and a solute-solvent vdW term, was calculated according to
Gnp=γ×SASA+b
The constants γ and b were set to 0.00542 kcal/(mol·Å2) and 0.92 kcal/mol, respectively. SASA represents the solvent accessible surface area.
The entropy (S) can be decomposed into translational (Strans), rotational (Srot.), and vibrational (Svib) entropies. These terms were calculated according to the statistical mechanics. Strans and Srot. were functions of the mass and moments of inertia of the molecule, respectively. The vibrational entropy (Svib) was estimated by normal-mode analysis (NMA) using the VIBRAN module of the CHARMM program.
The free energy contribution of each residue can be divided into polar (Gpolar) and nonpolar interactions (Gnonpolar).
Herein each part is the sum of two energy terms. In the following analysis, Gpolar is considered as the electrostatic interaction and Gnonpolar as hydrophobic interaction. It is important to note that Gresidue only provides a decomposition of Gpolar and Gnonpolar (i.e., it does not contain the contribution of the entropy). Gresidue=Gpolar+Gnonpolar, Gpolar=Gelec+GPB, Gnonpolar=GvdwGnp. The electrostatic energy of each residue (Gpolar) is the sum of the intermolecular electrostatic energy (Gelec) and the electrostatic solvation energy (GPB). The linear PB equation allowed the decomposition of electrostatic solvation energy as contributions of each atom. The Gvdw contribution of each residue in SpA is half the Gvdw between the residue and Fc fragment and vice versa for the Gvdw contribution of residues in Fc. Gnonpolar of each residue is proportional to the loss of SASA in the residue. From the above analyses we obtained ΔGbind of the SpA-hIgG1 complex, the free energy contribution of each residue, and thereby the hot spots interacted between SpA and hIgG1.
In the SpA-hIgG1 complex, the SpA residues in contact with Fc fragment are discretely located on helix I (K126 to H137), helix II (E144 to D155), and irregular curl (L138 to E143). In this invention, the hot spots are identified as the residues that have large contribution to the binding free energy and that are involved in the important intermolecular interaction formation to compensate the unfavorable salvation. The residues contributing a lot to the free energy are identified on the basis of the criterion of ±2.5 kcal/mol. In order to recognize the significant residues in the complex for guiding the rational design of affinity ligands, a higher criterion than that often used in other literatures was adopted in this invention. For SpA, the residues F132, Y133, H137, R146, and K154 are found to have large contributions to the binding. Though the residue E143 of SpA has only little energy contribution (−1.2 kcal/mol), its free energy deviation is as high as −6.7 kcal/mol. Moreover, the side chain of E143 in SpA has a negatively charged [COO−] group, mainly offering electrostatic force. Therefore, the free energy contribution of E143 is highly sensitive to the conformation of residues. E143 of SpA has strong electrostatic attraction with the residue K317 of hIgG1. On the contrary, it has weak electrostatic repulsion with both D280 and D315 of hIgG1. So, though E143 does not contribute a lot of binding free energy directly, it creates a favorable local binding environment for K317 of hIgG1. Thus E143 is considered as one of the hot spots of SpA.
Helix I of SpA binds the Fc fragment mainly by hydrophobic interaction. Particularly, hot spots F132 and Y133 in helix I offer the majority of hydrophobic inteactions. MD simulation results indicate that the aromatic side chains of F132 and Y133 bind tightly at the shallow slot of the hydrophobic pocket on Fc fragment. In contrast, helix II of SpA binds the Fc fragment mainly by electrostatic interaction. Helix II contains three polar hot spots (E143, R146, and K154), which have strong electrostatic attraction with the polar residues (H310, Q311, D315, K317, and K338) located around the hydrophobic area at the binding site. Hence, hydrophobic interactions and special electrostatic interactions should be both considered in the rational design process to develop novel SpA-mimetic affinity ligands. According to the affinity mechanism of SpA in interaction with hIgG and the hot spots distribution of SpA, a simplified SpA binding motif has been constructed, wherein six hot spots are F132, Y133, H137, E143, R1461, and K154 (
1. Identification of the Length of Peptide Sequences
It is known that the length of a peptide bond and an amino acid backbone is about 1.33 and 2.78 Å, respectively. The length needed to insert an amino acid residue is about the length of two peptide bonds plus one amino acid backbone length (2×1.33+2.78=5.44 Å); the length needed to insert two amino acid residues is about the length of three peptide bonds plus the length of two amino acid backbones (3×1.33+2×2.78=9.55 Å).
On the basis of the distance between the corresponding C and N terminal of two hot spots of SpA (which is calculated through Visual Molecular Dynamics software VMD), the number of amino acid residues inserted between the hot spots was worked out and the peptide construction models were identified as shown in Table 1.
Among them, the No. 1 was an octapeptide mode, and the others were the heptapeptide modes. Considering that the C/N terminal residues of these peptide ligands are consisted of hot spots, which have high affinity for IgG, in order to maintain the high affinity between the ligand and antibodies after its immobilization, Cys was considered to insert into the middle region of the ligand, and thus the peptide could be immobilized onto Thiopropyl Sepharose 6B via disulfide bond. Thus, a residue Cys was inserted in the middle region of the peptide sequence to facilitate the peptide immobilization. This can make sure that sufficient flexibility could be offered to the terminal key residues upon peptide immobilization onto the resin, and then the key residues at two terminals could fully play the role of affinity so as to keep its affinity with Fc fragment of IgG. According to the hot spots of SpA and the number of residues inserted between them, finally eight peptide models were obtained (Table 2).
2. Identification of the Kinds of Amino Acid Residues Inserted Between Hot Spots
Fragment location method was used to identify the amino acids inserted between hot spots. Fragment location method is a kind of simulation methods which uses molecular simulation method to identify the best orientation of a specific atom or fragment in the binding cavity. First, the different interaction regions of the binding cavity should be identified, such as electrostatic area, hydrophobic area, hydrogen bonding donor area, and hydrogen bonding acceptor area. Then, based on the principle of chemical environment matching, the ligand molecule that matches the chemical property of the binding cavity is placed there. That is, the ligand molecule fragment placed in the vicinity of the hydrophobic area of the receptor is also a hydrophobic group, like benzene ring and aliphatic hydrophobic chain, etc; the positively/negatively charged area of the receptor should match the negatively/positively charged area of the ligand. First, the Fc fragment region corresponding to the vacant small area between the six hot spots of SpA was selected, and then all the 19 amino acids (except cysteine, in order for later peptide ligand immobilization) were docked to the Fc fragment region successively using molecular docking software Vina. After the amino acids that had high affinity for the vacant area were all determined, their residues were linked with the hot spots to form candidate peptide ligands.
The identification of the kinds of the inserted amino acids was conducted using amino acid location method, and the peptide library was built accordingly.
The criteria for selecting amino acids are listed as follows: firstly, the candidate amino acid should have a proper binding conformation. Namely, the C/N terminal of its docked conformation is located end to end with the NIC terminal of other residues; secondly, the binding free energy is less than −2.0 kcal/mol. The kinds of unknown ‘X’ residues in the peptides were detected as summarized in Table 3.
Perl scripts invoking CHRAMM software are utilized to acquire the peptide library, which includes 2173 peptide sequences, and every peptide sequence contains four hot spots.
1. Docking of Peptides to Fc Fragment Using Vina
That individual residue having strong affinity for Fc dose not necessarily means that the peptide comprised by these residues must have strong interaction with Fc, so Vina was used to dock all the peptides in the library to the “consensus binding site” of Fc fragment sequentially. It is found that all the peptides of the library could bind Fc, and the predicted binding energies of the docked peptides were in the range of −4.5 to −8.2 kcal/mol. This range is in line with the requirement that affinity ligands should have medium affinity (the binding constant is in the range of 104-108 M−1). Among them, the peptides with binding free energy around −6.5 kcal/mol were accounting for the most. In order to avoid missing the promising ligands, 754 peptides with binding free energy less than −6.5 kcal/mol were selected. The docking scores distribution is presented in
It is expected that the hot spots could fully exhibit their affinity for IgG, so it is necessary to seek out the peptide sequences of which the conformations of the hot spots were similar to those of corresponding hot spots in SpA, and therefore the peptide sequences may simulate effective affinity binding formed between SpA and Fc fragment.
Herein, for analyzing the differences between the conformations of the hot spots in peptide ligands and those in SpA, the auxiliary program g_rms provided by the GROMACS molecular simulation package was used to calculate the RMSD values between the hot spots in 754 peptide sequences obtained through vina docking and those corresponding in SpA. Encad all-atom force field was chosen, and RMSD values for key hot spots excluding hydrogen atoms were calculated. The smaller the RMSD value is, the more alike the two conformations of the corresponding hot spots are. The RMSD values are found in the range of 0.2 to 0.6 run (
2. Rescreening Peptides Using FlexPepDock
Common molecular docking softwares like Autodock, DOCK, PatchDock, and MEDdock are only suitable for the docking of small molecules with a few of rotatable bonds. Since peptides have more side chains and more freedom than small organic molecules, therefore the virtual screening using the above softwares will have some limitations. FlexPepDock is a novel software aimed at docking peptides to target proteins using the Monte-Carlo minimization approach, and fully incorporates the flexibility of backbones and side chains of peptides as well as side chain flexibility for the receptor protein. If the binding site and the approximate binding model of the peptide-protein complex are available, FlexPepDock could realize the high-resolution prediction of the binding conformation between the peptide and target protein. The scoring function of FlexPepDock is a kind of generic full-atom energy function, including Lennard-Jones all-atom attractive and repulsive energy, lazaridis-jarplus solvation energy, and hydrogen bonding, etc. Many reports have indicated that the binding surface energy score (I_sc) can mainly evaluate the binding strength of peptide with proteins.
All the 150 peptides obtained from the last round were docked to Fc using FlexPepDock successively. It is found only a few peptides couldn't bind to Fc and the other majority could. The I_sc values of the other majority of peptides were in the range of −5 to −22, and the less the value of I_sc is, the stronger the binding is. Among them, the peptide sequences with I_sc values less than −16 were selected. Two parallel dockings were conducted, and there were 34 peptide sequences whose I_sc values were less than −16 from the first docking result (
So, herein we chose the 15 peptides for further MD simulations to analyze the affinity for IgG.
MD simulation is an effective tool for studying protein dynamic behaviors. Fast inner movement, slower conformational changes, and folding process in protein can all be investigated using MD simulations. In order to further assess the affinity in the IgG-peptide complexes, not only their static structures (using molecular docking) but their dynamic behaviors should be considered. Next, the binding dynamics of the selected 15 peptides with Fc fragment of IgG were studied by MD simulations.
All MD simulations were performed using the GROMACS 4.5.3 package with GROMOS96 53a6 force field. The peptide-protein complexes were put at the center of a rectangular water box, of which the distance from the edge of the box was at least 0.9 nm. A cutoff value of 0.9 nm was used for Coulomb as well as Lennard-Jones interactions. Particle Mesh Ewald (PME) method was used to calculate the long-range electrostatic interactions with a grid-spacing of 0.12 nm and an interpolation order of 4. Temperature (300 K) and pressure (1 bar) were controlled by the v-rescale thermostat and Parrinello-Rahman pressostat, respectively. An integration time step of 2 fs was used together with the LINCS constraint solver for all covalent bonds. First, 50,000 steps of steepest descent method were used to conduct energy minimization for the system. Then the system went through 100 ps of restrained kinetic equilibrium under NVT and NPT ensemble, sequentially. At last, 20 nm of unrestrained dynamic simulation was performed. Structures were saved every 500 ps for analysis, resulting in 40 conformations for total 20-ns simulation. All MD simulations were run on a 64-CPU Dawning A620r-F server.
First, VMD was used to calculate the changes of interaction and relative locations between peptides and Fc fragments, and it was found only FYTHCAKE in the 15 peptides was detached from Fc fragment and got apart from it gradually along the MD simulation. The change of Cα RMSD values over simulation time was found during the study of FYTHCAKE in interaction with Fc, and it was observed that in the whole 20 ns simulation, the Cα RMSD values of both FYTHCAKE and Fc were severely fluctuating and could not achieve stability. Compared with its initial conformation (Flexpepdock docking conformation), the Cα RMSD value of the final conformation reached 0.45 nm. The contact number and minimum distance between atoms of FYTHCAKE and Fc over simulation time were also analyzed. It was found that at around 15 ns, the peptide began to separate from Fc, and meanwhile the contact number abruptly declined from 600 and correspondingly the minimum distance rapidly increased. The two molecules were completely separate from each other at around 18 ns, indicating weak affinity between FYTHCAKE and IgG. In contrast, the other 14 peptides could all bind Fc in the overall MD simulation process, suggesting they could be able to be effective affinity ligands of IgG, which were FYWHCLDE, FYFCRWE, FYIHCLPE, FYYHCKKE, FYCHWALE, FYCHWQDE, FYCHTIDE, FYRHCQRE, FYCHHKTE, FYCHLQKE, FYCHRKAE, FYCHNQDE, FYCHRQEE, and FYNHCASE.
1. Peptide Ligand Immobilization and Preparation of Affinity Column
Peptide powders purchased from GL Biochem Ltd. (Shanghai, China) had been purified by high performance liquid chromatography (HPLC) with a purity of 96.36%. 1 g of resin (Thiopropyl Sepharose 6B, purchased from GE Healthcare) was weighed and washed for 15 min with 200 mL of deionized water in a buchner funnel. After being filtered and getting rid of water, two aliquots of 1.0 g of drained gel were transferred into two 25-mL Erlenmeyer flasks, respectively. Then 6 mL and 10.7 mL of linking buffer (0.1 M Tris-HCl, pH 7.5, 0.5 M NaCl and 1 mM EDTA) were added to the two flasks respectively, to fully pre-equilibrate the gels. Two peptides (FYWHCLDE and FYCHWALE) were randomly selected from the 14 peptides obtained by MD simulations, weighed and dissolved in 500 μL of 50% (v/v) ethylene glycol solution. The peptides were fully mixed with the gels in corresponding flasks and initial peptide concentrations of 1.0 mg/mL were obtained. The mixtures were reacting in a shaking bath at a rate of 180 rpm, at 25° C. for 2 h. 500 μL of supernatant was acquired each hour for analyzing the residue peptide content in the reaction solution with reversed-phase high pressure liquid chromatography (RP-HPLC). When the peptide content did not change any more, 8 mg cysteine was added and then the reaction was going on for 30 min to block the unreacted groups on the gel so as not to disturb the affinity interaction between proteins and the ligands. After the reaction was finished, the reaction system was centrifuged and the supernatant was removed, then the gel was washed repeatedly with washing buffer (10 mM PBS, pH 7.2, 150 mM NaCl) in a buchner funnel to get rid of the free peptides. Finally, equilibriation buffer (20 mmol/L citrate buffers, pH 5.0-5.5; or 20 mmol/L PBS buffers, pH 5.5-6.0) was used to suspend the gel to 1 mL, and after being degassed, the gel was slowly loaded into the glass column (Tricorn chromatogram column, Tricorn 5×5, GE healthcare). The column was first washed with equilibration buffer at 0.1 mL/min, and then the speed of the buffer was increased to 0.2 mL/min when the column pressure was stable, and the speed of the buffer was increased according to above operation till it reached 1.0 mL/min. Till the height of the gel did not drop any longer, the height adjuster should be screwed to the upper surface of the gel column.
2. Determination of Peptide Content in the Supernatant
The residue peptide content in the supernatant was determined with RP-HPLC, and the detailed assay parameters were shown below: mobile phase A, aqueous solution containing 0.1% trifluoroacetic acid (TFA); mobile phase B, acetonitrile solution containing 0.1% TFA; loading amount: 10 μL; flow rate: 0.5 mL/min; detection wavelength: 220 nm. The results showed that almost all FYWHCLDE molecules could be immobilized onto the gel, and the immobilization ratio of FYCHWALE was 93.7%.
3. Affinity Chromatography
(1) Equilibration: the affinity column was washed with equilibration buffer until a stable baseline was reached. The flow rate of buffer was 0.5 mL/min, keep washing another five column volumes (CVs) before sample injection.
(2) Sample injection: unless specifically stated, all protein samples were prepared by equilibration buffer. After sample injection, the column was equilibrated with 5-10 CVs of equilibration buffer (20 mmol/L citrate buffer, pH≤5.5; or 20 mmol/L PBS buffer, pH≥5.5). The sample loading amount was 100 μL, protein sample concentration was 1.0 mg/mL, and the flow rate was 0.5 mL/min.
(3) Elution: the column was washed with 5-10 CVs of elution buffer, and the flow rate was 0.8 mL/min.
(4) Storage: first 10 CVs of deionized water was used to wash the column, followed by another 10 CVs of 20% ethanol solution. Then the affinity gel was stored in this solution. All buffers should be filtered through a filter and degassed before use.
At pH 5.0-6.0, FYWHCLDE peptide ligand had a strong binding of hIgG. On the contrary, as for BSA, the ligand only had a large binding at pH 5.5. The affinity interaction between FYWHCLDE and hIgG mainly involved specific electrostatic interaction. At pH 5.5-7.0, FYCHWALE ligand could bind a lot of hIgGs. However at pH 6.0, it could bind almost all hIgGs. So the best binding buffer for FYCHWALE was 20 mM PBS at pH 6.0. Again, the affinity between FYCHWALE and hIgG mainly involved electrostatic interactions, and therefore the ionic strength of the binding system should be proper and could not be too high, avoiding a decreased binding of antibodies. FYCHWALE ligand could bind about half of bovine serum albumin (BSA) at pH 5.5, ⅓ of BSA at pH 6.0, and trace BSA at pH 6.5. The above results suggested that both the two peptide ligands had specific binding for hIgG at pH 6.0.
4. Protein Content Determination
The total protein concentration in mixed protein solution was determined by Bradford method, and the operation was as follows: a series of 1 mL of BSA solutions with concentrations of 15, 30, 45, 60, 75, and 90 μg/mL were prepared, and the blank control was 1 mL of distilled water. When the data points were determined, 3 mL of coomassie blue G-250 was added to the protein solutions and the control solution, respectively. After fully mixed, their absorbance was determined at 595 nm within 1-2 min. The protein concentrations were plotted against the absorbance values to obtain the protein content standard curve. The IgG concentration standard curve was produced by preparing hIgG solutions of concentrations ranged from 0 to 2.0 mg/mL, and the standard curve was plotted on the basis of the hIgG solution absorbance values at 280 nm using spectrophotometry method. The IgG concentration standard curve was only used to determine the pure protein solution only containing IgG. The absorbance values of collected fractions from protein chromatography were determined at 280 nm and 595 nm, and then the protein contents were identified according to the standard curve.
5. Chromatographic Purification of IgG from Serum Sample
After the affinity gel was fully equilibrated with equilibration buffers (20 mM PBS at pH 6.0 or 6.5), namely all parameters had reached the baseline, 500 μL of serum sample, which was prepared by diluting human serum with 9 time volumes of binding buffer, was loaded. Another 5 CVs of equilibration buffer were loaded to wash the column, and then 0.5 or 0.2 M NaCl in equilibration buffer was used for elution; or 50 mM citrate buffer (pH 3.0) was used as elution buffer. When the elution peak was completely separate, 0.1 M Gly-HCl buffer (pH 2.4) was used to regenerate the affinity gel. The separation results were shown in
6. Sodium Dodecyl Sulfate Polyacrylamide Gel Electrophoresis (SDS-PAGE)
The main solutions:
Sample preparation: 25 μL of raw sample solution was mixed with 25 μL of sample buffer thoroughly. 5 μL of marker and 10 μL of sample were injected. The separation gel was loaded into the electrophoresis groove of SE/250 vertical plate until a distance of 1.5 cm from the glass plate was reached. Then gently cover the separation gel with a sheet of water. When the separation gel was solidified, the water was poured out, the stacking gel was loaded and a comb was inserted. When the stacking gel was solidified, the comb was pulled out. And the samples to be tested were added to the comb holes using a microsyringe. The protein amount of the sample is proper to be 10 μg. The electrophoresis was conducted at 10 mA until the indicator strip was reaching the bottom of the stacking gel, then the electric current was increased to 25 mA until the indicator strip was reaching the bottom of the separation gel, thus electrophoresis was completed. After that, coomassie blue (R-250) staining method was used. This method had high sensitivity, and the minimum detection limit was 0.3˜10 μg. The gel was drenched in the staining solution for 4 h at room temperature to be fixed and stained. Thereafter, the gel was decolored using destaining solution until the background of the gel was close to colorless. Then pictures were taken for the destained gel by Bio-RAP and the results were analyzed with Gel-Pro software.
The flow-through peak fractions and elution peak fractions collected from the human serum separation were analyzed by SDS-PAGE, and the electrophoretograms were shown in
Compared with other methods of designing novel peptide ligands, such as combinatorial chemistry synthetic peptide library screening, phage display peptide library screening, and ribosome display peptide library screening, the biomimetic method for designing peptide ligands of human IgG using molecular simulation introduced in this invention has the following advantages: first, there is no need for complex and costly raw materials. It is easier to handle than experimental operations and less costly; second, it is possible to construct a library including a huge number of candidates, realizing the large-scale high-throughput screening in a real sense. The peptides obtained from screening are not only the affinity ligands of human IgG, but also applicable to other receptor molecules. That is, they also could efficiently purify IgG of pigs and sheep. The molecular docking screening, molecular dynamics simulation rescreening, and experimental verification introduced in this invention can apply to all the peptides in the peptide library. Due to the various limited factors, we could not verify all the peptides in the library by experimental methods, and could only enrich effective candidate peptides with various screening methods, But one thing is for sure that besides the 14 finally obtained peptides, other peptides distributed in the library are also possible to be affinity peptide ligands of IgG. Related researchers can select some of them to conduct experimental verifications if needed. According to the available experimental data, the peptides in the library are predicated to be effective potential affinity peptide ligands of human IgG. It is hoped that the peptide ligands with good performance can be commercialized and therefore benefit humankind,
Number | Date | Country | Kind |
---|---|---|---|
2012 1 0561815 | Dec 2012 | CN | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/CN2013/081761 | 8/19/2013 | WO | 00 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2014/094444 | 6/26/2014 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
7408030 | Carbonell et al. | Aug 2008 | B2 |
Number | Date | Country |
---|---|---|
1634990 | Jul 2005 | CN |
101704879 | May 2010 | CN |
103014880 | Apr 2013 | CN |
2 495 253 | Sep 2012 | EP |
2 495 254 | Sep 2012 | EP |
2005-147610 | Jun 2005 | JP |
2008-141923 | May 2008 | JP |
2009-041860 | Feb 2009 | JP |
Entry |
---|
Ehrlich et al., “Identification of Model Peptides as Affinity Ligands for the Purification of Humanized Monoclonal Antibodies by Means of Phage Display”, Journal of Biochemical and Biophysical Methods, vol. 49, 2001, pp. 443-454. |
Huang, “Rational Design of Affinity Peptide Ligands for Antibody and Chromatographic Characterizations” 2012. |
Dec. 22, 2014 Office Action issued in Chinese Patent Application No. 201210561815.8. |
Li et al., “Design, Synthesis, and Application of a Protein A Mimetic, Nature Biotechnology”, vol. 16, 1998, pp. 190-195. |
Jun. 20, 2014 Office Action issued in Chinese Patent Application No. 201210561815.8. |
Huang et al., “Molecular Mechanism of the Affinity Interactions between Proten A and Human Immunoglobulin G1 Revealed by Molecular Simulations, The Journal of Physical Chemistry” vol. 115, 2011,4168-4176. |
Oct. 21, 2013 Office Action issued in Chinese Patent Application No. 201210561815.8. |
Dec. 5, 2013 International Search Report issued in International Patent Application No. PCT/CN2013/081761. |
Dec. 5, 2013 International Preliminary Report on Patentability issued in International Patent No. PCT/CN2013/081761. |
Fassina et al., “Protein A Mimetic Peptide Ligand for Affinity Purification of Antibodies, Journal of Molecular Recognition”, vol. 9, 1996, pp. 564-569. |
Number | Date | Country | |
---|---|---|---|
20150355192 A1 | Dec 2015 | US |