The present disclosure relates to a thermostabilized mutant-predicting apparatus for membrane proteins, a thermostabilized mutant-predicting method, and a computer program product.
Membrane proteins occupy 30% of all of proteins to be encoded by genomes and play an important role in cell functions such as signal transduction, substance transportation, bio-energy production and conversion. At the same time, since about 60% of commercially available drugs affect membrane proteins, a membrane protein is an important target in medicinal drug discovery. In particular, G protein-coupled receptor (GPCR), which is a receptor for hormones, neurotransmitters and the like, forms about 800 kinds of families, and among them about 280 kinds are estimated as targets for drug discovery.
Recently, for designing and improving effective drugs with less side effect, Structure-Based Drug Design(SBDD) based on the three-dimensional structure of protein as a drug target has been considered effective. However, GPCR has problems like (1) the thermal stability is inferior and thus mass production is difficult; and (2) the hydrophilic surface for crystallization is small and thus crystallization is difficult. As a result, any detailed structure of human GPCR was not obtained until the year 2007.
In 2007, as described in Non-patent literature 1, a group of B. Kobilka and R. Stevens et al. succeeded in X-ray crystallography by fusing T4 lysozyme (T4L) with an intracellular third loop of human adrenergic receptor, thereby thermally stabilizing the receptor and at the same time extending the hydrophilic surface, and crystalizing by a method that is called a lipidic cubic phase method. That is, the problem of the GPCR (2), namely difficulty in crystallization, can be overcome by use of T4L fusion and antibody. However, the problem (1), namely inferior thermal stability to hinder mass production, cannot be solved sufficiently, and even now 90% or more of the GPCR cannot be produced at a large scale.
In Patent literature 1 or the like, StaR (registered trademark) technique is disclosed. According to the StaR (registered trademark) technique, for enhancing the thermal stability, the respective amino acids of GPCR are substituted exhaustively by alanine, any mutation sites that will improve the thermal stability are examined experimentally, these mutation sites are combined to remarkably improve the thermal stability and the crystal structure is analyzed, and the thermal stability for the other type of GPCR is also improved by using the similarity of the GPCR structure.
However, since the StaR (registered trademark) technique requires experimental exhaustive analysis, the effect of substitution of amino acids other than alanine has not been known. Another problem is that this technique requires considerable time and effort to newly perform a similar analysis for any other membrane proteins such as GPCR.
It is an object of the present disclosure to at least partially solve the aforementioned problems in the conventional technology, by providing a thermostabilized mutant-predicting apparatus capable of predicting with computer any amino acid mutant for thermal stabilization in a membrane protein, a thermostabilized mutant-predicting method, and a computer program product.
A thermostabilized mutant-predicting apparatus according to one aspect of the present disclosure is a thermostabilized mutant-predicting apparatus that predicts a candidate of an amino acid mutant for thermal stabilization of a membrane protein, and the thermostabilized mutant-predicting apparatus includes a storage unit and a control unit. The storage unit stores an amino acid sequence of the membrane protein. The control unit includes a mutation-introducing unit that introduces an amino acid mutation into the amino acid sequence of the membrane protein to create an amino acid sequence of the amino acid mutant, a calculating unit that calculates a solvation entropy change for the membrane protein and each amino acid mutant in formation of a tertiary structure from a primary structure or formation of the tertiary structure from secondary-structure units within the transmembrane segment involving structural optimization based on the amino acid sequence, and, a candidate-extracting unit that extracts a candidate of the amino acid mutant to be thermostabilized based on a difference between the solvation entropy change in the membrane protein and the solvation entropy change in the amino acid mutant.
The thermostabilized mutant-predicting apparatus according to another aspect of the present invention is the thermostabilized mutant-predicting apparatus, wherein the calculating unit further calculates for the membrane protein and each of the amino acid mutants an energy change in formation of the tertiary structure from the primary structure or formation of the tertiary structure from secondary-structure units within the transmembrane segment involving the structural optimization based on the amino acid sequence, and the candidate-extracting unit extracts the candidate of the amino acid mutant to be thermostabilized, based on a change amount as a sum of a difference between the energy change in the membrane protein and the energy change in the amino acid mutant, and a value obtained by multiplying an absolute temperature to the difference between the solvation entropy change in the membrane protein and the solvation entropy change in the amino acid mutant.
The thermostabilized mutant-predicting apparatus according to still another aspect of the present invention is the thermostabilized mutant-predicting apparatus, wherein the calculating unit calculates the solvation entropy change by using an integrated methodology of an integral equation theory and a morphometric representation based on four geometric indices of an excluded volume, an accessible surface area, and integrated mean and Gaussian curvatures of accessible surface.
The thermostabilized mutant-predicting apparatus according to still another aspect of the present invention is the thermostabilized mutant-predicting apparatus, wherein the storage unit further stores structural data of the membrane protein, and the calculating unit performs structural optimization based on the amino acid sequence and the structural data.
The thermostabilized mutant-predicting apparatus according to still another aspect of the present invention is the thermostabilized mutant-predicting apparatus, wherein the calculating unit performs the structural optimization while relaxing a constraint stepwise, by first fixing heavy atoms of the membrane protein and minimizing, then fixing Cα carbon and cβ carbon and minimizing, and finally minimizing without fixation.
The thermostabilized mutant-predicting apparatus according to still another aspect of the present invention is the thermostabilized mutant-predicting apparatus, wherein the calculating unit calculates, as the solvation entropy change, a difference between solvation entropy of the tertiary structure subjected to the structural optimization before extracting the transmembrane segment and solvation entropy of the secondary structure from which the tertiary structure has been separated.
The thermostabilized mutant-predicting apparatus according to still another aspect of the present invention is the thermostabilized mutant-predicting apparatus, wherein the calculating unit calculates, as the solvation entropy change, a difference between solvation entropy of the tertiary structure subjected to the structural optimization after extracting the transmembrane segment and solvation entropy of the secondary structure subjected to the structural optimization after separating the extracted transmembrane segment.
The thermostabilized mutant-predicting apparatus according to still another aspect of the present invention is the thermostabilized mutant-predicting apparatus, wherein the calculating unit calculates, as the solvation entropy change, a difference between solvation entropy of the tertiary structure subjected to the structural optimization before extracting the transmembrane segment and solvation entropy of the secondary structure subjected to the structural optimization after extracting the transmembrane segment and separating the transmembrane segment.
The thermostabilized mutant-predicting apparatus according to still another aspect of the present invention is the thermostabilized mutant-predicting apparatus, wherein the calculating unit calculates, as the solvation entropy change, a difference between solvation entropy of the tertiary structure subjected to the structural optimization before extracting the transmembrane segment and solvation entropy of the primary structure subjected to the structural optimization after extracting the transmembrane segment and extending the transmembrane segment.
The thermostabilized mutant-predicting apparatus according to still another aspect of the present invention is the thermostabilized mutant-predicting apparatus, wherein the calculating unit calculates, as the solvation entropy change, a difference between solvation entropy of the tertiary structure subjected to the structural optimization after extracting the transmembrane segment and solvation entropy of the primary structure subjected to the structural optimization after extracting the transmembrane segment and separating and extending the transmembrane segment.
The present invention also relates to a thermostabilized mutant-predicting method for predicting a candidate of an amino acid mutant for thermal stabilization of membrane protein, which is executed in a computer including a storage unit for storing an amino acid sequence of the membrane protein and a control unit. The method includes a mutation-introducing step of introducing an amino acid mutation into the amino acid sequence of the membrane protein to create an amino acid sequence of the amino acid mutant, a calculating step of calculating a solvation entropy change in formation of a tertiary structure from a primary structure or formation of the tertiary structure from secondary-structure units within the transmembrane segment involving structural optimization based on the amino acid sequence for the membrane protein and each amino acid mutant, and a candidate-extracting step of extracting a candidate of an amino acid mutant to be thermostabilized based on a difference between the solvation entropy change in the membrane protein and the solvation entropy change in the amino acid mutant.
The present invention also relates to a computer program product having a non-transitory tangible computer readable medium including programmed instructions for causing, when executed by a computer including a storage unit for storing an amino acid sequence of the membrane protein and a control unit, to perform a thermostabilized mutant-predicting method for predicting a candidate of an amino acid mutant for thermal stabilization of membrane protein. The method includes a mutation-introducing step of introducing an amino acid mutation into the amino acid sequence of the membrane protein to create an amino acid sequence of the amino acid mutant, a calculating step of calculating a solvation entropy change in formation of a tertiary structure from a primary structure or formation of the tertiary structure from secondary-structure units within the transmembrane segment involving structural optimization based on the amino acid sequence for the membrane protein and each amino acid mutant, and a candidate-extracting step of extracting a candidate of an amino acid mutant to be thermostabilized based on a difference between the solvation entropy change in the membrane protein and the solvation entropy change in the amino acid mutant.
Further, the present disclosure relates to a recording medium on which the program is recorded.
The present disclosure provides an effect of enabling prediction in silico of an amino acid mutation for thermal stabilization in a membrane protein, by storing an amino acid sequence of the membrane protein, introducing an amino acid mutation into the amino acid sequence of the membrane protein thereby creating an amino acid sequence of the amino acid mutant, then, for the membrane protein and each amino acid mutant, calculating a solvation entropy change in formation of a tertiary structure from a primary structure or formation of the tertiary structure from secondary-structure units within the transmembrane segment involving structural optimization based on the amino acid sequence, and extracting a candidate of an amino acid mutant to be thermostabilized based on a difference between the solvation entropy change in the membrane protein and the solvation entropy change in the amino acid mutant.
Further, the present disclosure provides an effect of enabling higher speed calculation of the solvation entropy by morphometrically simplifying the solute, which is achieved by calculating the solvation entropy change by using an integrated methodology of an integral equation theory and a morphometric representation based on four geometric indices of an excluded volume, an accessible surface area, and integrated mean and Gaussian curvatures of accessible surface.
Further, the present disclosure provides an effect of enabling an accurate optimization of the structure by utilizing known structural data since in the present disclosure the structural data of the membrane protein are stored and the structural optimization is performed based on the amino acid sequence and the structural data.
Further, the present disclosure provides an effect of obtaining a structure predicted more precisely by performing structural optimization while relaxing a constraint stepwise, by first fixing heavy atoms of a membrane protein and minimizing, next fixing Cα carbon and Cβ carbon and minimizing, and finally minimizing without fixation.
Further, the present disclosure can provide an effect of obtaining a comparatively high prediction hit rate (5/11 in an example), by calculating, as a solvation entropy change, a difference between solvation entropy of a tertiary structure subjected to structural optimization before extracting a transmembrane segment, and solvation entropy of a secondary structure from which the tertiary structure has been separated.
Further, the present disclosure can provide an effect of obtaining a high prediction hit rate (1/4 in an example), by calculating, as a solvation entropy change, a difference between solvation entropy of a tertiary structure subjected to structural optimization after extracting a transmembrane segment and solvation entropy of a secondary structure subjected to structural optimization after separating the extracted transmembrane segment.
Further, the present disclosure can provide an effect of obtaining a high prediction hit rate (3/11 in an example), by calculating, as a solvation entropy change, a difference between solvation entropy of a tertiary structure subjected to structural optimization before extracting a transmembrane segment and solvation entropy of a secondary structure subjected to structural optimization after extracting a transmembrane segment and separating the transmembrane segment.
Further, the present disclosure can provide an effect of obtaining a high prediction hit rate (5/11 in an example), by calculating, as a solvation entropy change, a difference between solvation entropy of a tertiary structure subjected to structural optimization before extracting a transmembrane segment and solvation entropy of a primary structure subjected to structural optimization after extracting a transmembrane segment and extending the transmembrane segment.
Further, the present disclosure can provide an effect of obtaining a high prediction hit rate (2/4 in an example), by calculating, as a solvation entropy change, a difference between solvation entropy of a tertiary structure subjected to structural optimization after extracting a transmembrane segment and solvation entropy of a primary structure subjected to structural optimization after extracting a transmembrane segment and separating and extending the transmembrane segment.
Exemplary embodiments of a thermostabilized mutant-predicting apparatus, a thermostabilized mutant-predicting method, and a computer program product according to the present disclosure are described below in detail with reference to the accompanying drawings. The present embodiments are not intended to limit the present disclosure.
Hereinafter, for explaining the summary of embodiment of the present disclosure, first an integrated methodology of a morphometric representation and an integral equation theory developed by the inventors will be explained and then the summary, the constitution, the processing or the like of the present embodiments will be explained in detail.
Integrated Methodology
The present inventors focused on an entropy effect caused by translation of water molecules and succeeded in providing a picture of thermal denaturation of protein in an aqueous solution by an integrated methodology of morphometric approach and a statistical mechanics theory for liquids developed by the inventors.
The present disclosure aims to apply the methodology to focus on an entropy effect caused by the translation of CH, CH2 and CH3 groups (mass of these groups may be regarded as a solvent) constituting hydrophobic chains of phospholipid molecules, thereby theoretically predicting a change in solvation entropy of a membrane protein by amino acid substitution. Here, a free energy function F for the membrane protein is expressed by a formula below.
F=−TS+Λ
(T: absolute temperature, S: entropy component, Λ: energy component)
By dividing the above formula by kBT0 (kB: Boltzmann's constant, T0=298K) for nondimensionalization and setting T=T0, a formula below is obtained.
F/(kBT0)=−S/kB+Λ/(kBT0)
Here,
Here, D is a value of energy lowering at the time of forming one intramolecular hydrogen bond. D may be a value of energy lowering (for example, −14kBT0) obtained when formamide forms one hydrogen bond in a nonpolar solvent.
For example, as shown in
Then, every donor and acceptor between a main chain and a main chain, between a main chain and a side chain, and between a side chain and a side chain are examined, the number of intramolecular hydrogen bonds is counted to calculate “Λ=energy lowering (negative) associated with formation of intramolecular hydrogen bond”. It is assumed that acquisition of van der Waals attractive interaction within the protein molecule mutually cancel with loss of the van der Waals attractive interaction between protein and phospholipid.
As shown in
When the hydrocarbon group mass of the phospholipid molecules is regarded as a solvent, S is solvation entropy (entropy loss of a solvent that occurs when a solute with a fixed three-dimensional structure is inserted into the solvent: negative quantity).
In the present embodiment, a high-speed computation may be conducted in calculating the solvation entropy S by using the integrated methodology of the integral equation theory and the morphometric representation devised by the inventors.
According to the morphometric representation, solvation entropy of a three-dimensional structure of a solute can be calculated based on four geometric indices (an excluded volume V, an accessible surface area A, an integrated mean curvature X of accessible surface, and an integrated Gaussian curvature Y of accessible surface). That is, the expression of solvation entropy S is expressed by a linear combination below.
S/k
B
=C
1
V+C
2
A+C
3
X+C
4
Y
Here, an excluded space is “a space which centers of solvent molecules cannot enter”. The volume of the excluded space is the excluded volume V, and the surface area of the excluded space is the accessible surface area A. The excluded space also forms a conjugate of spheres of various radii, and contribution of a sphere having a radius r to X and Y is as follows.
“Contribution to X=Mean curvature 1/r multiplied by ξ accessible surface area ξ of the sphere; Contribution to Y=Gaussian curvature 1/r2 multiplied by ξ”.
According to the morphometrics, coefficients C1, C2, C3 and C4 of the four morphology indices do not rely on the geometric property of the solute, and thus, they can be processed in a simplified form (for example, sphere). Therefore, the form is regarded as a simplified sphere to calculate an entropy loss associated with insertion of spherical solutes having various diameters. In the present embodiment, the hydrocarbon group mass is modeled as a rigid sphere solvent and calculated by using an integral equation theory. According to morphometric representation with respect to the spherical solute, the following formulae are provided.
S/k
B
=C
1(4πR3/3)+C2(4πR2)+C3(4πR)+C4(4π),
R=(dU+dS)/2
Here, dS is a solvent molecule diameter, and dU is a spherical (rigid sphere) solute diameter. S of isolated rigid-sphere solute having various diameters (about 15 patterns within a range of 0<dU≦30dS) is calculated by using the integral equation theory, and C1-C4 are determined by a least squares method applied with the above formulae. Once the C1-C4 are determined, they are applied also to proteins having optional three-dimensional structures. Namely, S is obtained directly from the formulae by only calculating V, A, X, and Y, though C1-C4 rely considerably on the type of solvent and thermodynamic conditions (such as temperature and pressure).
When the solute is protein and a calculation using the above-described four geometric indices is performed for the diameters and x, y, z-coordinates of the constituent atoms (H, C, N, O, and S), S can be calculated at a high speed (within 1 second) even with a standard workstation per three-dimensional structure by the above-described integrated methodology of the statistical mechanics theory and the morphometric approach. When compared with a case of calculating with a three-dimensional integral equation theory in light of the complicated polyatomic structure, the calculation time is about 1/10,000, and the error is less than ±5%. Even if the calculation of Λ is included, the calculation itself of the free energy function F ends within 1 second.
As mentioned above, a high-speed computation with less processing load is available by applying the integrated methodology, which was developed regarding protein in an aqueous solution, to a membrane protein. Hereinafter, the integral equation theory will be explained.
The integral equation theory starts from the system partition function, and derives relational expressions established among various distribution functions (correlation functions) while defining the distribution functions. Regarding the equilibrium structures and physical properties, this process allows analyses of the same level as a computer simulation. Since this theory targets an indefinitely large system and takes the average of physical quantity with respect to an infinite number of microscopic states, the theory is free from problems such as “the system size may be too small; statistical error is inevitable”.
In a case of a bulk solvent composed of a single component, relational expressions established among the distribution functions are solved numerically with input data of temperature, number density and interaction potential among solvent molecules, whereby microscopic structure and various thermodynamic quantities representing macroscopic properties of the solvent can be obtained (which can be extended to a solvent composed of multicomponents).
Solvation properties of a solute (microscopic structure of solvent in the vicinity of solute; the thermodynamic quantity of solvation) also can be analyzed. Here, the thermodynamic quantity of solvation indicates a change in thermodynamic quantity that occurs when a solute (three-dimensional structure is fixed) is inserted into the solvent. In a case of a simple solvent such as a rigid sphere system or Lennard-Jones fluids, a solute having an optional shape and polyatomic structure can be processed directly (three-dimensional integral equation theory). The integral equation theory gains an advantage over a computer simulation in calculation of thermodynamic quantity of the solvation. However, since a considerable amount of computational load is required for solving basic formulae, this is solved by integrating with the above-described morphometric indices in the present embodiment.
Summary of the Present Embodiment
Hereinafter, summary of the present embodiments will be described.
First in the present embodiment, an amino acid sequence of an amino acid mutant where respective amino acid residues of a membrane protein have been substituted by all of the amino acids other than Gly and Pro is created. For example, a membrane protein of a mutant can be obtained by introducing an amino acid mutation into a wild type membrane protein. The amino acid mutation may be an amino acid sequence formed by deleting, substituting or adding one or a plurality of amino acids from/for/to an original amino acid sequence.
Further in the present embodiment, an amino acid sequence of an amino acid mutant where the respective amino acid residues of a membrane protein have been substituted by all of amino acid including Gly and Pro may be created.
Next, in the present embodiment, for each of the amino acid mutants, the solvation entropy change −ΔS in formation of a tertiary structure from a primary structure or formation of the tertiary structure from secondary-structure units within the transmembrane segment involving structural optimization is calculated based on the amino acid sequence.
Stage 1 relates to a stage where a membrane protein forms secondary-structure units from its primary structure. More specifically, structural units of α-helices are stabilized individually within the membrane and form as many intramolecular hydrogen bonds as possible (Step 1). In a lipid bilayer membrane, an α helix has the advantage over a β sheet.
Stage 2 relates to a stage where the membrane protein forms its tertiary structure from secondary-structure units within the membrane. More specifically, side chains between structural units of α-helices are closely packed (Step 2). In the present embodiment, a solvation entropy change up to formation of the tertiary structure from the primary structure through the Stages 1 and 2 may be calculated, or solvation entropy change up to formation of the tertiary structure from the secondary-structure units through the Stage 2 may be calculated.
Then, a candidate of an amino acid mutant to be thermostabilized is extracted based on a difference −ΔΔS between the solvation entropy change −ΔSw in a membrane protein wild type and solvation entropy change −ΔSm in the amino acid mutant.
As shown in
As shown in
As shown in
The present embodiment is summarized above. Examples of the apparatus constitution and processing for executing the present embodiment of the present disclosure will be explained in detail below.
Constitution of Thermostabilized Mutant-Predicting Apparatus
Next, the constitution of a thermostabilized mutant-predicting apparatus 100 in the present embodiment will be explained in detail with reference to
As shown in
The various databases and tables (structure file 106a and sequence file 106b or the like) to be stored in the storage unit 106 are storage units like a fixed disc device. For example, the storage unit 106 stores various programs, tables, files, databases and webpages to be used for various processing.
Among these components of the storage unit 106, the structure file 106a is a structural data storing unit that stores structural data of the membrane protein. The structure file 106a may store structural data or the like of a membrane protein that has been input via the input unit 114 and whose crystal structure has been analyzed. Structural data in the structure file 106a may include coordinates or the like of the respective atoms in the two-dimensional space and a three-dimensional space.
The sequence file 106b is a sequence data storing unit that stores sequence data of the membrane protein. The sequence file 106b may store sequence data or the like of the membrane protein that have been input via the input unit 114.
In
Further in
As shown in
The calculating unit 102b is a calculating unit that calculates solvation entropy changes −ΔSw and −ΔSm in formation of a tertiary structure from a primary structure or formation of the tertiary structure from secondary-structure units within the transmembrane segment involving structural optimization based on the amino acid sequence for the membrane protein wild type and each mutant. The calculating unit 102b may calculate the solvation entropy by using integrated methodology of integral equation theory and morphometric representation based on four geometric indices of an excluded volume V, an accessible surface area A, an integrated mean curvature X of accessible surface, and an integrated Gaussian curvature Y of accessible surface.
For the structural optimization, the calculating unit 102b may perform the structural optimization based on not only the amino acid sequence stored in the sequence file 106b but the structural data stored in the structure file 106a. Further, the calculating unit 102b may perform structural optimization while relaxing a constraint stepwise, by first fixing heavy atoms of the membrane protein and minimizing, and then fixing Cα carbon and Cβ carbon and minimizing, and finally minimizing without fixation. In addition to that, the calculating unit 102b may perform the structural optimization by using any other methods for structural optimization such as Modeller.
The calculating unit 102b may calculate the solvation entropy change by any of Procedure 1 to Procedure 5 below.
Procedure 1: a difference between solvation entropy of a tertiary structure subjected to structural optimization before extracting a transmembrane segment, and solvation entropy of a secondary structure from which the tertiary structure has been separated;
Procedure 2: a difference between solvation entropy of a tertiary structure subjected to structural optimization after extracting a transmembrane segment, and solvation entropy of a secondary structure subjected to structural optimization after separating the extracted transmembrane segment;
Procedure 3: a difference between solvation entropy of a tertiary structure subjected to structural optimization before extracting a transmembrane segment, and solvation entropy of a secondary structure subjected to structural optimization after extracting a transmembrane segment and separating the transmembrane segment;
Procedure 4: a difference between solvation entropy of a tertiary structure subjected to structural optimization before extracting a transmembrane segment, and solvation entropy of a primary structure subjected to structural optimization after extracting a transmembrane segment and extending the transmembrane segment; and
Procedure 5: a difference between solvation entropy of a tertiary structure subjected to structural optimization after extracting a transmembrane segment, and solvation entropy of a primary structure subjected to structural optimization after extracting a transmembrane segment and separating and extending the transmembrane segment.
The candidate-extracting unit 102c is a candidate-extracting unit that extracts a candidate of a mutant to be thermostabilized based on a difference (−ΔΔS(=−ΔSw−(−ΔSm)) between a solvation entropy change −ΔSw in a membrane protein wild type and a solvation entropy change −ΔSm in a mutant. For example, the candidate-extracting unit 102c may determine that the mutant is thermostabilized when −ΔΔS is a negative value and thermally destabilized when −ΔΔS is a positive value. In one example, the candidate-extracting unit 102c may extract a mutant whose −ΔΔS is equal to or less than a certain value as a candidate of a mutant to be thermostabilized. Hereinafter, each sign and the meaning are listed to correspond to each other.
S: solvation entropy (of a certain structure)
−ΔS: a solvation entropy change (from a certain structure to another structure)
−ΔΔS: a difference in a solvation entropy change (between protein before mutation and a mutant)
The above-described constitution is an example of the thermostabilized mutant-predicting apparatus 100 in the present embodiment. The thermostabilized mutant-predicting apparatus 100 may be connected to an external system 200 via the network 300. In this case, the communication-control interface unit 104 performs a communication control between the thermostabilized mutant-predicting apparatus 100 and the network 300 (or a communicating apparatus such as a router). Namely, the communication-control interface unit 104 has a function of communicating data with other terminals via a communication line. The network 300 has a function of interconnecting the thermostabilized mutant-predicting apparatus 100 and the external system 200, and for example, it is internet.
The external system 200 is interconnected to the thermostabilized mutant-predicting apparatus 100 via the network 300, and it has a function of providing external data base relating to various data such as structural data and sequence data, parameter and simulation result data, and a program for allowing a connected information processing apparatus to execute the thermostabilized mutant-predicting method.
The external system 200 may be constituted as a WEB server, an ASP server or the like. The hardware constitution of the external system 200 may be constituted with a commercially available information processing apparatus like work station and personal computers and their accessories. Further, the respective functions of the external system 200 are provided by the CPU, a disc device, a memory device, an input device, an output device, a communication-controlling apparatus and the like, and also a program or the like controlling thereof.
Constitutions in the present embodiment are as explained above.
Processing by Thermostabilized Mutant-predicting Apparatus 100
Next, an example of processing by the thus constituted thermostabilized mutant-predicting apparatus 100 in the present embodiment will be explained hereinafter in detail with reference to the drawings.
First, an example of processing executed by the thermostabilized mutant-predicting apparatus 100 is explained with reference to
As shown in
Then, the calculating unit 102b calculates the solvation entropy changes −ΔSw and −ΔSm in formation of the tertiary structure from the primary structure, or formation of the tertiary structure from secondary-structure units within the transmembrane segment involving structural optimization based on the amino acid sequence, for the membrane protein wild type Wt and the respective mutants Mt (Step SA-2). The calculating unit 102b may calculate the change in solvation entropy by any of Procedures 1 to 5 described below. Further, the calculating unit 102b may calculate the solvation entropy by using an integrated methodology of an integral equation theory and a morphometric representation based on four geometric indices of an excluded volume V, an integrated mean curvature X of accessible surface and an integrated Gaussian curvature Y of accessible surface. Regarding the structural optimization, the calculating unit 102b may perform the structural optimization based on not only the amino acid sequence stored in the sequence file 106b but the structural data stored in the structure file 106a. The calculating unit 102b may perform structural optimization while relaxing a constraint stepwise by first fixing the heavy atoms of the membrane protein and minimizing, then fixing Cα carbon and Cβ carbon and minimizing, and finally minimizing without fixation.
Then, the candidate-extracting unit 102c calculates a difference −ΔΔS(=−ΔSw−(−ΔSm)) between the solvation entropy change −ΔSw in the membrane protein wild type −ΔSw and the solvation entropy change −ΔSm in a mutant Mt (Step SA-3).
Then, the candidate-extracting unit 102c extracts a candidate of the mutant Mt to be thermostabilized, based on the calculated difference −ΔΔS (Step SA-4). For example, the candidate-extracting unit 102c may determine that the mutant is thermostabilized when the −ΔΔS is a negative value, and thermally destabilized when the −ΔΔS is a positive value. In one example, the candidate-extracting unit 102c may extract a mutant Mt having −ΔΔS equal to or lower than a predetermined threshold as a candidate of the mutant Mt to be thermostabilized.
An example of processing by the thermostabilized mutant-predicting apparatus 100 in the present embodiment is as explained above.
Five Kinds of Procedures
Hereinafter, processing of five kinds of Procedures 1 to 5 will be explained specifically in detail with reference to
As a typical example, as shown in
Specifically, the calculating unit 102b may calculate the change in the solvation entropy by any of the following Procedures 1 to 5.
Procedure 1 is a method of calculating a difference −ΔS between solvation entropy S of a tertiary structure subjected to structural optimization before extracting a transmembrane segment, and solvation entropy S of a secondary structure from which the tertiary structure has been separated. In this manner, in Procedure 1, repacking closely the side chain of the separated helices is not taken into consideration.
Procedure 2 is a method of calculating a difference −ΔS between solvation entropy S of a tertiary structure subjected to structural optimization after extracting a transmembrane segment, and solvation entropy S of a secondary structure subjected to structural optimization after separating the extracted transmembrane segment.
Procedure 3 is a method of calculating a difference −ΔS between solvation entropy S of a tertiary structure subjected to structural optimization before extracting a transmembrane segment, and solvation entropy S of a secondary structure subjected to structural optimization after extracting a transmembrane segment and separating the transmembrane segment. As shown in
Procedure 4 is a method of calculating a difference −ΔS between solvation entropy S of a tertiary structure subjected to structural optimization before extracting a transmembrane segment, and solvation entropy S of a primary structure subjected to structural optimization after extracting a transmembrane segment and extending the transmembrane segment.
Procedure 5 is a method of calculating a difference −ΔS between solvation entropy S of a tertiary structure subjected to structural optimization after extracting a transmembrane segment, and solvation entropy S of a primary structure subjected to structural optimization after extracting a transmembrane segment and separating and extending the transmembrane segment. As shown in
Procedure 1
As shown in
Next, the calculating unit 102b extracts a transmembrane segment alone and calculates its solvation entropy (−Sw) (Step S1-2).
Then, the calculating unit 102b calculates the sum (−S′w) of solvation entropy of structures where their respective helices have been separated (Step S1-3).
Then, the calculating unit 102b calculates the entropy change −ΔSw=−Sw−(−S′w) associated with packing of the helices (Step S1-4).
Meanwhile, for a mutant Mt, the mutation-introducing unit 102a substitutes the amino acid residue of Structure (1) based on the sequence data stored in the sequence file 106b (Step S1-a).
Then, the calculating unit 102b performs structural optimization (Step S1-b).
Then, the calculating unit 102b extracts the transmembrane segment alone and calculates solvation entropy (−Sm) (Step S1-c).
Then, the calculating unit 102b calculates the sum (−S′m) of the solvation entropy of the structures where their respective helices have been separated (Step S1-d).
The calculating unit 102b calculates the entropy change −ΔSm=−Sm−(−S′m) associated with packing of the helices (Step S1-e).
From the above-described result, the calculating unit 102b can calculate the entropy change −ΔΔS=−ΔSm−(−ΔSw) associated with amino acid substitution, which is a difference between the entropy change −ΔSw of the wild type Wt and the entropy change −ΔSm of the mutant.
Procedure 2
Next, the calculating unit 102b performs structural optimization and calculates the solvation entropy (−Sw) (Step S2-2).
Then, the calculating unit 102b separates helices of Structure (2) (Step S2-3).
Then, the calculating unit 102b performs optimization of the structures of their respective helices and calculates the sum (−S′w) of the solvation entropy (Step S2-4).
Then, the calculating unit 102b calculates an entropy change −ΔSw=−Sw−(−S′w) associated with packing of the helices (Step S2-5).
Meanwhile, for a mutant Mt, the mutation-introducing unit 102a substitutes the amino acid residue of Structure (2) (this structure is referred to as Structure (3)) (Step S2-a).
Then, the calculating unit 102b performs structural optimization and calculates the solvation entropy (−Sm) (Step S2-b).
Then, the calculating unit 102b separates the helices of Structure (3) (Step S2-c).
Then, the calculating unit 102b performs optimization of the structures of their respective helices, and calculates the sum (−S′m) of the solvation entropy (Step S2-d).
Then, the calculating unit 102b calculates an entropy change −ΔSm=−Sm−(−S′m) associated with packing of the helices (Step S2-e).
From the above-described result, the calculating unit 102b can calculate an entropy change −ΔΔS=−ΔSm−(−ΔSw) associated with amino acid substitution, which is a difference between the entropy change −ΔSw of the wild type Wt and entropy change −ΔSm of the mutant.
Procedure 3
Next, the calculating unit 102b extracts a transmembrane segment alone and calculates the solvation entropy (−Sw) (Step S3-2).
Then, the calculating unit 102b extracts a transmembrane segment of Structure (1) and separates helix structures (Step S3-3).
Then, the calculating unit 102b performs optimization of the structures of their respective helices, and calculates the sum (−S′w) of the solvation entropy (Step S3-4).
Then, it calculates an entropy change −ΔSw=−Sw−(−S′w) associated with packing of the helices (Step S3-5).
Meanwhile, for a mutant Mt, the mutation-introducing unit 102a substitutes an amino acid residue of Structure (1) (this structure is referred to as Structure (4)) (Step S3-a).
Then, the calculating unit 102b performs structural optimization of Structure (4) (Step S3-b).
Then, the calculating unit 102b extracts a transmembrane segment alone and calculates the solvation entropy (−Sm) (Step S3-c).
Then, the calculating unit 102b extracts a transmembrane segment alone of Structure (4) and separates the helix structures (Step S3-d).
Then, the calculating unit 102b performs optimization of the structures of their respective helices, and calculates the sum (−S′m) of solvation entropy (Step S3-e).
Then, the calculating unit 102b calculates an entropy change −ΔSm=−Sm−(−S′m) associated with packing of the helices (Step S3-f).
From the above-described result, the calculating unit 102b can calculate an entropy change −ΔΔS=−ΔSm−(−ΔSw) associated with amino acid substitution, which is a difference between the entropy change −ΔSw of the wild type Wt and the entropy change −ΔSm of the mutant.
Procedure 4
Next, the calculating unit 102b extracts a transmembrane segment alone and calculates the solvation entropy (−Sw) (Step S4-2).
Then, the calculating unit 102b extracts a transmembrane segment of Structure (1), and creates a completely-extended structure (Step S4-3).
Then, the calculating unit 102b performs optimization of their respective extended structures, and calculates the sum (−S′w) of the solvation entropy (Step S4-4).
Then, the calculating unit 102b calculates an entropy change −ΔSw=−Sw−(−S′w) associated with helix formation and packing from the extended structure (Step S4-5).
Meanwhile, for a mutant Mt, the mutation-introducing unit 102a substitutes an amino acid residue of Structure (1) (this structure is referred to as Structure (4)) (Step S4-a).
Then, the calculating unit 102b performs structural optimization of Structure (4) (Step S4-b).
Then, the calculating unit 102b extracts a transmembrane segment alone and calculates the solvation entropy (−Sm) (Step S4-c).
Then, the calculating unit 102b extracts a transmembrane segment alone of Structure (4) and creates a completely-extended structure (Step S4-d).
Then, the calculating unit 102b performs optimization of the respectively-extended structures and calculates the sum (−S′m) of the solvation entropy (Step S4-e).
Then, the calculating unit 102b calculates an entropy change −ΔSm=−Sm−(−S′m) associated with helix formation and packing from the extended structures (Step S4-f).
From the result, the calculating unit 102b can calculate an entropy change −ΔΔS=−ΔSm−(−ΔSw) associated with amino acid substitution, which is a difference between the entropy change −ΔSw of the wild type Wt and the entropy change −ΔSm of the mutant.
Procedure 5
Next, the calculating unit 102b performs structural optimization and calculates the solvation entropy (−Sw) (Step S5-2).
Then, the calculating unit 102b separates the helices of Structure (2) and creates a completely-extended structure (Step S5-3).
Then, the calculating unit 102b performs optimization of the respective extended structures and calculates the sum (−S′w) of the solvation entropy (Step S5-4).
Then, the calculating unit 102b calculates an entropy change −ΔSw=−Sw−(−S′w) associated with helix and packing from the extended structures (Step S5-5).
Meanwhile, for a mutant Mt, the mutation-introducing unit 102a substitutes an amino acid residue of Structure (2) (Step S5-a).
Then, the calculating unit 102b performs structural optimization of Structure (3) and calculates the solvation entropy (−Sm) (Step S5-b).
Then, the calculating unit 102b separates the helices of Structure (3) and creates completely-extended structures (Step S5-c).
Then, the calculating unit 102b performs optimization of the respective extended structures and calculates the sum (−S′m) of the solvation entropy (Step S5-d).
Then, the calculating unit 102b calculates an entropy change −ΔSm=−Sm−(−S′m) associated with helix formation and packing from the extended structures (Step S5-e).
From the result, the calculating unit 102b can calculate the entropy change −ΔΔS=−ΔSm−(−ΔSw) associated with amino acid substitution, which is a difference between the entropy change −ΔSw of the wild type Wt and the entropy change −ΔSm of the mutant.
Comparison with Experimental Result
Regarding five kinds of amino acid mutations known to be stabilized or destabilized by substitution, prediction results for thermostabilized mutants by Procedures 1 to 5 were reviewed.
As shown in
As shown in
As shown in
As shown in
As shown in
As shown in
Consequently, the prediction results by Procedures 1 to 5 were reviewed for a group of mutants whose thermal stabilization result had been known due to the experimental results of Tate et al. of the StaR (registered trademark) technique. Tate et al. reports amino acid substitution stabilized at 17 positions from an experimental result by alanine scanning. These results were compared with the −ΔΔS values calculated by Procedures 1 to 5.
As shown in
Example of a thermostabilized mutant-predicting apparatus 100 in the present embodiment will be explained with reference to
As shown in
Then, the calculating unit 102b calculates, for the membrane protein wild type Wt and each of the mutants Mt, solvation entropy changes −ΔSw and −ΔSm in formation of a tertiary structure from a primary structure or formation of the tertiary structure from secondary-structure units within the transmembrane segment involving structural optimization based on the amino acid sequence, and, for the membrane protein wild type Wt and each of the mutants Mt, calculates energy changes ΔΛw and ΔΛm in formation of the tertiary structure from the primary structure or formation of the tertiary structure from secondary-structure units within the transmembrane segment involving structural optimization based on the amino acid sequence (Step SB-2).
The calculating unit 102b may calculate a change in the solvation entropy by any of Procedures 1 to 5. Further, the calculating unit 102b may calculate the solvation entropy by using integrated methodology of integral equation theory and morphometric representation based on four geometric indices of an excluded volume V, an accessible surface area A, an integrated mean curvature X of accessible surface, and an integrated Gaussian curvature Y of accessible surface.
Regarding the structural optimization, the calculating unit 102b may perform the structural optimization based on not only the amino acid sequence stored in the sequence file 106b but the structural data stored in the structure file 106a. Further, the calculating unit 102b may perform the structural optimization while relaxing a constraint stepwise, by first fixing heavy atoms of the membrane protein and minimizing, then fixing Cα carbon and Cβ carbon and minimizing, and finally minimizing without fixation.
Hereinafter, an example of energy calculation in this Example will be explained with reference to
In this Example, a free energy function F for a membrane protein is expressed by a formula below.
F=−TS+Λ
(T: absolute temperature, S: entropy component, Λ: energy component)
The above formula is divided by kBT0 (kB: Boltzmann's constant, T0=298K) for nondimensionalization, and T is set as T=T0 to provide a formula below.
F/(kBT0)=−S/kB+Λ/(kBT0)
As shown in
Further as shown in
Every donor and acceptor between a main chain and a main chain, between a main chain and a side chain, and between a side chain and a side chain are examined, and for each of which D is calculated and the total sum is taken to calculate “Λ=energy lowering (negative) associated with intramolecular hydrogen bond formation”. It is assumed that acquisition of van der Waals attractive interaction within the protein molecules mutually cancels with loss of the van der Waals attractive interaction between protein and phospholipid.
Hereinafter, an example of a step of calculating Λ in this Example will be explained with reference to
As shown in
Next, the calculating unit 102b extracts a transmembrane (intercadence) segment alone and calculates the energy (Λw) (Step S6-2).
Then, the calculating unit 102b extracts a transmembrane segment of Structure (1), and acquires a virtually-supposed completely-extended structure (Λ′w=0) (Step S6-3).
Then, the calculating unit 102b calculates an energy change ΔΛw=Λw−Λ′w associated with helix formation and packing from the virtually-supposed completely-extended structure (Λ′w=0) (Step S6-4).
Meanwhile, for a mutant Mt, the mutation-introducing unit 102a substitutes an amino acid residue of Structure (1) (this structure is regarded as Structure (4)) (Step S6-a).
Then, the calculating unit 102b performs structural optimization of Structure (4) (Step S6-b).
Then, the calculating unit 102b extracts a transmembrane segment alone and calculates the energy (Λm) (Step S6-c).
Then, the calculating unit 102b extracts a transmembrane segment of Structure (4) and acquires a virtually-supposed completely-extended structure (Λ′m=0) (Step S6-d).
Then, the calculating unit 102b calculates an energy change ΔΛm=Λm−Λ′m associated with helix formation and packing from the virtually-supposed completely-extended structure (Λ′m=0) (Step S6-e).
From the result, the calculating unit 102b can calculate an energy change amount ΔΔΛ=ΔΛm−ΔΛw associated with amino acid substitution of energy lowering caused by folding, which is a difference between the energy change ΔΛw of the membrane protein wild type Wt and the energy change ΔΛm of the mutant.
Returning to
Then, the candidate-extracting unit 102c extracts a candidate of the mutant Mt to be thermostabilized, based on the sum ΔΔF of the calculated ΔΔΛ and −TΔΔS (Step SB-4). For example, the candidate-extracting unit 102c may determine that the mutant is thermally stabilized when ΔΔF (change amount associated with amino acid substitution of free energy lowering of a system by folding) is a negative value, and that the mutant is thermally destabilized when ΔΔF is a positive value. In an example, the candidate-extracting unit 102c may extract a mutant Mt whose ΔΔF is equal to or less than a predetermined threshold, as a candidate of the mutant Mt to be thermostabilized.
An example of prediction result in the present embodiment will be explained below with reference to
First, among all of the amino acid substitutions performed by using Modeller, five amino acid substitutions (S91R, S91K, L85R, N280R, and N181K) in the order of having smaller ΔΔF value calculated based on Procedure 1 were selected as amino acid substitutions predicted to be stabilized.
Then, as shown in
Namely, as shown in
Then, these five amino acid substitutions were subjected to a verification experiment for checking whether they are experimentally stabilized. As shown in
Further, as shown in
Namely, as shown in
Then, these five amino acid substitutions were subjected to a verification experiment for checking whether they are experimentally stabilized. As shown in
In this manner, it was clarified that both of the thermostabilized mutant prediction using −ΔΔS and the thermostabilized mutant prediction using ΔΔF can provide high prediction success rates in this Example.
The present embodiment of the present disclosure has been explained so far. Besides the foregoing embodiment, the present disclosure can also be carried out in various different embodiments within the scope of the technical idea described in the claims.
For example, the thermostabilized mutant-predicting apparatus 100 may perform processing in a standalone mode, or may perform processing according to a request from a client terminal and then return the results of the processing to the client terminal.
Out of the processes explained in relation to the present embodiment, all or some of the processes explained as being automatically performed may be manually performed, or all or some of the processes explained as being manually performed may be automatically performed by publicly known methods.
Besides, the process steps, the control steps, the specific names, the information including registered data for the processes or parameters such as search conditions, the screen examples, or the database configurations described or illustrated herein or the drawings can be appropriately changed if not otherwise specified.
The constituent elements of the thermostabilized mutant-predicting apparatus 100 shown in the drawings are conceptual functions and do not necessarily need to be physically configured as shown in the drawings.
For example, all or any part of the processing functions included in the units of the thermostabilized mutant-predicting apparatus 100, in particular, the processing functions performed by the control unit 102 may be implemented by the CPU or programs interpreted and executed by the CPU, or may be implemented by wired logic-based hardware. The programs including programmed instructions for causing a computer to execute methods according to the present disclosure described later are recorded in non-transitory computer-readable recording media, and are mechanically read by the thermostabilized mutant-predicting apparatus 100 as necessary. Specifically, the computer programs for giving instructions to the CPU to perform various processes in cooperation with OS are recorded in the storage unit 106 such as a read-only memory (ROM) or a hard disc drive (HDD). The computer programs are loaded into the random access memory (RAM) and executed, and constitute a control unit in cooperation with the CPU.
The computer programs may be stored in an application program server connected to the thermostabilized mutant-predicting apparatus 100 via an appropriate network 300, and may be entirely or partly downloaded as necessary.
The programs according to the present disclosure may be stored in computer-readable recording media or may be formed as program products. The “recording media” include any portable physical media such as a memory card, a USB memory, an SD card, a flexible disc, a magneto optical disc (MO), a ROM, an erasable programmable read only memory (EPROM), an electrically erasable programmable read only memory (EEPROM), a compact disc read only memory (CD-ROM), a digital versatile disc (DVD), and a Blu-ray (registered trademark) disc.
The “programs” constitute data processing methods described in an appropriate language or by an appropriate describing method, and are not limited in format such as source code or binary code. The “programs” are not limited to singly-configured ones but may be distributed into a plurality of modules or libraries or may perform their functions in conjunction with another program typified by an OS. Specific configurations for reading the recording media by the units according to the present embodiment, specific procedures for reading the programs, or specific procedures for installing the read programs may be well-known configurations or procedures.
The various databases and others (structure file 106a, sequence file 106b or the like) stored in the storage unit 106 may be storage units such as any one, some, or all of a memory device such as a RAM or a ROM, a fixed disc device such as a hard disc, a flexible disc, and an optical disc, and may store any one, some, or all of various programs, tables, databases, and web page files for use in various processes and web site provision.
The thermostabilized mutant-predicting apparatus 100 may be an information processing apparatus such as a well-known personal computer, and appropriate peripherals may be connected to the information processing apparatus. The thermostabilized mutant-predicting apparatus 100 may be embodied by providing the information processing apparatus with software (including programs, data, and the like) for implementing the methods according to the present disclosure.
Further, the specific modes of distribution and integration of the devices are not limited to the ones illustrated in the drawings but all or some of the devices may be functionally or physically distributed or integrated by a predetermined unit according to various additions and the like or functional loads. That is, the foregoing embodiments may be carried out in any appropriate combination or may be selectively carried out.
As described above in detail, the present disclosure can provide a thermostabilized mutant-predicting apparatus, a thermostabilized mutant-predicting method and a computer program product capable of suppressing increase in calculation time even when an elongation method is applied to a two-dimensional system or a three-dimensional system, and thus the present disclosure is remarkably useful in various fields such as novel material researches, medical studies, pharmacy, drug discovery, chemical studies, biological studies and clinical examination.
Number | Date | Country | Kind |
---|---|---|---|
2014-130560 | Jun 2014 | JP | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP2015/068277 | 6/24/2015 | WO | 00 |