1. Field of the Invention
This invention relates to the field of molecular modeling and more particularly to the field of calculating solvation energy.
2. Description to the Related Art
In the computational analysis of molecules, inter-atomic forces are modeled to determine the configurations of individual molecules, multi-molecular entities, and the tendencies of separate molecules to form bound complexes. Because these configuration and affinities are highly dependent on the interaction between the modeled molecules and the surrounding solvent molecules (typically water), a variety of models have been devised to account for the molecule-solvent interactions.
One such method involves explicitly including the individual solvent molecules in the area around the molecules of interest. However, this is very expensive computationally, because thousands of solvent-related atomic positions and inter-atomic forces must often be computed, in addition to those involving the molecules of interest themselves. Accordingly, many attempts have been made to simplify this analysis, and to compute a free energy of solvation for a given configuration of molecule or multi-molecular entity, based on principles of electrostatics that treat the solvent as a dielectric continuum in which the molecules of interest are embedded.
The following formula represents one commonly used model of calculating the solvation energy of a molecule:
Gsol=Gpol+Gnp (Eq.1)
In the above formula, Gsol represents the solvation energy of the molecule, Gpol represents the polarization component of the solvation energy (polar solvation energy), and Gnp represents the non-polar component of the solvation energy (non-polar solvation energy). The non-polar component Gnp can be estimated using the following formula:
Gnp=σSA (Eq.2)
In the above formula, SA is the total solvent accessible area of the molecule, and σ is an empirical parameter. The above equation can be modified to a summation over individual atoms of the molecule, as shown in the following equation:
Gnp=ΣiσiSAi (Eq.3)
The polar solvation energy Gpol can be calculated by methods such as the Poisson-Boltzman (PB) method, the generalized Born (GB) method, the analytical continuum electrostatic potential (ACE) method, and so forth.
In the generalized Born method, the polar solvation energy is determined using the following equation:
Gpol=−166(1/∈m−1/∈w)ΣiΣjqiqj/(rij2+aiajexp(−rij2/4aiaj))0.5. (Eq.4)
In the above formula, ∈m is a dielectric constant of the molecule, ∈w is a dielectric parameter of the solvent, qi is the partial charge of the atom i, qj is the partial charge of the atom j, ai and aj are known as the respective effective Born radii of the atoms i and j, and rij is the distance between atoms i and j. In the above formula, the summations are taken over all atoms of the molecule. If the effective Born radius of every atom is established, then the molecule's polar solvation energy can be calculated.
The effective Born radius of a given atom in a molecule may be generally characterized as the van der Waals radius of the atom increased by an amount characterizing the degree to which the atom is screened from interacting with the solvent by the other atoms of the molecule. More screening equates to a larger effective Born radius. A larger effective Born radius equates to a smaller contribution to the molecule's polar solvation energy computed with Eq.4.
The effective Born radius of an atom i is typically calculated by setting the charge on the atom qi to 1, and approximating the polar component of the solvation energy of the atom Gpol,i of the molecule using a number of numerical methods, or an analytical solution when possible. Once the polar solvation energy for the an atom is estimated, the effective Born radius is calculated as follows:
ai=−166(1/∈m−1/∈w)/Gpol,i. (Eq.5)
Many practical methods of calculating a; and Gpol,i use a Coulomb Field approximation:
ai−1=1/4π∫exr−4dV=Ri−1−1/(4π)∫in,r>Rir−4dV (Eq.6)
Gpol,i=−(1/∈m−1/∈w)166/4π∫exr−4dV=−(1/∈m−1/∈w)166(Ri−1−1/4π∫in,r>Rir−4dV) (Eq.7)
The first term −(1/∈m−1/∈w)166Ri−1 may be termed the “self energy” of the solvated atom if regarded as a single ion having one unit positive charge. The term ∫in,r>Rir−4dV in the above equations Eq.6 and Eq.7 is an integration over the region inside the molecule but outside the van der Waals radius of atom i, which evaluates the screening effect of the rest of the molecule. More details of the above equation are provided in Bashford & Case, Ann. Rev. Phys. Chem., 51, 129–152 (2000), which is incorporated herein by reference in its entirety.
According to a method introduced by Still, the polar component Gpol,i for atom i is calculated using the following discrete sum over the atoms of the molecule rather than a volume integral:
Gpol,i=−166(1/∈m−1/∈w)[1/(P0+Ri)−ΣPVj/rij4]. (Eq.8)
In the above equation, Ri is the van der Waals radius of atom i, Vj is the volume of atom j defined by its van der Waals radius, and P0 and P are empirically determined parameters. As shown in the above equations, the polar solvation energy contribution of an atom i is related to its position relative to other atoms j of the molecule.
The determined polar component Gpol,i is used in Eq.6 to determine the effective Born radius for the atom i.
The determined effective Born radii for the atoms are then used in Eq.4 to determine the polar solvation energy of the molecule. With the polar solvation energy determined, the solvation energy can be determined using Eq.1. More details of the generalized Born method and the Still method are provided in U.S. Pat. No. 5,420,805 and Still et al., J. Am. Chem. Soc., 112, 6127–6129 (1990), both of which are hereby incorporated by reference in their entireties.
A variety of modifications and improvements to the original Still formula have been developed which also involve sums over the atoms of the molecule. For example, the following linear pair-wise approximation equation can also be used to derive a polar component of the solvation energy of an atom in a molecule:
Gpol,i=−(1/∈m−1/∈w)[166/(λRvdw,i)+166*P1/(R2vdw,i)+Σbond,jP2Vj/rij4+Σangle,jP3Vj/rij4+Σnonbond,jP4Vj/rij4C(rij, Rvdw,i, Rvdw,j, P5)]. (Eq.9)
In the above equation, λ, P1, P2, P3, P4, and P5 represent empirical parameters. Rvdw,i and Rvdw,j represent the van der Waals radii of atoms i and j. The summation terms are used in Eq.9 to approximate the polar component of an atom i. In Eq.9, the other atoms j are separated into three types: atoms that form direct bonds with atom i (represented by the Σbond,j term), atoms that form indirect bonds with atom i (represented by the Σangle,j term), and atoms that do not form bonds with atom i (represented by the Σnonbond,j term). C(rij, Rvdw,i, Rvdw,j, P5) represents an analytical function that is at least a function of rij, Rvdw,i Rvdw,j, and P5. The other atoms j include all atoms of the molecule that contribute to the polar component of the atom i. In one embodiment for ease of formulation, the other atoms j include all atoms of the molecule. More details of the above equation are provided in Dominy & Brooks, J. Phys. Chem. B, 103, 3765–3773 (1999), incorporated herein by reference in its entirety.
Another pair-wise approximation equation similar to Eq.9 can also be used:
Gpol,i=−(1/∈m−1/∈w)[166/(Rvdw,I+φ+P1)+Σbond,jP2Vj/rij4+Σangle,jP3Vj/rij4+Σnonbond,jP4Vj/rij4C(rij, Rvdw,i, Rvdw,j, P5)]. (Eq.10)
In the above equation, φ, P1, P2, P3, P4, and P5 represent empirical parameters. C(rij, Rvdw,i, Rvdw,j, P5) represents an analytical function that is at least a function of rij, Rvdw,i, Rvdw,j, and P5. More details of the above equation are provided in Qui et al., J. Phys. Chem. B., 101, 3005–3014 (1997), also incorporated herein by reference in its entirety. As described above with respect to the integral formulations, the first term or terms of these equations which do not involve sums over the atoms of the system can be considered the “self energy” terms, and the sums are the “screening effect” terms.
Compared to explicitly including solvent molecules in the simulation, the generalized Born method requires fewer calculations, while producing results similar to those obtained by a more computationally intensive method, such as by using the Poisson-Boltzmann equation. Therefore, the generalized Born method facilitates dynamic simulation with long trajectories for large molecule systems, such as globular proteins and DNA.
However, the existing generalized Born method assumes that the solvent is homogeneous in all directions surrounding the molecular system being modeled. It has thus not been applied to study solvation effects in non-homogeneous media, or in environments having regions outside the solute molecule(s) which have different dielectric constants. As mentioned above, one example of such a system with biological and clinical significance are proteins which are partially embedded in cell membrane and partially in aqueous solution. Trans-membrane proteins are important drug targets, but since trans-membrane proteins are typically partially embedded in a hydrophobic lipid bilayer of cell membrane and partially embedded in a polar aqueous solution, the generalized Born methods described above have not been suitable for modeling trans-membrane proteins. Furthermore, since methods of crystallizing trans-membrane proteins are under-developed, and the results from solid state NMR measurements are still sparse, little is known about the structure of these proteins in native form and the mechanism of their actions. The limited number of 3-D structures limits the use of the homology modeling method for predicting trans-membrane protein structures.
Although other computational methods such as the Poisson-Boltzmann equation can be used to calculate the solvation energy of a molecule in a more complex solvent system, the computational complexity is significant.
In one embodiment, the invention comprises a method of modeling a molecular system comprising a plurality of atoms in an environment having a first region characterized by a first dielectric constant and a second region characterized by a second dielectric constant. The method may comprise estimating a contribution to the polar energy of solvation for an atom using a function which is dependent on the position of the atom with respect to the first region, and which is independent of the position of the atom with respect to other atoms of the plurality of atoms. The estimated contribution may be combined with another contribution to the polar solvation energy which is dependent on the position of the atom with respect to other atoms of the plurality of atoms; and the combined value is used in a Generalized Born equation to estimate the solvation energy of the molecular system.
In another embodiment, the invention comprises a method of estimating a polar component of a solvation energy of a molecule, the molecule being embedded in a first medium and a second medium, the first medium forming a substantially regular geometric shape. In this embodiment, the method comprises for each atom of the molecule, using an analytical function to estimate a first term of contribution to a polar component of a solvation energy of the atom made by the first medium and other atoms of the molecule that are located in the first medium, and determining a second term of contribution to the polar component of the solvation energy of the atom made by others atoms of the molecule that are located at least partially outside the first medium. The first term and the second term are combined to determine the polar component of the solvation energy of the atom. The determined polar components of solvation energies of all atoms of the molecule are used to determine the polar component of the solvation energy of the molecule.
In another embodiment a method of computing configuration energies of molecules or molecular systems is provided. In this embodiment, the molecules or molecular systems being modeled are in an environment including at least a first region characterized by a first dielectric constant and a second region characterized by a second dielectric constant. The configuration energies are computed at least in part by computing atomic solvation energies of atoms in the molecules or molecular systems. The method comprises modeling at least a portion of the atomic solvation energy of an atom as an analytical function which depends on the position of the atom with respect to one of the regions, and which does not depend on the position of any other atoms of the molecule.
Systems for modeling molecules or molecular systems which are configured or programmed to implement the methods disclosed herein are also provided.
Embodiments of the invention will now be described with reference to the accompanying Figures, wherein like numerals refer to like elements throughout. The terminology used in the description presented herein is not intended to be interpreted in any limited or restrictive manner, simply because it is being utilized in conjunction with a detailed description of certain specific embodiments of the invention. Furthermore, embodiments of the invention may include several novel features, no single one of which is solely responsible for its desirable attributes or which is essential to practicing the inventions herein described.
Since trans-membrane proteins, for example G-protein coupled receptors, are important drug targets, it is desirable to develop methods of estimating electrostatic energy terms in membrane-bound molecules. It is also desirable to develop methods that can compute fairly quickly and produce fairly accurate estimates. It is further desirable to develop general methods of estimating the solvation energy and its polar component for a molecule embedded in different media. In addition, it is desirable to develop a method, whose computational complexity of calculating the solvation energy of a molecule in two different media is not significantly higher than calculating the solvation energy of the molecule in a single medium.
The molecule 112 includes a plurality of atoms. The atoms 131–138 and 140 are shown in
As described above, for an atom i of the molecule 102, its solvation energy polar component Gpol,i may be characterized as including a self-energy contribution plus a screening-effect contribution. In conventional Generalized Born models of molecules embedded in a single homogeneous dielectric media, the self energy contribution is the solvation energy of the charged atom without considering the other atoms of the molecule. The inventors have found that the Generalized Born method can be extended to systems having multiple regions of different dielectric media if a portion of what would normally be considered the screening effect contribution is combined with the conventional self energy contribution. When combined in the appropriate way, the conventional self energy expression plus this portion of the screening contribution can be substituted with a relatively simple and easily calculable analytical expression that produces accurate results with little computational expense.
Turning back to the membrane example of
In formulating this approach, it is assumed that the dielectric constant inside the protein is equal to the dielectric constant of the membrane itself. This is likely a good assumption because the low dielectric constant (˜2) inside the hydrophobic part of lipid membranes is consistent with a similar value (˜2) inside the protein if most electrostatic interactions, but not the induced polarization are treated explicitly. As will be described below in connection with
If, as explained above, it is assumed that the dielectric constant of the protein region within the membrane is equal to the dielectric constant of the membrane itself, the expression for Gpol,i of Equation 7 above can be re-written with two separate integrals as follows:
Gpol,i=−(1/∈m−1/∈w)166[Ri−1−1/(4π)∫in,r>Rir−4dV−1/(4π)∫out,r>Rir−4dV] (Eq.11)
In the above equation Eq.11, ∫in,r>Ri r−4dV represents an integration over the membrane slab 112, and ∫out,r>Ri r−4dV represents an integration over the region of the molecule outside the membrane slab 112.
It has been recognized by the inventors that the first two terms of Equation 11 are the self energy of an ion (e.g. atom i with charge set to 1 as described above) in the presence of only the membrane and surrounding aqueous solvent. Because of the symmetry of this solvent system, which may be modeled as an infinite slab of dielectric constant ∈m, embedded in water of dielectric constant ∈w, this self energy can be computed fairly easily using the Poisson-Boltzmann method, and will depend on the distance Zi between the atom with respect to the center of the membrane.
Based on Poisson-Boltzmann computations, the inventors have further found that this self energy can be advantageously modeled with an analytical function Γ(Zi,Ri,L) that can be used to replace the term (R1−1−1/(4π)∫in, r>Ri r−4dV) in Eq.13, which results in the following equation for Gpol,i:
Gpol,i=−(1/∈m−1/∈w)166(Γ−1/(4π)∫out,r>Rir−4dV) (Eq.12)
It will be appreciated that the alternative discrete sum formulations for Gpol,i can also be modified by replacing their self energy terms with an analytical function Γ such as the equation set forth below:
Gpol,i=−(1/∈m−1/∈w)[Γ+Σbond,j,outP2Vj/Rij4+Σangle,j,outP3Vj/Rij4+Σnonbond,j,outP4Vj/Rij4C(rij, Rvdw,j, P5)]. (Eq.13)
One embodiment of the analytical function Γ is a sigmoid type function including an empirical parameter γ:
Γ(Zi, Rvdw,i, L)=gislv+(gcntr(L)−gislv)/{1+exp[γ(|zi|+Rvdw,i, −L/2)]}. (Eq.14)
This sigmoid type function has been found to fit polar solvation self energies calculated using Poisson-Boltzman. In the above equation, zi is the distance from the atom to the membrane midplane, and L is the thickness of the membrane. In one embodiment, the term gcntr of the above equation is the self energy of a centrally located ion. This value has been calculated, and can be determined by the following equation:
gcntr=−332 ln(2)/L. (Eq.15)
More details of the above equation are provided in Parsegian, Nature, 221, 844–846 (1969). The term gislv in Eq.20 is the self energy at an infinite distance from the membrane, and may be set to 166/Ri, which is the usual value for a solvated ion in water.
In the above equation, the summation terms Σbond,j,out, Σangle,j,out, and Σnonbond,j,out are taken over the molecule's atoms that are located outside the membrane slab 112, such as atoms 131, 132, 136–138. Atom 140, which is partially within, and partially outside the membrane can be considered in a variety of ways. In one embodiment, atom 140 is considered in the membrane if its center is in the membrane, and outside the membrane if its center is outside the membrane. In another embodiment, the term Vj represents the volume of the atom j which is located outside the membrane 112. In this embodiment, Vj will depend on Z.
In some embodiments, therefore, a function Vj(Zi) can be used to replace Vj, resulting in the following equation:
Gpol,i=−(1/∈m−1/∈w)[Γ+Σbond,j,outP2Vj(Zj)/Rij4+Σangle,j,outP3Vj(Zj)/Rij4+nonbond,j,outP4Vj(Zj)/Rij4C(rij, Rvdw,i, P5)]. (Eq.16)
The term Vj(Zj), representing a volume of the atom j outside the membrane, is expressed as a function of the distance from the atom j to the membrane. The term is used to include the contribution of an atom j located partially inside the membrane and partially outside the membrane. In one embodiment, Vj(Zj) can be determined by the following equation:
Vj(Zj)=πh2(Rvdw,j−h/3). (Eq.17)
The above function is used as a smoothing function to include in the screening-effect term the contribution of an atom located partially outside the membrane slab 112, such as atom 140 of
Although the parameters such as γ may be pre-selected, in some embodiments, prior to block 304, the process determines the one or more empirical parameters used by the analytical function. The empirical parameters can be determined by fitting the predictions of the analytical formula to the results of accurate Poisson-Boltzmann calculations by adjusting the parameter or parameters of the formula. Thus, in one embodiment, the process determines a parameter γ, which will be used in Eq.20 to define a specific analytical function Γ. More details on determining empirical parameters will be described below in connection with
Referring back to
The atom's solvation energy polar component is determined by combining the atom's self-energy term, as determined in block 304, with the atom's screening-effect term(s), as determined in block 306. The process proceeds from block 306 to block 308, where each atom's effective Born radius is determined. In one embodiment, since the atom's polar solvation energy has been determined, the atom's effective Born radius is determined using Eq.6. The process proceeds from block 308 to block 310. At block 310, the determined effective Born radius of each atom is used, for example in the generalized Born equation of Eq.4, to determine the polar solvation energy of the molecule. The process proceeds from block 310 to an end block 312.
The parameter determination module 404, when present, determines empirical parameters, which are used by the analytical function determination module 406. The parameter determination module 404 receives as input the polar components of atoms of experimental molecules. The received polar components have been determined using other methods, such as the Poisson-Boltzman method. The parameter determination module 404 generates the empirical parameters of the analytical function to produce a desired fit to Poisson-Boltzmann calculations, for example. More details of determining empirical parameters are described below in connection with
The analytical function determination module 406 receives the determined empirical parameters from the parameter determination module 404, and calculates a self-energy term of an atom's solvation energy polar component, based on the analytical function. In one embodiment, Eq.14 is used to determine the atom's self-energy term.
The atom polar component determination module 408 receives the analytical function value determined by the analytical function determination module 406, and calculates the polar solvation energy of an atom.
The molecule polar solvation energy determination module 410 receives the value of each atom's polar component from the atom polar component determination module 408. The molecule polar solvation energy determination module 410 then determines the polar solvation energy of the molecule. In one embodiment, the molecule polar solvation energy determination module 410 determines each molecule atom's effective Born radius based on the atom's polar component, using Eq.6. In another embodiment, the analytical function determination module 406 or the atom polar component determination module 408 directly calculates the effective Born radius of an atom. The module 410 then determines the molecule's polar solvation energy. In one embodiment, the module 410 uses the determined effective Born radii of the atoms and Eq.4 to determine the molecule's polar solvation energy.
In one embodiment, parameter determination is performed by another system, therefore the parameter determination module 404 is omitted from the computing system 402. As those skilled in the art will appreciate, a module can be separated into multiple modules, and multiple modules can be combined into fewer modules, to perform the same functions. In each of these cases, combinations of modules are included within the definition of “module.”
Once the molecule's polar solvation energy is determined, it can be used to determine the total solvation energy of the molecule, using Eq.1. In the case of membrane bound molecular systems, the non-polar component of the total solvation energy is also an important consideration. It has been found that when modeling globular non-membrane bound proteins in aqueous solution, the non-polar component of the solvation energy is much less significant than the polar component, and that the low energy conformation is not very sensitive to this part of the solvation energy. However, for membrane bound proteins, this component of the solvation energy is a significant force attracting the protein to the membrane interior. Therefore, the σSA term becomes more important to accurately estimate. In some embodiments of the invention, a value for σ of 28 cal/Å2 has been used with good results, wherein the solvent accessible surface area is computed with respect to the atoms outside the membrane only.
The molecule's estimated solvation energy in selected conformations and over the course of a dynamic simulation can be used to predict the physical properties of drug-candidate molecules, and to select drug-candidate molecules for further studying or synthesizing. The method and system described in the present application can be used to study the electrostatic effects of macromolecules, trans-membrane proteins, and other molecules located in two different media.
In addition to membrane containing systems, the described method and system can be used to study the electrostatic effect of polymer absorption. The described method and system are particularly suitable when the discrete dielectric regions are a substantially regular geometric shape, for example a plane formed by a membrane, a substantially circular region formed by an oil droplet, a substantially spherical or cylindrical shell region formed by spherical or cylindrical micelles, and the like. When the solvent system has appropriate symmetries, self energies can be calculated and fitted to a simple function of atomic position. In some cases, the functional form may be derivable from electrostatic theory.
Depending on the molecules and solvents used in the parameter determination process, other values of γ may also be used. For example, in one embodiment with a single 1 angstrom ion in a 30A membrane with dielectric parameter of 1, a value of γ=0.92 was found to achieve a good fit with FDPB values. Therefore, a plurality of γ parameters may be used in the above-described analytical function. For example, Eq.14 may be modified to the following equation:
Γ(Zi, Rvdw,i, L)=gislv+(gcntr(L)−gislv)/{1+exp[γi(zi|+Rvdw,i,−L/2)]}. (Eq.22)
In the above equation, γi is related to the atom i. For example, one value of γi may be used for atoms with small radii, and another value of γi may be used for atoms with larger radii. The parameter γi can be a function of the atom radius. However, increasing the number of γ parameters increases the computational complexity. A number of tests have found that a single γ parameter value has achieved reasonably good fits with FDPB values.
As shown in
The foregoing description details certain embodiments of the invention. It will be appreciated, however, that no matter how detailed the foregoing appears in text, the invention can be practiced in many ways. As is also stated above, it should be noted that the use of particular terminology when describing certain features or aspects of the invention should not be taken to imply that the terminology is being re-defined herein to be restricted to including any specific characteristics of the features or aspects of the invention with which that terminology is associated. The scope of the invention should therefore be construed in accordance with the appended claims and any equivalents thereof.
This application claims priority under 35 U.S.C. §119(e) to U.S. Provisional Patent Application No. 60/307,502, titled “METHOD OF ESTIMATING SOLVATION ENERGIES OF MEMBRANE BOUND MOLECULES” and filed Jul. 23, 2001. The above-referenced Provisional Patent Application is hereby incorporated by reference in its entirety.
Number | Name | Date | Kind |
---|---|---|---|
5420805 | Still et al. | May 1995 | A |
6226603 | Freire et al. | May 2001 | B1 |
Number | Date | Country | |
---|---|---|---|
20030033129 A1 | Feb 2003 | US |
Number | Date | Country | |
---|---|---|---|
60307502 | Jul 2001 | US |