The present disclosure generally relates to computer-implemented methods and systems of determining material properties of molecular quantum mechanical systems using artificial intelligence methods. In particular, but not by way of limitation, some embodiments described herein relate to computer-implemented methods and systems of determining material properties of molecular quantum mechanical systems according to artificial intelligence methods including, for example, using knowledge processing systems, such as neural networks, where a reasoning technique is applied to a collection of facts and relationships (i.e., knowledge representation).
Molecular simulation is used to predict experimental observables of molecular systems. For example, molecular simulations may include force field models of a molecular system that represent the energy of the system (i.e., the Hamiltonian) which can be used to determine properties of the molecular system. Artificial intelligence methods such as supervised, unsupervised, and reinforcement learning have been employed to classify and predict physical, chemical, and biological systems. However, using artificial intelligence (AI) techniques to devise interaction models or force field (FF) models for complex molecules has been challenging, especially for those strongly affected by electrostatic and polarization effects. Furthermore, applying AI techniques to large molecular systems has also proven to be difficult. Additionally, some known molecular force field models and simulation methods that do not use quantum mechanical description (e.g., analytical force field models) may either have limited coverage of molecular systems or be unable to determine properties with an acceptable level of chemical accuracy.
Accordingly, a need exists for a wide coverage force field model and a molecular simulation method based on artificial intelligence techniques that are transferable and extendable to previously un-benchmarked chemical species, have little and/or minimum reliance on experimental data, and predict experimentally observable properties of molecular systems within an acceptable degree of chemical accuracy.
In some embodiments, a method includes determining, at a processor, a first set of atomic interactions between a first set of atoms and a second set of atoms using a molecular interaction model (EPh). The EPh can include responses to an external electric field, an inductive accounting of many-body non-additive interactions, and a representation of electrostatic interactions and electrostatic response interactions. The method further includes providing a set of inputs to a set of neural network models (dENN) to determine a second set of atomic interactions between the first set of atoms and the second set of atoms. Each input from the set of inputs is a representation of at least one atom of the first set of atoms or the second set of atoms. The method includes training the dENN by reducing an error between (1) an interaction produced by a combination of the first set of atomic interactions and the second set of atomic interactions and (2) an interaction between the first set of atoms and the second set of atoms computed using quantum mechanical methods. The method further includes modifying the EPh by reducing an error between (1) the interaction produced by the combination of the first set of atomic interactions and the second set of atomic interactions and (2) the interaction between the first set of atoms and the second set of atoms computed using quantum mechanical methods. The method includes receiving a representation of a third set of atoms, inputting the representation of the third set of atoms into the EPh and the dENN to output a prediction associated with an energy or a state of the third set of atoms, and sending a signal to a user device to present the prediction.
A force field is a method that can be used to estimate the forces between atoms within molecules and also between molecules. The force field refers to the functional form and parameter sets used to calculate the potential energy of a system of atoms or coarse-grained particles in molecular mechanics, molecular dynamics, or Monte Carlo simulations. The parameters for a chosen energy function can be derived or adjusted from experiments in physics and chemistry, calculations in quantum mechanics, or both.
The force field parameters and interaction rules in applications to molecular systems can define the energy landscape from which a number of results of interest can be derived. For example, the acting forces on particles can be derived as a gradient of the potential energy with respect to the particle coordinates. For another example, the resulting equations of motion can be propagated forward in time and time averages can predict experimental observables.
Prediction of experimentally observable ensemble and non-ensemble properties of a material system may be obtained from a molecular force field model of the material system. An ensemble property may be obtained via molecular simulation of the model, for example, molecular dynamics (MD), path integral molecular dynamics (PIMD), or Monte Carlo (MC). The way to determine the ensemble properties from the Hamiltonian at non-zero temperature can be, for example, via various thermodynamic sampling methods such as molecular dynamics (MD), path integral molecular dynamics (PIMD), or Monte Carlo (MC).
The molecular interaction models can be roughly separated into two categories: scope limited models that benchmark their interaction energies with Quantum Mechanics (QM) calculations, and wide coverage models that fit at least some of their parameters to selected sets of experimental data. Some known QM parametrized models can employ high functional complexity to reproduce QM energies. The functional complexity of these known QM-parameterized models, however, can make transferability and extending coverage to other chemical species and hetero-chemical ensembles difficult.
Some known wide-coverage molecular force field models derive some or all of the parameters by fitting to empirical observables. There are at least two drawbacks to this approach. First, even available experimental data (e.g., densities, heats of vaporization) can be insufficient to produce models that describe existing compounds precisely; and there can be molecules (for example, molecules that have not been synthesized) that require and/or use more precise description than available data from existing inference. Second, if an empirical model's prediction is erroneous, it can be difficult to decide which parameter(s) of the model to remove, add, correct or adjust.
In a wide coverage force field, a chemical group can be considered described when both its self-interactions and its interactions with other described group(s) are determined. A Force Field with wide coverage can, in some examples, fully describe the interactions between most and/or all chemical groups. In the case of a protein, because of the importance of biology, wide coverage force fields can include the interaction of amino acids and backbone, for example, both between themselves, water, and non-polar solvents. Furthermore, in some instances, the force field can describe the interactions between a significant number of major chemical groups, e.g., alkanes aromatics amides amines sulfides esters, in both homo (self) and hetero (between any distinct number of) interactions. More specifically, in some instances, the force field can describe the behavior of these groups in a variety of solvents: H2O, CHEX (cyclohexane), OCT (octanol), Benzene, etc.
Finally, the simulation results produced with the force-field can correctly describe the entropic behavior of major solvents—H2O, CHEX, OCT, benzene, in order to capture the entropic component of solvation. Moreover, the results can correctly describe the entropic behavior of arbitrary molecular systems.
Some known fixed charge force field models lack flexibilities in the functional form to fit the experimental results well and frequently the errors are uncontrollable. The parameters of these known fixed charge force field models that are determined by QM energies are not well defined, as these models do not have polarization, and are, therefore, unable to transfer the results obtained by QM (in the gas phase) into the liquid phase or the solid phase (condensed phase). Furthermore, when fitting to experimental results, these known models do not have clear guidance on which parameters should be corrected, and therefore errors also arise.
One of the goals of a molecular simulation is to accurately reproduce the true quantum mechanical potential energy with a tractable classical approximation. In general, analytical expressions may not represent the intermolecular quantum potential energy surface at close distances and in strong interaction regimes. Alternatively, a description based on a neural network may be used. However, in some cases, the more accurate neural network approximations may not capture crucial physics concepts (e.g., many-body contributions and application of electric fields).
For such cases, a hybrid intermolecular interaction model may be used. The intermolecular interaction model may include analytical description of the polarizable force-field combined with a short-range neural network correction for the total intermolecular interaction energy. The correction is sub-typed for distinct chemical species to match the underlying force-field, to segment and reduce the amount of quantum training data, and to increase accuracy and computational speed. Simulations of molecular systems using such interaction model may determine properties of complex molecular systems at a fraction of time of some known simulation models.
Example embodiments are illustrated in the drawings. It is to be understood that the embodiments are described only to facilitate better understanding of the embodiments, rather than to limit the scope of the embodiments in any manner.
The memory 120 can be, for example, a random-access memory (RAM) (e.g., a dynamic RAM, a static RAM), a flash memory, a removable memory, a hard drive, a database and/or so forth. In some implementations, the memory 120 can include (or store), for example, a database, process, application, virtual machine, and/or other software modules (stored and/or executed in hardware) and/or hardware modules configured to execute various molecular simulation process(es) as further described below. In such implementations, instructions for executing various molecular simulation process(es) and/or the associated methods can be stored within the memory 120 and executed at the processor 110.
The processor 110 can be, for example, a system of parallel processors, a system of GPUs, and the like, configured to make computations in parallel. For example, processor 110 may be a blade server having multiple processors such as, for example, multiple Intel processors (e.g., Xenon processors), AMD processors (e.g., Ryzen or EPYC processors), and the like. The processor 110 may use any suitable operating system (e.g., Linux, Window, Mac OS, Unix, and the like) to perform various processing functions. Such functions may include, for example, writing data into and reading data from the memory 120, and executing various instructions stored within the memory 120. The processor 110 can also be configured to execute and/or control, for example, the operations of other components of the MAD 100 (such as a network interface card, other peripheral processing components (not shown)).
The memory 120 is configured to store models for performing various molecular simulation process(es). In particular, the memory 120 is configured to store a molecular interaction model (MIM) 124 (herein and elsewhere, MIM is also referred to as EPh) and a computer learning model (CLM) 126 (herein and elsewhere, CLM is also referred to as neural network model or dENN). The MIM 124 and the CLM 126 can be executed by the processor 110. Accordingly, the MIM 124 and the CLM 126 can be software models stored in memory 120 and executed by processor 110. In other embodiments, the MIM 124 and CLM 126 can be hardware-based models.
The MIM 124 provides a physics-based model of a multidimensional potential-energy surface (PES) associated with atomic coordinates in the molecular system. Physical-based models include classical force fields for calculating the total energy of a molecular system. The total energy of the molecular system may be decomposed into low-dimensional bonding two-, three-, and four-body terms representing covalent bonds, bonding angles, and dihedral angles. In addition, electrostatic and van der Waals interactions given by Coulomb's law and the Lennard-Jones potential, may be used. For example, in some cases of physics-based models the total energy may include bond energy terms (e.g., bond energy may be a continuous function of interatomic distance and describes the energy associated with forming bonds between atoms), energy terms associated with three-body valence angle strain and four-body torsional angle strain, as well as electrostatic and van der Waals energy terms.
When computing low-dimensional bonding terms, the immediate environment may be taken into account. In various cases, the bonding properties are strongly influenced by the positions of the neighboring atoms, thus, additional information on the bonding properties of the atoms is included via the introduction of atom types classifying the atoms according to functional groups. In this way, total energy expressions with low-dimensional, approximately additive energy terms can be obtained. Further, reactive force fields (e.g., ReaxFF method) are used for describing the making and breaking of bonds. Some atomistic potentials in the field of materials science are based on the concept of the bond order to take into account the effect of the atomic environments on the bonding properties. In many problems of materials science involving systems such as metals and alloys a good description of very different atomic environments is important due to the importance of many-body contributions to the potential-energy.
In some cases, MIM 124 may include two-body non-bonded interactions including electrostatic, exchange-repulsion and dispersion terms. The electrostatic interaction arises from the attraction between atoms with opposite charges and the repulsion between atoms with like charges. This interaction is described by Coulomb's law. The exchange-repulsion interaction arises from the overlap of the electron clouds of two atoms. This overlap creates a repulsive force that is proportional to the overlap volume and the fourth power of the distance between the atoms. This interaction is typically described by a Lennard-Jones potential, which combines an attractive term that describes the van der Waals interaction between the atoms and a repulsive term that accounts for the exchange-repulsion interaction. The electrostatic and exchange-repulsion terms are multipolar, and their radial dependence may be exponential so that they are better able to describe charge penetration and delocalization. The dispersion interaction arises from the fluctuations in the electron density of an atom, which induce corresponding fluctuations in the electron density of neighboring atoms. This interaction results in an attractive force between the atoms whose leading term is proportional to the inverse sixth power of the distance between them. In molecular mechanics calculations, the dispersion interaction between two atoms is typically described by a function that depends on their polarizabilities and the distance between them. The dispersion term may be conventionally represented by a spherical, 2 term (C6 and C8), Tang-Toennies-damped interaction. Many-body effects can be modeled by anisotropic atomic polarizable dipoles interacting with the electrostatic term and with each other and iterated to self-consistent field (SCF) convergence on the non-bonded steps. The intermolecular parameters of MIM 124 may be determined by agreements with QM values of dimer and multimer energies, electrostatic potentials, and multipole moments of monomers. Further, MTM 124 may include bonded interactions that are fitted to monomer QM energies computed using df-MP2/aug-cc-pVTZ basis set.
In addition to physics-based model of a multidimensional potential-energy surface (PES), neural network-based (NN-based) models may be used to reproduce or encode atomic interactions. Some known NN-based models may be unable to incorporate effects which traditional physics has been built to represent such as, a response to an external electric field, the ability to extrapolate from 2 or 3 molecules to arbitrarily many, and a proper treatment and truncation of long-range interactions. Furthermore, the accuracy of the reproduction of intermolecular potentials by some known NN-based models has plateaued at −1 kcal/mol. Therefore, a combined hybrid intermolecular interaction model may be used. For example, as shown in
The CLM 126 may be any suitable machine learning model for determining interactions between atoms of a molecular system. In some implementations, for example, the CLM 126 may be a system of neural networks (NN) incorporating description of a molecular neighborhood for determining atomic interactions. The CLM 126 may be trained using input data that includes reference data describing molecular structures and their corresponding energies. Further, CLM 126 may be validated using a suitable validation data set (e.g., the validation data set may include various molecular structures and their corresponding energies). In some cases, CLM 126 may be represented by a multilayer feed-forward (MLFF) NN. Such NNs can be useful for the construction of PESs due to their ability to represent arbitrary functions. Such NNs offer a number of advantages for the construction of PESs as energies of molecular structures can be fitted to a high degree of accuracy when compared to the underlying reference data. CLM 126 that uses NNs may be unbiased and generally applicable to various types of bonding and may not require system specific modifications.
In various embodiments, a demand for accuracy may be particularly important for protein-ligand interactions. For example, the energies of interest (interacting ligand and protein functional groups) are diverse and frequently quite strong. For another example, because the system is a liquid crystal and its phase space is constrained, the PES errors may not average out as in a liquid, and the model interactions can correctly encompass the distances and orientations. Consequently, the accuracy of each component of a useful prediction of the free energy of ligand binding can be significantly higher than the desired accuracy of the final answer (˜±0.5 kcal/mol). The components' agreement with Quantum Mechanics may be as narrow as the uncertainty of the underlying ab-initio calculations.
In various embodiments, the MIM 124 and CLM 126 are configured to be used together to describe an interaction of a group of atoms (e.g., atoms forming a molecular system) with predefined accuracy. In various embodiments, CLM 126 may be used for improving approximation of the MIM 124. For example, CLM 126 may be used for correcting a polarizable force field for (molecular) pair interactions at medium-short distances. In some cases, the correction encoding/computational mechanism represented by CLM 126 may be performed using neural networks (NNs). The combination of MIM 124 and CLM 126 is configured to provide energy predictions in agreement with quantum mechanical (QM) energies. Various aspects of a molecular interaction model similar to MIM 124 are described in U.S. patent application Ser. No. 17/967,451, filed on Oct. 17, 2022, and titled “System and Method for Determining Material Properties of Molecular Systems from an Ab-Initio Parameterized Force-Field,” which is incorporated by reference herein in its entirety. The interaction energy is formally assigned to (atomic) pair interactions (e.g., interactions between an atom A and an atom B) and takes into account the molecular neighborhoods of atom A and atom B therefore taking into account interactions with other atoms in the neighborhood of atoms A and B.
It should be noted MIM 124 and CLM 126 may be combined in a variety of ways to describe the interaction of a group of atoms with the predefined accuracy (e.g., accuracy which matches quantum mechanical predictions). For example, MTM 124 may be used for long range interactions and CLM 126 may be used for short range interactions. Corrections due to CLM 126 may be range dependent (e.g., corrections due to CLM 126 may be weighted by a function that is range dependent). Alternatively, MIM 124 may be used for describing interactions between a first set of atoms and CLM 126 may be used for describing interactions between a second set of atoms. In some cases, MIM 124 may be used for describing interactions between atoms of a molecular system (e.g., molecular system that includes two or more interacting molecules) and CLM 126 may be used to describe atomic interactions between a subset of atoms of that molecular system (e.g., atoms between the two molecules within the molecular system that are in proximity to each other). Further details of how MIM 124 and CLM 126 may be combined are discussed below in relation to methods shown in
In some implementations, the hybrid intermolecular interaction model (e.g., a model that combines MTM 124 and CLM 126) can address the following:
Quantum mechanical calculations can become computationally expensive when molecules (or molecular systems) larger than a certain limited size (e.g., a few tens of atoms, a few hundreds of atoms, and/or the like) are considered. Therefore, in some implementations, analytical and computer models (e.g., MIM 124 and CLM 126) can be constructed using self-contained sub-pieces, which may be ‘stitched together’ along, for example, rotatable bonds, by fitting the ‘joining piece’ using monomer calculations or previous typification. For example, anthracene (3 joined aromatic rings) has the atom types identical to benzene except the 4 atoms at the joining sides, which are identical to naphthalene (2 joined aromatic benzene rings), thus, forces between atoms of anthracene may be calculated using MIM 124 and/or CLM 126 based on force calculations made for atoms of benzene.
When using MIM 124, it may be possible to identify many-body contributions from dimer calculations or very few multimer calculations by using an established physics theory and mechanism (polarization). In various cases, neural networks (NNs) may be too flexible and may be both provided and presented in training by the possible conformations to be encountered. If one needs to train many-body to a NN, the inputs can respond to and represent the multi-molecule interactions by including various permutations of such permutations, including, for example, various configurations of 4-mers.
Further, a proper description of electrostatics as well as polarization allows a system to parametrize the models and then conduct correct simulations, for example, in the presence of an electric field (e.g., in a battery). An example NN for interactions without such physics mechanisms can be trained using a large number of possible electric field configurations (e.g., electric fields both internal and externally applied). Thus, having CLM 126 for proper description of electrostatics may be computationally demanding. Therefore, in various implementations, MIM 124 may be used instead of CLM 126 for describing such interactions to reduce computation costs.
When describing a high-dimensional molecular system such as a molecular system having more than a few atoms (e.g., a molecular system with 5-10 atoms, with a tens of atoms, with hundreds of atoms or even with thousands of atoms), using a single neural network to describe interactions of the atoms in the molecular system may result in slow training times for such a network, thereby, CLM 126 may be formulated based on and/or include multiple NNs. Each NN can be configured to determine an energy contribution from a single atom to a total energy of the molecular system, and energy contributions from the atoms in the molecular system are then summed to obtain a total energy of the molecular system. Thus, to obtain a fitted high-dimensional PES, the total energy E of a molecular system can be decomposed into atomic energy contributions Ei of each atom i, which are predicted by separated species-specific neural networks.
Further, when calculating interactions between a pair of atoms A and B, atoms ai in a spherical neighborhood of atom A may be selected to describe the atomic environment functions GA of atom A, and atoms bj in a spherical neighborhood of atom B may be selected to describe the atomic environment functions GB of atom B. The extent of neighborhoods of atoms A and B may be determined by a cutoff radius RC, as schematically shown in
The cutoff radius RC can be chosen sufficiently large such that accurate energy predictions can be obtained. For some systems RC may be about 6 Å in order to reproduce energy predictions obtained using various methods based, for example, on quantum mechanical description such as methods based on, for example, density function theory (DFT), time-dependent density functional theory (TDDFT), Hartree-Fock theory, coupled cluster technique, DFT with projector-augmented wave (PAW) method, and/or Møller-Plesset perturbation theory (e.g., MP2). In various embodiments, the error for predicted energies may have a root mean squared error (RMSE) of only a few meV per atom. The specific geometric arrangement of the neighboring atoms is described by a set of atom-centered symmetry functions Gi, which provide characteristic vectors of numbers for different local environments. The atom-centered symmetry functions Gi may be similar to or the same as the atomic environment functions GA and GB. Herein, such functions are also referred to as atomic neighborhood descriptors.
As the decomposed atomic energies are not defined exactly, the training targets are the total energies, and this approach works with good accuracy for short-ranged interactions, while for long-ranged interactions as electrostatics and dispersion use special fitting and often are not considered. For many applications like liquid phase, biomolecular systems, molecular crystals, drugs, etc., the non-covalent interactions are of significance and can be taken into account. In some implementations, intermolecular interaction energy dEint of two molecules can be calculated with a ‘supermolecular’ method as a difference between the total energies of a dimer, EAB, and molecules/monomers, EA and EB. Thus, the intermolecular interaction energy is given as dEint=EAB−EA−EB. The dEint calculated using ab initio quantum mechanical (QM) methods can be used for neural network training. In various cases, the proposed approach of the neural network potentials (NNP) may be used to describe non-covalent interactions.
In various embodiments, CLM 126 may take a variable input (e.g., the variable input may include symmetry function vectors Gi describing atomic environments for atoms Ai, and return a total energy of the molecular system. The total energy may be the atomization energy (i.e., the energy of forming a molecule from individual atoms). In various cases, CLM 126 may be used for correcting or augmenting the intermolecular energy of a molecular system. In an example embodiment, the molecular system may include a pair of interacting molecules.
In various embodiments, CLM 126 may be constructed and/or defined to be extensible. Thus, CLM 126 may be used for multiple subsets of atoms within a molecular system, and atomic interactions between various subsets of atoms can be combined to obtain an overall energy of the molecular system. In one implementation, the molecular system may be partitioned into pairs of atoms (each such pair may be a subset of atoms of the molecular system). In such a case, the energy of the molecular system may be rewritten as a sum of environment-dependent pair interactions.
The expansion of atomic interactions into interactions of atom pairs offers a physically intuitive description of interaction energies and is related to the concept of the bond order. To explore the effect of such a pair approach on the accuracy of high-dimensional NN potentials, a CLM 426 implementation of CLM 126 may be used, as shown in
Even though individual neural networks such as neural networks N1 . . . N4 of CLM 326 or neural networks N12 . . . N34 of CLM 426 are configured to predict individual energies of atoms (as in the case of neural networks N1 . . . N4) or pairs of atoms (as in the case of neural networks N12 . . . N34), various implementations of CLM (e.g., CLM 326 or CLM 426) can be trained to obtain accurate corrections to the total energy results obtained using the MTM 124. In various embodiments, total energy may be calculated for a molecular system containing a few molecules (e.g., a dimer). The accuracy of the total energy (e.g., the total energy results obtained using MIM 124 with corrections from CLM 126) can be compared directly with a result obtained using quantum mechanical calculations. The training target energy correction can be implemented as an adjustment to the total energy computed by MTM 124. The correction may be computed as a difference between total quantum mechanical energy results and energy results using MIM 124 values.
In various embodiments, a method of generating a total energy for a molecular system includes the steps of (1) generating the atomic neighborhood descriptors (as described above) of the atomic neighborhoods or neighborhoods of atomic pairs, and (2) using these descriptors with an associated CLM (e.g., the CLM 326 for computing energies for individual atoms or the CLM 426 for computing energies for pairs of atoms) to obtain atomic (or atomic pair) energy values (e.g., the energy values E1 . . . E4 as shown in
In some implementations, when atomic pairs are used for determining total energy of a molecular system (e.g., an atomic pair may be a carbon-oxygen pair C—O), neural networks specific to such atomic pairs may be used. For example, a neural network N12 may be specific to a particular type of pair C—O, (e.g., to an aromatic C—carbonyl O pair). Also, note that using pair specific neural networks can allow for smaller networks (e.g., networks with fewer layers) and can result in improved accuracy of the CLM 426. Further, using pair specific neural networks can result in generating improvements to the CLM 426 incrementally (e.g., CLM 426 may not need all the training data for all possible C—O pairs in chemistry to be trained for aromatic C—carbonyl O pair). Further, pairs such as aromatic C—carbonyl O pair are implemented within the MIM 124, and energy obtained using the MTM 124 for such a pair can be readily corrected using the CLM 426, when CLM 426 is configured to compute energy for the same type of pair. A further step of generating a total energy for a molecular system can include combining (e.g., summing) the individual energies from atoms or pair of atoms.
Descriptors are used for representation of a local atomic environment. Some neural networks realizations do not consider electronic interactions explicitly, but instead approximate the potential energy surface as a function of the atomic positions. Therefore, in building neural networks it can be important to obtain a proper representation of local atomic environments, or a model, which encodes relations between atoms and quantifies similarities and differences between atomic structures, thereby enabling mapping of atomic coordinates to a certain property, e.g., the total energy of the system. The quality of this mapping can be used to develop neural networks that are able to distinguish atomic structures and, therefore, to properly describe interatomic interactions.
Similar to force field energies, neural networks can reflect the system symmetry with respect to global rotation/translations and permutations of atoms of the same element. Therefore, representations of the local atomic environment can be realized in terms of descriptors/fingerprints, or feature vectors with a fixed length, that preserve these symmetries. In various embodiments, descriptors are atom-centered, i.e., they are built for the local environment of a certain atom, (e.g., includes a set of few (10-50) atoms within a given cut radius Rc (as shown in
An example embodiment of a descriptor is simply the number of neighbors (the number of neighbors may be denoted as N) within some radius R of an atom A.
There are a few other properties that may be important and/or used for a suitable ‘input’ for neural networks for chemical purposes. The input may be invariant under the same changes that the energy is invariant under. For example, changing the order of counting the atoms in the neighborhood may not change the value of the input. Rotational invariance for inputs centered on atom A (rotating around atom A) may also be present. Conversely the input may not change when energy would change—e.g., shifting the positions of neighboring atoms may change the input. When the descriptors (input) are applied to a pair of atomic neighborhoods (A-B) they may be invariant with respect to the order of A and B. Thus, the total input to a computational model (e.g., to a CLM 126) is symmetrized.
In some implementations, the symmetrization (herein also referred to as an encoding) may be done using atom-based symmetry functions (ABSF) (e.g., based on code stored in memory 120 and executed by processor 110 of
The decomposition of the energy of a molecular system into energy contributions from pairs of atoms or ‘neighborhood-pairs’ of atoms is done by including, in the neighborhood of atom A of molecule A, atoms ai of molecule A but also atoms bj of molecule B. The same thing is then done for atom B of molecule B. This includes an additional symmetrization with respect to permutations between A and B. This approach is different from a typical technique of decomposition of the energy of a molecular system into energy contributions from single atoms, as that approach allows for adding energies for atoms A and B without including symmetrization with respect to permutations between A and B.
In various embodiments, a CLM (e.g., CLM 126) may be used to describe pair interactions between atoms A-B and estimate the intermolecular energy. The total energy of the molecular system may then be obtained by adding energy results obtained using CLM 126 and energy results obtained using MTM 124 (MTM 124 may include determining an energy of the molecular system associated with bonds such as energies associated with bond distortions, and/or the like).
In various embodiments, combining CLM 126 with MTM 124 may alleviate problems associated when CLM 126 is used without MIM 124, such as, for example, (1) predicting the energy of a molecular system having a collection of atoms and molecules in an external electric field, when CLM 126 was trained on input data (e.g., input data including location of atoms within the molecular system) that did not include effects of external electric field and (2) predicting the energy due to multi-body atomic interactions in the far range including effects of polarization, or dispersion. Far range interactions may require expanding the ‘neighborhood’ far enough so that such effects can be accounted. In the case of electrostatic interactions, the expansion may use very large neighborhoods, which may not be computationally achievable (or computationally efficient). Even to describe proper 3 and 4 body interactions (between molecules) the neighborhoods may become exceedingly large. Thus, MTM 124 provides a way to describe physical interaction between atoms in a molecular system using forces between atoms derived based on classical field description, while CLM 126 may be used for multi-body atomic interactions when classical description is difficult to determine explicitly.
In principle it is possible to correct a model describing the atomic interactions (e.g., the model may be MIM 124) using a CLM such as CLM 126. In some cases, CLM 126 may be used alone without MIM 124. However, using MIM 124 allows for incorporation of physical description when determining atomic interactions. Specifically, in some implementations, there are two physical features of a molecular system that a combined MIM 124 and CLM 126 model may describe. First is a proper modeling of molecular induction/polarization (e.g., the modeling of dipole moments in molecules in response to an external electric field). To model such a feature, a physics-based theory may be used (which may be formulated as a part of MM 124) and allows one to describe many-body (molecular) non-additive interactions based on relatively few benchmark calculations. If MIM 124 is not used for describing this physical feature and CLM 126 is used instead to describe this feature, the CLM 126 can be presented with (e.g., trained using) a very large number of trimers, tetramers, and the like to properly model the interaction, and the training input can be enlarged to accommodate such training examples. The second desirable physical feature is a relatively accurate description of electrostatic density and atomic interactions. In various embodiments, the atomic interactions may include (1) long-range atomic interactions, with at least some long-range atomic interactions not accounted for, or modelled, or truncated, to reduce computational complexity (e.g., the computational speed may be reduced from n2 calculations to n·log(n) calculations). For example, when describing long range interactions, a Particle mesh Ewald (PME) method may be used. Note that in some cases, truncation of the long-range interactions using a simple (e.g., smooth truncation function) to zero at some distance may result in prediction errors, thus, a more complex truncation approach may be used. Additionally, or alternatively, a PME, or a fast-multipole method incorporating electrostatic interactions may be used instead of the truncation approach.
In some implementations, the process of determining a total energy of a molecular system may include the following steps and considerations:
The atomic neighborhood (in the many senses of the word, but roughly proportional to ‘bond-distance’) is responsible for the splitting of the interaction determining designation (first splitting is chemical element i.e., O, H, N etc.) into subtypes (i.e., aromatic carbon, aliphatic carbon, etc.). In some embodiments, the sub-type splitting and the correcting neural network are typed in the same manner as the underlying Newtonian model (except, for example, pair-specific and not decomposable). In some implementations, a pair-specific term is roughly representing charge transfer. In some implementations, instead of having a C—O pair interaction and correction a (Aromatic C)-(carbonyl O) pair interaction and correction is defined. It is conceivable that the ‘typification’ can be done in substantially real-time: i.e., there can be inputs that represent the neighborhoods of atoms a and b and therefore adjust the network behavior to shift between types as an integral object. This may increase the complexity of the network and reduce the sensitivity to intermolecular interactions.
Additionally, as typed pair interactions have been made in the underlying model, the same typed corrections can be applied. The typification decision can be human, or algorithmic (e.g., automatically done by processor 110); but in some embodiments it is done a-priori. Note that this implies that molecules cannot disintegrate/break bonds—as it will change the atomic neighborhoods. As above, the decision to stay in the inter-molecular energy range and interactions allows a direct fit to that energy range and therefore permits (much) greater accuracy.
As shown in
In various embodiments, combination of MIM 124 and CLM 126 allows one to preserve at least some analytical physical description of intermolecular interactions. For instance, the proposed combination of an analytical polarizable force field model (MIM 124) with an intermolecular 2-body neural network short-range correction (CLM 126) achieves accurate results while also describing effects of polarization using analytical expressions used in MIM 124. These analytical expressions, for example, for the polarization, allows one to parameterize the model on QM-computable sets of molecules (e.g., dimers), and ensures that the model is correct in describing large numbers of molecules. The analytical terms of MIM 124 allow the intermolecular hybrid model to naturally and properly respond to external perturbations such as an applied electric field. These terms also allow for the full intermolecular hybrid model to be properly truncated at long-range, which is used for efficient and accurate simulation. In the regime of close molecular approach, however, where many simplifying assumptions are used to represent Quantum Mechanics with Newtonian modes-predictions obtained using MIM 124 may become inaccurate, thereby correction using CLM 126 may be desirable. CLM 126 is designed to be computationally efficient (e.g., by training smaller NNs, which characterize the interaction of molecular pairs into pairs of chemical types), readily trainable, and well-behaved even when trained with limited data.
As analytical force-field errors arise mostly at close range, the corrections described by a neural network (NN) model (e.g., by CLM 126) may be tapered beyond a chosen distance between atoms A and B (e.g., such distance may be for example a few angstroms (Å) such as 4, 5, or 6 Å, or the like), beyond which the interactions are computed by MTM 124. Rather than relying on the built-in decay of radial symmetry functions, a Tensorflow lambda layer may be appended to CLM 126 to smoothly force the outputs to zero beyond the cut-off distance.
Like most analytical Force-Fields, the NN correction for the pair of atoms is further subdivided by their atomic types. For example, instead of encoding a general interaction correction NN model for, for example, Carbon-Nitrogen, the interaction corrections can be more specific in a NN model for, for example, aromatic Carbon-Amide Nitrogen. This typification can provide advantages, such as, for example, it breaks up the full chemical interaction space into much smaller training groups so that model improvement is realized in segments and avoids a pre-compute of a large number of possible intermolecular interactions. As a consequence, the resulting total hybrid accuracy of representation of the interaction can be substantially higher. In some cases, the accuracy may be on the order of convergence accuracy of the QM method chosen for the training set. Moreover, the limited amount of encoded information permits a small and therefore fast neural network. Because the fingerprint itself does not encode the chemical environment but the relative orientation and bonded distortion of the interaction, it is also much smaller and more computationally efficient. Moreover, it can be convenient to have the NN correction have the same typification as the underlying analytical force field model.
In recent years various descriptors have been proposed based on local PES, the local atomic density, the radial/angular distribution functions, etc. In some implementations, a descriptor can:
The choice of descriptors can depend strongly on the atomic system under consideration, as well as available computational resources.
In one embodiment, the spherical Bessel descriptors based on the concept of the atomic density are applied [SB]. The local atomic environment around a given point r can be described with atomic neighbor density function:
where rij is the relative position vectors of each neighbor j with respect to the reference point r0i, wkj is the weight factor that can be used to distinguish the kth species in a multicomponent system (in expressions below it is set to be 1). Expanding this density over orthonormal basis functions of full sphere with radius rcut yields:
where gnl(r) are radial basis functions and Ylm(Θ,φ) are spherical harmonics, while nmax specifies the order of the approximation. The expansion coefficients cnlm for the ith atomic pair X-Y (therefore, with respect to the point r0i={ri,qi,ji}), are obtained from the relative spherical coordinates {rij,qij,jij} of the nearest neighbors of atom X of the first molecule and atom Y in the second molecule as shown in (
These coefficients are not invariant with respect to rotation, but the power spectrum calculated as:
where Pl(x) is the Legendre polynomial of order l, includes real-values numbers, is invariant to translation, rotations and inversion as well as to permutations of the same atoms and can be used as descriptors.
In some implementations, various functions can be chosen for gnl(r). It is shown [SB] to be efficient to use a linear combinations of two spherical Bessel functions jl(r) of the first kind as follows:
f
nl(r)=anljl(runl/rcut)+bnljl(run+1,l/rcut), (eq. 5)
where uln is the (n+1)th non-zero root of jl(r), and rcut is the cutoff radius. With a special relation for coefficients anl and bnl these functions can be made twice differentiable even at the cutoff radius. The Gram-Schmidt orthogonalization procedure, applied sequentially to fnl(r) for 0<n£nmax, results in a set of orthogonal and normalized radial functions gnl(r). The main advantage of the described approach is the continuity and twice differentiability of gnl(r) within the sphere of the cutoff radius. In addition, this infinite orthonormal basis set gnl(r) is complete for square-integrable functions with the given boundary conditions, and in general its completeness makes training with neural network become more steady, reliable, and predictable.
The approximation order in the described approach can be determined with the parameter nmax, and the number of generated descriptors Nd is (nmax+1) (nmax+2)/2. The low number of descriptors do not describe properly the atomic environment, and feature vectors might contain insufficient information about the system. The increase in Nd enhances the descriptor resolution and, therefore, the accuracy of the atomic environment representation, but molecular dynamics simulations with the large number of descriptors use substantial computational resources.
The cutoff radius rcut is an important parameter of the local atomic environments and generated descriptors. Small rcut are insufficient to properly describe the atomic environment. For atom-centered representations, increasing the value of rcut implies the inclusion of additional atoms into the local environment and, therefore, uses more computational resources. As a result, in some implementations, it can be considered efficient to set the cutoff radius as small as possible.
Consistent with disclosed embodiments,
Further, the method 1401 includes step 1413 of providing a plurality of inputs related to the first set of atoms and the second set of atoms to a neural network model (NNM) (e.g., NNM may be CLM 126 as described above, herein and elsewhere it may be also referred to as dENN) formed from a plurality of neural networks (e.g., neural networks N1 . . . N4, or N12 . . . N34, as shown in
Additionally, following the step 1413, the method 1401 includes step 1415 of training the NNM to reduce error between (1) the combination of the first plurality of atomic interactions and the second plurality of atomic interactions and (2) atomic interactions obtained using quantum mechanical calculations.
Further, the method 1401 includes step 1417 of modifying the MIM to reduce error between (1) the combination of the first plurality of atomic interactions and the second plurality of atomic interactions and (2) atomic interactions obtained using quantum mechanical calculations.
Further, the method 1401 includes step 1419 of receiving a third set of atoms, step 1421 of inputting the representation of the third set of atoms to output a prediction associated with an energy or a state of the third set of atoms, and step 1423 of sending a signal to a user to present the prediction. In various embodiments, the predictions are obtained using a combined MIM and NNM (e.g., the combined MIM 124 and CLM 126).
In various embodiments, the method 1401 includes predicting at least one of the partition function, free energy, enthalpy, entropy, density, hydration free energy, conductivity, heat of vaporization, diffusion properties, binding free energy, proportion of reactants, or non-equilibrium, or combination thereof of properties of the third set of atoms. Additionally, or alternatively, the method 1401 can be used to predict any other suitable physical quantity that can be derived from the computations of potential energy between the third set of atoms.
As described herein, the method 1401 is configured to combine MIM and NNM models in any suitable way to achieve accurate predictions. In one implementation, of the method 1401, the combination of MIM and NNM is realized by adding the output of the first plurality of atomic interactions (i.e., the interactions obtained using MIM) and the output of the second plurality of atomic interactions (i.e., the interactions obtained using NNM).
Further, the method 1401 may include smoothly, over a predefined interval, switching the output of the NNM to zero at a predefined distance between the first set of atoms and the second set of atoms. In some cases, the smooth switching to zero of the output of neural networks is realized by a neural network analytical layer such as a lambda layer in TensorFlow.
In an implementation of the method 1401, the combination of MIM and NNM is realized by using a substitution of at least one parameter of the MIM by an output of one of neural network models used to form NNM (e.g., neural network model N1, as shown in
Moreover, in an implementation of method 1401, the combination of MIM and NNM is realized by applying the first plurality of atomic interactions and the second plurality of atomic interactions to a first subset of atoms in the third set of atoms and a faster computational method (e.g., MIM) to a second subset of atoms in the third set of atoms. In an example embodiment, the second subset of atoms may not overlap with the first subset of atoms. Further, in some cases the first subset of atoms and the second subset of atoms may be complimentary. In some cases, when using method 1401, the first set of atoms includes only a first single atom and the second set of atoms includes only a second single atom.
In some implementations, the first set of atoms includes a first atom and at least one neighbor atom of the first atom and the second set of atoms includes a second atom and at least one neighbor atom of the second atom.
In some cases, the first set of atoms and the second set of atoms are within at least a portion of a common molecule.
Further, in some embodiments, the first set of atoms is the same as the second set of atoms.
In some cases, the first set of atoms include atoms within a first molecule and the second set of atoms include atoms within a second molecule different from the first molecule.
In various embodiments, as described above, a neighborhood is defined by a predetermined distance from a central atom or atoms or from a geometrically assigned interaction center, or a predetermined number of bonds away from a central atom or atoms. In some implementations, for example, the geometrically assigned interaction center can be a reference point and is located at the center of a line connecting the first atom and the second atom. A geometrically assigned interaction center for a group of atoms may be also a centroid for such a group, when there are more than two atoms in that group. Further a center atom may be an atom that is closest to such a centroid. If more than one atom is located the same distance away from the centroid, than any one of such atoms may be selected as a central atom. Additionally, a neighborhood may be defined for any selected atom of a molecular system. For example, for a water molecule interacting with NaCl, an O atom may be selected to define a neighborhood about that atom. Additionally, or alternatively neighborhoods about Na and CL may be defined. Further, in some cases, a neighborhood about a pair of atoms may be defined (e.g., a neighborhood about a pair of O-CL or O—Na may be defined). In some cases, a neighborhood about a set of atoms may be defined. For example, a neighborhood about a center of a unit cell containing a few atoms within a crystal structure may be defined.
In various embodiments, the representation of the at least one of the first set of atoms or the second set of atoms includes at least one of an orientation of an atom from the first set of atoms or the second set of atoms, a neighborhood of an atom from the first set of atoms or the second set of atoms, a type of an atom first set of atoms or the second set of atoms, or a relative location of an atom from first set of atoms or the second set of atoms.
The representation of at least one of the first set of atoms or the second set of atoms may have a predetermined length not dependent on a number of atoms in the at least one of the first set of atoms or the second set of atoms.
The representation of the at least one of the first set of atoms or the second set of atoms may have a predetermined length not dependent on a rotation, a translation, or a reordering of member atoms in the at least one of the first set of atoms or the second set of atoms, or reordering of the sets themselves.
In various embodiments, the NNM may include the plurality of neural network models each one of the neural network models configured to describe either a distinct atom type or a distinct atomic pair type. An example atomic pair type is a pair of atoms with each atom being of a particular type. For example, an atomic pair type may be a pair formed by an aromatic carbon and an amine nitrogen instead of a carbon nitrogen pair, or a pair formed by an aromatic carbon and an ester oxygen instead of a carbon oxygen pair, and/or the like.
Further, in various embodiments, the plurality of atom types are inferred from the membership of an atom or pairs of atoms in the periodic table, and the chemically designated atoms in the neighborhoods of an atom or a pair of atoms. Furthermore, in some cases, an atomic type of an atom may be inferred based on its location in the periodic table (e.g., its associated chemical element), topological position of its neighboring atoms, neighboring atoms' position in the periodic table (e.g., the neighboring atoms associated chemical elements), and in some cases, neighboring atoms' associated atomic types. Similarly, atomic type for a pair (or a group) of atoms may be inferred based on the atoms' associated chemical elements in the pair (or the group) of atoms, topological position of neighboring atoms of the pair (or the group) of atoms, as well as associated chemical elements of neighboring atoms of the pair (or the group) of atoms, and in some cases, associated atomic types of neighboring atoms of the pair (or the group) of atoms.
In an example embodiment, the plurality of atom types includes an aromatic carbon, an aliphatic carbon, a carbonyl oxygen, and an ether oxygen, or, for example, a hydroxyl oxygen.
In various cases, the interactions between the first set of atoms and the second set of atoms may be computed using quantum mechanical methods and include one or more collections of interactions of atoms within a single molecule, atoms of pairs of molecules, atoms of triples of molecules, atoms of a molecule and an ion, atoms of a molecule and a charge, and/or atoms of a molecule within an applied electric field. The interactions between the first set of atoms and the second set of atoms computed using the quantum mechanical methods include at least one sub-component of the description of the collection of atoms computed by quantum mechanics. The sub-component may be an energy sub-component such as, for example, Dispersion, Exchange-Repulsion, Electrostatic, or Induction Interactions. The sub-component may also be the effective energy change produced by including Nuclear Quantum effects, and polarization induced by external fields and molecules.
In various embodiments, an MIM (e.g., MIM 124) may be used to describe electrostatic response interactions, which may include at least one of induction, polarization or charge transfer. MIM may take as an input at least one of a monopole, dipole, quadrupole, or octupole having a radial accuracy within about one percent, about five percent, about ten percent, about fifteen percent, about a few percent or about a few tens of percent points.
The MIM 124 may be modified (e.g., various parameters of the MIM 124 may be suitably adjusted) to reproduce experimentally observable or measurable macroscopic properties of collections of molecules. For example, the parameters of the MIM 124 may be adjusted to reduce the error between the outputs of MIM 124 and atomic and molecular set properties computed by quantum mechanical methods. The outputs may include energies of interaction between molecules, partial charges, induced charge distribution changes, or correlated charge fluctuation responses. In some cases, the adjustment of the parameters includes fitting at least one of the induction response, the non-additive energies, or various components of a total energy to obtain an accurate prediction for the molecular and/or atomic polarization. Herein, the term “induction response” can refer to the polarization of atoms in a molecule in response to an external electric field. Induction response can arise due to the movement of electrons in a molecule when an external electric field is applied. This movement causes a redistribution of electron density in the molecule, which can lead to the formation of induced dipoles. The magnitude and direction of the induced dipole depend on the strength and orientation of the external electric field and the polarizability of the atoms in the molecule. The induction response may be accounted for in molecular interaction model calculations by introducing an additional term that accounts for the interaction between the induced dipoles in neighboring atoms. This term is known as the “polarization energy” and is added to the non-bonded interaction energy between the atoms. The polarization energy is proportional to the square of the induced dipole moment and the strength of the external electric field. Thus, the induction response is the polarization of atoms in a molecule in response to an external electric field. This effect is important in molecular interaction model calculations as it can affect the stability, conformation, and properties of molecules. To account for induction response, the molecular interaction calculations can include a polarization energy term that accounts for the interaction between induced dipoles in neighboring atoms. The energy prediction may be obtained by computing an energy of a molecule probed by a charge (or an applied electric field) at various positions around the molecule.
The MIM 124 may be truncated at long ranges. For example, truncating the molecular interaction model at long ranges includes using a method such as a Particle Mesh Ewald or Fast
Consistent with disclosed embodiments,
Further, the method 1501 includes step 1513 of providing a plurality of inputs to a plurality of neural network models (NNM) (e.g., NNM may be CLM 126 as described above) formed from a plurality of neural networks (e.g., neural networks N1 . . . N4, or N12 . . . N34, as shown in
Additionally, following the step 1513, the method 1501 includes step 1515 of training the NNM by reducing an error between (1) an interaction produced by a combination of the first plurality of atomic interactions and the second plurality of atomic interactions and (2) an interaction between the first set of atoms and the second set of atoms computed using quantum mechanical methods.
Further, the method 1501 includes step 1519 of receiving a representation of a third set of atoms, step 1521 of inputting the representation of the third set of atoms into MIM and NNM to output a prediction associated with an energy or a state of the third set of atoms, and step 1523 of sending a signal to a user to present the prediction. In various embodiments, the predictions are obtained using a combined MIM and NNM (e.g., the combined MIM 124 and CLM 126).
Example Molecular descriptors
In various embodiments, when predicting intermolecular interactions, binding and solvation free energies can be accurately determined. Intermolecular interactions may be sensitive to geometries of intermolecular contacts. Therefore, to fit QM dimer interaction energies in CLM 126, descriptors may be assigned to each intermolecular atom-atom contact within a certain distance cutoff (e.g., a distance cutoff may be a few Angstroms, such as 4, 5, or 6 Angstroms). These descriptors differ from commonly used atom centered descriptors describing the environment of each atom in the molecular system and, thus, are more appropriate for accurate fitting of intramolecular energies and describe properties such as torsional barriers and vibrational spectra. Descriptors describing an atom-atom contact (an intermolecular bond) can include 2/rij function (rij—the distance of the contact) and sum of terms dependent on relative coordinates of atoms of the contact and environment atoms covalently bound to atoms of the contact versus geometrical center
where 0≤n≤nmax, 0≤l≤lmax,
Pl is the Legendre polynomial of order l and γjk is the triplet angle between atoms i, j and k, radial basis functions gnl(r) are linear combinations of spherical Bessel functions of the type
with expansion coefficients chosen to satisfy the conditions of orthogonality and zero first and second derivatives of gnl(r) vs r at the cutoff radius rc. By construction the descriptors can be invariant to permutations of atoms of the atomic contact and environment atoms. In some implementations, spherical Bessel descriptors with nmax=lmax=6 may be used.
The CLM 126 corrections to the MIM 124 may be trained on multiple conformations of molecular dimers. To cover the space of intermolecular orientations, several procedures may be combined to generate dimer geometries. For example, grid, liquid MD dimers (e.g., dimers in a liquid molecular system) and vacuum MD dimers (e.g., dimers in a gaseous molecular system) can be combined. For a given molecular dimer, a regular grid of molecular conformations can be generated, for example, by translating one monomer vs another along the line connecting their centers of mass with a step of 0.2 Angstrom, then rotating each monomer in spherical angles θ and (p with a step of 10 degrees.
In some cases, internal geometries of monomers may be kept fixed (e.g., for minimal energy geometries). In some cases, liquid MD dimers for molecules A and B are sampled from MD simulations of A solvated in a box of B monomers and vice versa. Extracted dimer geometries have the closest monomer distance in the range of 1.5 to 5 A. MD dimers are also filtered to be close to the grid dimers within a certain threshold of root-mean-square deviation (RMSD) value. The latter is chosen in such a way that the coverage of grid dimers by MD dimers is no less than 80%. To ensure a uniform sampling of grid dimer space by MD dimers, a cap is set that limits a number of MD dimers that are associated with each grid dimer.
In various cases, the purpose of vacuum MD dimers is to augment the liquid MD dimers by supporting sampling of the conformational space that is either not sampled by grid dimers and/or by liquid MD dimers before RMSD filtering. Vacuum MD dimers can be generated by running MD simulations of molecule A in a box of few molecules B, but the non-bonded interactions between A and B are, for example, either completely removed or reduced down to 1-10%. The latter is done, for example, by running TI annihilation simulation for lambdas=0.9-0.99. Then dimers are extracted from these simulations. To use relevant dimers, the vacuum dimers are filtered by computing dimer energies using MTM 124 and leaving the dimers for which dimer energy is less than 40 kcal/mol. In some implementations, another filtering stage can use those vacuum dimers that do not sample the conformational space that is already sampled by liquid MD dimers. The latter is ensured by selecting those vacuum dimers that are conformationally far from the liquid dimers, that is the RMSD between vacuum MD dimers and liquid dimers is above a certain threshold.
In various embodiments, the neural network models used for determining energy correction for the pair of atoms can be formulated for atomic types of the atoms in the pair of atoms. For example, a neural network model describing aromatic carbon-amide nitrogen may be used. Other neural network models may be developed to describe interactions of other atomic types, such as interactions of aliphatic carbon, carbonyl carbon, primary, secondary, ternary, or quaternary carbon, amine nitrogen, nitro nitrogen, pyridine nitrogen, quaternary nitrogen, carbonyl oxygen, hydroxyl oxygen, ether oxygen, acetal oxygen, ester oxygen, carboxylate oxygen, and/or the like with other atoms. This typification has several significant advantages. For example, having a neural network specific for atomic types, breaks-up the chemical interaction space into much smaller training subgroups so that model improvement is realized in segments and avoids pre-computing of all possible intermolecular interactions. For another example, having a neural network specific for atomic types also permits the resulting total hybrid accuracy of representation of the interaction to be high (e.g., on the order of convergence accuracy of the QM method chosen for the training set). Furthermore, as the amount of encoded information is limited, and because the fingerprint itself does not need to encode the chemical environment (‘type’) but can encode the relative orientation and bonded distortion of the interaction, having a neural network specific for atomic types allows for a smaller and therefore faster set of neural networks.
In various cases, polarizability in the force fields and nonadditive effects are important when determining two-body molecular interactions. Ability to respond to an electrical field is called a polarizational response. A polarizational response can make the force field more transferable, because molecules respond to an electrical field in similar manners, disregarding other small factors such as coupling of such response with other smaller effects such as dispersion interactions. Therefore, INM can include a two-body potential. For example, the underlining analytical force field may accurately determine effects of polarization on molecular interactions. In some cases, effects of polarization may include an anisotropic model of polarization in contrast to a simpler, less precise isotropic model of polarization.
In various cases, electrostatic forces may affect long-range molecular interactions and polarizability of molecules. Electrostatic forces can be the strongest of intermolecular forces at close, medium (2.5-5 Angstrom) and far distances (beyond 5 Angstrom). The presence of electrostatic forces in biomolecules is very prominent through water, polar, ion interactions, etc. Correct description of local electrostatic forces, thus, is important to reproduce non-additive energies. These non-additive energies may be caused by local charge distribution, which, in some cases, may lead to polarization self-consistent field “equilibration” of polarizable electronic clouds in molecules.
In various embodiments, electrostatic forces may be well described by long range algorithms like, for example, Particle-mesh Ewald (PME) algorithm. Truncation CLM and using analytical approach (e.g., MIM) for electrostatic treatment beyond e.g., a few Angstroms, such as, for example, 5 Angstrom may allow for fast molecular simulation (using algorithms such as PME, which scales similar to N log(N), where N is number of particles). Further, using MIM allows reducing the amount of dimer data that will be used for training CLMs (e.g., only molecules with distances less than a few Angstroms may be used for training CLMs).
In some implementations, the IHM may include the following features: The two-body non-bonded interactions may include electrostatic, exchange-repulsion and dispersion terms. The electrostatic and exchange-repulsion terms may be multipolar, and their radial dependence may be a Slater-like exponential so that the electrostatic and exchange-repulsion terms are better able to describe charge penetration and delocalization. Multibody effects can be modeled by anisotropic atomic polarizable dipoles interacting with the electrostatic term and with each other and iterated to self-consistent field (SCF) convergence on non-bonded step. In some implementations, there may be no explicit three-body terms, and may not include charge flux. The intermolecular parameters of MIM are determined by agreements with QM values of dimer and multimer energies, electrostatic potentials, and multipole moments of monomers. To aid transferability, the individual components to their corresponding QM counterparts may be matched, in addition to reproducing the total energy. The bonded interactions may be fitted to monomer QM energies computed at the df-MP2/aug-cc-pVTZ level of theory.
In various cases, the screening of the dispersion interaction in the intermediate and far range can be a many-body phenomenon and can be difficult to obtain from ab-initio calculations. Therefore, the close-range dispersion interaction may be obtained from a decomposition of dimer QM calculations, and then smoothly scaled down by a factor that is a property of the two interacting elements. The scaling factor may be specific to the periodic table membership and bond order, and the value is chosen to best reproduce densities of liquids. In some cases, for example, the values cutoff=4.5 Å and asymptotic reduction of 0.5 may be appropriate.
A) MIM Charge Density function
In some implementations, for example, in the MIM calculations, the charge density of the electron cloud of atom ‘a’ located at position Ra can be written as:
where qa is the cloud charge and wA is the MIM atom-type parameter that characterizes the cloud size. The capital letters ‘A’, ‘B’, and so on, may be used to denote the MIM atom-types respectively for atoms ‘a’, ‘b’, . . . ; similar notations are used for bond, bend and torsion atoms types (e.g. ‘AB’ denotes the MIM bond type for the pair of bonded atoms ‘a’ and ‘b’).
In some implementations, for example, the cloud charge may be written in the form:
where ZA is the charge of the “nucleus” for the atom type ‘A’, {a} denotes the set of atoms chemically bonded to atom ‘a’ and qBA is the MTM bond charge transfer parameters (thus the equality qAB=−qBA providing for charge conservation).
In some implementations, for example, differential operator {circumflex over (D)}a can be:
where dimensionless vector ta and 3×3 dimensionless tensor ωa introduce p and d type density anisotropy, respectively. They can be related to the local atomic dipole Pa and atomic quadrupole Qa as:
P
a
=q
a
t
a,
Q
a
=q
aωa (A4)
Vector ta is a sum of permanent and induced vectors:
t
a
=t
a
per
+t
a
ind (A5)
In some implementations, for example, the induced part is written as
t
a
ind
=t
A
maxτa (A6)
where tmaxA is the MIM atom-type parameter and τa is the dimensionless vector representing the MIM dynamic variable, its length being confined within interval (0, 1) by the restraint potential UIN. The induced variable is iterated to convergence (1E-6 kcal/mol/A) of the electrostatic and exchange forces in an SCF (self-consistent-field) manner as is common in polarizable Force Fields. The extended Largrangian techniques may not be employed, instead the dipoles may be iterated explicitly to convergence.
In some implementations, for example, the permanent part in Eq. (A5) is written as:
where nab is the unit vector, nab=Rab/|Rab| and Rab is the vector directed from atom ‘a’ to neighboring atom ‘b’ and tAB is the MIM bond-type parameter associated with permanent ‘chemical’ polarization or “charge shift” along the bond ‘ab’(in general tAB≠tBA). In some implementations, for example, the components of the local quadrupole tensor, can be calculated as:
where α,β=x, y, z and QAB are the MIM bond-type parameters. The atom “nucleus” (proportional to Dirac's delta-function) can be formally considered as the limiting case of Eq. (A1) replacing qa by ZA, dropping operator {circumflex over (D)}a and formally approaching zero by the width parameter.
B) Non-Valence Interactions
In some implementations, for example, based on representation of Equation (A1) the potential φab of electrostatic interaction of two-unit charge distributions ‘a’ and ‘b’ separated by a distance Rab=|Rb−Ra| can be represented in the form:
Expanding equation (B1) one gets:
where F's are the algebraic factors can be defined as:
and potentials φ{k}ab satisfy the recurrence relations:
The ES potential UES between the two clouds can be written as:
U
ab
ES(Rab)=332.064·qaqbφab(Rab)
The potentials are in kcal/mol, the components of vectors Rab and t are in Å, charges are in a.u. This formula is also valid for “nucleus”-cloud and “nucleus”-“nucleus” interactions by replacing the cloud charges with the nucleus charges, nullifying vectors t and tensor o for the core, with the corresponding width parameter(s) formally approaching zero.
In some implementations, for example, the multipolar EX potential UEX can be written similarly to that of ES:
U
ab
EX=332.064·CAEXCBEXχab(Rab) (B6)
where CEXA, CEXB are the force parameters for atom types ‘A’ and ‘B’, and
The algebraic factors F were defined in Eq. (B4), potentials χ{k}ab(Rab) satisfy the recurrence relations in Eq. (B5)
where uA and uB are the EX width parameters that generally differ from ES parameters wA and wB. Φab(0) represents a functional form of the exchange potential.
U
ab
DS=−332.064·(CADS6CBDS6ψ6(Rab)+CADS8CBDSSψ8(Rab)) (B9)
where CDS6, CDS8 are the force constants and ψ's are the Tang-Toennies's functions:
vA and vB being the dispersion width parameters that generally differ from those for electrostatics and exchange.
In some implementations, for example, the anharmonic restraint potential for the atom ‘a’ can be represented as:
where αA is the MIM parameter characterizing polarizability prescribed to atom-type A while bond-type parameters sAB describe the anisotropy of the restraint potential (generally sAB≠SBA), the vector τa was introduced in Eq. (A6). Note that the force due to potential in Eq. (B11) approaches infinity as τa approaches 1. This peculiarity of the MIM restraint potential avoids the polarization catastrophe and provides the existence of a solution of the problem of improvement of electron density in any physically reasonable external field.
C) Induced Dipoles Implementation
It can be seen from equations (B1-B11) that the system depends on polarizable part of clouds taind=tAmaxτa that responds to electrostatic fields of other cloud charges, “nuclei”, cloud dipoles, cloud quadrupoles as well as other polarizable clouds in the system. These equations of self-consistent polarizable clouds can be represented as typical quadratic form of induced polarizable shifts or dipoles E=Eτ-independen+6Στaτb+ΣCaτa+ΣCbτb. This quadratic form is optimized and/or improved with steepest descent with respect to polarizable clouds vector τa at the steps of molecular dynamics until it reaches a small force threshold, which can be, for example, about 1E-6 kcal/mol/A.
D) Valence Interactions
In some implementations, for example, similar to the MMFF-94 force field, the Hamiltonian of valence interaction can be given in the form:
H=H
BS
+H
AB
+H
StBn
+H
OOP
+H
TORS (C1)
where the terms correspond respectively to bond stretching (BS), angle bending (AB), stretch-bend (StBn), out-of-plane bending (OOP), and torsion interactions (TORS).
In some implementations, for example, the bond stretching component is of the form:
In Eq. (C2), the sum is over the bond pairs, I and J are the indices of the atomic types ascribed respectively to atoms i and j, rI,J(0) and kI,J(b) are respectively the equilibrium (or reference) bond length and the bond force constant determined by the atomic type pair and the specific bond type index, and cb is a model parameter.
In some implementations, for example, the angle bending term is given by:
where the summation is performed over the bend triplets i-j, j-k, the indices I, J and K corresponding to the atomic types ascribed respectively to atoms i, j and k. Other notations are:
Δφijk=φijk−φIJK(0) (C5)
where φijk is the angle between i-j and k-j bonds, φIJK{0} and kIJK{a} are the equilibrium (or reference) angle and the force constant determined by the atomic type triplet and the specific angle type index, and ca is a model parameter.
In some implementations, for example, the stretch-bend interaction is given by:
H
St-Bn=ΣΔφijk(kIJK(ba)Δrij+kKJI(ba)Δrkj) (C6)
where the summation is similar to Eq. (C4). The bond length and angle deviations, Δrij, Δrkj and Δφijk are defined by Eqs. (C3) and (C5), k(ba)IJK and k(ba)KJI are the force constants for the interaction of ij and kj bond-angle, respectively. These constants are determined by the atomic type triplet and the specific stretch-bend type index.
In some implementations, for example, the out-of-plane interaction is given by:
where the sum is over bond tetrahedrals i-j-k-1 with the 3-bond central (or vertex) atom j and the bonds i-j, k-j, l-j, the indices I, J, K and L correspond to the atomic types ascribed respectively to atoms i, j, k, and l. The angle deviation Δχikl is defined as the angle between the vector (ri-rj) and the plane defined by vectors (rk-rl) and (rl-rj); that is
In some implementations, for example, the force constant kIJKL is determined by the atomic type quartet IJKL. In some implementations, for example, the torsion component is of the form
where the sum is over bond quartets i-j, j-k, k-l, the indices I, J, K, and L correspond to the atomic types ascribed respectively to atoms i, j, k, and l. ø is a standard torsion angle defined as the angle between the planes i-j-k and j-k-l, given by:
In some implementations, for example, the force constants VIJKL are determined by the atomic type quartet and two specific indices, the first relating to the type of the central bond j-k, while the second one relates to the hybridization state of atoms j and k.
It should be appreciated that the above detailed embodiments of the present disclosure are only to exemplify or explain principles of the present disclosure and not to limit the present disclosure. Therefore, any modifications, equivalent alternatives and improvement, etc. without departing from the scope of the present disclosure shall be included in the scope of protection of the present disclosure. Meanwhile, appended claims of the present disclosure aim to cover all the variations and modifications falling under the scope and boundary of the claims or equivalents of the scope and boundary.
The above-described embodiments can be implemented in any of numerous ways. For example, embodiments may be implemented using hardware, software or a combination thereof. When implemented in software, the software code can be stored (e.g., on non-transitory memory) and executed on any suitable processor or collection of processors, whether provided in a single computer or distributed among multiple computers.
Further, it should be appreciated that a compute device including a computer can be embodied in any of a number of forms, such as a rack-mounted computer, a desktop computer, a laptop computer, netbook computer, or a tablet computer. Additionally, a computer can be embedded in a device not generally regarded as a computer but with suitable processing capabilities, including a smartphone, smart device, or any other suitable portable or fixed electronic device.
Also, a computer can have one or more input and output devices. These devices can be used, among other things, to present a user interface. Examples of output devices that can be used to provide a user interface include printers or display screens for visual presentation of output and speakers or other sound generating devices for audible presentation of output. Examples of input devices that can be used for a user interface include keyboards, and pointing devices, such as mice, touch pads, and digitizing tablets. As another example, a computer can receive input information through speech recognition or in other audible format.
Such computers can be interconnected by one or more networks in any suitable form, including a local area network or a wide area network, such as an enterprise network, and intelligent network (IN) or the Internet. Such networks can be based on any suitable technology and can operate according to any suitable protocol and can include wireless networks, wired networks or fiber optic networks.
The various methods or processes outlined herein can be coded as software that is executable on one or more processors that employ any one of a variety of operating systems or platforms. Additionally, such software can be written using any of a number of suitable programming languages and/or programming or scripting tools, and also can be compiled as executable machine language code or intermediate code that is executed on a framework or virtual machine.
In this respect, various disclosed concepts can be embodied as a computer-readable storage medium (or multiple computer-readable storage media) (e.g., a computer memory, one or more floppy discs, compact discs, optical discs, magnetic tapes, flash memories, circuit configurations in Field Programmable Gate Arrays or other semiconductor devices, or other non-transitory medium or tangible computer storage medium) encoded with one or more programs that, when executed on one or more computers or other processors, perform methods that implement the various embodiments of the disclosure discussed above. The computer-readable medium or media can be transportable, such that the program or programs stored thereon can be loaded onto one or more different computers or other processors to implement various aspects of the present disclosure as discussed above.
Some embodiments described herein relate to a computer storage product with a non-transitory computer-readable medium (also can be referred to as a non-transitory processor-readable medium) having instructions or computer code thereon for performing various computer-implemented operations. The computer-readable medium (or processor-readable medium) is non-transitory in the sense that it does not include transitory propagating signals per se (e.g., a propagating electromagnetic wave carrying information on a transmission medium such as space or a cable). The media and computer code (also can be referred to as code) may be those designed and constructed for the specific purpose or purposes. Examples of non-transitory computer-readable media include, but are not limited to, magnetic storage media such as hard disks, floppy disks, and magnetic tape; optical storage media such as Compact Disc/Digital Video Discs (CD/DVDs), Compact Disc-Read Only Memories (CD-ROMs), and holographic devices; magneto-optical storage media such as optical disks; carrier wave signal processing modules; and hardware devices that are specially configured to store and execute program code, such as Application-Specific Integrated Circuits (ASICs), Programmable Logic Devices (PLDs), Read-Only Memory (ROM) and Random-Access Memory (RAM) devices. Other embodiments described herein relate to a computer program product, which can include, for example, the instructions and/or computer code discussed herein.
The terms “program” or “software” or “software stack” are used herein in a generic sense to refer to any type of computer code or set of computer-executable instructions that can be employed to program a computer or other processor to implement various aspects of embodiments as discussed above. Additionally, it should be appreciated that according to one aspect, one or more computer programs that when executed perform methods of the present disclosure need not reside on a single computer or processor, but can be distributed in a modular fashion amongst a number of different computers or processors to implement various aspects of the disclosure.
Computer-executable instructions can be in many forms, such as program modules, executed by one or more computers or other devices. Generally, program modules include routines, programs, objects, components, data structures, etc. that perform particular tasks or implement particular abstract data types. Typically the functionality of the program modules can be combined or distributed as desired in various embodiments.
Also, various concepts can be embodied as one or more methods, of which an example has been provided. The acts performed as part of the method may be ordered in any suitable way. Accordingly, embodiments can be constructed in which acts are performed in an order different than illustrated, which can include performing some acts simultaneously, even though shown as sequential acts in illustrative embodiments. All publications, patent applications, patents, and other references mentioned herein are incorporated by reference in their entirety.
This application claims priority to and the benefit of U.S. Provisional Patent Application No. 63/403,120, filed Sep. 1, 2022, and titled “Artificial Intelligence-Based Knowledge Processing Systems,” which is incorporated herein by reference in its entirety.
Number | Date | Country | |
---|---|---|---|
63403120 | Sep 2022 | US |