Drug-metabolizing enzyme prediction apparatus

Information

  • Patent Application
  • 20070016392
  • Publication Number
    20070016392
  • Date Filed
    September 21, 2006
    18 years ago
  • Date Published
    January 18, 2007
    17 years ago
Abstract
A drug-metabolizing enzyme prediction apparatus acquires compound structure information, identifies binding site information and molecular species information based on a binding-site identifying condition, specifies a pinching point that is an atom binding at least between a reactive site and a binding site, acquires pinching point information on the specified pinching point and reactive site information on the reactive site bound to the pinching point, and identifies the acquired reactive site information based on a reactive-site identifying condition and the acquired pinching point information based on a pinching-point identifying condition.
Description
BACKGROUND OF THE INVENTION

1. Field of the Invention


The present invention generally relates to a drug-metabolizing enzyme prediction apparatus, a drug-metabolizing enzyme prediction method, a drug-metabolizing enzyme prediction program, and a recording medium, and more particularly, to a drug-metabolizing enzyme prediction apparatus, a drug-metabolizing enzyme prediction method, a drug-metabolizing enzyme prediction program, and a recording medium, capable of accurately predicting a molecular species of a drug-metabolizing enzyme and a reactive site of a compound with a drug-metabolizing enzyme.


2. Description of the Related Art


In recent years, about 50% or more of drugs developed are dropped out caused by a problem on drug metabolism (for example, drug's power and side effect) in the stage of drug development.


Therefore, it is greatly needed to evaluate the problem on drug metabolism related to compounds and to narrow them down to any compound that clears the problem in the early stage of drug development, in terms of development of drugs with fewer side effects and of reduction in cost required for development of drugs.


Conventionally, to specify, for example, a molecular species and a reactive site of a drug-metabolizing enzyme from a compound, a drug-metabolism screening test and metabolite identification using a mass spectrum are conducted.


A technology for predicting a molecular species and a reactive site of a drug-metabolizing enzyme on a computer is described in, for example, Homepage of Drug metabolism research support system “BioFrontier/P450” developed by Fujitsu Ltd.: “http://venus.netlaboratory.com/material/messe/biofrontierp 450/”, US Patent Application Laid Open No. 2003/0054430, US Patent Application Laid Open No. 2002/0040276, and US Patent Application Laid Open No. 2001/0044699.


Homepage of Drug metabolism research support system “BioFrontier/P450” developed by Fujitsu Ltd.: “http://venus.netlaboratory.com/material/messe/biofrontierp 450/” describes a technology for inputting a compound structure and predicting an unknown metabolite on the analogy of known metabolic reactions collected from documents. More specifically, it describes a technology of implementing an experimental technique of extracting a structure of a reactive site between a substrate and a metabolite from known metabolic reactions, and predicting a metabolite based on known metabolism information if the structure of the reactive site extracted is found in a compound to be predicted. This technology allows accurate prediction of a wide range of compounds based on a population (compounds) of training data used for creating a prediction model.


Furthermore, US Patent Application Laid Open No. 2003/0054430 describes a technology for predicting the relative rate of every metabolic pathway for drugs by regarding a pathway with the fastest reaction rate as a reaction that is the most possible reaction to occur, based on stoichiometric data.


US Patent Application Laid Open No. 2002/0040276 describes a technology for predicting the susceptibility of every metabolic site of drugs by regarding a site with the highest susceptibility as a first candidate for a metabolic site, based on energy theoretical data.


US Patent Application Laid Open No. 2001/0044699 describes a technology for predicting the stability of every metabolic site of drugs by regarding a site with the highest stability as a first candidate for a metabolic site, based on energy theoretical data.


However, all identifications of metabolites using the drug-metabolism screening test and the mass spectrum are performed based on wet experiments, and this causes a problem to arise such that cost and time are required enormously.


In the technology described in Homepage of Drug metabolism research support system “BioFrontier/P450” developed by Fujitsu Ltd.: “http://venus.netlaboratory.com/material/messe/biofrontierp 450/”, because a prediction rate is dependent on the quality of training data used, there is also a problem that it is difficult to predict compounds which are extremely different from the population (compounds) of the training data used for creation of the prediction model. In other words, there is a problem that the range of predictable compounds is limited by the population, and prediction cannot always be applied to all types of compounds. That is, all types of compounds cannot always be accurately predicted.


In the technologies described in US Patent Application Laid Open No. 2003/0054430, US Patent Application Laid Open No. 2002/0040276, and US Patent Application Laid Open No. 2001/0044699, because a cubic form of the compound is not calculated, there is a problem that any compound, of which high reactivity is predicted based on energy calculation but which cannot be bound to a drug-metabolizing enzyme in terms of its form, may also be included in the compounds accordingly.


SUMMARY OF THE INVENTION

It is an object of the present invention to at least partially solve the problems in the conventional technology.


The above and other objects, features, advantages and technical and industrial significance of this invention will be better understood by reading the following detailed description of presently preferred embodiments of the invention, when considered in connection with the accompanying drawings.




BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 is a schematic for illustrating a basic principle of the present invention;



FIG. 2 is a block diagram of one example of a system to which the present invention is applied;



FIG. 3 is a diagram of one example of information stored in a compound-structure information file;



FIG. 4 is a diagram of one example of information stored in a binding-site identifying condition file;



FIG. 5 is a diagram of one example of information stored in a binding-site/molecular-species information file;



FIG. 6 is a diagram of one example of information stored in a pinching-point information file;



FIG. 7 is a diagram of one example of information stored in a reactive-site information file;



FIG. 8 is a diagram of one example of information stored in a reactive-site identifying condition file;



FIG. 9 is a diagram of one example of information stored in a pinching-point identifying condition file;



FIG. 10 is a diagram of one example of information stored in an open-space-site information file;



FIG. 11 is a diagram of one example of information stored in an open-space-site identifying condition file;



FIG. 12 is a diagram of one example of information stored in a prediction result file;



FIG. 13 is a block diagram of one example of a binding-site/molecular-species-information identifying unit 102b in the system to which the present invention is applied;



FIG. 14 is a block diagram of one example of a pinching-point/reactive-site-information acquiring unit in the system to which the present invention is applied;



FIG. 15 is a block diagram of one example of a reactive-site-information identifying unit in the system to which the present invention is applied;



FIG. 16 is a block diagram of one example of a pinching-point-information identifying unit in the system to which the present invention is applied;



FIG. 17 is a block diagram of one example of an open-space-site-information identifying unit in the system to which the present invention is applied;



FIG. 18 is a flowchart of one example of the main process in the system according to one embodiment of the present invention;



FIG. 19 is a flowchart of one example of a binding-site/molecular-species-information identifying process in the system according to the present embodiment;



FIG. 20 is a flowchart of one example of a pinching-point/reactive-site-information acquiring process in the system according to the present embodiment;



FIG. 21 is a flowchart of one example of a reactive-site-information identifying process in the system according to the present embodiment;



FIG. 22 is a flowchart of one example of a reactive-site coordinate range determining process in the system according to the present embodiment;



FIG. 23 is a flowchart of one example of a pinching-point-information identifying process in the system according to the present embodiment;



FIG. 24 is a flowchart of one example of an open-space-site-information identifying process in the system according to the present embodiment;



FIG. 25 is a schematic of the binding site, the reactive site, the pinching point, and the open space site in a compound;



FIG. 26 is a schematic of one example of a binding-site selection criterion;



FIG. 27 is a schematic of one example of a binding-site identifying condition;



FIG. 28 is a schematic of one example of a reactive-site identifying condition;



FIG. 29 is a schematic of one example of the reactive-site identifying condition;



FIG. 30 is a schematic of one example of an open-space-site identifying condition; and



FIG. 31 is a diagram for explaining one example of the main process in a drug-metabolizing enzyme prediction apparatus according to the present invention.




DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

Exemplary embodiments of the present invention are explained in detail below with reference to the accompanying drawings. It should be noted that the present invention is not limited by these embodiments.



FIG. 1 is a schematic for illustrating a basic principle of the present invention.


The present invention schematically includes the following basic features. That is, at first, the present invention acquires compound structure information including at least one of atomic coordinate information being information on coordinates of each of atoms forming a compound, bond information being information on a bond between the atoms, and ring structure information being information on a ring structure formed with a plurality of the atoms (step S-1).


Then, the present invention identifies binding site information being information on an atom forming a binding site in a compound corresponding to the compound structure information and also molecular species information being information on a molecular species, from the compound structure information acquired at step S-1, based on a binding-site identifying condition for identifying a binding site which is a site, where a compound and a drug-metabolizing enzyme are bound to each other, and also a molecular species of the drug-metabolizing enzyme to be bound (step S-2).


From the binding site information identified at step S-2 and the compound structure information acquired at step S-1, the present invention identifies a pinching point which is an atom binding at least a reactive site being a site, where a metabolic reaction occurs in the compound and the drug-metabolizing enzyme, to the binding site, and acquires pinching point information being information on the atom as the pinching point identified and reactive site information being information on an atom forming the reactive site bound to the pinching point corresponding to the pinching point information (step S-3).


Then, the present invention identifies the reactive site information acquired at step S-3, from the molecular species information and the compound structure information, based on a reactive-site identifying condition for identifying a reactive site (step S-4).


Thereafter, the present invention identifies the pinching point information acquired at step S-3, from the molecular species information and the compound structure information, based on the pinching-point identifying condition for identifying a pinching point (step S-5).


Here, if there exists an open space site being a site which is other than the reactive site corresponding to the reactive site information identified at step S-4 and the binding site corresponding to the binding site information identified at step S-2 and which is bound to the pinching point corresponding to the pinching point information identified at step S-5, the present invention may acquire open-space-site information being information on an atom forming an open space site from the binding site information, the reactive site information, the pinching point information, and the compound structure information, and identify the open-space-site information acquired, from the molecular species information and the compound structure information, based on the open-space-site identifying condition for identifying an open space site.


When the reactive site information is identified at step S-4, the present invention may determine an atom forming a reactive site corresponding to the reactive site information or a partial-structure atom group which is a group of part of atoms in a molecular structure of the reactive site, based on an inhibition determining condition (e.g., a list of predetermined atoms as inhibitors) for determining whether a metabolic reaction is inhibited in the compound of which the reactive site information is identified.


Furthermore, the present invention may output, as a prediction result for a compound, at least one of the binding site information, the molecular species information, the reactive site information, the pinching point information, the open-space site information, and the inhibition determination result in the compound, which are identified (predicted).


As explained above, the present invention can automatically identify the binding site, the pinching point, the reactive site, and the open space site in the compound from the compound structure information. Therefore, it is possible to accurately predict a molecular species of a drug-metabolizing enzyme and a reactive site of a compound with the drug-metabolizing enzyme for all kinds of compounds without using training data for prediction and in consideration of each cubic form of the compounds.


The present invention can further determine (check) whether any atom inhibiting a metabolic reaction is contained in the reactive site identified, for the compound of which the reactive site with the drug-metabolizing enzyme is identified. Therefore, a useful compound group can be narrowed highly accurately, for example, in the early stage of drug development.



FIG. 2 is a block diagram of one example of the system to which the present invention is applied, and only a portion related to the present invention in the configuration is conceptually shown therein.


In FIG. 2, a network 300 such as the Internet has functions of mutually connecting between the drug-metabolizing enzyme prediction apparatus 100 and an external system 200.


In FIG. 2, the external system 200 is mutually connected with the drug-metabolizing enzyme prediction apparatus 100 through the network 300, and has a function of providing a compound library, an external database on the compound structure information, and a website for executing various external programs, to users.


Here, the external system 200 may be configured as a WEB server or an ASP server, and its hardware may be configured by an information processor such as commercially available work station and personal computer, and its attachment device. The functions of the external system 200 are implemented by a CPU, a disk drive, a memory device, an input device, an output device, a communication control device in the hardware of the external system 200, and by programs for controlling these devices.


In FIG. 2, the drug-metabolizing enzyme prediction apparatus 100 schematically includes a control unit 102 such as CPU that integrally controls the whole of the drug-metabolizing enzyme prediction apparatus 100; a communication-control interface unit 104 connected to a communication device (not shown) such as a router connected to a communication line and the like; a memory unit 106 that stores various types of databases and files; and an input/output-control interface unit 108 connected to an input device 112 and an output device 114. These units are communicably connected to one another through arbitrary communication paths. Furthermore, the drug-metabolizing enzyme prediction apparatus 100 is communicably connected to the network 300 through a communication device such as a router and a wired or a wireless communication line such as a dedicated line.


The various types of databases, tables, and files (the compound-structure information file 106a to the prediction result file 106j) stored in the memory unit 106 of FIG. 2 are storage units such as fixed disk devices, which store various programs, tables, files, and databases used for various processes, and files for web pages.


Among these components in the memory unit 106, the compound-structure information file 106a is a compound-structure-information storage unit that stores the compound structure information including at least one of the atomic coordinate information being information on coordinates of each of atoms forming a compound, the bond information being information on a bond between the atoms, and the ring structure information being information on a ring structure formed with a plurality of the atoms. Information stored in the compound-structure information file 106a is explained below with reference to FIG. 3.



FIG. 3 is a diagram of one example of information stored in the compound-structure information file 106a. As shown in FIG. 3, the information stored in the compound-structure information file 106a includes atom identifying information for uniquely identifying an atom forming a compound; an atomic symbol of the atom corresponding to the atom identifying information; atomic coordinate information (x coordinate value, y coordinate value, and z coordinate value) for the atom corresponding to the atom identifying information; bond information that includes bond identifying information for uniquely identifying the bond information, atom identifying information for an atom bonded to the atom corresponding to the atom identifying information in the bond identifying information, and bond type information on bond types; and ring structure information that includes ring identifying information for uniquely identifying the ring structure information and also includes atom identifying information for atoms forming the ring structure, which are mutually associated with one another.


The compound-structure information file 106a may be external compound-structure information databases, which are accessed through the Internet, or may be an in-house database created by copying these databases, storing original compound structure information, and further adding unique annotation information and so forth.


The binding-site identifying condition file 106b is a binding-site identifying condition storage unit that stores the binding-site identifying condition (e.g., condition under which a binding-site coordinate range, which is a range of atomic coordinate information for an atom forming a binding site, is defined for each of predetermined molecular species) for identifying a binding site being a site where a compound and a drug-metabolizing enzyme are bound to each other and also identifying a molecular species of the drug-metabolizing enzyme to be bound. Here, information stored in the binding-site identifying condition file 106b is explained below with reference to FIG. 4.



FIG. 4 is a diagram of one example of information stored in the binding-site identifying condition file 106b. As shown in FIG. 4, the information stored in the binding-site identifying condition file 106b includes molecular species information and ranges (range of ±x direction, range of ±y direction, and range of ±z direction) of atomic coordinate information for an atom forming a binding site, which are mutually associated with one another (in a matrix form).


The binding-site/molecular-species information file 106c is a binding-site/molecular-species information storage unit that stores the binding site information and the molecular species information identified by a binding-site/molecular-species-information identifying unit 102b explained later. Information stored in the binding-site/molecular-species information file 106c is explained below with reference to FIG. 5.



FIG. 5 is a diagram of one example of information stored in the binding-site/molecular-species information file 106c. As shown in FIG. 5, the information stored in the binding-site/molecular-species information file 106c includes binding-site identifying information for uniquely identifying a binding site, atom identifying information for an atom forming the binding site, and molecular species information for a drug-metabolizing enzyme bound to the binding site, which are mutually associated with one another.


The pinching-point information file 106d is a pinching-point information storage unit that stores pinching point information which is identified by the pinching-point-information identifying unit 102e and which is information on a pinching point being an atom for binding at least a reactive site to a binding site, the reactive site being a site where a metabolic reaction occurs in a compound and a drug-metabolizing enzyme. Information stored in the pinching-point information file 106d is explained below with reference to FIG. 6.



FIG. 6 is a diagram of one example of information stored in the pinching-point information file 106d. As shown in FIG. 6, the information stored in the pinching-point information file 106d includes pinching-point identifying information for uniquely identifying a pinching point, atom identifying information for an atom as the pinching point, binding-site identifying information for a binding site bound to the pinching point, reactive-site identifying information for a reactive site bound to the pinching point, and open-space-site identifying information for an open space site bound to the pinching point, which are mutually associated with one another.


The reactive-site information file 106e is a reactive site information storage unit that stores reactive site information identified by the reactive-site-information identifying unit 102d explained later. Information stored in the reactive-site information file 106e is explained below with reference to FIG. 7.



FIG. 7 is a diagram of one example of information stored in the reactive-site information file 106e. As shown in FIG. 7, the information stored in the reactive-site information file 106e includes reactive-site identifying information for uniquely identifying a reactive site, pinching-point identifying information for a pinching point bound to the reactive site, and atom identifying information for atoms forming the reactive site, which are mutually associated with one another.


The reactive-site identifying condition file 106f is a reactive-site identifying condition storage unit that stores the reactive-site identifying condition for identifying a reactive site being a site where a metabolic reaction occurs in a compound and a drug-metabolizing enzyme (e.g., condition under which the ranges as follows are defined for each of the predetermine molecular species: a reactive-site coordinate range being a range of atomic coordinate information for atoms forming the reactive site, a reactive-site angle range being a range of an angle value between each of the atoms forming the reactive site and the pinching point, and a reactive-site distance range being a range of a distance value between each of the atoms forming the reactive site and the pinching point). Information stored in the reactive-site identifying condition file 106f is explained below with reference to FIG. 8.



FIG. 8 is a diagram of one example of information stored in the reactive-site identifying condition file 106f. As shown in FIG. 8, the information stored in the reactive-site identifying condition file 106f includes the molecular species information, ranges of the atomic coordinate information for an atom forming a reactive site (range of ±x direction, range of ±y direction, and range of ±z direction), a range of an angle value between the atom forming the reactive site and the pinching point, and a range of a distance value between the atom forming the reactive site and the pinching point, which are mutually associated with one another (in a matrix form).


The pinching-point identifying condition file 106g is a pinching-point identifying condition storage unit that stores the pinching-point identifying condition for identifying a pinching point which is an atom binding at least between a reactive site and a binding site. Information stored in the pinching-point identifying condition file 106g is explained below with reference to FIG. 9.



FIG. 9 is a diagram of one example of information stored in the pinching-point identifying condition file 106g. As shown in FIG. 9, the information stored in the pinching-point identifying condition file 106g includes the molecular species information, and the ranges of the atomic coordinate information for an atom as a pinching point (range of ±x direction, range of ±y direction, and range of z direction), which are mutually associated with one another (in a matrix form).


The open-space-site information file 106h is an open-space-site-information storage unit that stores open-space site information identified by the open-space-site-information identifying unit 102g explained later. Information stored in the open-space-site information file 106h is explained below with reference to FIG. 10.



FIG. 10 is a diagram of one example of information stored in the open-space-site information file 106h. As shown in FIG. 10, the information stored in the open-space-site information file 106h includes open-space-site identifying information for uniquely identifying an open space site, pinching-point identifying information for a pinching point bound to the open space site, and the atom identifying information for an atom forming the open space site, which are mutually associated with one another.


The open-space-site identifying condition file 106i is an open-space-site identifying condition storage unit that stores the open-space-site identifying condition for identifying an open space site being a site which is other than the reactive site and the binding site and is bound to the pinching point. Information stored in the open-space-site identifying condition file 106i is explained below with reference to FIG. 11.



FIG. 11 is a diagram of one example of information stored in the open-space-site identifying condition file 106i. As shown in FIG. 11, the information stored in the open-space-site identifying condition file 106i includes the molecular species information, and the ranges of the atomic coordinate information for an atom forming the open space site (range of ±x direction, range of ±y direction, and range of ±z direction), which are mutually associated with one another (in a matrix form).


The prediction result file 106j is a prediction-result storage unit that stores a prediction result including at least one of the binding site information, the molecular species information, the reactive site information, the pinching point information, the open-space site information, and the inhibition determination result, for a compound, which are identified. Information stored in the prediction result file 106j is explained below with reference to FIG. 12.



FIG. 12 is a diagram of one example of information stored in the prediction result file 106j. As shown in FIG. 12, the information stored in the prediction result file 106j includes the binding-site identifying information for the binding site identified, the molecular species information identified, the pinching-point identifying information for the pinching point identified, the reactive-site identifying information for the reactive site identified, the open-space-site identifying information for the open space site identified, and the inhibition determination result that is the result of determination on the inhibition of the reactive site in an inhibition determining unit 102h explained later, which are mutually associated with one another.


As other information, the memory unit 106 of the drug-metabolizing enzyme prediction apparatus 100 stores, for example, programs for automatically generating a plurality of conformations.


In FIG. 2, the communication-control interface unit 104 controls communications between the drug-metabolizing enzyme prediction apparatus 100 and the network 300 (or the communication device such as a router). That is, the communication-control interface unit 104 has a function of communicating data with other terminals through the communication line.


In FIG. 2, the input/output-control interface unit 108 controls the input device 112 and the output device 114. As the output device 114, a speaker can be used other than a monitor (including a home television) (hereinafter, the output device 114 may sometimes be described “monitor”). As the input device 112, a keyboard, a mouse, and a microphone can be used. The monitor also implements a function as a pointing device in corporation with the mouse.


Furthermore, in FIG. 2, the control unit 102 includes an internal memory for storing control programs such as OS (Operating System), programs defining various types of procedures, and required data, and performs information processing to execute various processes by these programs. The control unit 102 functionally includes a compound-structure-information acquiring unit 102a, the binding-site/molecular-species-information identifying unit 102b, the pinching-point/reactive-site-information acquiring unit 102c, the reactive-site-information identifying unit 102d, the pinching-point-information identifying unit 102e, an open-space-site-information acquiring unit 102f, the open-space-site-information identifying unit 102g, the inhibition determining unit 102h, and a prediction-result output unit 102i.


Among these, the compound-structure-information acquiring unit 102a acquires the compound structure information including at least one of the atomic coordinate information being information on coordinates of each of atoms forming a compound, the bond information being information on a bond between the atoms, and the ring structure information being information on a ring structure formed with a plurality of the atoms.


The binding-site/molecular-species-information identifying unit 102b identifies the binding site information being information on an atom forming a binding site in a compound corresponding to the compound structure information, and the molecular species information being information on molecular species, based on the binding-site identifying condition. Here, as shown in FIG. 13, the binding-site/molecular-species-information identifying unit 102b further includes a binding-site selecting unit 102j, a binding-site-information acquiring unit 102k, and an acquired-binding-site/molecular-species-information identifying unit 102m.



FIG. 13 is a block diagram of one example of the binding-site/molecular-species-information identifying unit 102b in the system to which the present invention is applied, and only a portion related to the present invention in the configuration is conceptually shown therein. In FIG. 13, the binding-site selecting unit 102j selects a binding site based on a predetermined binding-site selection criterion. The binding-site-information acquiring unit 102k acquires binding site information from the compound structure information. The acquired-binding-site/molecular-species-information identifying unit 102m identifies the binding site information acquired in the binding-site-information acquiring unit 102k from the compound structure information based on the binding-site identifying condition, and identifies the molecular species corresponding to the binding site information identified, as molecular species information. Here, as shown in FIG. 13, the acquired-binding-site/molecular-species-information identifying unit 102m further includes a coordinate-based acquired-binding-site/molecular-species-information identifying unit 102n. The coordinate-based acquired-binding-site/molecular-species-information identifying unit 102n identifies binding site information when the atomic coordinate information for an atom forming a binding site corresponding to the binding site information acquired in the binding-site-information acquiring unit 102k satisfies a binding-site coordinate range defined by the binding-site identifying condition, and identifies the molecular species corresponding to the binding-site coordinate range satisfied, as molecular species information.


Referring back to FIG. 2, the pinching-point/reactive-site-information acquiring unit 102c specifies a pinching point, and acquires pinching point information for the pinching point specified and reactive site information for a reactive site bound to the pinching point specified, from the binding site information identified and the compound structure information. Here, as shown in FIG. 14, the pinching-point/reactive-site-information acquiring unit 102c further includes an atom specifying unit 102p, a ring-structure determining unit 102q, a pinching-point/reactive-site specifying unit 102r, and a specified-pinching-point/reactive-site-information acquiring unit 102s.



FIG. 14 is a block diagram of one example of the pinching-point/reactive-site-information acquiring unit 102c in the system to which the present invention is applied, and only a portion related to the present invention in the configuration is conceptually shown therein. In FIG. 14, the atom specifying unit 102p specifies a directly bonded atom being an atom bonded to the atom forming a binding site corresponding to the binding site information identified, and specifies an atom bonded to the directly bonded atom, based on the bond information included in the compound structure information. The ring-structure determining unit 102q determines whether the atom specified is an atom forming the ring structure, based on the ring structure information included in the compound structure information. The pinching-point/reactive-site specifying unit 102r specifies the atom specified and another atom forming the ring structure together with the atom as atoms forming the reactive site when it is determined that the atom specified forms the ring structure, and specifies the directly bonded atom specified, as a pinching point. The specified-pinching-point/reactive-site-information acquiring unit 102s acquires the pinching point information for the pinching point specified and the reactive site information for the reactive site specified, from the binding site information and the compound structure information.


Referring back again to FIG. 2, the reactive-site-information identifying unit 102d identifies reactive site information from the molecular species information and the compound structure information, based on the reactive-site identifying condition for identifying a reactive site. As shown in FIG. 15, the reactive-site-information identifying unit 102d further includes a coordinate-based reactive-site-information determining unit 102t, an angle-based reactive-site-information determining unit 102u, a distance-based reactive-site-information determining unit 102v, and a determination-result-based reactive-site-information identifying unit 102w.



FIG. 15 is a block diagram of one example of the reactive-site-information identifying unit 102d in the system to which the present invention is applied, and only a portion related to the present invention in the configuration is conceptually shown therein. In FIG. 15, the coordinate-based reactive-site-information determining unit 102t determines whether atomic coordinate information for an atom forming the reactive site corresponding to the reactive site information satisfies the reactive-site coordinate range defined by the reactive-site identifying condition, in the molecular species corresponding to the molecular species information identified. The angle-based reactive-site-information determining unit 102u calculates an angle value between each of the atoms forming the reactive site corresponding to the reactive site information and the pinching point corresponding to the pinching point information, and determines whether the angle value calculated satisfies the reactive-site angle range defined by the reactive-site identifying condition, in the molecular species corresponding to the molecular species information identified. The distance-based reactive-site-information determining unit 102v calculates a distance value between the atom forming the reactive site corresponding to the reactive site information and the pinching point corresponding to the pinching point information, and determines whether the distance value calculated satisfies the reactive-site distance range defined by the reactive-site identifying condition, in the molecular species corresponding to the molecular species information identified. The determination-result-based reactive-site-information identifying unit 102w identifies the reactive site information determined as being satisfied in the coordinate-based reactive-site-information determining unit 102t, the angle-based reactive-site-information determining unit 102u, and the distance-based reactive-site-information determining unit 102v, as reactive site information that satisfies the reactive-site identifying condition.


Referring back again to FIG. 2, the pinching-point-information identifying unit 102e identifies pinching point information from the molecular species information and the compound structure information, based on the pinching-point identifying condition. Here, as shown in FIG. 16, the pinching-point-information identifying unit 102e further includes a coordinate-based pinching-point-information identifying unit 102x.



FIG. 16 is a block diagram of one example of the pinching-point-information identifying unit 102e in the system to which the present invention is applied, and only a portion related to the present invention in the configuration is conceptually shown therein. In FIG. 16, the coordinate-based pinching-point-information identifying unit 102x identifies the pinching point information when the atomic coordinate information for the atom corresponding to the pinching point information satisfies the pinching-point coordinate range defined by the pinching-point identifying condition, in the molecular species corresponding to the molecular species information identified.


Referring back again to FIG. 2, if there exists an open space site which is a site other than the reactive site identified and the binding site identified and is bound to the pinching point identified, the open-space-site-information acquiring unit 102f acquires open-space site information being information on an atom forming the open space site, from the binding site information, the reactive site information, the pinching point information, and the compound structure information.


The open-space-site-information identifying unit 102g identifies open-space-site information from the molecular species information and the compound structure information, based on the open-space-site identifying condition. Here, as shown in FIG. 17, the open-space-site-information identifying unit 102g further includes a coordinate-based open-space-site-information identifying unit 102y.



FIG. 17 is a block diagram of one example of the open-space-site-information identifying unit 102g in the system to which the present invention is applied, and only a portion related to the present invention in the configuration is conceptually shown therein. In FIG. 17, the coordinate-based open-space-site-information identifying unit 102y identifies open-space-site information when the atomic coordinate information for an atom forming an open space site corresponding to the open-space-site information satisfies an open-space-site coordinate range defined by the open-space-site identifying condition, in the molecular species corresponding to the molecular species information identified.


Referring back again to FIG. 2, when the reactive site information is identified by the reactive-site-information identifying unit 102d explained later, the inhibition determining unit 102h determines whether an atom forming the reactive site corresponding to the reactive site information or a partial-structure atom group which is a group of part of the atoms in a molecular structure at the reactive site inhibits a metabolic reaction in the compound, based on the inhibition determining condition (e.g., a list of predetermined atoms as inhibitors) for determining whether the metabolic reaction is inhibited in the compound of which the reactive site information is identified.


The prediction-result output unit 102i outputs at least one of the binding site information, the molecular species information, the reactive site information, the pinching point information, the open-space site information, and the inhibition determination result identified in a compound, as a result of prediction for the compound.


Details of the processes in these units are explained later.


One of the process in the system according to the present embodiment configured in the above manner is explained in detail below with reference to FIG. 18 to FIG. 24.


Details of the main process are explained first with reference to FIG. 18. FIG. 18 is a flowchart of one example of the main process in the system according to the present embodiment.


At first, in the drug-metabolizing enzyme prediction apparatus 100, the compound-structure-information acquiring unit 102a acquires the compound structure information including at least one of the atomic coordinate information being information on coordinates of each of atoms forming a compound, the bond information being information on a bond between the atoms, and the ring structure information being information on a ring structure formed with a plurality of the atoms, and stores the compound structure information in a predetermined storage area of the compound-structure information file 106a (step SA-1).


At step SA-1, the compound structure information to be acquired may be, for example, compound structure information stored in the external system 200 or the like, compound structure information stored in the compound library, and virtual compound structure information input by a user through the input device 112.


Then, in the drug-metabolizing enzyme prediction apparatus 100, the binding-site/molecular-species-information identifying unit 102b identifies binding site information and molecular species information from the compound structure information acquired at step SA-1, based on the binding-site identifying condition stored in the binding-site identifying condition file 106b, and stores them in a predetermined storage area of the binding-site/molecular-species information file 106c (step SA-2).


At step SA-2, the binding-site/molecular-species-information identifying unit 102b may cause the user to select a binding site based the predetermined binding-site selection criterion, to acquire binding site information corresponding to the binding site selected from the compound structure information, identify the binding site information acquired, from the compound structure information, based on the binding-site identifying condition, and identify a molecular species corresponding to the binding site information identified, as molecular species information (binding-site/molecular-species-information identifying process). It may also cause the user to select at least one of an original point, and the x-axis, the y-axis, and the z-axis with respect to the original point, from an atom forming a binding site.


After the user selects the binding site, the drug-metabolizing enzyme prediction apparatus 100 may generate a plurality of conformations for the compound structure information, using, for example, automatic conformation generating software (see, for example, “http://www.rsi.co.jp/kagaku/cs/info/pdf.files/moe.pdf”, “http://homepage2.nifty.com/ccsnews2/2002/3q/20023Qrsimoe2 00203.htm”, and “http://homepage2.nifty.com/ccsnews2/2003/2q/20032Q/ccs03s rsi.htm”), and perform processes at step SA-3 and thereafter as follows for each conformation generated.


The binding-site selection criterion may be specifically defined such that a site is selected as a binding site, the site being a ring structure in a compound or a pseudo-ring structure externally similar to the ring structure (or pseudo-ring structure containing an acid group (e.g., carboxyl and sulfone)). More specifically, as shown in FIG. 26, it may be defined to select a site as a binding site if there is the site containing a bond “S═O” in its structure (e.g., Omeprazole or H259/31) specific to CYP (cytochrome P450) 2C19, and to select a site, as a binding site, where one pseudo-ring structure is formed with three ring structures if Losartan, in its structure specific to CYP2C9 (e.g., Losartan).


The binding-site identifying condition may be such that each range of a value which each atom forming a binding site can take is conceptually defined for each width (±x direction), depth (±y direction), and thickness (∓z direction). More specifically, for example, the width (±x direction) may conceptually be a range decided based on the condition of functional group as follows. The depth (±y direction) may conceptually be a range decided for each molecular species of CYP, and may be a range where a benzene ring is the maximum if, for example, CYP2C19. The thickness (±z direction) may conceptually be a range where the thickness of a mesyl group is the maximum. As shown in FIG. 27, the condition of functional group is such that R1 and R2 form a planar structure and a carboxyl group is the maximum (e.g., if the bond is “═O” and “═S”), R3 and R4 form a planar structure and a benzene ring is the maximum, and R5 becomes the same as that of R1 and R2 (however, CH3 is also allowed if CYP2C8).


Details of the binding-site/molecular-species-information identifying process are explained below with reference to FIG. 19. FIG. 19 is a flowchart of one example of the binding-site/molecular-species-information identifying process in the system according to the present embodiment.


At first, in the binding-site/molecular-species-information identifying unit 102b, the binding-site selecting unit 102j causes the user to select a binding site based on the predetermined binding-site selection criterion (step SB-1).


Then, in the binding-site/molecular-species-information identifying unit 102b, the binding-site-information acquiring unit 102k acquires binding site information for the binding site selected at step SB-1, and stores the binding site information in a predetermined storage area of the binding-site/molecular-species information file 106c (step SB-2).


In the binding-site/molecular-species-information identifying unit 102b, the acquired-binding-site/molecular-species-information identifying unit 102m identifies the binding site information acquired at step SB-2, from the compound structure information, based on the binding-site identifying condition stored in the binding-site identifying condition file 106b, and identifies molecular species corresponding to the binding site information identified, as molecular species information.


More specifically, in the acquired-binding-site/molecular-species-information identifying unit 102m, the coordinate-based acquired-binding-site/molecular-species-information identifying unit 102n determines whether the x coordinate value, in the atomic coordinate information for the atom forming the binding site corresponding to the binding site information acquired at step SB-2, satisfies the binding-site coordinate range (±x direction) defined by the binding-site identifying condition stored in the binding-site identifying condition file 106b (step SB-3).


Then, in the acquired-binding-site/molecular-species-information identifying unit 102m, if “it is satisfied” is determined at step SB-3 (step SB-4: Yes), the coordinate-based acquired-binding-site/molecular-species-information identifying unit 102n acquires molecular species information corresponding to the binding-site coordinate range (±x direction) satisfied, and stores the molecular species information in a predetermined storage area of the binding-site/molecular-species information file 106c (step SB-5).


In the acquired-binding-site/molecular-species-information identifying unit 102m, the coordinate-based acquired-binding-site/molecular-species-information identifying unit 102n determines whether the y coordinate value in the atomic coordinate information satisfies the binding-site coordinate range (±y direction) defined by the binding-site identifying condition stored in the binding-site identifying condition file 106b, in the molecular species corresponding to the molecular species information acquired at step SB-5 (step SB-6).


Then, in the acquired-binding-site/molecular-species-information identifying unit 102m, if “it is satisfied” is determined at step SB-6 (step SB-7: Yes), the coordinate-based acquired-binding-site/molecular-species-information identifying unit 102n determines whether the z coordinate value in the atomic coordinate information satisfies the binding-site coordinate range (±z direction) defined by the binding-site identifying condition stored in the binding-site identifying condition file 106b, in the molecular species corresponding to the molecular species information acquired at step SB-5 (step SB-8).


Then, in the acquired-binding-site/molecular-species-information identifying unit 102m, if “it is satisfied” is determined at step SB-8 (step SB-9: Yes), the coordinate-based acquired-binding-site/molecular-species-information identifying unit 102n returns again to the process at step SB-3 if there is any atom that is not determined (step SB-10: Yes), and identifies the binding site information acquired at step SB-2 if there are no atoms that are not determined (step SB-10: No), in other words, if all the atoms are determined, and identifies the molecular species information acquired at step SB-5.


When “it is not satisfied” is determined at step SB-3, step SB-6, and step SB-8 (step SB-4: No, step SB-7: No, and step SB-9: No), the process is terminated even if there still remains an atom which is not determined.


Now, the binding-site/molecular-species-information identifying process is completed.


Referring back to FIG. 18, in the drug-metabolizing enzyme prediction apparatus 100, the pinching-point/reactive-site-information acquiring unit 102c specifies a pinching point from the binding site information identified at step SA-2 and from the compound structure information acquired at step SA-1, acquires pinching point information for the pinching point specified and reactive site information for a reactive site bound to the pinching point corresponding to the pinching point information, and stores them in a predetermined storage area of the pinching-point information file 106d and in a predetermined storage area of the reactive-site information file 106e, respectively (step SA-3).


At step SA-3, the pinching-point/reactive-site-information acquiring unit 102c may also be configured to specify a directly bonded atom being an atom bonded to the atom forming the binding site corresponding to the binding site information identified at step SA-2, and also specify an atom bonded to the directly bonded atom, based on the bond information included in the compound structure information; determine whether the atom specified is an atom forming the ring structure, based on the ring structure information included in the compound structure information; specify, if it is determined that the atom specified is an atom forming the ring structure, this atom specified and another atom forming the ring structure together with this atom as atoms forming the reactive site; to specify the directly bonded atom specified, as a pinching point; and acquire the pinching point information for the pinching point specified and the reactive site information for the reactive site specified, from the binding site information and the compound structure information, respectively (pinching-point/reactive-site-information acquiring process).


If it is determined that the atom specified is not an atom forming the ring structure, it is determined whether the directly bonded atom is an atom forming the ring structure. If it is determined that the directly bonded atom is an atom forming the ring structure, an atom, which forms a binding site corresponding to the binding site information identified at step SA-2 and is bonded to the directly bonded atom, is specified as a pinching point, and the directly bonded atom and another atom forming the ring structure together with the directly bonded atom may be specified as atoms forming a reactive site. In other words, the atom forming the binding site may sometimes be a pinching point.


The atom specified as the pinching point may be an atom bonded to the atom forming the ring structure, selected from among atoms each located, for example, one bond away from the atom forming the binding site.


It may also be considered that a compound is suppressed by CYP at the most strict point of an allowable range of the atomic coordinate information (e.g., ±x direction) between an atom-forming the binding site and a pinching point bound to the atom.


The pinching-point/reactive-site-information acquiring unit 102c may acquire a plurality pieces of pinching point information. In this case, the processes for identifying pinching point information and identifying reactive site information explained later are performed on the pinching point information and the corresponding reactive site information, respectively.


Details of the pinching-point/reactive-site-information acquiring process are explained below with reference to FIG. 20. FIG. 20 is a flowchart of one example of the pinching-point/reactive-site-information acquiring process in the system according to the present embodiment.


At first, in the pinching-point/reactive-site-information acquiring unit 102c, the atom specifying unit 102p specifies a directly bonded atom and an atom bonded to the directly bonded atom, based on the bond information included in the compound structure information (step SC-1).


Then, the ring-structure determining unit 102q in the pinching-point/reactive-site-information acquiring unit 102c determines whether the atom specified at step SC-1 is an atom forming the ring structure based on the ring structure information included in the compound structure information (step SC-2).


Then, if it is determined that the atom specified at step SC-2 is an atom forming the ring structure, the pinching-point/reactive-site specifying unit 102r in the pinching-point/reactive-site-information acquiring unit 102c specifies the atom specified and another atom forming the ring structure together with the atom, as atoms forming a reactive site, and specifies the directly bonded atom specified, as a pinching point (step SC-3).


If it is determined at step SC-3 that the atom specified is not an atom forming the ring structure, the ring-structure determining unit 102q in the pinching-point/reactive-site-information acquiring unit 102c determines whether the directly bonded atom is an atom forming the ring structure. If it is determined that the directly bonded atom is an atom forming the ring structure, the pinching-point/reactive-site specifying unit 102r specifies the atom, which forms the binding site corresponding to the binding site information identified and is bonded to the directly bonded atom, as a pinching point.


Then, the specified-pinching-point/reactive-site-information acquiring unit 102s in the pinching-point/reactive-site-information acquiring unit 102c acquires pinching point information for the pinching point specified at step SC-3 and reactive site information for the reactive site specified from the binding site information and the compound structure information, and stores these information in a predetermined storage area of the pinching-point information file 106d and in a predetermined storage area of the reactive-site information file 106e, respectively (step SC-4).


Now, the pinching-point/reactive-site-information acquiring process is completed.


Referring back to FIG. 18, the reactive-site-information identifying unit 102d in the drug-metabolizing enzyme prediction apparatus 100 identifies the reactive site information acquired at step SA-3 from the molecular species information and the compound structure information, based on the reactive-site identifying condition stored in the reactive-site identifying condition file 106f (step SA-4).


The reactive-site identifying condition used at step SA-4 may be a condition such that the reactive-site coordinate range, the reactive-site angle range, and the reactive-site distance range are defined for each of the predetermine molecular species. Alternatively, the reactive-site identifying condition may be processes of determining whether the atomic coordinate information for an atom forming the reactive site corresponding to the reactive site information acquired at step SA-3 satisfies the reactive-site coordinate range in the molecular species corresponding to the molecular species information identified at step SA-2; calculating an angle value between the atom forming the reactive site corresponding to the reactive site information and the pinching point corresponding to the pinching point information; determining whether the angle value calculated satisfies the reactive-site angle range in the molecular species corresponding to the molecular species information identified at step SA-2; calculating a distance value between the atom forming the reactive site corresponding to the reactive site information and the pinching point corresponding to the pinching point information; determining whether the distance value calculated satisfies the reactive-site distance range in the molecular species corresponding to the molecular species information identified at step SA-2; and identifying the reactive site information determined that it satisfies the reactive-site coordinate range, the reactive-site angle range, and the reactive-site distance range, as reactive site information that satisfies the reactive-site identifying condition (reactive-site-information identifying process).


The reactive-site coordinate range may be such that a range of a value, which each atom forming the reactive site can take, is conceptually defined for each of width (±x direction), depth (±y direction), and thickness (±z direction). More specifically, for example, the width (±x direction) may conceptually be a range (extension in +x direction from, for example, the binding site) similar to the range in the binding-site identifying condition. The depth (±y direction) may conceptually be a range defined for each molecular species of CYP, and may be a narrow range if, for example, CYP2C9, and may be a wide range if CYP2C19. Furthermore, the thickness (±z direction) may conceptually be a range such that the thickness of cyclohexane (chair form) is the maximum.


As shown in FIG. 28, the reactive-site angle range may conceptually be a range of an angle value from a coordinate axis between atoms when the atom as the pinching point and the atom forming the reactive site are projected to a previously defined two-dimensional coordinate system (two-dimensional coordinate system in which the pinching point is set as the origin in FIG. 28).


The reactive-site distance range may conceptually be a range of a linear distance between the atom as the pinching point and the atom forming the reactive site. More specifically, as shown in FIG. 29, the reactive-site distance range may be a range of a distance (e.g., distance on the z coordinate) between a pinching point (in FIG. 29, atom with number of “1”) and an atom apart from the pinching point by five carbons (in FIG. 29, atom with number of “5”).


Details of the reactive-site-information identifying process are explained below with reference to FIG. 21. FIG. 21 is a flowchart of one example of the reactive-site-information identifying process in the system according to the present embodiment.


At first, in the reactive-site-information identifying unit 102d, the coordinate-based reactive-site-information determining unit 102t determines whether the atomic coordinate information for an atom forming the reactive site corresponding to the reactive site information acquired at step SA-3 satisfies the reactive-site coordinate range defined by the reactive-site identifying condition stored in the reactive-site identifying condition file 106f, in the molecular species corresponding to the molecular species information identified at step SA-2 (step SD-1) (reactive-site coordinate range determining process).


Details of the reactive-site coordinate range determining process performed at step SD-1 are explained below with reference to FIG. 22. FIG. 22 is a flowchart of one example of the reactive-site coordinate range determining process in the system according to the present embodiment.


At first, in the reactive-site-information identifying unit 102d, the coordinate-based reactive-site-information determining unit 102t determines whether the x coordinate value in the atomic coordinate information for the atom forming the reactive site corresponding to the reactive site information acquired at step SA-3 satisfies the reactive-site coordinate range (+x direction) defined by the reactive-site identifying condition stored in the reactive-site identifying condition file 106f, in the molecular species corresponding to the molecular species information identified at step SA-2 (step SE-1).


Then, if “it is satisfied” is determined at step SE-1 (step SE-2: Yes), the coordinate-based reactive-site-information determining unit 102t in the reactive-site-information identifying unit 102d determines whether the y coordinate value in the atomic coordinate information satisfies the reactive-site coordinate range (±y direction) defined by the reactive-site identifying condition stored in the reactive-site identifying condition file 106f, in the molecular species corresponding to the molecular species information identified at step SA-2 (step SE-3).


Then, if “it is satisfied” is determined at step SE-3 (step SE-4: Yes), the coordinate-based reactive-site-information determining unit 102t in the reactive-site-information identifying unit 102d determines whether the z coordinate value in the atomic coordinate information satisfies the reactive-site coordinate range (±z direction) defined by the reactive-site identifying condition stored in the reactive-site identifying condition file 106f, in the molecular species corresponding to the molecular species information identified at step SA-2 (step SE-5).


If “it is not satisfied” is determined at step SE-1 and step SE-3 (step SE-2: No and step SE-4: No), the process is terminated.


Now, the reactive-site coordinate range determining process is completed.


Referring back again to FIG. 21, in the reactive-site-information identifying unit 102d, if “it is satisfied” is determined at step SD-1 (if “it is satisfied” is determined at step SE-5) (step SD-2: Yes), the angle-based reactive-site-information determining unit 102u calculates an angle value between the atom forming the reactive site corresponding to the reactive site information and the pinching point corresponding to the pinching point information, and determines whether the angle value calculated satisfies the reactive-site angle range defined by the reactive-site identifying condition stored in the reactive-site identifying condition file 106f, in the molecular species corresponding to the molecular species information identified at step SA-2 (step SD-3).


Then, in the reactive-site-information identifying unit 102d, if “it is satisfied” is determined at step SD-3 (step SD-4: Yes), the distance-based reactive-site-information determining unit 102v calculates a distance value between the atom forming the reactive site corresponding to the reactive site information and the pinching point corresponding to the pinching point information, and determines whether the distance value calculated satisfies the reactive-site distance range defined by the reactive-site identifying condition stored in the reactive-site identifying condition file 106f, in the molecular species corresponding to the molecular species information identified at step SA-2 (step SD-5).


In the reactive-site-information identifying unit 102d, the determination-result-based reactive-site-information identifying unit 102w returns again to the process at step SD-1 if there is any atom that is not determined yet (step SD-6: Yes), and identifies the reactive site information acquired at step SA-3 if there are no atoms that are not determined (step SB-6: No), in other words, if all the atoms are determined. More specifically, when it is determined that the atomic coordinate information for all the atoms forming the reactive site corresponding to the reactive site information acquired at step SA-3 “is satisfied” at step SD-1, and when it is determined that the atomic coordinate information for the atoms forming the reactive site corresponding to the reactive site information acquired at step SA-3 “is satisfied” at step SD-3 and step SD-5, the reactive site information acquired at SA-3 is identified.


If “it is not satisfied” is determined at step SD-1 (step SD-2: No), the process is terminated even if there still remains an atom which is not determined. If “it is not satisfied” is determined at step SD-3 (step SD-4: No), the process proceeds to step SD-6 as shown in FIG. 21.


Now, the reactive-site-information identifying process is completed.


Referring back again to FIG. 18, in the drug-metabolizing enzyme prediction apparatus 100, the pinching-point-information identifying unit 102e identifies the pinching point information acquired at step SA-3 from the molecular species information and the compound structure information, based on the pinching-point identifying condition stored in the pinching-point identifying condition file 106g (step SA-5).


The pinching-point identifying condition used at step SA-5 may be a condition such that the pinching-point coordinate range, which is a range of the atomic coordinate information for an atom as the pinching point, is defined for each of the predetermine molecular species. Alternatively, the pinching-point-information identifying unit 102e may identify the pinching point information when the atomic coordinate information for the atom as the pinching point corresponding to the pinching point information acquired at step SA-3 satisfies the pinching-point coordinate range, in the molecular species corresponding to the molecular species information identified at step SA-2 (pinching-point-information identifying process).


Details of the pinching-point-information identifying process are explained below with reference to FIG. 23. FIG. 23 is a flowchart of one example of the pinching-point-information identifying process in the system according to the present embodiment.


At first, in the pinching-point-information identifying unit 102e, the coordinate-based pinching-point-information identifying unit 102x determines whether the x coordinate value in the atomic coordinate information for the atom corresponding to the pinching point information acquired at step SA-3 satisfies the pinching-point coordinate range (±x direction) defined by the pinching-point identifying condition stored in the pinching-point identifying condition file 106g, in the molecular species corresponding to the molecular species information identified at step SA-2 (step SF-1).


Then, in the pinching-point-information identifying unit 102e, if “it is satisfied” is determined at step SF-1 (step SF-2: Yes), the coordinate-based pinching-point-information identifying unit 102x determines whether the y coordinate value in the atomic coordinate information for the atom corresponding to the pinching point information satisfies the pinching-point coordinate range (±y direction) defined by the pinching-point identifying condition stored in the pinching-point identifying condition file 106g, in the molecular species corresponding to the molecular species information identified at step SA-2 (step SF-3).


Then, in the pinching-point-information identifying unit 102e, if “it is satisfied” is determined at step SF-3 (step SF-4: Yes), the coordinate-based pinching-point-information identifying unit 102x determines whether the z coordinate value in the atomic coordinate information for the atom corresponding to the pinching point information satisfies the pinching-point coordinate range (±z direction) defined by the pinching-point identifying condition stored in the pinching-point identifying condition file 106g, in the molecular species corresponding to the molecular species information identified at step SA-2 (step SF-5).


Then, in the pinching-point-information identifying unit 102e, if “it is satisfied” is determined at step SF-5, the coordinate-based pinching-point-information identifying unit 102x identifies the pinching point information acquired at step SA-3.


The pinching point identified may serve an atom which forms the binding site as a result.


Now, the pinching-point-information identifying process is completed.


In the drug-metabolizing enzyme prediction apparatus 100, if there exists an open space site being a site which is other than the reactive site corresponding to the reactive site information identified at step SA-4 and the binding site corresponding to the binding site information identified at step SA-2 and is bound to the pinching point corresponding to the pinching point information identified at step SA-5, the open-space-site-information acquiring unit 102f may acquire open-space site information being information on an atom forming the open space site from the binding site information, the reactive site information, the pinching point information, and the compound structure information, and store the open-space site information in a predetermined storage area of the open-space-site information file 106h; and the open-space-site-information identifying unit 102g may also identify the open-space site information acquired, from the molecular species information and the compound structure information, based on the open-space-site identifying condition stored in the open-space-site identifying condition file 106i for identifying the open space site. The open-space-site identifying condition may be a condition such that the open-space-site coordinate range, which is a range of the atomic coordinate information for an atom forming an open space site is defined for each of the predetermine molecular species. Alternatively, in the open-space-site-information identifying unit 102g, the coordinate-based open-space-site-information identifying unit 102y may identify open-space site information when the atomic coordinate information for the atom forming the open space site corresponding to the open-space site information acquired satisfies the open-space-site coordinate range defined by the open-space-site identifying condition stored in the open-space-site identifying condition file 106i, in the molecular species corresponding to the molecular species information identified (open-space-site-information identifying process).


Furthermore, as shown in FIG. 30, the open-space-site coordinate range may conceptually be a range such that the range of a value which each atom forming the open space site can take is defined by each of the width (±x direction), the depth (±y direction), and the thickness (±z direction).


Details of the open-space-site-information identifying process are explained below with reference to FIG. 24. FIG. 24 is a flowchart of one example of the open-space-site-information identifying process in the system according to the present embodiment.


At first, in the open-space-site-information identifying unit 102g, the coordinate-based open-space-site-information identifying unit 102y determines whether the x coordinate value in the atomic coordinate information for the atom forming the open space site corresponding to the open-space site information acquired by the open-space-site-information acquiring unit 102f satisfies the open-space-site coordinate range (±x direction) defined by the open-space-site identifying condition stored in the open-space-site identifying condition file 106i, in the molecular species corresponding to the molecular species information identified at step SA-2 (step SG-1).


Then, in the open-space-site-information identifying unit 102g, if “it is satisfied” is determined at step SG-1 (step SG-2: Yes), the coordinate-based open-space-site-information identifying unit 102y determines whether the y coordinate value in the atomic coordinate information satisfies the open-space-site coordinate range (±y direction) defined by the open-space-site identifying condition stored in the open-space-site identifying condition file 106i, in the molecular species corresponding to the molecular species information identified at step SA-2 (step SG-3).


Then, in the open-space-site-information identifying unit 102g, if “it is satisfied” is determined at step SG-3 (step SG-4: Yes), the coordinate-based open-space-site-information identifying unit 102y determines whether the z coordinate value in the atomic coordinate information satisfies the open-space-site coordinate range (±z direction) defined by the open-space-site identifying condition stored in the open-space-site identifying condition file 106i, in the molecular species corresponding to the molecular species information identified at step SA-2 (step SG-5).


Then, in the open-space-site-information identifying unit 102g, if “it is satisfied” is determined at step SG-5 (step SG-6: Yes), the coordinate-based open-space-site-information identifying unit 102y returns again to the process at step SG-1 if there is any atom that is not determined yet (step SG-7: Yes), and identifies the open-space site information acquired in the open-space-site-information acquiring unit 102f if there are no atoms that are not determined (step SB-7: No), in other words, if all the atoms are determined.


When “it is not satisfied” is determined at step SG-1, step SG-3, and step SG-5 (step SG-2: No, step SG-4: No, and step SG-6: No), the process is terminated even if there still remains an atom which is not determined.


Now, the open-space-site-information identifying process is completed.


In the drug-metabolizing enzyme prediction apparatus 100, if the reactive site information is identified at step SA-4, the inhibition determining unit 102h may determine whether an atom forming the reactive site corresponding to the reactive site information or a partial-structure atom group which is a group of part of atoms in the molecular structure at the reactive site inhibits a metabolic reaction in a compound, based on the inhibition determining condition (e.g., a list of predetermined atoms as inhibitors) for determining whether the metabolic reaction is inhibited in the compound of which the reactive site information is identified. More specifically, in the drug-metabolizing enzyme prediction apparatus 100, if the molecular species of the drug-metabolizing enzyme bound to the compound and the reactive site where a metabolic reaction occurs in the compound bound to the drug-metabolizing enzyme, are identified by the process, the inhibition determining unit 102h may determine whether any atom (e.g., amide (N)) inhibiting the metabolic reaction is contained in atoms forming the reactive site of the compound.


In the drug-metabolizing enzyme prediction apparatus 100, the prediction-result output unit 102i may store at least one of the binding site information, the molecular species information, the reactive site information, the pinching point information, and the open-space site information, for the compound, and the inhibition determination result, in a predetermined storage area of the prediction result file 106j, as a prediction result for the compound, and output the prediction result through the output device 114.


Now, the main process is completed.


As explained above, according to the present embodiment, in the drug-metabolizing enzyme prediction apparatus 100, the compound-structure-information acquiring unit 102a acquires the compound structure information including at least one of the atomic coordinate information being information on coordinates of each of atoms forming a compound, the bond information being information on a bond between the atoms, and the ring structure information being information on a ring structure formed with a plurality of the atoms. The binding-site/molecular-species-information identifying unit 102b identifies the binding site information being information on the atom forming the binding site in the compound corresponding to the compound structure information and the molecular species information being information on the molecular species, from the compound structure information acquired, based on the binding-site identifying condition for identifying the binding site being a site where the compound and the drug-metabolizing enzyme are bound to each other and identifying the molecular species of the drug-metabolizing enzyme to be bound. The pinching-point/reactive-site-information acquiring unit 102c specifies the pinching point which is an atom binding at least the reactive site where a metabolic reaction occurs in the compound and the drug-metabolizing enzyme to the binding site, and acquires the pinching point information being the information on the atom as the pinching point specified and the reactive site information being information on the atom forming the reactive site bound to the pinching point corresponding to the pinching point information, from the binding site information and the compound structure information identified. The reactive-site-information identifying unit 102d identifies the reactive site information acquired, from the molecular species information and the compound structure information, based on the reactive-site identifying condition for identifying the reactive site. The pinching-point-information identifying unit 102e identifies the pinching point information acquired, from the molecular species information and the compound structure information, based on the pinching-point identifying condition for identifying the pinching point. Therefore, it is possible to accurately predict a molecular species of a drug-metabolizing enzyme and a reactive site of a compound with the drug-metabolizing enzyme, for all types of compounds without using training data for prediction and in consideration of each cubic form of the compounds.


That is, according to the present embodiment, the drug-metabolizing enzyme prediction apparatus 100 allows accurate prediction of a molecular species of a drug-metabolizing enzyme and a reactive site of a compound with the drug-metabolizing enzyme, for all types of compounds without using training data for prediction and in consideration of each cubic form of the compounds. Therefore, a compound group can accurately be narrowed down, for example, in the early stage of drug development.


According to the present embodiment, in the drug-metabolizing enzyme prediction apparatus 100, if the reactive site information is identified at step SA-4, the inhibition determining unit 102h may determine whether an atom forming a reactive site corresponding to the reactive site information or a partial-structure atom group which is a group of part of atoms in the molecular structure at the reactive site inhibits a metabolic reaction, based on the inhibition determining condition (e.g., a list of predetermined atoms as inhibitors) for determining whether the metabolic reaction is inhibited in the compound of which reactive site information is identified. Therefore, it is possible to determine (check) whether any atom inhibiting the metabolic reaction is contained in the atoms forming the reactive site predicted, and this allows the compound group to be more accurately narrowed.



FIG. 31 is a diagram for explaining an example of the main process in the drug-metabolizing enzyme prediction apparatus 100 according to an example of the present invention.


At first, in the drug-metabolizing enzyme prediction apparatus 100, the compound-structure-information acquiring unit 102a according to the present embodiment acquires a compound library (which corresponds to the compound structure information according to the present embodiment) stored in the external system 200 (step SH-1: compound-library acquiring process).


Then, in the drug-metabolizing enzyme prediction apparatus 100, if the compound library cannot be acquired at step SH-1 (step SH-2: No), the compound-structure-information acquiring unit 102a according to the present embodiment acquires a virtual compound library input by the user (step SH-3: virtual-compound-library acquiring process).


Then, in the drug-metabolizing enzyme prediction apparatus 100, the binding-site/molecular-species-information identifying unit 102b according to the present embodiment identifies the binding site information from the compound library acquired at step SH-1 or the virtual compound library acquired at step SH-3 (step SH-4: binding-site identifying process).


Then, in the drug-metabolizing enzyme prediction apparatus 100, the pinching-point-information identifying unit 102e according to the present embodiment identifies the pinching point information from the compound library acquired at step SH-1 or the virtual compound library acquired at step SH-3 (step SH-5: pinching-point identifying process).


Then, in the drug-metabolizing enzyme prediction apparatus 100, the open-space-site-information identifying unit 102g according to the present embodiment identifies the open-space site information from the compound library acquired at step SH-1 or the virtual compound library acquired at step SH-3 (step SH-6: open-space-site identifying process).


Then, in the drug-metabolizing enzyme prediction apparatus 100, the reactive-site-information identifying unit 102d according to the present embodiment identifies the reactive site information from the compound library acquired at step SH-1 or the virtual compound library acquired at step SH-3 (step SH-7: reactive site identifying process).


Then, in the drug-metabolizing enzyme prediction apparatus 100, the control unit 102 according to the present embodiment in the drug-metabolizing enzyme prediction apparatus 100 confirms the molecular species of a drug-metabolizing enzyme (CYP) based on the identification results at step SH-4 to step SH-7, and the inhibition determining unit 102h according to the present embodiment further determines whether any atom inhibiting the metabolic reaction is contained in the atoms forming the reactive site of the compound in which the binding site, the pinching point, the open space site, and the reactive site are identified at step SH-4 to step SH-7 (step SH-8: CYP confirming process).


Then, in the drug-metabolizing enzyme prediction apparatus 100, the prediction-result output unit 102i according to the present embodiment stores the binding site, the pinching point, the open space site, and the reactive site identified at step SH-4 to step SH-7, and the metabolic prediction result including the molecular species and the metabolic inhibition result confirmed at step SH-8, in the prediction result file 106j according to the present embodiment (metabolic-prediction-result database), and outputs (displays) them to (on) the output device 114 (e.g., monitor) according to the present embodiment (step SH-9).


According to the example, by respectively evaluating the binding site, the pinching point, the reactive site, and the open space site, it is possible to accurately predict a molecular species and a reactive site of a drug-metabolizing enzyme for all types of compounds, and also efficiently verify whether a metabolic reaction actually occurs at the reactive site predicted. This allows a useful compound group to be accurately narrowed, for example, in the early stage of drug development.


Although the invention has been described with respect to a specific embodiment for a complete and clear disclosure, the appended claims are not to be thus limited but are to be construed as embodying all modifications and alternative constructions that may occur to one skilled in the art that fairly fall within the basic teaching herein set forth.

Claims
  • 1. A drug-metabolizing enzyme prediction apparatus comprising: a compound-structure-information acquiring unit that acquires compound structure information including at least one of atomic coordinate information that is information on coordinates of each of atoms forming a compound, bond information that is information on a bond between the atoms, and ring structure information that is information on a ring structure formed with a plurality of the atoms; a binding-site/molecular-species-information identifying unit that identifies binding site information that is information on an atom forming a binding site in the compound corresponding to the compound structure information and molecular species information that is information on a molecular species, from the compound structure information acquired by the compound-structure-information acquiring unit, based on a binding-site identifying condition for identifying the binding site that is a site where the compound and a drug-metabolizing enzyme are bound to each other and the molecular species of the drug-metabolizing enzyme to be bound; a pinching-point/reactive-site-information acquiring unit that specifies a pinching point that is the atom binding at least between the binding site and a reactive site that is a site where a metabolic reaction occurs in the compound and the drug-metabolizing enzyme, from the binding site information identified by the binding-site/molecular-species-information identifying unit and the compound structure information, and that acquires pinching point information that is information on the atom of the specified pinching point and the reactive site information that is information on the atom forming the reactive site bound to the pinching point corresponding to the pinching point information; a reactive-site-information identifying unit that identifies the reactive site information acquired by the pinching-point/reactive-site-information acquiring unit, from the molecular species information and the compound structure information, based on a reactive-site identifying condition for identifying the reactive site; and a pinching-point-information identifying unit that identifies the pinching point acquired by the pinching-point/reactive-site-information acquiring unit, from the molecular species information and the compound structure information, based on a pinching-point identifying condition for identifying the pinching point.
  • 2. The drug-metabolizing enzyme prediction apparatus according to claim 1, wherein the binding-site/molecular-species-information identifying unit includes a binding-site selecting unit that selects the binding site based on a predetermined binding-site selection criterion; a binding-site-information acquiring unit that acquires the binding site information for the binding site selected by the binding-site selecting unit, from the compound structure information; and an acquired-binding-site/molecular-species-information identifying unit that identifies the binding site information acquired by the binding-site-information acquiring unit, from the compound structure information, based on the binding-site identifying condition, and identifies the molecular species corresponding to the identified binding site information as the molecular species information.
  • 3. The drug-metabolizing enzyme prediction apparatus according to claim 2, wherein the binding-site identifying condition is a condition in which a binding-site coordinate range that is a range of the atomic coordinate information for the atom forming the binding site is defined for each predetermined molecular species, and the acquired-binding-site/molecular-species-information identifying unit includes a coordinate-based acquired-binding-site/molecular-species-information identifying unit that identifies the binding site information when the atomic coordinate information for the atom that forms the binding site corresponding to the binding site information acquired by the binding-site-information acquiring unit satisfies the binding-site coordinate range defined by the binding-site identifying condition, and identifies the molecular species corresponding to the satisfied binding-site coordinate range as the molecular species information.
  • 4. The drug-metabolizing enzyme prediction apparatus according to claim 1, wherein the pinching-point/reactive-site-information acquiring unit includes an atom specifying unit that specifies a directly bonded atom that is the atom bonded to the atom forming the binding site corresponding to the binding site information identified by the binding-site/molecular-species-information identifying unit and also specifies the atom bonded to the directly bonded atom, based on the bond information included in the compound structure information; a ring-structure determining unit that determines whether the atom specified by the atom specifying unit is the atom forming the ring structure, based on the ring structure information included in the compound structure information; a pinching-point/reactive-site specifying unit that specifies the atom specified and another atom forming the ring structure together with the atom, as the atoms forming the reactive site, when it is determined by the ring-structure determining unit that the atom specified by the atom specifying unit is the atom forming the ring structure, and that specifies the directly bonded atom specified by the atom specifying unit, as the pinching point; and a specified-pinching-point/reactive-site-information acquiring unit that acquires the pinching point information for the pinching point specified by the pinching-point/reactive-site specifying unit and the reactive site information for the reactive site specified, from the binding site information and the compound structure information.
  • 5. The drug-metabolizing enzyme prediction apparatus according to claim 1, wherein the reactive-site identifying condition is such that a reactive-site coordinate range that is a range of the atomic coordinate information for the atoms forming the reactive site, a reactive-site angle range that is a range of an angle value between each of the atoms forming the reactive site and the pinching point, and a reactive-site distance range that is a range of a distance value between each of the atoms forming the reactive site and the pinching point are defined for each of the molecular species previously specified, and the reactive-site-information identifying unit includes a coordinate-based reactive-site-information determining unit that determines whether the atomic coordinate information for the atoms, which form the reactive site corresponding to the reactive site information acquired by the pinching-point/reactive-site-information acquiring unit, satisfies the reactive-site coordinate range defined by the reactive-site identifying condition, in the molecular species corresponding to the molecular species information identified; an angle-based reactive-site-information determining unit that calculates the angle value between the atom forming the reactive site corresponding to the reactive site information and the pinching point corresponding to the pinching point information, and determines whether the angle value calculated satisfies the reactive-site angle range defined by the reactive-site identifying condition, in the molecular species corresponding to the molecular species information identified; a distance-based reactive-site-information determining unit that calculates the distance value between the atom forming the reactive site corresponding to the reactive site information and the pinching point corresponding to the pinching point information, and determines whether the distance value calculated satisfies the reactive-site distance range defined by the reactive-site identifying condition, in the molecular species corresponding to the molecular species information identified; and a determination-result-based reactive-site-information identifying unit that identifies the reactive site information determined, as being satisfied, by the coordinate-based reactive-site-information determining unit, the angle-based reactive-site-information determining unit, and the distance-based reactive-site-information determining unit, as the reactive site information that satisfies the reactive-site identifying condition.
  • 6. The drug-metabolizing enzyme prediction apparatus according to claim 1, wherein the pinching-point identifying condition is such that a pinching-point coordinate range that is a range of the atomic coordinate information for the atom as the pinching point is defined for each of the molecular species previously specified, and the pinching-point-information identifying unit includes a coordinate-based pinching-point-information identifying unit that identifies the pinching point information when the atomic coordinate information for the atom, which is the pinching point corresponding to the pinching point information acquired by the pinching-point/reactive-site-information acquiring unit, satisfies the pinching-point coordinate range defined by the pinching-point identifying condition, in the molecular species corresponding to the molecular species information identified.
  • 7. The drug-metabolizing enzyme prediction apparatus according to claim 1, further comprising: an open-space-site-information acquiring unit that acquires open-space site information that is information on the atom forming an open space site, from the binding site information, the reactive site information, the pinching point information, and the compound structure information, if there exists the open space site that is a site other than the reactive site corresponding to the reactive site information identified by the reactive-site-information identifying unit and the binding site corresponding to the binding site information identified by the binding-site/molecular-species-information identifying unit and which is bound to the pinching point corresponding to the pinching point information identified by the pinching-point-information identifying unit; and an open-space-site-information identifying unit that identifies the open-space site information acquired by the open-space-site-information acquiring unit, from the molecular species information and the compound structure information, based on an open-space-site identifying condition for identifying the open space site.
  • 8. The drug-metabolizing enzyme prediction apparatus according to claim 7, wherein the open-space-site identifying condition is such that an open-space-site coordinate range, which is a range of the atomic coordinate information for the atom forming the open space site, is defined for each of the molecular species previously specified, and the open-space-site-information identifying unit includes a coordinate-based open-space-site-information identifying unit that identifies the open-space site information when the atomic coordinate information for the atom, which forms the open space site corresponding to the open-space site information acquired by the open-space-site-information acquiring unit, satisfies the open-space-site coordinate range defined by the open-space-site identifying condition, in the molecular species corresponding to the molecular species information identified.
  • 9. The drug-metabolizing enzyme prediction apparatus according to claim 1, further comprising: an inhibition determining unit that determines whether the atom forming the reactive site corresponding to the reactive site information or a partial-structure atom group which is a group of part of the atoms in a molecular structure at the reactive site inhibits the metabolic reaction, when the reactive site information is identified by the reactive-site-information identifying unit, based on an inhibition determining condition for determining whether the metabolic reaction is inhibited in the compound of which the reactive site information is identified.
  • 10. A computer-readable drug-metabolizing enzyme prediction method comprising: a compound-structure-information acquiring step of acquiring compound structure information including at least one of atomic coordinate information that is information on coordinates of each of atoms forming a compound, bond information that is information on a bond between the atoms, and ring structure information that is information on a ring structure formed with a plurality of the atoms; a binding-site/molecular-species-information identifying step of identifying binding site information that is information on an atom forming a binding site in the compound corresponding to the compound structure information and molecular species information that is information on a molecular species, from the compound structure information acquired at the compound-structure-information acquiring step, based on a binding-site identifying condition for identifying the binding site that is a site where the compound and a drug-metabolizing enzyme are bound to each other and the molecular species of the drug-metabolizing enzyme to be bound; a pinching-point/reactive-site-information acquiring step of specifying a pinching point that is the atom binding at least between the binding site and a reactive site that is a site where a metabolic reaction occurs in the compound and the drug-metabolizing enzyme, from the binding site information identified at the binding-site/molecular-species-information identifying step and the compound structure information, and acquiring pinching point information that is information on the atom of the specified pinching point and the reactive site information that is information on the atom forming the reactive site bound to the pinching point corresponding to the pinching point information; a reactive-site-information identifying step of identifying the reactive site information acquired at the pinching-point/reactive-site-information acquiring step, from the molecular species information and the compound structure information, based on a reactive-site identifying condition for identifying the reactive site; and a pinching-point-information identifying step of identifying the pinching point acquired at the pinching-point/reactive-site-information acquiring step, from the molecular species information and the compound structure information, based on a pinching-point identifying condition for identifying the pinching point.
  • 11. The drug-metabolizing enzyme prediction method according to claim 10, wherein the binding-site/molecular-species-information identifying step includes a binding-site selecting step of selecting the binding site based on a predetermined binding-site selection criterion; a binding-site-information acquiring step of acquiring the binding site information for the binding site selected at the binding-site selecting step, from the compound structure information; and an acquired-binding-site/molecular-species-information identifying step of identifying the binding site information acquired at the binding-site-information acquiring step, from the compound structure information, based on the binding-site identifying condition, and identifies the molecular species corresponding to the identified binding site information as the molecular species information.
  • 12. The drug-metabolizing enzyme prediction method according to claim 11, wherein the binding-site identifying condition is a condition in which a binding-site coordinate range that is a range of the atomic coordinate information for the atom forming the binding site is defined for each predetermined molecular species, and the acquired-binding-site/molecular-species-information identifying step includes a coordinate-based acquired-binding-site/molecular-species-information identifying step of identifying the binding site information when the atomic coordinate information for the atom that forms the binding site corresponding to the binding site information acquired at the binding-site-information acquiring step satisfies the binding-site coordinate range defined by the binding-site identifying condition, and identifying the molecular species corresponding to the satisfied binding-site coordinate range as the molecular species information.
  • 13. The drug-metabolizing enzyme prediction method according to claim 10, wherein the pinching-point/reactive-site-information acquiring step includes an atom specifying step of specifying a directly bonded atom that is the atom bonded to the atom forming the binding site corresponding to the binding site information identified at the binding-site/molecular-species-information identifying step and also specifying the atom bonded to the directly bonded atom, based on the bond information included in the compound structure information; a ring-structure determining step of determining whether the atom specified at the atom specifying step is the atom forming the ring structure, based on the ring structure information included in the compound structure information; a pinching-point/reactive-site specifying step of specifying the atom specified and another atom forming the ring structure together with the atom, as the atoms forming the reactive site, when it is determined at the ring-structure determining step that the atom specified at the atom specifying step is the atom forming the ring structure, and specifying the directly bonded atom specified at the atom specifying step, as the pinching point; and a specified pinching-point/reactive-site-information acquiring step of acquiring the pinching point information for the pinching point specified at the pinching-point/reactive-site specifying step and the reactive site information for the reactive site specified, from the binding site information and the compound structure information.
  • 14. The drug-metabolizing enzyme prediction method according to claim 10, wherein the reactive-site identifying condition is such that a reactive-site coordinate range that is a range of the atomic coordinate information for the atoms forming the reactive site, a reactive-site angle range that is a range of an angle value between each of the atoms forming the reactive site and the pinching point, and a reactive-site distance range that is a range of a distance value between each of the atoms forming the reactive site and the pinching point are defined for each of the molecular species previously specified, and the reactive-site-information identifying step includes a coordinate-based reactive-site-information determining step of determining whether the atomic coordinate information for the atoms, which form the reactive site corresponding to the reactive site information acquired at the pinching-point/reactive-site-information acquiring step, satisfies the reactive-site coordinate range defined by the reactive-site identifying condition, in the molecular species corresponding to the molecular species information identified; an angle-based reactive-site-information determining step of calculating the angle value between the atom forming the reactive site corresponding to the reactive site information and the pinching point corresponding to the pinching point information, and determining whether the angle value calculated satisfies the reactive-site angle range defined by the reactive-site identifying condition, in the molecular species corresponding to the molecular species information identified; a distance-based reactive-site-information determining step of calculating the distance value between the atom forming the reactive site corresponding to the reactive site information and the pinching point corresponding to the pinching point information, and determining whether the distance value calculated satisfies the reactive-site distance range defined by the reactive-site identifying condition, in the molecular species corresponding to the molecular species information identified; and a determination-result-based reactive-site-information identifying step of identifying the reactive site information determined, as being satisfied, at the coordinate-based reactive-site-information determining step, the angle-based reactive-site-information determining step, and the distance-based reactive-site-information determining step, as the reactive site information that satisfies the reactive-site identifying condition.
  • 15. The drug-metabolizing enzyme prediction method according to claim 10, wherein the pinching-point identifying condition is such that a pinching-point coordinate range that is a range of the atomic coordinate information for the atom as the pinching point is defined for each of the molecular species previously specified, and the pinching-point-information identifying step includes a coordinate-based pinching-point-information identifying step of identifying the pinching point information when the atomic coordinate information for the atom, which is the pinching point corresponding to the pinching point information acquired at the pinching-point/reactive-site-information acquiring step, satisfies the pinching-point coordinate range defined by the pinching-point identifying condition, in the molecular species corresponding to the molecular species information identified.
  • 16. The drug-metabolizing enzyme prediction method according to claim 10, further comprising: an open-space-site-information acquiring step of acquiring open-space site information that is information on the atom forming an open space site, from the binding site information, the reactive site information, the pinching point information, and the compound structure information, if there exists the open space site that is a site other than the reactive site corresponding to the reactive site information identified at the reactive-site-information identifying step and the binding site corresponding to the binding site information identified at the binding-site/molecular-species-information identifying step and which is bound to the pinching point corresponding to the pinching point information identified at the pinching-point-information identifying step; and an open-space-site-information identifying step of identifying the open-space site information acquired at the open-space-site-information acquiring step, from the molecular species information and the compound structure information, based on an open-space-site identifying condition for identifying the open space site.
  • 17. The drug-metabolizing enzyme prediction method according to claim 16, wherein the open-space-site identifying condition is such that an open-space-site coordinate range, which is a range of the atomic coordinate information for the atom forming the open space site, is defined for each of the molecular species previously specified, and the open-space-site-information identifying step includes a coordinate-based open-space-site-information identifying step of identifying the open-space site information when the atomic coordinate information for the atom, which forms the open space site corresponding to the open-space site information acquired at the open-space-site-information acquiring step, satisfies the open-space-site coordinate range defined by the open-space-site identifying condition, in the molecular species corresponding to the molecular species information identified.
  • 18. The drug-metabolizing enzyme prediction method according to claim 10, further comprising: an inhibition determining step of determining whether the atom forming the reactive site corresponding to the reactive site information or a partial-structure atom group which is a group of part of the atoms in a molecular structure at the reactive site inhibits the metabolic reaction, when the reactive site information is identified at the reactive-site-information identifying step, based on an inhibition determining condition for determining whether the metabolic reaction is inhibited in the compound of which the reactive site information is identified.
  • 19. A computer-readable recording medium that stores therein a drug-metabolizing enzyme prediction program, wherein the drug-metabolizing enzyme prediction program a computer to execute: a compound-structure-information acquiring procedure of acquiring compound structure information including at least one of atomic coordinate information that is information on coordinates of each of atoms forming a compound, bond information that is information on a bond between the atoms, and ring structure information that is information on a ring structure formed with a plurality of the atoms; a binding-site/molecular-species-information identifying procedure of identifying binding site information that is information on an atom forming a binding site in the compound corresponding to the compound structure information and molecular species information that is information on a molecular species, from the compound structure information acquired at the compound-structure-information acquiring procedure, based on a binding-site identifying condition for identifying the binding site that is a site where the compound and a drug-metabolizing enzyme are bound to each other and the molecular species of the drug-metabolizing enzyme to be bound; a pinching-point/reactive-site-information acquiring procedure of specifying a pinching point that is the atom binding at least between the binding site and a reactive site that is a site where a metabolic reaction occurs in the compound and the drug-metabolizing enzyme, from the binding site information identified at the binding-site/molecular-species-information identifying procedure and the compound structure information, and acquiring pinching point information that is information on the atom of the specified pinching point and the reactive site information that is information on the atom forming the reactive site bound to the pinching point corresponding to the pinching point information; a reactive-site-information identifying procedure of identifying the reactive site information acquired at the pinching-point/reactive-site-information acquiring procedure, from the molecular species information and the compound structure information, based on a reactive-site identifying condition for identifying the reactive site; and a pinching-point-information identifying procedure of identifying the pinching point acquired at the pinching-point/reactive-site-information acquiring procedure, from the molecular species information and the compound structure information, based on a pinching-point identifying condition for identifying the pinching point.
  • 20. The computer-readable recording medium according to claim 19, wherein the binding-site/molecular-species-information identifying procedure includes a binding-site selecting procedure of selecting the binding site based on a predetermined binding-site selection criterion; a binding-site-information acquiring procedure of acquiring the binding site information for the binding site selected at the binding-site selecting procedure, from the compound structure information; and an acquired-binding-site/molecular-species-information identifying procedure of identifying the binding site information acquired at the binding-site-information acquiring procedure, from the compound structure information, based on the binding-site identifying condition, and identifies the molecular species corresponding to the identified binding site information as the molecular species information.
  • 21. The computer-readable recording medium according to claim 20, wherein the binding-site identifying condition is a condition in which a binding-site coordinate range that is a range of the atomic coordinate information for the atom forming the binding site is defined for each predetermined molecular species, and the acquired-binding-site/molecular-species-information identifying procedure includes a coordinate-based acquired-binding-site/molecular-species-information identifying procedure of identifying the binding site information when the atomic coordinate information for the atom that forms the binding site corresponding to the binding site information acquired at the binding-site-information acquiring procedure satisfies the binding-site coordinate range defined by the binding-site identifying condition, and identifying the molecular species corresponding to the satisfied binding-site coordinate range as the molecular species information.
  • 22. The computer-readable recording medium according to claim 19, wherein the pinching-point/reactive-site-information acquiring procedure includes an atom specifying procedure of specifying a directly bonded atom that is the atom bonded to the atom forming the binding site corresponding to the binding site information identified at the binding-site/molecular-species-information identifying procedure and also specifying the atom bonded to the directly bonded atom, based on the bond information included in the compound structure information; a ring-structure determining procedure of determining whether the atom specified at the atom specifying procedure is the atom forming the ring structure, based on the ring structure information included in the compound structure information; a pinching-point/reactive-site specifying procedure of specifying the atom specified and another atom forming the ring structure together with the atom, as the atoms forming the reactive site, when it is determined at the ring-structure determining procedure that the atom specified at the atom specifying procedure is the atom forming the ring structure, and specifying the directly bonded atom specified at the atom specifying procedure, as the pinching point; and a specified pinching-point/reactive-site-information acquiring procedure of acquiring the pinching point information for the pinching point specified at the pinching-point/reactive-site specifying procedure and the reactive site information for the reactive site specified, from the binding site information and the compound structure information.
  • 23. The computer-readable recording medium according to claim 19, wherein the reactive-site identifying condition is such that a reactive-site coordinate range that is a range of the atomic coordinate information for the atoms forming the reactive site, a reactive-site angle range that is a range of an angle value between each of the atoms forming the reactive site and the pinching point, and a reactive-site distance range that is a range of a distance value between each of the atoms forming the reactive site and the pinching point are defined for each of the molecular species previously specified, and the reactive-site-information identifying procedure includes a coordinate-based reactive-site-information determining procedure of determining whether the atomic coordinate information for the atoms, which form the reactive site corresponding to the reactive site information acquired at the pinching-point/reactive-site-information acquiring procedure, satisfies the reactive-site coordinate range defined by the reactive-site identifying condition, in the molecular species corresponding to the molecular species information identified; an angle-based reactive-site-information determining procedure of calculating the angle value between the atom forming the reactive site corresponding to the reactive site information and the pinching point corresponding to the pinching point information, and determining whether the angle value calculated satisfies the reactive-site angle range defined by the reactive-site identifying condition, in the molecular species corresponding to the molecular species information identified; a distance-based reactive-site-information determining procedure of calculating the distance value between the atom forming the reactive site corresponding to the reactive site information and the pinching point corresponding to the pinching point information, and determining whether the distance value calculated satisfies the reactive-site distance range defined by the reactive-site identifying condition, in the molecular species corresponding to the molecular species information identified; and a determination-result-based reactive-site-information identifying procedure of identifying the reactive site information determined, as being satisfied, at the coordinate-based reactive-site-information determining procedure, the angle-based reactive-site-information determining procedure, and the distance-based reactive-site-information determining procedure, as the reactive site information that satisfies the reactive-site identifying condition.
  • 24. The computer-readable recording medium according to claim 19, wherein the pinching-point identifying condition is such that a pinching-point coordinate range that is a range of the atomic coordinate information for the atom as the pinching point is defined for each of the molecular species previously specified, and the pinching-point-information identifying procedure includes a coordinate-based pinching-point-information identifying procedure of identifying the pinching point information when the atomic coordinate information for the atom, which is the pinching point corresponding to the pinching point information acquired at the pinching-point/reactive-site-information acquiring procedure, satisfies the pinching-point coordinate range defined by the pinching-point identifying condition, in the molecular species corresponding to the molecular species information identified.
  • 25. The computer-readable recording medium according to claim 19, wherein the drug-metabolizing enzyme prediction program further causes the computer to execute: an open-space-site-information acquiring procedure of acquiring open-space site information that is information on the atom forming an open space site, from the binding site information, the reactive site information, the pinching point information, and the compound structure information, if there exists the open space site that is a site other than the reactive site corresponding to the reactive site information identified at the reactive-site-information identifying procedure and the binding site corresponding to the binding site information identified at the binding-site/molecular-species-information identifying procedure and which is bound to the pinching point corresponding to the pinching point information identified at the pinching-point-information identifying procedure; and an open-space-site-information identifying procedure of identifying the open-space site information acquired at the open-space-site-information acquiring procedure, from the molecular species information and the compound structure information, based on an open-space-site identifying condition for identifying the open space site.
  • 26. The computer-readable recording medium according to claim 25, wherein the open-space-site identifying condition is such that an open-space-site coordinate range, which is a range of the atomic coordinate information for the atom forming the open space site, is defined for each of the molecular species previously specified, and the open-space-site-information identifying procedure includes a coordinate-based open-space-site-information identifying procedure of identifying the open-space site information when the atomic coordinate information for the atom, which forms the open space site corresponding to the open-space site information acquired at the open-space-site-information acquiring procedure, satisfies the open-space-site coordinate range defined by the open-space-site identifying condition, in the molecular species corresponding to the molecular species information identified.
  • 27. The computer-readable recording medium according to claim 19, wherein the drug-metabolizing enzyme prediction program further causes the computer to execute: an inhibition determining procedure of determining whether the atom forming the reactive site corresponding to the reactive site information or a partial-structure atom group which is a group of part of the atoms in a molecular structure at the reactive site inhibits the metabolic reaction, when the reactive site information is identified at the reactive-site-information identifying procedure, based on an inhibition determining condition for determining whether the metabolic reaction is inhibited in the compound of which the reactive site information is identified.
Continuations (1)
Number Date Country
Parent PCT/JP04/04746 Mar 2004 US
Child 11524208 Sep 2006 US