VISUALIZING THOUSAND-DIMENSIONAL CHEMICAL DIVERSITY

Information

  • Research Project
  • 6386403
  • ApplicationId
    6386403
  • Core Project Number
    R44GM058919
  • Full Project Number
    5R44GM058919-03
  • Serial Number
    58919
  • FOA Number
  • Sub Project Id
  • Project Start Date
    9/1/1998 - 27 years ago
  • Project End Date
    11/30/2003 - 22 years ago
  • Program Officer Name
    PREUSCH, PETER C.
  • Budget Start Date
    7/1/2001 - 24 years ago
  • Budget End Date
    11/30/2003 - 22 years ago
  • Fiscal Year
    2001
  • Support Year
    3
  • Suffix
  • Award Notice Date
    6/25/2001 - 24 years ago
Organizations

VISUALIZING THOUSAND-DIMENSIONAL CHEMICAL DIVERSITY

The goal of this work is to provide interactive computer visualizations which research scientists can use to interpret high throughput screening data and to make combinatorial chemistry choices. The simplest drug discovery principle is that compounds similar in enough properties are usually similar in biological activity. Similarity often involves measures in high-dimensional spaces, such as molecular fingerprints or shape descriptors. Uses of similarity in drug discovery research may apply to millions of compounds from virtual libraries of potentially synthesizable compounds. To examine relationships among vast numbers of compounds in diversity space, by simple graphical interactions with two dimensional maps of the space, allows the intuition of experienced scientists to come into play. The algorithms for visualization of thousand dimensional diversity spaces rely on horizons, which are distances beyond which the distance matrix need not be resolved, and on efficient subsampling methods. These concepts also enable selection of optimal descriptors to cluster compounds for predictive use, when combined in genetic algorithms. Optimal descriptors help not only in visualizing important features of diversity space, but in deciding which compounds to make and test next during early analoging of active substances. PROPOSED COMMERCIAL APPLICATION: Software that performs diversity selections needs these visualization tools. Compound libraries offered for random screening or following up on hits are more valuable when their designs can be illustrated. The tools apply to new areas such as differential gene expression data analysis. New methods for analyzing HTS data have commercial potential of improving the process of early drug discovery research.

IC Name
NATIONAL INSTITUTE OF GENERAL MEDICAL SCIENCES
  • Activity
    R44
  • Administering IC
    GM
  • Application Type
    5
  • Direct Cost Amount
  • Indirect Cost Amount
  • Total Cost
    380538
  • Sub Project Total Cost
  • ARRA Funded
  • CFDA Code
    821
  • Ed Inst. Type
  • Funding ICs
    NIGMS:380538\
  • Funding Mechanism
  • Study Section
    ZRG1
  • Study Section Name
    Special Emphasis Panel
  • Organization Name
    TRIPOS, INC.
  • Organization Department
  • Organization DUNS
    099666216
  • Organization City
    SAINT LOUIS
  • Organization State
    MO
  • Organization Country
    UNITED STATES
  • Organization Zip Code
    63144
  • Organization District
    UNITED STATES