Bayesian Support Vector Machines for the Prediction of Molecular-Genetic Network Motifs Across Organisms

Information

  • NSF Award
  • 0523643
Owner
  • Award Id
    0523643
  • Award Effective Date
    8/1/2005 - 20 years ago
  • Award Expiration Date
    7/31/2008 - 17 years ago
  • Award Amount
    $ 249,999.00
  • Award Instrument
    Standard Grant

Bayesian Support Vector Machines for the Prediction of Molecular-Genetic Network Motifs Across Organisms

Progress in the biological sciences in the post-genomic era depends on our ability to make sense of genome-scale information. The genome and the proteome together establish an intricate network of interactions that exhibits many similarities to social and political networks. This interaction network forms a simple, conceptual representation of the molecular machinery in the living cell. Knowledge of individual interactions and patterns of interaction therefore significantly enhances our understanding of the mechanisms of biological function. <br/><br/>Unfortunately, experimental determination of molecular interactions at the scale of the entire genome is often error-prone: many interactions revealed by such high-throughput experiments are false, and conversely, many of the actual interactions are not revealed at all. It is important, therefore, to establish rigorous computational methods that utilize high-throughput data from a variety of sources to predict the existence (and lack thereof) of an interaction, and to assign confidence levels to each prediction in a systematic manner. Our project utilizes state-of-the-art predictive methods from the field of Artificial Intelligence-Bayesian support vector machines-to predict molecular interactions at the whole-genome level. The project is initiated by an exhaustive data collection effort involving a variety of data sources that supply putative predictors for the presence or absence of interactions among protein/protein or gene/protein pairs. Dominant predictors among these will be isolated, and the prediction system will be applied to the genomes of several organisms, including the budding yeast, worm, and fly. The accuracy of the method will be tested and refined by computational and biological means. Successful completion of this project will significantly enhance our ability to decipher genomic information and apply these findings to discover novel functional pathways of biological, agricultural, and medical importance. <br/><br/>All methods and results will be publicly disseminated, the former with stand-alone executable programs, and the latter via publications and web pages. The project will support interdisciplinary training of graduate and postdoctoral students, and should provide research opportunities for undergraduate students.

  • Program Officer
    Mitra Basu
  • Min Amd Letter Date
    7/26/2005 - 20 years ago
  • Max Amd Letter Date
    7/26/2005 - 20 years ago
  • ARRA Amount

Institutions

  • Name
    Keck Graduate Institute
  • City
    Claremont
  • State
    CA
  • Country
    United States
  • Address
    535 Watson Drive
  • Postal Code
    917114817
  • Phone Number
    9096079313

Investigators

  • First Name
    Animesh
  • Last Name
    Ray
  • Email Address
    animesh_ray@kgi.edu
  • Start Date
    7/26/2005 12:00:00 AM
  • First Name
    Amarnath
  • Last Name
    Gupta
  • Email Address
    gupta@sdsc.edu
  • Start Date
    7/26/2005 12:00:00 AM
  • First Name
    Alpan
  • Last Name
    Raval
  • Email Address
    Alpan_Raval@kgi.edu
  • Start Date
    7/26/2005 12:00:00 AM

FOA Information

  • Name
    Computer Science
  • Code
    912