Algorithmic identification of binding specificity mechanisms in proteins

Information

  • Research Project
  • 10164894
  • ApplicationId
    10164894
  • Core Project Number
    R01GM123131
  • Full Project Number
    3R01GM123131-02S1
  • Serial Number
    123131
  • FOA Number
    PA-18-591
  • Sub Project Id
  • Project Start Date
    9/20/2019 - 6 years ago
  • Project End Date
    8/31/2023 - 2 years ago
  • Program Officer Name
    RAVICHANDRAN, VEERASAMY
  • Budget Start Date
    9/1/2020 - 5 years ago
  • Budget End Date
    8/31/2021 - 4 years ago
  • Fiscal Year
    2020
  • Support Year
    02
  • Suffix
    S1
  • Award Notice Date
    9/18/2020 - 5 years ago
Organizations

Algorithmic identification of binding specificity mechanisms in proteins

Project Summary Variations in protein binding preferences are a critical barrier to the precision treatment of disease. When high resolution structures of a protein are available, and many isoforms of the protein have been connected to dif- fering binding preferences, it is possible in principle to model the structures of all isoforms and discover the mechanisms that cause variations in binding preferences. Unfortunately, this discovery process depends on human expertise for examining molecular structure, and given that hundreds of isoforms may exist, a human would be overwhelmed to objectively examine many similar isoforms. To fill this gap, this project will (A1) de- velop software that identifies structural mechanisms that cause differential binding preferences, categorizes similar structural mechanisms, and explains the mechanisms in English. The second aim of this project (A2) is to validate the software at a large scale on families of proteins that exhibit a variety of well-examined binding preferences, and through blind predictions with experimental collaborators. Our approach involves creating software that mimics the visual reasoning techniques employed by structural biologists when examining molecular structures. Not only are these techniques responsible for most major dis- coveries in structural biology, but they are also straightforward to understand by non-computational research- ers. This property will enable our software to immediately integrate into existing workflows at labs that do not focus on computational methods. This property also contrasts from existing methods, which generally output structural models, potential energies, p-values and structural scores which are difficult for non-experts to un- derstand or incorporate into their research. Often, an expert in biophysics is required to interpret the outputs so that they can be operationalized in laboratory environments. In preliminary results, our methods have already identified molecular mechanisms that govern specificity in several families of proteins. Verification against peer-reviewed experimentation has proven the preliminary results correct in almost all cases. Our methods have also been applied to make a blind prediction of binding mechanisms in the ricin toxin, which binds to and damages the human ribosome. With experimental collabo- rators, we showed that our methods correctly identified and predicted the roles of several amino acids with a hitherto unknown role in recognizing the ribosome. Using our methodological approach and our rigorous valida- tion strategy, this project will produce a highly validated, usable software package that will bridge a critical gap in the development of precision therapies and diagnostics.

IC Name
NATIONAL INSTITUTE OF GENERAL MEDICAL SCIENCES
  • Activity
    R01
  • Administering IC
    GM
  • Application Type
    3
  • Direct Cost Amount
    64959
  • Indirect Cost Amount
    35397
  • Total Cost
    100356
  • Sub Project Total Cost
  • ARRA Funded
    False
  • CFDA Code
    859
  • Ed Inst. Type
    BIOMED ENGR/COL ENGR/ENGR STA
  • Funding ICs
    NIGMS:100356\
  • Funding Mechanism
    Non-SBIR/STTR RPGs
  • Study Section
    BDMA
  • Study Section Name
    Biodata Management and Analysis Study Section
  • Organization Name
    LEHIGH UNIVERSITY
  • Organization Department
    BIOSTATISTICS & OTHER MATH SCI
  • Organization DUNS
    808264444
  • Organization City
    BETHLEHEM
  • Organization State
    PA
  • Organization Country
    UNITED STATES
  • Organization Zip Code
    18015
  • Organization District
    UNITED STATES