Shortening the development cycle time of Trans Proteomic Pipeline tools with high performance computing

Information

  • Research Project
  • 10389405
  • ApplicationId
    10389405
  • Core Project Number
    R01GM087221
  • Full Project Number
    3R01GM087221-11S1
  • Serial Number
    087221
  • FOA Number
    PA-20-272
  • Sub Project Id
  • Project Start Date
    9/1/2010 - 14 years ago
  • Project End Date
    4/30/2022 - 2 years ago
  • Program Officer Name
    RAVICHANDRAN, VEERASAMY
  • Budget Start Date
    5/1/2021 - 3 years ago
  • Budget End Date
    4/30/2022 - 2 years ago
  • Fiscal Year
    2021
  • Support Year
    11
  • Suffix
    S1
  • Award Notice Date
    8/20/2021 - 3 years ago

Shortening the development cycle time of Trans Proteomic Pipeline tools with high performance computing

Project Summary Mass spectrometry (MS) based proteomics is currently the most widely used technology for the analysis of complex protein mixtures. It has the ability to detect and quantify the abundance of thousands of proteins and their variants, post-translational modifications, protein:protein and protein:drug interactions per experiment. The Trans-Proteomic Pipeline (TPP) is a free and open-source suite of software tools for the analysis of mass spectrometry (MS) proteomics data that has been widely used by the community for over 15 years. Our ongoing award for the maintenance and development of the TPP has four aims in which we 1) enhance the TPP to process datasets from emerging workflows, 2) add statistical analysis and visualization capabilities for large datasets, 3) respond to the needs of the community with functional and usability enhancements, and 4) expand our user base with improved training and on-line learning. Great progress has been made on these aims during the project period. However, current development for one of the main tools that we are currently focusing on has been severely hindered by aging computational servers purchased in 2012 and inadequate development of computing resources. We propose to address this problem via the purchase and deployment of a pair of high- memory, high-core count, fast-disk compute servers, substantially accelerating our development cycle so that the algorithm development can be completed more rapidly, and the completed algorithm can be further rapidly optimized to run with more modest hardware requirements routinely. Deployment of these machines will streamline our software development by providing rapid feedback on software changes and further developments and their impact on search speed and enable us to provide tools to our users more rapidly, accelerating the progress of many other research programs that use the TPP for data processing.

IC Name
NATIONAL INSTITUTE OF GENERAL MEDICAL SCIENCES
  • Activity
    R01
  • Administering IC
    GM
  • Application Type
    3
  • Direct Cost Amount
    100000
  • Indirect Cost Amount
    0
  • Total Cost
    100000
  • Sub Project Total Cost
  • ARRA Funded
    False
  • CFDA Code
    859
  • Ed Inst. Type
  • Funding ICs
    NIGMS:100000\
  • Funding Mechanism
    Non-SBIR/STTR RPGs
  • Study Section
    BDMA
  • Study Section Name
    Biodata Management and Analysis Study Section
  • Organization Name
    INSTITUTE FOR SYSTEMS BIOLOGY
  • Organization Department
  • Organization DUNS
    135646524
  • Organization City
    SEATTLE
  • Organization State
    WA
  • Organization Country
    UNITED STATES
  • Organization Zip Code
    981095263
  • Organization District
    UNITED STATES