Machine learning approaches for improved accuracy and speed in sequence annotation: supplement for software enhancement

Information

  • Research Project
  • 10406630
  • ApplicationId
    10406630
  • Core Project Number
    R01GM132600
  • Full Project Number
    3R01GM132600-03S1
  • Serial Number
    132600
  • FOA Number
    PA-20-272
  • Sub Project Id
  • Project Start Date
    9/20/2019 - 5 years ago
  • Project End Date
    7/31/2023 - a year ago
  • Program Officer Name
    RAVICHANDRAN, VEERASAMY
  • Budget Start Date
    8/1/2021 - 3 years ago
  • Budget End Date
    7/31/2022 - 2 years ago
  • Fiscal Year
    2021
  • Support Year
    03
  • Suffix
    S1
  • Award Notice Date
    9/16/2021 - 3 years ago
Organizations

Machine learning approaches for improved accuracy and speed in sequence annotation: supplement for software enhancement

Summary The goal of this parent grant for this supplement request is to develop Machine Learning approaches to improve both accuracy and speed of highly-sensitive sequence database search and alignment. We have developed three software tools associated with this effort of correctly annotating genomes: (i) ULTRA, which labels repetitive sequence, (ii) PolyA which integrates such labels with other sequence annotations in a probabilistic framework, computing uncertainty and improving accuracy, and (iii) SODA, which aids in visualization of annotations and supporting evidence. Here, we describe a plan to refactor these software tools and their documentation to improve robustness and reliability, and to improve their availability through package management systems and incorporation into cloud-based analysis frameworks.

IC Name
NATIONAL INSTITUTE OF GENERAL MEDICAL SCIENCES
  • Activity
    R01
  • Administering IC
    GM
  • Application Type
    3
  • Direct Cost Amount
    149935
  • Indirect Cost Amount
    71969
  • Total Cost
    221904
  • Sub Project Total Cost
  • ARRA Funded
    False
  • CFDA Code
    859
  • Ed Inst. Type
    SCHOOLS OF ARTS AND SCIENCES
  • Funding ICs
    NIGMS:221904\
  • Funding Mechanism
    Non-SBIR/STTR RPGs
  • Study Section
    GVE
  • Study Section Name
    Genetic Variation and Evolution Study Section
  • Organization Name
    UNIVERSITY OF MONTANA
  • Organization Department
    BIOSTATISTICS & OTHER MATH SCI
  • Organization DUNS
    010379790
  • Organization City
    MISSOULA
  • Organization State
    MT
  • Organization Country
    UNITED STATES
  • Organization Zip Code
    598124104
  • Organization District
    UNITED STATES