Machine learning approaches for improved accuracy and speed in sequence annotation: supplement for software enhancement

Information

Research Project
10406630

ApplicationId
10406630
Core Project Number
R01GM132600
Full Project Number
3R01GM132600-03S1
Serial Number
132600
FOA Number
PA-20-272
Sub Project Id

Project Start Date
9/20/2019 - 6 years ago
Project End Date
7/31/2023 - 2 years ago
Program Officer Name
RAVICHANDRAN, VEERASAMY
Budget Start Date
8/1/2021 - 4 years ago
Budget End Date
7/31/2022 - 3 years ago
Fiscal Year
2021
Support Year
03
Suffix
S1
Award Notice Date
9/16/2021 - 4 years ago

Organizations

UNIVERSITY OF MONTANA

Information

Machine learning approaches for improved accuracy and speed in sequence annotation: supplement for software enhancement

Summary The goal of this parent grant for this supplement request is to develop Machine Learning approaches to improve both accuracy and speed of highly-sensitive sequence database search and alignment. We have developed three software tools associated with this effort of correctly annotating genomes: (i) ULTRA, which labels repetitive sequence, (ii) PolyA which integrates such labels with other sequence annotations in a probabilistic framework, computing uncertainty and improving accuracy, and (iii) SODA, which aids in visualization of annotations and supporting evidence. Here, we describe a plan to refactor these software tools and their documentation to improve robustness and reliability, and to improve their availability through package management systems and incorporation into cloud-based analysis frameworks.

IC Name

NATIONAL INSTITUTE OF GENERAL MEDICAL SCIENCES

Activity
R01
Administering IC
GM
Application Type
3

Direct Cost Amount
149935
Indirect Cost Amount
71969
Total Cost
221904
Sub Project Total Cost

ARRA Funded
False
CFDA Code
859
Ed Inst. Type
SCHOOLS OF ARTS AND SCIENCES
Funding ICs
NIGMS:221904\
Funding Mechanism
Non-SBIR/STTR RPGs
Study Section
GVE
Study Section Name
Genetic Variation and Evolution Study Section

Organization Name
UNIVERSITY OF MONTANA
Organization Department
BIOSTATISTICS & OTHER MATH SCI
Organization DUNS
010379790
Organization City
MISSOULA
Organization State
MT
Organization Country
UNITED STATES
Organization Zip Code
598124104
Organization District
UNITED STATES

Machine learning approaches for improved accuracy and speed in sequence annotation: supplement for software enhancement

Information

ApplicationId

Core Project Number

Full Project Number

Serial Number

FOA Number

Sub Project Id

Project Start Date

Project End Date

Program Officer Name

Budget Start Date

Budget End Date

Fiscal Year

Support Year

Suffix

Award Notice Date

Organizations

Machine learning approaches for improved accuracy and speed in sequence annotation: supplement for software enhancement

IC Name

Activity

Administering IC

Application Type

Direct Cost Amount

Indirect Cost Amount

Total Cost

Sub Project Total Cost

ARRA Funded

CFDA Code

Ed Inst. Type

Funding ICs

Funding Mechanism

Study Section

Study Section Name

Organization Name

Organization Department

Organization DUNS

Organization City

Organization State

Organization Country

Organization Zip Code

Organization District