BD Spokes: PLANNING: WEST: Collaborative: Increasing collaborations in proteogenomics applications of genetic data

Information

NSF Award
1636903

Owner

Institute for Systems Biology

Award Id
1636903
Award Effective Date
9/1/2016 - 8 years ago
Award Expiration Date
8/31/2017 - 7 years ago
Award Amount
$ 71,000.00
Award Instrument
Standard Grant

Information

BD Spokes: PLANNING: WEST: Collaborative: Increasing collaborations in proteogenomics applications of genetic data

Many well-known diseases can be caused by genetic variants (such as point mutations) that affect important protein features such as enzyme active sites. The scientific community has catalogued millions of genetic variants (in genomic databases) and thousands of protein structures (in the Protein Data Bank). However, these two types of information are not linked, or easily linkable, in a manner that makes it easy to explore the relationships between variants and their structural locations. Integrative research, including genetic variation and protein sequence and 3D structure, has been rare or just focusing on a few proteins individually. In this planning project we will promote and facilitate interactions between experts from these communities with the shared goal of developing methods for integrating these data comprehensively. The project will lay groundwork for precision medicine efforts, and will have a significant impact on research on many species for which exploration of the genetic variation among strains or breeds is important. Furthermore, this project will directly impact education: we currently teach several courses per year in proteomics informatics and systems biology, and we will create and use redistributable teaching modules to help students learn to apply these concepts to their research.<br/><br/>Our proposed methodology will enable researchers to "think beyond linear" when interpreting genetic variation. There is currently a strong tendency for scientists in the genomic and mass-spectrometry proteomics communities to think about genome function in linear terms. However, the functional implications of variants (and post-translational modifications) are strongly influenced by their 3-dimensional location on a protein structure. Due to the lack of readily available tools, this leap from a linear position to a 3-dimensional location is rarely made. Our infrastructure will enable analysis at all scales, from mapping individual variants to a single protein, to mapping millions of variants to all available protein sequences and structures. This will in turn enable the discovery and interpretation of spatial patterns as a function of variant frequencies, affected amino acids, tendency to be post-translationally modified, and location within substructures.

Program Officer
Fen Zhao
Min Amd Letter Date
8/24/2016 - 8 years ago
Max Amd Letter Date
8/24/2016 - 8 years ago
ARRA Amount

Institutions

Name
Institute for Systems Biology
City
SEATTLE
State
WA
Country
United States
Address
401 Terry Avenue North
Postal Code
981095263
Phone Number
2067321200

Investigators

First Name
Eric
Last Name
Deutsch
Email Address
edeutsch@systemsbiology.org
Start Date
8/24/2016 12:00:00 AM

Program Element

Text
BD Spokes -Big Data Regional I

Program Reference

Text
BD Spokes Planning Grants

Text
CyberInfra Frmwrk 21st (CIF21)
Code
7433

Text
Big Data Science &Engineering
Code
8083

BD Spokes: PLANNING: WEST: Collaborative: Increasing collaborations in proteogenomics applications of genetic data

Information

Owner

Award Id

Award Effective Date

Award Expiration Date

Award Amount

Award Instrument

BD Spokes: PLANNING: WEST: Collaborative: Increasing collaborations in proteogenomics applications of genetic data

Program Officer

Min Amd Letter Date

Max Amd Letter Date

ARRA Amount

Institutions

Name

City

State

Country

Address

Postal Code

Phone Number

Investigators

First Name

Last Name

Email Address

Start Date

Program Element

Text

Program Reference

Text

Text

Code

Text

Code