A portal and integrative collaborative analysis platform for GTEx

Information

  • Research Project
  • 10405210
  • ApplicationId
    10405210
  • Core Project Number
    U41HG009494
  • Full Project Number
    3U41HG009494-05S1
  • Serial Number
    009494
  • FOA Number
    PA-20-272
  • Sub Project Id
  • Project Start Date
    8/15/2017 - 7 years ago
  • Project End Date
    5/31/2022 - 2 years ago
  • Program Officer Name
    GUPTA, JYOTI
  • Budget Start Date
    9/2/2021 - 3 years ago
  • Budget End Date
    5/31/2022 - 2 years ago
  • Fiscal Year
    2021
  • Support Year
    05
  • Suffix
    S1
  • Award Notice Date
    9/2/2021 - 3 years ago
Organizations

A portal and integrative collaborative analysis platform for GTEx

Project Summary Abstract: Our capability to apply high-throughput molecular profiling technologies to increasingly large cohorts and sample sets is significantly expanding our understanding of human biology and complex disease. The Genotype Tissue-Expression (GTEx) project is creating a unique resource of genetic variation and gene expression across a wide range of human tissues. Upon completion this will include RNA sequence data from over 25,000 samples spanning 53 human tissues/organs and whole genome and exome sequence data from 960 donors. Additional data types not yet generated will include miRNA-seq, protein levels, DNA methylation, ChIP-seq, and DNase I hypersensitive site data among others. The ability to easily access, interpret and integrate these large data sets by a wide range of users with varying needs and skills is becoming of critical importance to leverage the full utility of the data. The GTEx Portal (http://gtexportal.org/) is the most widely accessed resource for the GTEx project, hosting all unprotected data, analysis results and numerous visual exploration tools, and has been enthusiastically received by the scientific community. To maximize the impact of this resource, we plan to expand the portal to: host data currently in production and new data types still to be generated; present novel and integrative analyses of existing data, and data from external sources; and to develop and share flexible tools for data analysis, visualization and access. Aim 1. We will host and support all open-access GTEx data and analysis results, performing systematic re-analyses of the data with new methods to reflect the state-of-the-art in RNA-seq analysis. We will add all new data sets to the portal to include novel assays (e.g. mi-RNA-seq, protein, methylation assays, etc/), derived analysis results (e.g. trans-eQTLs, splice- QTLs, GWAS enrichment analyses, protein-QTLs, etc.), and RNA-seq data sets from external investigators. Aim 2. We will work closely with both small focus groups of tool developers and engage our large user-base to identify and prioritize new features for development to display and integrate between multiple data types, and collaborate with other large genomic resources (e.g. ENCODE, UCSC and ENSEMBL browsers) to enable better integration of data sources and to enhance the utility and accessibility of the GTEx resource. Aim 3. We will automate and share all analysis pipeline tools with the scientific community. To support a wide range of user access needs, we will develop an open-source API to provide comprehensive data access, and also improve visualization tools and user-driven data analyses on the portal. To maximize use of the resource, we will design and offer training videos and outreach workshops.

IC Name
NATIONAL HUMAN GENOME RESEARCH INSTITUTE
  • Activity
    U41
  • Administering IC
    HG
  • Application Type
    3
  • Direct Cost Amount
    421005
  • Indirect Cost Amount
    85360
  • Total Cost
    506365
  • Sub Project Total Cost
  • ARRA Funded
    False
  • CFDA Code
    172
  • Ed Inst. Type
  • Funding ICs
    NHGRI:506365\
  • Funding Mechanism
    RESEARCH CENTERS
  • Study Section
    ZHG1
  • Study Section Name
    Special Emphasis Panel
  • Organization Name
    BROAD INSTITUTE, INC.
  • Organization Department
  • Organization DUNS
    623544785
  • Organization City
    CAMBRIDGE
  • Organization State
    MA
  • Organization Country
    UNITED STATES
  • Organization Zip Code
    021421027
  • Organization District
    UNITED STATES