Elucidation of the organizing principles of the regulatory genome through large-scale data integration

Information

  • Research Project
  • 10251060
  • ApplicationId
    10251060
  • Core Project Number
    R35HG011317
  • Full Project Number
    5R35HG011317-02
  • Serial Number
    011317
  • FOA Number
    RFA-HG-18-006
  • Sub Project Id
  • Project Start Date
    9/1/2020 - 3 years ago
  • Project End Date
    6/30/2025 - a year from now
  • Program Officer Name
    GILCHRIST, DANIEL A
  • Budget Start Date
    7/1/2021 - 2 years ago
  • Budget End Date
    6/30/2022 - a year ago
  • Fiscal Year
    2021
  • Support Year
    02
  • Suffix
  • Award Notice Date
    6/17/2021 - 3 years ago

Elucidation of the organizing principles of the regulatory genome through large-scale data integration

PROJECT SUMMARY The human genome contains the structural and operational instructions for living cells, yet exactly what these instructions are and how they are utilized and encoded in the primary genomic sequence is poorly understood. Arguably the only well-understood portions of the genome are protein-coding regions, which make up less than 2% of the genome. It has become increasingly clear that the non-coding genome encodes vast numbers of regulatory elements important for controlling gene expression levels in a cell type specific manner. Moreover, the overwhelming majority of disease- and trait-associated variants identified by genome-wide association studies (GWAS) lie in non-coding regions of the genome, and are strongly enriched in regulatory elements. Despite this clear relevance, we still lack a complete understanding of the global organizing principles of the regulatory genome, such as how regulatory elements are distributed across the genome, what their occurrence patterns are across cell types, and how they are encoded in the genomic sequence. We hypothesize that the main reason for our limited understanding is not lack of data, but that most data sets are generated and ultimately analyzed in isolation, limiting their full potential. To further our understanding of the organizing principles of the regulatory genome, it is therefore essential to take an ?en masse approach to data analysis, exploiting the dynamics across large numbers of observations. In this project, we will use this notion to develop methods for defining the first comprehensive and pragmatically useful human regulatory genome annotation based on the coordinated occurrence patterns of regulatory elements across hundreds of cell types and states. Beyond individual elements, we will define multi-kilobase domains of shared regulatory activity, which will shed light on the regulatory landscapes around genes and higher-order regulatory domains. In addition, we will integrate regulatory annotations with orthogonal information based on functional genomics chromatin state data to arrive at a rich composite view of the regulatory genome. Lastly, we will develop the first fully data-driven system for designing and validating context-specific synthetic regulatory elements. We anticipate that our results will provide a new lens on the human regulatory genome, which will open up new research avenues in the areas of systems and synthetic biology, ultimately contributing to the understanding and treatment of human disease. We are determined to provide the genomics community with pragmatically useful regulatory genome annotations and tools to utilize these resources.

IC Name
NATIONAL HUMAN GENOME RESEARCH INSTITUTE
  • Activity
    R35
  • Administering IC
    HG
  • Application Type
    5
  • Direct Cost Amount
    293529
  • Indirect Cost Amount
    76318
  • Total Cost
    369847
  • Sub Project Total Cost
  • ARRA Funded
    False
  • CFDA Code
    172
  • Ed Inst. Type
  • Funding ICs
    NHGRI:369847\
  • Funding Mechanism
    Non-SBIR/STTR RPGs
  • Study Section
    ZHG1
  • Study Section Name
    Special Emphasis Panel
  • Organization Name
    ALTIUS INSTITUTE FOR BIOMEDICAL SCIENCES
  • Organization Department
  • Organization DUNS
    079715609
  • Organization City
    SEATTLE
  • Organization State
    WA
  • Organization Country
    UNITED STATES
  • Organization Zip Code
    981211692
  • Organization District
    UNITED STATES