Establishing the GWAS Catalog as a resource for large-scale association studies

Information

  • Research Project
  • 9955305
  • ApplicationId
    9955305
  • Core Project Number
    U41HG007823
  • Full Project Number
    5U41HG007823-07
  • Serial Number
    007823
  • FOA Number
    PAR-14-191
  • Sub Project Id
  • Project Start Date
    9/1/2014 - 11 years ago
  • Project End Date
    6/30/2022 - 3 years ago
  • Program Officer Name
    WILEY, KENNETH L
  • Budget Start Date
    7/1/2020 - 5 years ago
  • Budget End Date
    6/30/2021 - 4 years ago
  • Fiscal Year
    2020
  • Support Year
    07
  • Suffix
  • Award Notice Date
    7/16/2020 - 5 years ago

Establishing the GWAS Catalog as a resource for large-scale association studies

The GWAS Catalog?s objective is to summarise GWAS data acquired from scientific publications, and to give the results structure, in order to summarize research findings to a broad scientific community. The Catalog is used by a growing user community of biologists and bioinformaticians worldwide. Over the next five years, the Catalog will continue to provide the most thoroughly curated resource for human variation data, by engaging journals in data recruitment, and by allowing co-submission/data transfer from other resources like dbGAP and the EGA. In order to underpin the Catalog?s relevance, a multi-stranded approach combining data generation, infrastructure development and liaison with the Catalog?s user community will be adopted. The first Aim for the next five years is for the Catalog to continue to deliver the Catalog as a community resource with high quality content. The curation system will evolve from manual curation, towards identification of data for automated extraction and review of submitted metadata, supporting author deposition, and the development of supporting QC processes. In Aim 2, the scope of the Catalog will be broadened to include new GWAS study designs, additional associated data, and emerging technologies. The Catalog?s eligibility criteria will ensure alignment with current research and the needs of the user community, but will be monitored and re-evaluated as needed. Building on previous pilots, the focus of Aim 2 will be on the inclusion of targeted array data and other genotyping methods, such as sequencing or imputation from family members. In Aim 3, the Catalog will be delivered as a scalable and sustainable resource for the future, which will allow for an extended scope of data. The development and promotion of standard formats for GWAS study design and results will be critical to ensure an efficient process for incorporating data into the Catalog. Authors will be encouraged to submit all SNP-trait associations, irrespective of p-value: this will vastly expand the depth of data available, and the utility of the Catalog. The manual curation system will be re-developed, with process automation to increase curator efficiency. Curation resources will be allocated in order to prioritise studies with the highest utility, therefore expediting the publication of these data in the Catalog. Finally, the Catalog?s resources, interfaces, and data access will be improved for all researchers by enhancing data representation, the search functionality, data visualization and integration with data from other relevant resources. User needs will be identified through surveys, and combined with feedback from other communication routes; existing data curation processes will then be modified to improve data representation, visualization, access and versatility. The continuation of the Catalog, as the main resource for data published on diseases with complex genetic traits, is of crucial importance for the biomedical research community, as a more efficient and effective way to better understand and to prevent, or cure, diseases like cardiovascular conditions, cancer and diabetes.

IC Name
NATIONAL HUMAN GENOME RESEARCH INSTITUTE
  • Activity
    U41
  • Administering IC
    HG
  • Application Type
    5
  • Direct Cost Amount
    758000
  • Indirect Cost Amount
    60640
  • Total Cost
    818640
  • Sub Project Total Cost
  • ARRA Funded
    False
  • CFDA Code
    172
  • Ed Inst. Type
  • Funding ICs
    NHGRI:818640\
  • Funding Mechanism
    RESEARCH CENTERS
  • Study Section
    ZHG1
  • Study Section Name
    Special Emphasis Panel
  • Organization Name
    EUROPEAN MOLECULAR BIOLOGY LABORATORY
  • Organization Department
  • Organization DUNS
    321691735
  • Organization City
    HEIDELBERG
  • Organization State
  • Organization Country
    GERMANY
  • Organization Zip Code
    69117
  • Organization District
    GERMANY