High-throughput Epistasis Screening Using Genetical Genomics

Information

  • Research Project
  • 8002139
  • ApplicationId
    8002139
  • Core Project Number
    R43HG005936
  • Full Project Number
    1R43HG005936-01
  • Serial Number
    5936
  • FOA Number
    PAR-09-220
  • Sub Project Id
  • Project Start Date
    9/27/2010 - 14 years ago
  • Project End Date
    8/31/2012 - 12 years ago
  • Program Officer Name
    STRUEWING, JEFFERY P
  • Budget Start Date
    9/27/2010 - 14 years ago
  • Budget End Date
    8/31/2012 - 12 years ago
  • Fiscal Year
    2010
  • Support Year
    1
  • Suffix
  • Award Notice Date
    9/26/2010 - 14 years ago
Organizations

High-throughput Epistasis Screening Using Genetical Genomics

DESCRIPTION (provided by applicant): High-throughput Epistasis Screening using Genetical Genomics A fast software tool is proposed for identifying potential sets of interacting genes involved in human disease pathways. A meta-analysis of marker and expression-trait studies is performed using penalized regression software running in parallel on commodity graphics cards. The research team includes experts from genomics, statistics and software acceleration. Data will come from published studies. Initial results suggest promise for our approach. Epistasis is a key area of investigation in the elucidation of human- disease pathways. eQTL experiments have shown promise in identifying epistasis for given expression traits. We will leverage the success of eQTLs by employing the results of GWAS experiments to suggest specific expression traits to study. In this way we will exploit the findings of multiple, disparate studies in an overall meta-analysis of a disease trait. Various forms of regression analysis are currently used to screen eQTL data for epistasis, especially stepwise linear regression. We will employ penalized regression techniques, because of their speed advantage, their ability to identify multiple candidates simultaneously and their relative novelty. We will apply several distinct types of penalized regression, each with its own predictor-selection characteristics. We have strong in-house expertise in penalized regression. As more and larger genomic data sets become available, effective means for combining and mining them become essential. The sheer mass of the data, moreover, will require high-performance software in order to provide analysis in reasonable time. Parallel computation is one promising area for improving software performance. We will employ the new generation of inexpensive, widely-available graphics coprocessors to run our software in parallel. Successful application will demonstrate that relevant, large- data bioinformatics solutions can be implemented on modestly-priced desktop hardware. PUBLIC HEALTH RELEVANCE: Personalized medicine is based on the observation that susceptibility to disease has a strong genetic component. This genetic component consists of groups of highly interacting genes. We will develop high- speed software able to process the huge amounts of data needed to identify these interactions and the role they play in disease susceptibility.

IC Name
NATIONAL HUMAN GENOME RESEARCH INSTITUTE
  • Activity
    R43
  • Administering IC
    HG
  • Application Type
    1
  • Direct Cost Amount
  • Indirect Cost Amount
  • Total Cost
    181857
  • Sub Project Total Cost
  • ARRA Funded
    False
  • CFDA Code
    172
  • Ed Inst. Type
  • Funding ICs
    NHGRI:181857\
  • Funding Mechanism
    SBIR-STTR
  • Study Section
    ZRG1
  • Study Section Name
    Special Emphasis Panel
  • Organization Name
    INSILICOS
  • Organization Department
  • Organization DUNS
    126643241
  • Organization City
    SEATTLE
  • Organization State
    WA
  • Organization Country
    UNITED STATES
  • Organization Zip Code
    981094955
  • Organization District
    UNITED STATES