Statistical Methods for MicroRNA-Seq Experiments

Information

  • Research Project
  • 10261580
  • ApplicationId
    10261580
  • Core Project Number
    R01GM139928
  • Full Project Number
    5R01GM139928-02
  • Serial Number
    139928
  • FOA Number
    PA-19-056
  • Sub Project Id
  • Project Start Date
    9/11/2020 - 3 years ago
  • Project End Date
    6/30/2025 - a year from now
  • Program Officer Name
    BRAZHNIK, PAUL
  • Budget Start Date
    7/1/2021 - 2 years ago
  • Budget End Date
    6/30/2022 - a year ago
  • Fiscal Year
    2021
  • Support Year
    02
  • Suffix
  • Award Notice Date
    9/13/2021 - 2 years ago

Statistical Methods for MicroRNA-Seq Experiments

Project Summary/Abstract MicroRNAs (miRNAs) are a class of small (18-24 nucleotide) RNAs that are essential regulators of gene expression, which act within the RNA-induced silencing complex (RISC) to bind mRNAs and suppress translation. Alterations in miRNA expression have been shown to disrupt entire cellular pathways, substantially contributing to a variety of human diseases. Despite nearly 25 years of research, miRNAs remain dicult to measure due to their short length, relatively small number, sequence similarity, and diculty to isolate from other small RNA fragments. While qPCR- and microarray-based miRNA assays are still widely used, the majority of recent studies use small RNA-seq (sRNA-seq) because it allows for the quanti cation of isomiRs (miRNA isoforms) and the possibility of identifying novel miRNAs. The processing of reads generated from sRNA-seq data globally distinguish between miRNA reads and those from other small RNAs, but do not necessarily capture the full spectrum of miRNA variation. Subsequent statistical analyses of processed sRNA-seq data are still performed using methods developed for mRNA-seq data despite the fact that sRNA-seq data violate several of the assumptions of these methods. Speci cally, methods for mRNA-seq data assume approximate independence between feature counts; however, the small total number of miRNAs and presence of a small number of very highly expressed miRNAs result in a lack of independence between miRNA counts. Additionally, normalization methods for mRNA-seq data assume either the overall level of transcription is constant across samples or an equal number of features are over- and under-expressed when comparing any two samples, neither of which hold for sRNA-seq data. The development of statistical methods that address the challenges of sRNA-seq data represents a critical need for miRNA research. Our long-term goal is to advance miRNA research by developing statistical methods that are tailored to the speci c complexities of miRNA expression data. The overall objective of this application is to improve the analysis of sRNA-seq data by developing statistical methods that account for challenges speci c to sRNA-seq data and outperform methods designed for mRNA-seq data. This addresses an urgent need for statistical methods to appropriately analyze sRNA-seq data, which are now routinely generated by large consortia such as TCGA and FANTOM. The rationale that underlies the proposed research is that methods that explicitly address the challenges inherent in measuring miRNAs are necessary to fully elucidate the role miRNAs play in many human disease processes.

IC Name
NATIONAL INSTITUTE OF GENERAL MEDICAL SCIENCES
  • Activity
    R01
  • Administering IC
    GM
  • Application Type
    5
  • Direct Cost Amount
    297813
  • Indirect Cost Amount
    94500
  • Total Cost
    392313
  • Sub Project Total Cost
  • ARRA Funded
    False
  • CFDA Code
    859
  • Ed Inst. Type
    SCHOOL OF MEDICINE & DENTISTRY
  • Funding ICs
    NIGMS:392313\
  • Funding Mechanism
    Non-SBIR/STTR RPGs
  • Study Section
    BMRD
  • Study Section Name
    Biostatistical Methods and Research Design Study Section
  • Organization Name
    UNIVERSITY OF ROCHESTER
  • Organization Department
    BIOSTATISTICS & OTHER MATH SCI
  • Organization DUNS
    041294109
  • Organization City
    ROCHESTER
  • Organization State
    NY
  • Organization Country
    UNITED STATES
  • Organization Zip Code
    146270140
  • Organization District
    UNITED STATES