A method to determine mRNA isoform frequencies using novel primer generators

Information

  • Research Project
  • 9622969
  • ApplicationId
    9622969
  • Core Project Number
    R43GM130245
  • Full Project Number
    1R43GM130245-01
  • Serial Number
    130245
  • FOA Number
    PA-17-302
  • Sub Project Id
  • Project Start Date
    9/1/2018 - 6 years ago
  • Project End Date
    2/28/2019 - 5 years ago
  • Program Officer Name
    RAVICHANDRAN, VEERASAMY
  • Budget Start Date
    9/1/2018 - 6 years ago
  • Budget End Date
    2/28/2019 - 5 years ago
  • Fiscal Year
    2018
  • Support Year
    01
  • Suffix
  • Award Notice Date
    8/3/2018 - 6 years ago
Organizations

A method to determine mRNA isoform frequencies using novel primer generators

Abstract Current methods for RNA-seq library preparation attempt to uniformly sample all sequences across every mRNA molecule, optimally with sufficient overlap to allow de novo reassembly of the mRNA sequences from which they derive, or alternatively, to allow inference of mRNA sequence by alignment with reference sequences. Genes that encode mRNAs in multiple isoforms present a challenge: given a complete set of short sequence reads that span every exon and splice junction, certain alternative underlying mRNA isoform models cannot be deconvoluted using data of this nature. This confounding situation occurs when more than one isoform model can explain the frequencies of exon and junction sequence reads, and it is mathematically unavoidable: ultimately, short sequence reads do not contain the information needed to unambiguously identify the correct isoform model for certain common splicing patterns. We propose to test a method to preserve the information to reconstruct isoform models. In this method, we generate a small barcoded collection of overlapping sequence reads for every individual mRNA molecule, such that sequence reads from the same mRNA molecule contain the same barcode, but other transcripts from the same gene are each associated with a different molecule-specific barcode. Assembly of contigs entails the alignment of the gene-derived sequences associated with the same barcode. This will be done by random primed synthesis of cDNA in an emulsion format using beads each of which carries random primers flanked by a bead-specific barcode. A novelty in this proposal is a method to generate a bead library in which each bead carries only one barcode, but in which the overall complexity of barcodes is very high. These beads are used to generate barcoded random primers in emulsion droplets that also contain cDNA, such that multiple randomly primed products all contain the same barcode. In principle, this method produces molecule-specific collections of barcoded cDNAs, which, upon high throughput sequencing, can be aligned to reveal the specific structural details of mRNA isoforms on a molecule-by-molecule basis. This approach would solve the isoform model identifiability problem. 1

IC Name
NATIONAL INSTITUTE OF GENERAL MEDICAL SCIENCES
  • Activity
    R43
  • Administering IC
    GM
  • Application Type
    1
  • Direct Cost Amount
  • Indirect Cost Amount
  • Total Cost
    225000
  • Sub Project Total Cost
  • ARRA Funded
    False
  • CFDA Code
    859
  • Ed Inst. Type
  • Funding ICs
    NIGMS:225000\
  • Funding Mechanism
    SBIR-STTR RPGs
  • Study Section
    ZRG1
  • Study Section Name
    Special Emphasis Panel
  • Organization Name
    VLP BIOTECH, INC.
  • Organization Department
  • Organization DUNS
    160242579
  • Organization City
    San Diego
  • Organization State
    CA
  • Organization Country
    UNITED STATES
  • Organization Zip Code
    921211126
  • Organization District
    UNITED STATES