Rephase Update-a Database of Repetitive Sequences

Information

  • Research Project
  • 7236216
  • ApplicationId
    7236216
  • Core Project Number
    P41LM006252
  • Full Project Number
    5P41LM006252-10
  • Serial Number
    6252
  • FOA Number
  • Sub Project Id
  • Project Start Date
    9/30/1996 - 27 years ago
  • Project End Date
    4/30/2009 - 15 years ago
  • Program Officer Name
    FLORANCE, VALERIE
  • Budget Start Date
    5/1/2007 - 17 years ago
  • Budget End Date
    4/30/2008 - 16 years ago
  • Fiscal Year
    2007
  • Support Year
    10
  • Suffix
  • Award Notice Date
    3/28/2007 - 17 years ago

Rephase Update-a Database of Repetitive Sequences

0DESCRIPTION (provided by applicant): Repbase Update (http://www.girinst.org) is a database of repetitive elements currently representing over 3,200 families and subfamilies of transposable elements (TEs) from eukaryotic species. Each family is identifiable by a unique name, and its annotation includes biologically meaningful information such as species of origin, its systematic classification, keywords, reference to the scientific literature or names of contributors, brief commentaries, etc. Most of the 1,799 repetitive families added to Repbase Update (RU) during the last cycle are either unreported anywhere else, or have been thoroughly revised in terms of their consensus sequences and biological classification. Our approach is based on computer-assisted reconstruction and analysis of publicly available DNA sequence data. Additional contributions come via the peer-reviewed electronic journal Repbase Reports. RU has been routinely used by public institutions, genome sequencing consortia, and private companies throughout the world in routine gene discovery, polymorphism studies, sequence assembly and probe design. RU content has been partially described in original peer-reviewed articles, reviews and book chapters. This database became a unique resource for individual research projects of biological and medical importance and for creation of secondary databases. During the next five years RU is expected to double its size. This information will be extracted, reconstructed, analyzed, annotated, classified, indexed and made available to researchers. We will also add an undetermined number of unannotated repetitive families. We propose the following specific aims to meet the challenge: (1) Continue detection, reconstruction, annotation and electronic distribution of reference sequences for repetitive families from all sequenced eukaryotic species. (2) Continue systematic analysis and classification of repetitive families and prepare comprehensive dictionaries cross-referencing their nomenclature and biological classification. (3) Facilitate external submissions to RU and provide collaborative support to smaller databases compiled by different research groups. (4). Upgrade CENSOR program for annotation and analysis of repetitive DNA and continue generating maps of repetitive elements for selected genomes. (5) Pursue development of automated de novo identification and analysis of transposable elements. (6) Organize small workshops devoted to training and standardization of repeat nomenclature.

IC Name
NATIONAL LIBRARY OF MEDICINE
  • Activity
    P41
  • Administering IC
    LM
  • Application Type
    5
  • Direct Cost Amount
  • Indirect Cost Amount
  • Total Cost
    467234
  • Sub Project Total Cost
  • ARRA Funded
  • CFDA Code
    879
  • Ed Inst. Type
  • Funding ICs
    NLM:467234\
  • Funding Mechanism
  • Study Section
    BLR
  • Study Section Name
    Biomedical Library Review Committee
  • Organization Name
    GENETIC INFORMATION RESEARCH INSTITUTE
  • Organization Department
  • Organization DUNS
  • Organization City
    MOUNTAIN VIEW
  • Organization State
    CA
  • Organization Country
    UNITED STATES
  • Organization Zip Code
    940430808
  • Organization District
    UNITED STATES