Rephase Update-a Database of Repetitive Sequences

Information

Research Project
7236216

ApplicationId
7236216
Core Project Number
P41LM006252
Full Project Number
5P41LM006252-10
Serial Number
6252
FOA Number
Sub Project Id

Project Start Date
9/30/1996 - 28 years ago
Project End Date
4/30/2009 - 16 years ago
Program Officer Name
FLORANCE, VALERIE
Budget Start Date
5/1/2007 - 18 years ago
Budget End Date
4/30/2008 - 17 years ago
Fiscal Year
2007
Support Year
10
Suffix
Award Notice Date
3/28/2007 - 18 years ago

Organizations

GENETIC INFORMATION RESEARCH INSTITUTE

Information

Rephase Update-a Database of Repetitive Sequences

0DESCRIPTION (provided by applicant): Repbase Update (http://www.girinst.org) is a database of repetitive elements currently representing over 3,200 families and subfamilies of transposable elements (TEs) from eukaryotic species. Each family is identifiable by a unique name, and its annotation includes biologically meaningful information such as species of origin, its systematic classification, keywords, reference to the scientific literature or names of contributors, brief commentaries, etc. Most of the 1,799 repetitive families added to Repbase Update (RU) during the last cycle are either unreported anywhere else, or have been thoroughly revised in terms of their consensus sequences and biological classification. Our approach is based on computer-assisted reconstruction and analysis of publicly available DNA sequence data. Additional contributions come via the peer-reviewed electronic journal Repbase Reports. RU has been routinely used by public institutions, genome sequencing consortia, and private companies throughout the world in routine gene discovery, polymorphism studies, sequence assembly and probe design. RU content has been partially described in original peer-reviewed articles, reviews and book chapters. This database became a unique resource for individual research projects of biological and medical importance and for creation of secondary databases. During the next five years RU is expected to double its size. This information will be extracted, reconstructed, analyzed, annotated, classified, indexed and made available to researchers. We will also add an undetermined number of unannotated repetitive families. We propose the following specific aims to meet the challenge: (1) Continue detection, reconstruction, annotation and electronic distribution of reference sequences for repetitive families from all sequenced eukaryotic species. (2) Continue systematic analysis and classification of repetitive families and prepare comprehensive dictionaries cross-referencing their nomenclature and biological classification. (3) Facilitate external submissions to RU and provide collaborative support to smaller databases compiled by different research groups. (4). Upgrade CENSOR program for annotation and analysis of repetitive DNA and continue generating maps of repetitive elements for selected genomes. (5) Pursue development of automated de novo identification and analysis of transposable elements. (6) Organize small workshops devoted to training and standardization of repeat nomenclature.

IC Name

NATIONAL LIBRARY OF MEDICINE

Activity
P41
Administering IC
LM
Application Type
5

Direct Cost Amount
Indirect Cost Amount
Total Cost
467234
Sub Project Total Cost

ARRA Funded
CFDA Code
879
Ed Inst. Type
Funding ICs
NLM:467234\
Funding Mechanism
Study Section
BLR
Study Section Name
Biomedical Library Review Committee

Organization Name
GENETIC INFORMATION RESEARCH INSTITUTE
Organization Department
Organization DUNS
Organization City
MOUNTAIN VIEW
Organization State
CA
Organization Country
UNITED STATES
Organization Zip Code
940430808
Organization District
UNITED STATES

Rephase Update-a Database of Repetitive Sequences

Information

ApplicationId

Core Project Number

Full Project Number

Serial Number

FOA Number

Sub Project Id

Project Start Date

Project End Date

Program Officer Name

Budget Start Date

Budget End Date

Fiscal Year

Support Year

Suffix

Award Notice Date

Organizations

Rephase Update-a Database of Repetitive Sequences

IC Name

Activity

Administering IC

Application Type

Direct Cost Amount

Indirect Cost Amount

Total Cost

Sub Project Total Cost

ARRA Funded

CFDA Code

Ed Inst. Type

Funding ICs

Funding Mechanism

Study Section

Study Section Name

Organization Name

Organization Department

Organization DUNS

Organization City

Organization State

Organization Country

Organization Zip Code

Organization District