COMPUTER STUDY OF SEQUENCES OF AMINO ACIDS IN PROTEINS

Information

Research Project
3180493

ApplicationId
3180493
Core Project Number
R01CA040474
Full Project Number
5R01CA040474-27
Serial Number
40474
FOA Number
Sub Project Id

Project Start Date
3/1/1986 - 39 years ago
Project End Date
2/28/1991 - 34 years ago
Program Officer Name
Budget Start Date
3/1/1989 - 36 years ago
Budget End Date
2/28/1990 - 35 years ago
Fiscal Year
1989
Support Year
27
Suffix
Award Notice Date
2/21/1989 - 36 years ago

Organizations

NATIONAL BIOMEDICAL RESEARCH FOUNDATION, INC.

Information

COMPUTER STUDY OF SEQUENCES OF AMINO ACIDS IN PROTEINS

We are examining theoretical aspects of the structure, function, and evolution of proteins with emphasis upon protein sequences and upon those problems for which a computer is essential. We detect distant relationships and infer evolutionary trees of proteins and phylogenetic trees of species in which they occur, using sequence data. We organize all known sequences into the Superfamily List, a hierarchical tabulation with five levels of distinction based on sequence similarity. We plan to develop an improved computer model of the evolutionary process by incorporating additional data on point mutations, parameters for deletion-insertion events, and parameters to allow variable mutability at different positions in the chain. Groups of simulated sequences of known evolutionary distances will be constructed and used to test and improve the performance of our programs for detecting relationships and constructing trees. This grant also partially supports the Atlas of Protein Sequence and Structure Reference Data Center, which contains a complete, currently correct, continuing collection of protein sequence data and files of background information including evolutionary history, distant relationships, alignments, genetic relationships, and three-dimensional structures. The protein sequence data are made available to the scientific ccmmunity in several forms: published volumes of the Atlas of Protein Sequence and Structure and of the Protein Segment Dictionary, and computer-readable tapes of the sequence data. These are periodically updated. Data searches and other computer services using the up-to-date sequence data collection are performed at cost for other research workers upon request. In the 1980-81 grant year we obtained an administrative supplement to support partially the preparation of the information and the development of an efficient computer retrieval system for our Nuclic Acid Sequence Database.

IC Name

NATIONAL CANCER INSTITUTE

Activity
R01
Administering IC
CA
Application Type
5

Direct Cost Amount
Indirect Cost Amount
Total Cost
Sub Project Total Cost

ARRA Funded
CFDA Code
396
Ed Inst. Type
Funding ICs
Funding Mechanism
Study Section
SSS
Study Section Name

Organization Name
NATIONAL BIOMEDICAL RESEARCH FOUNDATION
Organization Department
Organization DUNS
Organization City
WASHINGTON
Organization State
DC
Organization Country
UNITED STATES
Organization Zip Code
20007
Organization District
UNITED STATES

COMPUTER STUDY OF SEQUENCES OF AMINO ACIDS IN PROTEINS

Information

ApplicationId

Core Project Number

Full Project Number

Serial Number

FOA Number

Sub Project Id

Project Start Date

Project End Date

Program Officer Name

Budget Start Date

Budget End Date

Fiscal Year

Support Year

Suffix

Award Notice Date

Organizations

COMPUTER STUDY OF SEQUENCES OF AMINO ACIDS IN PROTEINS

IC Name

Activity

Administering IC

Application Type

Direct Cost Amount

Indirect Cost Amount

Total Cost

Sub Project Total Cost

ARRA Funded

CFDA Code

Ed Inst. Type

Funding ICs

Funding Mechanism

Study Section

Study Section Name

Organization Name

Organization Department

Organization DUNS

Organization City

Organization State

Organization Country

Organization Zip Code

Organization District