Identification of Structural Correlate Patterns for Protein Functional Domains

Information

  • NSF Award
  • 8715633
Owner
  • Award Id
    8715633
  • Award Effective Date
    9/1/1988 - 36 years ago
  • Award Expiration Date
    9/1/1991 - 33 years ago
  • Award Amount
    $ 496,497.00
  • Award Instrument
    Continuing grant

Identification of Structural Correlate Patterns for Protein Functional Domains

This project's aim is to develop a method to identiy patterns which represent the structural correlates of protein functional domains. These will be constructed from protein primary sequences annotated with structural inferences derived from the primary sequences. The basis for the approach is that common functions generally correlate with common protein structures, domains and/or regions of invariant or equivalent amino acids. This is true even for functionally related proteins with very different primary sequences. The proposed method involves comparative analysis of sets of functionally related proteins for a pattern consisting of elements of the common structure, invariant amino acids and other properties which can be predicted statistically from the primary sequence. The approach utilizes the input from the disciplines of molecular genetic, biochemistry, and computer science. The project: will begin by extending newly developed methods, which have been proven successful upon initial application; and will culminate with the generation of a pattern-indexed library of protein functional domain pattern descriptors. This will be coupled with the software development required for their identification in new sequences. The generation of this library will thus aid in the identification of the function(s) and domain substructure of newly sequenced DNA coding regions, which will be inmportant given the ease and anticipated rate of genome sequencing in the near future. Advances in biotechnology have led to the determination of the primary structure, or ordering of amino acids, of many types of proteins. A computer based pattern recognition program would analyze known primary structures to find common patterns, and this information could then be applied to the study of newly determined sequences and to the design of new proteins with specific functions. This capability would increase fundamental knowledge in biology and lead to improved methods in biotechnology.

  • Program Officer
    Gerald Selzer
  • Min Amd Letter Date
    8/29/1988 - 36 years ago
  • Max Amd Letter Date
    8/28/1990 - 34 years ago
  • ARRA Amount

Institutions

  • Name
    Dana-Farber Cancer Institute
  • City
    Boston
  • State
    MA
  • Country
    United States
  • Address
    Office of Grants and Contracts
  • Postal Code
    022155450
  • Phone Number
    6176323940

Investigators

  • First Name
    Temple
  • Last Name
    Smith
  • Email Address
    tsmith@darwin.bu.edu
  • Start Date
    9/1/1988 12:00:00 AM

FOA Information

  • Name
    Health
  • Code
    203000
  • Name
    Life Science Biological
  • Code
    61