A Web-Enabled Database for Rapid Metagenomic Biocatalyst Discovery and Validation

Information

  • Research Project
  • 8832535
  • ApplicationId
    8832535
  • Core Project Number
    R43GM113357
  • Full Project Number
    1R43GM113357-01
  • Serial Number
    113357
  • FOA Number
    PA-14-071
  • Sub Project Id
  • Project Start Date
    12/1/2014 - 10 years ago
  • Project End Date
    5/31/2015 - 9 years ago
  • Program Officer Name
    FABIAN, MILES
  • Budget Start Date
    12/1/2014 - 10 years ago
  • Budget End Date
    5/31/2015 - 9 years ago
  • Fiscal Year
    2015
  • Support Year
    01
  • Suffix
  • Award Notice Date
    11/20/2014 - 10 years ago
Organizations

A Web-Enabled Database for Rapid Metagenomic Biocatalyst Discovery and Validation

DESCRIPTION (provided by applicant): Enzyme biocatalysis plays an important role in pharmaceutical synthesis as it can afford precise control over stereochemistry while achieving many conversions under physiological conditions. These benefits can both lower manufacturing costs and reduce environmental impact by eliminating solvent and heavy-metal catalyst steps in a synthetic strategy. Some recent examples of this include the enzyme-catalyzed synthesis of the antidiabetic medication sitagliptin, and the biomanufacturing of the antimalarial medication, artemisinin. Efficient enzyme discovery, engineering, and validation methods are required as the foundation of these efforts. As organisms occupy diverse environments, enzymes involved in identical conversions often possess dramatically different catalytic capacity and can harbor auxiliary domains which are critical for performance. As a result, it is has been routinely shown that broadening search efforts by drawing from diverse source organisms is the most efficient strategy for exploring catalytic fitness landscapes. Due to the cost and time associated with accessing, through DNA synthesis, large numbers of enzyme variants, researchers typically focus on testing a subselection of variants which have been previously characterized or those that are easy to obtain. This approach is fundamentally limited as: 1) the overwhelming majority of available biocatalysts have not been studied in detail due to cultivation bottlenecks and 2) there are no robust methods of predicting enzyme activity from primary sequence. To address this limitation, we propose to create a platform in which all computationally-identified enzyme variants in a database search can be immediately isolated, engineered, and delivered to end users through a publicly available, bioinformatics and LIMS-driven automation platform. Our strategy is ~1,000x less expensive while being up to 300x faster than current DNA synthesis theoretical limits. The platform is built upon a massive metagenomic library, the largest reported in the world, which overcomes the primary limitation of studying enzymes from cultivated sources. This collection contains several orders of magnitude greater enzyme diversity than can be found in culture collections. To populate the database, we will apply our patented high-efficiency sequencing method, to our metagenomic library, generating an unprecedented data set containing an N50 of >50kb. End-users will be able to search and identify enzyme variants in this dataset and both native sequences and combinatorial libraries of variants can be retrieved, produced, and automatically delivered to end users in less than a week and for far less cost than DNA synthesis. Overall, this system comprises an end-to-end, rapid biocatalyst discovery, engineering, and delivery system that will be a powerful resource for end users in basic research and industrial biotechnology.

IC Name
NATIONAL INSTITUTE OF GENERAL MEDICAL SCIENCES
  • Activity
    R43
  • Administering IC
    GM
  • Application Type
    1
  • Direct Cost Amount
  • Indirect Cost Amount
  • Total Cost
    216675
  • Sub Project Total Cost
  • ARRA Funded
    False
  • CFDA Code
    859
  • Ed Inst. Type
  • Funding ICs
    NIGMS:216675\
  • Funding Mechanism
    SBIR-STTR RPGs
  • Study Section
    ZRG1
  • Study Section Name
    Special Emphasis Panel
  • Organization Name
    RADIANT GENOMICS, INC.
  • Organization Department
  • Organization DUNS
    078535589
  • Organization City
    BERKELEY
  • Organization State
    CA
  • Organization Country
    UNITED STATES
  • Organization Zip Code
    947102451
  • Organization District
    UNITED STATES