A Web-Enabled Database for Rapid Metagenomic Biocatalyst Discovery and Validation

Information

  • Research Project
  • 9251856
  • ApplicationId
    9251856
  • Core Project Number
    R44GM113357
  • Full Project Number
    5R44GM113357-03
  • Serial Number
    113357
  • FOA Number
    PA-14-071
  • Sub Project Id
  • Project Start Date
    12/1/2014 - 10 years ago
  • Project End Date
    3/31/2018 - 6 years ago
  • Program Officer Name
    FABIAN, MILES
  • Budget Start Date
    4/1/2017 - 7 years ago
  • Budget End Date
    3/31/2018 - 6 years ago
  • Fiscal Year
    2017
  • Support Year
    03
  • Suffix
  • Award Notice Date
    3/11/2017 - 7 years ago
Organizations

A Web-Enabled Database for Rapid Metagenomic Biocatalyst Discovery and Validation

? DESCRIPTION (provided by applicant): Radiant Genomics proposes to develop an integrated enzyme discovery service, the Enzyme Variant Engine (EVE), built upon the largest cloned metagenomic sequence collection reported to date. The goal is to combine a publicly­accessible search engine, richly­annotated sequence database, arrayed sample library, and LIMS automation platform to deliver novel enzyme variants to end­users for lower cost, in less time, and from a greater pool of biodiversity than alternative options, such as DNA synthesis. Importantly, this service overcomes a major bottleneck in enzyme discovery that has traditionally focused on easily­cultivated organisms which are now known to represent less than 1% of biodiversity. Phase I research and development milestones were met or exceeded. In particular, we successfully demonstrated a high­efficiency sequencing workflow that will allow us to sequence and assemble our clone library, which is predicted to encode ~600M genes, >99% of which are derived from uncultivated and essentially unstudied organisms. We next demonstrated a combinatorial barcoding strategy that yields assemblies with an average length of >30 kilobases, a dramatic improvement in metagenomic contiguity. This feature enables the discovery of clusters of functionally related genes, such as those that encode complex natural products and nutrient fixation. These services were successfully integrated into an online search engine and e­commerce platform available at www.eve.bio. Finally, we developed and demonstrated infrastructure for an automated LIMS gene recovery system that can recover thousands of genes of interest from our arrayed library per week. The success of Phase I research was complemented by general improvements in sequencing cost­efficiency and cloud­computing. The EVE service has gained commercial traction and we believe further development will benefit basic research while positively impacting a broad range of biomanufacturing processes. Based on customer feedback, the aims of this proposal are 1) continued sequencing of the library using contiguity­preserving strategies 2) scaling of computational infrastructure 3) development of advanced enzyme selectors and 4) third­party database integration. The overall outcome of this program will be a centralized search engine which allows end­users to rapidly select and receive genes identified in bioinformatic analyses. These genes will be accessible for lower cost, in less time, and from a greater pool of genetic diversity than existing services. Overall, we believe that our platform will improve our understanding of sequence­to­function relationships and annotation for metagenomic environments, helping to bridge the gap between in silico and biochemical characterization from unexplored pools of genetic diversity.

IC Name
NATIONAL INSTITUTE OF GENERAL MEDICAL SCIENCES
  • Activity
    R44
  • Administering IC
    GM
  • Application Type
    5
  • Direct Cost Amount
  • Indirect Cost Amount
  • Total Cost
    542276
  • Sub Project Total Cost
  • ARRA Funded
    False
  • CFDA Code
    859
  • Ed Inst. Type
  • Funding ICs
    NIGMS:542276\
  • Funding Mechanism
    SBIR-STTR RPGs
  • Study Section
    ZRG1
  • Study Section Name
    Special Emphasis Panel
  • Organization Name
    RADIANT GENOMICS, INC.
  • Organization Department
  • Organization DUNS
    078535589
  • Organization City
    EMERYVILLE
  • Organization State
    CA
  • Organization Country
    UNITED STATES
  • Organization Zip Code
    94608
  • Organization District
    UNITED STATES