The contamination of groundwater and sediment poses a significant risk to human health. One approach to clean up such sites involves the use of microorganisms to transform the pollutants. Some of the most beneficial microorganisms for this involve bacteria containing one or more of a particular group of enzymes (monooxygenases). However, the quantities of these enzymes in the environment are generally unknown and therefore it is difficult to predict biodegradation removal times. This research will develop methods to quantify the genes encoding for these enzymes using freely available sequencing data (DNA sequences) from groundwater, sediments and soils. Additionally, contaminant removal rates will be correlated with gene quantities in laboratory experiments. The expected findings will be particularly important to environmental engineers responsible for contaminated site clean-up.<br/><br/>Bioremediation often involves multiple lines of evidence for proof of effective contaminant biodegradation. One line of evidence concerns the quantification of genetic biomarkers in environmental samples. Many contaminated sites are aerobic and thus rely on degradative processes carried out by aerobic bacteria. Typically, quantitative PCR (qPCR) is used to determine the presence and abundance of these key functional genes in such samples. However, these assays are frequently based on pure cultures sequences and thus often do not take into account the gene sequences actually present in groundwater or subsurface sediments. To address this, this project research will examine freely available whole genome sequencing (WGS) data, as well as newly generated WGS data, from environmental samples to determine gene sequences actually found in the environment. Then, qPCR assays will be re-evaluated, and if needed, redesigned to be specific to environmental sequences. The use of WGS in environmental engineering to examine bioremediation potential at contaminated sites is still in its infancy, primarily due to sequencing costs and challenges relating to data analyses. This project will address both limitations by taking advantage of freely available WGS data and using the Department of Energy Systems Biology Knowledgebase (KBase), a free and easy to use platform for WGS analysis. The developed qPCR assays will be designed specifically to the gene sequences actually present in situ and will therefore enable a more accurate representation of biodegradation capabilities. The project will provide numerous training opportunities for both undergraduate and graduate students, with emphasis on outreach to undergraduate students from underrepresented communities.<br/><br/>This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.