Microbiome science has contributed greatly to the understanding of microbial life and provided insights on the essential roles of microbial communities on the planet, from global elements cycling to human health. However, there is still lack of knowledge on how these communities are assembled, maintained, and function as a system. Most importantly, microbe-microbe interactions and how microbes and communities react to perturbations are poorly understood. As a consequence, microbiome science today is mostly descriptive and correlation-based, rather than predictive and based on mechanistic understanding. In order to achieve predictive microbiome science, there is a need to comprehensively elucidate the metabolic role of each microbe, and its interactions with others. Such knowledge would allow to change a microbe's trajectory within a community, for example by selectively promoting or limiting its growth.
Ribo-Seq, a promising technology to study translational regulation, was first applied to yeast in 2009 and then extended to animal stem cells and recently to a handful of model bacteria. The Zengler lab was among the first labs to streamline the protocol and to extend the technology to strictly anaerobic bacteria, like the ones present in the human gut. Recently, Fremin et al. publish a paper that describes also metaRibo-Seq measures translation in microbiomes (Fremin et al., Nature Communications 11, 3268 (2020). However, Fremin's measures using a protocol that does not integrate RNA-Seq with Ribo-Seq to identify TE, determine guilds and microbial niches.
Microbes tightly control their genotype-to-phenotype relationship by controlling resource allocation through the complex regulation of their transcription and translation. Determining how a microbe allocates its cellular resources to achieve the optimal balance between mRNA levels and protein numbers reveals its niche. Since translation is the most expensive process in a cell, bacteria use translational regulation to prioritize functions essential for their niche adaptation. Translation efficiency (TE), calculated by analyzing the number of ribosomes on a given transcript, can be used as a direct readout of functional prioritization in pure cultures. TE can be measured using ribosome sequencing (Ribo-Seq), as the ratio between translated mRNA over total mRNA (RNA-Seq). Ribo-seq is based on blocking the ribosomes on the mRNA during the elongation step of mRNA translation to protein, and then sequencing all (and only) the mRNA fragments that are covered (thus actively translated) by the ribosomes. In comparison with proteomics, which involves measuring all the proteins that are present in a sample independently of when these proteins were produced, Ribo-Seq (translatomics) provides a snapshot of active protein translation at a given instant, thus giving a time-resolved view of bacterial resource allocation.
The present disclosure provides a method for a design and intervention of microbial consortia. In certain embodiment, such method comprises:
In certain embodiments, the translational efficiency is determined using genome annotation, transcription and translation information of the identified member. In certain embodiments, the modulation of microbial consortia comprises one or more of the following steps:
In certain embodiments, the modulation improves performance of the microbial consortia to perform the particular task. In certain embodiments, the preferred conditions are based on the niche information and competition is determined by association to guilds.
The present disclosure also provides a method for predicting the effect of perturbations on one or more members of a microbial consortia. In certain embodiments, such method comprises:
In certain embodiments, the perturbations comprises changes of an organism's growth (e.g., its absolute or relative abundance, its size, or its growth rate and yield), metabolic activity (e.g., respiration or chemical transformation, antibiotic production, quorum sensing), chemical composition, physical properties (e.g., cell surface properties, surface charge, surface structure, speed of movement, frequency and kind of motion, such as tumbling, gliding, oscillating, production of extracellular matrices), or behavior changes (e.g., association to other organism through physical contact or chemical exchanges).
The present disclosure further provides a method for designing a microbial consortia for a particular task, wherein the members of a microbial consortia are selected from the method disclosed herein. In certain embodiments, a method for selecting one or more members of a microbial consortia for a particular task comprises:
In certain embodiments, the method for selecting one or more members of a microbial consortia for a particular task, wherein the microbial consortia are accompanied by conditions defined based on niche information.
As used herein, a microbial consortia type refers to one or more bacteria (e.g., mycoplasma, coccus, bacillus, rickettsia, spirillum), fungi (e.g., filamentous fungi, yeast), nematodes, protozoans, archaea, algae, dinoflagellates, viruses (e.g., bacteriophages), viroids and/or a combination thereof. In one embodiment, the one or more microorganism strains is one or more bacteria (e.g., mycoplasma, coccus, bacillus, rickettsia, spirillum), fungi (e.g., filamentous fungi, yeast), nematodes, protozoans, archaea, algae, dinoflagellates, viruses (e.g., bacteriophages), viroids and/or a combination thereof. In another embodiment, the one or more microorganism strains is one or more fungal species or fungal subspecies. In another embodiment, the one or more microorganism strains is one or more bacterial species or bacterial subspecies.
As used herein, the microbial consortia originates from animal, human, plant, soil (e.g., bulk soil or rhizosphere), air, saltwater, freshwater, wastewater sludge, built environment, sediment, oil, an agricultural product, an industrial product or process (e.g. fermentation process), a microbial sample, or an extreme environment. In certain embodiments, the animal or human sample is a blood, tissue, tooth, perspiration, fingernail, skin, hair, feces, urine, semen, mucus, saliva, gastrointestinal tract, rumen, muscle, brain, tissue, or organ sample. In certain embodiments, the plant sample is a root, stem, leaf, flower, fruit, seed, xylem, phloem, or juice sample.
As used herein, a member of a microbial consortia comprises one taxonomic unit (e.g., strain, species, genus, family, order, class, phylum, kingdom) present in a microbial consortium as defined above. In certain embodiments, a member of a microbial consortia comprises one single living unit (e.g., one single bacterium, fungus, protozoan, archaea, algae, dinoflagellate, virus, viroid).
As used herein, a composition comprises of any number of different microorganisms (e.g., strain, species, genus, family, order, class, phylum, kingdom) present in a microbial consortium or consortia as defined above. In certain embodiments, a composition comprises one single living unit (e.g., one single bacterium, fungus, protozoan, archaea, algae, dinoflagellate, virus, viroid). In certain embodiments, composition comprises different actives, such as transcriptional, translational, enzymatic, as defined above.
As used herein, the genome annotation comprises nucleic acid sequencing, measuring the number of unique genomic DNA markers and/or a metagenomic approach. In certain embodiments, measuring the number of first unique markers in the sample comprises measuring the number of unique RNA markers. In certain embodiments, measuring the number of unique first markers in the sample comprises measuring the number of unique protein markers. In certain embodiments, measuring the number of unique first markers in the sample comprises measuring the number of unique metabolite markers. In certain embodiments, measuring the number of unique metabolite markers in the sample comprises measuring the number of unique carbohydrate markers, unique lipid markers or a combination thereof.
As used herein, transcriptomics is measuring the level of expression of one or many RNA molecules (e.g., miRNA, tRNA, rRNA, and/or mRNA) in the sample for expression analysis. In certain embodiments, the gene expression analysis comprises a sequencing reaction. In certain embodiments, the RNA expression analysis comprises a quantitative polymerase chain reaction (qPCR), metatranscriptome sequencing, and/or transcriptome sequencing.
As used herein, translation is determining RNA binding to ribosomes by metaribosome profiling, or ribosome profiling. In certain embodiments, measuring translation comprises determining the number of unique proteins, measured by mass spectrometry analysis.
As used herein, the translation efficiency (TE) comprises transcription (e.g., miRNA, tRNA, rRNA, and/or mRNA), and/or translation (e.g., metaribosome profiling, ribosome profiling, or proteomics), and/or a ratio of translation to transcription. In certain embodiments, TE comprises ribosome binding site (RBS) affinity with ribosome (e.g., RBS sequence optimization level).
As used herein, a niche comprises the condition or set of conditions that permit a member or a set of members to exist or to thrive within a microbial consortia or biotope (Hutchinson-Macfayden, 1957), as defined by having high TE level in genome annotated processes (e.g., unique DNA, RNA, protein, or metabolite marker) associated with the said condition or set of condition. In certain embodiments, a niche comprises one substrate or set of substrates (e.g., metabolites, carbohydrates, lipids, proteins, nutrients) for which a member or a set of members have a high TE on processes relative to import, metabolism or processing of the said substrate. In certain embodiments, a niche comprises biochemical, chemical, biophysical, physical or geological conditions for which a member or a set of members have a high TE on genome annotated processes relative to import, metabolism, processing, resistance to the said condition. In certain embodiments, a niche comprises a biological condition (e.g., presence, absence of abundance of an organism or set of organisms) modifying the nutrient, biochemical, biophysical or biogeological conditions associated with high TE on genome annotated processes relative to import, metabolism, processing or resistance to the modified condition.
As used herein, a guild comprises a functional or ecological unit of member or members within microbial consortia characterized based on similarities and distances calculated using a network and/or clustering analysis method to measure distances between each member's TE profiles. In certain embodiments, the clustering analysis comprises network analysis, cluster analysis, linkage analysis, partitioning analysis, variance analysis, or a combination thereof. In another embodiment, distances between members are calculated from the clustering analysis, normalized, Z-scored or a combination thereof. In certain embodiments, the clustering analysis integrates TE measured on annotated genomes (e.g., unique DNA, RNA, protein, or metabolite marker). In certain embodiments, guilds are molded by adaptation to the same class of resources. In certain embodiments, guilds are molded by competition between its members.
As used herein, a competition comprises negative interactions between members using similar resources, where resources are e.g., organic compounds or inorganic compounds and salt, unnatural amino acids, gaseous chemicals, using similar physical properties, e.g., light, temperature, pH, osmolarity, physical space, or using similar biological entities, e.g., living organisms or dead organisms, for maintaining or advancing their life, lifetime, reproduction, growth, and sustainability. In certain embodiments, competition involves the production of antimicrobials, e.g., chemical, biochemical or biophysical compounds to reduce fitness of other members.
As used herein, a predicting effect of perturbation comprises changes of an organism's growth (e.g., its absolute or relative abundance, its size, or its growth rate and yield), metabolic activity (e.g., respiration or chemical transformation, antibiotic production, quorum sensing), chemical composition, physical properties (e.g., cell surface properties, surface charge, surface structure, speed of movement, frequency and kind of motion, such as tumbling, gliding, oscillating, production of extracellular matrices), or behavior changes (e.g., association to other organism through physical contact or chemical exchanges).
As used herein, interventions comprise addition of entire organisms (dead or live) or parts of organisms (e.g., DNA, RNA, proteins or peptides, lipids, cell membranes), physical removal of entire organisms e.g., removal through binding to antibodies, filtration by size exclusion or binding to a matrix, biological removal of entire organisms, e.g., through antibiotics, antimicrobial peptides, predators (e.g., amoeba), or viral infection. In certain embodiments, the interventions comprise changes (i.e., addition, removal or variations in concentrations) of chemical properties (e.g., organic compounds or inorganic compounds and salt, unnatural amino acids, gaseous chemicals). In certain embodiments, the interventions comprise changes (i.e., addition, removal or variations in concentrations) of physical properties, e.g., light, temperature, pH, osmolarity, physical space and environment.
Other systems, methods, features, and advantages of the present disclosure will be or become apparent to one with skill in the art upon examination of the following drawings and detailed description. It is intended that all such additional systems, methods, features, and advantages be included within this description, be within the scope of the present disclosure, and be protected by the accompanying claims. In addition, all optional and preferred features and modifications of the described embodiments are usable in all aspects of the disclosure taught herein. Furthermore, the individual features of the dependent claims, as well as all optional and preferred features and modifications of the described embodiments are combinable and interchangeable with one another.
Many aspects of the present disclosure can be better understood with reference to the drawings and supplemental drawings, tables, data and description attached as Appendix I, which is incorporated by reference by its entity. The components in the drawings and supplemental drawings are not necessarily to scale, emphasis instead being placed upon clearly illustrating the principles of the present disclosure. Moreover, in the drawings, like reference numerals designate corresponding parts throughout the several views.
Additional advantages of the invention will be set forth in part in the description which follows, and in part will be obvious from the description, or can be learned by practice of the invention. The advantages of the invention will be realized and attained by means of the elements and combinations particularly pointed out in the appended claims. It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the invention, as claimed.
The present disclosure provides a new method integrating transcriptional and translational regulation measurements, revealing how each microbe allocates its resources for optimal proteome efficiency. Protein translation is the most expensive process in a cell, so microbes closely regulate their resource allocation by prioritizing essential functions through differential translational efficiency (TE) (AI-Bassam et al. 2018). Thus, direct measurement of TE in a microbial community sample would shine light on the metabolic role of each member of the community and allow to better understand interactions with other members.
In certain embodiments, the present disclosure provides a method for a design and intervention of microbial consortia comprising the following steps:
In certain embodiments, the translational efficiency is determined using genome annotation, transcription and translation information of the identified member. In certain embodiments, the modulation of microbial consortia comprises one or more of the following steps:
In certain embodiments, the modulation improves performance of the microbial consortia to perform the particular task. In certain embodiments, the preferred conditions are based on the niche information and competition is determined by association to guilds.
A method for predicting the effect of perturbations on one or more members of a microbial consortia and a method for designing microbial consortia for a particular task are also provided herein. In certain embodiments, the perturbations comprises changes of an organism's growth (e.g., its absolute or relative abundance, its size, or its growth rate and yield), metabolic activity (e.g., respiration or chemical transformation, antibiotic production, quorum sensing), chemical composition, physical properties (e.g., cell surface properties, surface charge, surface structure, speed of movement, frequency and kind of motion, such as tumbling, gliding, oscillating, production of extracellular matrices), or behavior changes (e.g., association to other organism through physical contact or chemical exchanges). In certain embodiments, the microbial consortia are accompanied by conditions defined based on niche information.
Many modifications and other embodiments disclosed herein will come to mind to one skilled in the art to which the disclosed compositions and methods pertain having the benefit of the teachings presented in the foregoing descriptions and the associated drawings. Therefore, it is to be understood that the disclosures are not to be limited to the specific embodiments disclosed and that modifications and other embodiments are intended to be included within the scope of the appended claims. The skilled artisan will recognize many variants and adaptations of the aspects described herein. These variants and adaptations are intended to be included in the teachings of this disclosure and to be encompassed by the claims herein.
Although specific terms are employed herein, they are used in a generic and descriptive sense only and not for purposes of limitation.
As will be apparent to those of skill in the art upon reading this disclosure, each of the individual embodiments described and illustrated herein has discrete components and features which may be readily separated from or combined with the features of any of the other several embodiments without departing from the scope or spirit of the present disclosure.
Any recited method can be carried out in the order of events recited or in any other order that is logically possible. That is, unless otherwise expressly stated, it is in no way intended that any method or aspect set forth herein be construed as requiring that its steps be performed in a specific order. Accordingly, where a method claim does not specifically state in the claims or descriptions that the steps are to be limited to a specific order, it is no way intended that an order be inferred, in any respect. This holds for any possible non-express basis for interpretation, including matters of logic with respect to arrangement of steps or operational flow, plain meaning derived from grammatical organization or punctuation, or the number or type of aspects described in the specification.
All publications mentioned herein are incorporated herein by reference to disclose and describe the methods and/or materials in connection with which the publications are cited. The publications discussed herein are provided solely for their disclosure prior to the filing date of the present application. Nothing herein is to be construed as an admission that the present invention is not entitled to antedate such publication by virtue of prior invention. Further, the dates of publication provided herein can be different from the actual publication dates, which can require independent confirmation.
While aspects of the present disclosure can be described and claimed in a particular statutory class, such as the system statutory class, this is for convenience only and one of skill in the art will understand that each aspect of the present disclosure can be described and claimed in any statutory class.
It is also to be understood that the terminology used herein is for the purpose of describing particular aspects only and is not intended to be limiting. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which the disclosed compositions and methods belong. It will be further understood that terms, such as those defined in commonly used dictionaries, should be interpreted as having a meaning that is consistent with their meaning in the context of the specification and relevant art and should not be interpreted in an idealized or overly formal sense unless expressly defined herein.
Prior to describing the various aspects of the present disclosure, the following definitions are provided and should be used unless otherwise indicated. Additional terms may be defined elsewhere in the present disclosure.
As used herein, “comprising” is to be interpreted as specifying the presence of the stated features, integers, steps, or components as referred to, but does not preclude the presence or addition of one or more features, integers, steps, or components, or groups thereof. Moreover, each of the terms “by”, “comprising,” “comprises”, “comprised of,” “including,” “includes,” “included,” “involving,” “involves,” “involved,” and “such as” are used in their open, non-limiting sense and may be used interchangeably. Further, the term “comprising” is intended to include examples and aspects encompassed by the terms “consisting essentially of” and “consisting of.” Similarly, the term “consisting essentially of” is intended to include examples encompassed by the term “consisting of.
As used in the specification and the appended claims, the singular forms “a,” “an” and “the” include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to “a catalyst,” “a metal,” or “a substrate,” includes, but are not limited to, mixtures or combinations of two or more such catalysts, metals, or substrates, and the like.
It should be noted that ratios, concentrations, amounts, and other numerical data can be expressed herein in a range format. It will be further understood that the endpoints of each of the ranges are significant both in relation to the other endpoint, and independently of the other endpoint. It is also understood that there are a number of values disclosed herein, and that each value is also herein disclosed as “about” that particular value in addition to the value itself. For example, if the value “10” is disclosed, then “about 10” is also disclosed. Ranges can be expressed herein as from “about” one particular value, and/or to “about” another particular value. Similarly, when values are expressed as approximations, by use of the antecedent “about,” it will be understood that the particular value forms a further aspect. For example, if the value “about 10” is disclosed, then “10” is also disclosed.
When a range is expressed, a further aspect includes from the one particular value and/or to the other particular value. For example, where the stated range includes one or both of the limits, ranges excluding either or both of those included limits are also included in the disclosure, e.g. the phrase “x to y” includes the range from ‘x’ to ‘y’ as well as the range greater than ‘x’ and less than ‘y’. The range can also be expressed as an upper limit, e.g. ‘about x, y, z, or less’ and should be interpreted to include the specific ranges of ‘about x’, ‘about y’, and ‘about z’ as well as the ranges of ‘less than x’, less than y’, and ‘less than z’. Likewise, the phrase ‘about x, y, z, or greater’ should be interpreted to include the specific ranges of ‘about x’, ‘about y’, and ‘about z’ as well as the ranges of ‘greater than x’, greater than y’, and ‘greater than z’. In addition, the phrase “about ‘x’ to ‘y’, where ‘x’ and ‘y’ are numerical values, includes “about ‘x’ to about ‘y’.
It is to be understood that such a range format is used for convenience and brevity, and thus, should be interpreted in a flexible manner to include not only the numerical values explicitly recited as the limits of the range, but also to include all the individual numerical values or sub-ranges encompassed within that range as if each numerical value and sub-range is explicitly recited. To illustrate, a numerical range of “about 0.1% to 5%” should be interpreted to include not only the explicitly recited values of about 0.1% to about 5%, but also include individual values (e.g., about 1%, about 2%, about 3%, and about 4%) and the sub-ranges (e.g., about 0.5% to about 1.1%; about 5% to about 2.4%; about 0.5% to about 3.2%, and about 0.5% to about 4.4%, and other possible sub-ranges) within the indicated range.
As used herein, the terms “about,” “approximate,” “at or about,” and “substantially” mean that the amount or value in question can be the exact value or a value that provides equivalent results or effects as recited in the claims or taught herein. That is, it is understood that amounts, sizes, formulations, parameters, and other quantities and characteristics are not and need not be exact but may be approximate and/or larger or smaller, as desired, reflecting tolerances, conversion factors, rounding off, measurement error and the like, and other factors known to those of skill in the art such that equivalent results or effects are obtained. In some circumstances, the value that provides equivalent results or effects cannot be reasonably determined. In such cases, it is generally understood, as used herein, that “about” and “at or about” mean the nominal value indicated ±10% variation unless otherwise indicated or inferred. In general, an amount, size, formulation, parameter or other quantity or characteristic is “about,” “approximate,” or “at or about” whether or not expressly stated to be such. It is understood that where “about,” “approximate,” or “at or about” is used before a quantitative value, the parameter also includes the specific quantitative value itself, unless specifically stated otherwise.
As used herein, the terms “optional” or “optionally” means that the subsequently described event or circumstance can or cannot occur, and that the description includes instances where said event or circumstance occurs and instances where it does not.
Unless otherwise expressly stated, it is in no way intended that any method set forth herein be construed as requiring that its steps be performed in a specific order. Accordingly, where a method claim does not actually recite an order to be followed by its steps or it is not otherwise specifically stated in the claims or descriptions that the steps are to be limited to a specific order, it is no way intended that an order be inferred, in any respect. This holds for any possible non-express basis for interpretation, including: matters of logic with respect to arrangement of steps or operational flow; plain meaning derived from grammatical organization or punctuation; and the number or type of embodiments described in the specification.
Unless otherwise specified, temperatures referred to herein are based on atmospheric pressure (i.e., one atmosphere).
In certain embodiments of the present disclosure, metatranscriptomics and metatranslatomics analysis were performed to directly measure TE in situ, in a 16-member synthetic community (SynCom) compiled from rhizosphere isolates grown in a complex culture medium. It allowed to perform a guild-based microbiome classification, grouping microbes according to the metabolic pathways they prioritize, independently of their taxonomic relationships. It was shown that guilds predicted competition between members of the same guild with 100% sensitivity and 74% specificity (77% accuracy) in the SynCom. Further, gene-level analysis of TE allowed to predict each microbe's substrate preferences, i.e., their niche in the community. Such Microbial Niche Determination (MiND) successfully predicted which particular microbes would benefit from substrates supplementation with 54% sensitivity and 83% specificity (78% accuracy) in the SynCom. As microbes adapt their translational regulation to community settings, such predictions could not be achieved using axenic culture approaches (i.e., phenotypic microarray, growth curves), or partially functional measurements (i.e., metagenomics, metatranscriptomics).
Further, combining TE-based MiND and guilds predictions allowed to selectively manipulate the SynCom, by increasing or decreasing abundance of targeted members either by providing preferred substrates or by giving an advantage to their competitors. Most importantly, the method disclosed herein is scalable to more complex, natural samples. In certain embodiments, MiND was applied to native soil and human fecal samples and its applicability to predict changes and manipulate microorganisms in complex microbiomes were demonstrated.
Now having described the aspects of the present disclosure, in general, the following Examples describe some additional aspects of the present disclosure. While aspects of the present disclosure are described in connection with the following examples and the corresponding text and figures, there is no intent to limit aspects of the present disclosure to this description. On the contrary, the intent is to cover all alternatives, modifications, and equivalents included within the spirit and scope of the present disclosure.
The following examples are put forth so as to provide those of ordinary skill in the art with a complete disclosure and description of how the compounds, compositions, articles, devices and/or methods claimed herein are made and evaluated and are intended to be purely exemplary of the disclosure and are not intended to limit the scope of what the inventors regard as their disclosure. Efforts have been made to ensure accuracy with respect to numbers (e.g., amounts, temperature, etc.), but some errors and deviations should be accounted for. Unless indicated otherwise, parts are parts by weight, temperature is in ° C. or is at ambient temperature, and pressure is at or near atmospheric.
A 16-member microbial SynCom was established from the rhizosphere of switchgrass (Panicum virgatum) from agricultural crops, consisting of one strain each of Arthrobacter, Bosea, Bradyrhizobium, Brevibacillus, Burkholderia, Chitinophaga, Lysobacter, Marmoricola, Methylobacterium, Mucilaginibacter, Mycobacterium, Niastella, Paenibacillus, Rhizobium, Rhodococcus and Variovorax (Coker et al. 2022). These isolates were obtained from the rhizosphere and soil surrounding a single switchgrass plant grown in marginal soils described elsewhere (Ceja-Navarro et al. 2021). Isolates and details on their isolation are available from the Leibniz Institute German Collection of Microorganisms and Cell Cultures GmbH (DSMZ) under accession numbers DSM 113524 (Arthrobacter OAP107), DSM 113628 (Bosea OAE506), DSM 113701 (Bradyrhizobium OAE829), DSM 113525 (Brevibacillus OAP136), DSM 113627 (Burkholderia OAS925), DSM 113563 (Chitinophaga OAE865), DSM 113522 (Lysobacter OAE881), DSM 114042 (Marmoricola OAE513), DSM 113562 (Mucilaginibacter OAE612), DSM 113602 (Methylobacterium OAE515), DSM 113539 (Mycobacterium OAE908), DSM 113593 (Niastella OAS944), DSM 113526 (Paenibacillus OAE614), DSM 113517 (Rhizobium OAE497), DSM 113518 (Rhodococcus OAS809), DSM 113622 (Variovorax OAS795).
Precultures of individual isolates were made in 5 mL of liquid 1×R2A medium (Teknova, cat #R0005) in aerobic conditions at 30° C. for 7 days, without shaking. Note, one isolate (Bradyrhizobium OAE829) was grown in 0.1×R2A due to poor growth in 1×R2A, as described previously (Coker et al. 2022).
Optical density readings at 600 nm (OD600), from isolates pre-cultures were taken with a Molecular Devices SpectraMax M3 Multi-Mode Microplate Reader (VWR, cat #89429-536). Precultures were diluted to a starting OD600 of 0.02 in 5 mL 0.1×R2A. The SynCom was assembled in large volumes to minimize pipetting error and maximize reproducibility. Briefly, 1 mL of each normalized culture (OD600=0.02) was diluted in a final volume of 250 mL 0.1×R2A and spread into 20 mL aliquots in Falcon tubes. Falcon tubes containing the SynCom inoculum were then incubated at 30° C. for 7 days, in aerobic conditions, in four biological replicates. After 7 days of culture, the SynCom samples were harvested by centrifugation and pellets were immediately treated for multi-omics analysis as detailed below (DNA-, RNA- and metaRibo-Seq).
For targeted modifications experiments carried out in the SynCom, the SynCom was assembled and cultured as described above, with modifications. Specific isolates were omitted from the SynCom assembly for the dropout experiments, as described in
A soil associated with switchgrass lowland reference genome clone (Missaoui, Boerma, and Bouton 2005) that was sampled from a field located in Texas, USA (28.3325, −98.1175) was used. Fifty gram (50 g) of soil was added to 250 mL of 0.1×R2A culture medium or 0.1×R2A+SynCom inoculum prepared as described above, and this volume was spread in 14 mL culture tubes (5 mL in each tube). 50-500 μL of concentrated, filter-sterilized stocks of the desired substrates and/or 20 μL of the SynCom isolate pure cultures diluted at an OD600 of 0.02 (i.e. approximately 8.105 CFU/mL) were then added. Soil samples were then grown at 30° C. for 7 days, in aerobic conditions, in two biological replicates (three replicates for reference soil sample). After 7 days of culture, the soil samples were harvested by centrifugation and pellets were stored at −80° C. prior to metagenomics analysis.
MetaRibo-Seq sample preparation was performed as detailed in the protocol provided below. This protocol shares similarities with the recently published MetaRibo-Seq protocol from Fremin et al. 2021, with modifications (Fremin, Sberro, and Bhatt 2020). Briefly, bacterial lysis was performed in a solution containing Chloramphenicol to stop protein elongation. Monosome recovery was performed following MNase treatment, using RNeasy Mini spin size-exclusion columns (Qiagen) and RNA Clean & Concentrator-5 kit (Zymo). rRNA removal was performed using the QIAseq FastSelect-5S/16S/23S kit (Qiagen). MetaRibo-Seq libraries were prepared using the NEBNext Small RNA Library Prep set for Illumina, with modifications. Amplification was followed in real time using SYBR-Green and stopped when reaching a plateau. PCR products were purified using Select-a-size DNA Clear & Concentrator kit (Zymo). Leftover lysate prior to MNase treatment was saved and stored at −80° C. for metagenomics and metatranscriptomics analysis.
DNA and RNA from SynCom samples were extracted from leftover lysates from metaRibo-Seq sample preparation, stored in Trizol at −80 C. DNA from soil samples was extracted using ZymoBIOMICS DNA miniprep kit (Zymo). DNA-Seq libraries were prepared using Nextera XT library preparation kit with 700 pg DNA input per sample and 6:30 min tagmentation at 55° C. and barcoded using Nextera XT indexes (Illumina). RNA was extracted using RNeasy mini kit (Qiagen), and rRNA was removed using QIAseq FastSelect-5S/16S/23S kit (Qiagen). RNA-Seq libraries were prepared using KAPA RNA HyperPrep kit (Roche) and barcoded using TruSeq indexes (Illumina). Amplification was followed in real time using SYBR-Green and stopped when reaching a plateau.
The quality and average size of the libraries was controlled using a 4200 TapeStation System (Agilent). Libraries concentrations were quantified using Qubit dsDNA HS Assay kit and QuBit 2.0 Fluorometer (Invitrogen). Libraries were pooled by -omic and experiment and sequenced on a Illumina NovaSeq, PE100 platform. Minimum sequencing depth was 10 million reads for metagenomics samples, 50 million reads for metatranscriptomics samples, and 100 million reads for metatranslatomics samples.
Genomics data from individual cultures of the 16 SynCom members were used to assemble genomes using FLASh2 merging and SPAdes, and quality controlled using CheckM. Genomes were annotated at the gene level using PROKKA version 1.14.5 (Seemann 2014), and KEGG pathway annotation was performed using BlastKOALA version 2.2 (Kanehisa, Sato, and Morishima 2016). A custom SynCom metagenome database was built from the 16 isolates' genomes using bowtie2 version 2.3.2 (Langmead and Salzberg 2012).
Adapter sequences were removed from multi-omics sequencing data using TrimGalore (Cutadapt version 1.18), and quality controlled using FastQC version 0.11.9 (Andrews et al. 2010). Trimmed reads were aligned to a custom SynCom database using bowtie2 version 2.3.2. Gene count tables were obtained using Woltka version 0.1.1 (Zhu et al. 2022). Multi-omics gene counts were normalized to reads per kilobase per million (RPKM).
Feature count tables were imported and analyzed using R version 3.6.3. HCPC analysis was performed using the FactomineR package (Lê, Josse, and Husson 2008). Graphical representations were performed using the ggplot2, gplots, cowplot and factoextra packages (Wickham 2016; Irnawati et al. 2020).
TE for each of the 275 KEGG pathways present amongst the 16 SynCom members was calculated as the ratio between Ribo-Seq and RNA-Seq reads per kilobase per million (RPKM). Hierarchical clustering on the principal components (HCPC) (Le, Josse, and Husson 2008) of TE data allowed to group community members according to the metabolic pathways they prioritize, i.e. their guild.
TE was computed as:
TE
_(i,j)=Ribo-Seq
_(i,j)/
RNA-Seq
_(i,j);
Where Ribo-Seqi,j and RNA-Seqi,j are Ribo-Seq and RNA-Seq RPKM for each feature i in each bacteria j. TEi,j was calculated at two different levels: i) considering features as KEGG pathway and ii) considering features as genes. KEGG-pathway analysis was performed on all Ribo-Seqi,j and RNA-Seqi,j>10 RPKM.
For each pair of bacteriai,j, we defined a competition scorei,j, referring to the likelihood of bacteria j increasing upon removal of bacteria i, as a function of the distance between i and j in the guild clustering as follows:
Score
_(i,j)=−(
dist
_(i,j)−“μ”_i)/“σ”_i;
Where disti,j is the euclidean distance between bacteria i and j in the guild clustering, μi and σi are the average and the standard deviation of the population distances to bacteria i. Scorei,j was then adjusted to zero for bacteria having a relative abundance <0.5% after 7 days of growth in the SynCom.
Sensitivity and specificity of the prediction of community outcomes upon modifications of the community composition were calculated by considering a Scorei,j>0 as “likely” and Scorei,j<0 as “unlikely” for bacteria j to increase upon removal of bacteria i from the SynCom.
All 16 SynCom member isolates were grown axenically on 285 different substrates using Biolog Phenotypic Microarray (PM) plates (PM 1, 2A, 3B) following company instructions (Biolog). Briefly, isolates were streaked on 1×R2A agar plates (1.5% w/v) (Bradyrhizobium was streaked on 0.1×R2A plates), colonies picked and resuspended in inoculation fluid IF0a GN/GP (Biolog Part #72268) up to an OD600 of 0.07 and inoculated into the microarrays in triplicate. Microarrays were stored at 30 C incubator without shaking, with lids coated with an aqueous solution of 20% ethanol and 0.01% Triton X-100 (Sigma, cat #X100-100 ML) to prevent condensation (Coker et al. 2022). Absorbance readings were taken at 0 hr and 144 hr time points. Growth in PM2A wells were indicated by blank subtracted OD600 increases greater than 0.02, which was the minimum absorbance reached by all isolates in 0.1×R2A at the start of their exponential growth phase. PM1 and PM3B assays were performed in the same way but supplemented with 1×RedoxDyeMix G (Biolog Part #74227) (for gram negative isolates) or RedoxDyeMix H (Part #74228) (for gram positive isolates) with growth in wells indicated by increases greater than 0.02 in blank subtracted OD590 readings. Axenic growth capabilities on these substrates were used to train individual GEM models for all 16 isolates.
Genome-scale metabolic models (GEMs) of 16 SynCom members were simulated on in silico media that includes the uptake fluxes of metabolites using Flux Balance Analysis (FBA). The predicted growth rates of SynCom members were recorded for comparison purposes. To predict the effect of media supplemented with different substrates, we incorporated the uptake flux of each substrate at a time and simulated the model for each change in the media. The models were analyzed using COBRApy software package (version 0.17.1) with IBM CPLEX solver (version 22.1.0) (IBM) in Python (version 3.7.11).
Protein translation is the most expensive process in a cell, and bacteria use translational regulation to accurately allocate finite resources and prioritize functions essential for their adaptation (AI-Bassam et al. 2018; Le Scornet and Redder 2019; Fris and Murphy 2016). Ribosome profiling (Ribo-Seq), i.e. translatomics, allows the direct measurement of protein translation in vivo in real time (Latif et al. 2015). It was shown that translational efficiency (TE), calculated by analyzing the number of ribosomes on a given transcript as the ratio between translated mRNA over total mRNA (Ribo-Seq/RNA-Seq), can be used as a direct readout of functional prioritization in axenic bacterial cultures (AI-Bassam et al. 2018).
Here, metagenomics, metatranscriptomics and metatranslatomics (also called metaribosome profiling or metaRibo-Seq) (Fremin, Sberro, and Bhatt 2020) were applied to simultaneously measure TE in multiple organisms in a 16-member microbial community from rhizosphere isolates grown in complex medium (see methods,
Metabolic pathway prioritization was different when bacteria were grown axenically or in the SynCom, showing the importance of performing functional analysis in community settings directly, rather than in isolated cultures. On another hand, TE-based metabolic guilds were substantially different from phylogenetic clustering, showing that they define functional categories that are independent of taxonomic relationships (
As early as 1967, Root defined ecological guilds—observed outside of the microbial world at this time—as functional categories molded by adaptation to the same class of resources, but also by competition between its members. To proof such within-guild competition also applied to microbes, individual members were dropped out from the SynCom and the effect on relative abundance of the remaining 15 members was evaluated.
To further validate the observed competition interactions, an in-depth analysis was conducted on two of the strong competition pairs observed in the dropout experiments (i.e., Chitinophaga-Mucilaginibacter and Burkholderia-Rhizobium,
Overall, members of the same guild specifically compete between each other. Removal of a competitor allows another member of this guild to fill that niche, while addition of a competitor would have the opposite effect. It was also observed production of microbial compounds by one SynCom member, specifically targeted against its closest competitor. This confirms that TE-based metabolic guilds accurately predict competition interactions in a microbial community. Of note, a similar analysis based on metatranscriptomics data alone did not efficiently predict competition (37.5% sensitivity and 76% specificity only).
Next, TE was used to identify substrate preferences, i.e., metabolites that would specifically increase the abundance of targeted members of the SynCom, akin to a prebiotic. High TE for genes coding for import proteins would indicate prioritized metabolism for the corresponding substrates, thus defining a microbe's niche.
A total of 88 genes coding for import proteins were detected in the SynCom at the metagenomics level, of which 40 (45%) were expressed and translated based on metatranscriptomics and metaRibo-Seq data. MiND was performed by calculating TE for each of these 40 import protein genes in each SynCom member, thus defining their substrate preferences, i.e., their niche (
Phenotypic microarray (Biolog) confirmed the ability of each SynCom member to utilize MiND-predicted preferred substrates in isolation, with 88% of high TE measured in the SynCom being confirmed in isolation by Biolog. Ability to utilize a substrate in axenic culture, however, did not necessarily translate into a high priority for this substrate's intake in the SynCom, thus only 33% of substrate use abilities detected by Biolog in axenic conditions translated into a high TE in the SynCom. Similar results were observed by comparing axenic growth curves of each SynCom member in 0.1×R2A+selected substrate with the MiND predictions. This shows that while bacteria have the ability to utilize a range of substrates in axenic cultures, they only prioritize a fraction of them once put in more complex community settings.
Culture medium was supplemented with metabolites identified in the MiND analysis, in order to specifically benefit microbes that prioritize the import of these substrates (primary targets). Furthermore, metabolite-induced increase of targeted bacteria would result in a concomitant decrease of this bacteria's nearest guild competitors (secondary targets).
A total of 14 different compounds were tested in three different concentrations, including six sugars (fructose, galactose, maltose/maltodextrin, ribose, trehalose, xylose), three amino acids (cystine, glutamate, methionine), two diamines (putrescine, spermidine), one vitamin (cobalamin), one peptide (glutathione), and one inorganic compound (sulfate/thiosulfate). Addition of cobalamin, cystine or methionine did not induce any significant change in relative abundance in the community, likely because these substrates were already present in excess in the non-modified culture medium and were thus discarded from the analysis.
Bacteria with high TE for a given metabolite's import protein were specifically increased in relative abundance after metabolite addition (
In conclusion, combining MiND and guild classification accurately predicts substrate preferences and competition in a 16-members SynCom, and thus can be used to design and predict the outcome of targeted interventions in a microbial community.
MiND and guild analyses predict which SynCom members are likely to increase in relative abundance upon substrate addition, and which competitions influence the outcome of prebiotic intervention. However, if two competitors both prioritize the import of a specific substrate, the outcome of this competition could not be predicted, as observed for the competition pair Burkholderia-Rhizobium, both prioritizing ribose import (
After benchmarking MiND for the 16-member SynCom, it was evaluated if principles observed from the SynCom study can be extrapolated to the soil environment, one of the most complex microbiomes. MiND predicted interactions and prebiotic intervention outcomes were tested in non-sterile natural soil samples. In analogy to results from the SynCom study (
Soil samples from a switchgrass crop similar to the site from which the SynCom strains were originally isolated, were incubated in 33 different conditions: 0.1×R2A alone (soil control), with and without metabolite additions (prebiotic), and with and without individual SynCom members (probiotic, 8.105 CFU/mL), or the whole 16-member SynCom (probiotic consortium, 8.105 CFU/mL of each member) (see methods and
The SynCom members resulted in on average 1% of all metagenomic reads, showing that these SynCom members were detectable in the soil samples, but did not dominate the rhizosphere microbiome. This proportion was higher in the soil+probiotic consortium samples, while decreasing overtime (SynCom members making on average 6.8, 4.6, 2.6 and 1.6% of total reads after 2, 4, 7 and 14 days of incubation), showing the transitional nature of probiotic intervention outcome. Data showed good reproducibility between replicates.
As an example, the relative abundance of Burkholderia in the natural soil samples was sought to increase, through either i) single probiotic intervention (Burkholderia); ii) combined pre- and single probiotic (fructose+Burkholderia); iii) combined prebiotic and probiotic consortium (fructose+SynCom); or iv) prebiotic-only treatment (fructose). The results show that i) probiotic treatment alone induced a non-significant increase of Burkholderia (fold change=1.3,
Similarly, it was aimed to increase Chitinophaga and predict its closest competitor, Mucilaginibacter, to decrease. Probiotic treatment alone successfully increased Chitinophaga (fold change=12.3), while inducing a non-significant decrease of Mucilaginibacter (
In 5 out of 7 conditions tested, and despite the very low amount of probiotic added (OD600=0.00008, i.e., about 8×105 CFU/mL), probiotic intervention successfully increased the primary target, without dominating the rest of the community. In 10 out of 10 tested conditions, prebiotic+single probiotic treatment resulted in a successful increase of the primary target, together with a predicted decrease of secondary targets in 8 out of these 10 cases. Prebiotic+probiotic consortium (SynCom) treatment also increased the primary target in 6/7 tested conditions, with a successful decrease of secondary targets in all these 6 cases. Thus, prebiotic supplementations allowed to modulate the effect of the probiotic consortium, by increasing either Burkholderia through the addition of SynCom+fructose, or Chitinophaga through the addition of SynCom+glutathione, for example. Finally, prebiotic treatment alone successfully increased the predicted primary targets in 3 out of 7 conditions tested and decreased the secondary targets in 3 out of these 3 cases.
Combined prebiotic and probiotic, or prebiotic treatments alone often outperformed results from probiotic treatment alone. For example, Paenibacillus probiotic treatment alone did not significantly increase its abundance (fold change=1.5). However, combined prebiotic and probiotic treatment with Paenibacillus's+preferred substrates substantially increased Paenibacillus and caused a 28.8-fold change for Paenibacillus+ribose and 14.7 fold-change for Paenibacillus+trehalose. Prebiotic treatment with ribose and trehalose alone resulted in a smaller but significant increase of Paenibacillus (fold change=3.5 and 5.5, respectively).
These results also indicate that the strong competition interactions identified in the SynCom only experiments (i.e., Burkholderia—Rhizobium, Chitinophaga—Mucilaginibacter) are maintained in soil conditions. As such, it was repeatedly observed that a successful increase of primary target Burkholderia results in a decrease of secondary targets Rhizobium and Variovorax, across multiple experiments both in SynCom (
Overall, the results presented herein reveal the potential of MiND to identify and predict bacterial competition, and to design and predict the effect of targeted interventions in complex communities.
It should be emphasized that the above-described embodiments of the present disclosure are merely possible examples of implementations set forth for a clear understanding of the principles of the disclosure. Many variations and modifications may be made to the above-described embodiment(s) without departing substantially from the spirit and principles of the disclosure. All such modifications and variations are intended to be included herein within the scope of this disclosure and protected by the following claims.
This application claims the benefit of U.S. Provisional Application No. 63/310,476, filed on Feb. 15, 2022, the entire content of which is incorporated herein by reference.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2023/061546 | 1/30/2023 | WO |
Number | Date | Country | |
---|---|---|---|
63310476 | Feb 2022 | US |