The human intestinal microbiota constitutes a complex ecosystem now well recognized for its impact on human health and well being. It does contribute to maturation of the immune system and direct barrier against colonization by pathogens. Over the second half of the past century, infectious diseases have been dramatically reduced and major pathogens have been put under control. During the same period, a number of “immune” diseases have followed a constant increase in prevalence, especially in western societies. This has been the case for allergies, inflammatory bowel diseases, irritable bowel syndrome and possibly metabolic and degenerative disorders such as obesity, metabolic syndrome, diabetes and cancer. The sequence of the human genome has lead to the observation of genes associated with an increased risk for immune diseases but mutations in these genes will most often only explain a small fraction of the actual cases and genetic predisposition will require environmental triggers to actually cause a disease. Among environmental components, the intestinal microbiota has recently gained a marked recognition as a key player.
The analysis of the molecular composition of the intestinal microbiota in healthy humans indicates marked inter-individual variations which may seem paradoxical considering the high degree of conservation of major functions of the intestinal microbiota such as anaerobic digestion of alimentary fibres. Recent high throughput and culture independent molecular observations have lead to the description of a core within the human intestinal microbiota, in terms of species but also at the level of genes; i.e. a set of conserved entities that could be responsible for major conserved functionalities.
The current knowledge permits to define criteria qualifying the normal state of the human intestinal microbiota, i.e. normobiosis. This further allows identifying specific distortions from normobiosis, i.e. dysbiosis, in immune, metabolic or degenerative diseases. The exploration of dysbiosis may be viewed as a primary step providing key information for the design of strategies aiming at restoring or maintaining homeostasis and normobiosis. In addition, criteria qualifying dysbiosis in a strictly defined, well phenotyped, disease context will be valuable elements to design diagnosis models. Although so far restricted to microbiota composition and/or diversity, dysbiosis has been suspected for several diseases and in a few cases it has already been partially documented, e.g. in obesity. Indeed, nutrition plays a crucial role in directly modulating our microbiomes and health phenotypes. Poorly balanced diets can turn the gut microbiome from a partner for health to a “pathogen” in chronic diseases. Accumulating evidence supports the hypothesis that obesity and related metabolic diseases develop because of low-grade, systemic and chronic inflammation induced by diet-disrupted gut microbiota. There is thus still a need for a new, reliable method allowing a consistent diagnosis of obesity.
Most intestinal commensals cannot be cultured. Genomic strategies have been developped to overcome this limitation (Hamady and Knight, Genome Res, 19: 1141-1152, 2009). These strategies have allowed the definition of the microbiome as the collection of the genes comprised in the genomes of the microbiota (Turnbaugh et al., Nature, 449: 804-8010, 2007; Hamady and Knight, Genome Res., 19: 1141-1152, 2009). The existence of a small number of species shared by all individuals constituting the human intestinal microbiota phylogenetic core has been demonstrated (Tap et al., Environ Microbiol., 11(10): 2574-2584, 2009). Recently, a metagenomic analysis has led to the identification of an extensive catalogue of 3.3 million non-redundant microbial genes of the human gut, corresponding to 576.7 gigabases of sequence (Qin et al., Nature, 2010, doi:10.1038/nature08821).
The inventors have used a method based on the isolation and sequencing of DNA fragments from human faeces in different individuals. Since an extensive catalogue of microbial genes from the gut is now available (Qin et al., Nature, 2010, doi:10.1038/nature08821), the number of copies and the frequency of a specific sequence in a specific population (e.g. patients suffering from obesity) can be calculated. It is thus possible to identify any correlation between the presence or absence of a specific gene and the presence or absence of a specific pathology. In addition, the number of copies of a specific gene in an individual can be determined.
The inventors were able to identify genes which are significantly different between a group of obese patients, and a control group of lean, healthy people. These genes are listed in Table 1. The said genes are more numerous in lean individuals than in the patients. This observation is statistically significant, since the total number of microbial genes is not different in both populations. There is thus a loss of specific human's gut microbial genes in individuals suffering from obesity.
A first aspect of this invention is a method for diagnosing obesity, said method comprising a step of determining whether at least one gene is absent from an individual's gut microbiome. By “individual's gut microbiome”, it is herein understood all the genes constituting the microbiota of the said individual. The term “individual's gut microbiome” thus corresponds to all the genes of all the bacteria present in the said individual's gut.
A gene is absent from the microbiome when its number of copies in the microbiome is under a certain threshold value. According to the present invention, a “threshold value” is intended to mean a value that permits to discriminate samples in which the number of copies of the gene of interest corresponds to a number of copies in the individual's microbiome that is low or high. In particular, if a number of copies is inferior or equal to the threshold value, then the number of copies of this gene in the microbiome is considered low, whereas if the number of copies is superior to the threshold value, then the number of copies of this gene in the microbiome is considered high. A low copy number means that the gene is absent from the microbiome, whereas a high number of copies means that the gene is present in the microbiome. For each gene, and depending on the method used for measuring the number of copies of the gene, the optimal threshold value may vary. However, it may be easily determined by a skilled artisan based on the analysis of the microbiome of several individuals in which the number of copiesl (low or high) is known for this particular gene, and on the comparison thereof with the number of copies of a control gene.
The method of the invention thus allows the skilled person to diagnose a pathology solely on the basis of the presence or the absence of a gene from the individual's gut microbiome. There is a direct correlation between the number of copies of a specific gene and the number of bacterial cells carrying this gene. The method of the invention thus allows the skilled person to detect a dysbiosis, i.e. a microbial imbalance, by analysis of the microbiome. Not all the species in the gut have been identified, because most cannot be cultured, and identification is difficult. In addition, most species found in the gut of a given individual are rare, which makes them difficult to detect (Hamady and Knight, Genome Res., 19: 1141-1152, 2009). In this first aspect of the invention, no prior identification of the bacterial species the said gene belongs to is required. The method of diagnosis of the invention is thus not restricted to the determination of a change in the population of known gut's bacterial species, but encompasses also the bacteria which have not yet been characterized taxiconomically.
There are several ways to obtain samples of the said individual's gut microbial DNA (Sokol et al., Inflamm. Bowel Dis., 14(6): 858-867, 2008). For example, it is possible to prepare mucosal specimens, or biopsies, obtained by coloscopy. However, coloscopy is an invasive procedure which is ill-defined in terms of collection procedure from study to study. Likewise, it is possible to obtain biopies through surgery. However, even more than coloscopy, surgery is an invasive procedure, which effects on the microbial population are not known. Preferred is the faecal analysis, a procedure which has been reliably been used in the art (Bullock et al., Curr Issues Intest Microbiol.; 5(2): 59-64, 2004; Manichanh et al., Gut, 55: 205-211, 2006; Bakir et al., Int J Syst Evol Microbiol, 56(5): 931-935, 2006; Manichanh et al., Nucl. Acids Res., 36(16): 5180-5188, 2008; Sokol et al., Inflamm. Bowel Dis., 14(6): 858-867, 2008). An example of this procedure is described in the Methods section of the Experimental Examples. Faeces contain about 1011 bacterial cells per gram (wet weight) and bacterial cells comprise about 50% of faecal mass. The microbiota of the faeces represent primarily the microbiology of the distal large bowel. It is thus possible to isolate and analyse large quantities of microbial DNA from the faeces of an individual. By “microbial DNA”, it is herein understood the DNA from any of the resident bacterial communities of the human gut. The term “microbial DNA” encompasses both coding and non-coding sequences; it is in particular not restricted to complete genes, but also comprises fragments of coding sequences. Faecal analysis is thus a non-invasive procedure, which yields consistent and directly-comparable results from patient to patient.
Therefore, in a preferred embodiment, the method of the invention comprises a step of obtaining microbial DNA from faeces of the said individual. In a further preferred embodiment, the faeces from said individual are collected, DNA is extracted, and the presence or absence from an individual's gut microbiome of at least one gene is determined. The presence or absence of a gene may be determined by all the methods known to the skilled person. For instance, the whole microbiome of the said individual may be sequenced, and the presence or absence of the said gene searched with the help of bioinformatics methods. One instance of such a strategy is described in the Methods section of the Experimental Examples. Alternatively, the gene of interest may be looked for in the microbiome by hybridization with a specific probe, e.g. by Southern hybridization. It will be immediately apparent to the person of skills in the art that, in this particular embodiment, although Southern hybridization is perfectly suitable, it is nevertheless more convenient and sensitive to use microarrays. In yet another embodiment, the presence of the gene of interest may be detected by amplification, in particular by quantitative PCR (qPCR). These technologies (Southern, microarrays, qPCR, etc) are now used routinely by those skilled in the art and thus do not need to be detailed here.
In another preferred embodiment, the gene which absence or presence from the individual's gut microbiome is determined is selected from the group of genes listed in Tables 1. The skilled person will have no difficulty in realizing that the more genes are tested, the higher the degree of confidence of the result. According to another further preferred embodiment, the method of the invention comprises determining the presence or absence of at least 50% of the genes listed in Table 1, more preferably, at least 75% of the genes of Table 1, even more preferably, at least 90% of the genes of Table 1.
Even though a great number of the bacterial species found in the microbial flora have not been identified, it is known that most bacteria belong to the genera Bacteroides, Clostridium, Fusobacterium, Eubacterium, Ruminococcus, Peptococcus, Peptostreptococcus, and Bifidobacterium. Other genera such as Escherichia and Lactobacillus are present to a lesser extent. Some individual species belonging to these genera have been identified, and some of the genes of these species are known. The extensive metagenomic study which has led to the identification of 3.3 million non-redundant microbial genes has also permitted the assignment of most new sequences. A gene belonging to a given species is present in an individual at the same frequency as all the other genes of the said species. It is thus possible for each of the genes identified through the method of the invention to determine whether there is a correlation between the presence or absence of the said gene and the presence or absence of a set of genes known to belong to a specific bacterial species in various individuals. Such a correlation indicates that the unknown gene belongs to the said specific bacterial species. The inventors have thus shown that some bacterial species are associated with obesity whereas other bacterial species are associated with the lean phenotype. The obese phenotype can be predicted by a linear combination of the said species, i.e. the more bacterial species associated with the obese phenotype are present in an individual's gut, and the lesser species associated with the lean phenotype in the said individual's gut, the higher the probability that the said individual suffers from obesity. For example, the absence of Bacteronides pectinophilus, Eubacterium siraeum and Clostridium phyto fermentans and the presence of Anaerotruncus colihominis in the gut of a person indicates that this person suffers from obesity.
It will be clear for the person skilled in the art that the genes of the invention can be used as biomarkers, for example during the treatment of patients suffering from obesity. Therefore, in another embodiment, the invention includes a method for monitoring the efficacy of a treatment for obesity. When a treatment is efficacious against obesity, the dysbiosis initially observed gradually disappears. Whereas some specific genes are absent from the individual's guts when that said individual is obese (e.g. the genes of Table 1), these genes reappear during the treatment. In this embodiment, the method of the invention thus comprises the steps of first determining whether at least one gene is absent from the said patient's microbiome, administering the treatment, determining if the said at least one gene is present in the patient's microbiome. In a preferred embodiment, the method of the invention comprises the steps of obtaining microbial DNA from faeces of the said individual, before and after the treatment. In a further preferred embodiment, the faeces from said individual are collected before and after the treatment, DNA is extracted, and the presence or absence from an individual's gut microbiome of at least one gene is determined.
In another preferred embodiment, the gene which absence or presence from the individual's gut microbiome is determined is selected from the group of genes listed in Tables 1. In a particular embodiment, the method of the invention comprises determining the presence or absence of at least 50% of the genes listed in Table 1, more preferably, at least 75% of the genes of Table 1, even more preferably, at least 90% of the genes of Table 1.
The present invention also includes a kit dedicated to the implementation of the methods of the invention, comprising all the genes which are absent in a patient suffering from obesity and which are present in a lean, healthy person. In particular, the present invention relates to a microarray dedicated to the implementation of the methods according to the invention, comprising probes binding to all the genes absent in a patient suffering from obesity and present in a lean person. In a preferred embodiment, said microarray is a nucleic acid microarray. According to the invention, a “nucleic microarray” consists of different nucleic acid probes that are attached to a substrate, which can be a microchip, a glass slide or a microsphere-sized bead. A microchip may be constituted of polymers, plastics, resins, polysaccharides, silica or silica-based materials, carbon, metals, inorganic glasses, or nitrocellulose. Probes can be nucleic acids such as cDNAs (“cDNA microarray”) or oligonucleotides (“oligonucleotide microarray”, the oligonucleotides being about 25 to about 60 base pairs or less in length). Alternatively to nucleic acid technology, quantitative PCR may be used and amplification primers specific for the genes to be tested are thus also very useful for performing the methods according to the invention. The present invention thus further relates to a kit for diagnosing obesity in a patient, comprising a dedicated microarray as described above or amplification primers specific for genes absent in a patient suffering from obesity and present in a healthy person. Whereas these kits may allow the skilled person to detect 10%, 25%, 50% or 75% of the said genes, they are most useful when they allow the detection of 90%, 95%, 97.5% or even 99% of the said genes. Thus a microarray according to the invention will comprise probes binding to at least 10%, 25%, 50% or 75%, and preferably 90%, 95%, 97.5%, and even more preferably at least 99% of the said genes. Likewise a kit for quantitative PCR will contain primers allowing the amplification of at least 10%, 25%, 50% or 75%, and preferably 90%, 95%, 97.5%, and even more preferably at least 99% of the said genes. In a preferred embodiment, the genes which are absent in an obese patient and are present in lean people are the genes listed in Table 1.
Human Faecal Sample Collection.
Danish individuals were from the Inter-99 cohort (Toft. et al., Prev. Med., 47: 378-383, 2008), varying in phenotypes according to BMI (body/mass index) and status towards obesity/diabetes. Patients and healthy controls were asked to provide a frozen stool sample. Fresh stool samples were obtained at home, and samples were immediatelyfrozen by storing them in their home freezer. Frozen samples were delivered to the hospital using insulating polystyrene foam containers, and then they were stored at −80° C. until analysis.
DNA Extraction.
A frozen aliquot (200 mg) of each faecal sample was suspended in 250 μl of guanidine thiocyanate, 0.1M Tris (pH 7.5) and 40 μl of 10% N-lauroyl sarcosine. Then, DNA extraction was conducted as previously described (Manichanh et al., Gut, 55: 205-211, 2006). The DNA concentration and its molecular size were estimated by nanodrop (Thermo Scientific) and agarose gel electrophoresis.
DNA Library Construction and Sequencing.
DNA library preparation followed the manufacturer's instruction (Illumina). We used the same workflow as described elsewhere to perform cluster generation, template hybridization, isothermal amplification, linearization, blocking and denaturization and hybridization of the sequencing primers. The base-calling pipeline (version IlluminaPipeline-0.3) was used to process the raw fluorescent images and call sequences. We constructed one library (clone insert size 200 bp) for each of the first 15 samples, and two libraries with different clone insert sizes (135 by and 400 bp) for each of the remaining 109 samples for validation of experimental reproducibility. To estimate the optimal return between the generation of novel sequence and sequencing depth, we aligned the Illumina GA reads from samples MH0006 and MH0012 onto 468,335 Sanger reads totalling to 311.7 Mb generated from the same two samples (156.9 and 154.7 Mb, respectively), using the Short Oligonucleotide Alignment Program (SOAP) (Li et al., Bioinformatics, 25: 1966-1967, 2009). and a match requirement of 95% sequence identity. With about 4 Gb of Illumina sequence, 94% and 89% of the Sanger reads (for MH0006 and MH0012, respectively) were covered. Further extensive sequencing, to 12.6 and 16.6 Gb for MH0006 and MH0012, respectively, brought only a moderate increase of coverage to about 95%. More than 90% of the Sanger reads were covered by the Illumina sequences to a very high and uniform level, indicating that there is little or no bias in the Illumina GA sequence. As expected, a large proportion of Illumina sequences (57% and 74% for M0006 and M0012, respectively) was novel and could not be mapped onto the Sanger reads. This fraction was similar at the 4 and 12-16 Gb sequencing levels, confirming that most of the novelty was captured already at 4 Gb.
We generated 35.4-97.6 million reads for the remaining 122 samples, with an average of 62.5 million reads. Sequencing read length of the first batch of 15 samples was 44 by and the second batch was 75 bp.
Public Data Used
The sequenced bacteria genomes (totally 806 genomes) deposited in GenBankwere downloaded from the NCBI database (http://www.ncbi.nlm.nih.gov/) on 10 Jan. 2009. The known human gut bacteria genome sequences were downloaded from HMP database (http://www.hmpdacc-resources.org/cgi-bin/hmp_catalog/main.cgi), GenBank (67 genomes), Washington University in St Louis (85 genomes, version April 2009, http://genome.wust1.edu/pub/organism/Microbes/Human_Gut_Microbiome/), and sequenced by the MetaHIT project (17 genomes, version September 2009, http://www.sanger.ac.uk/pathogens/metahit/). The other gut metagenome data used in this project include: (1) human gut metagenomic data sequenced from US individuals (Zhang et al., Proc. Natl Acad. Sci. USA, 106: 2365-2370, 2009), which was downloaded from NCBI with the accession SRA002775; (2) human gut metagenomic data from Japanese individuals (Kurokawa et al., DNA Res. 14: 169-181, 2007), which was downloaded from P. Bork's group at EMBL (http://www.bork.embl.de). The integrated NR database we constructed in this study included NCBI-NR database (version April 2009) and all genes from the known human gut bacteria genomes.
Illumina GA Short Reads De Novo Assembly.
High-quality short reads of each DNA sample were assembled by the SOAP de novo assembler (Li. & Zhu, Genome Res., 20(2): 265-272, 2010). In brief, we first filtered the low abundant sequences from the assembly according to 17-mer frequencies The 17-mers with depth less than 5 were screened in front of assembly, for these low-frequency sequences were very unlikely to be assembled, whereas removing them would significantly reduce memory requirement and make assembly feasible in an ordinary supercomputer (512 GB memory in our institute). Then the sequences were processed one by one and the de Bruijn graph data format was used to store the overlap information among the sequences. The overlap paths supported by a single read were unreliable and removed. Short low-depth tips and bubbles that were caused by sequencing errors or genetic variations between microbial strains were trimmed and merged, respectively. Read paths were used to solve the tiny repeats. Finally, we broke the connections at repeat boundaries, and outputted the continuous sequences with unambiguous connections as contigs. The metagenomic special model was chosen, and parameters ‘-K 21’ and ‘-K 23’ were used for 44 by and 75 by reads, respectively, to indicate the minimal sequence overlap required. After de novo assembly for each sample independently, we merged all the unassembled reads together and performed assembly for them, as to maximize the usage of data and assemble the microbial genomes that have low frequency in each read set, but have sufficient sequence depth for assembly by putting the data of all samples together.
Validating Illumina Contigs Using Sanger Reads.
We used BLASTN (WUBLAST 2.0) to map Sanger reads from samples MH0006 and MH0012 (156.9 Mb and 154.7 Mb, respectively) to Illumina contigs (single best hit longer than 75 by and over 95% identity) from the same samples. Each alignment was scanned for breakage of collinearity where both sequences have at least 50 bases left unaligned at one end of the alignment. Each such breakage was considered an assembly error in the Illumina contig at the location where collinearity breaks. Errors within 30 by from each other were merged. An error was discarded if there exists a Sanger read that agrees with the contig structure for 60 by on both sides of the error. For comparison, we repeated this on a Newbler2 assembly of 454 Titanium reads from MH0006 (550 Mb reads). We estimate 14.12 errors per Mb of contigs for the Illumina assembly, which is comparable to that of the 454 assembly (20.73 per Mb). 98.7% of Illumina contigs that map at least one Sanger read were collinear over 99.55% of the mapped regions, which is comparable to 97.86% of such 454 contigs being collinear over 99.48% of the mapped regions.
Evaluation of Human Gut Microbiome Coverage.
The Illumina GA reads were aligned against the assembled contigs and known bacteria genomes using SOAP by allowing at most two mismatches in the first 35-bp region and 90% identity over the read sequence. The Roche/454 and Sanger sequencing reads were aligned against the same reference using BLASTN with 1×10−8, over 100 by alignment length and minimal 90% identity cutoff. Two mismatches were allowed and identity was set 95% over the read sequence when aligned to the GA reads of MH0006 and MH0012 to Sanger reads from the same samples by SOAP.
Gene Prediction and Construction of the Non-Redundant Gene Set.
We use MetaGene (Noguchi et al., Nucleic Acids Res., 34, 5623-5630, 2006)—which uses di-codon frequencies estimated by the GC content of a given sequence, and predicts a whole range of ORFs based on the anonymous genomic sequences—to find ORFs from the contigs of each of the 124 samples as well as the contigs from the merged assembly. The predicted ORFs were then aligned to each other using BLAT (Kent et al., Genome Res., 12: 656-664, 2002). A pair of genes with greater than 95% identity and aligned length covered over 90% of the shorter gene was grouped together. The groups sharing genes were then merged, and the longest ORF in each merged group was used to represent the group, and the other members of the group were taken as redundancy. Therefore, we organized the non-redundant gene set from all the predicted genes by excluding the redundancy. Finally, the ORFs with length less than 100 by were filtered. We translated the ORFs into protein sequences using the NCBI Genetic Codes (Ley et al., Nature Rev. Microbiol., 6: 776-788, 2008).
Identification of Genes.
To make a balance between identifying low-abundance genes and reducing the error-rate of identification, we explored the impact of the threshold set for read coverage required to identify a gene in individual microbiomes. The number of genes decreased about twice when the number of reads required for identification was increased from 2 to 6, and changed slowly thereafter. Nevertheless, to include the rare genes into the analysis, we selected the threshold of 2 reads.
Gene Taxonomic Assignment.
Taxonomic assignment of predicted genes was carried out using BLASTP alignment against the integrated NR database. BLASTP alignment hits with e-values larger than 1×10−5 were filtered, and for each gene the significant matches which were defined by e-values<10×e-value of the top hit were retained to distinguish taxonomic groups. Then we determined the taxonomical level of each gene by the lowest common ancestor (LCA)-based algorithm that was implemented in MEGAN (Huson et al., Genome Res., 17: 377-386, 2007). The LCA-based algorithm assigns genes to taxa in the way that the taxonomical level of the assigned taxon reflects the level of conservation of the gene. For example, if a gene was conserved in many species, it was assigned to the LCA rather than to a species.
Gene Functional Classification.
We used BLASTP to search the protein sequences of the predicted genes in the eggNOG database (Jensen et al., Nucleic Acids Res., 36: D250-D254, 2008) and KEGG database (Kanehisa et al., Nucleic Acids Res., 32: D277-D280, 2004) with e-value<1×10−5. The genes were annotated as the function of the NOGs or KEGG homologues with lowest e-value. The eggNOG database is an integration of the COG and KOG databases. The genes annotated by COG were classified into the 25 COG categories, and genes that were annotated by KEGG were assigned into KEGG pathways.
Determination of Minimal Gut Bacterial Genome.
The number of non-redundant genes assigned to the eggNOG clusters was normalized by gene length and cluster copy number. The clusters were ranked by normalized gene number and the range that included the clusters encoding essential Bacillus subtilis genes was determined, computing the proportion of these clusters among the successive groups of 100 clusters. Analysis of the range gene clusters involved, besides iPath projections, use of KEGG and manual verification of the completeness of the pathways and protein machineries they encode.
Determination of Total Functional Complement and Minimal Metagenome.
We computed the total and shared number of orthologous groups and/or gene families present in random combinations of n individuals (with n=52 to 124, 100 replicates per bin). This analysis was performed on three groups of gene clusters: (1) known eggNOG orthologous groups (that is, those with functional annotation, excluding those in which the terms [Uu]ncharacteri[sz] ed, [Uu]nknown, [Pp]redicted or [Pp]utative occurred); (2) all eggNOG orthologous groups; (3) all orthologous groups plus gene families constructed from remaining genes not assigned to the two above categories. Families were clustered from all-against-all BLASTP results using MCL (van Dongen, Ph. D. Thesis, Univ. Utrecht, 2000) with an inflation factor of 1.1 and a bit-score cutoff of 60.
Rarefaction Analysis.
Estimation of total gene richness was done using EstimateS on 100 randomly picked samples due to memory limitations. Because the CV value was >0.5, both chao2 (classic) and ICE richness estimators were calculated and the larger estimate of the two (ICE) was used. The estimate for this sample size was 3,621,646 genes (ICE) whereas Sobs (Mao Tau) was 3,090,575 genes, or 85.3%. The ICE estimator curve did not completely saturate, indicating that additional samples will need to be added to achieve a final, conclusive estimate.
Common Bacterial Core.
To eliminate the influence of very similar strains and assess the presence of known microbial species among the individuals of the cohort, we used 650 sequenced bacterial and archaeal genomes as a reference set. The set was composed from 932 publicly available genomes, which were grouped by similarity, using a 90% identity cutoff and the similarity over at least 80% of the length. From each group only the largest genome was used. Illumina reads from 124 individuals were mapped to the set, for species profiling analysis and the genomes originating from the same species (by differing in size >20%) curated by manual inspection and by using the 16S-based clustering when the sequences were available.
Relative Abundance of Microbial Genomes Among Individuals.
We computed the genome coverage by uniquely mapping Illumina reads and normalized it to 1 Gb of sequence, to correct for different sequencing levels in different individuals. The coverage was summed over all species of the non-redundant bacterial genome set for each individual and the proportion of each species relative to the sum calculated.
Species Co-Existence Network.
For the 155 species that had genome coverage by the Illumina reads ≧1% in at least one individual we calculated the pair-wise inter-species Pearson correlations between sequencing depths (abundance) throughout the entire cohort of 124 individuals. From the resulting 11,175 inter-species correlations, correlations less than −0.4 or above 0.4 (n=342) were visualized in a graph using Cytoscape (Shannon et al., Genome Res. 13: 2498-2504, 2003). displaying the average genome coverage of each species as node size in the graph.
A Summary Description of the Cohort & the Method Used.
A total of 177 Danish individuals were studied. They comprised 67 people with a BMI<27.5 (lean, healthy controls) and 110 individuals with a BMI>27.5 (obese patients). The entire gene catalog of 3.3 million genes was searched by ranksum search for those that are significantly different between the two groups. Gene frequency was normalized by the gene size (larger genes are bigger targets and are seen more often) and the difference in the sequencing extent for different individuals. The number of significantly different genes is affected by the thresholds and the splits into groups. In brief, 1327 “BMI-related genes” (also referred to herein as BMI genes) were found at p<10−4.
Overall Analysis of the BMI Genes.
The significantly different genes, i.e. BMI-related genes, were plotted by individual (
Comparison of the Distribution of All Genes and BMI Genes.
The distribution of all genes of the microbiome and of the BMI genes was compared. There is much less difference in all gene numbers and frequency between the two groups than the BMI genes. The BMI gene distribution does not reflect simply the all gene distribution. The loss of genes in the obese patients is thus significant.
BMI-Related Species.
The BMI genes were allocated to species, using the taxonomic assignments attributed to the genes in the 3.3 million catalog (Qin et al., Nature, 2010, in press, doi:10.1038/nature08821). It was found that 59.8% of the BMI genes, but only 32.8% of all genes, were from Firmicutes. On the other hand, the frequency of Bacteroidetes was 8.1% for BMI genes and 18.4% for all the genes of the microbiome. Therefore, obesity is associated to changes in Firmicutes. The species were first identified by the number of genes assigned to them amongst the BMI genes. Then other genes from the same species were pulled out of the catalog and the presence of 50 representative genes for each species assessed in different individuals (this compared very favorably with the use of a single 16S gene, which is currently done to identify a species). The species was considered present if at least half of the marker genes were found in an individual. The significance of the distribution between the healthy and the patients was estimated by the comparison with the all cohort distribution (67 to 110) using the Chi2 test. Bacteronides pectinophilus, Eubacterium siraeum and Clostridium phyto fermentans were associated with the healthy population (p=2.1×10−3, p=3.5×10−4, and p=6.1×10−4, respectively), i.e. they tended to be absent from the obese patients. On the other hand, Anaerotruncus colihominis was associated with the patient cohort (p=1.4×10−2). On the basis of the identification of species, it was demonstrated that the linear combination of these 4 species fully predicts the obesity phenotype (FIG. 2A). Healthy individuals and patients are shown as blue and red dots, respectively. The species presence (the ordinate) corresponds to the sum of the genes the of “good species” (anti-associated with obesity) minus the genes of the “bad species” (associated with obesity). The individuals are ranked by the species presence (the abscissa). If an individual has excess of the “good species” genes, he or she will be on the top of the rank and tend to be healthy, while if there is an excess of “bad species” genes, he or she will be at the right and tend to be sick. This is also illustrated in
Faecalibacterium prausnitzii
Eubacterium ventriosum
Alistipes putredinis
Clostridium leptum
Alistipes putredinis
Bacteroides ovatus
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Cupriavidus pinatubonensis
Anaerostipes caccae
Eubacterium hallii
Clostridium cellulolyticum
Clostridium cellulolyticum
Clostridium
Desulfovibrio piger
Alistipes putredinis
Bacteroides capillosus
Clostridium asparagiforme
Caldicellulosiruptor saccharolyticus
Anaerocellum thermophilum
Anaerococcus hydrogenalis
Bacteroides pectinophilus
Clostridium difficile
Bacteroides capillosus
Bacteroides capillosus
Desulfitobacterium hafniense
Bacteroides capillosus
Bacteroides capillosus
Heliobacterium modesticaldum
Anaerofustis stercorihominis
Clostridium bartlettii
Clostridium
Coprococcus comes
Dorea formicigenerans
Clostridium bolteae
Bacteroides capillosus
Clostridium difficile
Anaerotruncus colihominis
Clostridium perfringens
Clostridium
Clostridium hylemonae
Clostridium
Coprococcus comes
Clostridium phytofermentans
Ruminococcus lactaris
Clostridium bolteae
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Roseburia inulinivorans
Faecalibacterium prausnitzii
Clostridium butyricum
Clostridium
Coprococcus eutactus
Clostridium
Roseburia inulinivorans
Clostridium
Haemophilus influenzae
Pedobacter
Faecalibacterium prausnitzii
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Bacteroides pectinophilus
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Chthoniobacter flavus
Eubacterium siraeum
Clostridium phytofermentans
Mesoplasma forum
Bacteroides intestinalis
Clostridium asparagiforme
Desulfovibrio desulfuricans
Clostridium thermocellum
Faecalibacterium prausnitzii
Roseburia inulinivorans
Clostridium leptum
Candidatus Amoebophilus asiaticus
Clostridium
Clostridium phytofermentans
Clostridium phytofermentans
Clostridium bolteae
Coprococcus comes
Blautia hydrogenotrophica
Ruminococcus torques
Clostridium
Bacteroides capillosus
Eubacterium ventriosum
Ruminococcus torques
Clostridium phytofermentans
Clostridium hiranonis
Thermoanaerobacter tengcongensis
Collinsella stercoris
Ruminococcus obeum
Heliobacterium modesticaldum
Dorea formicigenerans
Clostridium phytofermentans
Clostridium phytofermentans
Anaerofustis stercorihominis
Dorea longicatena
Clostridium phytofermentans
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Bacteroides
Bacteroides intestinalis
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Bacteroides
Clostridium phytofermentans
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Clostridium
Clostridium
Clostridium methylpentosum
Clostridium leptum
Clostridium hiranonis
Eubacterium ventriosum
Faecalibacterium prausnitzii
Alistipes putredinis
Clostridium leptum
Eubacterium siraeum
Roseburia inulinivorans
Trichomonas vaginalis
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Anaerotruncus colihominis
Ruminococcus obeum
Faecalibacterium prausnitzii
Caulobacter
Eubacterium siraeum
Clostridium acetobutylicum
Lactobacillus
Ruminococcus obeum
Faecalibacterium prausnitzii
Clostridium leptum
Bacteroides capillosus
Alistipes putredinis
Faecalibacterium prausnitzii
Gramella forsetii
Faecalibacterium prausnitzii
Clostridium thermocellum
Clostridium
Ruminococcus torques
Elusimicrobium minutum
Alkaliphilus oremlandii
Clostridium scindens
Roseburia inulinivorans
Coprococcus eutactus
Clostridium methylpentosum
Bacteroides capillosus
Desulfococcus oleovorans
Alkaliphilus oremlandii
Lactobacillus delbrueckii
Clostridium
Clostridium
Clostridium methylpentosum
Clostridium phytofermentans
Clostridium
Roseburia inulinivorans
Bacteroides pectinophilus
Clostridium
Clostridium
Eubacterium hallii
Clostridium kluyveri
Clostridium phytofermentans
Eubacterium ventriosum
Clostridium
Clostridium
Clostridium phytofermentans
Streptomyces griseus
Eubacterium ventriosum
Clostridium phytofermentans
Clostridium phytofermentans
Desulfitobacterium hafniense
Bacteroides
Bacteroides
Bacteroides capillosus
Bacteroides
Bacteroides
Faecalibacterium prausnitzii
Alistipes putredinis
Opitutus terrae
Bacteroides capillosus
Hyphomonas neptunium
Bacteroides intestinalis
Bacteroides pectinophilus
Clostridium
Clostridium nexile
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Eubacterium siraeum
Faecalibacterium prausnitzii
Alistipes putredinis
Eubacterium siraeum
Clostridium methylpentosum
Eubacterium siraeum
Methanococcus maripaludis
Eubacterium siraeum
Eubacterium siraeum
Faecalibacterium prausnitzii
Clostridium leptum
Eubacterium siraeum
Eubacterium siraeum
Alistipes putredinis
Clostridium
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Bacteroides
Gloeobacter violaceus
Trichomonas vaginalis
Roseburia inulinivorans
Anaerotruncus colihominis
Clostridium
Anaerofustis stercorihominis
Bifidobacterium adolescentis
Clostridium thermocellum
Schizosaccharomyces
Clostridium
Bacteroides capillosus
Bacteroides capillosus
Lentisphaera araneosa
Clostridium thermocellum
Thermoanaerobacter
Carboxydothermus hydrogenoformans
Ruminococcus torques
Clostridium bolteae
Ruminococcus gnavus
Clostridium
Eubacterium dolichum
Clostridium
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Geobacillus
Faecalibacterium prausnitzii
Clostridium
Roseburia inulinivorans
Clostridium
Clostridium
Clostridium
Clostridium
Clostridium asparagiforme
Clostridium asparagiforme
Faecalibacterium prausnitzii
Lysinibacillus sphaericus
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Clostridium bolteae
Syntrophomonas wolfei
Ruminococcus lactaris
Roseburia inulinivorans
Eubacterium siraeum
Clostridium hylemonae
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Bacteroides capillosus
Butyrivibrio
Trichomonas vaginalis
Bacteroides capillosus
Eubacterium siraeum
Eubacterium siraeum
Clostridium asparagiforme
Eubacterium siraeum
Clostridium thermocellum
Desulfitobacterium hafniense
Desulfitobacterium hafniense
Eubacterium siraeum
Eubacterium siraeum
Bacteroides capillosus
Ruminococcus lactaris
Clostridium
Clostridium bolteae
Eubacterium siraeum
Eubacterium siraeum
Faecalibacterium prausnitzii
Eubacterium siraeum
Bacteroides
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Faecalibacterium prausnitzii
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Faecalibacterium prausnitzii
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium biforme
Eubacterium siraeum
Faecalibacterium prausnitzii
Eubacterium siraeum
Eubacterium siraeum
Bifidobacterium dentium
Eubacterium biforme
Bacteroides pectinophilus
Eubacterium siraeum
Eubacterium hallii
Alkaliphilus oremlandii
Eubacterium siraeum
Bacteroides plebeius
Roseburia inulinivorans
Bacteroides pectinophilus
Ruminococcus lactaris
Clostridium phytofermentans
Bordetella
Burkholderia
Clostridium kluyveri
Bacteroides
Eubacterium siraeum
Eubacterium biforme
Eubacterium hallii
Bacteroides pectinophilus
Anaerofustis stercorihominis
Eubacterium hallii
Clostridium
Akkermansia muciniphila
Eubacterium siraeum
Akkermansia muciniphila
Akkermansia muciniphila
Akkermansia muciniphila
Eubacterium siraeum
Eubacterium siraeum
Akkermansia muciniphila
Dictyoglomus thermophilum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Ruminococcus lactaris
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Desulfitobacterium hafniense
Bacteroides pectinophilus
Eubacterium siraeum
Eubacterium siraeum
Synechococcus
Bacteroides capillosus
Trichodesmium erythraeum
Clostridium cellulolyticum
Clostridium
Blautia hydrogenotrophica
Clostridium
Clostridium
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Clostridium ramosum
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Clostridium leptum
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Clostridium bolteae
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Finegoldia magna
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Synechococcus
Bacteroides
Bacteroides intestinalis
Streptococcus infantarius
Clostridium
Clostridium asparagiforme
Clostridium bolteae
Clostridium
Catenibacterium mitsuokai
Clostridium
Moorella thermoacetica
Clostridium
Eubacterium siraeum
Bacteroides pectinophilus
Clostridium scindens
Faecalibacterium prausnitzii
Bacteroides capillosus
Desulfatibacillum alkenivorans
Clostridium phytofermentans
Clostridium
Clostridium
Cyanobacteria
Bacteroides capillosus
Faecalibacterium prausnitzii
Clostridium
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Clostridium bolteae
Eubacterium siraeum
Bacteroides pectinophilus
Eubacterium hallii
Eubacterium siraeum
Eubacterium siraeum
Clostridium phytofermentans
Clostridium
Eubacterium ventriosum
Clostridium bolteae
Syntrophomonas wolfei
Clostridium botulinum
Eubacterium siraeum
Bacteroides cellulosilyticus
Bacteroides
Roseburia inulinivorans
Ruminococcus torques
Gammaproteobacteria
Anaerotruncus colihominis
Bacteroides pectinophilus
Bacteroides pectinophilus
Bacteroides pectinophilus
Syntrophomonas wolfei
Eubacterium siraeum
Bacteroides pectinophilus
Faecalibacterium prausnitzii
Roseburia inulinivorans
Bacteroides pectinophilus
Roseburia inulinivorans
Bacteroides pectinophilus
Ruminococcus lactaris
Clostridium
Clostridium bartlettii
Bacteroides pectinophilus
Bacteroides pectinophilus
Roseburia inulinivorans
Bacteroides pectinophilus
Eubacterium siraeum
Bacteroides pectinophilus
Bacteroides uniformis
Bacteroides uniformis
Clostridium butyricum
Clostridium leptum
Blautia hydrogenotrophica
Faecalibacterium prausnitzii
Blautia hydrogenotrophica
Clostridium bolteae
Faecalibacterium prausnitzii
Bacteroides
Heliobacterium modesticaldum
Anaerostipes caccae
Fervidobacterium nodosum
Alistipes putredinis
Clostridium
Erwinia tasmaniensis
Bacteroides pectinophilus
Heliobacterium modesticaldum
Bacteroides pectinophilus
Clostridium nexile
Clostridium
Parabacteroides distasonis
Coprococcus comes
Eubacterium siraeum
Clostridium phytofermentans
Synechococcus
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Gloeobacter violaceus
Ruminococcus obeum
Bacteroides pectinophilus
Coprococcus comes
Coprococcus eutactus
Coprococcus eutactus
Clostridium phytofermentans
Clostridium
Clostridium
Clostridium
Clostridium
Ruminococcus gnavus
Roseburia inulinivorans
Dorea formicigenerans
Bacteroides pectinophilus
Clostridium
Clostridium
Faecalibacterium prausnitzii
Coprococcus comes
Catenibacterium mitsuokai
Eubacterium ventriosum
Ruminococcus lactaris
Coprococcus eutactus
Eubacterium hallii
Ruminococcus obeum
Eubacterium hallii
Clostridium cellulolyticum
Clostridium nexile
Ruminococcus obeum
Bacteroides pectinophilus
Blautia hydrogenotrophica
Roseburia inulinivorans
Bacteroides pectinophilus
Clostridium hylemonae
Catenibacterium mitsuokai
Dorea formicigenerans
Roseburia inulinivorans
Bacteroides pectinophilus
Roseburia inulinivorans
Ruminococcus obeum
Desulfitobacterium hafniense
Eubacterium hallii
Clostridium acetobutylicum
Clostridium bolteae
Bacteroides
Bacteroides
Faecalibacterium prausnitzii
Bacteroides
Bacteroides cellulosilyticus
Bacteroides
Bacteroides
Bacteroides capillosus
Bacteroides
Bacteroides
Bacteroides
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Roseburia inulinivorans
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Ruminococcus lactaris
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Clostridium
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Coprococcus eutactus
Eubacterium siraeum
Clostridium bartlettii
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Clostridium
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Clostridium leptum
Eubacterium siraeum
Eubacterium siraeum
Eubacterium siraeum
Bifidobacterium
Listeria
Eubacterium ventriosum
Clostridium thermocellum
Faecalibacterium prausnitzii
Bacteroides
Bacteroides
Bacteroides pectinophilus
Bacteroides capillosus
Eubacterium biforme
Bacteroides capillosus
Clostridium tetani
Bacteroides capillosus
Bacteroides capillosus
Alistipes putredinis
Anaerostipes caccae
Clostridium
Bacteroides capillosus
Cryptosporidium
Faecalibacterium prausnitzii
Faecalibacterium prausnitzii
Atopobium rimae
Alistipes putredinis
Bacteroides pectinophilus
Clostridium thermocellum
Ruminococcus torques
Eubacterium siraeum
Anoxybacillus flavithermus
Clostridium bolteae
Clostridium cellulolyticum
Clostridium hylemonae
Brachyspira
Clostridium
Clostridium nexile
Clostridium
Clostridium thermocellum
Bacteroides
Clostridium
Roseburia inulinivorans
Eubacterium hallii
Heliobacterium modesticaldum
Eubacterium siraeum
Clostridium
Bacteroides capillosus
Clostridium scindens
Ruminococcus lactaris
Clostridium botulinum
Eubacterium siraeum
Eubacterium siraeum
Bacteroides capillosus
Clostridium
Anaerostipes caccae
Roseburia inulinivorans
Clostridium phytofermentans
Proteobacteria
Dethiobacter alkaliphilus
Halothermothrix orenii
Anaerocellum thermophilum
Clostridium butyricum
Geobacter bemidjiensis
Clostridium
Coprococcus eutactus
Roseburia inulinivorans
Anaerofustis stercorihominis
Clostridium leptum
Xanthomonas
Thermoanaerobacter pseudethanolicus
Mollicutes
Moorella thermoacetica
Clostridium phytofermentans
Eubacterium ventriosum
Bacillus
Roseburia inulinivorans
Anaerostipes caccae
Clostridium phytofermentans
Clostridium
Clostridium
Alkaliphilus metalliredigens
Clostridium phytofermentans
Ruminococcus lactaris
Clostridium scindens
Desulfitobacterium hafniense
Clostridium botulinum
Ruminococcus obeum
Heliobacterium modesticaldum
Clostridium methylpentosum
Parabacteroides distasonis
Clostridium thermocellum
Anaerostipes caccae
Bacteroides capillosus
Clostridium thermocellum
Bacteroides capillosus
Clostridium asparagiforme
Bacteroides capillosus
Eubacterium hallii
Coprococcus comes
Salmonella enterica
Trichoplax
Clostridium phytofermentans
Acholeplasma laidlawii
Clostridium
Bacilli
Clostridium
Methylocella silvestris
Eubacterium dolichum
Roseburia inulinivorans
Roseburia inulinivorans
Clostridium
Faecalibacterium prausnitzii
Clostridium
Eubacterium siraeum
Alkaliphilus oremlandii
Bacillus
Clostridium hylemonae
Clostridium beijerinckii
Desulfitobacterium hafniense
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/EP2011/053041 | 3/1/2011 | WO | 00 | 8/31/2012 |
Number | Date | Country | |
---|---|---|---|
61309333 | Mar 2010 | US |