The instant application contains a Sequence Listing which has been submitted electronically in ASCII format and is hereby incorporated by reference in its entirety. Said ASCII copy, created on Oct. 27, 2021, is named 56686-703_301_SL.txt and is 29,705 bytes in size.
This invention is in the field of diagnosis and in particular the diagnosis of bile acid malabsorption (BAM).
Bile acid malabsorption (BAM) is a cause of several gut-related problems, in particular chronic diarrhea. It can result from malabsorption secondary to gastrointestinal disease, or be a primary disorder, associated with excessive bile acid production. A proportion of patients incorrectly diagnosed as suffering from diarrhea-predominant irritable bowel syndrome (IBS-D) and alternating or mixed-type irritable bowel syndrome (IBS-M) actually suffer from BAM (1, 2), for which the treatment is different to that for irritable bowel syndrome (IBS) (3). Importantly, BAM is a clinically distinct entity from IBS—not all patients suffering from BAM have IBS, and not all IBS patients suffer from BAM. BAM is estimated to account for 25% to 50% of patients with functional diarrhea or diarrhea-predominant irritable bowel syndrome (IBS-D) and 1% of the population suffer from it (4).
BAM may be treated with bile acid sequestrants. Bile acids are produced in the liver, secreted into the biliary system, stored in the gallbladder and are released after meals stimulated by cholecystokinin. Usually over 95% of the bile acids are absorbed in the terminal ileum and are taken up by the liver and resecreted. When large amounts of bile acids enter the large intestine, they stimulate water secretion and intestinal motility in the colon, which causes symptoms of chronic diarrhea. Bile acid transporters including apical sodium-dependent bile salt transporter (ASBT, IBAT, gene symbol SLC10A2), cytoplasmic ileal bile acid binding protein (IBABP, ILBP, gene symbol FABP6) and the basolateral heterodimer of OSTα and OSTβ transfer bile acids. If expression of these transporters is reduced, the intestine is less able to absorb bile acids (Type 1 bile acid malabsorption). If intestinal motility is affected by gastro-intestinal surgery, or bile acids are deconjugated by small intestinal bacterial overgrowth, absorption is less efficient (Type 3 bile acid malabsorption). Primary bile acid diarrhea (Type 2 bile acid “malabsorption”) may be caused by an overproduction of bile acids. A very small proportion of the patients with no obvious disease (Type 2 bile acid malabsorption) may have mutations in ASBT.
BAM can be diagnosed by the 75Selenium (Se) homocholic acid taurine (SeHCAT) test, which detects inability to resorb and metabolize bile acids. The test involves administering a radiolabeled bile compound and measuring its retention after one week (5). BAM is a clinically distinct entity from IBS which can be successfully managed by e.g. with a bile acid sequestrant. The SeHCAT test is the definitive test for BAM diagnosis in the clinic (6) which is expensive and not widely available.
IBS is a common condition that affects the digestive system. Symptoms include cramps, bloating, diarrhoea and constipation and occur over a long time period, generally years. Disorders such as anxiety, major depression, and chronic fatigue syndrome are common among people with IBS. There is no known cure for IBS and treatment is generally carried out to improve symptoms. Treatment may include dietary changes, medication, probiotics, and/or counselling. Dietary measures that are commonly suggested as treatments include increasing soluble fiber intake, a gluten-free diet, or a short-term diet low in fermentable oligosaccharides, disaccharides, monosaccharides, and polyols (FODMAPs). The medication loperamide is used to help with diarrhea while laxatives are be used to help with constipation. Antidepressants may improve overall symptoms and pain. Like most chronic non-communicable disorders, IBS appears to be heterogeneous (7). It ranges in severity from nuisance bowel disturbance to social disablement, accompanied by marked symptomatic heterogeneity (8). Although frequently considered a disorder of the brain-gut axis (9, 10), it is unclear if IBS begins in the gut or in the brain or both. The occurrence of post-infectious IBS (11) suggests that a proportion of cases are initiated in the end-organ, albeit with susceptibility risk factors, some of which may be psychosocial. Advances in microbiome science, with emerging evidence for a modifying influence by the microbiota on neurodevelopment and perhaps on behaviour, have broadened the concept of the mind/body link to encompass the microbiota-gut-brain axis (12). However, progress in understanding and treating IBS has been limited by the absence of reliable biomarkers and IBS is still defined by symptoms. Moreover, the current approach to stratification of patients into clinical subtypes based on predominant symptoms (diarrhea-predominant (IBS-D) or constipation-predominant (IBS-C)) has significant limitations including failure to inform treatment of patients who alternate between subtypes sometimes within days (13). Pharmaceutical agents designed to tackle polar opposite symptoms have the potential for severe unwanted adverse effects if prescribed for a patient misclassified (14).
Investigations have been carried out into gut microbiota alterations in patients with bowel disorders compared to control groups (15-18). Interaction of the microbiome with diet, antibiotics and enteric infections, all of which may be involved in bowel disorders, is consistent with the hypothesis that microbiome alterations could activate or perpetuate pathophysiological mechanisms in the syndrome (19, 20). However, robust microbiome signatures or biomarkers that separate patients with bowel disorders from controls and that help inform therapies are lacking, though signatures have been suggested for IBS severity. Furthermore, most microbiota studies to date have employed 16S rRNA profiling, and did not analyse bacterial metabolites.
There is a requirement for further and improved methods for diagnosing bowel disorders such as BAM. There is also a requirement for further and improved methods for diagnosing bowel disorders such as BAM in patients already diagnosed with another disease with similar symptoms, for example IBS.
The inventors have developed new and improved methods for diagnosing bile acid malabsorption (BAM). A comprehensive and detailed analysis of the microbiome and the metabolome in patients and control (non-BAM) individuals has allowed new indicators of disease to be identified. The invention provides a method of diagnosing BAM in a patient comprising detecting: a bacterial species of a taxa associated with BAM and/or a metabolite associated with BAM.
As shown in the example, the inventors have developed methods for diagnosing bile acid malabsorption (BAM) that are effective and significantly cheaper, more accessible and safer than the 75Selenium (Se) homocholic acid taurine (SeHCAT) test. The SeHCAT test is the technique that is currently most widely used for diagnosing BAM, but it exposes patients to radiation, requires a clinical setting and is very expensive, unlike the methods of the invention. In the SeHCAT test, a capsule containing radiolabelled 75SeHCAT (with 370 kBq of Selenium-75 and less than 0.1 mg SeHCAT) is administered orally with water. Measurements are taken using an uncollimated gamma camera 1-3 hours after taking the capsule and then at 7 days. The percent retention of SeHCAT at 7 days is then calculated, with a 7-day SeHCAT retention value greater than 15% considered to be normal, and with values less than 15% signifying excessive bile acid loss, as found in bile acid malabsorption.
In one embodiment, the present invention provides a method for diagnosing patients with BAM. In a particular embodiment, the present invention provides a method for diagnosing patients with mild BAM. In a particular embodiment, the present invention provides a method for diagnosing patients with moderate BAM. Moderate BAM may be characterised by retention of 10% of the labelled bile acid analogue in the SeHCAT test. In a particular embodiment, the present invention provides a method for diagnosing patients with severe BAM. The data show that the methods of the invention are particularly effective for diagnosing severe BAM. Severe BAM may be characterised by retention of less than or equal to 5% of the labelled bile acid analogue in the SeHCAT test. In some embodiments, the method of the invention is for diagnosing BAM in patients that have been diagnosed with IBS. In some embodiments, the method of the invention is for diagnosing BAM in patients that have been diagnosed with IBS-M. In some preferred embodiments, the method of the invention is for diagnosing BAM in patients that have been diagnosed with IBS-D. In a particular embodiment, the method of the invention is for diagnosing severe BAM in patients that have been diagnosed with IBS.
In one embodiment, the method comprises diagnosing a patient as suffering from severe BAM based on their microbiota composition. In a particular embodiment, patients suffering from IBS and severe BAM have a distinct microbiota composition. In a particular embodiment, IBS patients suffering from severe BAM have a distinct microbiota composition to IBS patients with normal, mild, moderate or borderline BAM diagnoses.
In one embodiment, the present invention provides a method for diagnosing patients with BAM, comprising detecting a distinct fecal metabolome signature. In a particular embodiment, the present invention provides a method for diagnosing IBS patients with severe BAM, comprising detecting a distinct fecal metabolome signature. In one embodiment, machine learning is applied to fecal metabolome data to predict BAM.
In one embodiment, the present invention provides a method for diagnosing patients with BAM, comprising detecting one or more metabolites predictive of BAM. Generally, detecting a metabolite predictive of BAM or associated with BAM in the methods of the invention comprises measuring the concentration of the metabolite in a sample or measuring changes in the concentration of a metabolite and optionally comparing the concentration to a corresponding sample from a control (non-BAM) individual or relative to a reference value. In a particular embodiment, detecting a metabolite predictive of BAM or associated with BAM in the methods of the invention comprises measuring the concentration of the metabolite in a sample or measuring changes in the concentration of a metabolite and comparing the concentration to a corresponding sample from a patient suffering from IBS. In one embodiment, the one or more metabolites predictive of BAM are selected from: PG(P-16:0/14:0), 2-Ethylsuberic acid, Glu-Glu-Gly-Tyr, 1,2,3-Tris(1-ethoxyethoxy)propane, PG(O-30:1), Ursodeoxycholic acid (UDCA), MG(22:2(13Z,16Z)/0:0/0:0), L-Lysine, O-Phosphoethanolamine, PE(22:5(7Z,10Z,13Z,16Z,19Z)/24:0) and/or Heptadecanoic acid. In another embodiment, the one or more metabolites predictive of BAM are selected from: 1,3-di-(5Z,8Z,11Z,14Z,17Z-eicosapentaenoyl)-2-hydroxy-glycerol (d5), Dimethyl benzyl carbinyl butyrate, 1-18:0-2-18:2-monogalactosyldiacylglycerol, PG(P-16:0/14:0), Glu-Glu-Gly-Tyr, PC(22:2(13Z,16Z)/15:0), PG(34:0), PE(18:3(6Z,9Z,12Z)/P-18:0), MG(22:2(13Z,16Z)/0:0/0:0), Arg-Ile-Gln-Ile, PE(22:5(7Z,10Z,13Z,16Z,19Z)/24:0), PC(18:1(9Z)/15:0), Thiophanate-methyl, Asn-Ser-His-His, 1,2,3-Tris(1-ethoxyethoxy)propane, PS(39:6), 2-Hydroxylauroylcarnitine, Hypoxanthine, Adenosine, PC(40:6), Asp-Phe-Phe-Val, 3-Dehydroxycarnitine, Inosine, PG(O-34:3), 11-Deoxocucurbitacin I, Methyl caprate, Linoleoyl ethanolamide, His-Met-Phe-Phe, 1-Decanol, Gravelliferone, Uridine, Arachidyl carnitine, Guanosine, Methyl nonylate, 3-Epidemissidine, Momordol, N-[2-(1H-Indol-3-yl)ethyl]docosanamide, Methyl caproate, Ascorbic acid, N-Acetyl-leu-leu-tyr, 4-Hydroxybutyric acid, [ST dimethyl(4:0/3:0)] (5Z,7E,17Z)-(1S,3R)-26,27-dimethyl-9,10-seco-5,7,10(19),17(20)-cholestatetraen-22-yne-1,3,25-triol, N-Methylindolo[3,2-b]-5alpha-cholest-2-ene and gamma-Glutamyl-S-methylcysteinyl-beta-alanine. In a preferred embodiment, the method comprises detecting ursodeoxycholic acid. In a preferred embodiment, the method comprises detecting L-lysine. In a preferred embodiment, the method comprises detecting 1,3-di-(5Z,8Z,11Z,14Z,17Z-eicosapentaenoyl)-2-hydroxy-glycerol (d5). In a preferred embodiment, the method comprises detecting Dimethyl benzyl carbinyl butyrate. In a preferred embodiment, the method comprises detecting 1-18:0-2-18:2-monogalactosyldiacylglycerol. In a preferred embodiment, the method comprises detecting PG(P-16:0/14:0). In a preferred embodiment, the method comprises detecting Glu-Glu-Gly-Tyr. In any such embodiments, detecting the metabolite comprises measuring the relative concentration of the metabolite in a sample, for example relative to a corresponding sample from a control (non-BAM) individual or relative to a reference value. In a preferred embodiment, detecting the metabolite comprises measuring the relative concentration of the metabolite in a sample, for example relative to a corresponding sample from a patient suffering from IBS. In some embodiments, the method comprises detecting a precursor or breakdown product of the above metabolites.
In one embodiment, the present invention provides a method for diagnosing patients with BAM, comprising detecting an increase in the concentration of one or more metabolites predictive of BAM. In some embodiments, detecting a metabolite predictive of BAM or associated with BAM in the methods of the invention comprises measuring the concentration of the metabolite in a sample and optionally comparing the concentration to a corresponding sample from a control (non-BAM) individual or relative to a reference value. In some embodiments, metabolites that are predictive of BAM have a higher concentration compared to a corresponding sample from a control (non-BAM) individual or relative to a reference value. In a particular embodiment, detecting a metabolite predictive of BAM or associated with BAM in the methods of the invention comprises measuring the concentration of a metabolite and comparing the concentration to a corresponding sample from a patient suffering from IBS. In some embodiments, metabolites that are predictive of BAM have a higher concentration compared to a corresponding sample from a patient suffering from IBS. In a particular embodiment, the one or more metabolites that are predictive of BAM are selected from: 1,3-di-(5Z,8Z,11Z,14Z,17Z-eicosapentaenoyl)-2-hydroxy-glycerol (d5), Dimethyl benzyl carbinyl butyrate, PG(P-16:0/14:0), PG(34:0), PE(18:3(6Z,9Z,12Z)/P-18:0), Thiophanate-methyl, PS(39:6), Asp-Phe-Phe-Val, PG(O-34:3), 1-Decanol, 3-Epidemissidine and/or Momordol.
In one embodiment, the present invention provides a method for diagnosing patients with BAM, comprising detecting a decrease in the concentration of one or more metabolites predictive of a control individual (non-BAM). In some embodiments, detecting a metabolite predictive of BAM or associated with BAM in the methods of the invention comprises measuring the concentration of the metabolite in a sample and optionally comparing the concentration to a corresponding sample from a control (non-BAM) individual or relative to a reference value. In some embodiments, metabolites that are predictive of BAM have a lower concentration compared to a corresponding sample from a control (non-BAM) individual or relative to a reference value. In a particular embodiment, detecting a metabolite predictive of BAM or associated with BAM in the methods of the invention comprises measuring the concentration of a metabolite and comparing the concentration to a corresponding sample from a patient suffering from IBS. In some embodiments, metabolites that are predictive of BAM have a lower concentration compared to a corresponding sample from a patient suffering from IBS. In a particular embodiment, the one or more metabolites that are predictive of a control (non-BAM) individual are selected from: 1-18:0-2-18:2-monogalactosyldiacylglycerol, Glu-Glu-Gly-Tyr, PC(22:2(13Z,16Z)/15:0), MG(22:2(13Z,16Z)/0:0/0:0), Arg-Ile-Gln-Ile, PE(22:5(7Z,10Z,13Z,16Z,19Z)/24:0), PC(18:1(9Z)/15:0), Asn-Ser-His-His, 1,2,3-Tris(1-ethoxyethoxy)propane, 2-Hydroxylauroylcarnitine, Hypoxanthine, Adenosine, PC(40:6), 3-Dehydroxycarnitine, Inosine, 11-Deoxocucurbitacin I, Methyl caprate, Linoleoyl ethanolamide, His-Met-Phe-Phe, Gravelliferone, Uridine, Arachidyl carnitine, Guanosine, Methyl nonylate, N-[2-(1H-Indol-3-yl)ethyl]docosanamide, Methyl caproate, Ascorbic acid, N-Acetyl-leu-leu-tyr, 4-Hydroxybutyric acid, [ST dimethyl(4:0/3:0)] (5Z,7E,17Z)-(1S,3R)-26,27-dimethyl-9,10-seco-5,7,10(19),17(20)-cholestatetraen-22-yne-1,3,25-triol, N-Methylindolo[3,2-b]-5alpha-cholest-2-ene and/or gamma-Glutamyl-S-methylcysteinyl-beta-alanine.
In some embodiments, detecting a metabolite associated with BAM in the methods of the invention comprises measuring the concentration of a precursor of the metabolite and optionally comparing the concentration to a corresponding sample from a control (non-BAM) individual or relative to a reference value. In some embodiments, detecting a metabolite associated with BAM in the methods of the invention comprises measuring the concentration of a breakdown product of the metabolite and optionally comparing the concentration to a corresponding sample from a control (non-BAM) individual or relative to a reference value. In certain embodiments, the method comprises detecting a bacterial taxa known to produce a metabolite predictive of BAM.
In one embodiment, the present invention provides a method for diagnosing patients with BAM, comprising detecting metabolites which are predictive of BAM selected from table 1 and/or table 7. In a particular embodiment, the present invention provides a method for diagnosing IBS patients with severe BAM, comprising detecting metabolites which are predictive of BAM selected from table 1 and/or table 7. In preferred embodiments, the method of the invention comprises detecting metabolites associated with fatty acid metabolism. In preferred embodiments, the method of the invention comprises detecting ursodeoxycholic acid. In one embodiment, machine learning is used to diagnose BAM. In any such embodiments, detecting the metabolite comprises measuring the relative concentration of the metabolite in a sample, for example relative to a corresponding sample from a control (non-BAM) individual or relative to a reference value.
The inventors have identified bacterial taxa that are associated with BAM, as demonstrated in the example. Accordingly, the invention provides methods for diagnosing BAM comprising detecting the presence of certain bacterial taxa. Preferably, these methods comprise detecting bacterial strains in a fecal sample from a patient. Alternatively, the bacteria (i.e. one or more bacterial strains) may be detected from an oral sample, such as a swab. Generally, detecting a bacterial taxa associated with BAM in the methods of the invention comprises measuring the relative abundance of the bacteria (i.e. one or more bacterial strains) in a sample, for example relative to a corresponding sample from a control (non-BAM) individual or relative to a reference value.
In one embodiment, the invention provides a method for diagnosing BAM, comprising detecting bacterial species of one or more of the following families: Lachnospiraceae, Bacteroidaceae, Ruminococcaceae, Bifidobacteriaceae, Prevotellaceae, Veillonellaceae and Coriobacteriaceae. In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacterial species of one or more of the following genera: Blautia, Bacteroides, Faecalibacterium, Oscillibacter, Ruminococcus, Bifidobacterium, Coprococcus, Paraprevotella, Gemmiger, Dialister and Megamonas. In any such embodiments, detecting the bacteria (i.e. one or more bacterial strains) comprises measuring the relative abundance of the bacteria in a sample, for example relative to a corresponding sample from a control (non-BAM) individual or relative to a reference value. The examples demonstrate that methods detecting these bacteria are particularly effective. The bacterial taxa used in the invention may be defined with reference to 16S rRNA gene sequences, or the invention may use Linnaean taxonomy. Bacteria of either category of taxa may be detected using clade-specific bacterial genes, 16S sequences, transcriptomics, metabolomics, or a combination of such techniques. In certain embodiments, the bacteria (i.e. one or more bacterial strains) may be detected using clade-specific bacterial genes, 16S sequences, transcriptomics or metabolomics.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting one or more bacterial strains belonging to an operational taxonomic unit (OTU) associated with BAM. As is well known in the art, an operational taxonomic unit (OTU) is an operational definition used to classify groups of closely related individuals. As used herein, an “OTU” is a group of organisms which are grouped by DNA sequence similarity of a specific taxonomic marker gene (39). In some embodiments, the specific taxanomic marker gene is the 16S rRNA gene. In some embodiments, the Ribosomal Database Project (RDP) taxonomic classifier is used to assign taxonomy to representative OTU sequences. For example, the sequence information in Table 3 can be used to classify whether bacteria (i.e. one or more bacterial strains) belong to the OTUs listed in Table 2.
Bacteria having at least 97% sequence identity to the sequences in Table 3 belong to the corresponding OTUs in Table 2. In preferred embodiments, the OTU is selected from table 2. In any such embodiments, detecting the bacteria (i.e. one or more bacterial strains) comprises measuring the relative abundance of the bacteria in a sample, for example relative to a corresponding sample from a control (non-BAM) individual or relative to a reference value.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting one or more bacterial strains belonging to an operational taxonomic unit (OTU) associated with BAM. In preferred embodiments, the OTU is selected from table 2. In one embodiment, the OTU associated with BAM is classified as belonging to one of the following phyla: Firmicutes, Bacteroidetes or Actinobacteria. In a particular embodiment, the OTU associated with BAM is classified as belonging to one of the following classes: Clostridia, Bacteroidia, Actinobacteria or Negativicutes. In a particular embodiment, the OTU associated with BAM is classified as belonging to one of the following orders: Clostridiales, Bacteroidales, Selenomonadales or Coriobacteriales. In a particular embodiment, the OTU associated with BAM is classified as belonging to one of the following families: Lachnospiraceae, Bacteroidaceae, Ruminococcaceae, Bifidobacteriaceae, Prevotellaceae, Veillonellaceae or Coriobacteriaceae. In a particular embodiment, the OTU associated with BAM is classified as belonging to one of the following genera: Blautia, Bacteroides, Faecalibacterium, Oscillibacter, Lachnospiracea_incertae_sedis, Ruminococcus2, Bifidobacterium, Coprococcus, Paraprevotella, Gemmiger, Dialister or Megamonas.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacterial strains belonging to one or more OTUs listed in Table 2. The sequences in Table 3 can be used to classify bacteria (i.e. one or more bacterial strains) as belonging to the OTUs listed in Table 2. Bacteria (i.e. one or more bacterial strains) having at least 97% sequence identity to the sequences in Table 3 belong to the corresponding OTUs in Table 2. The alignment is across the length of the sequence. In both Metaphlan2 and HUMAnN2 runs, alignment for species composition is done using bowtie 2. Bowtie2 is run with “very-sensitive argument” and the alignment performed is “Global alignment”.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 1. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Blautia genus.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 2. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Bacteroides genus.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 3. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Clostridiales order.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 4. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Faecalibacterium genus.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 5. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Oscillibacter genus.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 6. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Lachnospiracea genus.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 6. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Lachnospiraceae family.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 7. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Lachnospiraceae family.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 8. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Lachnospiraceae family.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 9. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Ruminococcaceae family.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 10. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Ruminococcus genus.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 11. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Lachnospiraceae family.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 12. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Bifidobacterium genus.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 13. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Coprococcus genus.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 14. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Clostridiales order.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 15. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Paraprevotella genus.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 16. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Bacteroides genus.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 17. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Ruminococcaceae family.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 18. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Gemmiger genus.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 19. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Ruminococcaceae family.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 20. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Dialister genus.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 21. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Clostridiales order.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 22. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Clostridiales order.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 23. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Clostridiales order.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 24. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Lachnospiraceae family.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 25. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Faecalibacterium genus.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 26. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Clostridiales order.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 27. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Megamonas genus.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 28. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Coriobacteriaceae family.
In preferred embodiments, the invention provides a method for diagnosing BAM, comprising detecting different bacteria (i.e. one or more bacterial strains) having 16S rRNA gene sequences at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to two or more of SEQ ID No:1-28, such as 5, 8, or all of SEQ ID No:1-28.
In one embodiment, the present invention provides a further step of diagnosing IBS, comprising detecting bacterial strains belonging to one or more OTUs listed in Table 5. The sequences in Table 6 can be used to classify bacteria (i.e. one or more bacterial strains) as belonging to the OTUs listed in Table 5. Bacteria (i.e. one or more bacterial strains) having at least 97% sequence identity to the sequences in Table 6 belong to the corresponding OTUs in Table 5. The alignment is across the length of the sequence. In both Metaphlan2 and HUMAnN2 runs, alignment for species composition is done using bowtie 2. Bowtie2 is run with “very-sensitive argument” and the alignment performed is “Global alignment”.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 29. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Lachnospiraceae family.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 30. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Firmicutes phylum.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 31. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Butyricicoccus genus.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 32. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Lachnospiraceae family.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 33. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Clostridiales order.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 34. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Ruminococcaceae family.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 35. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Ruminococcaceae family.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 36. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Firmicutes phylum.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No: 37. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Ruminococcaceae family.
In one embodiment, the present invention provides a method for diagnosing BAM, comprising detecting bacteria (i.e. one or more bacterial strains) having a 16S rRNA gene sequence at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to SEQ ID No:38. In certain such embodiments, the bacteria (i.e. one or more bacterial strains) is classified as belonging to the Lachnospiraceae family.
In preferred embodiments, the invention provides a method for diagnosing BAM, comprising detecting different bacteria (i.e. one or more bacterial strains) having 16S rRNA gene sequences at least 97% (e.g. 98%, 99%, 99.5% or 100%) identical to two or more of SEQ ID No: 29-38, such as 5, 8, or all of SEQ ID No: 29-38.
In certain embodiments, the bacterial species belongs to a sequence-based taxon. In preferred embodiments, the sequence-based taxon is selected from table 2.
In preferred embodiments, the method for diagnosing BAM comprises detecting at least one metabolite as set out above and detecting at least one bacterial strain or species as set out above. While the metabolomics model performed with 100% accuracy for severe and moderate BAM, the OTU model resulted in fewer misclassifications (five) compared to the fecal metabolomics model (nine). There was no overlap in misclassified subjects between the models, indicating that a combined microbiome-metabolome model would increase BAM prediction accuracy.
In one embodiment, the present invention provides a method for diagnosing BAM in patients already diagnosed with a disease that is co-morbid with BAM. In another embodiment, the diagnosis of BAM using the BAM metabolomic signature distinguishes patients suffering from BAM to patients suffering from other diseases, for example diseases that are co-morbid with BAM.
In a particular embodiment, the present invention provides a method for diagnosing BAM in patients already diagnosed with inflammatory bowel disease, e.g. ulcerative colitis or Crohn's disease. In a particular embodiment, the diagnosis of BAM using the BAM metabolomic signature distinguishes patients suffering from BAM to patients suffering from other diseases, for example inflammatory bowel disease, e.g. ulcerative colitis or Crohn's disease.
In one embodiment, the present invention provides a method for diagnosing BAM in patients already diagnosed with anorexia nervosa. In another embodiment, the diagnosis of BAM using the BAM metabolomic signature distinguishes patients suffering from BAM to patients suffering from other diseases, for example anorexia nervosa.
In certain embodiments, the method for diagnosing BAM comprises detecting one or more bacterial species and one or more metabolite.
Integrative Analysis of Diet, Microbiome and Metabolome in BAM Patients
In certain embodiments, the invention provides a method of diagnosing BAM comprising one or more of i) detecting a bacterial species, for example as discussed above, ii) detecting metabolites, for example as discussed above. In any such embodiments, detecting the bacteria, gene or metabolite comprises measuring the abundance or concentration of said marker in a sample, for example the relative to a corresponding sample from a control (non-BAM) individual or relative to a reference value.
Diagnostic Methods
The inventors have developed new and improved methods for diagnosing BAM.
In preferred embodiments, the methods of the invention are for use in diagnosing a patient resident in Europe, such as Northern Europe, preferably Ireland or a patient that has a European, Northern European or Irish diet. The examples demonstrate that the methods of the invention are particular effective for such patients. In other embodiments, the patient may be resident in the United States of America.
In certain embodiments of any aspect of the invention, the abundance of bacteria, genes or metabolites is assessed relative to control (non-BAM) individuals. Such reference values may be generated using any technique established in the art.
In certain embodiments of any aspect of the invention, comparison to a corresponding sample from a control (non-BAM) individual is a comparison to a corresponding sample from a healthy individual.
Preferably the method of diagnosing BAM has a sensitivity of greater than 40% (e.g. greater than 45%, 50% or 52%, e.g. 53% or 58%) and a specificity of greater than 90% (e.g. greater than 93% or 95%, e.g. 96%).
In certain embodiments, the method of diagnosis is a method of monitoring the course of treatment for BAM.
In certain embodiments, the step of detecting the presence or abundance of bacteria, such as in a fecal sample, comprises a nucleic acid based quantification methodology, for example 16s rRNA gene amplicon sequencing. Methods for qualitative and quantitative determination of bacteria in a sample using 16s rRNA gene amplicon sequencing are described in the literature and will be known to a person skilled in the art. Other techniques may involve PCR, rtPCR, qPCR, high throughput sequencing, metatranscriptomic sequencing, or 16S rRNA analysis.
In alternative aspects of any embodiment of the invention, the invention provides a method for diagnosing the risk of developing BAM.
In any embodiment of the invention, modulated abundance of a bacterial strain, species or metabolite is indicative of BAM. In preferred embodiments, the abundance of the bacterial strain, species or OTU as a proportion of the total microbiota in the sample is measured to determine the relative abundance of the strain, species or OTU. In preferred embodiments, the concentration of a metabolites is measured. In preferred embodiments, the abundance of bacterial strains as a proportion of the total microbiota in the sample is measured to determine the relative abundance of the strains. Then, in such preferred embodiments, the relative abundance of the bacterium or OTU or the concentration of the metabolite or gene sequence in the sample is compared with the relative abundance or concentration in the same sample from a reference control (non-BAM) individual. A difference in relative abundance of the bacterium or OTU in the sample, e.g. a decrease or an increase, compared to the reference is a modulated relative abundance. As explained herein, detection of modulated abundance can also be performed in an absolute manner by comparing sample abundance values with absolute reference values. Therefore, the invention provides a method of determining BAM status in an individual comprising the step of assaying a biological sample from the individual for a relative abundance of one or more BAM-associated bacteria and/or a modulated concentration of a metabolite, wherein a modulated relative abundance of the bacteria or modulated concentration of a metabolite is indicative of BAM. Similarly, the invention provides a method of determining whether an individual has an increased risk of having BAM comprising the step of assaying a biological sample from the individual for a relative abundance of one or more BAM-associated oral bacteria or BAM-associated metabolites, wherein modulated relative abundance or concentration is indicative of an increased risk.
In any embodiment of the invention, detecting bacteria may comprise detecting “modulated relative abundance”. As used herein, the term “modulated relative abundance” as applied to a bacterium or OTU in a sample from an individual should be understood to mean a difference in relative abundance of the bacterium or OTU in the sample compared with the relative abundance in the same sample from a reference control (non-BAM) individual (hereafter “reference relative abundance”). In one embodiment, the bacterium or OTU exhibits increased relative abundance compared to the reference relative abundance. In one embodiment, the bacterium or OTU exhibits decreased relative abundance compared to the reference relative abundance. Detection of modulated abundance can also be performed in an absolute manner by comparing sample abundance values with absolute reference values. In one embodiment, the reference abundance values are obtained from age and/or sex matched individuals. In one embodiment, the reference abundance values are obtained from individuals from the same population as the sample (i.e. Celtic origin, North African origin, Middle Eastern origin). Method of isolating bacteria from oral and fecal sample are described below, as are methods for detecting abundance of bacteria (i.e. one or more bacterial strains). Any suitable method may be employed for isolating specific species or genera of bacteria, which methods will be apparent to a person skilled in the art. Any suitable method of detecting bacterial abundance may be employed, including agar plate quantification assays, fluorimetric sample quantification, qPCR, 16S rRNA gene amplicon sequencing, and dye-based metabolite depletion or metabolite production assays.
The invention also provides kits comprising reagents for performing the methods of the invention, such as kits containing reagents for detecting one or more, such as two or more of the bacterial species, genes or metabolites set out above. Also provided are kits that find use in practicing the subject methods of diagnosing BAM, as mentioned above. The kit may be configured to take a biological sample from an individual, for example a urine sample or a fecal sample. The individual may be suspected of having BAM. The individual may be suspected of being at increased risk of having BAM. A kit can comprise a sealable container configured to receive the biological sample. A kit can comprise polynucleotide primers. The polynucleotide primers may be configured for amplifying a 16S rRNA polynucleotide sequence from at least one BAM-associated bacterium to form an amplified 16S rRNA polynucleotide sequence. A kit may comprise a detecting reagent for detecting the amplified 16S rRNA sequence. A kit may comprise instructions for use.
Summary
Background & Aims: Diagnosis of BAM is based on SeHCAT analysis. Some patients have an alteration in their microbiota. Therefore, microbiome and metabolomic profiling was conducted to identify biomarkers for the condition.
Methods:
Anthropometric, medical and dietary information were collected with fecal samples for microbiome and metabolomic analyses. Shotgun and 16S rRNA amplicon sequencing were performed on feces, and fecal metabolites were analysed by gas chromatography (GC)—and liquid chromatography (LC) mass spectrometry (MS). Bile acid malabsorption (BAM) was identified in patients with diarrhea by retention of radiolabelled 75Selenium (Se) homocholic acid taurine (SeHCAT).
Results: BAM was accurately distinguished within IBS by fecal metabolomics.
Conclusion: BAM can be identified by species-, metagenomics and fecal metabolomic-signatures which are from those of IBS. These findings are useful for diagnosing BAM and for developing precision therapeutics for IBS and BAM.
Materials and Methods
Subject recruitment: Eighty patients aged 16-70 years with IBS meeting the Rome IV criteria were recruited at Cork University Hospital. Clinical subtyping of the patients (21) was as follows: IBS with constipation (IBS-C), mixed IBS (IBS-M) or IBS with diarrhea (IBS-D). Sixty-five controls of the same age range and of the same ethnicity and geographic region were recruited. Descriptive statistics for the study population are presented in Table 4.
Exclusion criteria included the use of antibiotics within 6 weeks prior to study enrolment, other chronic illnesses including gastrointestinal diseases, severe psychiatric disease, abdominal surgery other than hernia repair or appendectomy. Standard-of-care blood analysis was carried out on all participants if recent results were not available, and all subjects were tested by serology to exclude coeliac disease. The inclusion/exclusion criteria for the control population were the same as for the IBS population with the exception of having to fulfil the Rome IV criteria for IBS. Gastrointestinal (GI) symptom history, psychological symptoms, diet, medical history and medication data were collected on each participant (both IBS and controls) and using the following questionnaires: Bristol Stool Score (BSS), Hospital Anxiety and Depression Scale (HADS) (22); Food Frequency Questionnaire (FFQ) (23). IBS-D and IBS-M patients reporting diarrhoea as well as a subset of consenting control subjects were assessed for bile acid malabsorption by SeHCAT, a radiolabelled synthetic bile acid which is used to clinically diagnosis of BAM which is not metabolized by bacteria and passes through the enterohepatic circulation as endogenous bile acids. Ethical approval for the study was granted by the Cork Research Ethics Committee (protocol number: 4DC001) before commencing the study and all participants provided written informed consent to take part.
Sample collection: Fecal and urine samples were collected from all participants for microbiome and metabolomics profiling. Subjects collected a freshly voided fecal sample at home using a collection kit and brought the sample to the clinic that day, when a fresh urine sample was collected. Samples were kept at 4° C. until brought to the laboratory for storage at −80° C. which was within a few hours of the sample collection.
Microbiome profiling and metagenomics: Genomic DNA was extracted and amplified from frozen fecal samples (0.25 g) using the method described by Browne et al. (24). The modifications from the methods described by Brown et al (18) included bead beating tubes consisted of 0.5 g of 0.1 mm zirconia beads and 4×3.5 mm glass beads. Fecal samples were homogenised via bead beating for 3×60 s cycles and cooled on ice between each cycle. Genomic DNA was visualised on 0.8% agarose gel and quantified using the SimpliNano Spectrometer (Biochrom™, US). The PCR master mix used 2× Phusion Taq High-Fidelity Mix (Thermo Scientific, Ireland) and 15 ng of DNA. The resulting PCR products were purified, quantified and equimolar amounts of each amplicon were then pooled before being sent for sequencing to the commercial supplier (GATC Biotech AG, Konstanz, Germany) on the MiSeq (2×250 bp) chemistry platforms. Sequencing was performed by GATC Biotech, Germany on an Illumina MiSeq instrument using a 2×250 bp paired end sequencing run.
Microbiome profiling and metagenomics—16S amplicon sequencing: Using the Qiagen DNeasy Blood & Tissue Kit and following the manufacturer's instructions, microbial DNA was extracted from 0.25 g of each of 144 frozen fecal samples (IBS: n=80 and control (n=64). No fecal sample was available for one control subject. The 16S rRNA gene amplicons preparation and sequencing was carried out using the 16S Sequencing Library Preparation Nextera protocol developed by Illumina (San Diego, Calif., USA). 15 ng of each of the DNA fecal extracts was amplified using PCR and primers targeting the V3-V4 variable region of the 16S rRNA gene using the following gene-specific primers:
The amplicon size was 531 bp. The products were purified and forward and reverse barcodes were attached by a second round of adapter PCR.
Microbiome profiling and metagenomics—Shotgun sequencing: Genomic DNA was extracted as described above. The DNA quality was checked on 0.8% agarose gel and quantified using the Simplinano (Thermo Scientific, Ireland). For shotgun sequencing, 1 μg (concentration>5 ng/μl) of high molecular weight DNA for each sample was sent to GATC Biotech, Germany for sequencing on Illumina HiSeq platform (HiSeq 2500) using 2×250 bp paired-end chemistry. This returned 2,714,158,144 raw reads (2,612,201,598 processed reads) of which 45.6% were mapped to an average of 222,945 gene families per sample with a mean count value of 8,924,302±2,569,353 per sample.
Bioinformatics analysis (16S amplicon sequencing): Miseq 16S sequencing data was returned for 144 subjects. Data generated for 3 samples (2 IBS and 1 control) were removed as the number of reads returned from sequencing was too low for analysis, leaving 141 samples (control: n=63, IBS n=78). Raw amplicon sequence data were merged and the reads trimmed using the flash methodology (25). The USEARCH pipeline was used to generate the OTU table (26). The UPARSE algorithm was used to cluster the sequences into OTUs at 97% similarity (27). UCHIME chimera removal algorithm was used with Chimeraslayer to remove chimeric sequences (28). The Ribosomal Database Project (RDP) taxonomic classifier was used to assign taxonomy to the representative OTU sequences (26) and microbiota compositional (abundance and diversity) information was generated.
Bioinformatics analysis (Shotgun metagenomic sequencing): For shotgun metagenomics, 6 control samples were not sequenced due to data not passing QC or no sample available (control: n=59; IBS n=80). The number of raw read pairs obtained after sequencing, varied from 5,247,013 to 21,280,723 (Mean=9,763,159±2,408,048). Reads were processed in accordance with the Standard Operating Procedure of Human Microbiome Project (HMP) Consortium (29). Metagenomic composition and functional profiles were generated using HUMAnN2 pipeline (30). For each sample, multiple profiles were obtained, including: microbial composition profiles from clade-specific gene information (using MetaPhlAn2) and Gene family abundance.
Fecal GC/LC MS: 1 g samples of frozen feces were sent on dry ice to Metabolomic Discoveries (now Metabolon), Potsdam, Germany. For LC-MS, the samples were dried and resuspended to a final concentration of 10 mg per 400 μl before analysis. GC-MS and SCFA analysis were performed using wet samples. Untargeted metabolomics analysis was performed using liquid chromatography (LC) and Solid Phase Microextraction (SPME) gas chromatography (GC) and metabolites were identified using electrospray ionization mass spectrometry (ESI-MS). SCFA analysis was also performed by LC-tandem mass spectrometry.
Bioinformatics analysis of fecal metabolome data: Fecal MS metabolomics data was returned for all IBS subjects (n=80) and all but 2 controls (n=63) as these did not pass QC or no sample was available. 2,933 metabolites were returned from untargeted fecal metabolomics analysis carried out by the service provider of which 753 were identified. Metabolites identified using LC-MS were not normalized, since the fecal samples were already normalized with dry weight (10 mg per 400 μl) during sample preparation. Metabolites identified using GC-MS were normalized with corresponding sample wet weights. Only the identified metabolites were considered for further analyses. Machine learning analysis was carried out as described below. Summary statistics for all datasets were generated using the Wilcoxon rank sum test with q-value adjustment for multiple testing.
BAM SeHCAT assay: SeHCAT was administered at Cork University Hospital as a single capsule dose containing less than 0.1 mg of tauroselcholic acid (GE Healthcare, UK) and with a radioactivity dose of 370 kBq at the reference date. A baseline whole-body absorption reading using an uncollimated gamma counter (Siemens Ecam camera) was taken for each subject 2-3 hours after capsule administration. A follow-up scan was taken 7 days later and the proportion of bile acid retention was calculated; a value of <15% retention indicated mild to severe BAM with a SeHCAT score of 15-20% representing a borderline classification as discussed by Watson et al. (31).
Machine learning: An in-house machine learning pipeline was applied to each datatype (16S, shotgun and BAM-fecal MS metabolomics) using a twostep approach applying the Least Absolute Shrinkage and Selection Operator (LASSO) feature selection followed by Random Forest (RF) modelling (32). The models were implemented using R software version 3.4.0, using package glmnet version 2.0-10 for LASSO feature selection, and RF package randomForest version 4.6-12.
First, feature selection was performed using the LASSO algorithm to improve accuracy and interpretability of models by efficiently selecting the relevant features. This process was tuned by parameter lambda, which was optimized for each dataset using a grid search. The training data was filtered to include only the features selected by the LASSO algorithm, and RF was then used for modelling whereby 1500 trees were built. Both LASSO feature selection and RF modelling were performed using 10-fold cross validation (CV), which generated an internal 10-fold prediction yielding an optimal model that predicts the IBS or Control classification of samples.
For BAM-fecal metabolomics data analysis, machine learning was performed in a similar manner with the only difference being that instead of ten-fold cross-validation, Leave-One-Out (LOO) CV was used at every cross-validation step
Model 1; BAM (borderline to severe BAM or SeHCAT retention <20%) or Normal bile acid (SeHCAT retention >20%) for IBS and control subjects.
Model 2; BAM (mild to severe BAM or SeHCAT retention <15%) or Normal bile acid (SeHCAT retention >20%) for IBS subjects only.
Co-inertia analysis of the data types: The microbiome derived datasets were Hellinger transformed. Co-inertia analysis was performed using ade4 (v. 1.7.2) package in R (v 3.2.0). Principal component analysis (PCA) was performed on each of the profile in the comparison pair, followed by co-inertia analysis on these PCA objects on the first 5 principal axes. Significance of the co-inertia was calculated by permutation test using the randtest function.
Results
IBS Patients with Bile Acid Malabsorption have Altered Fecal Microbiome
Since some patients with IBS-D may have bile acid malabsorption (BAM) which will influence transit time and possibly microbiota composition, 19/21 patients with IBS-D, 26/29 subjects with IBS-M and 9/65 controls were tested for SeHCAT retention, the gold standard for identifying BAM (33). Failure to retain >15% of the labelled bile acid analogue was classified as BAM, 10-15% retention classified as mild BAM, 5-10% as moderate BAM and <5% as severe BAM (
Eighteen of the 45 IBS patients (54%) tested were diagnosed with BAM, of which 4 had severe BAM, 7 had moderate BAM and 7 had mild BAM. A further 5 patients were borderline BAM (16-20% retention). Using the 15% threshold, a positive BAM classification was reported in 40% of the IBS population tested. Mild BAM was identified in one control subject, who was subsequently diagnosed with IBS. As expected, a positive BAM diagnosis was more common in IBS-D (74%) than in IBS-M (35%) (p-value=0.03) (
Only IBS patients in the severe BAM category showed a distinct separation in their microbiota from the microbiota of patients with normal, mild, moderate, or borderline BAM diagnoses (using analysis as set out, see
Machine learning applied to the microbiome OTU dataset identified BAM (AUC: 0.95, sensitivity: 0.88 and specificity: 0.93) (Table 2). While the metabolomics model performed with 100% accuracy for severe and moderate BAM, the OTU model resulted in fewer misclassifications (five) compared to the fecal metabolomics model (nine). There was no overlap in misclassified subjects between the models, indicating that a combined microbiome-metabolome model would increase BAM prediction accuracy.
Discussion
It has been shown that the subset of IBS-D and IBS-M patients that have severe bile acid malabsorption have an altered microbiome and fecal metabolome.
The microbiome among IBS clinical subtypes does not significantly differ, and the clinical utility of assigning patients to these categories is debatable. However, a subset of IBS-D and IBS-M patients with BAM were identified who were distinguishable by a microbiome and metabolomic signature. Others have reported altered microbiota in IBS-D but did not stratify for BAM (15). It is also noteworthy that transit time (reflected by Bristol Stool Score) is a major co-variate with microbiota composition (37) but the tendency for IBS patients to alternate between the mixed, constipation and diarrhea subtypes, may mask or ‘average out’ microbiota associations with transit time. Regardless, all three subtypes can be distinguished from controls by a common fecal microbiome signature.
BAM was detected by SeHCAT in over half of the combined IBS-D and M subjects tested. Differences in the microbiome were most evident in the severe BAM group. The unrecognized presence of appreciable numbers of subjects with BAM may have contributed to low treatment success rates compared to placebo in previous trials of various IBS therapeutics (38). While subjects in the severe BAM category had a significantly altered microbiome, a fecal metabolomics signature was identified for all BAM-diagnosed subjects. This fecal metabolomics signature for BAM will readily have clinical application as it requires instrumentation that is more convenient, more accessible and less expensive than SeHCAT (which is not currently available in the USA).
Materials and Methods
Subject recruitment: Eighty patients aged 16-70 years with IBS meeting the Rome IV criteria were recruited at Cork University Hospital. Clinical subtyping of the patients (21) was as follows: IBS with constipation (IBS-C), mixed IBS (IBS-M) or IBS with diarrhea (IBS-D). Sixty-five controls of the same age range and of the same ethnicity and geographic region were recruited. Descriptive statistics for the study population are presented in Table 4.
Exclusion criteria included the use of antibiotics within 6 weeks prior to study enrolment, other chronic illnesses including gastrointestinal diseases, severe psychiatric disease, abdominal surgery other than hernia repair or appendectomy. Standard-of-care blood analysis was carried out on all participants if recent results were not available, and all subjects were tested to exclude coeliac disease. The inclusion/exclusion criteria for the control population were the same as for the IBS population with the exception of having to fulfil the Rome IV criteria for IBS. Gastrointestinal (GI) symptom history, psychological symptoms, diet, medical history and medication data were collected on each participant (both IBS and controls) and using the following questionnaires: Bristol Stool Score (BSS), Hospital Anxiety and Depression Scale (HADS) (22); Food Frequency Questionnaire (FFQ) (23). IBS-D and IBS-M patients reporting diarrhoea as well as a subset of consenting control subjects were assessed for bile acid malabsorption by SeHCAT, a radiolabelled synthetic bile acid which is used to clinically diagnosis of BAM which is not metabolized by bacteria and passes through the enterohepatic circulation as endogenous bile acids. Ethical approval for the study was granted by the Cork Research Ethics Committee (protocol number: 4DC001) before commencing the study and all participants provided written informed consent to take part.
Sample collection: Fecal and urine samples were collected from all participants for metabolomics profiling. Subjects collected a freshly voided fecal sample at home using a collection kit and brought the sample to the clinic that day, when a fresh urine sample was collected. Samples were kept at 4° C. until brought to the laboratory for storage at −80° C. which was within a few hours of the sample collection.
Fecal GC/LC MS: 1 g samples of frozen feces were sent on dry ice to Metabolomic Discoveries (now Metabolon), Potsdam, Germany. For LC-MS, the samples were dried and resuspended to a final concentration of 10 mg per 400 μl before analysis. GC-MS and SCFA analysis were performed using wet samples. Untargeted metabolomics analysis was performed using liquid chromatography (LC) and Solid Phase Microextraction (SPME) gas chromatography (GC) and metabolites were identified using electrospray ionization mass spectrometry (ESI-MS). SCFA analysis was also performed by LC-tandem mass spectrometry.
Bioinformatics analysis of fecal metabolome data: Fecal MS metabolomics data was returned for all IBS subjects (n=80) and all but 2 controls (n=63) as these did not pass QC or no sample was available. 2,933 metabolites were returned from untargeted fecal metabolomics analysis carried out by the service provider of which 753 were identified. Metabolites identified using LC-MS were not normalized, since the fecal samples were already normalized with dry weight (10 mg per 400 μl) during sample preparation. Metabolites identified using GC-MS were normalized with corresponding sample wet weights. Only the identified metabolites were considered for further analyses. Machine learning analysis was carried out as described below. Summary statistics for all datasets were generated using the Wilcoxon rank sum test with q-value adjustment for multiple testing.
BAM SeHCAT assay: SeHCAT was administered at Cork University Hospital as a single capsule dose containing less than 0.1 mg of tauroselcholic acid (GE Healthcare, UK) and with a radioactivity dose of 370 kBq at the reference date. A baseline whole-body absorption reading using an uncollimated gamma counter (Siemens Ecam camera) was taken for each subject 2-3 hours after capsule administration. A follow-up scan was taken 7 days later and the proportion of bile acid retention was calculated; a value of <15% retention indicated mild to severe BAM with a SeHCAT score of 15-20% representing a borderline classification as discussed by Watson et al (2015) (31).
Machine learning: An in-house machine learning pipeline was applied to the fecal metabolomic data. The machine learning pipeline used in this example is similar to the machine learning pipeline used in Example 1, but comprised additional optimization and validation steps, using a two step approach within a ten-fold cross-validation. Within each validation fold Least Absolute Shrinkage and Selection Operator (LASSO) feature selection was carried out followed by Random Forest (RF) modelling and an optimised model was validated against the cross validation test data which is external to the cross-validation training subset.
The models were implemented using R software version 3.4.0, using package glmnet version 2.0-10 for LASSO feature selection, and RF package randomForest version 4.6-12.
The fecal metabolome sample profiles were log10 transformed before they were analysed in the machine learning pipeline. Only IBS samples having SeHCAT information were transformed. Samples with borderline BAM were then removed, and the remaining samples classified as BAM (19 samples) or Normal (21 samples). The classified samples were then analysed in the machine learning pipeline.
After determination of the lambda (λ) range, the samples were assigned weights based on their class probabilities. The weights assigned to the training samples in this step were used in all subsequent applicable steps.
A LASSO algorithm substantially as described in Example 1 was then applied to the weighted training samples. In this example, the LASSO algorithm used the previously calculated optimal lambda (λ) range, and used the Caret (version 6.0-84 in this example) and glmnet (version 2.0-18 in this example) packages, The ROC AUC (receiver operating characteristic, area under curve) metric was calculated using Leave-One-Out cross validation. Leave-One-Out cross validation was used to maximise the number of samples available for model optimization. The feature coefficients identified by the optimized LASSO algorithm were extracted and features with non-zero coefficients were selected for further analysis. In
Following feature selection using LASSO, an optimized random forest classifier (with 1500 trees) was generated using the selected features. Random forest generation was performed using Caret (version 6.0-84) and internal cross validation, by tuning the ‘mtry’ parameter to maximise the ROC AUC metric. The optimized random forest classifier was then applied to the test set and the performance of the classifier was calculated via the AUC, sensitivity, and specificity metrics.
Results
Fecal Metabolome is Predictive of BAM Classes
Fecal metabolome profile was investigated for its predictive ability to classify samples as BAM or non-BAM. Cross-validation was Leave-One-Out CV. Leave-One-Out CV was used to ensure the maximal number of samples available for model optimization.
Machine learning to the fecal metabolome dataset of the IBS patients who underwent the SeHCAT assay, but excluding patients with a borderline BAM diagnosis. The predictive model successfully identified the subjects that had BAM with an AUC of 0.85 in all three BAM grades. The model performed with 100% accuracy for severe BAM, 75% for moderate BAM and 43% for mild BAM. The performance summary, and feature details are described in table 7 and shown in
The classification threshold was optimized to achieve maximum sensitivity and specificity using pROC package (version 1.15.0) and Youden J score. The obtained optimized values for Sensitivity and Specificity were 0.684, and 0.904, respectively.
The metabolites identified using this pipeline as predictive for BAM are listed in Table 7. Among the main predictive metabolites were a range of glycerophospholipids. Elevated levels of these compounds have been associated with altered fatty acid metabolism and disease. Among the main predictive metabolites for BAM were 1,3-di-(5Z,8Z,11Z,14Z,17Z-eicosapentaenoyl)-2-hydroxy-glycerol (d5) and dimethyl benzyl carbinyl butyrate.
Discussion
It has been shown via machine learning analysis that fecal metabolome is predictive of BAM status in IBS. It is shown that the subset of IBS-D and IBS-M patients with bile acid malabsorption have an altered fecal metabolome that can potentially be used to distinguish these subjects without requiring a SeHCAT test.
The microbiome among IBS clinical subtypes does not significantly differ, and the clinical utility of assigning patients to these categories is debatable. However, a subset of IBS-D and IBS-M patients with BAM were identified who were distinguishable by metabolomic signature. Others have reported altered microbiota in IBS-D but did not stratify for BAM (15). BAM was detected by SeHCAT in over half of the combined IBS-D and M subjects tested. Differences in the microbiome were most evident in the severe BAM group. The unrecognized presence of appreciable numbers of subjects with BAM may have contributed to low treatment success rates compared to placebo in previous trials of various IBS therapeutics (38). While subjects in the severe BAM category had a significantly altered microbiome, a fecal metabolomics signature was identified for all BAM-diagnosed subjects. This fecal metabolomics signature for BAM will readily have clinical application as it requires instrumentation that is more convenient, more accessible and less expensive than SeHCAT (which is not currently available in the USA).
The above described pipeline for recognising the fecal metabolomics signature of BAM will also have clinical application as it similarly utilises instrumentation that is more convenient, more accessible and less expensive than SeHCAT.
The findings of the current study have clinical implications. A fecal metabolomic profile has been linked with BAM which can accurately distinguish it from non-BAM related IBS.
The taxa and metabolites that distinguish BAM subjects from non-BAM related IBS subjects identified here may be targeted by a range of microbiota-directed therapies such as fecal transplants, antibiotics, probiotics or live biotherapeutics.
Blautia
Bacteroides
Faecalibacterium
Oscillibacter
Lachnospiracea_incertae_sedis
Ruminococcus2
Bifidobacterium
Coprococcus
Paraprevotella
Bacteroides
Gemmiger
Dialister
Faecalibacterium
Megamonas
Butyricicoccus
Number | Date | Country | Kind |
---|---|---|---|
19167114.8 | Apr 2019 | EP | regional |
19167118.9 | Apr 2019 | EP | regional |
1909052.1 | Jun 2019 | GB | national |
1915143.0 | Oct 2019 | GB | national |
1915156.2 | Oct 2019 | GB | national |
This application is a continuation of International Application No. PCT/EP2020/059460, filed Apr. 2, 2020, which claims the benefit of European Application No. 19167114.8, filed Apr. 3, 2019, European Application No. 19167118.9, filed Apr. 3, 2019, Great Britain Application No. 1909052.1, filed Jun. 24, 2019, Great Britain Application No. 1915143.0, filed Oct. 18, 2019, and Great Britain Application No. 1915156.2, filed Oct. 18, 2019, all of which are hereby incorporated by reference in their entirety.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/EP2020/059460 | Apr 2020 | US |
Child | 17491564 | US |