Autism spectrum disorder (ASD) is a complex group of developmental disorders characterized by impaired social interactions and communication together with repetitive behaviors. The purpose of this study is to determine bacterial biomarkers for individuals with autism, as well as to identify probiotic/therapeutic bacteria for autism. The gut bacterial profile is different between autism children and typically developing children, and the profile also evolves as children grow and develop. Gut microbiota is regarded as an important factor in the development of ASD as well as indicator of growth and development age of children. Currently there are no effective methods for diagnosing or treating autism and particularly no existing model using microbial markers to predict risk of autism in children, nor any model using microbial markers for assessing developmental age of children. This invention provide a new method for predicting risk of autism in children, new method for improve behavioral symptom in autism patients by microbial transfer and/or supplementation, and new method for assessing children's developmental age based on their gut microbial profile.
The invention relates to novel methods and compositions useful for treating the symptoms of autism spectrum disorder (ASD). In particular, the present inventor discovered that certain microorganism species, especially certain bacteria, are at an altered level, in the gastrointestinal (GI) tract of children at risk for ASD or suffering from ASD. Health benefits such as improving behavioral symptoms and alleviating detrimental effects can be achieved by modulating the level of pertinent microorganisms in patients' gut, for example, by fecal microbiota transplantation (FMT) treatment or oral administration of beneficial bacterial species or by suppressing the level of harmful bacterial species. These findings also provide new methods indicating the presence or risk of ASD in a child. Thus, in the first aspect, the present invention provides a novel method for treating ASD, including alleviating ASD symptoms, by increasing the level of one or more bacterial species named in Table 1 in the gastrointestinal tract of a child afflicted by ASD.
In some embodiments, the introducing step comprises oral administration to the subject a composition comprising an effective amount of the one or more of the bacterial species. In some embodiments, the introducing step comprises delivery to the small intestine, ileum, or large intestine of the subject a composition comprising an effective amount of the one or more of the bacterial species. In some embodiments, the introducing step comprises fecal microbiota transplantation (FMT). In some embodiments, the FMT comprises administration to the child a composition comprising processed donor fecal material. In some embodiments, the composition is orally administered; or the composition is directly deposited to the child's gastrointestinal tract. In some embodiments, the level or relative abundance of the one or more of the bacterial species is determined in a first stool sample obtained from the child prior to the introducing step and in a second stool sample obtained from the child after the introducing step. In some embodiments, the level of the one or more of the bacterial species is determined by polymerase chain reaction (PCR), especially quantitative PCR.
In a second aspect, the present invention provides a method for treating ASD, including alleviating ASD symptoms, by reducing the level of one or more bacterial species in Table 2 in the gastrointestinal tract of a child afflicted by ASD.
In some embodiments, the reducing step comprises FMT. In some embodiments, the reducing step comprises treating the subject with an anti-bacterial agent. In some embodiments, a composition comprising processed donor fecal material is introduced to the gastrointestinal tract of the subject after the subject is treated with the anti-bacterial agent. For example, the composition is orally administered, or the composition is directly deposited to the gastrointestinal tract of the child. In some embodiments, the level or relative abundance of the one or more of the bacterial species is determined in a first stool sample obtained from the child prior to the reducing step and in a second stool sample obtained from the child after the reducing step. In some embodiments, the level of the one or more bacterial species is determined by PCR, especially by quantitative PCR.
In a related aspect, a kit is provided for treating the symptoms of ASD. The kit comprises: a first container containing a first composition comprising (i) an effective amount of a first one of the bacterial species set forth in Table 1, or (ii) an effective amount of an anti-bacterial agent that suppresses growth of a first one of the bacterial species set forth in Table 2, and a second container containing a second composition comprising (i) an effective amount of a second one of the bacterial species set forth in Table 1, or (ii) an effective amount of an anti-bacterial agent that suppresses growth of a second one of the bacterial species set forth in Table 2.
In some embodiments, the first composition comprises processed donor fecal material for FMT, for example, the material has been processed and formulated for oral administration, such as dried, frozen or lyophilized, and placed in a capsule suitable for oral ingestion. In some embodiments, the second composition is formulated for oral administration. In some embodiments, both the first and second compositions are formulated for oral administration. In some cases, the kit may include two or more compositions each comprising an effective amount of at least one, possibly two or more, of the bacterial species set forth in Table 1, and/or (ii) an effective amount of an anti-bacterial agent that suppresses growth of at least one, possibly two or more, of the bacterial species set forth in Table 2. The compositions in the kit may each comprise a physiologically acceptable carrier or excipient.
In a third aspect, a method is provided for determining risk for autism spectrum disorder (ASD) in a human child. The method comprises these steps: (1) determining, in a stool sample taken from the child, the relative abundance of any one of the bacterial species set forth in Table 1 or 2; and (2) detecting the relative abundance from step (1) being no lower than the cutoff value in Table 1 or a standard control value or being lower than the cutoff value in Table 2 or a standard control value and determining the child as not having increased risk for ASD; or detecting the relative abundance from step (1) being lower than the cutoff value in Table 1 or a standard control value or being no lower than the cutoff value in Table 2 or a standard control value and determining the child as having an increased risk for ASD. In some embodiments, the relative abundance of the bacterial species in the child's stool sample is determined by PCR, e.g., quantitative PCR.
In a related aspect, a method is provided for assessing risk for autism spectrum disorder (ASD) in two human children. The method comprises these steps: (1) determining, in a stool sample from each of the two children, the relative abundance of any one of the bacterial species set forth in Table 1 or 2; and (2) determining the relative abundance of a bacterial species set forth in Table 1 from step (1) being higher in the stool sample from the first child or the relative abundance of a bacterial species set forth in Table 2 from step (1) being lower in the stool sample from the first child; and (3) determining the second child as having a higher risk for ASD than the first child. In some embodiments, the relative abundance of the bacterial species in both children's stool samples is determined by PCR, e.g., quantitative PCR.
Further, a method is provided for determining risk for ASD in a human child including these steps: (1) determining in a stool sample from the child a value of (a) the relative abundance of Alistipes indistinctus (Ai) or Anaerotruncus colihominis (Ac), or (b) the combined score of levels of three bacterial species Ai, Ac, and Eubacterium hallii (Eh), which is calculated by I1+β1*Ai+β2*Eh+β3*Ac; and (2) detecting the value to be higher than a standard control value and determining the individual as having increased risk of ASD.
Similarly, a method is provided for determining risk of ASD in a human child including these steps: (1) determining in a stool sample from the child relative abundance of Eubacterium hallii (Eh); and (2) detecting the relative abundance from step (1) to be lower than a standard control value and determining the individual as having increased risk of ASD.
In a four aspect, a method is provided for assessing risk for autism spectrum disorder (ASD) in a human child. The method comprises these steps: (1) determining, in a stool sample from the child, the level or relative abundance of one or more of the bacterial species set forth in Table 3; (2) determining the level or relative abundance of the same bacterial species in a stool sample from a reference cohort comprising normal and ASD children; (3) generating decision trees by random forest model using data obtained from step (2) and running the level or relative abundance of one or more of the bacterial species from step (1) down the decision trees to generate a risk score; and (4) determining the child with a risk score greater than 0.5 as having an increased risk for ASD and determining the child with a risk score no greater than 0.5 as having no increased risk for ASD.
In some embodiments, the one or more bacterial species comprise or consist of Alistipes indistinctus. In some embodiments, the one or more bacterial species comprise or consist of Alistipes indistinctus, candidate division TM7 single-cell isolate TM7c, and Streptococcus cristatus. In some embodiments, the one or more bacterial species comprise or consist of Alistipes indistinctus, candidate division TM7 single-cell isolate TM7c, Streptococcus cristatus, Eubacterium_limosum, and Streptococcus_oligofermentans.
In a related aspect, the present invention provides a kit for assessing individuals' risk of developing autism spectrum disorder (ASD). The kit comprises reagents for detecting one or more of the bacterial species set forth in Table 1, 2, or 3. In some embodiments, the reagents comprise a set of oligonucleotide primers for amplification of a polynucleotide sequence from any one of the bacterial species set forth in Table 1, 2, or 3. In some embodiments, the amplification is PCR, for example, quantitative PCR.
In a fifth aspect, the present invention provides a method for determining the growth or developmental age of a child. The method comprises these steps: (a) quantitatively determining the relative abundance of one or more bacterial species selected from Table 8 or 9 in a stool sample taken from the child; (b) quantitatively determining the relative abundance of the one or more bacterial species in a stool sample taken from a reference cohort consisting of typically developing children; (c) generating decision trees by random forest model using data obtained from step (b); and (d) running the relative abundances obtained from step (a) down the decision trees from step (b) to generate a developmental age for the child. In some embodiments, the one or more bacterial species comprise Streptococcus gordonii, Enterococcus avium, Eubacterium_sp_3_1_31, Clostridium hathewayi, and Corynebacterium durum. In some embodiments, the one or more bacterial species comprise Streptococcus gordonii, Enterococcus avium, Eubacterium_sp_3_1_31, and Clostridium hatheway. In some embodiments, the one or more bacterial species comprise Streptococcus gordonii, Enterococcus avium, and Eubacterium_sp_3_1_31. In some embodiments, the one or more bacterial species comprise Streptococcus gordonii and Enterococcus avium. In some embodiments, the one or more bacterial species comprise Streptococcus gordonii. In some embodiments, the child is between about 3 to about 6 years old.
In a related aspect, a kit is provided for determining the growth or developmental age of a child. The kit includes a first container containing a first reagent for detecting a first bacterial species set forth in Table 8 or 9 and a second container containing a second reagent for detecting a second and different bacterial species set forth in Table 8 or 9. In some embodiments, the kit includes three or more containers, each of which containing a reagent for detecting a different bacterial species set forth in Table 8 or 9. In some embodiments, the kit includes two or more containers, each of which containing a reagent for detecting a different bacterial species selected from the group consisting of (1) Streptococcus gordonii, Enterococcus avium, Eubacterium_sp_3_1_31, Clostridium hathewayi, and Corynebacterium durum; (2) Streptococcus gordonii, Enterococcus avium, Eubacterium_sp_3_1_31, and Clostridium hatheway; (3) Streptococcus gordonii, Enterococcus avium, and Eubacterium_sp_3_1_31; or (4) Streptococcus gordonii and Enterococcus avium. In some embodiments, the reagents comprise a set of oligonucleotide primers for amplification of a polynucleotide sequence from any one of the bacterial species set forth in Table 8 or 9. In some embodiments, the amplification is PCR, for example, quantitative PCR (qPCR).
In a sixth aspect, the present invention provides a method for promoting growth and development of a child, comprising administering to the child an effective amount of one or more bacterial species selected from Table 8. In some embodiments, the child is between about 3 to about 6 years old in biological age.
In a related aspect, a kit is provided for promoting growth and development of a child. The kit includes a first container containing a first composition comprising (i) an effective amount of one of the bacterial species set forth in Table 8 and a second container containing a second composition comprising (i) an effective amount of another, different one of the bacterial species set forth in Table 8. In some embodiments, the first or second composition comprises processed donor fecal material for FMT. In some embodiments, the first or second composition is formulated for oral administration. In some embodiments, both the first and second compositions are formulated for oral ingestion.
The term “fecal microbiota transplantation (FMT)” or “stool transplant” refers to a medical procedure during which fecal matter containing live fecal microorganisms (bacteria, fungi, viruses, and the like) obtained from a healthy individual is transferred into the gastrointestinal tract of a recipient to restore healthy gut microflora that has been disrupted or destroyed by any one of a variety of medical conditions, for example, autism spectrum disorder (ASD). Typically, the fecal matter from a healthy donor is first processed into an appropriate form for the transplantation, which can be made through direct deposit into the lower gastrointestinal tract such as by colonoscopy, or by nasal intubation, or through oral ingestion of an encapsulated material containing processed (e.g., dried and frozen or lyophilized) fecal material.
The term “inhibiting” or “inhibition,” as used herein, refers to any detectable negative effect on a target biological process, such as RNA/protein expression of a target gene, the biological activity of a target protein, cellular signal transduction, cell proliferation, and the like. Typically, an inhibition is reflected in a decrease of at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90% or greater in the target process (e.g., growth or proliferation of a microorganism of certain species, for example, one or more of the bacterial species shown in Table 2), or any one of the downstream parameters mentioned above, when compared to a control. “Inhibition” further includes a 100% reduction, i.e., a complete elimination, prevention, or abolition of a target biological process or signal. The other relative terms such as “suppressing,” “suppression,” “reducing,” “reduction,” “decrease,” “decreasing,” “lower,” and “less” are used in a similar fashion in this disclosure to refer to decreases to different levels (e.g., at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90% or greater decrease compared to a control level, i.e., the level before suppression) up to complete elimination of a target biological process or signal. On the other hand, terms such as “activate,” “activating,” “activation,” “increase,” “increasing,” “promote,” “promoting,” “enhance,” “enhancing,” “enhancement,” “higher,” and “more” are used in this disclosure to encompass positive changes at different levels (e.g., at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, 200%, or greater such as 3, 5, 8, 10, 20-fold increase compared to a control level (before activation), for example, the control level of one or more of the bacterial species shown in Table 1) in a target process or signal. In contrast, the term “substantially the same” or “substantially lack of change” indicates little to no change in quantity from a comparison basis (such as a standard control value), typically within ±10% of the comparison basis, or within ±5%, 4%, 3%, 2%, 1%, or even less variation from the comparison basis.
“Standard control” as used herein refers to a value corresponding to either the average level of a pre-selected bacterial species found in a particular type of samples (e.g., stool samples) obtained from individuals who did not suffer from ASD or developmental delay or a composite score calculated from the average levels of multiple bacterial species found in the type of samples taken from such individuals. For example, for the purpose of examining the risk of ASD in a child, a “standard control” value is established to provide a cut-off value to indicate whether or not the child being examined has an elevated risk for ASD. In order for a “standard control” to be properly established, a sufficient number of individuals (e.g., at least 10, 12, 15, 20, 24 or more individuals) must be included in the control group to provide samples for determination of the average level(s) of one or more pre-selected bacterial species or the composite score calculated from the levels of multiple bacterial species representative of the risk for ASD.
The term “anti-bacterial agent” refers to any substance that is capable of inhibiting, suppressing, or preventing the growth or proliferation of bacterial species, respectively, especially those of shown in Table 2. Known agents with anti-bacterial activity include various antibiotics that generally suppress the proliferation of a broad spectrum of bacterial species as well as agents such as antisense oligonucleotides, small inhibitory RNAs, and the like that can inhibit the proliferation of specific bacterial species. The term “anti-bacterial agent” is similarly defined to encompass both agents with broad spectrum activity of killing virtually all species of bacteria and agents that specifically suppress proliferation of target bacteria species. Such specific anti-bacterial agent may be short polynucleotide in nature (e.g., a small inhibitory RNA, microRNA, miniRNA, lncRNA, or an antisense oligonucleotide) that is capable of disrupting the expression of a key gene in the life cycle of a target bacterial species and is therefore capable of specifically suppressing or eliminating the species only without substantially affecting other closely related bacterial species.
“Percentage relative abundance,” when used in the context of describing the presence of a particular bacterial species (e.g., any one of those shown in any one of Tables 1-11) in relation to all bacterial species present in the same environment, refers to the relative amount of the bacterial species out of the amount of all bacterial species as expressed in a percentage form. For instance, the percentage relative abundance of one particular bacterial species can be determined by comparing the quantity of DNA specific for this species (e.g., determined by quantitative polymerase chain reaction) in one given sample with the quantity of all bacterial DNA (e.g., determined by quantitative polymerase chain reaction (PCR) and sequencing based on the 16s rRNA sequence) in the same sample.
“Absolute abundance,” when used in the context of describing the presence of a particular bacterial species (e.g., any one of those shown in Tables 1-11) in the feces, refers to the amount of DNA derived from the bacterial species out of the amount of all DNA in a fecal sample. For instance, the absolute abundance of one bacterium can be determined by comparing the quantity of DNA specific for this bacterial species (e.g., determined by quantitative PCR) in one given sample with the quantity of all fecal DNA in the same sample.
“Total bacterial load” of a fecal sample, as used herein, refers to the amount of all bacterial DNA, respectively, out of the amount of all DNA in the fecal sample. For instance, the absolute abundance of bacteria can be determined by comparing the quantity of bacteria-specific DNA (e.g., 16s rRNA determined by quantitative PCR) in one given sample with the quantity of all fecal DNA in the same sample.
As used herein, the term “autism spectrum disorder (ASD),” refers to a condition related to brain development that impacts how a person perceives and socializes with others, resulting in difficulties in social interaction and communication. ASD begins in early childhood and eventually causes problems in the suffers' inability to function properly in society-socially, in school, and at work. The term “spectrum” in autism spectrum disorder refers to the wide range of symptoms and severity. ASD includes conditions that were previously considered separate, such as autism, Asperger's syndrome, childhood disintegrative disorder, and an unspecified form of pervasive developmental disorder. The disorder also includes limited and repetitive patterns of behavior.
The term “treat” or “treating,” as used in this application, describes an act that leads to the elimination, reduction, alleviation, reversal, prevention and/or delay of onset or recurrence of any symptom of a predetermined medical condition. In other words, “treating” a condition encompasses both therapeutic and prophylactic intervention against the condition, including facilitation of patient recovery from the condition.
The term “effective amount,” as used herein, refers to an amount of a substance that produces a desired effect (e.g., an inhibitory or suppressive effect on the growth or proliferation of one or more detrimental bacterial species (e.g., the bacterial species shown in Table 2) for which the substance (e.g., an anti-bacterial agent) is used or administered. The effects include the prevention, inhibition, or delaying of any pertinent biological process during bacterial proliferation to any detectable extent. The exact amount will depend on the nature of the substance (the active agent), the manner of use/administration, and the purpose of the application, and will be ascertainable by one skilled in the art using known techniques as well as those described herein. In another context, when an “effective amount” of one or more beneficial or desirable bacterial species (e.g., those listed in Table 1) are artificially introduced into a composition intended to be introduced into the gastrointestinal tract of a patient, e.g., to be used in FMT, it is meant that the amount of the pertinent bacteria being introduced is sufficient to confer to the recipient health benefits such as reduced recovery time or reduced needs for therapeutic intervention for a pertinent disorder such as ASD, including but not limited to medication (such as antipsychotic drugs and antidepressants) and any of the variety of therapies such as behavior and communication therapy, educational therapy, family therapy, speech or physical therapy, and the like.
The term “growth/development age,” as used herein, refers to a child's developmental stage that is expressed in time units and assessed based on the status/profile of microorganism present in his or her GI tract. A comparison between a child's biological (birth) age and growth/development age reflects whether or not the child's growth and development is consistent with his or her birth or chronological age or “age-appropriate.”
As used herein, the term “about” denotes a range of value that is +/−10% of a specified value. For instance, “about 10” denotes the value range of 9 to 11 (10+/−1).
I. Introduction
The invention provides a novel approach for assessing the risk for developing autism spectrum disorder (ASD) among children, for assessing the growth or development age among children, as well as for treating ASD symptoms. During their studies, the present inventors discovered that the presence and relative abundance of certain bacterial species alter significantly in the gastrointestinal tract of patients due to ASD, with increase or decrease of particular species correlating with disease severity. For example, the presence of bacterial species shown in Table 2 is found to be at an elevated level in the GI tract of ASD children, whereas the presence of bacterial species such as those shown in Table 1 have been found to be at a reduced level in the GI tract of ASD children. On the other hand, the level or relative abundance of certain bacterial species (such as those shown in Table 3) in children's stool samples has been observed to correlate with likelihood of children developing ASD at a later time. Lastly, the inventors discovered that the microorganism presence and profile within a child's GI tract evolves as the child progresses along with his growth and development process. Thus, the results of these latest discoveries provide useful tools for treating ASD symptoms, for assessing ASD risk among children, for guiding the necessary treatment such as medication and/or therapies described herein for children who have been identified as at high risk of ASD or are exhibiting symptoms of ASD, as well as for assessing a child's growth and developmental age to determine whether he is appropriate in his development process in relation to his biological or birth age, which can then facilitate subsequently determining whether or not certain treatment is needed, for example, supplementary administration of certain bacterial species found to be deficient in the GI tract of the child for the purpose of promoting his growth and development.
II. FMT Donor/Recipient Selection and Preparation
ASD children suffer from a disrupted state of GI tract microflora are considered as recipients for FMT treatment in order to restore the normal healthy profile for microorganisms. As revealed by the present inventors, the presence or risk of ASD tends to lead to a depressed level of bacterial species such as those shown in Table 1, a FMT donor whose fecal material contains an higher than average level of one or more of these bacterial species is favored as particularly advantageous for this purpose. For example, a desirable donor may preferably have higher than about 0.01%, 0.02%, 0.05%, 0.10%, 0.20%, 0.40%, 0.50%, 0.60%. 0.80%, 1.0%, 2.0%, 3.0%, 4.0%, 5.0%, 6.0%, 7.0%, 8.0%, 8.5%, 9.0%, or higher of total bacteria in relative abundance for each of these bacterial species in his stool sample.
On the other hand, ASD children have abnormally high level of the bacterial species listed in Table 2. Thus, to restore their normal and healthy GI bacterial profile, FMT is appropriate using fecal material donated from a healthy person whose level of these bacterial species (in Table 2) in the stool sample is either naturally low or artificially depressed, for example, by the use of a specific anti-bacterial agent that specifically kills or suppresses certain target bacterial species without significantly impacting other bacterial species. Preferably, each of these bacterial species should have no more than about 0.01%, 0.02%, 0.05%, 0.07%, 0.08%, 0.10%, 0.13%, 0.15%, 0.20%, 0.25%, 0.30%, 0.50%, 0.70% or higher of total bacteria in relative abundance in the fecal material before being processed for use in FMT.
Fecal matter used in FMT is obtained from a healthy donor and then processed into appropriate forms for the intended means of delivery in the upcoming FMT procedure. While a healthy individual from the same family or household often serves as donor, in practicing the present invention the donor microorganism profile is an important consideration and may favor the choice of an unrelated donor instead. The process of preparing donor material for transplant includes steps of drying, freezing or lyophilizing, and formulating or packaging, depending on the precise route of delivery to recipient, e.g., by oral ingestion or by rectal deposit.
In preparation for FMT treatment, an intended recipient, e.g., a patient who has been diagnosed with ASD or who was deemed to have an increased risk of developing ASD but has not yet exhibited any definitive symptoms for the disease (e.g., has family history or other known risk factors for ASD), may first receive a treatment to suppress bacterial level in his GI tract prior to FMT. The treatment may involve administration of an anti-bacterial agent, either a broad spectrum antibiotic or a specific anti-bacterial agent, to eliminate or reduce the level of undesirable bacterial species that has risen due to the ASD presence or risk, such as one or more of the bacteria named in Table 2.
Various methods have been reported in the literature for determining the levels of all bacterial species in a sample, for example, amplification (e.g., by PCR) and sequencing of bacterial polynucleotide sequence taking advantage of the sequence similarity in the commonly shared 16S rRNA bacterial sequences. On the other hand, the level of any given bacterial species may be determined by amplification and sequencing of its unique genomic sequence. A percentage abundance is often used as a parameter to indicate the relative level of a bacterial species in a given environment.
III. Treatment Methods by Modulating Bacterial Level
The discovery by the present inventors reveals the direct correlation between ASD and the increase or decrease of certain bacterial species (e.g., those shown in Table 1 or 2) in ASD children's gut. This revelation enables different methods for treating ASD symptoms, especially for aiding ASD children to benefit from different treatment regimens such as medication and/or various therapies, by adjusting or modulating the level of these bacterial species in these patient's GI tract via, e.g., an FMT procedure, to either deliver to the patients' GI tract an effective amount of one or more of the bacterial species of those shown in Table 1 or to decrease the level of one or more bacterial species listed in Table 2, e.g., by delivering an anti-bacterial agent to suppress the target bacterial species.
When a proposed FMT donor whose stool is tested and found to contain an insufficient level of one or more of the beneficial bacterial species such as those shown in Table 1 (e.g., each is less than about 0.01%, 0.05%, 0.10%, 0.20%, 0.40%, 0.50%, 0.80%, 1.0%, 2.0%,3.0%,4.0%,5.0%,6.0%,7.0%, or 8.0% of total bacteria in the stool sample), the proposed donor is deemed as an unsuitable donor for FMT intended to treat ASD symptoms or to reduce a recipient's (e.g., a child's) risk for developing ASD in the future, he may be disqualified as a donor in favor of anther individual whose stool sample exhibits a more favorable bacterial profile, and his fecal material should not be immediately used for FMT due to the lack of prospect of conferring such beneficial health effects unless the stool material is adequately modified. In these cases of expected lack of health benefits from FMT treatment can be readily improved in view of the inventors' discovery, for example, one or more of the bacterial species such as those shown in Table 1 may be introduced from an exogenous source into a donor fecal material so that the level of the bacterial species in the fecal material is increased (e.g., to reach at least about 0.01%, 0.02%, 0.05%, 0.10%, 0.20%, 0.40%, 0.50%, 0.60%. 0.80%, 1.0%, 2.0%, 3.0%, 4.0%, 5.0%, 6.0%, 7.0%, 8.0%, 8.5%, 9.0%, or 10% of total bacteria in the fecal material) before it is processed for use in FMT for the treatment of ASD symptoms or for reducing ASD risk in a child. Pre-treatment schemes with similarly intended goals can be employed to prepare patients who are soon to receive FMT treatment in order to maximize their potential to receive health benefits such as those stated above and herein.
As an alternative, the beneficial bacterial species (one or more of those shown in Table 1) may be obtained from a bacterial culture in a sufficient quantity and then formulated into a suitable composition, which is without any fecal material taken from a donor, for delivery into an ASD patient's gut. Similar to FMT, such composition can be introduced into a patient by oral, nasal, or rectal administration.
On the other hand, certain bacterial species (e.g., those in Table 2) are found to rise in their relative abundance as a result of the presence of ASD or risk of ASD. Thus, ASD patients or those at heightened risk for ASD are treated to reduce the level of these bacterial species in order to ameliorate the patients' symptoms related to the illness. There are several options to reduce the level of these bacterial species: first, the patient may be given a specific anti-bacterial agent to specifically kill or suppress the targeted bacterial species, thereby lowering the abnormally high level of these bacteria.
Second, the patient may be first given an anti-bacterial agent, such as a broad spectrum antibiotic to kill or suppress all bacterial species, or a specific anti-bacterial agent to specifically kill or suppress the targeted bacterial species; then a composition may be administered to the patient (e.g., by FMT) to introduce a well-balanced mixed bacterial culture into the GI tract of the patient.
Each of these options can be performed in one combined step to achieve the first and second treatment method goals, i.e., to increase the level of certain bacterial species (such as one or more of those shown in Table 1) and to decrease the level of certain other bacterial species (for example, one or more of those listed in Table 2), using one single composition (such as processed fecal material from an FMT donor) containing the pertinent bacterial species within the appropriate ratio range to one another.
Immediately upon completion of the step of introducing an effective amount of the desired bacterial species into a patient's GI tract (e.g., via an FMT procedure) and/or the step of suppressing undesirable bacterial level, the recipient may be further monitored by continuous testing of the level or relative abundance of the bacterial species in the stool samples on a daily basis for up to 5 days post-procedure while the clinical symptoms of ASD being treated as well as the general health status of the patient are also being monitored in order to assess treatment outcome and the corresponding levels of relevant bacteria in the recipient's GI tract: the level of bacterial species (one or more of those shown in Table 1) may be monitored in connection with observation of health benefits achieved such as improvement in behavior, language or social skills.
IV. Assessing Disease Severity and Corresponding Treatment
The present inventors also discovered that the altered level of certain bacterial species can indicate the presence or risk of ASD: they revealed the correlation between reduced level of certain bacterial species (e.g., those shown in Table 1) in human children's stool samples and the likelihood of a later diagnosis of ASD in these children. Similarly, a correlation between increased level of certain other bacterial species (e.g., those shown in Table 2) in a child's GI tract and the likelihood of the child later developing ASD has been established. Further, the level or relative abundance of certain bacterial species (such as one or more of the species shown in Table 3) have been revealed to indicate a subject's risk for later developing ASD when properly calculated using certain specified mathematic tools.
For example, when stool samples taken from two or more children, the level or relative abundance of bacterial species in Table 1 or 2 in the samples may be determined, for example, by PCR especially quantitative PCR. For the bacterial species listed in Table 1, a lower level found in a child's stool sample indicates a higher likelihood for the presence or increased risk of ASD in the child; conversely, for the bacterial species listed in Table 2, a higher level found in a child's stool sample indicates a higher likelihood of the presence or risk for ASD in the child. In the event that the level of multiple species are measured and compared, the rick determination is made based on the indication from the majority of the pertinent bacterial species measured.
Once the ASD risk assessment is made, for example, a child is deemed to have ASD or is at an increased risk of later developing ASD, appropriate treatment steps can be taken as a measure to address the heightened risk for the child. For example, the child may be given medication such as antipsychotic and/or antidepressant drugs or may be given therapies such as those specifically designed to address behavioral problems and/or to improve language, communication, or social skills
V. Assessing Growth/Development Age in Children
The present inventors have in addition revealed that the profile of bacterial species present in a child's gastrointestinal tract continues to evolve as the child continues to develop as a part of the normal growth process. Thus, the results disclosed therein further allows one to devise an effective and accurate means for assessing children's developmental age based on the levels of certain relevant gut bacterial species using the methods described herein. More specifically, a stool sample is first taken from the child who is being tested for his growth or development age. The level or relative abundance of a plurality of pre-selected bacterial species (such as the bacterial species shown in Table 8 or 9) are then quantitatively determined using methods known in the pertinent field or described herein. Using the levels of these bacterial species one can subsequently calculate the child's development age using mathematic tools specifically described in this disclosure.
Once a child's growth or developmental age is determined using the method of this invention, if needed the child may be given appropriate treatment for the purpose of promoting his growth or development. For example, if a child's development age is found to be well behind his biological age, e.g., more than about 6 or 9 or 12 months behind his biological age, or more than about 10%, 20%, 25%, 33%, or even 50% behind his biological age, he may be given treatment by way of administration of an effective amount of one or more of the bacterial species named in Table 8 or 9 and found to be deficient in his gastrointestinal tract. One method of treatment is FMT, e.g., oral administration or direct deposit of pre-processed material enriched with the desired bacterial species.
VI. Kits and Compositions for Use in ASD Treatment
The present invention provides novel kits and compositions that can be used for alleviating the symptoms and conferring health benefits in the therapeutic and/or prophylactic treatment of ASD, including facilitation of patient improvement by way of conventional therapies designed for treating ASD. For example, a kit is provided that comprises a first container containing a first composition comprising (i) an effective amount of one or more of the bacterial species set forth in Table 1 or 14, or (ii) an effective amount of an anti-bacterial agent that suppresses growth of one or more of the bacterial species set forth in Table 2 or 13, and a second container containing a second composition comprising an effective amount of a medicine known for use in the treatment of ASD (such as an antipsychotic or antidepressant drug). In some variations, the kit may contain two or more compositions each of which comprises an effective amount of (1) one or more of the beneficial bacterial species of Table 1 or 14, (2) an anti-bacterial agent, and (3) a medicine for treating ASD, either alone or in any combination.
In some cases, the first composition comprises a fecal material from a donor, which has been processed, formulated, and packaged to be in an appropriate form in accordance with the delivery means in the FMT procedure, which may be by direct deposit in the recipient's lower gastrointestinal track (e.g., wet or semi-wet form) or by oral ingestion (e.g., frozen, dried/lyophilized, encapsulated). Alternatively, the first composition may not contain any donor fecal material but is an artificially mix containing the preferred bacterial species, such as one or more of the bacterial species set forth in Table 1 or 14, at an appropriate ratio and quantity. Further, the first composition may contain an adequate amount of an anti-bacterial agent that suppresses growth of one or more of the bacterial species set forth in Table 2 or 13. The anti-bacterial agent may be a broad-spectrum anti-bacterial agent in some cases; or in other cases it may be a specific anti-bacterial agent targeting the specific bacterial species only (e.g., those in Table 2 or 13): it may be a short polynucleotide, e.g., a small inhibitory RNA, microRNA, miniRNA, lncRNA, or an antisense oligonucleotide, that is capable of specifically targeting one or more of predetermined bacterial species without significantly affecting other closely related bacterial species.
In other cases, the first composition may be a composition (e.g., a processed FMT donor fecal material) comprising the preferred bacterial species (such as one or more of the bacterial species set forth in Table 1 or 14) at an appropriate ratio and quantity along with a specific anti-bacterial agent targeting the specific bacterial species only (e.g., those named in Table 2 or 13). The first composition is formulated and packaged in accordance with its intended means of delivery to the patient, for example, by oral ingestion, nasal delivery, or rectal deposit.
The second composition in some cases may comprises an adequate or effective amount of a therapeutic agent effective for treating ASD, for example, an antipsychotic or antidepressant drug. The composition is formulated for the intended delivery method of the prebiotic or therapeutic agent(s), for example, by injection (intravenous, intraperitoneal, intramuscular, or subcutaneous injection) or by oral/nasal administration or by local deposit (e.g., suppositories).
The first and second compositions are often kept separately in two different containers in the kit. In some cases, the composition for increasing the level of certain bacterial species (such as one or more of the bacterial species set forth in Table 1 or 14) and the composition for suppressing other bacterial species (e.g., one or more of those listed in Table 2 or 13) may be combined to form a single composition for administration to the patient together, for example, by oral or local delivery, at the same time. In some cases, the first and second compositions may be combined in a single composition so that they can be administered to the patient together, for example, by oral or local delivery, at the same time.
Moreover, a kit is provided for the quantitative detection of one or more bacterial species such as the bacterial species set forth in Tables 1, 2, 13, and 14. The kit comprises reagents for quantitative detection of each of the bacterial species, for example, such reagents may comprise a set of oligonucleotide primers for the amplification, such as PCR especially quantitative PCR, of a polynucleotide sequence derived from, and preferably unique to, each one of the pertinent bacterial species (such as any one or more of the bacterial species set forth in Tables 1-3), especially those set forth in Tables 1, 2, 13, and 14.
In addition, the present invention provides kits and compositions for assessing a child's growth or development age as well as for promoting or enhancing a child's growth or development. Typically, a kit for determining developmental age of a child includes a first container containing a first reagent for detecting a first bacterial species set forth in Table 8 or 9 and a second container containing a second reagent for detecting a second (different from the first) bacterial species set forth in Table 8 or 9. For example, the kit may include three or more containers each of which containing a reagent for detecting a different bacterial species set forth in Table 8 or 9. As another example, the kit may include two or more containers each of which containing a reagent for detecting a different bacterial species selected from any of the following groups consisting of (1) Streptococcus gordonii, Enterococcus avium, Eubacterium_sp_3_1_31, Clostridium hathewayi, and Corynebacterium durum; (2) Streptococcus gordonii, Enterococcus avium, Eubacterium_sp_3_131, and Clostridium hatheway; (3) Streptococcus gordonii, Enterococcus avium, and Eubacterium_sp_31_31; or (4) Streptococcus gordonii and Enterococcus avium. The reagents included in the kit for detection of a pre-selected bacterial species may include a set of oligonucleotide primers for amplification of a polynucleotide sequence from (and preferably unique to) the bacterial species, e.g., any one of the bacterial species set forth in Table 8 or 9. A frequently used method of amplification is PCR, such as quantitative PCR (qPCR).
A kit for promoting growth and development of a child (e.g., a child of about 3 to about 6 years of biological or birth age) by way of administering to the child an effective amount of one or more bacterial species selected from Table 8 typically includes a first container containing a first composition comprising (i) an effective amount of one of the bacterial species set forth in Table 8 and a second container containing a second composition comprising (i) an effective amount of another (and different from the first) one of the bacterial species set forth in Table 8. In exemplary embodiments, the first and/or second composition(s) may be or include a processed donor fecal material for FMT. Either or both of the first and second compositions may be formulated for oral administration, for example, to be used in an FMT process. All compositions described herein may contain one or more physiologically acceptable excipients or carriers in addition to the active components.
The following examples are provided by way of illustration only and not by way of limitation. Those of skill in the art will readily recognize a variety of non-critical parameters that could be changed or modified to yield essentially the same or similar results.
The present inventors studied changes in gut microbiota due to the presence or risk for autism spectrum disorder (ASD) by comparing the profile of bacterial species present in the gastrointestinal tract of autistic children with that of developmentally normal children. The bacterial species that have been found to be present at a decreased level or relative abundance in autistic children, e.g., any one of those set forth in Table 1, and the bacterial species that have been found to be present at an increased level or relative abundance in autistic children, e.g., any one of those set forth in Table 2, can be quantitatively measured to assess an individual's risk of later developing ASD. On the other hand, these bacterial species may be subject to modulation of their level or relative abundance in order to treat ASD by alleviating at least some of its symptoms.
A total of 128 Chinese children (aged between 3 and 6 years) were recruited, including 64 children with autism spectrum disorder (ASD) and 64 typically developing children. There were more male (83%) than female. Majority of case were diagnosis with ASD at around 3 years old. The study was approved by The Joint Chinese University of Hong Kong, New Territories East Cluster Clinical Research Ethics Committee (The Joint CUHK-NTEC CREC, CREC Ref. No: 2016.607). All subjects consented to donate fecal samples and to the questionnaire investigation, where written informed consents were obtained. Fecal samples from the study subjects were stored at −80° C. for downstream microbiome analyses. Children diagnosed with ASD by pediatrician or clinical psychologist according to the standard of the fourth or fifth edition of the Diagnostic and Statistical Manual of Mental Disorders (DSM-IV or DSM-V) were included. Children without ASD, delays in motor and language development, as well as behaviors as reported by their parents, and those do not have first-degree relatives with ASD were included as typically developing children.
Fecal bacterial DNA was extracted by Maxwell® RSC PureFood GMO and Authentication Kit (Promega) with modifications to increase the yield of DNA. Approximately 100 mg from each stool sample was pretreated: stool sample suspended in 1 ml ddH2O and pelleted by centrifugation at 13,000×g for 1 min. Washed sample was added 800ul TE buffer (PH 7.5), 16ul beta-Mercaptoethanol and 250U lyticase sufficiently mixed and digestion at 37° C. for 90 minutes. Pelleted by centrifugation at 13,000×g for 3 minutes.
After pretreatment, precipitate was re-suspended in 800ul CTAB buffer (Maxwell® RSC PureFood GMO and Authentication Kit following manufacturer's instructions) and mixed well. After samples were heated at 95° C. for 5 minutes and cooled down, nucleic acid was released from the samples by vortexing with 0.5 mm and 0.1 mm beads at 2850 rpm for 15 minutes. Following this, 40ul Proteinase K and 20ul RNase A were added and nucleic acid digested at 70° C. for 10 minutes. Finally, supernatant was obtained after centrifugation at 13,000×g, 5 minutes and placed in a Maxwell® RSC instrument for DNA extraction. The extracted fecal DNA was used for ultra-deep metagenomics sequencing via Ilumina Novaseq 6000 (Novogene, Beijing, China).
Raw sequence reads were trimmed by Trimmomatic1(v0.38) firstly and then separation of non-human reads from contaminant host reads. There were some steps to acquire clean reads: 1) Remove adapters; 2) Scan the read with a 4-base wide sliding window, removing reads when the average quality per base drop below 20; 3) Drop reads below the 50 bases long. T rimmed sequence reads were mapped to human genome (Reference database: GRCh38 p12) by KneadData (v0.7.2) to remove reads originated from the host. Pair-end two reads were concatenated together.
Profiling of the composition of bacterial communities was performed on metagenomic trimmed reads via MetaPhlAn2 (v2.7.5)2. Mapping reads to clade-specific markers gene and annotation of species pangenomes was done through Bowtie2 (v2.3.4.3)3. The output table contained bacterial species and its relative abundance in different levels, from kingdom to strain level. The resulting data were analyzed in R v3.6.1 using tidyverse (v1.2.1)4, ggpubr (v0.2, website: github.com/kassambara/ggpubr) and phyloseq (v1.24.2)5. Human gut bacteria composition and defined the differential bacterial species were compared between children with autism spectrum disorder (ASD) and typically developing children via Linear discriminant analysis effect size (LEfSe) analysis6.
Random forest (RF) was chosen to build ASD versus typically developing children prediction model using fecal microbes because of its superior performance for classification with binary features. Random Forest7 is one of the most popular approaches in metagenomic data analysis to identify the discriminative features and build prediction models. As a widely used ensemble learning algorithm, Random Forest consists of a series of classification and regression trees (CARTs) to form a strong classifier. A subset of data randomly sampled from the original dataset with replacement is known as bootstrap sampling, applying to build the trees. When the training dataset for the current tree is drawn by the bootstrap method,
observations are left out from the overall dataset. With infinite N, there are 36.8% data not occurred in the training samples called out-of-bag (OOB) observations, which would not be used for constructing the trees. In addition, extra randomness introduced to the random forest as each decision tree splits nodes based on a random subset of features selected from the overall features. The features with the least Gini (Gini are used to evaluate the purity of the node) would be utilized to split the nodes in each iteration to generate the trees. With different subsets of data and features, the algorithm is able to train different trees and obtain the final classification by averaging the result from the tree models. In addition to the prediction model, Random Forest has the capability to assess the importance of variables8. The OOB observations are used to estimate the classification error for each tree in the forest. To measure the importance of a given variable, the values of the variable in the OOB data are randomly altered, and then the changed OOB data is used to generate new predictions. The difference of the error rate between the altered and the original OOB observations divided by the standard error is calculated as the importance of a variable. To classify a new sample, the features of the sample passed down to each tree to estimate the probability for classification. The Random Forest used the average probability of all trees to determine the final result of the classification.
A total of 64 children with ASD patients and 64 typically developing children were included as the discovery cohort for modeling. The importance value of each species to the classification model was evaluated by recursive feature elimination. According to descending importance value, the selected species were added one by one to the random forest model if its Pearson correlation value with any already existing probe in the model was <0.7. Each time a new feature was added to the model, the performance of the model was re-evaluated using 10-fold cross-validation. These models were compared in terms of binary classifiers with Area Under the Curve (AUC) in Receiver Operating Characteristic (ROC) curves. The final model was chosen when best accuracy and kappa were achieved. These analysis was done using R packages randomForest v4.6-147 and pROC v1.15.39.
With LEfSe analysis, the species Faecalibacterium prausnitzii, Roseburia inulinivorans, Eubacterium hallii, Dorea longicatena, Eubacterium siraeum (
Faecalibacterium prausnitzi
Roseburia inulinivorans
Eubacterium hallii
Dorea longicatena
Eubacterium siraeum
Clostridium nexile
Dialister invisus
Clostridium bolteae
Clostridium symbiosum
Eubacterium limosum
Clostridium ramosum
Anaerotruncus colihominis
Clostridium citroniae
Alistipes indistinctus
Bacteria listed in Table 1 and Table 2 can be used in different combinations to determine the risk of ASD. For example, the relative abundance can be determined using as a panel of qPCR primer or by metagenomics sequencing to calculate the risk.
Furthermore, bacteria listed in Table 1 can be administered to children with ASD or at risk for developing ASD to ameliorate symptoms of ASD or reduce risk for later developing ASD. Conversely, bacteria listed in Table 2 can be targeted for suppression in children with ASD or at risk for developing ASD to ameliorate symptoms of ASD or reduce risk for later developing ASD.
Five bacterial markers were used in the machine learning model, including Alistipes indistinctus, candidate division_TM7 single_cell_isolate_TM7c, Streptococcus cristatus, Eubacterium_limosum, Streptococcus_oligofermentans (Table 3). The final models using these 5 markers has an Area Under the Curve (AUC) in Receiver Operating Characteristic (ROC) curves of 79.1% (
Alistipes indistinctus
Streptococcus cristatus
Eubacterium
—
limosum
Streptococcus
—
oligofermentans
To determine the risk of ASD in a subject, the following steps will be carried out:
In performing the above step (1), the bacterial species selected from Table 3 should comprise of (a) Alistipes indistinctus (top 1 species; AUC: 61.6%;
The relative abundance of 5 species listed in Table 3 from 64 children with ASD and 64 typically developing children was determined by metagenomics sequencing and taxonomy assigned as described in methods (relative abundance listed in Table 4). Decision trees were generated by random forest from data in Table 4 with parameter: trees=801, mtry=2.
The risk of ASD of a 3-year-old child was determined. The relative abundance of the 5 species listed in Table 3 in fecal sample of this child was determined by metagenomics sequencing and taxonomy assigned as described in method (Table 5). The relative abundances were run down the decision trees and a risk score was generated. The score of the child was 0.78 (
The relative abundance of Alistipes indistinctus, candidate division TM7 single-cell isolate TM7c, Streptococcus cristatus selected from Table 3 from 64 children with ASD and 64 typically developing children was determined by metagenomics sequencing and taxonomy assigned as described in methods (relative abundance listed in Table 4). Decision trees were generated by random forest from data in Table 4 with parameter: trees=801, mtry=2.
The risk of ASD of a 3-year-old child was determined. The relative abundance of the 3 species above in the fecal sample of this child was determined by metagenomics sequencing and taxonomy assigned as described in method (Table 5). The relative abundances were run down the decision trees and a risk score was generated. The score of the child was 0.833 (
The relative abundance of 5 species listed in Table 3 from 64 children with ASD and 64 typically developing children was determined by metagenomics sequencing and taxonomy assigned as described in methods (relative abundance listed in Table 4). Decision trees were generated by random forest from data in Table 4 with parameter: trees=801, mtry=2.
The risk of ASD of a 20-year-old female subject was determined. The relative abundance of the 5 species listed in table 3 in fecal sample of this child was determined by metagenomics sequencing and taxonomy assigned as described in method (Table 6). The relative abundances were run down the decision trees and a risk score was generated. The score of the child was 0.77 (
The relative abundance of Alistipes indistinctus, candidate division TM7 single-cell isolate TM7c, Streptococcus cristatus selected from Table 3 from 64 children with ASD and 64 typically developing children was determined by metagenomics sequencing and taxonomy assigned as described in methods (relative abundance listed in Table 4). Decision trees were generated by random forest from data in Table 4 with parameter: trees=801, mtry=2.
The risk of ASD of a 20-year-old female was determined. The relative abundance of the 3 species above in the fecal sample of this child was determined by metagenomics sequencing and taxonomy assigned as described in method (Table 6). The relative abundances were run down the decision trees and a risk score was generated. The score of the child was 0.79 (
This subject was diagnosed with ASD since 2020.
Alistipes
Streptococcus
indistinctus
cristatus
Eubacterium
—
limosum
Streptococcus
—
oligofermentans
Alistipes
Streptococcus
indistinctus
cristatus
Eubacterium
—
limosum
Streptococcus
—
oligofermentans
Alistipes
Streptococcus
indistinctus
cristatus
Eubacterium
—
limosum
Streptococcus
—
oligofermentans
An independent validation cohort of ASD (n=8) and typically developing (TD) children (n=10) were recruited to validate the machine learning model described in example 1 (PART I). As in example 1 (PART I), a machine learning model is generated using 5 species listed in Table 3. Briefly, the relative abundance of 5 species listed in Table 3 from 64 children with ASD and 64 TD children was determined by metagenomics sequencing and taxonomy assigned as described in METHODS of PART I (resulting relative abundance are listed in Table 4). Decision trees were generated by random forest from data in Table 4 with parameter: trees=801, mtry=2.
This machine learning model generated from 64 ASD and 64 TD children was used to determine the risk of ASD in each of the 18 children in the validation cohort. The relative abundance of the 5 species in fecal samples from the validation cohort were determined by metagenomics sequencing and taxonomy assigned as described in METHODS of PART I. Resulting relative abundance are listed in Table 7. These relative abundances were run down the decision trees and a risk score was generated. The model showed an AUC of 0.762 in discriminating ASD and TD in the validation cohort (
Alistipes
Streptococcus
Eubacterium
Streptococcus
indistinctus
cristatus
limosum
oligofermentans
A total of 64 typically developing children (aged between 3 and 6 years) were recruited. There were more male (84%) than female. The study was approved by The Joint Chinese University of Hong Kong, New Territories East Cluster Clinical Research Ethics Committee (The Joint CUHK-NTEC CREC, CREC Ref. No: 2016.607). All subjects consented to donate fecal samples and to the questionnaire investigation, where written informed consents were obtained. Fecal samples from the study subjects were stored at −80° C. for downstream microbiome analyses. Children were included as typically developing children without ASD, delays in motor and language development, as well as behaviors as reported by their parents, and those do not have first-degree relatives with ASD.
Fecal bacterial DNA was extracted by Maxwell® RSC PureFood GMO and Authentication Kit (Promega) with modifications to increase the yield of DNA. Approximately 100 mg from each stool sample was pretreated: stool sample suspended in 1 ml ddH2O and pelleted by centrifugation at 13,000×g for 1 min. Washed sample was added 800ul TE buffer (PH 7.5), 16ul beta-Mercaptoethanol and 250U lyticase sufficiently mixed and digestion at 37° C. for 90 minutes. Pelleted by centrifugation at 13,000×g for 3 minutes.
After pretreatment, precipitate was re-suspended in 800ul CTAB buffer (Maxwell® RSC PureFood GMO and Authentication Kit following manufacturer's instructions) and mixed well. After samples were heated at 95° C. for 5 minutes and cooled down, nucleic acid was released from the samples by vortexing with 0.5 mm and 0.1 mm beads at 2850 rpm for 15 minutes. Following this, 40ul Proteinase K and 20ul RNase A were added and nucleic acid digested at 70° C. for 10 minutes. Finally, supernatant was obtained after centrifugation at 13,000×g, 5 minutes and placed in a Maxwell® RSC instrument for DNA extraction. The extracted fecal DNA was used for ultra-deep metagenomics sequencing via Ilumina Novaseq 6000 (Novogene, Beijing, China).
Raw sequence reads were trimmed by Trimmomatic1(v0.38) firstly and then separation of non-human reads from contaminant host reads. There were some steps to acquire clean reads: 1) Remove adapters; 2) Scan the read with a 4-base wide sliding window, removing reads when the average quality per base drop below 20; 3) Drop reads below the 50 bases long. Trimmed sequence reads were mapped to human genome (Reference database: GRCh38 p12) by KneadData (v0.7.2) to remove reads originated from the host. Pair-end two reads were concatenated together.
Profiling of the composition of bacterial communities was performed on metagenomic trimmed reads via MetaPhlAn2 (v2.7.5)2. Mapping reads to clade-specific markers gene and annotation of species pangenomes was done through Bowtie2 (v2.3.4.3)3. The output table contained bacterial species and its relative abundance in different levels, from kingdom to strain level. Bacterium species and chronological age correlations performed by Spearman's correlation analysis was conducted via psych package (1.9.12.31) in R.
Random forest (RF) was chosen to build microbiota age prediction model using fecal microbes from 64 typically developing children because of its superior performance for mean prediction with learning method for regression. Random Forest7 is one of the most popular approaches in metagenomic data analysis to identify the discriminative features and build prediction models. As a widely used ensemble learning algorithm, Random Forest consists of a series of classification and regression trees (CARTs) to form a strong mean prediction. A subset of data randomly sampled from the original dataset with replacement is known as bootstrap sampling, applying to build the trees. When the training dataset for the current tree is drawn by the model votes or averaging, into a single ensemble model that ends up outperforming any individual decision tree's output. Bootstrap method,
observations are left out from the overall dataset. With infinite N, there are 36.8% data not occurred in the training samples called out-of-bag (OOB) observations, which would not be used for constructing the trees. In addition, extra randomness was introduced to the random forest as each decision tree splits nodes based on a random subset of features selected from the overall features. The features with the higher % IncMSE (Increased in mean squared error (%)) represent features that have greater contribution in prediction model. With different subsets of data and features, the algorithm is able to train different trees and obtain the final result by averaging the result from the tree models. In addition to the prediction model, Random Forest has the capability to assess the importance of variables8. To obtain a single prediction for a single OOB observation, these predicted responses can be averaged. To measure the importance of a given variable, the values of the variable in the OOB data are randomly altered, and then the changed OOB data is used to generate new predictions. The difference of the error rate between the altered and the original OOB observations divided by the standard error is calculated as % IncMSE (estimated with out-of-bag) the importance of a variable. To predict a new sample, the features of the sample were passed down to each tree to estimate the average value. The Random Forest used the average probability of all trees to determine the final result.
A total 64 typically developing children were included as the discovery cohort for modeling. The importance value of each species to the regression model was evaluated by recursive feature elimination. According to descending importance value, top 5 bacterial taxa were selected to build model. These analysis was done using R packages randomForest v4.6-147.
Gut bacterial species correlated with chronological age in typically developing children
To assess the correlation between bacterial species and children chronological age, Spearman's correlation coefficient between the 2 factors was calculated. Statistical significance was determined for all pairwise comparisons. There are both positive (relative abundance increase with age) and negative correlations (relative abundance decrease with age). Only statistically significant correlations with absolute coefficient>0.2 were shown in the table below. For example, the species Bacteroides thetaiotaomicron was significantly increased with children's age. Bacteroides thetaiotaomicron can help children metabolizing a diverse range of polysaccharides when they have abundant carbohydrate-rich diets.
Enterococcus avium
Streptococcus vestibularis
Anaerostipes unclassified
Streptococcus gordonii
Blautia hansenii
Clostridium hathewayi
Parabacteroides
Ruminococcus albus
Adlercreutzia equolifaciens
Eubacterium_sp_3_1_31
Mitsuokella multacida
Alistipes onderdonkii
Bacteroides fragilis
Roseburia intestinalis
Bacteroides uniformis
Eubacterium brachy
Bacteroides
thetaiotaomicron
Dorea formicigenerans
Bilophila unclassified
Bacteroides xylanisolvens
Faecalibacterium
prausnitzii
As such, bacteria listed in Table 8 can be used in different combinations to build an assessment model to determine the age of growth and development in a child and whether microbiome restoration therapy or supplementation is required. The relative abundance can be determined using as a panel of qPCR primer or by metagenomics sequencing to determine the development of gut microbiota.
Furthermore, bacteria listed in Table 8 that have a positive correlation coefficient (Spearman's correlation coefficient) may be supplemented to children to support growth and development in children. The relative abundance should increase to a level higher than or equal to the mean relative abundance of typically developing children listed in Table 8.
With regression random forest analysis, it was discovered that the species Streptococcus gordonii, Enterococcus avium, Eubacterium_sp_3_1_31, Clostridium hathewayi, Corynebacterium durum (
Streptococcus gordonii
Enterococcus avium
Eubacterium sp. 3_1_31
Clostridium hathewayi
Corynebacterium durum
Thus, to determine the risk of growth and developmental delay in a child, the following steps will be carried out:
4. Two species (Streptococcus gordonii and Enterococcus avium;
5. One species (Streptococcus gordonii;
Five bacterial species (Streptococcus gordonii, Enterococcus avium, Fubacterium_sp_3_1_31, Clostridium hathewayi, Corynebacterium durum) listed in Table 9 from 64 typically developing children was determined by metagenomics sequencing and taxonomy assigned as described in methods (relative abundance listed in Table 10). Decision trees were generated by random forest from data in Table 10 with parameter: ntree=10000, proximity=TRUE, importance=TRUE, nPerm=10.
The risk of growth and developmental delay of a 3-year-old child was determined. The relative abundance of the 5 species above in fecal sample of this child was determined by metagenomics sequencing and taxonomy assigned as described in method. The relative abundances were run down the decision trees and a predicted age of growth and development was generated. The predicted age of this child was 48.3 month (
Four species (Streptococcus gordonii, Enterococcus avium, Eubacterium_sp_3_1_31, Clostridium hathewayi) listed in Table 9 from 64 typically developing children was determined by metagenomics sequencing and taxonomy assigned as described in methods (relative abundance listed in Table 10). Decision trees were generated by random forest from data in Table 10 with parameter: ntree=10000, proximity=TRUE, importance=TRUE, nPerm=10.
The risk of growth and developmental delay of a 3-year-old child was determined. The relative abundance of the 4 species above in fecal sample of this child was determined by metagenomics sequencing and taxonomy assigned as described in method. The relative abundances were run down the decision trees and a predicted age of growth and development was generated. The predicted microbiota age in child was 48.3 month (
Three species (Streptococcus gordonii, Enterococcus avium, Eubacterium_sp_3_1_31) listed in Table 9 from 64 typically developing children was determined by metagenomics sequencing and taxonomy assigned as described in methods (relative abundance listed in Table 10). Decision trees were generated by random forest from data in Table 10 with parameter: ntree=10000, proximity=TRUE, importance=TRUE, nPerm=10.
The risk of growth and developmental delay of a 3-year-old child was determined. The relative abundance of the 3 species above in the fecal sample of this child was determined by metagenomics sequencing and taxonomy assigned as described in method. The relative abundances were run down the decision trees and a predicted age of growth and development was generated. The predicted age of this child was 52.6 month (
The relative abundance of Streptococcus gordonii and Enterococcus avium in 64 typically developing children was determined by metagenomics sequencing and taxonomy assigned as described in methods (relative abundance listed in Table 10). Decision trees were generated by random forest from data in Table 10 with parameter: ntree=10000, proximity=TRUE, importance=TRUE, nPerm=10.
The risk of growth and developmental delay of a 3-year-old child was determined. The relative abundance of the 2 species above in the fecal sample of this child was determined by metagenomics sequencing and taxonomy assigned as described in method. The relative abundances were run down the decision trees and a predicted age of growth and development was generated. The predicted age of this child was 53.2 month (
The relative abundance of Streptococcus gordonii in 64 typically developing children was determined by metagenomics sequencing and taxonomy assigned as described in methods (relative abundance listed in Table 10). Decision trees were generated by random forest from data in Table 10 with parameter: ntree=10000, proximity=TRUE, importance=TRUE, nPerm=10.
The risk of growth and developmental delay of a 3-year-old child was determined. The relative abundance of the one species above in the fecal sample of this child was determined by metagenomics sequencing and taxonomy assigned as described in method. The relative abundances were run down the decision trees and a predicted age of growth and development was generated. The predicted age of this child 64.9 month (
Streptococcs
Enterococcus
Eubacterium
Clostridium
Corynebacterium
gordonii
avium
hathewayi
durum
Streptococcus
Enterococcus
Eubacterium
Clostridium
Corynebacterium
gordonii
avium
hathewayi
durum
Chinese children aged 3 to 6 years in Hong Kong (64 ASD versus 64 TD children) were enrolled and examined the effects of host factors on configuration of children's fecal microbiome. Among host factors examined, ASD, chronological age and Body Mass Index (BMI) showed the most significant impact on the fecal microbiome ranked according to effect size (
The gut microbiome composition in children with ASD was altered at multiple taxonomic levels compared with TD children. Microbiome richness was higher in children with ASD than age- and BMI-matched TD children (BMI: 15.31±1.87 versus 15.38±1.42 respectively) (t-test, p-value=0.021,
At the genus level, Clostridium and Coprobacillus were largely enriched in children with ASD (
Gut Bacterium-Bacterium Ecological Network is Impaired in Children with ASD
The ecological interaction of bacterium-bacterium in the gut of ASD and TD group was next assessed by evaluating spearman's correlation between bacterial species. Majority of bacterium-bacterium correlations in both ASD and TD children were positive correlations (
Pathways Related to Neurotransmitter Biosynthesis were Decreased in the Gut Microbiome of ASD
To understand alterations in gut microbiome functions in relation to compositional changes in ASD, the genetic abundance of constituent functional modules (gene families) was profiled using HUMAnN2 (Franzosa et al., Nature methods 15(11): 962-968, 2018). Pathways for essential amino acid biosynthesis (L-threonine, L-isoleucine, L-leucine, L-valine), glucose metabolism, nucleotide biosynthetic and vitamin B biosynthetic were significantly decreased in ASD children (
Collectively, neurotransmitters enable signal transmission across synapses to nerve cells, where synaptic dysfunction is thought to crucially contribute to the pathophysiology of ASD (Zoghbi and Bear (2012), “Synaptic dysfunction in neurodevelopmental disorders associated with autism and intellectual disabilities.” Cold Spring Harbor perspectives in biology 4(3): a009886). As such, changes in these pathways, particularly for tryptophan and glycine anabolism/metabolism, in the functionality of ASD microbiome could lead to abnormal neurotransmitter synthesis and therefore relays to the host. The species Ruminococcus sp. 5_1_39BFAA, Eubacterium rectale and Ruminococcus bromii, Faecalibacterium prausnitzii were dominant contributors to the biosynthesis of L-tryptophan and glycine respectively (
Beyond that, the abundance of microbial genes encoding for glutamate synthase was also significantly decreased in ASD children compared to TD children (
Building upon the gut microbiome functionality profile and clinical parameters of the study subjects, the inventors explored the relationship between the abundance of microbiome functional modules and host factors, via correlation analysis. It was found that age had the most profound effect in shaping the functionality of the gut microbiome in children, as demonstrated by the most abundant associations between children's age, among the examined host factors, and abundances of microbial functional modules (30 significant associations, P value<0.05, Spearman's correlation value>0.2 or <−0.2,
Given the impact of host chronological age on composition and functionality of the gut microbiome, it was hypothesized that age-related bacteria seen in healthy children may develop abnormally in the gut of ASD children. Age-discriminatory taxa were identified in TD children and subsequently investigated their abundance in association with age in ASD children. The relative abundance of fecal bacterial species regressed against the chronologic age of TD children at the time of fecal sample collection, via Random Forest with five times ten-fold cross-validation. Consequently, 26 age-discriminatory bacterial species were discerned, as proxy of “normal” development of children's gut microbiome with age (
To validate their finding, the inventors developed a sparse microbiome-age prediction model as a function of the chronological age in TD children based the abundance of the 26 age-discriminatory species (
Autism spectrum disorder (ASD) is a complex group of developmental disorders characterized by impaired social interactions and communication together with repetitive behaviors. The purpose of this study is to determine bacterial biomarkers for individuals with autism, as well as to pinpoint probiotic/therapeutic bacteria for autism. The gut bacterial profile is different between autistic children and typically developing children. Gut microbiota is regarded as an important factor in the development of ASD. The practical use of this discovery includes predicting risk for autism in children and microbial transfer and/or supplementation as a potential means to improve behavioral symptom in autistic individuals.
A total of 120 Chinese children (aged between 3 and 6 years) were recruited: 61 autism spectrum disorder children and 59 typically developing children. Cohort recruited more male (83%) than female (17%), majority of ASD children were diagnosis with ASD around 3 years old.
The study was approved by The Joint Chinese University of Hong Kong, New Territories East Cluster Clinical Research Ethics Committee (The Joint CUHK-NTEC CREC, CREC Ref. No: 2016.607). All subjects consented to donate fecal samples and to the questionnaire investigation, where written informed consents were obtained. Fecal samples from the study subjects were stored at −80° C. for downstream microbiome analyses.
Families with their children diagnosed with ASD by pediatrician or clinical psychologist according to the standard of the fourth or fifth edition of the Diagnostic and Statistical Manual of Mental Disorders (DSM-IV or DSM-V) will be included. Children without ASD, delays in motor and language development, as well as behaviors as reported by their parents, and those do not have first-degree relatives with ASD will be included as typically developing children. Sixty-five families with children of ASD of Chinese origin and 65 families with typically developing children were divide into 2 groups: case group and control group.
Fecal bacterial DNA was extracted by Maxwell® RSC PureFood GMO and Authentication Kit (Promega) with modifications to increase the yield of DNA. Approximately 100 mg from each stool sample was pretreated: stool sample suspended in 1 ml ddH2O and pelleted by centrifugation at 13,000×g for 1 min. Washed sample was added 800ul TE buffer (PH 7.5), 16 ul beta-Mercaptoethanol and 250U lyticase sufficiently mixed and digestion at 37° C. for 90 minutes. Pelleted by centrifugation at 13,000×g for 3 minutes.
After pretreatment, precipitate was resuspended in 800ul CTAB buffer (Maxwell® RSC PureFood GMO and Authentication Kit following manufacturer's instructions) and mixed well. After samples were heated at 95° C. for 5 minutes and cooled down, nucleic acid were released from the samples by vortexing with 0.5 mm and 0.1 mm beads at 2850 rpm for 15 minutes. Following this, 40ul Proteinase K and 20ul RNase A were added and nucleic acid digested at 70° C. for 10 minutes. Finally, supernatant was obtained after centrifugation at 13,000×g, 5 minutes and placed in a Maxwell® RSC instrument for DNA extraction. The extracted fecal DNA was used for ultra-deep metagenomics sequencing via Ilumina Novoseq 6000 (Novogen, Beijing, China).
Raw sequence reads were trimmed by Trimmomatic1 (Trimmomatic-0.36) firstly and then separation of non-human reads from contaminant host reads. There were some steps to acquire clean reads: 1) Remove adapters; 2) Scan the read with a 4-base wide sliding window, removing reads when the average quality per base drop below 20; 3) Drop reads below the 50 bases long. Trimmed sequence reads were used by KneadData (Reference database: GRCh38 p12) to separate the non-human reads from human reads. Paired-end two reads were concatenated together.
Profiling of the composition of bacterial communities was performed on metagenomic trimmed reads via MetaPhlAn2 (v2.7.5)2. Mapping reads to clade-specific markers gene and annotation of species pangenomes was done through Bowtie2 (v2.3.4.3)3. The output table contained bacterial species and its relative abundance in different levels, from kingdom to strain level.
Primer and probe sequences for the internal control were designed manually on the basis of the conservative fragments in bacterial 16S rRNA genes, and then they were tested using the tool PrimerExpress v3.0 (Applied Biosystems) for determination of Tm, GC content, and possible secondary structures. Degenerate sites were included in the primers and probes to increase target coverage; degenerate sites were not close to 3′ ends of primers and 5′ end of the probes. Amplicon target was nt 1,063-1,193 of the corresponding E. coli genome.
Three bacterial marker candidates identified by previous metagenome sequencing were selected for qPCR quantification, including Alistipes indistinctus (Ai), Anaerotruncus colihominis (Ac) and Eubacterim hallii (Eh). These candidates were identified by AUC value ranking in metagenome study. Primer and probe sequences targeting the gene markers which extracted from MetaPhlAn2 database. Primers were designed using Primer-BLAST in NCBI and probes were designed manually. The primer-probe sets specifically detect the targets and not any other known sequences, as confirmed by Blast search. Each probe carried a 5′reporter dye FAM (6-carboxyfluorescein) or VIC (4,7,20-trichloro-70-phenyl-6-carboxyfluores-cein) and a 3′quencher dye TAMRA (6-carboxytetramethyl-rho-damine). Primers and hydrolysis probes were synthesized by BGI. Nucleotide sequences of the primers and probes are listed below. PCR amplification specificity was confirmed by direct Sanger sequencing of the PCR products.
Alistipes
indistinctus (Ai)
Anaerotruncus
colihominis (Ac)
Eubacterium
hallii (Eh)
Quantitative PCR (qPCR) amplifications were performed in a 20 uL reaction system of TaqMan Universal Master Mix II (Applied Biosystems) containing 0.3 mmol/L of each primer and 0.2 mmol/L of each probe in MicroAmp fast optical 96-well reaction plates (Applied Biosystems) with adhesive sealing. Thermal cycler parameters of an ABI PRISM 7900HT sequence detection system was 95° C. 10 minutes and (95° C. 15 seconds, 60° C. 1 minute) x45 cycles. A positive/reference control and a negative control (H2O as template) were included within every experiment. Measurements were performed in duplicates for each sample. qPCR data was analyzed using the Sequence DetectionSoftware (Applied Biosystems) with manual settings of threshold=0.05 and baseline from 3-15 cycles for all clinical samples. Experiments were disqualified if their negative control Cq value was<42. Data analysis was carried out according to the ΔCq method, with ΔCq=Cqtarget−Cqcontrol and relative abundances=POWER (2−ΔCq).
According to the performance of the classification, the species Alistipes indistinctus and Anaerotruncus colihominis (Table 13) showed higher relative abundance in children with ASD than typically developing children. In contrast, the species Eubacterium hallii (Table 14) was depleted in children with ASD as compared to typically developing children. The performance of each marker alone and in combination are shown in Table 15.
Alistipes indistinctus (Ai)
Anaerotruncus colihominis (Ac)
Eubacterium hallii (Eh)
Alistipes indistinctus
Eubacterium hallii
Anaerotruncus colihominis
These bacterial markers can be used separately or in combination to determine the risk of developing ASD in a subject. Standard control value (relative abundance of bacterial species or their combined scores found in typically developing children) can be established to provide a cut-off value to indicate whether or not the subject being examined has an elevated risk for ASD. For both single markers and combined scores, cutoff values are determined by receiver operating characteristic (ROC) analyses that maximized the Youden index (J=Sensitivity+Specificity−1). Pairwise comparison of areas under ROC (AUROCs) for each method/marker was performed using a nonparametric approach.
For example, the cut-off values of Ai, and Ac in this cohort are 0.000000019 and 0.000000758 respectively. The cut-off value of combined score in this cohort is 0.531 (
All patents, patent applications, and other publications, including GenBank Accession Numbers and the like, cited in this application are incorporated by reference in the entirety for all purposes.
This application claims priority to U.S. Provisional Patent Application No. 63/039,034, filed Jun. 15, 2020, and U.S. Provisional Patent Application No. 63/121,198, filed, Dec. 3, 2020, the contents of all of the above are hereby incorporated by reference in the entirety for all purposes.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/CN2021/099932 | 6/15/2021 | WO |
Number | Date | Country | |
---|---|---|---|
63121198 | Dec 2020 | US | |
63039034 | Jun 2020 | US |