The ASCII file, entitled 90821SequenceListing.txt, created on Jan. 3, 2022, comprising 335,609 bytes, submitted concurrently with the filing of this application is incorporated herein by reference. The sequence listing submitted herewith is identical to the sequence listing forming part of the international application.
The present invention, in some embodiments thereof, relates to a method of selecting a ruminating animal for a desired hereditable trait based on the presence of particular bacteria in the microbiome thereof.
The bovine rumen microbiome essentially enables the hosting ruminant animal to digest its feed by degrading and fermenting it. In this sense this relationship is unique and different from the host-microbiome interactions that have evolved between in humans and non-herbivorous animals, where such dependence does not exist. This strict obligatory host-microbiome relationship, which was established approximately 50 million years ago, is thought to play a major role in host physiology. Despite its great importance, the impact of natural genetic variation in the host—brought about through sexual reproduction and meiotic recombination—on the complex relationship of rumen microbiome components and host physiological traits is poorly understood. It is known that associations between specific components of the rumen microbiome to animals physiology exist, mainly exemplified by the ability of the animal to harvest energy from its feed [Kruger Ben Shabat S, et al., 2016. ISME J 10:2958-2972].
These recent findings position the bovine rumen microbiome as the new frontier in the effort to increase the feed efficiency of milking cows. As human population is continually increasing this could have important implications for food security issues as an effort towards replenishing food sources available for human consumption while lowering environmental impact in global scale. Despite its great importance, the complex relationship of rumen microbiome components and host genetics and physiology is poorly understood.
Background art includes WO2019/030752, WO2017/187433 and WO2014/141274, Guan L L, et al., 2008. FEMS Microbiology Letters 288:85-9; Roehe R, et al., 2016. PLoS Genet 12:e1005846; Li Z, et al., 2016. Microbiology Reports 8:1016-102.
According to an aspect of some embodiments of the present invention there is provided a method of selecting a ruminating animal having a desirable, hereditable trait comprising analyzing in the microbiome of the animal for an amount of at least one hereditable bacteria which is associated with the hereditable trait, wherein the amount of the hereditable bacteria is indicative as to whether the animal has a desirable hereditable trait, wherein the hereditable bacteria is of any one of the operational taxonomic units (OTUs) set forth in Table 1, wherein the trait is the corresponding trait to the at least one hereditable bacteria as set forth in Table 1, thereby selecting the ruminating animal having a desirable hereditable trait.
According to an aspect of some embodiments of the present invention there is provided a method of managing a herd of ruminating animals comprising:
(a) analyzing in the microbiome of a ruminating animal of the herd for an amount of at least one hereditable bacteria which is associated with the hereditable trait, wherein the amount of the hereditable bacteria is indicative that the animal has a non-desirable hereditable trait, wherein the hereditable bacteria is of any one of the operational taxonomic units (OTUs) set forth in Table 1, wherein the trait is the corresponding trait to the at least one hereditable bacteria as set forth in Table 1; and
(b) removing the animal with the non-desirable trait from the herd.
According to an aspect of some embodiments of the present invention there is provided a method for breeding a ruminating animal comprising breeding a ruminating animal that has been selected according to the methods described herein, thereby breeding the ruminating animal.
According to an aspect of some embodiments of the present invention there is provided a method of increasing the number of ruminating animals having a desirable microbiome comprising breeding a male and female of the ruminating animals, wherein the rumen microbiome of either of the male and/or the female ruminating animals comprises a hereditable microorganism having an OTU as set forth in Table 3 above a predetermined level, thereby increasing the number of ruminating animals having a desirable microbiome.
According to an aspect of some embodiments of the present invention there is provided a method of altering a trait of a ruminating animal comprising providing a microbial composition to the ruminating animal which comprises at least one microbe having an operational taxonomic unit (OTU) set forth in Table 2 and having a 16S rRNA sequence as set forth in SEQ ID NOs: 38-50 and 314-615, thereby altering the trait of the ruminating animal, wherein the microbial composition does not comprise a microbiome of the ruminating animal, wherein the trait is the corresponding trait to the at least one microbe as set forth in Table 2.
According to an aspect of some embodiments of the present invention there is provided a method of altering a trait of a ruminating animal comprising providing an agent which specifically downregulates an OTU set forth in Table 2 to the ruminating animal, thereby altering the trait of the ruminating animal, wherein the trait is the corresponding trait to the at least one microbe as set forth in Table 2.
According to an aspect of some embodiments of the present invention there is provided a microbial composition comprising at least one microbe having an OTU set forth in Table 2, the microbial composition not being a microbiome.
According to embodiments of the present invention, the hereditable bacteria is of the family lachnospiraceae or of the genus Prevotella.
According to embodiments of the present invention, the ruminating animal is a cow.
According to embodiments of the present invention, the method further comprises using the selected animal or a progeny thereof for breeding.
According to embodiments of the present invention, the analyzing an amount is effected by analyzing the expression of at least one gene of the genome of the at least one bacteria.
According to embodiments of the present invention, the analyzing an amount is effected by sequencing the DNA derived from a sample of the microbiome.
According to embodiments of the present invention, the microbiome comprises a rumen microbiome or a fecal microbiome.
According to embodiments of the present invention, the ruminating animal that has been selected is a female ruminating animal, the method comprises artificially inseminating the female ruminating animal with semen from a male ruminating animal.
According to embodiments of the present invention, the male ruminating animal has been selected according to the methods described herein.
According to embodiments of the present invention, when the ruminating animal that has been selected is a male ruminating animal, the method comprises inseminating a female ruminating animal with semen of the male ruminating animal.
According to embodiments of the present invention, the hereditable microorganism is associated with a hereditable trait.
According to embodiments of the present invention, the microbial composition comprises no more than 20 microbial species.
According to embodiments of the present invention, the microbial composition comprises no more than 50 microbial species.
According to embodiments of the present invention, the at least one microbe has an OTU set forth in Table 1.
According to embodiments of the present invention, the at least one microbe has a 16S rRNA sequence as set forth in SEQ ID NOs: 7-37 and 51-313.
According to embodiments of the present invention, the at least one microbe has an OTU set forth in Table 1.
According to embodiments of the present invention, the microbial composition comprises no more than 15 bacterial species.
According to embodiments of the present invention, the microbial composition comprises no more than 20 bacterial species.
Unless otherwise defined, all technical and/or scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which the invention pertains. Although methods and materials similar or equivalent to those described herein can be used in the practice or testing of embodiments of the invention, exemplary methods and/or materials are described below. In case of conflict, the patent specification, including definitions, will control. In addition, the materials, methods, and examples are illustrative only and are not intended to be necessarily limiting.
The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.
Some embodiments of the invention are herein described, by way of example only, with reference to the accompanying drawings. With specific reference now to the drawings in detail, it is stressed that the particulars shown are by way of example and for purposes of illustrative discussion of embodiments of the invention. In this regard, the description taken with the drawings makes apparent to those skilled in the art how embodiments of the invention may be practiced.
The present invention, in some embodiments thereof, relates to a method of selecting a ruminating animal for a desired hereditable trait based on the presence of particular bacteria in the microbiome thereof.
Before explaining at least one embodiment of the invention in detail, it is to be understood that the invention is not necessarily limited in its application to the details set forth in the following description or exemplified by the Examples. The invention is capable of other embodiments or of being practiced or carried out in various ways.
Ruminants sustain a long-lasting obligatory relationship with their rumen microbiome dating back 50 million years. In this unique host-microbiome relationship the host's ability to digest its feed is completely dependent on its coevolved microbiome. This extraordinary alliance raises questions regarding the dependence between ruminants' genetics and physiology and the rumen microbiome structure, composition and metabolism. To elucidate this relationship, the present inventors examined association of host genetics to phylogenetic and functional composition of the rumen microbiome. They accomplished this by studying a population of 1000 cows in four different European countries, using a combination of rumen microbiota data and other phenotypes from each animal with genotypic data from a subset of animals. This very large population size uncovered novel and unexpected bacteria that can be used to regulate desirable traits in these animals.
Thus, according to a first aspect of the present invention there is provided a method of selecting a ruminating animal having a desirable, hereditable trait comprising analyzing in the microbiome of the animal for an amount of at least one hereditable bacteria which is associated with the hereditable trait, wherein the amount of the hereditable bacteria is indicative as to whether the animal has a desirable hereditable trait, wherein the hereditable bacteria is of any one of the operational taxonomic units (OTUs) set forth in Table 1, wherein the trait is the corresponding trait to the at least one hereditable bacteria as set forth in Table 1, thereby selecting the ruminating animal having a desirable hereditable trait.
Ruminating animals contemplated by the present invention include for example cattle (e.g. cows), goats, sheep, giraffes, American Bison, European Bison, yaks, water buffalo, deer, camels, alpacas, llamas, wildebeest, antelope, pronghorn, and nilgai.
According to a particular embodiment, the ruminating animal is a bovine cow or bull—e.g. Bos taurus bovines or Holstein-Friesian bovines.
According to a particular embodiment, the animal which is selected is a newborn, typically not more than one day old. According to another embodiment, the animal which is selected is not more than two days old. According to another embodiment, the animal which is selected is not more than three days old. According to another embodiment, the animal which is selected is not more than 1 week old. According to another embodiment, the animal which is selected is not more than 2 weeks old. According to another embodiment, the animal which is selected is not more than 1 month old. According to another embodiment, the animal which is selected is not more than 3 months old. According to still another embodiment, the animal is an adult.
The phrase “hereditable trait” (also referred to as “heritable trait”) as used herein, refers to a trait of which the variation between the individuals in a given population is due in part (or in whole) to genetic variation. Due to these genetic variations, the relative or absolute abundance of particular microbial populations in the microbiome (which serve as markers) is similar from one generation to the next generation in a statistically significant manner.
A microorganism can be classified as being hereditable when changes in its abundance amongst a group of animals can be explained by the genetic variance amongst the animals.
Statistical methods which can be used in the context of the present invention include, but are not limited to Single component GRM approach, MAF-Stratified GREML (GREMLLMS), LDL and MAF-Stratified GREML (GREMLLLDMS), Single Component and MAF-Stratified LD-Adjusted Kinships (LDAK-SC and LDAK-MS), Extended Genealogy with Thresholded GRMs, Treelet Covariance Smoothing (TCS), LD-Score Regression and BOLT-REML.
According to a particular embodiment, the hereditable bacteria is set forth in Table 1, herein below. Thus, for example the hereditable bacteria may belong to the family lachnospiraceae or to the genus Prevotella.
In one embodiment, the trait is the corresponding trait to the bacteria as set forth in Table 1. Thus, the trait may be rumen propionate, rumen acetate, rumen butyrate, milk lactose, milk yield, milk fat, rumen pH and rumen Beta-Hydroxybutyric Acid (BHB).
Table 1, herein below also provides the correlation between the host trait and the amount of the particular bacteria in the rumen microbiome. Thus, for example, the first row of Table 1 relates to a bacteria (having a 16S rRNA sequence as set forth in SEQ ID NO: 7) whose abundance negatively correlates with rumen propionate. If the desired trait is low rumen propionate, the selected animal will have an amount of bacteria having a 16S rRNA sequence as set forth in SEQ ID NO: 7 above a predetermined level. If the desired trait is high rumen propionate, the selected animal will have an amount of bacteria having a 16S rRNA sequence as set forth in SEQ ID NO: 7 below a predetermined level. The other bacteria in Table 1 and their corresponding traits can be selected in the same way.
According to one embodiment, an animal can be classified as having a low trait (e.g. one that appears in Tables 1 or 2) when it has at least 0.05, 1, 2, 3, 4, 5 or even 6 standard deviations below the average amount of that trait of the herd (with a herd being at least 15 animals).
According to one embodiment, an animal can be classified as having a high trait (e.g. one that appears in Tables 1 or 2) when it has at least 0.05, 0.5, 1, 2, 3, 4, 5, or even 6 standard deviations above the average amount of that trait of the herd (with a herd being at least 15 animals).
The term “microbiome” as used herein, refers to the totality of microbes (bacteria, fungi, protists), their genetic elements (genomes) in a defined environment.
A microbiota sample comprises a sample of microbes and or components or products thereof from a microbiome.
According to a particular embodiment, the microbiome is a rumen microbiome. In still other embodiments, the microbiome is a fecal microbiome.
According to another embodiment, the microbiome is derived from a healthy animal (i.e. the microbiome is a non-pathogenic microbiome).
In order to analyze the microbes of a microbiome, a microbiota sample is collected from the animal. This is carried out by any means that allow recovery of microbes or components or products thereof of a microbiome and is appropriate to the relevant microbiome source e.g. rumen.
Rumen may be collected using methods known in the art and include for example use of a stomach tube with a rumen vacuum sampler. Typically rumen is collected after feeding.
In some embodiments, in lieu of analyzing a rumen sample, a fecal sample is used which mirrors the microbiome of the rumen. Thus, in this embodiment, a fecal microbiome is analyzed.
According to one embodiment of this aspect of the present invention, the abundance of particular bacterial taxa are analyzed in a microbiota sample.
Methods of quantifying levels of microbes (e.g. bacteria) of various taxa are described herein below.
In some embodiments, determining a level or set of levels of one or more types of microbes or components or products thereof comprises determining a level or set of levels of one or more DNA sequences. In some embodiments, one or more DNA sequences comprise any DNA sequence that can be used to differentiate between different microbial types. In certain embodiments, one or more DNA sequences comprise 16S rRNA gene sequences. In certain embodiments, one or more DNA sequences comprise 18S rRNA gene sequences. In some embodiments, 1, 2, 3, 4, 5, 10, 15, 20, 25, 50, 100, 1,000, 5,000 or more sequences are amplified.
Taxonomy assignment of species may be performed using a suitable computer program (e.g. BLAST) against the appropriate reference database (e.g. 16S rRNA reference database).
In determining whether a nucleic acid or protein is substantially homologous or shares a certain percentage of sequence identity with a sequence of the invention, sequence similarity may be defined by conventional algorithms, which typically allow introduction of a small number of gaps in order to achieve the best fit. In particular, “percent identity” of two polypeptides or two nucleic acid sequences is determined using the algorithm of Karlin and Altschul (Proc. Natl. Acad. Sci. USA 87:2264-2268, 1993). Such an algorithm is incorporated into the BLASTN and BLASTX programs of Altschul et al. (J. Mol. Biol. 215:403-410, 1990). BLAST nucleotide searches may be performed with the BLASTN program to obtain nucleotide sequences homologous to a nucleic acid molecule of the invention. Equally, BLAST protein searches may be performed with the BLASTX program to obtain amino acid sequences that are homologous to a polypeptide of the invention. To obtain gapped alignments for comparison purposes, Gapped BLAST is utilized as described in Altschul et al. (Nucleic Acids Res. 25:3389-3402, 1997). When utilizing BLAST and Gapped BLAST programs, the default parameters of the respective programs (e.g., BLASTX and BLASTN) are employed.
According to one embodiment, in order to classify a microbe as belonging to a particular genus, it must comprise at least 90% sequence homology, at least 91% sequence homology, at least 92% sequence homology, at least 93% sequence homology, at least 94% sequence homology, at least 95% sequence homology, at least 96% sequence homology, at least 97% sequence homology, at least 98% sequence homology, at least 99% sequence homology to a reference microbe known to belong to the particular genus. According to a particular embodiment, the sequence homology is at least 95%.
According to another embodiment, in order to classify a microbe as belonging to a particular species, it must comprise at least 90% sequence homology, at least 91% sequence homology, at least 92% sequence homology, at least 93% sequence homology, at least 94% sequence homology, at least 95% sequence homology, at least 96% sequence homology, at least 97% sequence homology, at least 98% sequence homology, at least 99% sequence homology to a reference microbe known to belong to the particular species. According to a particular embodiment, the sequence homology is at least 97%.
In some embodiments, a microbiota sample is directly assayed for a level or set of levels of one or more DNA sequences. In some embodiments, DNA is isolated from a microbiota sample and isolated DNA is assayed for a level or set of levels of one or more DNA sequences. Methods of isolating microbial DNA are well known in the art. Examples include but are not limited to phenol-chloroform extraction and a wide variety of commercially available kits, including QJAamp DNA Stool Mini Kit (Qiagen, Valencia, Calif.).
In some embodiments, a level or set of levels of one or more DNA sequences is determined by amplifying DNA sequences using PCR (e.g., standard PCR, semi-quantitative, or quantitative PCR). In some embodiments, a level or set of levels of one or more DNA sequences is determined by amplifying DNA sequences using quantitative PCR. These and other basic DNA amplification procedures are well known to practitioners in the art and are described in Ausebel et al. (Ausubel F M, Brent R, Kingston R E, Moore D, Seidman J G, Smith J A, Struhl K (eds). 1998. Current Protocols in Molecular Biology. Wiley: New York).
In some embodiments, DNA sequences are amplified using primers specific for one or more sequence that differentiate(s) individual microbial types from other, different microbial types. In some embodiments, 16S rRNA gene sequences or fragments thereof are amplified using primers specific for 16S rRNA gene sequences. In some embodiments, 18S DNA sequences are amplified using primers specific for 18S DNA sequences.
In some embodiments, a level or set of levels of one or more 16S rRNA gene sequences is determined using phylochip technology. Use of phylochips is well known in the art and is described in Hazen et al. (“Deep-sea oil plume enriches indigenous oil-degrading bacteria.” Science, 330, 204-208, 2010), the entirety of which is incorporated by reference. Briefly, 16S rRNA genes sequences are amplified and labeled from DNA extracted from a microbiota sample. Amplified DNA is then hybridized to an array containing probes for microbial 16S rRNA genes. Level of binding to each probe is then quantified providing a sample level of microbial type corresponding to 16S rRNA gene sequence probed. In some embodiments, phylochip analysis is performed by a commercial vendor. Examples include but are not limited to Second Genome Inc. (San Francisco, Calif.).
In some embodiments, determining a level or set of levels of one or more types of microbes or components or products thereof comprises determining a level or set of levels of one or more microbial RNA molecules (e.g., transcripts). Methods of quantifying levels of RNA transcripts are well known in the art and include but are not limited to northern analysis, semi-quantitative reverse transcriptase PCR, quantitative reverse transcriptase PCR, and microarray analysis. These and other basic RNA transcript detection procedures are described in Ausebel et al. (Ausubel F M, Brent R, Kingston R E, Moore D D, Seidman J G, Smith J A, Struhl K (eds). 1998. Current Protocols in Molecular Biology. Wiley: New York).
In some embodiments, determining a level or set of levels of one or more types of microbes or components or products thereof comprises determining a level or set of levels of one or more microbial proteins. Methods of quantifying protein levels are well known in the art and include but are not limited to western analysis and mass spectrometry. These and all other basic protein detection procedures are described in Ausebel et al. (Ausubel F M, Brent R, Kingston R E, Moore D D, Seidman J G, Smith J A, Struhl K (eds). 1998. Current Protocols in Molecular Biology. Wiley: New York). In some embodiments, determining a level or set of levels of one or more types of microbes or components or products thereof comprises determining a level or set of levels of one or more microbial metabolites. In some embodiments, levels of metabolites are determined by mass spectrometry. In some embodiments, levels of metabolites are determined by nuclear magnetic resonance spectroscopy. In some embodiments, levels of metabolites are determined by enzyme-linked immunosorbent assay (ELISA). In some embodiments, levels of metabolites are determined by colorimetry. In some embodiments, levels of metabolites are determined by spectrophotometry.
In some embodiments, what is determined is the distribution of microbial families within the microbiome. However, characterization may be carried to more detailed levels, e.g. to the level of genus and/or species, and/or to the level of strain or variation (e.g. variants) within a species, if desired (including the presence or absence of various genetic elements such as genes, the presence or absence of plasmids, etc.). Alternatively, higher taxanomic designations can be used such as Phyla, Class, or Order. The objective is to identify which microbes (usually bacteria, but also optionally fungi (e.g. yeasts), protists, etc.) are present in the sample from the ruminating animal and the relative distributions of those microbes, e.g. expressed as a percentage of the total number of microbes that are present, thereby establishing a micro floral pattern or signature for the animal being tested.
In other embodiments of the invention, when many taxa are being considered, the overall pattern of microflora is assessed, i.e. not only are particular taxa identified, but the percentage of each constituent taxon is taken in account, in comparison to all taxa that are detected and, usually, or optionally, to each other. Those of skill in the art will recognize that many possible ways of expressing or compiling such data exist, all of which are encompassed by the present invention. For example, a “pie chart” format may be used to depict a microfloral signature; or the relationships may be expressed numerically or graphically as ratios or percentages of all taxa detected, etc. Further, the data may be manipulated so that only selected subsets of the taxa are considered (e.g. key indicators with strong positive correlations). Data may be expressed, e.g. as a percentage of the total number of microbes detected, or as a weight percentage, etc.
In order to identify microbial species where significant proportions of their variation in abundance profiles can be attributed to heritable genetic factors, the microbiota sample is analyzed so as to uncover taxa (e.g. species) of microbes showing similar abundance (either absolute or relative) in animals that share a similar genetic background.
Methods of analyzing the similarity of the genetic background of two ruminating animals may be carried out using genotyping assays known in the art.
As used herein, the term “genotyping’ refers to the process of determining genetic variations among individuals in a species. Single nucleotide polymorphisms (SNPs) are the most common type of genetic variation that are used for genotyping and by definition are single-base differences at a specific locus that is found in more than 1% of the population. SNPs are found in both coding and non-coding regions of the genome and can be associated with a phenotypic trait of interest such as a quantitative phenotypic trait of interest. Hence, SNPs can be used as markers for quantitative phenotypic traits of interest. Another common type of genetic variation that are used for genotyping are “InDels” or insertions and deletions of nucleotides of varying length. For both SNP and InDel genotyping, many methods exist to determine genotype among individuals. The chosen method generally depends on the throughput needed, which is a function of both the number of individuals being genotyped and the number of genotypes being tested for each individual. The chosen method also depends on the amount of sample material available from each individual or sample. For example, sequencing may be used for determining presence or absence of markers such as SNPs, e.g. such as Sanger sequencing and High Throughput Sequencing technologies (HTS). Sanger sequencing may involve sequencing via detection through (capillary) electrophoresis, in which up to 384 capillaries may be sequence analysed in one run. High throughput sequencing involves the parallel sequencing of thousands or millions or more sequences at once. HTS can be defined as Next Generation sequencing, i.e. techniques based on solid phase pyrosequencing or as Next-Next Generation sequencing based on single nucleotide real time sequencing (SMRT). HTS technologies are available such as offered by Roche, Illumina and Applied Biosystems (Life Technologies). Further high throughput sequencing technologies are described by and/or available from Helicos, Pacific Biosciences, Complete Genomics, Ion Torrent Systems, Oxford Nanopore Technologies, Nabsys, ZS Genetics, GnuBio. Each of these sequencing technologies have their own way of preparing samples prior to the actual sequencing step. These steps may be included in the high throughput sequencing method. In certain cases, steps that are particular for the sequencing step may be integrated in the sample preparation protocol prior to the actual sequencing step for reasons of efficiency or economy. For instance, adapters that are ligated to fragments may contain sections that can be used in subsequent sequencing steps (so-called sequencing adapters). Primers that are used to amplify a subset of fragments prior to sequencing may contain parts within their sequence that introduce sections that can later be used in the sequencing step, for instance by introducing through an amplification step a sequencing adapter or a capturing moiety in an amplicon that can be used in a subsequent sequencing step. Depending also on the sequencing technology used, amplification steps may be omitted.
Low density and high density chips are contemplated for use with the invention, including SNP arrays comprising from 3,000 to 800,000 SNPs. By way of example, a “50K” SNP chip measures approximately 50,000 SNPs and is commonly used in the livestock industry to establish genetic merit or genomic estimated breeding values (GEBVs). In certain embodiments of the invention, any of the following SNP chips may be used: BovineSNP50 v1 BeadChip (Illumina), Bovine SNP v2 BeadChip (Illumina), Bovine 3K BeadChip (Illumina), Bovine LD BeadChip (Illumina), Bovine HD BeadChip (Illumina), Geneseek® Genomic Profiler™ LD BeadChip, or Geneseek® Genomic Profiler™ HD BeadChip.
In one embodiment, in order to measure the genetic similarity between the animals the genetic relatedness between the animals based on the SNP data is calculated. To this end a matrix that estimates the genetic relatedness between each unique pair of animals can be produced. This matrix is based on the count of shared alleles, weighted by the allele's rareness:
where Ajk represents the genetic relationship estimate between animals j and k; xij and xik are the counts of the reference alleles in animals j and k, respectively; pi is the proportion of the reference allele in the population; and n is the total number of SNPs used for the relatedness estimation.
In one embodiment, microbes or OTUs that exhibits a significant heritable component are considered as such if their heritability estimate is of >0.01 and P value of <0.1. It will be appreciated that the confidence level may be increased or decreased according to the stringency of the test. Thus, for example in another embodiment, microbes that exhibits a significant heritable component are considered as such if their heritability estimate is of >0.01 and P value of <0.05. Other contemplated heritability estimates contemplated by the present inventors include >0.02 and P value of <0.1, >0.03 and P value of <0.1, >0.04 and P value of <0.1, >0.05 and P value of <0.1, >0.06 and P value of <0.1, >0.07 and P value of <0.1, >0.08 and P value of <0.1, >0.09 and P value of <0.1, >0.1 and P value of <0.1, >0.2 and P value of <0.1, >0.3 and P value of <0.1, >0.4 and P value of <0.1, >0.5 and P value of <0.1, >0.6 and P value of <0.1, >0.7 and P value of <0.1, >0.8 and P value of <0.1.
Other contemplated heritability estimates contemplated by the present inventors include >0.02 and P value of <0.05, >0.03 and P value of <0.05, >0.04 and P value of <0.05, >0.05 and P value of <0.05, >0.06 and P value of <0.05, >0.07 and P value of <0.05, >0.08 and P value of <0.05, >0.09 and P value of 0.05, >0.1 and P value of 0.05, >0.2 and P value of 0.05, >0.3 and P value of 0.05, >0.4 and P value of 0.05, >0.5 and P value of 0.05, >0.6 and P value of 0.05, >0.7 and P value of 0.05, >0.8 and P value of 0.05.
According to a particular embodiment, the heritability estimate is >0.7 and a P value of <0.05.
To increase the confidence of the analysis, the heritability analysis may be limited exclusively to bacterial taxa which are present in at least 20%, 25%, 30%, 40%, 50% or higher of the genotyped subset. In addition, heritability analyses for each bacterial taxa may be performed a number of times, e.g. on a number of different sampling days (e.g. 2, 3, 4, 5, or more days). Only bacterial taxa that exhibited a significant heritable component (e.g. heritability estimate of >0.7 and p-value <0.05) in all individual sampling days, could be considered as heritable.
The term “OTU” as used herein, refers to a terminal leaf in a phylogenetic tree and is defined by a nucleic acid sequence, e.g., the entire genome, or a specific genetic sequence, and all sequences that share sequence identity to this nucleic acid sequence at the level of species. In some embodiments the specific genetic sequence may be the 16S sequence or a portion of the 16S sequence. In other embodiments, the entire genomes of two entities are sequenced and compared. In another embodiment, select regions such as multilocus sequence tags (MLST), specific genes, or sets of genes may be genetically compared. In 16S embodiments, OTUs that share greater than 97% average nucleotide identity across the entire 16S or some variable region of the 16S are considered the same OTU. See e.g., Claesson et al., 2010. Comparison of two next-generation sequencing technologies for resolving highly complex microbiota composition using tandem variable 16S rRNA gene regions. Nucleic Acids Res 38: e200. Konstantinidis et al., 2006. The bacterial species definition in the genomic era. Philos Trans R Soc Lond B Biol Sci 361: 1929-1940. In embodiments involving the complete genome, MLSTs, specific genes, other than 16S, or sets of genes OTUs that share, greater than 95% average nucleotide identity are considered the same OTU. See e.g., Achtman and Wagner. 2008. Microbial diversity and the genetic nature of microbial species. Nat. Rev. Microbiol. 6: 431-440; Konstantinidis et al., 2006, supra. The bacterial species definition in the genomic era. Philos Trans R Soc Lond B Biol Sci 361: 1929-1940. OTUs can be defined by comparing sequences between organisms. Generally, sequences with less than 95% sequence identity are not considered to form part of the same OTU. OTUs may also be characterized by any combination of nucleotide markers or genes, in particular highly conserved genes (e.g., “house-keeping” genes), or a combination thereof. Such characterization employs, e.g., WGS data or a whole genome sequence. As used herein, a “type” of bacterium refers to an OTU that can be at the level of a strain, species, clade, or family.
The present invention further contemplates analysing a plurality of the above described OTUs. Thus, at least one OTU, at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, at least nine, at least ten, at least 11, at least 12, at least 13, at least 14, at least 15 or all of the above described OTUs are analysed.
It will be appreciated that once the animal has been classified as having sufficient quantity of a heritable microorganism that correlates with a desirable phenotype, it may be selected (e.g. separated from the rest of the herd) and classified as having that phenotype. According to one embodiment, the animal branded such that it is clear that it comprises this phenotype.
As well as selecting the particular animal which has the desirable phenotype, the present inventors also contemplate removing (e.g. culling) animals from a herd that do not have the desirable phenotype. The animal may be branded as having the non-desirable phenotype. Thus, the present invention may be used to manage herds ensuring that the percentage of animals with a desirable phenotype in the herd is at its maximum and/or the percentage of animals with a non-desirable phenotype in the herd is at its minimum.
In one embodiment, the animal that has been deemed as having a desirable trait is selected as a candidate for breeding. Thus, the animal may be deemed suitable as a gamete donor for natural mating, artificial insemination or in vitro fertilization.
Thus, according to another aspect of the present invention there is provided a method for breeding a ruminating animal comprising: inseminating a female ruminating animal that has been selected according to the methods described herein with semen from a male ruminating animal, thereby breeding the ruminating animal.
In one embodiment, the male ruminating animal has also been selected as described herein.
According to another aspect of the present invention there is provided a method for breeding a ruminating animal comprising: inseminating a female ruminating animal with semen from a male ruminating animal that has been selected as described herein above, thereby breeding the ruminating animal.
The breeding of the one or more bovine bulls with the bovine cows is preferably by artificial insemination, but may alternatively be by natural insemination.
In one embodiment, the female ruminating animal has also been selected as described herein.
The present inventors have uncovered additional hereditable bacteria in the rumen microbiome. The hereditable bacteria are summarized in Table 3. By breeding animals that have rumen microbiomes containing one of these hereditable bacteria, it is possible to ensure that offspring of that animal will also contain that bacteria in their rumen microbiome. If the hereditable bacteria are associated with a particular trait (see Table 1), then by breeding animals that have rumen microbiomes containing one of these hereditable bacteria and the associated trait, it is possible to ensure that offspring of that animal will also contain that bacteria in their rumen microbiome, and therefore by virtue that trait.
Thus, according to another aspect of the present invention there is provided a method of increasing the number of ruminating animals having a desirable microbiome comprising breeding a male and female of said ruminating animals, wherein the rumen microbiome of either of said male and/or said female ruminating animals comprises a hereditable microorganism having an OTU as set forth in Table 3 above a predetermined level, thereby increasing the number of ruminating animals having a desirable microbiome.
As mentioned herein above, as well as selecting the particular animal which has the desirable microbiome, the present inventors also contemplate removing (e.g. culling) animals from a herd that do not have the desirable microbiome. Thus, the present invention may be used to manage herds ensuring that the percentage of animals with a desirable microbiome in the herd is at its maximum and/or the percentage of animals with a non-desirable microbiome in the herd is at its minimum.
The present inventors have also uncovered numerous bacteria that are associated with traits. Accordingly, the present inventors propose dictating the trait of a ruminating animal by altering its rumen microbiome.
According to this aspect of the present invention, the desirable microbiome is a microbiome which comprises a hereditable bacteria. Thus, the present inventors conceive that the hereditable bacteria itself may be considered as a hereditable trait.
Thus, according to another aspect of the present invention, there is provided a method of altering a trait of a ruminating animal comprising providing a microbial composition to the ruminating animal which comprises at least one microbe having an operational taxonomic unit (OTU) set forth in Table 2 and having a 16S rRNA sequence as set forth in SEQ ID NOs: 38-50 and 314-615, thereby altering the trait of the ruminating animal, wherein the microbial composition does not comprise a microbiome of the ruminating animal, wherein the trait is the corresponding trait to said at least one microbe as set forth in Table 2.
According to a particular embodiment, the bacteria is one that has a 16S rRNA sequence as set forth in SEQ ID NOs: 38-50 and 314-615.
In one embodiment, the microbial composition comprises at least one, at least two, at least three, at least four, at least five, at least six, at least seven, at least, eight, at least nine, at least ten, at least 11, at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19 at least 20 or more microbial species mentioned in Table 2.
Preferably, the microbial compositions of this aspect of the present invention comprise at least two microbial species. In one embodiment, the microbial compositions of this aspect of the present invention comprise less than 100 microbial species, less than 50 microbial species, less than 40 microbial species, less than 30 microbial species. Exemplary ranges of microbial species include 2-100, 2-50, 2-25, 2-20, 2-15. 2-10.
The microbial composition may be derived directly from a microbiota sample of the high energy efficient animal. Alternatively, the microbial composition may be artificially created by adding known amounts of different microbes. It will be appreciated that the microbial composition which is derived from the microbiota sample of an animal may be manipulated prior to administrating by increasing the amount of a particular species (e.g. increasing the amount of/or depleting the amount of a particular species). In another embodiment, the microbial compositions are not treated in any way which serves to alter the relative balance between the microbial species and taxa comprised therein. In some embodiments, the microbial composition is expanded ex vivo using known culturing methods prior to administration. In other embodiments, the microbial composition is not expanded ex vivo prior to administration.
According to one embodiment, the microbial composition is not derived from fecal material.
According to still another embodiment, the microbial composition is devoid (or comprises only trace quantities) of fecal material (e.g., fiber).
Prior to administration, the animal may be pretreated with an agent which reduces the number of naturally occurring rumen microbiome (e.g. by antibiotic treatment). According to a particular embodiment, the treatment significantly eliminates the naturally occurring rumen microflora by at least 20%, 30% 40%, 50%, 60%, 70%, 80% or even 90%.
As well as increasing the above mentioned bacterial populations in the rumen microbiome of the animals, the present inventors further contemplate decreasing any one of the bacterial species set forth in Table 2 herein below to alter a corresponding trait.
According to a particular embodiment, the bacteria has a 16S rRNA sequence as set forth in SEQ ID NOs: 1-37 and 51-313.
According to one embodiment, the agent which decreases the abundance of a bacteria is not an antibiotic agent.
According to another embodiment, the agent which decreases the abundance of the bacteria is an antimicrobial peptide.
According to still another embodiment, the agent which decreases the abundance of a bacteria is a bacteriophage.
According to still another embodiment, the agent which decreases the abundance of a bacteria is capable of downregulating an essential gene of at least one of the bacterial species described herein below.
Thus, for example, the present inventors contemplate the use of meganucleases, such as Zinc finger nucleases (ZFNs), transcription-activator like effector nucleases (TALENs) and CRISPR/Cas system to downregulate the essential gene.
CRISPR-Cas system—Many bacteria and archea contain endogenous RNA-based adaptive immune systems that can degrade nucleic acids of invading phages and plasmids. These systems consist of clustered regularly interspaced short palindromic repeat (CRISPR) genes that produce RNA components and CRISPR associated (Cas) genes that encode protein components. The CRISPR RNAs (crRNAs) contain short stretches of homology to specific viruses and plasmids and act as guides to direct Cas nucleases to degrade the complementary nucleic acids of the corresponding pathogen. Studies of the type II CRISPR/Cas system of Streptococcus pyogenes have shown that three components form an RNA/protein complex and together are sufficient for sequence-specific nuclease activity: the Cas9 nuclease, a crRNA containing 20 base pairs of homology to the target sequence, and a trans-activating crRNA (tracrRNA) (Jinek et al. Science (2012) 337: 816-821.). It was further demonstrated that a synthetic chimeric guide RNA (gRNA) composed of a fusion between crRNA and tracrRNA could direct Cas9 to cleave DNA targets that are complementary to the crRNA in vitro. It was also demonstrated that transient expression of Cas9 in conjunction with synthetic gRNAs can be used to produce targeted double-stranded brakes in a variety of different species (Cho et al., 2013; Cong et al., 2013; DiCarlo et al., 2013; Hwang et al., 2013a,b; Jinek et al., 2013; Mali et al., 2013).
The CRIPSR/Cas system for genome editing contains two distinct components: a gRNA and an endonuclease e.g. Cas9.
The gRNA is typically a 20 nucleotide sequence encoding a combination of the target homologous sequence (crRNA) and the endogenous bacterial RNA that links the crRNA to the Cas9 nuclease (tracrRNA) in a single chimeric transcript. The gRNA/Cas9 complex is recruited to the target sequence by the base-pairing between the gRNA sequence and the complement genomic DNA. For successful binding of Cas9, the genomic target sequence must also contain the correct Protospacer Adjacent Motif (PAM) sequence immediately following the target sequence. The binding of the gRNA/Cas9 complex localizes the Cas9 to the genomic target sequence so that the Cas9 can cut both strands of the DNA causing a double-strand break. Just as with ZFNs and TALENs, the double-stranded brakes produced by CRISPR/Cas can undergo homologous recombination or NHEJ.
The Cas9 nuclease has two functional domains: RuvC and HNH, each cutting a different DNA strand. When both of these domains are active, the Cas9 causes double strand breaks in the genomic DNA.
A significant advantage of CRISPR/Cas is that the high efficiency of this system coupled with the ability to easily create synthetic gRNAs enables multiple genes to be targeted simultaneously. In addition, the majority of cells carrying the mutation present biallelic mutations in the targeted genes.
However, apparent flexibility in the base-pairing interactions between the gRNA sequence and the genomic DNA target sequence allows imperfect matches to the target sequence to be cut by Cas9.
Modified versions of the Cas9 enzyme containing a single inactive catalytic domain, either RuvC- or HNH-, are called ‘nickases’. With only one active nuclease domain, the Cas9 nickase cuts only one strand of the target DNA, creating a single-strand break or ‘nick’. A single-strand break, or nick, is normally quickly repaired through the HDR pathway, using the intact complementary DNA strand as the template. However, two proximal, opposite strand nicks introduced by a Cas9 nickase are treated as a double-strand break, in what is often referred to as a ‘double nick’ CRISPR system. A double-nick can be repaired by either NHEJ or HDR depending on the desired effect on the gene target. Thus, if specificity and reduced off-target effects are crucial, using the Cas9 nickase to create a double-nick by designing two gRNAs with target sequences in close proximity and on opposite strands of the genomic DNA would decrease off-target effect as either gRNA alone will result in nicks that will not change the genomic DNA.
Modified versions of the Cas9 enzyme containing two inactive catalytic domains (dead Cas9, or dCas9) have no nuclease activity while still able to bind to DNA based on gRNA specificity. The dCas9 can be utilized as a platform for DNA transcriptional regulators to activate or repress gene expression by fusing the inactive enzyme to known regulatory domains. For example, the binding of dCas9 alone to a target sequence in genomic DNA can interfere with gene transcription.
There are a number of publically available tools available to help choose and/or design target sequences as well as lists of bioinformatically determined unique gRNAs for different genes in different species such as the Feng Zhang lab's Target Finder, the Michael Boutros lab's Target Finder (E-CRISP), the RGEN Tools: Cas-OFFinder, the CasFinder: Flexible algorithm for identifying specific Cas9 targets in genomes and the CRISPR Optimal Target Finder.
In order to use the CRISPR system, both gRNA and Cas9 should be expressed in a target cell. The insertion vector can contain both cassettes on a single plasmid or the cassettes are expressed from two separate plasmids. CRISPR plasmids are commercially available such as the px330 plasmid from Addgene.
The compositions described herein (e.g. microbial compositions) may be administered per se (e.g. using a catheter or syringe) or may be administered together in the feed (e.g. as a feed additive) of the animal or the drink of the animal.
These ruminants may be fed the feed additive composition of the present invention at any time and in any amount during their life. That is, the ruminant may be fed the feed additive composition of the present invention either by itself or as part of a diet which includes other feedstuffs. Moreover, the ruminant may be fed the feed additive composition of the present invention at any time during its lifetime. The ruminant may be fed the feed additive composition of the present invention continuously, at regular intervals, or intermittently. The ruminant may be fed the feed additive composition of the present invention in an amount such that it accounts for all, a majority, or a minority of the feed in the ruminant's diet for any portion of time in the animal's life. According to one embodiment, the ruminant is fed the feed additive composition of the present invention in an amount such that it accounts for a majority of the feed in the animal's diet for a significant portion of the animal's lifetime.
Examples of additional rumen active feed additives which may be provided together with the feed additive of the present invention include buffers, fermentation solubles, essential oils, surface active agents, monensin sodium, organic acids, and supplementary enzymes.
Also contemplated is encapsulation of the microbes in nanoparticles or microparticles using methods known in the art including those disclosed in EP085805, EP1742728 A1, WO2006100308 A2 and U.S. Pat. No. 8,449,916, the contents of which are incorporated by reference.
The compositions may be administered orally, rectally or any other way which is beneficial to the animal such that the microbes reach the rumen of the animal.
In another embodiment, the present invention provides novel processes for raising a ruminant by feeding the ruminant such a feed additive composition.
As used herein the term “about” refers to ±10%
The terms “comprises”, “comprising”, “includes”, “including”, “having” and their conjugates mean “including but not limited to”.
The term “consisting of” means “including and limited to”.
The term “consisting essentially of” means that the composition, method or structure may include additional ingredients, steps and/or parts, but only if the additional ingredients, steps and/or parts do not materially alter the basic and novel characteristics of the claimed composition, method or structure.
As used herein, the singular form “a”, “an” and “the” include plural references unless the context clearly dictates otherwise. For example, the term “a compound” or “at least one compound” may include a plurality of compounds, including mixtures thereof.
Throughout this application, various embodiments of this invention may be presented in a range format. It should be understood that the description in range format is merely for convenience and brevity and should not be construed as an inflexible limitation on the scope of the invention. Accordingly, the description of a range should be considered to have specifically disclosed all the possible subranges as well as individual numerical values within that range. For example, description of a range such as from 1 to 6 should be considered to have specifically disclosed subranges such as from 1 to 3, from 1 to 4, from 1 to 5, from 2 to 4, from 2 to 6, from 3 to 6 etc., as well as individual numbers within that range, for example, 1, 2, 3, 4, 5, and 6. This applies regardless of the breadth of the range.
Whenever a numerical range is indicated herein, it is meant to include any cited numeral (fractional or integral) within the indicated range. The phrases “ranging/ranges between” a first indicate number and a second indicate number and “ranging/ranges from” a first indicate number “to” a second indicate number are used herein interchangeably and are meant to include the first and second indicated numbers and all the fractional and integral numerals therebetween.
As used herein the term “method” refers to manners, means, techniques and procedures for accomplishing a given task including, but not limited to, those manners, means, techniques and procedures either known to, or readily developed from known manners, means, techniques and procedures by practitioners of the chemical, pharmacological, biological, biochemical and medical arts.
It is appreciated that certain features of the invention, which are, for clarity, described in the context of separate embodiments, may also be provided in combination in a single embodiment. Conversely, various features of the invention, which are, for brevity, described in the context of a single embodiment, may also be provided separately or in any suitable subcombination or as suitable in any other described embodiment of the invention. Certain features described in the context of various embodiments are not to be considered essential features of those embodiments, unless the embodiment is inoperative without those elements.
Various embodiments and aspects of the present invention as delineated hereinabove and as claimed in the claims section below find experimental support in the following examples.
Reference is now made to the following examples, which together with the above descriptions illustrate some embodiments of the invention in a non limiting fashion.
Generally, the nomenclature used herein and the laboratory procedures utilized in the present invention include molecular, biochemical, microbiological and recombinant DNA techniques. Such techniques are thoroughly explained in the literature. See, for example, “Molecular Cloning: A laboratory Manual” Sambrook et al., (1989); “Current Protocols in Molecular Biology” Volumes I-III Ausubel, R. M., ed. (1994); Ausubel et al., “Current Protocols in Molecular Biology”, John Wiley and Sons, Baltimore, Md. (1989); Perbal, “A Practical Guide to Molecular Cloning”, John Wiley & Sons, New York (1988); Watson et al., “Recombinant DNA”, Scientific American Books, New York; Birren et al. (eds) “Genome Analysis: A Laboratory Manual Series”, Vols. 1-4, Cold Spring Harbor Laboratory Press, New York (1998); methodologies as set forth in U.S. Pat. Nos. 4,666,828; 4,683,202; 4,801,531; 5,192,659 and 5,272,057; “Cell Biology: A Laboratory Handbook”, Volumes I-III Cellis, J. E., ed. (1994); “Culture of Animal Cells—A Manual of Basic Technique” by Freshney, Wiley-Liss, N. Y. (1994), Third Edition; “Current Protocols in Immunology” Volumes I-III Coligan J. E., ed. (1994); Stites et al. (eds), “Basic and Clinical Immunology” (8th Edition), Appleton & Lange, Norwalk, Conn. (1994); Mishell and Shiigi (eds), “Selected Methods in Cellular Immunology”, W. H. Freeman and Co., New York (1980); available immunoassays are extensively described in the patent and scientific literature, see, for example, U.S. Pat. Nos. 3,791,932; 3,839,153; 3,850,752; 3,850,578; 3,853,987; 3,867,517; 3,879,262; 3,901,654; 3,935,074; 3,984,533; 3,996,345; 4,034,074; 4,098,876; 4,879,219; 5,011,771 and 5,281,521; “Oligonucleotide Synthesis” Gait, M. J., ed. (1984); “Nucleic Acid Hybridization” Hames, B. D., and Higgins S. J., eds. (1985); “Transcription and Translation” Hames, B. D., and Higgins S. J., eds. (1984); “Animal Cell Culture” Freshney, R. I., ed. (1986); “Immobilized Cells and Enzymes” IRL Press, (1986); “A Practical Guide to Molecular Cloning” Perbal, B., (1984) and “Methods in Enzymology” Vol. 1-317, Academic Press; “PCR Protocols: A Guide To Methods And Applications”, Academic Press, San Diego, Calif. (1990); Marshak et al., “Strategies for Protein Purification and Characterization—A Laboratory Course Manual” CSHL Press (1996); all of which are incorporated by reference as if fully set forth herein. Other general references are provided throughout this document. The procedures therein are believed to be well known in the art and are provided for the convenience of the reader. All the information contained therein is incorporated herein by reference.
Materials and Methods
Experimental Design and Subject Details
The primary objective of this research was to relate the animal genome to the rumen microbiome, feed efficiency, and methane emissions in lactating dairy cows. The following research questions were specified at the outset: Does host genetics have a significant effect on the overall microbiome composition and to what extent? How consistent is the rumen microbiome across geographic locations, breeds and diets? On discovery of a heritable core rumen microbiome, the following additional research questions arose: Do heritable rumen microbes interact with the rest of the core rumen microbes? How do heritable microbes integrate in the overall microbe-host phenotype interaction network?
The objectives were addressed in an observational study involving collection of phenotypic data describing animal metabolism, digestion efficiency and emissions of methane and nitrogen. Samples of rumen digesta and blood were collected for molecular analysis and subsequent statistical analysis to identify correlations and genetic associations.
The final population sampled was 1016 cows to allow a small margin in case any individuals or samples had to be excluded.
Prospective inclusion criteria for animal selection were that cows must be between 10 and 40 weeks postpartum, have received the standard diet for at least 14 days, and had no health issue in the current lactation. Prospective data exclusion criteria were missing samples (e.g. milk, blood, rumen, feces), sample processing issues (e.g. inadequate DNA yield, assay problems, laboratory mishaps), and implausible outliers. Statistical outliers were defined as values greater than three standard deviations from the mean. All statistical outliers were investigated and calculations corrected or assays repeated where appropriate. Otherwise, outliers were retained for data analysis unless they were implausible. Data for any excluded sample were omitted, but the remaining data for the individual were retained.
Six milk samples were missing due to a faulty sampling device, and one blood sample was missing from a cow that could not be sampled. Two rumen fluid samples were lost during laboratory analysis. Two estimates of feed intake were considered implausible (200% of expected) due to abnormal fecal alkane values.
Animal work was conducted by four research teams in United Kingdom (UK), Italy (IT), Sweden (SE) and Finland (FI). In total, 1,016 cows on seven farms were sampled, and associated data collected. UK sampled 409 cows on two farms (UK1: N=243, and UK2: N=164); IT sampled 409 cows on three farms (IT1: N=185, IT2: N=176, and IT3: N=48); SE sampled 100 cows on one farm (SE1); and FI sampled 100 cows on one farm (FI1).
Recordings and collection of biological samples were performed over a 5-day period for each cow that had received the standard diet for at least 14 days. To reach 1,016 cows, sampling was conducted over a period of 26 months in 78 sessions with between 1 and 40 cows per session. At time of recording and sampling, all cows were in established lactation (between 10 and 40 weeks postpartum) when energy balance is close to zero and methane output is relatively stable (26).
Housing and Feeding Systems:
Cows on all farms were group-housed in loose housing barns, except in FI where cows were housed in individual standings during the sampling period. To minimize environmental variation, all cows were offered diets that were standardized within farms, i.e. all cows on a farm were fed on the same diet at any sampling period, and any changes to diet formulation when batches of forage changed were made at least 14 days before sampling commenced. Diets were based on maize silage, grass silage or grass hay, and concentrates in UK and IT, and were based on grass silage and concentrates in SE and FI. Diets were fed as ad libitum total-mixed rations (TMR) in IT, SE and FI, and as ad libitum partial-mixed rations (PMR) plus concentrates during robotic milking in UK. The PMR and TMR were delivered along feed fences in UK and IT, and TMR were delivered into individual feed bins in SE and FI.
Milk and Body Weight Recording:
Milk yield was recorded at every milking and daily mean calculated for each cow. Cows were milked twice daily in herringbone parlors in IT and SE, twice daily at their individual standings in FI, and in automatic milking stations (Lely Astronaut A3, Lely UK Ltd., St Neots, UK), on average 2.85 times per day, in UK.
Milk samples were collected from each cow at four milkings during the sampling period, preserved with broad spectrum microtabs II containing bronopol and natamycin (D & F Control Systems Inc, San Ramon Calif.) or Bronopol (Valio Ltd., Finland), and stored at 4° C. until analyzed. Milk samples were analyzed for fat, protein, lactose and urea concentrations using mid-infrared instruments (Foss Milkoscan, Foss, Denmark, or similar). Mean concentrations of milk components were calculated by weighting concentrations proportionally to respective milk yields from evening and morning milkings.
Body weight was recorded three (SE) or two (IT, FI) times during each sampling period, and automatically at each milking in UK. Mean body weight was calculated for each cow.
Feed Intake Measurement and Estimation
Feed intake was recorded individually on a daily basis throughout each sampling period using Roughage Intake Control (RIC) feeders (Insentec B. V., Marknesse, The Netherlands) in SE and manually in FI. Feed intake was estimated using indigestible markers (alkanes) in feed and feces (27) in UK and IT. Alkanes (C30 and C32) were administered via concentrates fed during milking in UK, and via a bolus gun whilst cows were restrained in locking head yokes during feeding in IT. Validation of the alkane method for estimating feed intake was provided by concurrent direct measurement of individual feed intake in 50 cows in UK via RIC feeders (Fullwood Ltd., Ellesmere, UK), and by applying the method to individually fed cows in a research herd in IT (28).
Collection of Rumen Samples
The method of sampling rumen fluid was standardized at all centers and involved using a ruminal probe specially designed for cattle (Ruminator; profs-products(dot)com). The probe comprises a perforated brass cylinder attached to a reinforced flexible pipe, a suction pump and a collection vessel. The brass cylinder was pushed gently to the back of a cow's mouth and gentle pressure applied until the device was swallowed as far as a ring on the pipe that indicates correct positioning in the rumen. The first liter of rumen fluid was discarded to avoid saliva contamination and the next 0.5 L was retained for sampling. The device was flushed thoroughly with tap water between cows.
Rumen fluid samples were collected on one day during the sampling period between 2 and 5 hours after feed was delivered to cows in the morning. For all samples, pH of rumen fluid was recorded immediately. After swirling, four aliquots of 1 ml each were pipetted into freeze resistant tubes (2 ml capacity), immediately frozen in liquid nitrogen or dry ice, stored at −80° C. and freeze dried within one month from the sampling date. Four additional aliquots of 2.5 mL were pipetted into centrifuge tubes with 0.5 mL of 25% metaphosphoric acid for VFA and ammonia-N analysis, centrifuged at 1000 g for 3 min, and supernatant was transferred to fresh tubes. Tubes were sealed and frozen at −20° C. until laboratory analysis.
Rumen Volatile Fatty Acids Measurement
Volatile fatty acid concentrations were determined by gas chromatography using the method of Playne (29). Ammonia-N concentration was determined by a photometric test with a Clinical Chemistry Autoanalyzer using an enzymatic ultraviolet method (e.g. Randox Laboratories Ltd, Crumlin, UK).
DNA Extraction
Total genomic DNA was isolated from 1 ml freeze dried rumen samples according to Yu and Morrison (30). This method combines bead beating with the column filtration steps of the QIAamp DNA Stool Mini Kit (Qiagen, Hilden, Germany).
Amplicon Sequencing
Primers for PCR amplification of bacterial and archaeal 16S rRNA genes, ciliate protozoal 18S rRNA genes and fungal ITS1 genes were designed in silico using ecoPrimers (31), the OBITools software suite (32) and a database created from sequences stored in GenBank. For each sample, PCR amplifications were performed in duplicate. An eight-nucleotide tag unique to each PCR duplicate was attached to the primer sequence, in order to enable the pooling of all PCR products for sequencing and the subsequent assignation of sequence reads to their respective samples. PCR amplicons were combined in equal volumes and purified using QIAquick PCR purification kit (Qiagen, Germany). After library preparation using a standard protocol with only five PCR cycles, amplicons were sequenced using the MiSeq technology from Illumina (Fasteris, S A, Geneva, Switzerland), which produced 250-base paired-end reads for all markers, except for the archaeal marker which was sequenced with the HiSeq technology from Illumina, generating 100-base pair-end reads.
Methane and CO2 Emission Measurement
Methane was measured using breath sampling either during milking in UK (33) or when cows visited a bait station in IT and SE (GreenFeed) (34). Methane was measured in FI by housing cows in respiration chambers for 5 days (35). Carbon dioxide was measured simultaneously with methane in IT, SE and FI.
Blood Sampling and Analysis
Blood samples were collected at the same time as rumen sampling using jugular venipuncture and collection into evacuated tubes (Vacutainer). One tube containing Lithium heparin or Na-EDTA as anticoagulant was collected for metabolic parameters, and two tubes containing sodium citrate were collected for genotyping. Tubes were gently inverted 8-10 times following collection to ensure optimal additive activity and prevent clotting. Tubes were chilled at 2-8° C. immediately after collection by placing in chilled water in a fridge or in a mixture of ice and water. Tubes collected for metabolic parameters were centrifuged for 10-15 min (3500 g at 4° C.) and the plasma obtained was divided into four aliquots. Blood samples collected for genotyping were not centrifuged. All samples were stored at −20° C. until analyzed.
Plasma non-esterified fatty acids, beta-hydroxybutyrate, glucose, albumin, cholesterol, urea and creatinine were analyzed at each center using commercial kits (Instrumentation Laboratory, Bedford, Mass., USA; Wako Chemicals GmbH, Neuss, Germany; Randox Laboratories Ltd, Crumlin, UK). Blood samples from each center were sent to IT for haptoglobulin determination, according to the method of Skinner et al. (36).
Quantitative PCR of 16S and 18S rRNA Genes
DNA was diluted to 0.1 ng/μl in 5 μg/ml herring sperm DNA for amplification with universal bacterial primers UniF (GTGSTGCAYGGYYGTCGTCA—SEQ ID NO: 1) and UniR (ACGTCRTCCMCNCCTTCCTC—SEQ ID NO: 2) (37) and 1 ng/μl in 5 μg/ml herring sperm DNA for amplification of other groups (38). Quantitative PCR was carried out using a BioRad CFX96 as described by Ramirez-Farias et al. (39). Amplification of archaeal 16S RNA genes was carried out using the primers Met630f (GGATTAGATACCCSGGTAGT—SEQ ID NO: 3) and Met803r (GTTGARTCCAATTAAACCGCA—SEQ ID NO: 4) as described by Hook et al. (40) and calibrated using DNA extracted from Methanobrevibacter smithii PS, a gift from M. P. Bryant, University of Illinois. For total bacteria amplification efficiency was evaluated using template DNA from Roseburia hominis A2-183 (DSM 16839T). Amplification of protozoal 18S rRNA gene was carried out using primers 316f (GCTTTCGWTGGTAGTGTATT—SEQ ID NO: 5) and 539r (CTTGCCCTCYAATCGTWCT—SEQ ID NO: 6) (41) and calibrated using DNA amplified from bovine rumen digesta with primers 54f and 1747r (41). Bacterial abundance was calculated from quadruplicate Ct values using the universal bacterial calibration equation.
Bovine Genotyping
From blood samples, genomic DNA was extracted and quantified for SNP genotyping. All animals were genotyped on the Bovine GGP HD (GeneSeek Genomic Profilers). The 200 cows coming from Finland and Sweden were genotyped using the Bovine GGP HD chip v1 (80K) that included 76.883 SNPs, while the 800 samples from UK and Italy were genotyped using the Bovine GGP HD chip v2 (150K) that included 138.892 SNPs, as the v1 of the chip was no longer available from the manufacturer. The v2 of the chip includes all the SNPs that were present in the previous v1 of the chip, while at the same time providing more markers for the same final processing cost. Neogen company performed the DNA hybridisation, image scanning and data acquisition of the genotyping chips according to the manufacturer's protocols (Illumina Inc.) All individuals had a call rate higher than 0.90 (93.5% of individuals with call rate higher than 0.99). More than 99% of SNPs had a call rate higher than 0.99, (93.2% of SNPs with call rate higher than 0.99). Minor allele frequency (MAF) distribution evidences more than 90% of markers with a MAF>5% and nearly 4% of monomorphic SNPs.
Quantification and Statistical Analysis
Statistical methods and software used are detailed in subsequent sections, and in figure legends and results. Statistical significance was declared at P<0.05, P<0.01 and P<0.001, as appropriate.
Utilization of Primer Sets Derived Microbiome Data in the Statistical Analysis
Associations of microbial domain richness were based on amplicon sequencing data from the following primer sets: Bact (bacteria), Arch (archaea), Neoc (fungi), Cili (protozoa). Associations of individual microbes (as species-level OTUs) were based on amplicon sequencing data from the following primer sets: ProkA (bacteria and archaea), Neoc (fungi), Cili (protozoa).
Converting OBITools Intermediate Fasta Files to QIIME Ready Format
Amplicon sequences were initially processed with OBITools (32) which removed barcodes and split each sample from each of the two sequencing rounds into an individual FASTQ file. Within each domain's amplicon sequences, individual samples sequences from both rounds were then pooled together into a single FASTQ file in the format required for further processing in QIIME (42) for picking OTU. In detail, the header of each FASTQ entry was appended with a prefix following the format [round_id] [sample_id] [running_number] [space].
Clustering of Microbial Marker Gene Amplicon Sequences and Picking Representative Denovo Species OTU
The marker gene sequences coming from each domain's primer-set (Archaea, Bacteria, Prokaryote, Ciliate, protozoa, and Fungi) were clustered using 97% nucleotide sequence similarity threshold, using the UCLUST algorithm (43), following the QIIME command: pick_otus.py-m uclust-s 0.97). Representative OTUs for each OTU cluster were chosen with QIIME command: pick_rep_set.py-m most_abundant.
Assigning Taxonomy to OTU
The OTU within each domain were assigned taxonomy using the Ribosomal Database Project (RDP) classifier (44), following QIIME command: assign_taxonomy.py-m rdp. The OTUs from the amplicon domains of Prokaryotic, Archaea and Bacteria were assigned taxonomy according to GreenGenes database (45). The OTU from Ciliate protozoa were assigned taxonomy according to SILVA database; release 123 (46). Fungal OTU were assigned taxonomy according to a Neocallimastigomycota ITS ldatabase from Koetschan (47).
Creation of OTU Tables, Samples Subsetting and Subsampling
Amplicon domain OTU tables were created from the representative OTU set counts in each sample along with their assigned taxonomy, using QIIME command: make_otu_table.py. Each OTU table was then subsetted to include only the sample from each animal (out of the two samples sequenced in two different sequencing rounds) that gained the highest sequence depth. Further on, amplicon domain OTU tables were subsampled to 7,000 reads depth for all analyses, with the following exceptions: domain richness (8,000 reads) and microbe abundance to trait association (8,000 reads) and inter-domain microbial interaction analysis, where no subsampling was taking place.
Correlating Microbial Domains Cell Count
The quantitative PCR derived microbial counts in each domain were correlated to each other using Spearman r correlation using R (48) cor function. The P-values for all inter-domain correlations within each farm were corrected using Bonferroni-Hochberg (49) procedure (BH).
Correlating Microbial Domain Cell Counts to Experimental Variables
Within each farm, each experimental variable was correlated to each microbial domain's cell count (Spearman r). Next, the analysis proceeded only with experimental variable—domain count pairs whose correlation direction was identical in all farms. Subsequently, P-values for the correlation of the selected experimental variable—domain cell count pairs from within each farm were combined by meta-analysis using the weighted sum of z procedure (50,51), weighted by the farm size. Meta-analysis was carried by using R package metap (52). Finally, combined P-values were corrected using the BH procedure.
Correlating Microbial Domain Richness to Experimental Variables
Separately within farms, each experimental variable was correlated to each microbial domain's richness, as observed species count (Spearman r), using domain specific primers. Next, the analysis proceeded only with experimental variable—domain richness pairs whose correlation direction was identical in all farms. Subsequently, P-values for the correlation of the selected experimental variable—domain richness pairs from within each farm were combined by meta-analysis using the weighted sum of z procedure, weighted by number of cows on each farm.
Meta-analysis was carried by R package metap (52). Finally, combined P-values were corrected using the BH procedure.
Prediction of Phenotypes and Other Experimental Variables by Core Microbiome
The abundances of the core microbes within each farm were used as features fed into a Ridge regression (56) in order to predict each of the traits (separately). Our approach followed a k-fold cross-validation methodology (k=10) where each fold was omitted once from the entire set and the model built from all the other folds (training set) was used to predict the trait value of the excluded samples (animal). This was implemented using the function cv.glmnet (alpha=0, k=10) from the GLMNET R package (57). Then, the overall prediction r2 was calculated using R code
1—model_fit$cvm[which(model_fit$glmnet.fit$lambda==model_fit$lambda.min)]/var(exp_covar). Cross-Validation Procedure was Repeated 1.00 Times and R2 Measurements were Averaged.
Prediction of Phenotypes by Core Microbiome while Correcting for Diet
In order to estimate the phenotypic variability explained by core microbes with omission of diet components effect, we repeated the analysis above with one difference. That is, prior to the running the regression, both phenotypic values and microbial OUT counts were corrected for diet. In detail, a Ridge regression (19) was used based on diet components as independent variables and the phenotype or OUT as the dependent variable. Thereafter, the phenotype residuals (diet predicted phenotype—actual phenotype) and OUT residuals (diet predicted OTU count—actual OTU count) were used to feed the GLMNET function (20).
Prediction of Phenotypes by Diet Components
Diet components within each farm were used as features fed into a Ridge regression (19) in order to predict each of the phenotypes (separately). Our approach followed a k-fold cross-validation methodology (k=10) where each fold was omitted once from the entire set and the model built from all the other folds (training set) was used to predict the trait value of the excluded samples (animal). This was implemented using the function cv.glmnet (alpha=0, k=10) from the GLMNET R package (20). Then, the overall prediction r2 was calculated using R code
1—model_fit$cvm[which(model_fit$glmnet.fit$lambda==model_fit$lambda.min)]/var(exp_covar). Cross-Validation Procedure was Repeated 1.00 Times and R2 Measurements were Averaged.
Prediction of Phenotypes and Other Experimental Variables by Core Microbiome Using Random Forest
As an additional analysis in order to further verify our findings of core microbiome explainability (by prediction) of host phenotypes and experimental variables, we repeated that analysis using RandomForest (RF) regression. The abundances of the core microbes within each farm were used as features fed into a RF regression model (21,22) in order to predict each of the traits (separately). Our approach followed a Leave-one-out cross-validation methodology where in each iteration one sample (animal) was omitted from the entire set and the model built from all the other animals (training set) was used to predict the trait value of the excluded sample (animal). Thereafter, the prediction R2 value between vector of actual and predicted values was calculated using R
CARET package function R2.
Bovine genotypes quality control Genotypes of the two breed types were processed independently. Genotypes were first subjected to QC filtering including 5% minor frequency allele, 5% genotype missingness and 5% individual missingness, following PLINK (54) command: plink --noweb --cow --maf 0.05 --geno 0.05 --mind 0.05. The QC for the genotypes used for association/heritability analysis (Holstein excluding Farm UK2) resulted with 5377 SNPs failed missingness, 14119 SNPs failed frequency and 48 of 635 individuals removed for low genotyping, resulting with 587 individuals and 121066 remaining.
Testing association of the global rumen prokaryotic core with host genetics Within each farm, the first 30 principal components (PCs) for core OTU were extracted (R prcomp). In addition, first genotypes PCs were extracted using R snpgdsPCA (55). Then, canonical correlation analysis (CCA) (56) was performed between the matrices of OTUs PCs and genotypes PCS, and total fraction of OTUs variance accounted for genotypes variables, through all canonical variates were calculated. This actual value was than compared to that of 1,000 random permutations, where the order of phenotypes PCs was shuffled.
Creation of Genetic Relationship Matrix
A genetic relatedness matrix (GRM) was created including all Holstein animals except Farm UK2, (57), using the command: gcta64 --make-grm-bin --make-bed --autosome-num 29 --autosome.
Heritability Estimation
For estimating OTUs heritability, the core microbes counts were quantile-normalized and were then provided to GCTA to estimate phenotypic variance explained by all SNPs with GREML method (57,58), with farms as qualitative covariates and the first five GRM PCs and diet components as quantitative covariates, following the GCTA command: gcta64--rend-pheno [phenotype_file] -mpheno [phneotype_index]--grm --autosome-num 29 -covar [farms_covars_file]--qcovar [quant_covariates_file].
Heritability Confidence Intervals Estimation
Heritability confidence intervals at 95% were estimated based on the heritability estimates and the GRM using the GRM eigenvalues and farms as covariates with the program FIESTA (59). The command used: fiesta.py --kinship_eigenvalues [GRM_eigenvalues_file] --[heritability_estimates_file] --covariates [farms_covariate_file] --confidence 0.95 --iterations 100 --output_filename [otu_file].
Bovine Genome SNPs—Microbe Association Effort
Microbial species-level OTUs phenotypes within the Holstein subset (excluding UK2 cohort that showed a different genetic makeup by genotypes PCA and ADMIXTURE ancestral background analysis) relative abundance data were transformed using quantile-normalization. Moreover, the top five genotypes top principal components (PCs) and the farm identity were used as a continuous and categorical covariate, respectively. The analysis was performed with the mixed-linear-model option (mlma) where SNP under inspection was accounted as fixed effect along with the covariates, and GRM effect as random. No association p-value surpassed the Bonferroni corrected significance threshold (9.076876e-10) for the number of phenotypes (455) and the number of SNPs included in the association analysis (121,066).
Estimating Kinship Matrix
Farm wise animal genetic kinship matrices as estimated based on genomic relatedness inferred from common single nucleotide polymorphisms (SNPs) that were filtered-in after the above quality control procedure. The tool used for the estimation was EMMAX, with the following command line: emmax-kin-intel64 -v -M 10 farm_genotypes_typed_file -o farm.hBN.kinf
Genomic Prediction
Genomic prediction was performed based on the each farm's kinship matrix. The GAPIT tool was used to predict phenotypic values, with the function GAPIT (parameters PCA.total=3, SNP.test=FALSE). creareFolds command from R caret package (53) was used to create three folds, where in each one fold observations are omitted and are predicted by the model built from the remaining two folds. R2 is estimated between the observed and predicted trait values were then correlated using caret R2 function. The process was repeated 10 times for a given trait in a given farm and mean of all measurements was then calculated.
Associating Microbes' Abundance with Experimental Variables
Separately for each farm and domain, OTUs occupying more than 10% of the animals in that farm were pairwise correlated (Spearman) to each of the experimental variables. Following that, all P-values resulted from correlation tests within a given domain and farm were subjected to multiple correction using BH procedure. Finally, an OTU that showed a significant correlation (corrected P<0.05) to a certain experimental variable in most (>3) of the farms with same r coefficient sign and no significant correlation with opposite r sign in the remaining farms, was identified as associated with that variable.
Inference of Microbial Interaction Network within Domains
Within each domain and farm, an OTU-table with subset of samples (animals) that contain a depth of at least 5,000 reads was created, followed by removal of OTU present in <50% of animals. The raw counts in the OTU table were fed into the R SpiecEasi (60) framework and edges were identified using spiec.easi function (‘mb’ method). Edges were given weights using symBeta function as suggested by the package authors. Thereafter, the resulting network was filtered to include only edges whose absolute weight was greater than 0.2. Finally, all individual farms within a certain domain were merged and edges connecting nodes (microbes) with the same taxonomic annotation were removed.
Inference of Inter-Domain Microbial Network
Within each farm, OTU from different domains were correlated to each other using Spearman correlation, followed by BH correction for all the correlations examined the farm and filtering in correlations with corrected P<0.05. Then, significant correlations were aggregated from all farms. Finally, correlations with correlation coefficient r<0.5 were removed.
Comparing Phylogenetic Relatedness of Core Prokaryotic Microbes to Random Sampling
Multiple sequence alignment between all core prokaryotic microbes was calculated using MAFFT (61,62) with default parameters. A phylogenetic tree-based distance matrix was obtained from aligned sequences using Fasttree (63,64), following the command: fasttree -nt -makematrix. Thereafter, the median phylogenetic between core microbes was calculated. Next, random sets (n=100) of OTU sequences were subjected to the same procedure. The P-value was calculated as P=(I(mcsd>mrsd)+1)/101, where mcsd represents median core phylogenetic distance and mrsd represents a vector of median phylogenetic distances calculated for the randomly sampled set.
Examining Core and Trait-Related Microbiome for Taxonomic Enrichment
The odds-ratio (O.R.) of each prokaryotic order appearing in the examined group (either core microbiome or trait-related microbiome), between the examined group and the whole prokaryotic microbiome catalog, was calculated. Next, orders showing an O.R.>1 (higher in the examined group) were filtered in. Finally, the O.R. P-value was calculated (Fisher Exact, two-tailed) and corrected using the BH procedure.
Comparing Heritable Microbes to Other Core Miocrobes' Ability to Explain Experimental Variables
In order to compare the ability of heritable microbes vs. other core microbes to explain the experimental variables, we used Ridge regression fit the heritable microbes as independent variables and the experimental variable as the predictable variable. We then contrasted this R2 value with other 1.00 R2 values achieved from random samples of non-heritable core microbes of same size (39 random microbes). Ridge regression was performed by the R glmnet package. We then compared the R2 of heritable microbes to the mean R2 of non-heritable core microbes for all the experimental variables altogether, using a paired Wilcoxon rank-sums test.
Seasonality Test:
In each farm core microbes were corrected for diet. Thereafter, the samples in the farm were partitioned into two groups, winter (fall equinox to spring equinox) and summer (spring equinox to fall equinix). Following, each microbial OTU abundance were compared using Wilcoxon rank-sums test that was used to test for difference between the abundance of the given OTU between the two seasons, followed by a multiple comparison correction using the Bonferroni method. Core microbial OTU with corrected P<0.05 in at least one farm were considered as showing a seasonal association.
Results
The study cohort consisted of 1016 animals, with 816 Holstein dairy cows from two UK and three Italian farms. Additionally, two hundred Nordic Red dairy cows were sampled from Sweden and Finland. The Holsteins received a maize silage-based diet, while the Nordic Reds received a nutritionally equivalent diet based on grass silage as forage. Animals were genotyped using common single nucleotide polymorphisms (SNPs) and measured for milk output and composition; feed intake and digestibility; plasma components; methane and CO2 emissions; and rumen microbiome based on ss rRNA gene analysis.
The abundance and richness of the bacterial, protozoal, fungal and archaeal communities were mutually dependent, and correlated to multiple host phenotypes in ways that have become widely understood, including rumen metabolites, milk production indices and plasma metabolites. In order to focus down on host-microbiome-phenotype relationships, the present inventors proceeded to investigate (i) how many and which species were common in our large animal cohorts, (ii) if a common, or core, group could be identified, (iii) if the core was influenced by the host genome, and (iv) how the core and non-core species determined phenotypic and production characteristics.
Taxonomic analysis revealed a core group of rumen microbes (512 species-level microbial operation taxonomic units (OTUs), 454 prokaryotes, 12 protozoa and 46 fungi) present in at least 50% of animals, within each of the seven farms studied. The group comprised eleven prokaryotic orders, one fungal and two protozoal orders that share some similarity with published core microbial communities (4,15). The core group was shared between Holstein and Nordic Red dairy breeds, and the results are particularly useful because they apply to the most popular and productive milking cow breed used in developed countries, the Holstein, and the smaller breed used in northern European latitudes, the Nordic Red. The results demonstrate once again, however, that this microbial community is representative of ruminants in general, especially with respect to bacterial and protozoal species. This core community is significantly enriched in Bacteroidales, Spirochetales and the WCHB1-4 order. The core microbiome consists of less than 0.25% of the overall microbial species pool (512 out of 250,000 OTUs), yet it is highly abundant, representing 30-60% of the overall microbiome. The core group is also tightly associated with the overall microbiome, as reflected by high correlation between the beta diversity metrics of the identified core microbiome and the overall microbiome across farms (R value between 0.45 and 0.7), this strengthens the notion of strong connectivity between microbes in such a metabolically complex ecosystem where multiple microbial interactions are potentially facilitated. These core microbes show highly conserved abundance rank structure across geography, breed and diet, where the species abundance order is kept almost identical across different individuals. Furthermore, core members are more closely related to each other than to non-core microbiome members, as indicated by differences in phylogenetic distances determined by ss rRNA gene tree. Thus, such relatedness between the members of the rumen core microbiome could indicate that they are sharing a set of functional traits, integral to this environment and potentially compatible with host requirements as suggested for species relatedness in other ecosystems (16). Although the rumen microbiome contains many hundreds of species, these core species generally belong to a rather narrow section of the whole bacterial phylome (17).
The core microbiome was found to be significantly correlated with host genetics as revealed by Canonical Correlation Analysis (CCA) which was calculated for each farm separately (
These mainly belonging to Bacteroidales and Clostridiales orders, but also including representatives from five other bacterial phyla and two fungi, of the genus Neocallimastix (
Table 1 summarizes all the hereditable bacteria that are associated with traits.
Table 2 summarizes all bacteria which correlated with a trait identified in this study.
Table 3 summarizes all hereditable bacteria identified in this study.
Neocallimastigales; Neocallimastigaceae;
Overall, when microbial co-occurrence networks were inferred within individual farms, it became evident that heritable microbes are significantly more connected than non-heritable microbes, consistent with the central positions of heritable microbes in the rumen co-occurrence networks (
The demonstration here of heritable, interacting microbes raises possibilities of breeding animals for particular microbiomes and thus phenotypic and production properties, on condition that the core can be shown to control these properties. Co-occurrence networks were further investigated for the core abundances relation to phenotypic outcomes.
The associations found here are hugely complex (
The present example shows that a small number of host-determined, heritable microbes make higher contribution to explaining experimental variables and host phenotypes (
The present study focused on two bovine dairy breeds, but the results are likely to be applicable to beef animals and other ruminant species. Given the high importance of diet in performance and the composition of the rumen microbiome, such programs should take special cognizance of likely feeding regimes. Within that context, following the overall predictive impact of identified trait-associated heritable microbes on production indices should result in a more efficient and more environmentally friendly ruminant livestock industry.
Although the invention has been described in conjunction with specific embodiments thereof, it is evident that many alternatives, modifications and variations will be apparent to those skilled in the art. Accordingly, it is intended to embrace all such alternatives, modifications and variations that fall within the spirit and broad scope of the appended claims.
It is the intent of the applicant(s) that all publications, patents and patent applications referred to in this specification are to be incorporated in their entirety by reference into the specification, as if each individual publication, patent or patent application was specifically and individually noted when referenced that it is to be incorporated herein by reference. In addition, citation or identification of any reference in this application shall not be construed as an admission that such reference is available as prior art to the present invention. To the extent that section headings are used, they should not be construed as necessarily limiting. In addition, any priority document(s) of this application is/are hereby incorporated herein by reference in its/their entirety.
improvements in performance and usability. Molec. Biol. Evol. 30, 772-780 (2013).
This application is a Continuation of PCT Patent Application No. PCT/IL2020/050742 having International filing date of Jul. 2, 2020, which claims the benefit of priority under 35 USC § 119(e) of U.S. Provisional Patent Application No. 62/869,616 filed on Jul. 2, 2019. The contents of the above applications are all incorporated by reference as if fully set forth herein in their entirety.
Number | Date | Country | |
---|---|---|---|
62869616 | Jul 2019 | US |
Number | Date | Country | |
---|---|---|---|
Parent | PCT/IL2020/050742 | Jul 2020 | US |
Child | 17567238 | US |