The present invention relates to a method for determining resistance to mastitis in a bovine subject comprising detecting at least one genetic marker associated with mastitis resistance. Furthermore, the present invention relates to a kit for detecting the presence or absence of at least one genetic marker associated with resistance to mastitis.
Mastitis is the inflammation of the mammary gland or udder of the cow resulting from infection or trauma and mastitis is believed to be the most economically important disease in cattle. The disease may be caused by a variety of agents. The primary cause of mastitis is the invasion of the mammary gland via the teat end by microorganisms. Mastitis may be clinical or sub-clinical, with sub-clinical infection preceding clinical manifestations. Clinical mastitis (CM) can be detected visually through observing red and swollen mammary glands i.e. red swollen udder, and through the production of clotted milk. Once detected, the milk from mastitic cows is kept separate from the vat so that it will not affect the overall milk quality. Sub-clinical mastitis is a type of mastitis characterized by high somatic cell counts (SCC), a normal or elevated body temperature, and milk samples that should test positive on culture. Thus, sub-clinical mastitis cannot be detected visually by swelling of the udder or by observation of the gland or the milk produced. Because of this, farmers do not have the option of diverting milk from sub-clinical mastitic cows. However, this milk is of poorer quality than that from non-infected cows and can thus contaminate the rest of the milk in the vat.
Mastitis can be detected by the use of somatic cell counts (SCC) in which a sample of milk from a cow is analysed for the presence of somatic cells (white blood cells). Somatic cells are part of the cow's natural defense mechanism and cell counts rise when the udder becomes infected. The number of somatic cells in a milk sample can be estimated indirectly by rolling-ball viscometer and Coulter counter.
As mastitis results in reduced quantity and quality of milk and products from milk, mastitis results in economic losses to the farmer and dairy industry. Therefore, the ability to determine the genetic basis of resistance to mastitis in a bovine is of immense economic significance to the dairy industry both in terms of daily milk production but also in breeding management, selecting for bovine subjects with resistance to mastitis. A method of genetically selecting bovine subjects with improved resistance that will yield cows less prone to mastitis would be desirable.
Many studies have attempted to detect quantitative trait loci (QTL) affecting mastitis (e.g. Schrooten et al. 2000; Boichard et al. 2003), so that the QTL information could be utilized through marker assisted selection (MAS). Most studies, so far, have identified QTL for somatic cell score (SCS), an indicator trait for clinical mastitis (CM), and not directly for CM. Although these two traits have a high genetic correlation (Lund et al. 1999), it is not known if the QTL that have been identified for SCS also affect CM. It has been shown that persistently high somatic cell count (SCC) levels are mainly a sign of subclinical mastitis which is most often caused by contagious bacteria such as Streptococcus aureus and Streptococcus agalactiae (de Haas et al., 2002). Incidences of acute clinical mastitis are more often caused by environmental bacteria such as Escherichia coli and in these infections the SCC levels increase rapidly but are soon dropping to normal level when the infection is cured. Therefore an acute infection may not be detected by high SCC levels. Another limitation of earlier studies is that the QTL were detected by linkage analysis (LA) with low precision for QTL position and, furthermore, LA associations between markers and the trait can only be used for selection within families. On the contrary, a combined linkage disequilibrium and linkage analysis (LDLA) can potentially fine-map a QTL to a chromosomal region less than 1 cM using closely linked markers (Meuwissen & Goddard 2000). The markers within the LDLA confidence interval can be used to identify haplotypes with predictive ability in the general population. These haplotypes are easier to use in MAS than the LA markers.
Once mapped, a genetic marker can be usefully applied in marker assisted selection. In the present invention genetic markers associated to clinical mastitis and/or SCS have been identified in the bovine genome, which allows for a method for determining whether a bovine subject and its off-spring will be resistant to mastitis.
It is of significant economic interest within the cattle industry to be able to select bovine subjects with increased resistance to mastitis and thereby avoid economic losses in connection with animals suffering from mastitis. The genetic predisposition for resistance to mastitis may be detected by the present invention. The present invention offers a method for determining the resistance to mastitis in a bovine subject based on genetic markers which are associated with and/or linked to resistance to mastitis.
One aspect of the present invention relates to method for determining resistance to mastitis in a bovine subject, comprising detecting in a sample from said bovine subject the presence or absence of at least one genetic marker that is associated with at least one trait indicative of mastitis resistance of said bovine subject and/or off-spring therefrom, wherein said at least one genetic marker is located in a region of the bovine genome selected from the group consisting of regions 1-61 identified in table 2, wherein said regions are delineated by the SNP markers identified in columns 3 and 5, and/or delineated by the genomic position identified in columns 4 and 6.
In another aspect, the present invention relates to a method for selecting a bovine subject for breeding purposes, said method comprising determining resistance to mastitis of said bovine subject and/or off-spring therefrom by a method of the invention, and then selecting or not selecting said bovine subject for breeding based on said determined breeding value.
A third aspect of the present invention relates to a kit for use in detecting the presence or absence in a bovine subject of at least one genetic marker associated with resistance to mastitis, comprising at least one detection member for determining a genetic marker located in a region of the bovine genome selected from the group consisting of regions 1-61 identified in table 2, wherein said regions are delineated by the SNP markers identified in columns 3 and 5, and/or delineated by the genomic position identified in columns 4 and 6.
In a fourth aspect, the invention relates to the use of the kit mentioned above for detecting the presence or absence in a bovine subject of at least one genetic marker associated with resistance to mastitis.
In a fifth aspect, the present invention relates to a method for estimating a breeding value in respect of susceptibility to mastitis in a bovine subject, comprising detecting in a sample from said bovine subject the presence or absence of at least one genetic marker that is associated with at least one trait indicative of mastitis resistance of said bovine subject and/or off-spring therefrom, wherein said at least one genetic marker is located in a region of the bovine genome selected from the group consisting of regions 1-61 of table 2, wherein said regions are delineated by the SNP markers identified in columns 3 and 5, and/or delineated by the genomic position identified in columns 4 and 6.
The present invention relates to genetic determinants of mastitis resistance in dairy cattle. The occurrence of mastitis, both clinical and sub-clinical mastitis involves substantial economic loss for the dairy industry. Therefore, it is of economic interest to identity those bovine subjects that have a genetic predisposition for mastitis resistance. Bovine subjects with such genetic predisposition are carriers of desired traits, which can be passed on to their offspring.
The term “genetic marker” refers to a variable nucleotide sequence (polymorphism) of the DNA on the bovine chromosome and distinguishes one allele from another. The variable nucleotide sequence can be identified by methods known to a person skilled in the art for example by using specific oligonucleotides in for example amplification methods and/or observation of a size difference. However, the variable nucleotide sequence may also be detected by sequencing or for example restriction fragment length polymorphism analysis, or by different hybridization techniques, such as southern blotting or array technologies using oligonucleotide probes. The variable nucleotide sequence may be represented by a deletion, an insertion, repeats, and/or a point mutation.
One type of genetic marker is a microsatellite marker, which may be located in/or coupled to a quantitative trait locus. Microsatellite markers refer to short sequences repeated after each other. In short sequences are for example one nucleotide, such as two nucleotides, for example three nucleotides, such as four nucleotides, for example five nucleotides, such as six nucleotides, for example seven nucleotides, such as eight nucleotides, for example nine nucleotides, such as ten nucleotides. However, changes sometimes occur and the number of repeats may increase or decrease. The specific definition and locus of the polymorphic microsatellite markers can be found in the USDA genetic map (Kappes et al. 1997; or by following the link to U.S. Meat Animal Research Center http://www.marc.usda.gov/genome/cattle/cattle.html). Another type of genetic marker is a single nucleotide polymorphism (SNP). In cattle, it is possible to simultaneously genotype large numbers of SNP markers using the commercially available kits, for example the bovine SNP genotyping kits provided by Illumina Inc.
It is appreciated that the genetic markers of the present invention are genetically linked to traits for mastitis resistance in a bovine subject. However, it is also understood that a number of additional genetic markers may be found in neighbouring DNA regions, and that these markers can be used to infer the identity of genetic markers associated with mastitis provided herein, when such additional genetic markers are genetically coupled to the markers provided by the present invention. Such additional genetic markers are obvious equivalents of the markers provided herein, and such markers are also within the scope of the present invention.
The term ‘Quantitative trait locus (QTL)’ is a region of DNA that is associated with a particular trait (e.g., mastitis resistance, somatic cell count, or clinical mastitis). Though not necessarily genes themselves, QTLs are regions of DNA that are closely linked to the genes that underlie the trait in question.
The term “associated with” as used herein in regards to the genetic marker allele and/or combination of genetic marker alleles and phenotypic traits, is meant to comprise both direct and indirect genetic linkages. Thus, a genetic marker allele and/or combination of genetic marker alleles which are associated with a trait according to the present invention may be coupled to said trait by direct or indirect genetic linkages. Moreover, the term “trait associated with” as used herein in regards to a specific phenotype, relates to any phenotypic traits, which to any extent contribute to said phenotype. For example, the traits somatic cell count (SCC), somatic cell score (SCS), udder conformation (which comprises several quantitative measures, such as fore udder attachment, udder depth, udder texture etc.), and diagnostic variables (such as treated cases of clinical mastitis within a specific timeframe) contribute to the overall mastitis phenotype. Thus, the “traits associated with mastitis resistance”, or “mastitis resistance phenotypic traits” comprise SCC, SCS, CM11, CM12, CM2, CM3, CM, SCC3, SCC2, SCC1, SCC and diagnostic variables, including the subindexes of any of said phenotypic traits.
The term “genetically coupled” is used herein about two genomic loci, which tend to segregate together. Thus, an SNP marker allele, which is genetically coupled to another genetic marker allele associated with a specific phenotypic trait according to the present invention, is indicative of said genetic marker, and may consequently be detected in a sample as an alternative of detecting said genetic marker associated with said phenotypic traits, for example traits associated with mastitis resistance.
It is furthermore appreciated that the nucleotide sequences of the genetic marker allele or combination of marker alleles of the present invention are genetically associated with phenotypic traits of the present invention in a bovine subject. Consequently, it is also understood that a number of genetic markers may be comprised in the nucleotide sequence of the DNA region(s) flanked by and including the genetic markers according to the method of the present invention.
The term “gene” is as used herein is meant to comprise coding regions as well as non-coding region of any genes, as well as upstream and downstream regions of the open reading frame. Thus, a genetic marker “located in a gene” may be located in exons, introns, or upstream or downstream of the open reading frame, for example in the area of 1000 nucleotides or more upstream or downstream of the open reading frame of the gene in question.
Specifically, the transcribed region of a gene is considered to be comprised in the term “gene”, and markers located in a gene, thus, includes any marker located in a transcribed region of that gene.
Linkage Disequilibrium Linkage disequilibrium (LD) reflects recombination events dating back in history and the use of LD mapping within families increases the resolution of mapping. LD exists when observed haplotypes in a population do not agree with the haplotype frequencies predicted by multiplying together the frequency of individual genetic markers in each haplotype. In this respect the term haplotype means a set of closely linked genetic markers present on one chromosome which tend to be inherited together. In order for LD mapping to be efficient the density of genetic markers needs to be compatible with the distance across which LD extends in the given population. Linkage disequilibrium reflects the extent to which different genetic markers tend to be co-inherited in a population. In cattle the level of LD is high compared to for example human, due to i.a. inbreeding and historical bottlenecks. Therefore, the identity of one genetic marker can often be inferred from the identity of alternative genetic markers, which are in LD.
The granddaughter design includes analysing data from DNA-based markers for grand sires that have been used extensively in breeding and for sons of grand sires where the sons have produced offspring. The phenotypic data that are to be used together with the DNA-marker data are derived from the daughters of the sons. Such phenotypic data could be for example milk production features, features relating to calving, meat quality, or disease. One group of daughters have inherited one allele from their father whereas a second group of daughters have inherited the other allele form their father. By comparing data from the two groups information can be gained whether a fragment of a particular chromosome is harbouring one or more genes that affect the trait in question. It may be concluded whether a QTL is present within this fragment of the chromosome. A prerequisite for performing a granddaughter design is the availability of detailed phenotypic data. In the present invention such data have been available (http://www.lr.dk/kvaeg/diverse/principles.pdf). Genes conferring quantitative traits to an individual may be found in an indirect manner by observing pieces of chromosomes that act as if one or more gene(s) is located within that piece of the chromosome. In contrast, DNA markers can be used directly to provide information of the traits passed on from parents to one or more of their off spring when a number of DNA markers on a chromosome have been determined for one or both parents and their off-spring. The markers may be used to calculate the genetic history of the chromosome linked to the DNA markers.
The term “bovine subject” refers to cattle of any breed and is meant to include both cows and bulls, whether adult or newborn animals. No particular age of the animals are denoted by this term. One example of a bovine subject is a member of the Holstein breed. In one preferred embodiment, the bovine subject is a member of the Holstein-Friesian cattle population. In one embodiment, the bovine subject is a member of the Danish and/or Swedish Holstein cattle population. In another embodiment, the bovine subject is a member of the Holstein Swartbont cattle population. In another embodiment, the bovine subject is a member of the Deutsche Holstein Schwarzbunt cattle population. In another embodiment, the bovine subject is a member of the US Holstein cattle population. In one embodiment, the bovine subject is a member of the Red and White Holstein breed. In another embodiment, the bovine subject is a member of the Deutsche Holstein Schwarzbunt cattle population.
In one embodiment, the bovine subject is a member of any family, which include members of the Holstein breed. In one preferred embodiment the bovine subject is a member of the Danish Red population. In another preferred embodiment the bovine subject is a member of the Finnish Ayrshire population. In yet another embodiment the bovine subject is a member of the Swedish Red and White population. In a further embodiment the bovine subject is a member of the Danish Holstein population. In another embodiment, the bovine subject is a member of the Swedish Red and White population. In yet another embodiment, the bovine subject is a member of the Nordic Red population. In yet another embodiment, the bovine subject is a member Nordic Holstein, Danish Jersey and Nordic Red breed
In one embodiment of the present invention, the bovine subject is selected from the group consisting of Swedish Red and White, Danish Red, Finnish Ayrshire, Holstein-Friesian, Danish Holstein and Nordic Red. In another embodiment of the present invention, the bovine subject is selected from the group consisting of Finnish Ayrshire and Swedish Red and White cattle. In another embodiment of the present invention, the bovine subject is selected from the group consisting of Finnish Ayrshire and Swedish Red and White cattle.
The term “mastitis” relates to the inflammation of the mammary gland of the udder of a cow. In the present application the term “mastitis” is used to describe both clinical mastitis and sub-clinical mastitis, which can be characterized for example by high somatic cell score (SCS).
The terms “mastitis resistance” and ‘resistance to mastitis’ are used interchangeable and relates to the fact that some bovine subjects are not as prone to mastitis as are other bovine subjects, in other words, some bovine subjects are less susceptible to mastitis than other bovine subjects. Thus, the term “resistance” as used herein, refers to any level of reduction in mastitis, ranging from a minute reduction of 0.5% or less to complete absence of mastitis, i.e. complete resistance. When performing analyses of a number of bovine subjects as in the present invention in order to determine genetic markers that are associated with resistance to mastitis, the traits implying resistance to mastitis may be observed by the presence or absence of genetic markers linked to occurrence of clinical mastitis and/or sub-clinical mastitis in the bovine subjects analyzed. It is understood that mastitis resistance comprise resistance to traits, which affect udder health in the bovine subject or its off-spring. Thus, mastitis resistance of a bull is physically manifested by its female off-spring.
Mastitis resistance is inversely correlated with susceptibility to mastitis, i.e. a bovine subject with high mastitis resistance has low susceptibility to mastitis. Thus, the term “susceptible to mastitis” as used herein is meant to indicate that a bovine subject has a relatively higher likelihood of suffering from mastitis, or having a trait indicative of mastitis.
Daughters of bulls can be scored for mastitis resistance on the basis of a number of different quantitative and qualitative parameters. Specifically, mastitis resistance may be observed according to the present invention on the basis of specific traits, which are indicative of mastitis resistance. One such trait indicative of mastitis resistance in a population of cattle is recorded cases of clinical mastitis. Other examples of traits are somatic cell count (SCC), or somatic cell score (SCS), which is defined as the mean of log10 transformed somatic cell count values (in 10,000/mL) obtained from the milk recording scheme. The mean is for example taken over the period 10 to 180 days after calving. Estimated breeding values (EBV) for traits of sons may be calculated using a single trait Best Linear Unbiased Prediction (BLUP) animal model ignoring family structure. Examples of specific quantitative traits indicative of mastitis resistance are provided in the table below:
In one embodiment of the present invention, the methods and kits described herein relates to mastitis resistance, such as resistance to clinical mastitis and/or resistance to sub-clinical mastitis, such as detected by somatic cell counts or SCS. More specifically, the methods and kits of the invention relates in one embodiment to genetic markers associated with at least one trait indicative of mastitis, such trait in a preferred embodiment being selected from CM11 (Clinical mastitis (1) or not (0) between −15 and 50 days after 1st calving), CM12 (Clinical mastitis (1) or not (0) between 51 and 305 days after 1st calving), CM2 (Clinical mastitis (1) or not (0) between −15 and 305 days after 2nd calving), CM3 (Clinical mastitis (1) or not (0) between −15 and 305 days after 3rd calving), CM (Clinical mastitis: 0.25*CM11+0.25*CM12+0.3*CM2+0.2*CM3), SCC1 (Log. somatic cell count average in 1st lactation), SCC2 (Log. somatic cell count average in 2nd lactation), SCC3 (Log. somatic cell count average in 3rd lactation) and SCC (Log somatic cell count: 0.5*SCC1+0.3*SCC2+0.2*SCC3). In a preferred embodiment, the trait is clinical mastitis, for example any trait selected from CM11, CM12, CM2, CM3 or CM. As specified in table 1, CM is an index for clinical mastitis based on CM11, CM12, CM2 and CM3.
In yet another embodiment, the method and kit of the present invention primarily relates to resistance to clinical mastitis in combination with resistance to sub-clinical mastitis such as detected by somatic cell counts or SCS, for example SCC1, SCC2, SCC3 or SCC. The methods and kits of the present invention comprise detecting the presence or absence of at least one genetic marker that is associated with at least one trait indicative of mastitis resistance of a bovine subject or off-spring therefrom, wherein said at least one trait is selected from somatic cell count (SCC), somatic cell score (SCS) and/or clinical mastitis.
In general, increased levels of SCS are indicative of mastitis, e.g. subclinical mastitis. The level of SCC may be increased compared to previous measures for the same bovine subject, or compared to an average SCC for the given population, breed, or family. The SCS level may be measured at any time, and may be separate measures or a mean value over one lactation period. For example, an SCC level above 100.000 cells/ml milk, such as above 200.000, for example above 300.000 cells/ml milk, such as above 400.000, for example above 500.000 cells/ml milk, such as above 600.000, cell/ml milk is indicative of mastitis, such as clinical or subclinical mastitis. Therefore, SCC levels of such magnitudes are considered as traits indicative of reduced susceptibility to mastitis according to the present invention. However, the level of SCC indicative of mastitis resistance or susceptibility to mastitis may vary for different bovine subjects, breeds and families.
The present invention can be used to estimate breeding values in respect of mastitis resistance or susceptibility to mastitis. True breeding value is the genetic merit of an individual which can be conceptually defined as twice the average deviation of its offspring from the population mean when mated randomly to an infinite population. It is an estimate of the ability of an individual to produce superior offspring. True breeding values are not known but can be estimated from the animals own performance and/or the performance of its offspring and/or other relatives. In addition to, or instead of, phenotypic performance, information about animals genotypes at certain genes or markers associated with the trait of interest can be used in breeding value estimation procedures. Use of such information can increase the reliability of the breeding values and make, for example, selection possible at a younger age. In one embodiment, the at least on genetic marker indicative of mastitis resistance is used to estimate the breeding value of a bovine subject.
The trait indicative of mastitis resistance may be recalculated into a breeding value for every bovine subject, for example every sire. Thus, the genetic markers of the methods and kits of the present invention may be used for selection of bovine subjects with increased breeding values, and detection of at least on genetic marker indicative of mastitis resistance according to the present invention is indicative of an increased breeding value of the bovine subject. For example the breeding value is increased by at least 0.5%, such as at least 1%, such as at least 2, 3, 4, 5, 6, 7, 8, 9, for example at least 10%.
The method according to the present invention includes analyzing a sample of a bovine subject, wherein said sample may be any suitable sample capable of providing the bovine genetic material for use in the method. The type of sample is not important, as long as the sample comprise genetic material specific for the bovine subject, which is analysed. Thus, any sample comprising genetic material from the bovine subject can be used. Preferably, the sample is a sample, which is easily obtained from the bovine subject, preferably a sample, which can be obtained without any invasive procedures.
Thus, mastitis resistance is determined by detecting the absence or presence of a genetic marker allele in a sample of any source comprising genetic material. The bovine genetic material may for example be extracted, isolated and/or purified if necessary. The samples may be fresh or frozen. Detection of a genetic marker may be performed on samples selected from the group consisting of blood, semen (sperm), urine, liver tissue, muscle, skin, hair, follicles, ear, tail, fat, testicular tissue, lung tissue, saliva, spinal cord biopsy and/or any other tissue.
In preferred embodiments the sample is selected from the group consisting of semen (sperm), blood, urine, skin, hair, ear, tail, and muscle. In another preferred embodiment the sample is selected from the group consisting of blood. In particularly preferred embodiments the sample is milk. In another particularly preferred embodiment the sample is skin tissue. In yet another particularly preferred embodiment the sample is muscle. In a most preferred embodiment the sample is semen (sperm).
For microsatellite or SNP genotyping, nucleic acid may be extracted from the samples by a variety of techniques. For example Genomic DNA may be isolated from the sample by treatment with proteinase K followed by extraction with phenol (see e.g. Sambrook et al. 1989). However, the sample may also be used directly.
The amount of the nucleic acid used for microsatellite or SNP genotyping for detection of a genetic marker according to the method of the present invention is in the range of nanograms to micrograms. It is appreciated by the person skilled in the art that in practical terms no upper limit for the amount of nucleic acid to be analysed exists. The problem that the skilled person encounters is that the amount of sample to be analysed is limited. Therefore, it is beneficial that the method of the present invention can be performed on a small amount of sample and thus a limited amount of nucleic acid in the sample is required. The amount of the nucleic acid to be analysed is thus at least 1 ng, such as at least 10 ng, for example at least 25 ng, such as at least 50 ng, for example at least 75 ng, such as at least 100 ng, for example at least 125 ng, such as at least 150 ng, for example at least 200 ng, such as at least 225 ng, for example at least 250 ng, such as at least 275 ng, for example at least 300 ng, 400 ng, for example at least 500 ng, such as at least 600 ng, for example at least 700 ng, such as at least 800, ng, for example at least 900 ng or such as at least 1000 ng.
In one preferred embodiment the amount of nucleic acid as the starting material for the method of the present invention is 20-50 ng. In a specifically preferred embodiment, the starting material for the method of the present invention is at 30-40 ng.
BTA is short for Bos taurus autosome.
One aspect of the present invention relates to a method for determining resistance to mastitis in a bovine subject, comprising detecting in a sample from said bovine subject the presence or absence of at least one genetic marker that is associated with at least one trait indicative of mastitis resistance of said bovine subject and/or off-spring therefrom, wherein said at least one genetic marker is located in a genetic region of the bovine genome selected from region 1-61, as specified in table 2.
In one embodiment, the genetic marker of the invention is selected from the group of markers set forth in table 2, column 9 or 10.
In another embodiment, the genetic marker is selected from the group consisting of the SNPs set forth in tables 10, 12, 13, 15, 16, 18, 19, 21, 23 and 24, cf. the examples herein below.
In another embodiment, the genetic marker is located in a gene selected from the group consisting of the genes set forth in tables 11, 14, 17, 20, 22 and 25, cf. the examples herein below.
In another embodiment, the genetic markers is selected from the group consisting of ss86284888, rs41649041, ss61565956, ss86341106, ss86317725, ss86328358, rs41812941, ss86327354, and rs41940571 (cf. table 3).
In another embodiment, the genetic markers is selected from the group consisting of ss86328743, rs41618669, ss86284888, rs41580905, rs41649041, rs43706944, rs42189699, rs42553026, rs41664497, rs41664497, ss86290235, ss86340493, ss86305923, ss86330005, ss86340725, rs29015635, rs42895750, ss117968104, rs29017739, rs29001782, rs41588957, ss86307579, ss86317213, rs41610991, ss117968170, ss117968764, ss117968030, ss117968525, rs29019575, ss117968738, ss86326721, ss86341106, ss86341106, rs29010419, rs29022799, ss86278591, ss86337596, rs43338539, ss86296213, rs42766480, rs41617692, ss117963883, rs43475842, rs29019286, ss86292503, ss86317725, ss86290731, ss86332750, ss86335834, ss86340346, ss105239139, ss117971362, ss86287919, ss86329615, ss86301882, ss86328358, ss117971370, ss117971325, ss86339873, ss117971671, ss117971176, rs41807595, rs41807595, rs29023167, ss86303613, ss86283374, ss86328473, ss86307986, rs41603818, rs41812941, ss105262977, ss105262977, rs42465037, ss86327354, ss86327432, ss61484557, rs42329877, ss86333005, ss86306906, ss117972835, rs41938511, rs42542144, rs41940571, rs41947330, rs29018751, rs41581087, ss105263178, rs41641052, rs41641055, ss86292111, rs41600165 and ss86306865 (cf. table 4).
Due to linkage disequilibrium as described herein, the present invention also relates to methods for determining the resistance to mastitis in a bovine subject, wherein the at least one genetic marker is linked or genetically coupled to genetic determinants of a bovine trait for resistance to mastitis. In order to determine resistance to mastitis in a bovine subject, it is appreciated that more than one genetic marker may be employed in the present invention. For example the at least one genetic marker may be a combination of at least two or more genetic markers such that the accuracy may be increased, such as at least three genetic markers, for example four genetic markers, such as at least five genetic markers, for example six genetic markers, such as at least seven genetic markers, for example eight genetic markers, such as at least nine genetic markers, for example ten genetic markers.
The at least one genetic marker may be located on at least one bovine chromosome, such as two chromosomes, for example three chromosomes, such as four chromosomes, for example five chromosomes, and/or such as six chromosomes. Thus, the at least one genetic marker may be a combination of markers located on different chromosomes. The at least one genetic marker is selected from any of the individual markers of the tables shown herein below.
In one embodiment of the invention the at least one genetic marker is located on the bovine chromosome BTA1 in a region delineated by BovineHD Genotyping BeadChip SNP#19479 and SNP#19481 and/or in a region between base nos. 76096755 and 76099500, for example, the marker is BovineHD0100021877 or is BovineHD Genotyping BeadChip SNP#76096755, or is linked to any of said markers
In another embodiment of the invention the at least one genetic marker is located on the bovine chromosome BTA3 in a region delineated by BovineHD Genotyping BeadChip SNP#23488 and SNP#25665 and/or in a region between base nos. 92199528 and 101364920, for example, the marker is BovineHD0300028997 or is BovineHD Genotyping BeadChip SNP#101323866, or is linked to any of said markers.
In another embodiment of the invention the at least one genetic marker is located on the bovine chromosome BTA5 in a region delineated by BovineHD Genotyping BeadChip SNP#20435 and SNP#27159 and/or in a region between base nos. 84539347 and 109948232, for example, the marker is BovineHD0500024659 or is BovineHD Genotyping BeadChip SNP#86998734, or is linked to any of said markers.
In one embodiment, the genetic marker is located on the bovine chromosome BTA5 in a region between 84-95 Mb, for example the marker is Chr5—92753829 and/or the trait is mastitis resistance, such as CM11. In one embodiment, the genetic marker is selected from the group consisting of Chr5—92753829, BovineHD0500024659, Chr5—87360522, BovineHD0500026657, Chr5—92753829, Chr5—87360522, Chr5—94040670, Chr5—89528205 and Chr5—87360522 (cf. table 10), and/or the genetic marker allele associated with increased mastitis resistance, and/or the specific trait is as indicated in table 10.
In one embodiment, the genetic marker is located in a gene selected from the group consisting of ENSBTAG00000022360, ENSBTAG00000005833, ENSBTAG00000001673, ENSBTAG00000013202, ENSBTAG00000047048, ENSBTAG00000046178, ENSBTAG00000020715, ENSBTAG00000030493, ENSBTAG00000013541, ENSBTAG00000008541 and ENSBTAG00000009444, cf. table 11.
In another embodiment of the invention the at least one genetic marker is located on the bovine chromosome BTA6 in a region delineated by BovineHD Genotyping BeadChip SNP#18708 and SNP#26792 and/or in a region between base nos. 71082832 and 102757841, for example, the marker is BovineHD0600024355 or is BovineHD Genotyping BeadChip SNP#88919352, or is linked to any of said markers.
However, in a particularly preferred embodiment, the at least one genetic marker is located on the bovine chromosome BTA6 in a region between base nos. 88000560 and 95999980. In a specifically preferred embodiment, the at least one genetic marker is BovineHD0600024355 located at 88,919,352 Bp on BTA6. In one embodiment, BovineHD0600024355 is a genetic marker associated with clinical mastitis, such as CM11.
For example, the at least one genetic marker is located in the region between base nos. 89,052,210 and 89,059,348 on BTA6. Thus, in one preferred embodiment, the genetic marker associated with at least one trait indicative of mastitis, such as clinical mastitis, for example CM11, is located in the neuropeptide FF receptor 2 (NPFFR2) gene, in particular in the coding region of NPFFR2. In one embodiment, the genetic marker associated with mastitis is the chr6—89059253 SNP, which is located at 89,059,253 Bp on BTA6. This SNP is a G-A substitution. However, as alternative SNPs located within the NPFFR2 gene are strongly coupled to the chr6—89059253 SNP, any genetic marker polymorphism located in the NPFFR2 gene is associated with a trait indicative of mastitis. Thus, the present invention relates to methods of determining mastitis and/or a breeding value as well as methods for selected cattle for breeding, and kits, wherein the at least one genetic marker is located in the NPFFR2 gene or is genetically coupled to the NPFFR2 gene, and in one preferred embodiment, the at least one genetic marker is the chr6—89059253 SNP and/or any genetic marker polymorphism genetically coupled thereto. Thus, in one embodiment, the genetic marker is the G/A SNP located at 89,059,253 Bp (UMD3.1), wherein the A allele is associated with mastitis and the G allele is associated with resistance to mastitis.
In one embodiment, the genetic marker is located on the bovine chromosome BTA6 in a region between 88-96 Mb, for example the marker is Chr6—88977023 and/or the trait is mastitis resistance, such as CM11. In one embodiment, the genetic marker is selected from the group consisting of Chr6—88977023, Chr6—88612186, Chr6—88610743, Chr6—88977023, Chr6—88977023, Chr6—88326504, Chr6—88326504, Chr6—88326504 and Chr6—88326504 (cf. table 12), and/or the genetic marker allele associated with increased mastitis resistance, and/or the specific trait is as indicated in table 12. In one embodiment, the marker is Chr6—89059253 and the allele associated with mastitis resistance is the G-allele.
In one embodiment, the genetic marker is located in a gene selected from the group consisting of ENSBTAG00000018531, ENSBTAG00000009310, ENSBTAG00000016795, ENSBTAG00000008577, ENSBTAG00000016290, ENSBTAG00000012397, ENSBTAG00000002348, ENSBTAG00000013718, ENSBTAG00000009070 and ENSBTAG00000006507, cf. table 14.
In another embodiment of the invention the at least one genetic marker is located on the bovine chromosome BTA7 in a region delineated by BovineHD Genotyping BeadChip SNP#2907 and SNP#4789 and/or in a region between base nos. 14485587 and 22681472, for example, the marker is BovineHD0700005054 or is BovineHD Genotyping BeadChip SNP#18032163, or is linked to any of said markers.
In another embodiment of the invention the at least one genetic marker is located on the bovine chromosome BTA7 in a region delineated by BovineHD Genotyping BeadChip SNP#7174 and SNP#10157 and/or in a region between base nos. 31432538 and 41607314, for example, the marker is BovineHD4100005904 or is BovineHD Genotyping BeadChip SNP#33485418, or is linked to any of said markers.
In another embodiment of the invention the at least one genetic marker is located on the bovine chromosome BTA12 in a region delineated by BovineHD Genotyping BeadChip SNP#787 and SNP#933 and/or in a region between base nos. 2569573 and 2991581, for example, the marker is BovineHD1200000926 or is BovineHD Genotyping BeadChip SNP#2917822, or is linked to any of said markers.
In another embodiment of the invention the at least one genetic marker is located on the bovine chromosome BTA12 in a region delineated by BovineHD Genotyping BeadChip SNP#3217 and SNP#7626 and/or in a region between base nos. 11578657 and 27097379, for example, the marker is BovineHD1200006858 or is BovineHD Genotyping BeadChip SNP#22865273, or is linked to any of said markers.
In another embodiment of the invention the at least one genetic marker is located on the bovine chromosome BTA12 in a region delineated by BovineHD Genotyping BeadChip SNP#15918 and SNP#17398 and/or in a region between base nos. 62561736 and 68494212, for example, the marker is BovineHD1200017277 or is BovineHD Genotyping BeadChip SNP#63068164, or is linked to any of said markers.
In another embodiment of the invention the at least one genetic marker is located on the bovine chromosome BTA13 in a region delineated by BovineHD Genotyping BeadChip SNP#11798 and SNP#15089 and/or in a region between base nos. 53471793 and 70173150, for example, the marker is BovineHD1300017074 or is BovineHD Genotyping BeadChip SNP#59588546, or is linked to any of said markers.
In one embodiment, the genetic marker is located on the bovine chromosome BTA13 in a region between 57-63 Mb, for example the marker is Chr13—57608628 and/or the trait is mastitis resistance, such as CM. In one embodiment, the genetic marker is selected from the group consisting of Chr13—57608336, Chr13—57608354, Chr13—59584651, Chr13—59584651, Chr13—57608628, Chr13—57608354, Chr13—60621602, Chr13—60621602 and Chr13—60621602 (cf. table 15), and/or the genetic marker allele associated with increased mastitis resistance, and/or the specific trait is as indicated in table 15. In one embodiment, the marker is Chr13—57579568 and the allele associated with mastitis resistance is the T-allele, and/or the marker is Chr13—57579569 and the allele associated with mastitis resistance is the G-allele.
In one embodiment, the genetic marker is located in a gene selected from the group consisting of ENSBTAG00000020261, ENSBTAG00000012109, ENSBTAG00000018053, ENSBTAG00000018418, ENSBTAG00000013330, ENSBTAG00000048288, ENSBTAG00000003364, ENSBTAG00000048009, ENSBTAG00000027384, ENSBTAG00000027383, ENSBTAG00000020555, ENSBTAG00000031254, ENSBTAG00000016169, ENSBTAG00000016348, ENSBTAG00000019200, ENSBTAG00000010112, ENSBTAG00000038687 and ENSBTAG00000038412, cf. table 17.
In another embodiment of the invention the at least one genetic marker is located on the bovine chromosome BTA16 in a region delineated by BovineHD Genotyping BeadChip SNP#5299 and SNP#16175 and/or in a region between base nos. 21799660 and 64955150, for example, the marker is BovineHD1600014622 or is BovineHD Genotyping BeadChip SNP#52924145, or is linked to any of said markers.
In one embodiment, the genetic marker is located on the bovine chromosome BTA16 in a region between 48-55 Mb, for example the marker is Chr16—50529178 and/or the trait is mastitis resistance, such as CM11. In one embodiment, the genetic marker is selected from the group consisting of Chr16—50529178, Chr16—49054912, Chr16—49054912, Chr16—54246279, Chr16—50532600, Chr16—52097973, Chr16—53806663, Chr16—53806663 and Chr16—53998150 (cf. table 18), and/or the genetic marker allele associated with increased mastitis resistance, and/or the specific trait is as indicated in table 18. In one embodiment, the marker is Chr16—50529178 and the allele associated with mastitis resistance is the A-allele, and/or the marker is Chr16—50564280 and the allele associated with mastitis resistance is the T-allele.
In one embodiment, the genetic marker is located in a gene selected from the group consisting of ENSBTAG00000024663, ENSBTAG00000016057, ENSBTAG00000010732, ENSBTAG00000015635, ENSBTAG00000015632, ENSBTAG00000014707, ENSBTAG00000014537 and ENSBTAG00000037523, cf. table 20.
In another embodiment of the invention the at least one genetic marker is located on the bovine chromosome BTA18 in a region delineated by BovineHD Genotyping BeadChip SNP#11892 and SNP#13902 and/or in a region between base nos. 41653211 and 48570545, for example, the marker is BovineHD1800013234 or is BovineHD Genotyping BeadChip SNP#44778431, or is linked to any of said markers.
In another embodiment of the invention the at least one genetic marker is located on the bovine chromosome BTA19 in a region delineated by BovineHD Genotyping BeadChip SNP#12750 and SNP#16762 and/or in a region between base nos. 49013784 and 62339802, for example, the marker is BovineHD1900015719 or is BovineHD Genotyping BeadChip SNP#55615219, or is linked to any of said markers.
In one embodiment, the genetic marker is located on the bovine chromosome BTA19 in a region between 55-58 Mb, for example the marker is Chr19—55296191 and/or the trait is mastitis resistance, such as SCS3. In one embodiment, the genetic marker is selected from the group consisting of Chr19—57164311, Chr19—55461224, BovineHD1900015719, Chr19—57418222, BovineHD1900015719, Chr19—55296191, Chr19—55296191, Chr19—55296191 and Chr19—55296191 (cf. table 21), and/or the genetic marker allele associated with increased mastitis resistance, and/or the specific trait is as indicated in table 21.
In one embodiment, the genetic marker is located in a gene selected from the group consisting of ENSBTAG00000013677, ENSBTAG00000005104 and ENSBTAG00000044443; cf. table 22.
In another embodiment of the invention the at least one genetic marker is located on the bovine chromosome BTA20 in a region delineated by BovineHD Genotyping BeadChip SNP#7852 and SNP#14407 and/or in a region between base nos. 28291423 and 55744850, for example, the marker is BovineHD2000010279 or is BovineHD Genotyping BeadChip SNP#35981673, or is linked to any of said markers.
In one embodiment, the genetic marker is located on the bovine chromosome BTA20 in a region between 32-40 Mb, for example the marker is Chr20—35965955 and/or the trait is mastitis resistance, such as CM2. In one embodiment, the genetic marker is selected from the group consisting of Chr20—34269660, Chr20—35965955, Chr20—35965955, Chr20—35914181, Chr20—35965955, Chr20—35969130, Chr20—35865606, Chr20—35914086 and Chr20—35543794 (cf. table 23), and/or the genetic marker allele associated with increased mastitis resistance, and/or the specific trait is as indicated in table 23. In one embodiment, the marker is Chr20—35965955 and the allele associated with mastitis resistance is the A-allele.
In one embodiment, the genetic marker is located in a gene selected from the group consisting of ENSBTAG00000010423, ENSBTAG00000014972, ENSBTAG00000016149, ENSBTAG00000006697, ENSBTAG00000033107, ENSBTAG00000011766 and ENSBTAG00000014177, cf. table 25.
In one specific embodiment, the at least one genetic marker is located in the Caspase recruitment domain-containing protein 6 gene (CARD6) on BTA20. Thus, in one preferred embodiment, the genetic marker associated with at least one trait indicative of mastitis, such as clinical mastitis, for example CM11, is located in the CARD6 gene, in particular in the coding region of NPFFR2. In one embodiment, the genetic marker associated with one or more mastitis traits is the rs133218364 SNP, which is located in the CARD6 gene on BTA20; cf. SEQ ID NO: 2. This SNP is a T-C substitution. However, as alternative SNPs located within the CARD6 gene are strongly coupled to the rs133218364 SNP, any genetic marker polymorphism located in the CARD6 gene is associated with a trait indicative of mastitis. Thus, the present invention relates to methods of determining mastitis and/or a breeding value as well as methods for selected cattle for breeding, and kits, wherein the at least one genetic marker is located in the CARD6 gene or is genetically coupled to the CARD6 gene, and in one preferred embodiment, the at least one genetic marker is the rs133218364 SNP and/or any genetic marker polymorphism genetically coupled thereto. Thus, in one embodiment, the genetic marker is the T/C SNP located in the CARD6 gene, wherein the T allele is associated with mastitis and the C allele is associated with resistance to mastitis.
In another specific embodiment, the at least one genetic marker is located in the Leukemia inhibitory factor receptor gene (LIFR) on BTA20, or the flanking sequences thereof, such as 5000 bp upstream or downstream of the LIFR gene. In one preferred embodiment, the genetic marker associated with at least one trait indicative of mastitis, such as clinical mastitis, for example CM11, is located in the LIFR gene or the flanking sequences, in particular within 5000 bp downstream of the LIFR gene coding region. In one embodiment, the genetic marker associated with one or more mastitis traits is the rs133596506 SNP, which is located 3323 bp downstream of the LIFR gene on BTA20; cf. SEQ ID NO: 3. This SNP is a T-C substitution. However, as alternative SNPs located within the LIFR gene and its flanking regions are strongly coupled to the rs133596506 SNP, any genetic marker polymorphism located in the LIFR gene and its flanking regions is associated with a trait indicative of mastitis. Thus, the present invention relates to methods of determining mastitis and/or a breeding value as well as methods for selected cattle for breeding, and kits, wherein the at least one genetic marker is located in the LIFR gene or its flanking regions is genetically coupled to the LIFR gene, and in one preferred embodiment, the at least one genetic marker is the rs133596506 SNP and/or any genetic marker polymorphism genetically coupled thereto. Thus, in one embodiment, the genetic marker is the T/C SNP located in the LIFR gene or its flanking regions, wherein the C allele is associated with mastitis and the T allele is associated with resistance to mastitis.
The method according to the present invention for determining mastitis resistance of a bovine subject comprises detecting in a sample from said bovine subject the presence or absence of at least one genetic marker allele that is associated with at least one trait indicative of mastitis resistance of said bovine subject and/or off-spring therefrom. Specific genetic markers associated with mastitis resistance are provided elsewhere herein. The genetic markers, including microsatellite markers and/or SNPs, or a complementary sequence as well as transcriptional (mRNA) and translational products (polypeptides, proteins) therefrom may be identified by any method known to those of skill within the art.
It will be apparent to the person skilled in the art that there are a large number of analytical procedures which may be used to detect the presence or absence of variant nucleotides at one or more of positions mentioned herein in the specified region. Mutations or polymorphisms within or flanking the specified region can be detected by utilizing a number of techniques. Nucleic acid from any nucleated cell can be used as the starting point for such assay techniques, and may be isolated according to standard nucleic acid preparation procedures that are well known to those of skill in the art. In general, the detection of allelic variation requires a mutation discrimination technique, optionally an amplification reaction and a signal generation system.
A number of mutation detection techniques are listed below. Some of the methods listed are based on the polymerase chain reaction (PCR), wherein the method according to the present invention includes a step for amplification of the nucleotide sequence of interest in the presence of primers based on the nucleotide sequence of the variable nucleotide sequence. The methods may be used in combination with a number of signal generation systems, a selection of which is listed further below.
Further amplification techniques are found elsewhere herein. Many current methods for the detection of allelic variation are reviewed by Nollau et al., Clin. Chem. 43, 1114-1120, 1997; and in standard textbooks, for example “Laboratory Protocols for Mutation Detection”, Ed. by U. Landegren, Oxford University Press, 1996 and “PCR”, 2nd Edition by Newton & Graham, BIOS Scientific Publishers Limited, 1997.
The detection of genetic markers can according to one embodiment of the present invention be achieved by a number of techniques known to the skilled person, including typing of microsatellites or short tandem repeats (STR), restriction fragment length polymorphisms (RFLP), detection of deletions or insertions, random amplified polymorphic DNA (RAPIDs) or the typing of single nucleotide polymorphisms by methods such as restriction fragment length polymerase chain reaction, allele-specific oligomer hybridisation, oligomer-specific ligation assays, hybridisation with PNA or locked nucleic acids (LNA) probes.
In one embodiment, the methods of the invention comprise amplifying a genetic region comprised in the sample provided from the bovine subject. Thus, specific methods may include amplifying a genetic region comprising a genetic marker of the invention, and detecting that amplification product.
In another preferred embodiment, the genetic marker is detected by DNA array methods. It is, for example, possible to genotype large numbers of SNP markers simultaneously using commercially available SNP genotyping kits. Such kits are for example the bovineSNP50 beadchip SNP kit provided by Illumina Inc., and the BovineHD BeadChip from Illumina Inc. Both of these kits are preferred for SNP genotyping according to the present invention.
A primer of the present invention is a nucleic acid molecule sufficiently complementary to the sequence on which it is based and of sufficiently length to selectively hybridise to the corresponding region of a nucleic acid molecule intended to be amplified. The primer is able to prime the synthesis of the corresponding region of the intended nucleic acid molecule in the methods described above. Similarly, a probe of the present invention is a molecule for example a nucleic acid molecule of sufficient length and sufficiently complementary to the nucleic acid sequence of interest which selectively binds to the nucleic acid sequence of interest under high or low stringency conditions. The genetic marker associated with mastitis resistance according to the present invention can be detected by a number of methods known to those of skill within the art. For example, the genetic marker may be identified by genotyping using a method selected from the group consisting of single nucleotide polymorphisms (SNPs), microsatellite markers, restriction fragment length polymorphisms (RFLPs), DNA chips, amplified fragment length polymorphisms (AFLPs), randomly amplified polymorphic sequences (RAPDs), sequence characterised amplified regions (SCARs), cleaved amplified polymorphic sequences (CAPSs), nucleic acid sequencing, and microsatellite genotyping.
In a preferred embodiment, the genetic markers associated with mastitis resistance traits as disclosed in the present invention is detected by SNP or microsatellite genotyping. SNP or microsatellite genotyping may be performed by amplification of the SNP or microsatellite marker by sequence specific oligonucleotide primers, and subsequent analysis of the amplification product, in terms of for example length, quantity and/or sequence of the amplification product.
Specifically, the at least one genetic marker according to the present invention may be detected by use of at least one oligonucleotide comprising between 5 and 100 consecutive nucleotides, such as between 10 and 30 consecutive nucleotides, or at least 5, such as 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24 or at least 25 consecutive nucleotides of the NPFFR2 gene, such as SEQ ID NO: 1, or a nucleic acid sequence at least 70% identical thereto, such as at least 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, such as at least 99% thereto.
In one embodiment of the methods and kits of the present invention, the genetic marker is detected by using an oligonucleotide primer or probe capable of recognizing at least one SNP selected from the group of SNPs set forth in column 10 of table 2. The oligonucleotide may be used as a primer in a nucleic acid amplification reaction and/or the oligonucleotide may be used as a probe in a hybridization detection technique.
The primers of the present invention may be used individually or in combination with one or more primers or primer pairs, such as any primer of the present invention.
The design of such primers or probes will be apparent to the molecular biologist of ordinary skill. Such primers are of any convenient length such as up to 50 bases, up to 40 bases, more conveniently up to 30 bases in length, such as for example 8-25 or 8−15 bases in length. In general such primers will comprise base sequences entirely complementary to the corresponding wild type or variant locus in the region. However, if required one or more mismatches may be introduced, provided that the discriminatory power of the oligonucleotide probe is not unduly affected. The primers/probes of the invention may carry one or more labels to facilitate detection.
In one embodiment, the primers and/or probes are capable of hybridizing to and/or amplifying a subsequence hybridizing to a single nucleotide polymorphism containing the sequence delineated by the markers as shown herein.
The primer nucleotide sequences of the invention further include: (a) any nucleotide sequence that hybridizes to a nucleic acid molecule comprising a genetic marker sequence or its complementary sequence or RNA products under stringent conditions, e.g., hybridization to filter-bound DNA in 6× sodium chloride/sodium citrate (SSC) at about 45° C. followed by one or more washes in 0.2×SSC/0.1% Sodium Dodecyl Sulfate (SDS) at about 50-65° C., or (b) under highly stringent conditions, e.g., hybridization to filter-bound nucleic acid in 6×SSC at about 45° C. followed by one or more washes in 0.1×SSC/0.2% SDS at about 68° C., or under other hybridization conditions which are apparent to those of skill in the art (see, for example, Ausubel F. M. et al., eds., 1989, Current Protocols in Molecular Biology, Vol. I, Green Publishing Associates, Inc., and John Wiley & sons, Inc., New York, at pp. 6.3.1-6.3.6 and 2.10.3). Preferably the nucleic acid molecule that hybridizes to the nucleotide sequence of (a) and (b), above, is one that comprises the complement of a nucleic acid molecule of the genomic DNA comprising the genetic marker sequence or a complementary sequence or RNA product thereof.
Among the nucleic acid molecules of the invention are deoxyoligonucleotides (“oligos”) which hybridize under highly stringent or stringent conditions to the nucleic acid molecules described above. In general, for probes between 14 and 70 nucleotides in length the melting temperature (TM) is calculated using the formula:
Tm(° C.)=81.5+16.6(log [monovalent cations(molar)])+0.41(% G+C)−(500/N)
where N is the length of the probe. If the hybridization is carried out in a solution containing formamide, the melting temperature is calculated using the equation Tm(° C.)=81.5+16.6(log [monovalent cations (molar)])+0.41(% G+C)−(0.61% formamide)−(500/N) where N is the length of the probe. In general, hybridization is carried out at about 20-25 degrees below Tm (for DNA-DNA hybrids) or 10-15 degrees below Tm (for RNA-DNA hybrids).
Exemplary highly stringent conditions may refer, e.g., to washing in 6×SSC/0.05% sodium pyrophosphate at 37° C. (for about 14-base oligos), 48° C. (for about 17-base oligos), 55° C. (for about 20-base oligos), and 60° C. (for about 23-base oligos).
Accordingly, the invention further provides nucleotide primers or probes which detect the polymorphisms of the invention. The assessment may be conducted by means of at least one nucleic acid primer or probe, such as a primer or probe of DNA, RNA or a nucleic acid analogue such as peptide nucleic acid (PNA) or locked nucleic acid (LNA).
According to one aspect of the present invention there is provided an allele-specific oligonucleotide probe capable of detecting a polymorphism at one or more of positions in the delineated regions.
The allele-specific oligonucleotide probe is preferably 5-50 nucleotides, more preferably about 5-35 nucleotides, more preferably about 5-30 nucleotides, more preferably at least 9 nucleotides.
Determination of Association with Mastitis
In order to detect if a genetic marker is present in the genetic material, standard methods well known to persons skilled in the art may be applied, e.g. by the use of nucleic acid amplification. In order to determine if the genetic marker is genetically linked to mastitis resistance traits, a permutation test can be applied (Doerge and Churchill, 1996), or the Piepho-method can be applied (Piepho, 2001). The principle of the permutation test is well described by Doerge and Churchill (1996), whereas the Piepho-method is well described by Piepho (2001). Significant linkage in the within family analysis using the regression method, a 10000 permutations were made using the permutation test (Doerge and Churchill, 1996). A threshold at the 5% chromosome wide level was considered to be significant evidence for linkage between the genetic marker and the mastitis resistance and somatic cell count traits. In addition, the QTL was confirmed in different sire families. For the across family analysis and multi-trait analysis with the variance component method, the Piepho-method was used to determine the significance level (Piepho, 2001). A threshold at the 5% chromosome wide level was considered to be significant evidence for linkage between the genetic marker and the mastitis resistance and somatic cell count traits.
In one aspect, the present invention further relates to a method for selecting a bovine subject for breeding purposes. This method for selecting a bovine subject for breeding purposes comprises determining resistance to mastitis of said bovine subject and/or off-spring therefrom by any method as defined herein, such as determining resistance to mastitis in a bovine subject, by detecting in a sample from said bovine subject the presence or absence of at least one genetic marker as defined herein.
The purpose of the method is to select those bovine subjects with the best breeding value for breeding. For example, selection of bovine subjects for breeding according to the present invention serve to increase the mean breeding value of the next generation of bovine subjects, compared to the mean breeding value of the previous (parent) generation of bovine subjects.
In one embodiment, the method of the present invention for selecting a bovine subject for breeding purposes comprises estimating a breeding value of said selected bovine subject. For example, the breeding value is estimated on the basis of the presence or absence of a genetic marker of the present invention.
In one aspect, the present invention relates to a kit, such as a diagnostic kit, for detecting the presence or absence in a bovine subject of at least one genetic marker as described herein, such as a marker associated with resistance to mastitis. In one embodiment, the present invention relates to a diagnostic kit for detecting the presence or absence in a bovine subject of two or more genetic marker alleles as described elsewhere herein, said kit comprising at least one detection member. Specifically, the kit is suitable for detection of the presence or absence of at least one genetic marker allele, such as two or more genetic markers, which are associated with at least one trait indicative of mastitis resistance of said bovine subject and/or off-spring therefrom. Examples of specific traits which are indicative of mastitis resistance are disclosed elsewhere herein. Such traits include, SCS, SCC, and treated cases of clinical mastitis, for example CM11, CM12, CM2, CM3, CM, SCC3, SCC2, SCC1 and/or SCC.
The kit of the invention preferably comprise at least one detection member for determining a genetic marker located in a genomic region as defined herein above.
Detection members of the present invention include any entity, which is suitable for detecting a genetic marker on the genomic (including epigenomic), transcriptional or translational level. Detection members comprise oligonucleotide primers and/or probes, antibodies, aptamers, chemical substances etc.
In one embodiment, the diagnostic kit comprises at least one oligonucleotide for detecting said genetic marker allele in said bovine subject.
In one embodiment, the detection member is an oligonucleotide primer and/or an oligonucleotide probe. In a preferred embodiment, the detection member is an oligonucleotide primer as described elsewhere herein, or an oligonucleotide probe with a sequence corresponding to any oligonucleotide primer as defined herein. The at least one oligonucleotide of the kit preferably comprises or consists of between 5 and 100 consecutive nucleotides, such as between 10 and 30 consecutive nucleotides, or at least 5, such as 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24 or at least 25 consecutive nucleotides. In a preferred embodiment, the detection member is an oligonucleotide comprising at least 5 consecutive nucleotides specific for any one of the SNP markers set forth in columns 9 and 10 in the table identified in table 2.
In one aspect, the present invention relates to a kit for use in detecting the presence or absence in a bovine subject of at least one genetic marker associated with resistance to mastitis, comprising at least one detection member for determining a genetic marker located in a region of the bovine genome selected from the group consisting of regions 1-61 of table 2, wherein said regions are delineated by the SNP markers identified in columns 3 and 5, and/or delineated by the genomic position identified in columns 4 and 6.
The genetic markers to be detected by the detection members of the kit of the present invention are disclosed elsewhere herein. Thus, the genetic marker is for example any genetic marker as described herein, such as two or more genetic marker alleles located in a gene selected from the group consisting of the markers mentioned in columns 9 and 10 of table 2. In a preferred embodiment, the genetic marker is located in the NPFFR2 gene, as defined elsewhere herein. Thus, in one embodiment, the kit of the invention comprise at least one detected member capable of detecting a mutation in the NPFFR2 gene, in particular for detecting the chr6—89059253 SNP located at 89,059,253 Bp position on BTA6. The detection member, thus in a preferred embodiment is a nucleic acid sequence comprising between 5 and 100 consecutive nucleotides, such as between 10 and 30 consecutive nucleotides, or at least 5, such as 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24 or at least 25 consecutive nucleotides of the NPFFR2 gene, such as SEQ ID NO: 1, or a nucleic acid sequence at least 70% identical thereto, such as at least 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, such as at least 99% thereto. In a preferred embodiment, the nucleic acid sequence comprises the chr6—89059253 SNP, and/or any genetic marker polymorphism coupled thereto.
The kits of the present invention may further comprise at least one reference sample. In one embodiment, said reference sample comprises a nucleic acid sequence comprising a genetic marker associated with mastitis resistance, such as described herein, and in another embodiment, the reference sample comprises a nucleic acid sequence comprising a genetic marker associated with susceptibility to mastitis
The kits of the present invention further comprise in specific embodiments instructions for performance of the detection method of the kit and for the interpretation of the results.
Genotyping of a bovine subject in order to establish the genetic determinants of resistance to mastitis for that subject according to the present invention can be based on the analysis of DNA and/or RNA. One example is genomic DNA which can be provided using standard DNA extraction methods as described herein. The genomic DNA may be isolated and amplified using standard techniques such as the polymerase chain reaction using oligonucleotide primers corresponding (complementary) to the polymorphic marker regions. Additional steps of purifying the DNA prior to amplification reaction may be included. Thus, a diagnostic kit for establishing mastitis resistance and somatic cell count characteristics comprises, in a separate packing, at least one oligonucleotide sequence.
The invention also relates to the use of a kit of the invention for detecting the presence or absence in a bovine subject of at least one genetic marker associated with resistance to mastitis, in particular for detecting any one or more of the markers identified herein. Furthermore, the present invention relates to the use of a kit of the present invention for estimating breeding value in respect of susceptibility to mastitis in a bovine subject.
The present invention also relates to determination of estimated breeding values.
In a large randomly mated population, each individual should on average give birth to two offspring in order to maintain the size of the population. The distribution of the number of offspring in the population has a left skewed binominal distribution (Poisson distributed) with an average value of 2 and variance of 2. Which means that the number of offspring per individual can vary from 0 and upwards, the values 0, 1, 2, 3, 4 and 5 being the most frequent. An estimated breeding value is often called an index (I). The index can be estimated on the basis of information of phenotype values from all possible relatives. A simple regression line or multiple regression can be used. The higher the number of relatives is the better the estimation will be. Correlation between the true breeding value (A) and the index is given the name Accuracy and it has the symbol rAI. The estimated breeding value is based on a theory of linear regression and correlation.
In one aspect, the present invention relates to a method for estimating a breeding value in respect of susceptibility to mastitis in a bovine subject, comprising detecting in a sample from said bovine subject the presence or absence of at least one genetic marker that is associated with at least one trait indicative of mastitis resistance of said bovine subject and/or off-spring therefrom, wherein said at least one genetic marker is located in a region of the bovine genome selected from the group consisting of regions 1-61 of table 2, wherein said regions are delineated by the SNP markers identified in columns 3 and 5, and/or delineated by the genomic position identified in columns 4 and 6. The method preferably comprises detection of one or more of the specific markers associated with mastitis, which are identified elsewhere herein.
The breeding value is in one example determined using a multi-trait random regression model (mtRRM) combined longitudinal TDSCS and binary CM traits, for example having the general description of the model in matrix form:
y=Xb+H
h
h+K
k
k+Z
a
a+Z
p
p+e,
where: y is a vector with observations on the nine different traits explained above. Vectors b, h, k contain the environmental effects whilst vectors a, and p contain additive genetic and nongenetic animal regression coefficients, respectively.
Environmental effects in the model could be calving age, herd environment and stage of lactation. Both additive genetic and non-genetic animal effects can be modelled by a second order Legendre polynomial for TDSCS and intercept for the other traits leading to a 15×15 (co)variance matrix for each random effect to be estimated. Vector e contains the residuals of the 9 traits.
In order to facilitate accurate estimation, residual (co)variances between CM traits and TDSCS may be assumed to be zero and the residual variance of CM and udder type traits may be set to operationally low values so that part of this variance entered the permanent environmental component. This can facilitate estimation of permanent environmental correlation between CM and the longitudinal trait. The covariance components were estimated using DMU package.
In one embodiment, the breeding value is calculated using a marker-assisted single trait Best Linear Unbiased Prediction (MA-BLUP).
The specific mastitis resistances traits, genetic markers and marker alleles, samples, bovine subjects, detection methods etc. are defined elsewhere herein.
In one aspect, the present invention provides a method for selective breeding of bovine subjects. The method of the invention allows the identification of bovine subjects suitable for selective breeding.
In one embodiment these methods comprise the steps of
a. providing a bovine subject,
b. obtaining a biological sample from said subject,
c. determining the presence in that sample of at least one genetic marker located in a region of the bovine genome selected from the group consisting of regions 1-61 of table 2, wherein said regions are delineated by the SNP markers identified in columns 3 and 5, and/or delineated by the genomic position identified in columns 4 and 6,
d. selecting a bovine subject having in its genome said at least one genetic marker, and
e. using said bovine subject for breeding.
The biological sample could be any suitable sample comprising genetic material, and which is preferably easily obtainable. Sample types are described further elsewhere herein.
The bovine is preferably a male subject, i.e. a bull. For example, when the bovine subject is a bull, the use of the bovine subject for breeding would normally include collecting semen from said bull and using said semen for artificial insemination of one or more heifers or cows.
However, the presence of the relevant genetic marker(s) may also be determined in cows and heifers according to the method of the invention.
Genome-wide linkage analysis was until recently the method of choice for quantitative trait loci (QTL) genome scan in cattle due to availability of large half-sib family structure. Linkage analysis is the method traditionally used to identify genes for phenotypes exhibiting Mendelian inheritance. For complex phenotypes such as quantitative traits, linkage analysis has only had limited success. In linkage analysis there are a few opportunities for recombination to occur within families and pedigree with known ancestry, resulting in relatively low mapping resolution which limits the candidate polymorphism search. In the contrary, association mapping (linkage disequilibrium mapping) has emerged as a powerful tool to resolve complex trait variation down to the sequence level by exploiting historical recombination events at the population level for high resolution mapping. In this approach markers/haplotypes with predicting ability in the general population for a trait of interest are identified. Such markers and haplotypes could be used directly for marker-based selection. Typically genome scans are used to map QTL for which some test statistic exceeds a pre-defined threshold value. Although the threshold level can be chosen to be very conservative, a probability that the QTL in reality represents a type I error remains. Therefore, results from QTL studies should be confirmed in an independent analysis before being used in subsequent fine mapping experiments or in marker-assisted selection. If the results from linkage analysis can be confirmed by an association study, it will also provide credibility to the detected QTL.
Lund et al (2008) mapped QTL for clinical mastitis and somatic cell score in Danish Holstein cattle using linkage analysis. These authors used data on 356 microsatellite markers spread across all autosomes with an average marker spacing of 8.6 cM. Nonetheless, the QTL regions reported were quite long (more than 20 cM for some QTL). Such large QTL regions along with family-specific marker-QTL associations limit the usability of their result for practical animal breeding as well as for candidate polymorphism searches. Thus, a need to map QTL to narrower genomic regions remains because inclusion of QTL information in selection decisions requires fine-mapping of causal polymorphisms. In this example, association mapping was carried out for 6 mastitis traits in cattle using dense SNP markers.
A total of 2,531 Danish and Swedish Holstein bulls were genotyped using the bovineSNP50 beadchip (Illumina®). Only SNPs with minor allele frequency equal to or higher than 0.05 and average GC score of at least 0.65 were retained for the analysis. Thus total of 36,387 SNPs on 29 bovine autosomes (BTAs) were selected for association analyses. Individual SNP types with GC score less than 0.6 were dropped. The number of SNPs included for analysis varied from 675 on BTA28 to 2,320 on BTA1. The details on the genotyping platform and quality control for SNPs are described by Sahana et al. (2010a). The SNP positions within a chromosome were based on the Bos taurus genome assembly (Btau—4.0, Liu et al. 2009).
Single trait breeding values (STBV) were used as phenotypes in this analysis. Six mastitis related STBVs were analyzed for association with SNPs. Single-trait breeding values were calculated for each animal using best linear unbiased prediction (BLUP) procedures and a sire model by the Nordic Cattle Genetic Evaluation. For definitions and models used in breeding value prediction, see http://www.nordicebv.info, except that the correlation to other traits was set to 0 to avoid information from phenotypes of correlated traits to affect results of any particular trait. Also, only sire-son and sonoffspring relationships were included, effectively producing a sire model. The STBV were adjusted for the same systematic environmental effects as in the official routine evaluations. Clinical mastitis was defined as a binary trait, mastitis treatment (1) or not (0) within four time periods: the incidence of mastitis from −15 to 50 days in first lactation (CM11), 51 to 305 days in first lactation (CM12), −15 to 305 days in second lactation (CM2), −15 to 305 days in third lactation (CM3), all measure as binary trait. The STBVs for the four mastitis traits are weighted together by the following relative weights: CM=0.25*CM11+0.25*CM12+0.3*CM2+0.2*CM3 to form a mastitis resistance index (CM) (Johansson et al. 2007), standardized to a mean of 100 and a standard deviation of 10. Somatic cell score (SCS) is an important trait for the estimation of breeding values for udder health. SCS is an index of average log somatic cell count from 5 to 170 days from first three lactations with relative weights of 0.5, 0.3 and 0.2 for first, second and third lactation respectively (Johansson et al. 2007). The number of STBVs available for analysis among the genotyped animals were 1671 for CM11, CM12 1668, CM2 1669, CM3 1544, CM 2098 and SCS 1671.
Mixed model: The mixed model analysis as proposed by Yu et al. (2006) was used for association analyses. In this approach, a polygenic genetic effect was fitted as a random effect. Single SNPs were successively included as fixed effect in the model. The model was:
y=μ1+αs+Zu+e
where y is a vector of observed phenotypes (STBV), μ is a shared fixed effect, 1 is a vector of ones, α is allele substitution effect of the SNP, s is an incidence vector with elements 0, 1 or 2 relating α to the individuals, Z is a matrix relating records to individuals, u is a vector of additive polygenic effects and e is a vector of random residual effects. The random variables u, and e are assumed to be multivariate normally distributed. u has mean 0 and covariance matrix σg2A, where σg2 is the polygenic genetic variance and A is the additive relationship matrix derived from pedigree. e has mean 0 and covariance matrix □e2I, where □e2 is the residual variance and I is the identity matrix. The analysis was carried out using the software package DMU (http://gbiagrsci.dk/dmu/). Significance of each marker's effect was tested using a t-test against a null hypothesis of α=0.
Significance Test:
For control of the family-wise error rate (FWER), the Bonferroni correction was applied. The Bonferroni correction controls FWER (α)=1−(1−ai)m αim, where αi is the individual test rejection level and m is the number of tests. The 5% chromosome-wise significance thresholds ranged from the point wise p-value of 2.16×10−5 on BTA1 to 7.41×10−5 on BTA28, or 4.67 to 4.13 in the −log10 transformed scale. Bonferroni correction is very conservative (Han et al. 2009) as it does not take account of correlation (linkage disequilibrium) among SNPs. We used a liberal significance threshold of 10−4 for the QTL regions where QTL have previously been identified by Lund et al. (2008) who used linkage analyses with microsatellite markers for QTL mapping. In the following sections a significant association will mean chromosome-wise significant, and a suggestive association means a point wise p-value less than 10−4.
Marking the QTL Region:
Normally multiple SNPs in the vicinity of a QTL are expected to yield significant results in a single SNP analysis. This is because all SNPs that are physically located near the causal factor will tend to be in linkage disequilibrium. This effect declines with genetic distance and also depends on minor allele frequencies. In this study, QTL regions were demarcated subjectively. Starting at the most significant SNP, the QTL region was extended left and right until a region was reached where all markers had −log(p) values below 3. I.e. that the QTL thus demarcated may contain one or more non-significant markers. To compare results from the present study with the earlier ones, we took the maker positions from Btau—4.0. If the maker location was not available in Btau—4.0, we have reported marker and given the position in cM from MARC table [http://www.marc.usda.gov/genome/cattle/cattle.html].
The present genome-wide association study (GWAS) identified 9 chromosome-wise significant QTL for clinical mastitis and somatic cell score on 8 chromosomes in Danish and Swedish Holstein cattle (Table 3). We have presented 92 SNP×trait combinations which showed chromosome-wise significant association and out of then 24 combinations crossed genome-wide significance threshold (Supplementary Table 3). Most of the genome-wide significant associations were observed for CM and four SNP showed genome-wide significant association with CM2. Five SNPs showed significant association with more than one mastitis trait. The signal plots (
6
rs41664497
25195079
CM2
0.581
0.104
7.08
6
ss86290235
26706544
CM
0.885
0.176
5.90
6
rs29015635
81958670
CM
0.810
0.161
5.89
6
rs29017739
85040979
CM
−2.016
0.331
8.42
6
rs29001782
86128028
CM
0.949
0.154
8.55
6
rs41588957
86467725
CM
−0.837
0.156
6.62
6
ss86307579
87255541
CM
−0.883
0.160
6.98
6
ss86317213
87879378
CM
−0.788
0.156
5.90
6
rs41610991
87904281
CM
−0.880
0.154
7.44
6
ss117968738
88983536
CM
0.974
0.172
7.36
6
ss86341106
89212073
CM
−1.071
0.164
9.53
6
ss86278591
89668441
CM
−2.444
0.468
6.29
6
ss86337596
89774923
CM
−0.972
0.152
9.22
6
rs43338539
89838828
CM
0.917
0.179
6.05
6
ss86296213
90008100
CM
1.144
0.190
8.18
6
rs42766480
90075264
CM
0.916
0.167
6.89
6
ss117963883
94872475
CM
−0.987
0.166
8.02
13
ss86317725
57728100
CM
−0.858
0.148
7.65
13
ss86290731
57750019
CM
0.846
0.149
7.32
ss86287919
236533
CM2
−0.282
0.054
6.25
14
ss86329615
443936
CM2
−0.278
0.054
6.09
14
ss86328358
679601
CM2
−0.279
0.052
6.49
20
rs41938511
36232606
CM
−0.984
0.193
6.02
20
rs41940571
37740343
CM
1.127
0.193
7.79
The QTL intervals observed with association mapping were much narrower than those reported by Lund et al. (2008) who used linkage study with sparse map of microsatellite markers. Association mapping utilizes population level linkage disequilibrium. It therefore can map a QTL to a very small chromosomal region. The definitions of the mastitis traits were slightly different in Lund et al. (2008) and the present study. Thus, clinical mastitis for the first lactation (−10 to 305 d) was studied as one trait (CM1), while we have divided the first lactation mastitis into two sub-traits (CM11 and CM12). On BTA4, we detected a QTL affecting CM and SCS at 19.7 Mb. We further observed 3 SNPs between 66.46-66.61 Mb had −log10(p) values between 3.5-3.8. We also detected two suggestive QTL for CM at 40.3 and 97.0 Mb on BTA5.
The strongest association of SNP to mastitis traits in this study was observed on BTA6 at 89.2 Mb. This QTL affected CM, CM11 and SCS. The most significant SNP, ss86341106, is located within the gene Deoxycytidine kinase (DCK), which catalyzes the rate-determining step in the deoxyribonucleoside salvage pathway. The highest levels of DCK expression are found in thymus and bone marrow, which indicates a role of DCK in lymphopoiesis. Indeed, knockout mice lacking enzyme activity revealed a combined immune deficiency phenotype, i.e. they produce very low levels of both T and B lymphocytes (Toy et al., 2010). Another strong candidate gene in this region is the IGJ gene, which encodes the immunoglobulin J polypeptide. This protein serves a nucleating function in the formation of the immunoglobulin M (IgM) pentameric complex and in the assembly of IgA dimers and polymers. IgM is the first antibody produced in the primary immune response to microbial infections and therefore plays a crucial role in preventing systemic spread of the pathogen (Racine and Winslow, 2009). Also IgA is engaged in the defense against microorganisms, in particular those that invade the host through mucosal surfaces. Thus, IgA is the major antibody class found in mucosal secretions, where it combines with microbes to prevent them from attaching to or penetrating the mucosal membranes (Lamm, 1997).
We have also detected another QTL at 25.2 on BTA6 significantly associated with the SNP ss61565956. An interesting candidate gene in this region is DAPP1, also known as Bam32, which is expressed in B cell lymphocytes and has been implicated in B cell antigen receptor (BCR) signaling. Thus, antigen binding to BCR involves a chain of signaling processes that are critical for B cell-fate decisions such as proliferation and differentiation, and BCR-mediated antigen internalization, processing, and presentation to T cells (Pierce, 2002). Studies of Bam32 deficient mice have shown that Bam32 mediates BCR-induced proliferation of B cell but not survival (Han et al., 2003), it regulates B cell antigen receptor internalization (Niiro et al., 2004), and it promotes the formation of stable interactions between B cells and T cells needed for efficient T cell activation, most likely by promoting adhesion to integrin ligands expressed on T cells (Al-Alwan et al., 2010).
We also found suggestive evidence for a QTL affecting CM at 75.9 Mb on BTA9.
We observed a suggestive evidence for a QTL affecting CM at 69.2 Mb on BTA11. Also, a suggestive QTL for CM was observed in our analysis at 30.4 Mb.
On BTA13, we detected a genome-wide significant QTL for CM, CM11 and CM12 at 57.7 Mb on BTA13. There were two closely located SNPs, which showed genome-wide association, located very close to the endothelin 3 gene [http://www.ensembl.org/Bos_taurus/]. The endothelins ET-1, ET-2, and ET-3 constitute a family of 21-amino acid peptides that are produced by numerous cells and tissues such as macrophages, and endothelial and epithelial cells (Giaid et al., 1991). In addition to a vasoconstrictive effect, they also have an impact on many different cell types, including activation of neutrophils (Elferink and De Koster, 1998). Neutrophils are blood-borne leukocytes that combat bacterial and fungal infections by phagocytosis or release of antimicrobial peptides (Selsted and Ouellette, 2005). Another possible candidate gene located in this region is Phactr3 (phosphatase and actin regulator 3), which has been shown to stimulate cell spreading and migration through direct interaction with the actin cytoskeleton (Sagara et al., 2009). Cell mobility is critically important for cell-mediated immune response (Luster et al., 2005). Lund et al. (2008) detected QTL for SCS between microsatellite markers BM9248 (29.1 cM) and BL1071 (68.6 cM; 71.9 Mb on Btau—4.0). The QTL interval reported was very large (39.5 cM) in the linkage analysis. In contrast, the present GWAS was able to narrow the QTL to a 4 Mb region.
We have identified a genome-wide significant QTL at 37.7 Mb on BTA20. The most significant SNP, rs41940571 is linked to the gene RIPTOR independent companion of MTOR, complex 2. There are several other genes located in this QTL region in the Btau—4.0 assembly. Among these is the C9 gene, encoding the complement component C9 precursor. The complement system is part of the immune response against invading pathogens. Activation of the complement system through the classical, alternative, or mannan-binding lectin pathways ultimately leads to formation of the Membrane Attack Complex, which creates pores in bacterial membranes, resulting in cell lysis. Complement C9 is the pore-forming subunit of MAC and mutations in this gene are associated with increased risk of infections, for example meningococcal meningitis (Kira et al., 1998; Zoppi et al., 1990; Horiuchi et al., 1998). Lund et al. (2008) observed a QTL for UD between 31.3 and 48.2 Mb on BTA20. These two studies point probably to the same QTL.
On BTA23, Lund et al. (2008) observed a QTL for SCS between BMS466 (46.1 cM; 43.4 Mb on Btau—4.0) and INRA090 (53.2 cM) and a QTL for UD between 43.9-46.6 Mb. Ashwell et al. (1997) and Heyen et al. (1999) detected QTL for SCS on BTA3 at 39.9 and 48.6 Mb, respectively. Our study found suggestive evidence for a QTL affecting CM and SCS on this chromosome at 11.7 Mb, far away from the earlier reports. The QTL we found could be a different one than those reported earlier.
We also detected a genome-wide significant QTL affecting CM and CM2 at the proximal end of BTA14 (0.7 Mb) which was not detected in the same population by Lund et al. (2008). Three SNPs showed genome-wide significant association with MAS2. A region around 1.3 Mb with CYP11B1 harbors a QTL for SCS in German Holstein cattle (Kaupe et al. 2007) which could be the same QTL as detected in the present study. There are several genes located in the QTL region in Btau—4.0 including DGAT1 (Grisart et al. 2002) which has a large influence on phenotypic variance in milk fat content and other milk characteristics.
The genetic correlation between clinical mastitis and SCS is >0.70 (Lund et al. 1999, Carlen et al. 2004; Heringstad et al. 2006). Therefore, it was expected that many of the QTL affecting CM would also affect SCS. Out of nine significant QTL affecting clinical mastitis traits, five showed effect on SCS. This was as expected due to high genetic correlation between clinical mastitis and SCS. As we are analyzing both clinical mastitis and SCS in the present study, may help to indicate the extent of SCS QTL from the literature can be expected to affect clinical mastitis. Out of the six mastitis traits analyzed in the present study, maximum number of QTL was observed for mastitis index which was an index combing clinical mastitis from first three lactations.
The present study identified several mastitis QTLs. We used association study with dense SNP markers in a mixed model analyses which was observed to perform best for samples from complex pedigreed population like cattle (Sahana et al. 2010b). In the present study QTL positions were refined to much narrower genomic regions than has been possible by previous linkage analysis. This association mapping identified SNPs which are in linkage disequilibrium with the QTL, or which are causative mutations, and therefore, marker-based selection at the population level for mastitis resistance could be carried out. Some of the QTL regions were narrow enough to initiate further search for candidate genes underlying mastitis QTL.
Clinical mastitis and somatic cell score QTL in dairy cattle were fine-mapped using high-density SNP Chips comprising 777,962 SNP probes.
Association mapping identifies specific functional variants (i.e., loci, alleles) linked to phenotypic differences in a trait, to facilitate detection of trait causing DNA sequence polymorphisms and/or selection of genotypes that closely resemble the phenotype. Association mapping has been variously defined (Chakraborty and Weiss 1988; Kruglyak 1999), and has also been referred to as “association genetics,” “association studies,” and “linkage disequilibrium mapping”. Genome-wide association studies (GWAS) provide an important avenue for undertaking an agnostic evaluation of the association between common genetic variants and risk of disease or quantitative traits. Recent advances in our understanding of genetic variation and the technology to measure such variation have made GWAS feasible.
In the present example, association mapping has been used to identify single nucleotide polymorphisms (SNPs) which are associated with mastitis resistance in dairy cattle. Several Quantitative Trait Loci (QTL) were identified which can be usefully applied in selection of animals for improvement of resistance to mastitis.
The genome scan for mastitis resistance was carried out using Danish and Swedish Holstein cattle for nine mastitis phenotypes analysed. The phenotype used for mapping quantitative trait loci (QTL) for mastitis resistance was udder health index estimated for Nordic cattle genetic evaluation (NAV, Pedersen, 2008, www.nordicebv.info). The udder health traits currently evaluated in NAV included four clinical mastitis traits from three lactations, all measured as a binary trait (Table 5). These four mastitis traits are weighted together to form a mastitis resistance index (CM), standardized to a mean of 100 and a standard deviation of 10 (Johansson et al. 2007). There were 4200 progeny tested bulls from Danish, Swedish and Finnish Holstein dairy cattle with recode for these nine mastitis related phenotypes. The SNP genotype and phenotypes of these bulls were utilized for association mapping.
The Holstein bulls were genotyped using the Illumina Bovine SNP50 BeadChip. Genotyping was done by the Illumina Bovine SNP50 BeadChip (Illumina Inc., http://www.illumina.com/Documents/products/datasheets/datasheet_bovine_snp5O.pdf) at the Danish Institute of Agricultural Sciences, Research Center Foulum, Department of Molecular Biology and Genetics and at GenoSkan, AgroBusiness Park Foulum. The platform used was an Illumina® Infinium II Multisample assay device. SNP chips were scanned using iScan and analyzed using Beadstudio ver. 3.1 software. The quality parameters used for selection of SNPs were minimum call rates of 85% for individuals and of 95% for loci. Marker loci with minor allele frequencies (MAFs) below 5% were excluded. The minimal acceptable GC score was 0.60 for individual typings. Individuals with average GC scores below 0.65 were excluded. The number of SNPs after quality control was 43,415 in the 50 k dataset. A total of 557 Holstein bulls in the EuroGenomics project (Lund et al., 2011) were regenotyped using the BovineHD Genotyping BeadChip (http://www.illumina.com/Documents/products/datasheets/datasheet_bovineHD.pdf). There are a total of 777,962 SNPs on the BovineHD BeadChip that uniformly span over entire bovine genome with an average gap size of 3.43 kb and a median gap size of 2.68 kb. The quality control parameters set for HD data were similar as it was for 50K chip as described above. The 50 k genotypes were imputed to the HD genotypes using Beagle software package (Browning and Browning, 2009), based on the marker data of the HD genotyped bulls (Su et al 2011; interbull meeting presentation). The markers in the 50 k chip but not included in the HD chip were excluded in the imputation process. The number of SNPs after imputation to BovineHD chip was 648,219. The genome positions of the SNPs were taken from UMD3.1 assembly (http://www.ensembl.org/Bos_taurus/2011—09_cow_genebuild.pdf). The physical maps for the 648,219 SNPs located on 29 Bovine autosomes are available at www.illumina.com.
The details of the association mapping model are described by Yu et al. (2006) and Sahana et al. (2010). The statistical model used for association analyses was:
y
i
=μ+bx
i
+s
i
+e
i
Where yi was the single trait estimated breeding value of individual i, μ was the general mean, xi was a count in individual i of one of the two alleles (with an arbitrary labeling), b was the allele substitution effect, si was the random effect of the sire of individual i, assumed to have a normal distribution N(0, Aσs2), where A is the additive relationship matrix and σs2 is the sire variance, and ei was a random residual of individual i assumed to follow a normal distribution with mean zero and error variance, σe2. Testing was done using a Wald test against a null hypothesis of H0: b=0. The significance threshold was determined using a Bonferroni correction. The genome-wide significance threshold was calculated by dividing the nominal significance threshold of 0.05 by the total numbers of SNPs included in the analysis.
A total 61 QTL regions on 22 chromosomes associated with mastitis related traits were identified. The QTL regions along with the highest significantly associated SNP for each QTL are presented in Table 6; cf.
Table 6: cf.
The genomic region from 88-96 Mb on BTA6 was selected for targeted genome-wide association study with SNP variants identified from the whole genome sequence of 90 bulls. This genomic region was selected as it showed the strongest association with clinical mastitis in analyses of the Illumina Bovine SNP50 BeadChip (HD SNP chip). The most significant SNP association with clinical mastitis for HD SNP chip analyses was BovineHD0600024355 located at 88,919,352 Bp on BTA6.
The whole genome of ninety bulls from three breeds (˜30 each from Nordic Holstein, Danish Jersey and Nordic Red breed) was sequenced (˜10× coverage) at Beijing Genomic Institute (BGI), China. The whole genome sequences were analyzed and more than 24 million variants were observed. The variants were functionally annotated. The SNP polymorphisms for the targeted region (TR) on BTA6 harbouring mastitis QTL were extracted. There were a total of 41,993 SNP variants within the TR of 8 Mb. There were 5,193 Nordic Holstein bulls with the clinical mastitis phenotypes and the HD SNP chip genotypes. These animals were imputed for the 41,993 SNP variants identified in WGS using software Beagle (Browning and Browning, 2007). The association analyses were carried out for these 5,193 bulls' data using the mixed linear model analysis (Yu et al. 2006). The results showed an association with clinical mastitis of the neuropeptide FF receptor 2 (NPFFR2) gene (
Genome-wide association studies (GWAS) was carried out previously for nine mastitis traits in Nordic Holstein cattle. The genotyping was done using Bovine HD SNP chip. A linear mixed model analyses was carried out to identify the SNPs significantly associated with mastitis resistance. Based on this GWAS study, six genomic regions were selected for targeted GWAS with whole genome sequence data (Table 8).
A total of 90 bulls' (˜30 of each of Danish Red, Danish Jersey and Nordic Holstein) whole genomes were sequenced at BGI, China. The sequence data was analyzed at by the Quantitative Genetics and Genomic Centre (QGG), MBG, Aarhus University. The average genome coverage was more than 10×. Alignment of sequence reads to the cattle reference genome was done and the candidate sites or regions at which one or more samples differ from the reference sequence were identified. The quality control measures removed candidate sites that likely were false positives. The variants calls i.e. the estimation of the alleles present in each individual at variant sites was carried out using VCF tools (http://vcftools.sourceforge.net/). A total of more than 24 millions DNA level variants (single nucleotide polymorphism (SNP), insertion-deletions (indel), copy number variation (CNV) etc.) observed across three cattle breeds. All the variants were functionally annotated for search of candidate polymorphisms affecting mastitis related traits.
Six chromosomal regions (Table 8) were selected based on GWAS study with HD SNP chip on nine mastitis resistance traits in Nordic Holstein cattle. The length of the regions and the number of SNP variants (from whole genome sequence data) for each region selected after quality control are given in Table 8. The phenotypes (estimated breeding values) were available for 5193 Nordic Holstein bulls for nine mastitis related traits. The SNP chip genotypes (50 k and 777 k) of these bulls were imputed to the sequence level for the targeted regions using the software Beagle (Browing and Browing, 2006). All the SNP position mentioned here is as per the Bovine genome assembly (UMD3.1).
A SNP-by-SNP analysis where each SNP was fitted separately in a linear mixed model (LMM) following Yu et al. (2006). Complex familial relationship is the primary confounding factor in GWAS study in livestock population. LMM which include the relationship among individuals through a polygenic effect is able to control the false positives due to family structure (Yu et al., 2006).
For each SNP separately, the association between the SNP and the phenotype was assessed by a single-locus regression analysis using a linear mixed model. The model was as follows:
y=1μ+mg+Zu+e
where y is the vector phenotypes (EBV), 1 is a vector of 1 s with length equal to number of observations, p is the general mean, m is the genotypic score (obtained from Beagle output; values ranged between 0 and 2) associating records to the marker effect, g is a scalar of the associated additive effect of the SNP, Z is an incidence matrix relating phenotypes to the corresponding random polygenic effect, u is a vector of the random polygenic effect with the normal distribution N(0, Aσu2), where A is the additive relationship matrix and σu2 is the polygenic variance, and e is a vector of random environmental deviates with the normal distribution N(0, Aσe2), where σe2 is the error variance. The model was fitted by restricted maximum likelihood (REML) using the software DMU (Madsen and Jensen, 2011) and testing was done using a Wald test against a null hypothesis of g=0.
A SNP was considered to have significant association if the p-value crossed the region-wise significant threshold after Bonferroni correction for multiple testing.
Association Analyses with the Most Important SNP as Cofactor in the Model
A large number of SNPs crossed region-wide significant threshold. As the LD is expected to be high these significant effect of the SNPs could be due to linkage to only one casual variant segregating in the targeted region. However, as the regions were quite large (>5 Mb in some cases), it is also possible that the effect observed was due to multiple causative variants segregating in Nordic Holstein population. This analysis was done to see if any SNP shows significant association after the most important SNP from the LMM analyses and/or functional annotation was included in the model as cofactor (Table 9). The analysis was done using lme function of nlme of R-package (http://cran.r-project.org/). The model was as below.
Y
ij
=μ+S
i+fixSNP+SNPm+eij
where Yij is the residual phenotype obtained from an animal model (i.e. adjusted for the pedigree) for the jth animal of ith sire, Si is the random effect of the ith sire, fixSNP is the regression of genotype score for the highest significant SNP from the LMM (or the most important SNP based on functional annotation among a few top ones), SNPm is the regression of the genotype score of the mth SNP (m≠fixSNP) and eij is the random error.
The manhattan plots for the RWAS (both linear mixed model, and the linear model with the most important SNP as cofactor) are presented in
All the six targeted regions had wide picks of association. However, including the most significant associated SNP as cofactor (Table 9), the entire range of associated region collapses. This indicates the SNPs mentioned in Table 9 which were used as cofactor are either the real causal polymorphisms affecting mastitis resistance in Nordic Holstein or are in very high LD with the real causal polymorphism in the targeted regions. Therefore, these SNPs could be used as predictor of mastitis resistance on individual animals in Holstein cattle. Results from individual genomic regions are discussed in details below.
The most significant SNP for each of the nine mastitis related traits are presented in table 10 for the targeted region on BTA5. The total length of the targeted region on BTA5 was 9 Mb and there were two regions (at 86.99 and 92.75 Mb) where the highly significant SNPs were concentrated. The manhatton plot for this region is presented in
Candidate Polymorphism within the BTA5 Targeted Region:
BTA5 (86.99 Mb): There is a huge intron at 86.99 Mb. Upstream there is a non-synonymous polymorphism (allele frequency of the alternative allele for the polymorphisms (alt) 64%) at 86,948,388 which could be the candidate polymorphism. Downstream at 87,004,771 (alt 15%) and 87,004,957 (alt 3%), there are two polymorphisms in a non-coding gene in an intron. Further downstream there is a synonymous coding splice-site polymorphism at 87,023,448 (alt 31%).
BTA5 (92.75 Mb): The gene around 92,496,500 has three polymorphisms at 92,496,251 (alt 54%), 92,496,510 (alt 28%) and 92,496,586 (alt 3%). Downstream the next annotation starts around 93,688,996 (gene ENSBTAG00000013541). However, all polymorphisms in this gene are either intronic, upstream or downstream. The next downstream a candidate causative polymorphism could be at 93,939,231 (alt 7%) (gene ENSBTAG00000008541) which is non-synonymous coding. However, none of above candidate polymorphisms discussed within the targeted region of BTA5 showed strong association signal across the mastitis traits analyzed.
BTA5: Genes associated with mastitis according to the analysis are summarized in the table below. For clinical mastitis the top SNPs are concentrated around 92.7 Mb, whereas there are minor peaks at positions 87 Mb, (88.8 Mb, 90.9 MB) and 93.4 Mb. The following genes are located in the two major peak regions around 87 Mb and 92.7 Mb.
Among the candidate genes in this region we find RERGL encoding Ras-related and estrogenregulated growth inhibitor-like protein. There is little or no functional information about this specific gene in the literature. However, the Ras family of small GTPases is a group of more than 150 proteins that function in diverse biological processes including immunity and inflammation (Johnson and Chen, Current Opinion in Pharmacology 12, 458-463, 2012). Another good candidate gene which might be relevant in relation to mastitis is PIK3C2G, which codes for phosphoinositide-3-kinase class 2 gamma subunit. Many PI3K enzymes play an important role in the functioning of immune cells (Johnson and Chen, Current Opinion in Pharmacology, 2012; Koyasu, Immunology, 2003).
The most significant SNP for each of the nine mastitis related traits are presented in table 12 for the targeted region of BTA6. The targeted region on BTA6 was 8 Mb in length. The manhatton plot for this region is presented in the
SNP, Chr6—89059253, is a strong candidate polymorphism (alt 48%, gene ENSBTAG00000009070) for the targeted region of BTA6. This SNP showed very strong association with all the five clinical mastitis traits (CM11, CM12, CM2, CM3 and CM) (Table 13).
One associated gene is the IGJ gene, which encodes the immunoglobulin J polypeptide although it should be noted that the gene might be located too far away from the peak. This protein interacts with immunoglobulins IgM and IgA. IgM is the first antibody produced in the primary immune response to microbial infections whereas IgA is engaged in the defense against microorganisms in particular those invading the host through mucosal surfaces. Another associated gene is Deoxycytidine kinase (DCK gene), which catalyzes the rate-determining step in the deoxyribonucleoside salvage pathway. DCK is expressed in thymus and bone marrow, possibly indicating a role in lymphopoiesis. Mice lacking DCK enzyme activity revealed a combined immune deficiency phenotype, i.e. they produce very low levels of both T and B lymphocytes (Toy et al., PNAS, 2010). A relevant gene in this region is the GC gene, which belongs to the albumin family. The GC protein binds vitamin D and is involved in (inflammationprimed) activation of macrophages (Yamamoto and Naraparaju, Journal of Immunology, 1996; Kisker et al., Neoplasia, 2003). Another gene associated with mastitis in this region is the NPFFR2 gene (also known as GPR74), which encodes neuropeptide FF receptor 2. NPFFR2 show expression in several tissues including thymus, liver, spleen, brain, spinal cord and other. NPFF receptors have been implicated in hormonal modulation, regulation of food intake, thermoregulation and nociception through modulation of the opioid system (information from GeneCards). However, it is well documented that many neuropeptides participate in immune responses for example by acting as stimulators or inhibitors of macrophage activity (reviewed by Ganea and Delgado, Microbes and Infection, 2001). NPFFR2 also binds the prolactin-releasing-hormone, suggesting that NPFFR2 may play a role in prolactin secretion (Ma et al., European journal of neuroscience, 2009). Interestingly, in addition to regulating lactation, prolactin also acts as an important regulator of the immune system (Yu-lee, Recent Progress in Hormone Research, 2002).
The most significant SNP for each of the nine mastitis related traits for the targeted region of BTA13 are presented in table 15. The targeted region was 6 Mb in length. The manhatton plot for this region is presented in the
Two possible candidate polymorphisms based on functional annotation within the targeted region of BTA13 could be two consecutive SNPs located at 57579568 and 57579569 and both of them showed very high associations with mastitis traits.
The most significant SNP for each of the nine mastitis related traits for the targeted region of BTA16 are presented in table 18. The targeted region on BTA16 was 7 Mb. The manhatton plot for this region is presented in the
The candidate SNPs for the targeted region on BTA16 which showed strong association across several mastitis related traits are presented in Table 19. The SNP at 50,529,178 showed the strong association followed by two more SNPs (50,564,280 and 50,573,032) across several traits. Besides these three candidates, there is a non-synonymous polymorphism at 50,529,395 (alt 8%) located in the gene ENSBTAG00000020014. Downstream there are candidates in ENSBTAG00000004738 at 50,546,994 (alt 45%) (non-synonymous), and a splice-site polymorphism at 50,547,815 (alt 78%).
The most significant SNP for each of the nine mastitis related traits for the targeted region of BTA19 are presented in table 21. The targeted region on BTA19 was 3 Mb. The manhatton plot for this region is presented in the
Polymorphism Associated with Mastitis Resistance in the Targeted BTA19:
Downstream ENSBTAG00000013677 starts around 55,324,679 Bp (alt 72%). There are splice-site variants at 55,331,001 (alt 21%) and 55,338,316 (alt 64%). ENSBTAG00000044443 starts around 55,414,846 (not included in the association analyses). There is a variant in a non-coding gene at 55,419,720 (alt 29%). Upstream ENSBTAG00000002633 starts around 55,158,662 without any interesting polymorphisms. None of the above SNP selected from the functional annotation showed strong association signal across mastitis traits.
The most significant SNP for each of the nine mastitis related traits for the targeted region of BTA20 are presented in table 23. The targeted region on BTA20 was 8 Mb. The manhatton plot for this region is presented in the
Polymorphism Associated with Mastitis Resistance in the Targeted BTA20 Region:
There are two interesting candidate polymorphic variants at 35,965,955 and 35,965,956. The association results points toward the SNP at 35,965,955 which showed strong association with all the nine traits analyzed (Table 24). In ENSBTAG00000010423 there is a non-synonymous polymorphism at 35,966,158 (alt 52%). There are also candidate polymorphism at 35,942,954 (tri-alleleic indel+snp, polymorphic) and 35,942,739 (alt 52%) and a splice-site polymorphism at 35,938,178 (alt 2%). There is another non-synonymous one at 35,922,233 (alt 4%), Another gene (ENSBTAG00000019595) starts around 35,994,141. There are non-synonymous variants at, 36,011,203 (alt 84%) and 36,013,931 (alt 73%). There is a splice-site polymorphism at 36,011,211 (alt 83%). Combing the association results and functional annotation the SNP Chr20—35965955 emerges as the strongest candidate polymorphism located with the targeted region on BTA20 affecting mastitis traits.
The missense mutation, rs110326785 (G/A) in the neuropeptide FF receptor 2 gene (NPFFR2) is associated with a mastitis QTL on BTA6. This SNP located at 89,059,253 Bp (UMD3.1) causes an amino acid change 392 E to K (Glutamic acid to Lysine). The minor allele frequency of rs110326785 in Nordic Holstein is 48.3%. The allele substitution effects for nine mastitis traits in Holstein are given in the below table 26. This SNP (rs110326785) is also segregating in Nordic Red cattle population (MAF=41.2%) with allele substitution effect of −2.68 (se=0.26) for the breeding value for mastitis index and it explained 2.58% of the genetic variance. This confirms its effect in the same direction in both Holstein and Nordic Red, i.e. the allele A is reducing the resistance to mastitis in both the populations.
The SNP, rs133218364, is a synonymous variant within Caspase recruitment domain-containing protein 6 gene (CARD6) showed most significant association with clinical mastitis index in Holstein cattle. This SNP is located at 33,642,072 Bp on BTA20. Similarly, another SNP, rs133596506, (at 35969994 Bp) located 3323 Bp downstream to LIFR gene (Leukemia inhibitory factor receptor) also showed very high significant association with clinical mastitis index. These two variants were fitted as fixed effect in a haplotype-based analysis using 50K genotype. The variant rs133218364 was able to explain the total QTL variance for the targeted region on BTA20 (green line in the Figure below). However, rs133218364 being a synonymous variant does not change the amino acid composition of the protein. Therefore, rs133218364 is not likely the causative polymorphism underlying the QTL, but is in perfect linkage disequilibrium with the causative polymorphism. The rs133596506 located close to LIFR gene also when included in the haplotype model resulted in a substantial decrease in test statistic
Number | Date | Country | Kind |
---|---|---|---|
PA 2012 70508 | Aug 2012 | DK | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2013/067838 | 8/28/2013 | WO | 00 |