The present invention relates to a method for identification of ovine with a genotype indicative of one or more altered performance traits.
Marker assisted selection (MAS) is an approach that is often used to identify animals that possess alteration in a particular trait using a genetic marker, or markers, associated with that trait. The alteration in the trait may be desirable and be advantageously selected for, or non-desirable and advantageously selected against, in selective breeding programs. MAS allows breeders to identify and select animals at a young age and is particularly valuable for hard to measure and sex limited traits. The best markers for MAS are the causal mutations, but where these are not available, a haplotype that is in strong linkage disequilibrium with the causal mutation can also be used. Such information can be used to accelerate genetic gain, or reduce trait measurement costs, and thereby has utility in commercial breeding programs.
Often in MAS, a particular marker is used for identification of animals with alteration in a particular trait, and different markers are used for different traits. For example, in sheep, the Inverdale marker is used to identify sheep with altered prolificacy (Galloway et al. 2000) and a GDF8 marker haplotype can be used to identify sheep with a variant causing increased muscling (Johnson et al. 2005).
It would however be beneficial to have available individual markers that could be used to identify animals with alteration in one or multiple performance traits.
It is therefore an object of the invention to provide a method for identifying an ovine with a genotype indicative of one or more altered performance traits, and/or at least to provide the public with a useful choice.
In the first aspect the invention provides a method for identifying an ovine with a genotype indicative of at least two altered performance traits, the method including the step of detecting, in a sample derived from the ovine, the presence of at least one allele of the CP34 simple sequence repeat (SSR) marker, or at least one allele of a marker in linkage disequilibrium (LD) with CP34, wherein the presence of the allele is indicative of the altered performance traits in the ovine.
Preferably the performance trait is selected from the group comprising of: weaning weight (WWT), body weight at 8 months (LW8), body weight at 12 months (LW12), carcass weight (CW), adult ewe weight (EWT), eye muscle width (EMW), eye muscle depth (EMD), eye muscle area (EMA), fat depth (FD), carcass fat weight (FAT), carcass lean muscle weight (LEAN), number of lambs born (NLB), lamb fleece weight (LFW), hogget fleece weight (FW12), ewe (adult) fleece weight (EFW), hogget fibre diameter (FDIAM), and resistance to gastrointestinal parasitic nematode infection.
Preferably the performance trait is selected from the group consisting of: weaning weight (WWT), body weight at 8 months (LW8), body weight at 12 months (LW12), carcass weight (CW), adult ewe weight (EWT), eye muscle width (EMW), eye muscle depth (EMD), eye muscle area (EMA), fat depth (FD), carcass fat weight (FAT), carcass lean muscle weight (LEAN), number of lambs born (NLB), lamb fleece weight (LFW), hogget fleece weight (FW12), ewe (adult) fleece weight (EFW), hogget fibre diameter (FDIAM), and resistance to gastrointestinal parasitic nematode infection.
In one embodiment the performance trait is weaning weight (WWT).
Alternatively the performance trait is body weight at 8 months (LW8).
Alternatively the performance trait is body weight at 12 months (LW12).
Alternatively the performance trait is carcass weight (CW).
Alternatively the performance trait is adult ewe weight (EWT).
Alternatively the performance trait is eye muscle width (EMW).
Alternatively the performance trait is eye muscle depth (EMD).
Alternatively the performance trait is eye muscle area (EMA).
Alternatively the performance trait is fat depth (FD).
Alternatively the performance trait is carcass fat weight (FAT).
Alternatively the performance trait is carcass lean muscle weight (LEAN).
Alternatively the performance trait is number of lambs born (NLB).
Alternatively the performance trait is lamb fleece weight (LFW).
Alternatively the performance trait is hogget fleece weight (FW 12).
Alternatively the performance trait is ewe (adult) fleece weight (EFW).
Alternatively the performance trait is hogget fibre diameter (FDIAM).
Alternatively the performance trait is resistance to gastrointestinal parasitic nematode infection.
Preferably the ovine is altered for at least three, more preferably at least four and most preferably at least five performance traits.
In a further aspect the invention provides a method for identifying an ovine with a genotype indicative of at least one altered performance traits selected from the group consisting of: weaning weight (WWT), body weight at 8 months (LW8), body weight at 12 months (LW12), carcass weight (CW), adult ewe weight (EWT), eye muscle width (EMW), eye muscle depth (EMD), eye muscle area (EMA), fat depth (FD), carcass fat weight (FAT), carcass lean muscle weight (LEAN), number of lambs born (NLB), lamb fleece weight (LFW), hogget fleece weight (FW12), ewe (adult) fleece weight (EFW), hogget fibre diameter (FDIAM), and resistance to gastrointestinal parasitic nematode infection, the method including the step of detecting, in a sample derived from the ovine, the presence of at least one allele of the CP34 simple sequence repeat (SSR) marker, or at least one allele of a marker in linkage disequilibrium (LD) with CP34, wherein the presence of the allele is indicative of the altered performance traits in the ovine.
In one embodiment the performance trait is weaning weight (WWT).
Alternatively the performance trait is body weight at 8 months (LW8).
Alternatively the performance trait is body weight at 12 months (LW12).
Alternatively the performance trait is carcass weight (CW).
Alternatively the performance trait is adult ewe weight (EWT).
Alternatively the performance trait is eye muscle width (EMW)
Alternatively the performance trait is eye muscle depth (EMD).
Alternatively the performance trait is eye muscle area (EMA).
Alternatively the performance trait is fat depth (FD).
Alternatively the performance trait is carcass fat weight (FAT).
Alternatively the performance trait is carcass lean muscle weight (LEAN).
Alternatively the performance trait is number of lambs born (NLB).
Alternatively the performance trait is lamb fleece weight (LFW).
Alternatively the performance trait is hogget fleece weight (FW12).
Alternatively the performance trait is ewe (adult) fleece weight (EFW).
Alternatively the performance trait is hogget fibre diameter (FDIAM).
Alternatively the performance trait is resistance to gastrointestinal parasitic nematode infection.
Preferably the ovine is altered for at least two, more preferably at least three, more preferably at least four and most preferably at least five performance traits.
In a further aspect the invention provides a method for identifying an ovine with a genotype indicative of at least one altered performance traits selected from the group consisting of: weaning weight (WWT), body weight at 8 months (LW8), body weight at 12 months (LW12), carcass weight (CW), adult ewe weight (EWT), eye muscle width (EMW), eye muscle depth (EMD), eye muscle area (EMA), fat depth (FD), carcass fat weight (FAT), carcass lean muscle weight (LEAN), number of lambs born (NLB), lamb fleece weight (LFW), hogget fleece weight (FW12), ewe (adult) fleece weight (EFW), and hogget fibre diameter (FDIAM, the method including the step of detecting, in a sample derived from the ovine, the presence of at least one allele of the CP34 simple sequence repeat (SSR) marker, or at least one allele of a marker in linkage disequilibrium (LD) with CP34, wherein the presence of the allele is indicative of the altered performance traits in the ovine.
In one embodiment the performance trait is weaning weight (WWT).
Alternatively the performance trait is body weight at 8 months (LW8).
Alternatively the performance trait is body weight at 12 months (LW12).
Alternatively the performance trait is carcass weight (CW).
Alternatively the performance trait is adult ewe weight (EWT).
Alternatively the performance trait is eye muscle width (EMW)
Alternatively the performance trait is eye muscle depth (EMD).
Alternatively the performance trait is eye muscle area (EMA).
Alternatively the performance trait is fat depth (FD).
Alternatively the performance trait is carcass fat weight (FAT).
Alternatively the performance trait is carcass lean muscle weight (LEAN).
Alternatively the performance trait is number of lambs born (NLB).
Alternatively the performance trait is lamb fleece weight (LFW).
Alternatively the performance trait is hogget fleece weight (FW12).
Alternatively the performance trait is ewe (adult) fleece weight (EFW).
Alternatively the performance trait is hogget fibre diameter (FDIAM).
Preferably the ovine is altered for at least two, more preferably at least three, more preferably at least four and most preferably at least five performance traits.
Resistance to gastrointestinal parasitic nematode infection can be assessed by measuring fecal egg count—summer lamb challenge (FEC1), fecal egg count—autumn lamb challenge (FEC2), and adult fecal egg count (AFEC). Preferably the nematode is of the genus: Haemonchus, Nematodirus, Teladorsagia or Trichostrongylus. Preferably the nematode is of the species Haemonchus contortus, Nematodirus spathiger, Nematodirus filicollis, Teladorsagia circumcincta, Trichostrongylus colubriformis or Trichostrongylus vitrinus.
Preferably the marker in LD with CP34 is an SSR marker.
Preferably the SSR in LD with CP34 is selected from the group including but limited to BMS1084327, BMS1082942, BMS1082956, BMS1082961, BMS1083945, BMS1083008, BMS1082252, BMS1082669, BMS1082702, BMS1082722, BMS1082831, BMS1887400, BMS1887404, BMS1784528, BMS1600436, BMS1082043, BMS1082045, BMS1081952, BMS1081760, BMS1081860, BMS30480882, BMS30480889, BMS1081770, BMS1081774, RSAD2—1, BMS1081640, BMS1080704, and BMS1080870 as herein defined.
More preferably the SSR in LD with CP34 is selected from the group consisting of BMS1084327, BMS1082942, BMS1082956, BMS1082961, BMS1083945, BMS1083008, BMS1082252, BMS1082669, BMS1082702, BMS1082722, BMS1082831, BMS1887400, BMS1887404, BMS1784528, BMS1600436, BMS1082043, BMS1082045, BMS1081952, BMS1081760, BMS1081860, BMS30480882, BMS30480889, BMS1081770, BMS1081774, RSAD2—1, BMS1081640, BMS1080704, and BMS1080870 as herein defined.
In one embodiment the method, the allele of CP34 is selected from a group comprising: allele A, allele B, allele C, allele D, allele E, allele F, allele G and allele H as herein defined.
Preferably the allele of CP34 is allele A, G or H. More preferably the allele of CP34 is allele A. These alleles are particularly suitable to be selected for in sheep breeding programs.
Alternatively the allele of CP34 is allele C or E. Alternatively the allele of CP34 is allele E. These alleles are particularly suitable to be selected against in sheep breeding programs.
Preferably the allele of the marker in LD with CP34, is in LD with CP34 at a D′ value of at least 0.1, more preferably at least 0.2, more preferably at least 0.3, more preferably at least 0.4, more preferably at least 0.5.
Preferably the allele of the marker in LD with CP34, is in LD with CP34 at a V2 value of at least 0.05, more preferably at least 0.075, more preferably at least 0.1, more preferably at least 0.2, more preferably at least 0.3, more preferably at least 0.4, more preferably at least 0.5.
Preferably the allele of the marker is in LD with the traits at a D′ value of at least 0.1, more preferably at least 0.2, more preferably at least 0.3, more preferably at least 0.4, more preferably at least 0.5.
Preferably the allele of the marker is in LD with the traits at a V2 value of at least 0.05, more preferably at least 0.075, more preferably at least 0.1, more preferably at least 0.2, more preferably at least 0.3, more preferably at least 0.4, more preferably at least 0.5.
The allele may be detected by any suitable method. Preferably the allele is detected using a polymerase chain reaction (PCR) step. PCR methods are well known to those skilled in the art and are described for example in Mullis et al., Eds. 1994 The Polymerase Chain Reaction, Birkhauser, incorporated herein by reference. Preferably a PCR product is produced by amplifying the marker with primers comprising sequence complimentary to sequence of the ovine genome flanking the marker.
Any suitable primer pair may be used. Preferably the PCR is performed using at least one primer selected from those set forth in Table 2. Preferably the PCR is performed using at least one primer pair selected from those set forth in Table 2.
Preferably the allele is identified by the size of the PCR product amplified. Preferably size is estimated by running the PCR product through a gel. Preferably a size standard is also run in the gel for comparison with the PCR product.
Other methods for detecting the allele are also contemplated, such as but not limited to probe-based methods, which are well known to those skilled in the art as described in Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd Ed. Cold Spring Harbor Press, 1987, incorporated herein by reference.
Beneficially in the method of the invention, the presence of a combination of more than one allele of the CP34 SSR marker, or more than one allele of a marker in linkage disequilibrium (LD) with CP34, may be detected to identify the ovine. Detection of various combinations of alleles of the CP34 SSR and/or alleles of a marker in LD with CP34, commonly known as haplotypes is contemplated.
In a further aspect the invention provides a method for selecting an ovine with at least two altered performance traits, the method comprising selecting an ovine identified by a method of the invention.
In a further aspect the invention provides a method for identifying an ovine with a genotype indicative of at least one altered performance traits selected from the group consisting of: weaning weight (WWT), body weight at 8 months (LW8), body weight at 12 months (LW12), carcass weight (CW), adult ewe weight (EWT), eye muscle width (EMW), eye muscle depth (EMD), eye muscle area (EMA), fat depth (FD), carcass fat weight (FAT), carcass lean muscle weight (LEAN), number of lambs born (NLB), lamb fleece weight (LFW), hogget fleece weight (FW12), ewe (adult) fleece weight (EFW), hogget fibre diameter (FDIAM), and resistance to gastrointestinal parasitic nematode infection, the method comprising selecting an ovine identified by a method of the invention.
In a further aspect the invention provides a method for identifying an ovine with a genotype indicative of at least one altered performance traits selected from the group consisting of: weaning weight (WWT), body weight at 8 months (LW8), body weight at 12 months (LW12), carcass weight (CW), adult ewe weight (EWT), eye muscle width (EMW), eye muscle depth (EMD), eye muscle area (EMA), fat depth (FD), carcass fat weight (FAT), carcass lean muscle weight (LEAN), number of lambs born (NLB), lamb fleece weight (LFW), hogget fleece weight (FW12), ewe (adult) fleece weight (EFW), and hogget fibre diameter (FDIAM), the method comprising selecting an ovine identified by a method of the invention.
In a further aspect the invention provides an isolated polynucleotide comprising an SSR marker selected from the group consisting of BMS1084327, BMS1082942, BMS1082956, BMS1082961, BMS1083945, BMS1083008, BMS1082252, BMS1082669, BMS1082702, BMS1082722, BMS1082831, BMS1887400, BMS1887404, BMS1784528, BMS1600436, BMS1082043, BMS1082045, BMS1081952, BMS1081760, BMS1081860, BMS30480882, BMS30480889, BMS1081770, BMS1081774, RSAD2—1, BMS1081640, BMS1080704, and BMS1080870 as herein defined.
In a further aspect the invention provides a primer suitable for amplifying a polynucleotide of the invention. Preferably the primer comprises sequence complimentary to sequence of the ovine genome flanking the marker. Preferably the primer comprises flanking sequence from the primers set forth in Table 2. Preferably the primer is selected from those set forth in Table 2.
In a further aspect combinations of the alleles of two or more of the above markers, commonly called a haplotype, could be used.
In this specification where reference has been made to patent specifications, other external documents, or other sources of information, this is generally for the purpose of providing a context for discussing the features of the invention. Unless specifically stated otherwise, reference to such external documents is not to be construed as an admission that such documents, or such sources of information, in any jurisdiction, are prior art, or form part of the common general knowledge in the art.
The term “comprising” as used in this specification means “consisting at least in part of”. When interpreting each statement in this specification that includes the term “comprising”, features other than that or those prefaced by the term may also be present. Related terms such as “comprise” and “comprises” are to be interpreted in the same manner.
The term “polynucleotide(s),” as used herein, means a single or double-stranded deoxyribonucleotide or ribonucleotide polymer of any length but preferably at least 15 nucleotides, and include as non-limiting examples, coding and non-coding sequences of a gene, sense and antisense sequences complements, exons, introns, genomic DNA, cDNA, pre-mRNA, mRNA, rRNA, siRNA, miRNA, tRNA, ribozymes, recombinant polynucleotides, isolated and purified naturally occurring DNA or RNA sequences, synthetic RNA and DNA sequences, nucleic acid probes or primers and fragments.
The term “primer” refers to a short polynucleotide, usually having a free 3′OH group, that is hybridized to a template and used for priming polymerization of a polynucleotide complementary to the target.
The term “probe” refers to a short polynucleotide that is used to detect a polynucleotide sequence, that is complementary to the probe, in a hybridization-based assay.
The abbreviation “SSR” stands for a “simple sequence repeat” and refers to any short sequence, for example, a mono-, di-, tri-, or tetra-nucleotide that is repeated at least once in a particular nucleotide sequence. These sequences are also known in the art as “microsatellites.” A SSR can be represented by the general formula (N1 N2 . . . Ni)n, wherein N represents nucleotides A, T, C or G, i represents the number of the nucleotides in the base repeat, and n represents the number of times the base is repeated in a particular DNA sequence. The base repeat, i.e., N1 N2 . . . Ni, is also referred to herein as an “SSR motif.” For example, (ATC)4, refers to a tri-nucleotide ATC motif that is repeated four times in a particular sequence. In other words, (ATC)4 is a shorthand version of “ATCATCATCATC.”
The term “complement of a SSR motif” refers to a complementary strand of the represented motif. For example, the complement of (ATG) motif is (TAC).
The term “SSR locus” refers to a location on a chromosome of a SSR motif; locus may be occupied by any one of the alleles of the repeated motif “Allele” is one of several alternative forms of the SSR motif occupying a given locus on the chromosome. For example, the (ATC)8 locus refers to the fragment of the chromosome containing this repeat, while (ATC)4 and (ATC)7 repeats represent two different alleles of the (ATC)8 locus. As used herein, the term locus refers to the repeated SSR motif and the flanking 5′ and 3′ non-repeated sequences. SSR loci of the invention are useful as genetic markers, such as for determination of polymorphism.
It will be appreciated by those skilled in the art that an SSR consists of repeats of a certain motif (e.g. ATC), and that different alleles of the SSR locus may have different numbers of repeats [e.g. (ATC)4 or (ATC)7]. Furthermore, the same motif (ATC) may be present, and repeated at a different and unrelated SSR locus. Therefore an SSR locus is defined by the non-repeated sequences flanking the repeated motif Primers complementary to the non-repeated flanking sequences may be used to amplify the repeated region by polymerase chain reaction (PCR). The PCR products may be separated, by methods described herein, to identify individually possessing different alleles of the SSR locus, with different numbers of repeats. Thus the PCR primer sequences (excluding the italicised M13 and PIGtail sequences) in Table 2, and/or sequences complementary to those primer sequences (excluding the italicised M13 and PIGtail sequences), define the SSR markers specified in that table.
“Polymorphism” is a condition in DNA in which the most frequent variant (or allele) has a population frequency which does not exceed 99%.
The term “an SSR in linkage disequilibrium (LD) with CP34” means that the alleles of the SSR are in LD with the CP34 SSR marker.
The term “linkage disequilibrium” or LD as used herein, refers to a derived statistical measure of the strength of the association or co-occurrence of two independent genetic markers. Various statistical methods can be used to summarize linkage disequilibrium (LD) between two markers but in practice only two, termed D′ and V2, are widely used.
“Altered” for any particular performance trait means altered relative to an animal of the same breed that does not possess the specified allele.
“Performance trait” means any trait of commercial significance in sheep breeding. Preferred performance traits include weaning weight (WWT), body weight at 8 months (LW8), body weight at 12 months (LW12), carcass weight (CW), adult ewe weight (EWT), eye muscle width (EMW), eye muscle depth (EMD), eye muscle area (EMA), fat depth (FD), carcass fat weight (FAT), carcass lean muscle weight (LEAN), number of lambs born (NLB), lamb fleece weight (LFW), hogget fleece weight (FW12), ewe (adult) fleece weight (EFW), hogget fibre diameter (FDIAM), and resistance to gastrointestinal parasitic nematode infection.
The applicants have identified several novel SSR markers that are in LD with the CP34 marker. The CP34 marker has previously been reported to be weakly associated with the resistance to parasitic nematode resistance. The applicants have now shown that, surprisingly, the CP34 marker, and several markers in LD with CP34, are strongly associated with several other performance traits in ovine, and strongly associated with parasitic nematode resistance. That is the CP34 marker and the markers in LD with CP34, are themselves in LD with these performance traits.
The invention therefore provides a method for identifying an ovine with a genotype indicative of at least one, and preferably two altered performance traits, the method including the step of detecting, in a sample derived from the ovine, the presence of an allele of the CP34 simple sequence repeat (SSR) marker or an allele of a marker in linkage disequilibrium (LD) with CP34, wherein the presence of the allele is indicative of the altered performance traits in the ovine.
Detecting specific polymorphic markers and/or haplotypes can be accomplished by methods known in the art for detecting sequences at polymorphic sites. For example, standard techniques for genotyping for the presence of single nucleotide polymorphisms (SNPs) and/or SSR markers can be used, such as fluorescence-based techniques (Chen, X. et al., Genome Res. 9(5): 492-98 (1999)), utilizing PCR, LCR, Nested PCR and other techniques for nucleic acid amplification. Specific methodologies available for SNP genotyping include, but are not limited to, TaqMan genotyping assays and SNPIex platforms (Applied Biosystems), mass spectrometry (e.g., MassARRAY system from Sequenom), minisequencing methods, real-time PCR, Bio-Plex system (BioRad), CEQ and SNPstream systems (Beckman), Molecular Inversion Probe array technology (e.g., Affymetrix GeneChip), BeadArray Technologies (e.g., Illumina GoldenGate and Infinium assays) and oligonucleotide ligation assay (OLA—Karim et al., 2000, Animal Genetics 31: 396-399). By these or other methods available to the person skilled in the art, one or more alleles of polymorphic markers, including SSRs, SNPs or other types of polymorphic markers, can be identified.
A number of methods are thus available for analysis of polymorphic markers. Assays for detection of markers fall into several categories, including, but not limited to direct sequencing assays, fragment polymorphism assays, hybridization assays, and computer based data analysis. Protocols and commercially available kits or services for performing multiple variations of these assays are available. In some embodiments, assays are performed in combination or in hybrid (e.g., different reagents or technologies from several assays are combined to yield one assay). The following are non-limiting examples of assays are useful in the present invention.
In some embodiments of the present invention, markers are detected using a direct sequencing technique. In these assays, DNA samples, such as those derived from for example blood, saliva or mouth swab samples, are first isolated from an ovine using any suitable method. In some embodiments, the region of interest is cloned into a suitable vector and amplified by growth in a host cell (e.g., a bacteria). In other embodiments, DNA in the region of interest is amplified using PCR. DNA in the region of interest (e.g., the region containing the marker of interest) is sequenced using any suitable method, including but not limited to manual sequencing using radioactive marker nucleotides, or automated sequencing. The results of the sequencing are displayed using any suitable method. The sequence is examined and the presence or absence of a given polymorphic marker is determined.
In some embodiments of the present invention, polymorphisms are detected using a PCR-based assay. In some embodiments, the PCR assay comprises the use of oligonucleotide primers to amplify a fragment containing the polymorphic marker of interest. Such methods are particularly suitable for detection of alleles of SSR markers. The presence of an additional repeats in such an SSR marker, results in the generation of a longer PCR product which can be detected by gel electrophoresis, and compared to the PCR products from individuals without that allele of the SSR marker.
In other embodiments, the PCR assay comprises the use of an oligonucleotide primer that distinguishes (by hybridisation or non-hybridisation) between an allele containing a specific marker, and alternative alleles. Thus in certain embodiments, if PCR results in a product, then the ovine has the marker, and if no PCR product is produced, the ovine does not have the marker.
In some embodiments of the present invention, presence of the marker is detected using a fragment length polymorphism assay. In a fragment length polymorphism assay, a unique DNA banding pattern based on cleaving the DNA at a series of positions is generated using an enzyme (e.g., a restriction endonuclease). DNA fragments from a sample containing the marker of interest will have a different banding pattern samples that do not contain the marker.
In some embodiments of the present invention, presence of the marker is detected using a restriction fragment length polymorphism assay (RFLP). The region of interest is first isolated using PCR. The PCR products are then cleaved with restriction enzymes known to give a unique length fragment for a given polymorphic marker. The restriction-enzyme digested PCR products may be separated by agarose gel electrophoresis and visualized by ethidium bromide staining. The length of the fragments is compared to molecular weight standards and fragments generated from test and control samples, to identify test samples containing the marker.
In other embodiments, presence of the polymorphic marker is detected using a CLEAVASE fragment length polymorphism assay (CFLP; Third Wave Technologies, Madison, Wis.; and U.S. Pat. No. 5,888,780).
In preferred embodiments of the present invention, presence of a marker is detected by hybridization assay. In a hybridization assay, the presence of absence of a given marker sequence is determined based on the ability of the DNA from the sample to hybridize to a complementary DNA molecule (e.g., a oligonucleotide probe). A variety of hybridization assays using a variety of technologies for hybridization and detection are available. A description of a selection of such assays is provided below.
In some embodiments, hybridization of a probe to the marker sequence of interest is detected directly by visualizing a bound probe (e.g., a Northern or Southern assay; See e.g., Ausabel et al. (eds.), Current Protocols in Molecular Biology, John Wiley & Sons, NY, 1991). In these assays, genomic DNA (Southern) or RNA (Northern) is isolated from a subject. The DNA or RNA is then cleaved with a series of restriction enzymes that cleave infrequently in the genome and not near any of the markers being assayed. The DNA or RNA is then separated (e.g., agarose gel electrophoresis) and transferred to a membrane. A labeled (e.g., by incorporating a radionucleotide) probe or probes specific for the marker sequence being detected is allowed to contact the membrane under a condition of low, medium, or high stringency conditions. Unbound probe is removed and the presence of binding is detected by visualizing the labeled probe.
In some embodiments of the present invention, the presence of the marker is detected using a DNA chip hybridization assay. In this assay, a series of oligonucleotide probes are affixed to a solid support. The oligonucleotide probes are designed to be unique to a given polymorphic maker sequence. The DNA sample of interest is contacted with the DNA “chip” and hybridization is detected.
In some embodiments, the DNA chip assay is a GeneChip (Affymetrix, Santa Clara, Calif.; See e.g., U.S. Pat. No. 6,045,996) assay. In other embodiments, a DNA microchip containing electronically captured probes (Nanogen, San Diego, Calif.) is utilized (See for example U.S. Pat. No. 6,068,818).
In still further embodiments, an array technology based upon the segregation of fluids on a flat surface (chip) by differences in surface tension (ProtoGene, Palo Alto, Calif.) is utilized (See for example U.S. Pat. No. 6,001,311).
In yet other embodiments, a “bead array” is used for the detection of polymorphic marker (Illumina, San Diego, Calif.; See for example PCT Publications WO 99/67641 and WO 00/39587, each of which is herein incorporated by reference).
In some embodiments of the present invention, genomic profiles are generated using a assay that detects hybridization by enzymatic cleavage of specific structures (INVADER assay, Third Wave Technologies; See e.g., U.S. Pat. No. 6,001,567). The INVADER assay detects specific DNA and RNA sequences by using structure-specific enzymes to cleave a complex formed by the hybridization of overlapping oligonucleotide probes.
In some embodiments, hybridization of a bound probe is detected using a TaqMan assay (PE Biosystems, Foster City, Calif.; See e.g., U.S. Pat. No. 5,962,233). The assay is performed during a PCR reaction. The TaqMan assay exploits the 5′-3′ exonuclease activity of the AMPLITAQ GOLD DNA polymerase. A probe, specific for a given allele or mutation, is included in the PCR reaction. The probe consists of an oligonucleotide with a 5′-reporter dye (e.g., a fluorescent dye) and a 3′-quencher dye. During PCR, if the probe is bound to its target, the 5′-3′ nucleolytic activity of the AMPLITAQ GOLD polymerase cleaves the probe between the reporter and the quencher dye. The separation of the reporter dye from the quencher dye results in an increase of fluorescence. The signal accumulates with each cycle of PCR and can be monitored with a fluorimeter.
In still further embodiments, presence of the marker sequence is detected using the SNP-IT primer extension assay (Orchid Biosciences, Princeton, N.J.; See e.g., U.S. Pat. No. 5,952,174).
In some embodiments, a MassARRAY system (Sequenom, San Diego, Calif.) is used to detect presence of the polymorphic marker (See e.g., U.S. Pat. No. 6,043,031.
It will be appreciated that if the marker linked to CP34 is in a protein coding region, presence of the marker may result in an amino acid change in the encoded protein. In such cases, any suitable method for detecting the presence of the characteristic amino acid in a protein or polypeptide may be applied. Typical methods involve the use of antibodies for detection of the protein polymorphism. Methods for producing and using antibodies are well known to those skilled in the art and are described for example in Antibodies, A Laboratory Manual, Harlow A Lane, Eds, Cold Spring Harbour Laboratory, 1998.
The polynucleotides, markers, primers and probes of the invention can be used to derive estimates for the association of each allele of the markers, in a reference population measured, for the traits of interest using a variety of statistical methods such as mixed models. These estimates coupled with a derived economic value for each trait can be used to rank individuals based solely on their genotype at a young age, or a mixture of their genotype estimates and selected subsets of the traits of interest. This approach is useful to rank individuals for their breeding worth.
Alternatively, the genotype information that can be generated using the polynucleotides, markers, primers and probes of the invention, may be considered as a fixed or random effect in an animal model Best Linear Unbiased Prediction (BLUP) or via mixed models (Mrode, 1996) where animals have parentage and various combinations of traits recorded. This approach would be useful for young animals that have not been recorded for the traits of primary interest, to rank individuals on their likely future performance.
The above approaches are not limited to detecting only CP34, or markers in LD with CP34, but also to situations where CP34, or markers in LD with CP34, which are included as part of a larger marker set from several additional markers to many thousands of markers, and the combined estimates of all markers are used to estimate the genetic worth of an individual or its likely individual performance.
The invention will now be illustrated with reference to the following non-limiting examples.
The only marker that had been previously described in the ovine chromosome 3p region in this work is CP34 (Ede et al. 1995). Beh et al., (2002) reported a small QTL present in one sire for resistance to parasitic nematode infection as assessed by fecal egg count (FEC) in the autumn at the 5% chromosome wide significance threshold located near CP34. Crawford et al., (2006) reported a similar result for 1 sire for FEC1, and another sire for abomasal adult Ostertagia numbers, but in contrast Davies et al., (2006) found no evidence of segregation in this region for FEC or antibody traits. To the applicant's knowledge no other trait associations have been reported in sheep in this region. In addition, work reported to date has been by linkage mapping which, at the marker density and animal numbers used, only defines a general region of perhaps 40 cM (˜40 Mbp) in length in which the QTL may reside, and has little or no predictive value in industry because no marker allele associations have been determined that can be used on independent groups of animals. In order to create such associations typically marker density has to be higher than several markers per cM and numerous pedigrees need to be tested.
The WormFEC resource (McEwan et al 2006) consists of 987 primarily male progeny tested sheep sourced from New Zealand recorded flocks and consisting of individuals of Coopworth, Romney, Perendale, Texel, Composite and other minor breeds. A subset of 847 sires, derived from 111 flocks, were used for this analysis. There were 126,004 progeny weaned from these sires with a median progeny group size of 117 (range 1-2017).
They consisted of:
The Romney host resistance parasite selection line was initially created in 1979 and has been recently described by Morris et al (2000). Over the selection period these animals have diverged in fecal egg count (FEC) after a standard challenge by 40 fold. They currently consist of 3 FEC selection lines a low, a high and a control line. For the current work DNA samples were collected from 50 susceptible (high) line, 50 resistant (low) line and 53 control line animals in 1997 and were genotyped for the markers described below.
The Perendale host resistance parasite selection line was initially created in 1985 and has recently been described by Morris et al (2005). Over the selection period these animals have diverged in FEC after a grazing challenge by 4.9 fold. They currently consist of 2 selection lines a low and high FEC line respectively. For the current work DNA samples were collected from 107 susceptible line animals and 128 resistant line animals in 1998 and were genotyped for the markers described below.
Performance recording and estimated breeding values (EBVs) are produced in New Zealand by Sheep Improvement Limited (SIL) a trading entity of Meat & Wool New Zealand. The underlying methodology and system used has been described in a number of papers (Geenty, 2000; Amer, 2000; Newman et al., 2000).
The traits recorded and described in this study and further background to the SIL system is at http://www.sil.co.nz/Technical%20Bullentins/Technical%20Notes/ and trait descriptions from this site are attached to this document including: Young and Walker, (2007a) describing trait measurements and breeding values, Young and Walker (2007b) describing SIL indices and sub indices and economic weights, McEwan (2006) and Young (2006) describing recording for host resistance selection in sheep, and Young (2005) describing New Zealand wide across flock and breed genetic evaluations to produce SIL Advanced Central Evaluation (ACE) EBVs and indices. In turn the SIL ACE EBVs are underpinned by the nationwide central progeny test (CPT) project recently described by McLean et al. (2006).
In brief the genetic evaluations use an across flock and breed multi-trait animal model BLUP analysis for all traits, except for NLB and host resistance traits which used across flock and breed multi-trait repeated measures animal model BLUP. Typically these analyses are done in goal trait group combinations. The outputs of these evaluations for individuals are breeding values corrected for flock, year and breed effects. These breeding values are “shrunken” based on the accuracy of the EBV, primarily in this case being affected by the number of measured progeny. This effect is particularly pronounced for situations where only low progeny numbers have been measured.
This analysis used the June 2007 across flock and breed SIL ACE evaluation EBVs for the sires. Values for the individual sires were downloaded from a direct database query as these estimates are not publicly available except to authorized personnel. Details of the overall evaluation description (Newman, 2007) are attached as an appendix. Only the subset of animals that were genetically linked, as described by the SIL ACE criteria were included in this work.
Novel ovine SSR markers were identified and validated by the following process. The region of interest from the orthologous section of the bovine genome was processed for suitable dinucleotide SSRs with more than 9 repeats using the program Sputnik and then primers designed using Primer 3 using an independent, but analogous approach to that described by Robinson et al. (2004). The bovine genome assembly used was version 3.1 and is available as ftp://ftp.hgsc.bcm.tmc.edu/pub/data/Btaurus/fasta/ as the Btau20040927-freeze and distances are reported on that basis. The primers had a M13 antisense and PIGtail sequence added to them and were then used to PCR amplify DNA samples in conjunction with a fluorescent M13 oligo as described by Boutin-Ganache et al. (2001) and Saito et al. (2005). The size of the resulting products were then measured using standard manufacturer procedures and protocols on a ABI 3730 sequencer. The primers were first screened over a panel of cattle, sheep and deer samples. Markers that passed the initial screen (i.e. were polymorphic in sheep and of reasonable quality see tables 1 and 2 for a list of primers and markers finally selected) were subsequently genotyped across the International Mapping Flock (IMF; Maddox et al. 2001). This allowed the markers to be mapped confirming their location to the region of interest and their suitability for genotyping. Distances and order reported here used markers available in the latest publicly available map v4.7 (http://rubens.its.unimelb.edu.au/˜jillm/jill.htm) and the genotypes obtained from the present study. CRI-Map was used to do the linkage mapping using the process described by Maddox et al. (2001). Markers that mapped to the appropriate location were then genotyped by the same method for the WormFEC sires and Parasite Selection Line (PSL) resources. Results from all the genotyping from the 3730 were measured as raw allele lengths using ABI GeneMapper 4.0 and reported as fragment lengths in base pair units relative to internal standards. These results were binned into alleles on the basis of a cluster analysis based on Ward's distance using SAS (http://www.sas.com/) to define mean allele lengths and reporting variability. The allele names and bins for each marker are shown in Table 3.
Because of uncertainty of marker order and the quality of the bovine assembly a number of markers were mapped using an ovine 5000 RAD ovine Radiation hybrid panel (Eng et al. 2004). This can more accurately position closely spaced markers and acts as a check on the bovine genome assembly. Panel cell lines were genotyped in duplicate using the primers described above for the RH panel and visualised by running on 2% Agarose gels as described by Band et al. (2000). Presence or absence of markers in each cell line was scored and the resulting data mapped using RHmapper (Slonim et al. 1997).
Linkage disequilibrium measures were calculated using the program LDMAX part of the GOLD package (Abecasis and Cookson, 2000) and the R statistical package (http://www.r-project.org/). The measures D′, the square of Cramer's V and the significance expressed as a probability of the association were calculated for all combinations of markers and plotted using a combination of the graphics facilities in SAS (http://www.sas.com/) and R. Each measure is useful for certain purposes and in this case we used a threshold of D′>0.3 or V2>0.1 and p<1×E-10 with CP34 to delimit the boundaries of the region containing significant linkage disequilibrium. Because of the nature of linkage disequilibrium between individual markers, not all markers within this region may be in significant LD.
The genotype results for each marker, for the two Parasite selection lines were tested for differences in allele frequency using the computer program Peddrift (Dodds and McEwan, 1997). This program estimates the likely distribution of allele frequencies between selection lines caused by random founder effects and genetic drift by simulation using the actual recorded pedigree structure. Significant divergence from the expected distribution is evidence of selection on a variant, near the genotyped marker, affecting the trait under selection: in this case fecal egg counts after a field challenge of gastrointestinal internal nematode parasites.
Breeding values for the genotyped individuals were analyzed in the following manner. The EBVs for traits were adjusted for breed (even though EBVs had already been adjusted for this effect) with each marker allele fitted independently as a covariate (0=none, 1=one allele, 2=2 alleles) in a least squares model after the method of Fan et al. (2006). Fitting breed reduces the bias of admixture i.e. when the marker is associated with breed and true differences exist between breeds.
The markers and their map positions are tabulated in Table 1. The BTA version 3.1 bovine genome assembly position given is 300 bp upstream of the actual dinucleotide repeat motif. The whole region is defined as including: the 300 bp upstream fragment, the dinucleotide repeat motif itself, and 300 bp downstream sequence. It was from this segment of DNA that the primers in Table 2 were designed. The markers are ordered in declining assembly order on bovine BTA11. The actual alleles observed and the length of their corresponding PCR products as measured by the ABI 3730 sequencer is tabulated in Table 3.
The IMF map positions are listed in centiMorgans (cM) defined from using a framework map starting from BMS1350 marker as 0 (not shown) and inserting the new BMS markers in their best location. Note some markers are unmapped and for some markers the order is not consistent with the bovine assembly order e.g. BMS1082722. The reasons for this are many. First the IMF resource can only reliably order markers greater than 5 cM apart. There also exists the possibility the bovine assembly is incorrect or that a genotyping error has occurred. Although in the latter case all apparent double recombinants have had their genotypes checked and where necessary eliminated. In other cases linkage mapping ordered markers that are not positioned in the current bovine genome assembly but can be ordered by linkage mapping.
The radiation hybrid map orders the markers in centiRays (cR) starting from zero, for BMS1082956. In theory this mapping technique should be more sensitive for ordering markers than linkage mapping in the IMF and appears to be able to discriminate and order markers where linkage mapping could not. However there were some apparent differences in order between the ovine and bovine genomes and it is not clear whether these are real or minor assembly and mapping discrepancies.
Table 4 lists the markers and the significance probability for an allele frequency difference between selection lines (resistant vs susceptible) after adjusting for founder effects and genetic drift. The assumption is that the allele frequencies have changed due to selection effects on a nearby locus in linkage disequilibrium with the measured marker. Four markers showed significant differences in allele frequency between selection lines in the Perendale flocks: BMS1784528, BMS1600436 BMS1081760 and BMS30480889. As shown later, all of these markers are located within the boundary defined by BMS1887400 to BMS1081770. In contrast no association was observed for any marker in the Romney selection lines.
WormFEC Sire Resource Association with Production and Host Resistance Traits
Marker Linkage Disequilibrium with CP34
In the current context, markers with D′ values with CP34 greater than 0.3 or V2 greater than 0.1 are considered to define the boundaries of useful LD, if they also have significant linkage disequilibrium with CP34. Based on the results presented in
The utility of the predictive ability is provided in the following example, but is not restricted solely to this approach. Selected CP34 trait estimated breeding value allele associations and their economic values (Young and Walker, 2007b) have been combined into an economic index in Table 5. The individual allele estimates are additive so an animal that has a genotype of AA will have, in the absence of other information, a predicted value of 66+66=132 cents versus an animal with a EE genotype of −81+−81=−162 cents. Used in this way individual animals can be ranked for breeding purposes. When estimated breeding values and their accuracies derived from trait measurements have been calculated, these marker based estimates can be blended to create an overall index using selection index theory and the relative accuracies of the two predictions. A further alternative is to fit the CP34 allele as either a fixed or random effect within the standard animal model BLUP evaluation.
Table 1 below, shows the map positions of the SSR markers identified.
Table 2, below shows the primer sequences used to amplify the SSR markers identified.
TGTAAAACGACGGCCAGTTTCCTTCCCCAGACAGTCAC
GTTTCTTTGTGTATTTGGGAGGGGTGT
TGTAAAACGACGGCCAGTCATGTGTGTCAACATCAATCCA
GTTTCTTTCCATCCAGACAATACAGCAA
TGTAAAACGACGGCCAGTACTGGTCAAGCAGACCATCT
GTTTCTTCCCATGTTCAGGCGTTATCT
TGTAAAACGACGGCCAGTGGCAGGTGAAAATACTTGCTG
GTTTCTTTGATGAGGCAGCTCATTGAC
TGTAAAACGACGGCCAGTCAAGATGAATGATCCCATGC
GTTTCTTTCAGCCCAGGAGTTAAACATT
TGTAAAACGACGGCCAGTGTCCTCTCAGATGGCAGAGC
GTTTCTTTGGAGACATTAGTGTGTGCTCAT
TGTAAAACGACGGCCAGTATGGTCACCACTGCACTGAC
GTTTCTTAAGGCAGGCAAGTATTTGGA
TGTAAAACGACGGCCAGTGGGGAGTATGCAATTCAGGA
GTTTCTTTACAGGCCAAAGGGAATTTG
TGTAAAACGACGGCCAGTGCGTGTGGATAGCGTGAGTA
GTTTCTTTTGAGACCCCAGTCCAGAAG
TGTAAAACGACGGCCAGTGGATATCAGGGAGTGGGATG
GTTTCTTTCCCCTGATGTTAGCAGCTT
TGTAAAACGACGGCCAGTCTGCTCCATATCACGACAGC
GTTTCTTTGGTCTTGGTGGTCTGTTTG
TGTAAAACGACGGCCAGTGAAAGGTGGTGGTCTCCTTG
GTTTCTTTGAGAGAAGACCTGGGGAGA
TGTAAAACGACGGCCAGTGGGTCGTAGAGAGTTGAACACA
GTTTCTTGCTGTCTCTTTCACTCCAAAATC
TGTAAAACGACGGCCAGTCTCTGAGCCATATGGGAAGC
GTTTCTTTTCCACAGTGTTTCAGATGTATAGC
TGTAAAACGACGGCCAGTTCAGGAAGTGGTAGGCAGAGA
GTTTCTTTACCACTGAGCCACCAGAGA
TGTAAAACGACGGCCAGTGCTGAACAATGTGATATGTTCAGG
GTTTCTTGGGACAATACTGTCTTAGATGCTGC
TGTAAAACGACGGCCAGTGGGAAACCCACACAACAGAG
GTTTCTTGGAGAATGGCATGGACAGAG
TGTAAAACGACGGCCAGTTTCTTCAGCACTCAGCCTTCT
GTTTCTTTCACTGCTGGATATGGTGGA
TGTAAAACGACGGCCAGTTTGCAAGGTTAGACTTTGGTGA
GTTTCTTTGTTCCCAGACCAGTATTTCAG
TGTAAAACGACGGCCAGTCTCAAAACGACAAAGCCACA
GTTTCTTAGGACCGGCTGTATAGCACA
TGTAAAACGACGGCCAGTGCAGGCTGGTTCTTTACCAC
GTTTCTTTTGTGGTAGGTTTCACCAAGG
TGTAAAACGACGGCCAGTGCAAATGGCCAAATGTCATC
GTTTCTTCATGCACCCCAATGTTCATA
TGTAAAACGACGGCCAGTTCTTGATCACTGAGCCACCA
GTTTCTTTCAGCAAAGAGGCTGGTACA
TGTAAAACGACGGCCAGTAAAGCGTTGCTATCTGTCACAA
GTTTCTTGCTGTCCTGAGCACATAGGG
TGTAAAACGACGGCCAGTTGGAATCCCTTGGACAGAAC
GTTTCTTCCCTGACTCCTAATGCCATC
TGTAAAACGACGGCCAGTTAGCAAACATGTGGGTGGTC
GTTTCTTTTTGCAGAGCCGTATTTGTG
TGTAAAACGACGGCCAGTTTTTAGGTGTACAGCAGAGTGATG
GTTTCTTGGAGGCTTGGTGTGCTACAG
TGTAAAACGACGGCCAGTACTCACCCTGAGTGCTCCAC
GTTTCTTCTCCGGGGTTTCTCTTCTCT
TGTAAAACGACGGCCAGTAATGGGGCAGCAAAGAGTT
GTTTCTTCTCCGGGGTTTCTCTTCTCT
Table 3, below shows a summary of the allele information for the SSR markers identified.
Table 4, below shows Peddrift results by selection line expressed as −log 10 (significance probability).
Allele effects by marker and significance are shown in
Table 5 below shows allele estimates for CP34 for the BV traits analyzed for the Romney, Coopworth, Perendale, Texel and Composite analysis in their standard SIL trait units coupled with combined standard SIL economic estimates in cents for: growth adjusted for meat value (Gm), Meat value adjusted for growth (Mg), wool, Number of lambs born/ewe wintered (NLB), and combined host resistance (FEC) plus their additive overall index sum. Significance values for each trait are listed at the bottom (* P<0.05, ** P<0.01, *** P<0.001)
The markers and associations described are useful for their predictive ability for a number of traits including host resistance. The industry utility of the invention is that young unmeasured progeny can be genotyped and their breeding worth predicted.
Saito, D. S., Saitoh, T., Nishium, I. 2005. Isolation and characterization of microsatellite markers in Ijima's leaf warbler, Phylloscopus ijimae (Ayes: Sylviidae). Molecular Ecology Notes 5:666-668
The above Examples illustrate practice of the invention. It will be appreciated by those skilled in the art that numerous variations and modifications may be made without departing from the spirit and scope of the invention.
Number | Date | Country | Kind |
---|---|---|---|
556506 | Jul 2007 | NZ | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/NZ2008/000173 | 7/11/2008 | WO | 00 | 10/18/2010 |