This application includes a sequence listing in XML format titled “2023-01-17_900905.00047_WIPO Sequence listing.xml”, which is 33,063 bytes in size and was created on Jan. 17, 2023. The sequence listing is electronically submitted with this application via Patent Center and is incorporated herein by reference in its entirety.
The present disclosure relates to the fields of animal husbandry, animal breeding, and livestock management by using genetic analysis to identify haplotypes and mutations linked to undesirable traits.
Selective breeding (also called artificial selection) is the process by which humans use animal and plant breeding to develop particular phenotypic traits (characteristics) by choosing which specific animals or plants, males and females, will sexually reproduce to produce offspring. In animal breeding, techniques such inbreeding, linebreeding, and outcrossing are utilized. The deliberate exploitation of selective breeding to produce desired traits has become very common in agriculture and experimental biology.
Dairy cows are significant investments for dairy farmers, and enormous efforts, such as systematic animal breeding programs and artificial insemination, have been and continue to be invested in ensuring that the animals have high and sustained productivity, that the milk produced is of high quality or has desired composition, and that the cows are healthy and fertile.
Traditional breeding techniques involve the studying of sire progenies, and evaluating their milk production, health and fertility ratings (transmitting abilities) to guide further breeding. This standard technique requires years to evaluate the true genetic value by progeny testing each bull. Many cows must be bred and give birth to offspring. The females must be raised, bred, allowed to give birth and finally milked for a length of time to measure their phenotypic traits. Bulls with high performing daughters are then returned to active service and are at least five years old. Those with poor performing daughters are culled with substantial financial loss because there is no opportunity to sell semen and recover the investment of raising the bull.
Furthermore, selection based purely on phenotypic characteristics does not efficiently take into account genetic variability caused by complex gene action and interactions, and the effect of environmental and developmental variants. There is thus a need for a method of genetically evaluating cattle to enable breeders to more accurately select animals at both the phenotypic and the genetic level early in life.
Marker-assisted selection can lower the high cost and reduce the extended time commitment of progeny testing used to improve a herd, since young animals could be evaluated immediately after birth or even prior to birth for the presence/absence of the key markers, and young animals, such as bulls, that are determined by genetic testing to have undesirable markers would never be progeny tested and may never be bred. Therefore, there is a need in the art for faster, more accurate genetic testing of domestic animals such as dairy cattle.
Disclosed herein are methods, systems and compositions for identifying domestic livestock, such as cattle (e.g., Bos taurus, Bos indicus), having genetic traits associated with a motor impairment phenotype. In some embodiments, the method comprises: (a) obtaining a sample from a domestic bovine, wherein the sample comprises chromosomal DNA; (b) generating genetic data for the chromosomal DNA at position 78732954 to 80748266 bp, or a fragment thereof, on chromosome 16; (c) identifying a motor impairment (MI) haplotype in the genetic data, wherein the MI haplotype comprises: (i) one or more of the alleles in the shaded portion of
In some embodiments, the genetic data comprises the DNA sequence of at least one chromosome 16 of the bovine at position 78732954 to 80748266 bp, or a fragment thereof. In some embodiments, the genetic data comprises the DNA sequence of both chromosomes 16 of the bovine at position 78732954 to 80748266 bp, or fragments thereof. In some embodiments, the MI haplotype comprises one or more SNPs at one or more positions 78732954 to 80748266 bp, or a fragment thereof, of bovine chromosome 16 associated with an MI phenotype, and genetic variants in linkage disequilibrium (LD) with those SNPs.
In some embodiments, genetic data comprises RNA derived from chromosome 16 of a bovine at position 78732954 to 80748266 bp, or a fragment thereof. In some embodiments, the RNA is derived from the calcium voltage-gated channel subunit alphal S gene (CACNA1S).
In some embodiments, generating genetic data comprises sequencing the chromosomal DNA, such as by next generation sequencing. In some embodiments, generating genetic data comprises an amplification step, such as a polymerase chain reaction. In some embodiments, generating genetic data comprises a DNA microarray, beadchip, or oligonucleotide binding.
Also disclosed herein are methods of selective breeding of domestic animals, such as bovine. In some embodiments, the method comprising: obtaining a genomic DNA sample from a bull, detecting a homozygous negative MI haplotype in the sample, and using semen from the bull for fertilizing a female animal. In some embodiments, the female animal is in vitro fertilized. In some embodiments, the female is also homozygous negative for the MI haplotype. In some embodiments, the bull is a Holstein, Holstein crossbred, or bull of another breed with Holstein ancestry, and wherein the female animal is a Holstein, Holstein crossbred, or cow or heifer of another breed with Holstein ancestry.
Also disclosed herein are methods of selective breeding, the methods comprising: obtaining a genomic DNA sample from a female bovine, detecting a homozygous negative MI haplotype in the sample, and using oocytes from the female for in-vitro embryo production. In some embodiments, the female bovine is a Holstein, Holstein crossbred, or cow or heifer of another breed with Holstein ancestry.
Also disclosed herein are methods of corrective mating. In some embodiments, the method comprises: obtaining a nucleic acid sample (e.g., a genomic DNA sample or an RNA sample) from a bull and a cow, ascertaining the MI haplotype of each, and mating a homozygous negative female (cow) for the MI haplotype to a bull that is positive for the MI haplotype. In some embodiments, the method comprises obtaining a nucleic acid sample from a bull and a cow and ascertaining the MI haplotype and mating a homozygous negative bull for the MI haplotype to a cow that is positive for the MI haplotype.
Also disclosed herein are compositions, kits, and systems. In some embodiments, the compositions, kits, and systems comprise at least one nucleic acid molecule, wherein the nucleic acid molecule comprises SEQ ID NO: 1, SEQ ID NO: 2, or a fragment of SEQ ID NO: 1 or SEQ ID NO: 2, wherein the fragment includes the MI SNP. In some embodiments, the nucleic acid molecule is linked/attached to a solid support. In some embodiments, the compositions, kits, and systems further comprise an enzyme having polymerase activity, such as Taq polymerase. In some embodiments, the nucleic acid molecule comprises one or more of a detectable marker, a linker (e.g., to attach to a solid support), and/or a non-natural nucleotide base.
These and other embodiments, aspects, advantages, and features of the present invention will be set forth in part in the description which follows and will become apparent to those skilled in the art by reference to the following description of the invention and referenced drawings or by practice of the invention. The accompanying drawings illustrate one or more implementations, and these implementations do not necessarily represent the full scope of the invention.
The present invention will be better understood and features, aspects, and advantages other than those set forth above will become apparent when consideration is given to the following detailed description thereof. Such detailed description refers to the following drawings, where:
The present invention is described herein using several definitions, as set forth below and throughout the application. Additionally, information describing certain attributes and embodiments are found in the attached Appendix, which is incorporated herein by reference in its entirety for all purposes.
Unless otherwise specified or indicated by context, the terms “a”, “an”, and “the” mean “one or more.” Thus, the references “a”, “an”, and “the” are generally inclusive of the plurals of the respective terms. For example, reference to “a SNP”, “a method”, or “a trait” includes a plurality of such “SNPs”, “methods”, or “traits.” Reference herein, for example to “an association” includes a plurality of such associations, whereas reference to “chromosomes” includes a single chromosome where such’ interpretation is not precluded from the context. Similarly, the words “comprise”, “comprises”, and “comprising” are to be interpreted inclusively rather than exclusively. Likewise the terms “include”, “including” and “or” should all be construed to be inclusive, unless such a construction is clearly prohibited from the context. Where used herein the term “examples,” particularly when followed by a listing of terms is merely exemplary and illustrative, and should not be deemed to be exclusive or comprehensive.
As used herein, “about,” “approximately,” “substantially,” and “significantly” will be understood by persons of ordinary skill in the art and will vary to some extent on the context in which they are used. If there are uses of these terms which are not clear to persons of ordinary skill in the art given the context in which they are used, “about” and “approximately” will mean plus or minus ≤10% of the particular term and “substantially” and “significantly” will mean plus or minus >10% of the particular term.
As used throughout, ranges herein are stated in shorthand, so as to avoid having to set out at length and describe each and every value within the range. Any appropriate value within the range can be selected, where appropriate, as the upper value, lower value, or the terminus of the range. For example, a range of 0.1-1.0 represents the terminal values or 0.1 and 1.0, as well as the intermediate values of 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, and all intermediate ranges encompassed within 0.1-1.0, such as 0.2-0.5, 0.2-0.8, 0.7-1.0, and so on.
As used herein, the terms “bovine” and “cattle” generally refer to members of the genus Bos, and include, without limitation, Bos taurus and Bos indicus (also termed Bos taurus indicus). The disclosed compositions, systems, methods, kits, and platforms are not intended to be limited to members of the Bos genus, and may be used with other members of the family Bovidae. By way of example, but not by way of limitation, in some embodiments, the disclosed compositions, systems, methods, kits, and platforms are used to detect an MI haplotype in a domestic bovine, such as a Holstein, Holstein crossbreed, or animal of another breed with Holstein ancestry. In the cattle industry, most breeds have a “grade-up” program, resulting, for example, in animals having Holstein genes present in a different breed and vice-versa.
The term “genotype” as used herein refers to the identity of the alleles present in an individual or a sample. In the context of the present invention, a genotype preferably refers to the description of the polymorphic alleles present in an individual or a sample. The term “genotyping” a sample or an individual for a polymorphic marker refers to determining the specific allele or the specific nucleotide carried by an individual at a polymorphic site.
As used herein, “locus”, or plural “loci”, refers to a physical site or location of a specific gene or marker on a chromosome. An exemplary locus comprises position 78732954 to 80748266bp on chromosome 16 of a bovine, such as a Holstein. In some embodiments a locus is a single base pair (bp, or BP), such as, for example, bp 79613592 on chromosome 16 of a Holstein.
As used herein, “linkage disequilibrium” (or “LD”) refers to the allelic association between specific alleles at two or more neighboring loci in the genome, e.g., within a population. LD can be determined for a single marker or locus, or multiple markers. Stated another way, LD is the correlation between nearby variant such that the alleles at neighboring polymorphisms (observed on the same chromosome) are associated within a population more often than if they were unlinked.
As used herein, “allele” refers to one or more alternative forms of a particular nucleic acid sequence, where the differences may include, without limitation, one or more single nucleotide polymorphisms (SNPs), an insertion, inversion, or deletion. The sequence may or may not be within a gene, and may be within a coding region or noncoding region of a gene, and may, for example, be within a promoter or regulatory regions, an exon, or an intron of a particular gene. By way of example, the sequence of the positive DNA strand from bp 79613564 to 79613623 of Holstein, with the alleles (in this case a SNP) indicated as [C] or [T] is:
GGGGCCCTCTGGCCCCTCACCTGCATGC[C/T]GATGACCGCGTAGAT
The term “haplotype” refers to a combination of alleles or DNA markers on one chromosome. At the DNA level, haplotype refers to a sequence of nucleotides found at two or more polymorphic sites in a locus on a single chromosome. There are multiple haplotypes over a given chromosome region within a breeding population. As used herein, haplotype includes a full-haplotype and/or a sub-haplotype. Full-haplotype is the 5′ to 3′ sequence of nucleotides found at all polymorphic sites examined in a locus on a single chromosome from a single individual, while sub-haplotype refers to the 5′ to 3′ sequence of nucleotides seen at a subset of the polymorphic sites examined in a locus on a single chromosome from a single individual. Relatedly, the term “haplotype pair” refers to the two haplotypes found for a locus in a single individual. “Haplotyping” is a term for a process for determining one or more haplotypes in an individual and includes use of family pedigrees, molecular techniques and/or statistical inference. By way of example but not by way of limitation, the MI haplotype is shown by the shaded region of
“Quantitative trait locus,” (or “QTL”), as used herein is a genomic sequence that is associated with a particular phenotypic trait. Multiple QTL may be identified for a particular trait, and they are frequently found on different chromosomes. The number of QTLs that associate significantly with a particular phenotypic trait may provide an indication of the genetic architecture of a trait, the number of genes that affect the trait, or the extent of the effect of one or more of those genes. One or more QTL that significantly associates with a trait may be candidate genes underlying that trait, which can be sequenced and identified. The significance of the degree of association of a given QTL with a particular trait can be assessed statistically, e.g. through QTL mapping of the alleles that occur in a locus and the phenotypes that they produce. Statistical analysis is preferred to demonstrate whether an observed association with a trait is significant. The presence of a QTL, and its location identify a particular region of the genome as potentially containing a gene that is associated, directly (e.g., structurally) or indirectly (e.g., regulatory) with the trait being analyzed. The probability of association can be plotted for various markers associated with the trait spaced across a chromosome, or throughout the genome. A QTL may be present in a haplotype that is associated with a phenotype.
A “polynucleotide” includes single-stranded or a multi-stranded nucleic acid molecules comprising two or more sequential bases, including any single strand or parallel and anti-parallel strands of a multi-stranded nucleic acid. Polynucleotide may be of any length, and thus, include very large nucleic acids, as well as short ones, such as oligonucleotides.
The term “oligonucleotide” typically refers to short polynucleotides, generally no greater than about 50 nucleotides. It will be understood that if a nucleotide sequence is denoted represented by a DNA sequence (i.e., A, T, G, C), the corresponding RNA sequence (i.e., A, U, G, C, wherein “U” replaces “T”) is also included unless otherwise specified, or unless context and common sense make clear the distinction.
As used herein, the term “motor impairment,” used interchangeably with the term “recumbent” or “recumbency” refers to a condition in which an animal is unable to coordinate, control, or respond normally with respect to voluntary muscle movement. By way of example, motor impairment may include the following traits noted, for example, in a calf: the inability to stand; the inability to stand without assistance; the inability to stand for the first 24 hours after birth; losing the ability to stand; the inability to stand, followed by recovery of the ability to stand, followed by poor growth and health. Motor impairment as used herein is not accompanied by or due to abnormal nutrition, injury, or metabolite levels.
As used herein, the term “motor impairment haplotype” (MI haplotype) refers to the haplotype of an animal, such as a Holstein bovine, at position 78732954 to 80748266 bp on chromosome 16. A positive MI haplotype is indicative of an animal that exhibits, or is likely to exhibit motor impairment as defined above, or that may pass motor impairments to its offspring. A negative MI haplotype is indicative of an animal that is unlikely to exhibit motor impairment, and that is unlikely to pass motor impairments to its offspring. By way of example, a positive MI haplotype is represented by
While the MI phenotype is associated with the MI haplotype, it is also associated with a SNP [C/T]: The sequence of the positive DNA strand from bp 79613564 to 79613623 with the SNP indicated as [C] or [T] (79613592 bp) is shown below:
GGGGCCCTCTGGCCCCTCACCTGCATGC[C/T]GATGACCGCGTAGATG
Underlined sequence represents intron.
The Forward Strand, Reverse Strand, Reverse Complement and amino acid sequence of this region is shown in
The MI SNP is present in the calcium voltage-gated channel subunit alphal S gene (CACNA1S) of Holstein cattle. The missense mutation alters a GGC codon to AGC which facilitates a glycine to serine amino acid substitution. The complete protein sequence of CACNA1S is shown below.
The amino acid sequence of CACNA1S (XP_024832342.1 CACNA1S [organism=Bos taurus] [GeneID=100337204] [isoform=X1]) is shown below (SEQ ID NO: 11 and 12):
The nucleic acid sequence encoding the CACNA1S amino acid sequences (XM_024976574.1 PREDICTED: Bos taurus calcium voltage-gated channel subunit alpha1 S (CACNA1S), transcript variant X1, mRNA) is shown below (SEQ ID NO: 13 and 14):
The exon including the MI SNP and both flanking introns (CACNA1S/XM_024976575.1/XP_024832343.1 79612962-79614080) are shown in the nucleic acid sequence below (SEQ ID NO: 15 and 16). The underlined sequence corresponds to intron:
CTGCAGGCCACGAAAGGCTGGACTGTGGACCTGGACAGGTCTCTCACCG
GCCCCAGCTCCCCCGACACCCTCCACCTCAGGTCACACAAACCGGGCTA
CAGCTGACCCCTGAGCAACGCCAGGGTATAGGTCTACAGTCAGCTCCCC
GCCCTGTATCCACGGATTCAACCAACCATGAATCCCGTGGAGCCCTAGT
ACAGTTCCCTGAAAAATTCACGGATACATGGACACCCGAAGTTCAAATC
TGCCTGTTCAAGGCTCACTTATGTTTCCAAGGTTGGGTGAAATTTTTCC
CCTCAAGCCAAATTAATCTGGGAGGGGGACACTGTTCCTAACCAAGAGC
TGACTTAGGGTCTCAATGTAGAGAGCAGAGTGCTTCAGGAGAGGGTAGC
TGAGACCACCAGAGCCTCCAACAGGGGCTTTGGAGACCAGTTCCGCCCT
TGTCCATCTGCCCACAAGGTTCCCCTGGGCTGCCTCCTCCTGACTGATG
CTGGGCCGGCTGGGAACCACCTCCTCCTCTAGGGCTGTGGCCAAGCCCC
GGGCTGCCAGGATGCAGGAGGCAGGCTGGTGAGAGGCCAGCCTGCAGGC
CGGCCGGCCTCCCGGGGGCCCTCTGGCCCCTCACCTGCATGC[C/T]GA
CGCCCGGTACGGGGCCCTTCCTTAGCTCACCGGGGACCCAAGGCCCATG
GGCCACACCCAGGGTGGCCCTGACGTCCCCGTAACCCATGCGGATACAG
TCTCGTGGGTGGGCGAGGGTTCCCTGAGTGGGTGGGCTTCCGGGGTGGA
CAGAGTCCCATGGGTGGGGGGGGTCACATGGGTAGGCGGGGTCCCGGGG
GGGGGTCCCGTGGGTGGGTGTGGTCCCCGATGGGCGGGGGTCCTGTGGG
CGTGGTCACAGGGGGGCGTGGTCCCGCGGTGGGCGGAGTCCCGTGGGTG
GGCGGGGTCCCGGGGTGGGTGCAAGGAGAGCGGTCCCAGGATGAGTGGG
GATCCCGGGGTAGGCGGGGCCCTGGGTGGGCGGGGCTGCGCAC
The haplotype shown in
Disclosed herein are methods and composition for identifying a bovine, including a young or adult bovine animal, an embryo, a semen sample, an egg, a fertilized egg, a zygote, fetus, or other cell or tissue sample therefrom, to determine whether said bovine possesses a motor impairment (MI) haplotype. Determination of the MI haplotype, located at position 78732954 to 80748266 bp on chromosome 16 of Holstein cattle, provides a quick, accurate, and effective means to identify whether an animal should be used for breeding stock, whether an animal is likely to develop motor impairment, and whether further progeny testing should be performed.
DNA markers have several advantages over phenotype analysis; segregation is easy to measure and is unambiguous, and DNA markers are co-dominant, i.e., heterozygous and homozygous animals can be distinctively identified. Once a marker system is established selection decisions can be made very easily, since DNA markers can be assayed any time after a DNA-containing sample, such as a blood or tissue sample, can be collected from the individual infant animal, or even earlier by testing embryos in vitro if very early embryos are collected. The use of marker assisted genetic selection will greatly facilitate and accelerate cattle breeding programs.
In one embodiment the present invention provides a breeding method whereby haplotype testing as described below is conducted on bovine embryos, young adults, or adult cattle, and based on the results, certain animals are either selected for or removed from the breeding program. Preferably, individuals negative for the MI haplotype are selected. Preferably, these individuals are homozygous with regard to the negative MI haplotype. For example, the individual is homozygous with regard to a negative MI haplotype.
The present disclosure relates to analyzing a nucleic acid sample from a bovine, including a young or adult bovine animal, an embryo, a semen sample, an egg, a fertilized egg, or a zygote, or other DNA-containing biological sample (e.g., blood, cell or tissue sample therefrom), to determine whether said bovine possesses one of the haplotypes disclosed herein, indicative of motor impairment. The nucleic acid sample is representative of DNA at position 78732954 to 80748266 bp on chromosome 16 (the “MI region”), or a fragment thereof. The nucleic acid can comprise genomic DNA, cDNA, or RNA.
In some embodiments, the method comprises haplotyping nucleic acid at bovine DNA at position 78732954 to 80748266 bp on chromosome 16 (the “MI region”) or fragment thereof, for at least one copy of the MI haplotype and optionally assigning to the individual a bovine MI haplotype score. The method may be used to identify the haplotype of both copies of the MI region in the animal, and assigning a haplotype pair to the animal.
In some embodiments, a haplotyping method comprises examining one copy of the MI region, or a fragment thereof, to identify the nucleotide at two or more polymorphic sites in that copy to assign a haplotype to the individual.
In some embodiments, a haplotyping method comprises examining one copy of the MI region, or a fragment thereof, to identify a single polymorphic site in that copy. In some embodiments, the single polymorphic site comprises the MI SNP, and is bp 79613592 of bovine chromosome 16 and is shown in SEQ ID NO: 1/2, where underline represents an intron:
GGGGCCCTCTGGCCCCTCACCTGCATGC[C/T]GATGACCGCGTAGATG
As will be readily appreciated by those skilled in the art, if an MI region is cloned and sequenced any individual clone will typically only provide haplotype information for a portion of one of the two MI region copies present in an individual. If haplotype information is desired for both copies of the individual's entire MI region, additional MI clones will need to be examined.
Further provided is a method for genotyping the bovine MI region, comprising determining for the two copies of the MI region present the identity of the nucleotide pair at one or more polymorphic sites, wherein the one or more polymorphic sites (PS) have the alternative alleles associated with the MI phenotype.
In some embodiments, a genotyping method comprises examining both copies of the MI region, or a fragment thereof, to identify the nucleotide pair at one or more polymorphic associated with the MI phenotype in the two copies to assign a genotype to the individual. In some embodiments, “examining a region” may include examining one or more of: DNA containing the region, mRNA transcripts thereof, or cDNA copies thereof. As will be readily understood by the skilled artisan, the two “copies” of a gene, mRNA or cDNA, or fragment thereof in an individual may be the same allele or may be different alleles. In another embodiment, a genotyping method of the invention comprises determining the identity of the nucleotide pair at each of the polymorphic sites associated with the MI phenotype.
In some embodiments, genotyping comprises an evaluation of one or more of the alleles in the shaded region of
The present disclosure further provides systems, platforms, and kits for genotyping and/or haplotyping a bovine sample, the system or kit comprising in a container one or more nucleic acid molecules designed for detecting the one or more polymorphisms associated with the MI phenotype, and optionally at least another component for carrying out such detection. In some embodiments, a kit comprises at least two oligonucleotides packaged in the same or separate containers. The kit may also contain other components such as hybridization buffer (where the oligonucleotides are to be used as a probe) packaged in a separate container. Additionally, or alternatively, where the oligonucleotides are to be used to amplify a target region, the kit may contain, preferably packaged in separate containers, a polymerase and a reaction buffer optimized for primer extension mediated by the polymerase, such as PCR.
In some embodiments, a system, platform, or kit may be configured to detect a mutant protein, such as a mutant calcium voltage-gated channel subunit alphal S gene (CACNA1S). In some embodiments, the mutant alters a GGC codon to AGC which facilitates a glycine to serine amino acid substitution.
Thus, in some embodiments, a system, platform or kit comprises immunoassay reagents for detecting a mutant protein, including one or more antibodies and buffers, wherein the primary or secondary antibody may have an attached enzyme that produces a color when a substrate for the enzyme is added and reacts with the enzyme. The enzyme that produces color may be linked to either the antigen or a primary or secondary antibody.
In some embodiments, a breeding method is provided whereby haplotyping and/or genotyping as described above is conducted on bovine embryos, and based on the results, certain cattle are either selected or dropped out of the breeding program. In some embodiments, individuals carrying an MI negative haplotype are selected for continued and future breeding.
Through use of the linked marker loci, the different haplotypes can be manipulated in genetic improvement programs by procedures termed “marker assisted selection” (MAS), for genetic improvement within a breeding nucleus; or “marker assisted introgression” for transferring useful alleles from a resource population to a breeding nucleus (Soller 1990; Soller 1994, herein incorporated by reference in their entireties).
In some embodiments, a nucleic acid sequence(s) comprising the MI SNP can be incorporated into a genotyping platform, comprising a panel or nucleic acid target array. As used herein, “panel” and “array” are used interchangeably and refer to a solid-state substrate, typically a glass, plastic or silicon slide, used to assay and detect single nucleotide polymorphisms from DNA samples in parallel, and genotypic data is output. Numerous panel/array platforms are commercially available and are well known in the art. By way of example, array genotyping platforms directed to cattle include, but are not limited to Neogen's BOVUHDV03 and GeneSeek Genomic Profiler (GGP) Bovine 150K; Illumina's bovine array beadchips, such as BovineSNP50 DNA Analysis BeadChip and BovineHD DNA Analysis Kit; and ThermoFisher's Axiom Bovine Genotyping 100K Array and Axiom Genome-Wide BOS 1 Bovine Array, Affymetrix platforms, among others. In some embodiments, nucleic acid sequence(s) comprising the MI SNP can be incorporated into such commercially available genotyping platforms. Alternatively, nucleic acid sequence(s) comprising the MI SNP can be incorporated into a custom designed array designed to assay specific pools of SNPs, some of which SNPs may overlap with a commercially available platform. Incorporation of nucleic acid sequence(s) comprising the MI SNPs depends on the chemistry of the array platform, but follows the principles of the array technology. For example, nucleic acid sequence(s) comprising the MI SNPs are used to design one or more complementary probe sequences, wherein the probe is affixed to the array.
By way of example, these array platforms represent a subset of a larger number of commercially available array platforms varying in SNP content, density (number), chemistry, throughput (number of samples that can be assayed simultaneously), or other properties. However, arrays generally rely on the hybridization of fragmented single-stranded sample DNA to short single-stranded probes, also known as oligonucleotides, which are typically 25-50 bp long. The probes are typically attached, printed, or synthesized directly onto the array surface in a grid-like fashion onto a two-dimensional solid-state surface, such as glass, plastic or silicon. The probe sequences are designed using apriori DNA sequence knowledge, which may have been collected from a panel of individuals representing genotypic diversity for that species, to artificially synthesize sequences that are complementary to the sample DNA fragment sequences. Hybridization depends on the binding of nucleotide bases with their complementary bases, where the probe and sample DNA are complementary in sequence, and thus hybridize. Even strands of DNA having less than a perfect match can hybridize. Typically, the DNA sample sequence matches the probe at 100%, in some embodiments, 99%, 98%, 97%, 96%, 95%, etc. down to 80%.
Array platforms typically utilize fluorescent dye to generate signals that are created when the dye is excited by a laser and the emission spectra detected by a camera, whereby the signals report specific nucleotide identities at the assayed basepair position within the probe sequence. The resulting raw images are normalized to subtract background signals and converted into genotypic data with confidence scores using computational algorithms (LaFramboise, Thomas, 2009). Variations of array technology exist, and the details provided herein are included by means of example but not by way of limitation.
In some embodiments, a nucleic acid sequences(s) comprising the MI SNP can be genotyped using probe oligonucleotides configured as molecular beacons. Molecular beacons are single-stranded oligonucleotide probes that are designed with complementary sequences on each end of the probe sequence, which creates a stem-loop structure. A fluorophore is added to one end of the probe and a fluorescence quencher to the other end. When in close proximity, such as is achieved when the molecule remains in the stem-loop state, no fluorescence is emitted. However, when the probe sequence hybridizes to its target sequence (the complementary sample DNA sequence), the stem-loop denatures, thereby separating the fluorophore and quencher, and allowing fluorescence. An individual's genotype at a particular position can be determined by designing one molecular beacon for one allele and another beacon for an alternate allele, where the probe wavelengths for each beacon differ. Different alleles can be distinguished by the different wavelengths each fluorophore emits.
In some embodiments, a nucleic acid sequence(s) comprising the MI SNP can be genotyped using DASH genotyping, which is based on detecting differences in melting temperatures of double-stranded DNA when there is a mismatched nucleotide. A fluorescent marker only emits a signal when the molecule is bound to dsDNA, which is formed between an allele-specific oligonucleotide and the sample DNA. As the temperature of the reaction is increased, the dsDNA denatures into ssDNA, and the marker no longer fluoresces.
A number of enzymes can be used to develop SNP genotyping assays, including DNA polymerase, DNA ligase and nucleases.
In some embodiments, a genotyping assay includes an amplification step, e.g., to amplify nucleic acid from a subject sample, wherein the nucleic acid comprises the MI SNP. The skilled person understands how to design primers and a polymerase chain reaction (PCR) protocol to amplify the nucleic acid target of interest, such as a sequence comprising the MI SNP. The primers may be designed to be complementary to the sequences flanking the MI SNP, such that the MI SNP and any sequence 3′ of the forward primer and 5′ of the reverse primer is amplified via PCR. The primer sequences may be variable in length, sequence composition or position relative to the MI SNP, and may amplify variable length amplicons of varying sequence compositions, wherein the amplicon contains the MI SNP. PCR-based genotyping technologies include but are not limited to realtime or quantitative PCR (qPCR) and end-point genotyping.
In some embodiments, a nucleic acid sequence(s) comprising the MI SNP can be genotyped using real-time qPCR. Real-time qPCR approaches are well known in the art, and generally rely on measuring PCR product accumulation over each PCR cycle, i.e., in real time, as opposed to after a specified number of PCR cycles as is done with PCR. qPCR typically relies on a reporter molecule/enzyme to determine the amount of PCR product present relative to a control in real time. SYBR green is one type of non-specific dye that can be used to report on PCR product accumulation by binding to dsDNA and fluorescing when bound. Hybridization probes may also be employed, wherein the probes consist of oligonucleotides complementary to sequences internal to the PCR product. The probes are tagged with different fluorescent dyes, one dye at the 3′ end of one probe and a different dye at the 5′ end of a second probe. As PCR products accumulate, the probes bind to the target DNA, which brings them close enough together to catalyze a fluorescence resonance energy transfer (FRET), wherein energy is transferred from the shorter wavelength dye (the donor) to the longer wavelength dye (the acceptor). The PCR product amount can be estimated based on the wavelength and intensity of the fluorescence detected.
In another embodiment, a nucleic acid sequence(s) comprising the MI SNP can be genotyped using end-point genotyping, which involves measuring a PCR amplification product after a specified number of PCR cycles. One example of an end-point PCR genotyping assay is LGC Biosearch's Kompetitive allele specific PCR, or KASP, which is based on allele-specific PCR and fluorescence resonant energy transfer (FRET). Two allele-specific forward primers, each labeled with a distinct fluorescent dye, and a common reverse primer allow for amplification of the sample DNA regardless of which allele it contains. The SNP genotype is determined by the emission spectra detected, where a homozygous individual is indicated by a signal consisting of one of two possible spectra, depending on the allele, and a heterozygous individual is indicated by a signal containing both possible spectra.
In some embodiments, nucleic acid sequences comprising the MI SNP can be genotyped using restriction fragment length polymorphisms (RFLPs). This assay comprises digesting a DNA sample into fragments by one or more restriction enzymes, where the resulting fragments can be separated by gel electrophoresis. The patterns of banding determine the composition of nuclease cleavage sites found within the DNA sample sequence, which ultimately reports on the original DNA sequence.
In some embodiments, a nucleic acid sequence(s) comprising the MI SNP can be genotyped using flap endonuclease (FEN). FEN is an endonuclease that recognizes and cleaves a specific molecular structure. One example of an assay that that utilizes a FEN called cleavase is the Invader assay. The Invader assay is well known in the art, and basically involves designing an oligonucleotide Invader probe that is complementary to the 3′ end of the variable SNP site. The last nucleotide of the Invader probe is a non-matching base that overlaps the variable SNP site. A second allele-specific probe is designed to be complementary to the 5′ end of the variable SNP site but extends past the 3′ side of the site. If the sample DNA and the second allele-specific probe are complementary in sequence, they will hybridize, and in the presence of the invader probe form a three-dimensional complex that is recognized and cleaved by FEN. Fluorescence resonance energy transfer (FRET) technology can be used to detect the cleavage reaction by triggering a secondary cleavage reaction. The secondary cleavage reaction occurs between a quencher molecule that is attached to the 3′ end of the second allele-specific probe, and a fluorophore that is attached to the 5′ end of the same probe. The primary cleavage by FEN separates the quencher from the fluorophore and thus causes a detectable signal. The invader assay is only one such genotyping assay and is meant to serve as an example. Variations of the invader assay include but are not limited to Serial Invasive Signal Amplification Reaction (SISAR), which allows both SNP alleles to be tested in a single reaction.
In some embodiments, nucleic acid sequences comprising the MI SNP can be genotyped using primer extension or single base extension, a mini-sequencing technique. A detection primer is designed to target a sequence directly upstream of the SNP. The 3′ end of the oligonucleotide is then extended by a single base using dideoxynucleotide triphosphates (ddNTPs), wherein each ddNPT base corresponds to a different fluorescent dye. The result is detection of up to four alleles, including the discrimination of homozygous and heterozygous genotypes. A number of detection platforms are able to detect the SNP, such as mass spectrometry (matrix-assisted laser desorption/ionization time of flight (MALDI-TOF), capillary electrophoresis, pyrosequencing, flow cytometry or fluorescence plate readers.
In some embodiments, nucleic acid sequences comprising the MI SNP can be genotyped using a 5′-nuclease. TaqMan probes utilize a variation of fluorescence resonance energy transfer (FRET) technology where a fluorophore is covalently attached to the 5′ end of an oligonucleotide probe, and a quencher is covalently attached to the 3′ end of the same probe. When PCR products accumulate, the probe binds to the PCR product during a denaturation cycle, and as Taq polymerase synthesizes a new strand, it encounters the hybridized probe. The exonuclease activity of Taq releases the fluorescently tagged nucleotide at the 3′ end of the probe. The fluor is now released into solution where it will fluoresce because it is no longer close enough to the quencher to carry out FRET.
In some embodiments, nucleic acid sequences comprising the MI SNP can be genotyped using an oligonucleotide ligation assay. This assay depends on the enzyme DNA ligase to ligate two DNA probes at the SNP site, wherein ligation occurs if both probes have perfect base pair complementarity to the target sequence. The first probe sequence is an allele-specific probe designed to be complementary to the target DNA so that its 3′ base corresponds to the target SNP, and the second probe sequence is designed to hybridize to the sequence directly adjacent to the SNP in the other direction so that its 5′ end is provided for the ligation reaction. Ligation will only occur if the probes match and hybridize with the target DNA. Any platform that can detect a difference in sequence length can be used to determine if the product is ligated or un-ligated, including gel electrophoresis, MALDI-TOF mass spectrometry, capillary electrophoresis. Alternatively, the oligonucleotides can be tagged and/or labeled so as to index samples and allow for high-throughput sequencing of the ligated products, wherein the genotypes of said products can be determined.
In some embodiments, the MI SNP may be detected by sequencing a nucleic acid sequence(s) comprising the MI SNP. Sequencing methodologies are broad, and a number of variations of the platform and chemistry exist. For example, first generation sequencing technologies utilize Sanger sequencing and rely on the electrophoretic separation of chain-termination products. Later generation sequencing technologies, including Next Generation Sequencing (NGS), are well-known in the art, and generally describe massively parallel sequencing via clonally amplified DNA templates in a flow cell. Sequencing methods that can be employed include but are not limited to whole genome sequencing (WGS), targeted resequencing, RNASeq, Exome Capture/Sequencing and genotype-by-sequencing (GBS), wherein the nucleic acid(s) sequenced can be DNA or RNA. These methods require use of a sequencing platform, where factors such as chemistry type, desired level of multiplexing, desired data output amount, and run-time should be considerations in choosing a platform. Examples of sequencing platforms include but are not limited to the Roche 454, Illlumina MiSeq, Illumina HiSeq, Life Technologies SOLiD4 and Pacific Biosciences SMRT platforms.
Mass spectrometry methods determine the mass of an ionized molecule by measuring the molecule's migration rate in an electric field, where smaller molecules travel faster. Some types of spectrometers are able to separate different sized fragments and further fragmentize them to sequence the oligonucleotide.
Detection of a haplotype can be achieved by detection of one or more polymorphisms comprising the haplotype, wherein the MI haplotype comprises 120 markers ranging from 78,732,954 to 80,748,266 bp, including the MI SNP, as shown in Table 2. MI haplotype marker genotypes can be determined using any of the genotyping methods above, or a combination thereof, e.g., one or more assays may be designed to detect the genotype(s) of all or a portion of the 120 markers comprising the MI haplotype. The MI SNP may be genotyped using the same assay as is used for detecting the genotype(s) of the MI haplotype marker(s), or it may be genotyped using a different assay. In one embodiment, all or a portion of the 120 markers comprising the haplotype, optionally including the MI SNP, and additional markers in linkage with the MI SNP may be added to a commercially available array platform, such as those described earlier.
An alternative method for detecting a mutant individual is through use of an immunoassay, such as ELISA (enzyme-linked immunosorbent assay), which can detect the presence of a protein of interest or a mutant protein of interest. Antibodies designed to bind the protein in question are added to a liquid sample, wherein said binding or the binding of a secondary antibody causes a detectable signal that can then be measured. In this way, a mutant protein can be detected by antibodies specific to the mutant versus the wild-type protein. Both the wild-type and mutant versions of the protein encoded by the CACNA1S gene could be detected using this type of assay.
In some embodiments, a nucleic acid of the present disclosure, comprising the MI SNP, is added to an existing genotyping platform, panel or array. In some embodiments, the nucleic acid is added to the platform, panel, or array in double-stranded form. In some embodiments, the nucleic acid is added to the platform, panel, or array in single-stranded form. In some embodiments, the nucleic acid is single-stranded and comprises a sense strand of DNA including the MI SNP. In some embodiments, the nucleic acid is single-stranded and comprises the antisense strand of DNA including the MI SNP. In some embodiments, the nucleic acid is single-stranded and comprises an RNA including the MI SNP.
In some embodiments, the nucleic acid comprises SEQ ID NO: 1 and/or SEQ ID NO: 2, the complements thereof, or reverse complements thereof. In some embodiments, the nucleic acid comprises a fragment of SEQ ID NO 1 and/or SEQ ID NO: 2, the complements thereof, or reverse complements thereof, wherein the fragment comprises the MI SNP. In some embodiments the fragment includes at least 10 nucleotides 5′ and at least 10 nucleotides 3′ of the MI SNP position. In some embodiments, the nucleic acid comprising the MI SNP includes at least about 10, at least about 15, at least about 20, at least about 25, at least about 30, at least about 35, at least about 40, at least about 45, at least about 50, at least about 55 at least about 60, at least about 65 at least about 70, at least about 75, at least about 80, at least about 85, at least about 90, at least about 95, or at least about 100 nucleotides 5′, 3′, or both 5′ and 3′ of the MI SNP, or any combination of lengths for the 5′ and 3′ sides of the MI SNP. In some embodiments, the nucleic acid or fragment thereof comprising the MI SNP has about 70%, about 75%, about 80% about 85%, about 90% about 91%, 92%, 93%, 94%, 95%, 96%, 97%, 97%, 99% identity to SEQ ID NO: 1, SEQ ID NO: 2, a complement thereof, reverse complement thereof, or a fragment of any of these. In some embodiments, a nucleic acid comprising the MI SNP deviates from SEQ ID NO: 1 or SEQ ID NO: 2 by virtue of variation in the genetic code. That is, the variant nucleic acid sequence (or fragment) encodes the same amino acid sequence, although the nucleic acid sequence may differ from that of SEQ ID NO: 1, SEQ ID NO: 2, a complement thereof, or a reverse complement thereof.
In some embodiments, the nucleic acid comprising the MI SNP also includes a detectable marker. The detectable marker may embody a number of formats, such as a dye that binds double stranded DNA of the nucleic acid comprising the MI SNP, or fluorescently labeled probes that hybridize with the nucleic acid comprising the MI SNP.
The following examples are intended to illustrate preferred embodiments of the invention and should not be interpreted to limit the scope of the invention as defined in the claims.
Holstein genomic selection programs screen for several genetic recessives associated with stillbirth and late term abortions (Agerholm et al., 2001; Charlier et al., 2012) and with neonate survival (Shuster et al., 1992; Schütz et al., 2016; Basiel et al., 2020). Genomic testing has also accelerated discovery of genetic recessives that result in embryonic loss (Fritz et al., 2013, 2018; McClure et al., 2014; Adams et al., 2016) by screening for regions where a given haplotype is not observed in a homozygous state when expected to be present based on haplotype frequency (VanRaden et al., 2011).
While genomic testing has facilitated rapid genetic improvement in addition to identification of genetic recessives, there has also been an acceleration of inbreeding when viewed on an annual and on a generational basis (Makanjuola et al., 2020). The current rise follows a period of contraction in male lineages due to the widespread adoption of artificial insemination (Dechow et al., 2020). Elevated inbreeding and small effective population sizes have increased the increased likelihood that new genetic recessive conditions will manifest in a significant number of animals (Fritz et al., 2013). Recently, motor impairment in newborn calves has been observed by veterinarians and farm staff in multiple states. Genetic defects are a prime contributor to neuromuscular defects in humans and other species (Laing, 2012), and could underlie the problem observed in calves. Therefore, the objectives of this study were to conduct a genome wide association study of affected calves and normal family members to determine if a genetic origin was likely and to establish plausibility of genetic inheritance based on shared ancestors.
Sample population. Table 1 provides details on the number of affected calves and the genotyped control population. Affected calves (34 total) were recorded on farms from New York state (farms 1 and 2), Florida (farm 3), and Pennsylvania (farm 4). The condition was first noted in December of 2019 with the latest observations from November of 2021. The affected calves were all unable to stand without assistance at one more time during the neonatal period with varying severity, recovery, and relapse patterns. Some were unable to stand for the first 24 hours after birth, whereas others were able to stand initially and then lost the ability to do so within the first month of life. There were 21 calves euthanized whereas 7 calves were considered recovered and 6 were unthrifty with poor growth at the time of data collection. Table 1 shows the number of calves with motor impairment and recovery status by farm, number of impaired calves that resulted from embryo transfer, number of genotyped impaired calves, number of genotyped control siblings, and number of parent or other relatives with genotypes.
Necropsy was performed on calves at the Cornell University College of Veterinary Medicine and the University of Florida College of Veterinary Medicine. Histological examination of muscle and peripheral nerves followed biopsy of the right bicep and right brachial plexus from one farm 3 calf. Blood samples were collected from four calves on farm 1 to determine if cholesterol deficiency, anemia, or selenium deficiency might contribute to the motor impairment, and multiple blood samples were collected on farm 3 to evaluate potential selenium deficiency.
Calves were from 17 sire×dam combinations and 22 calves resulted from in-vitro fertilization with full sibling families of 13 (farm 3), 3 (farm 3 and 4), and 2 (farm 2). All farm 3 calves were sired by a single bull mated to 3 dams.
Genomic and pedigree analysis. Tissue, hair or blood samples were available for genotyping from 18 affected calves ranging from 1 to 12 per farm with the remaining calves destroyed before sample collection could occur. For a control population, 22 healthy full siblings from two (17 and 5 calves, respectively) farm 3 dams were sampled. Additionally, the sire of the farm 3 calves and a full sister to the dam of three farm 4 calves were sampled. All samples were genotyped for 139,376 DNA markers (BOVUHDV03, Neogen, Lincoln, NE). The genotypes of two farm 3 dams were imputed with findhap.f90 (v3; Vanraden et al., 2011) and included in subsequent analyses.
Motor impairment was recorded as 1 for 26 unaffected animals including the control calves and genotyped relatives, or 2 for the 18 affected calves. Association analysis was conducted in PLINK (v1.9; Chang et al., 2015; www.cog-genomics.org/plink/1.9/). A marker call rate of ≥99% was imposed, leaving 101,907 makers for analysis. The initial association analysis was comprised of an allelic chi-square test (frequency of allele A versus frequency of allele B) assuming 1 degree of freedom with p-values generated using Fisher's exact test. This was followed with a dominance model (genotype AA and AB versus BB) with a 1 degree of freedom chi-square test. Multiple comparison testing was considered by implementing a false discovery-rate (FDR) p-value to declare genome-wide significance at FDR P≤0.05. Homozygosity screening was also performed to identify runs of homozygosity.
Thirty-three of the affected calves had a known sire and were included in a pedigree analysis. Two generations of sire identification (sire and maternal grand-sire) were available for nine calves from farm 1 whereas a minimum of six generations (calf's sire plus sires for 5 generations of dams) was available for all remaining calves. All known ancestors were traced separately for the sire and dam of each affected calf and the number of calves where an ancestor was present in the sire and dam lineage was determined to identify plausible common ancestors. Imputed genotypes of potential ancestors were derived using findhapf90 using the affected calve and control genotypes plus a population of 7622 Holsteins with genotype densities ranging from 3K to 150K.
Clinical findings. Necropsy failed to determine a cause for motor impairment for any of the calves; there was no evidence of cholesterol deficiency, selenium deficiency, or anemia based on serum testing results. A definitive disease diagnosis was not supported by the histological examination of muscle and peripheral nerves; however, the intramuscular nerve branches of the bicep biopsy showed moderate subperineural edema with scattered macrophage infiltration of the endoneurium, endomysium and perimysium. Similarly, moderately severe edema was noted in the endoneurium and subperineurium along with scattered mononuclear cell infiltration of nerve fascicles of the brachial plexus. The behavior and appetite of the calves was normal and there were no obvious clinical symptoms preventing proper motor function.
Genome wide association. A Manhattan plot demonstrating the significance level for each marker according to the dominance model is shown in
The association of a 120-marker haplotype ranging from 78,732,954 to 80,748,266 bp with the motor impairment phenotype is shown in Table 2. All affected calves were homozygous and one of 26 controls was homozygous and was located on farm 3; 9 controls were homozygous for the alternate haplotype and 16 were heterozygous. The sire of the farm 3 calves and both dams were heterozygous for markers in the region.
A second region ranging from 9,585,914 to 10,754,653 bp that contained three significant markers was present on chromosome 20 with the dominance model (
Pedigree analysis. The 33 affected calves with sire identification shared 63 known ancestors in their sire lineage born since 1990 with the most recent born in 2008 and 2010. For the maternal lineage, all 24 affected calves with at least 5 generations of known maternal sires traced to the bull born in 2008, versus 7 for the bull born in 2010. Additionally, the imputed genotypes suggested the 2008 sire was a carrier of the candidate region but not the 2010 sire. Therefore, the bull ROYLANE SOCRA ROBUST-ET (Robust) was considered the most likely ancestor of origin for these pedigrees. When expanded to include the remaining 9 calves with known maternal grand-sire, 30 of the calves could be traced to Robust. The pedigree with the most pathways from Robust to affected offspring is shown in
Calves homozygous for a haplotype on chromosome 16 had elevated risk of motor impairment characterized by an inability to stand. While some calves partly recovered, the condition of most deteriorated and were euthanized, often from development of secondary conditions including pneumonia. Curiously, the calves appeared healthy other than impaired motor function.
Despite strong evidence for the candidate region, there is a degree of uncertainty due to the inconsistent development of the motor impairment phenotype, existence of a homozygous calf that was able to stand without assistance, and an additional region with significant markers; this raises the possibility that the identified genomic region is spurious and that the condition is non-genetic. Evidence for the region harboring a recessive allele was supported by high statistical significance, the presence of heterozygous haplotypes in genotyped parents, a shared homozygous haplotype across multiple affected families, and a plausible path of inheritance.
Observations in humans and animal models support a genetic basis for the motor impairment phenotype. Humans with defects in the CACNA1S gene are known to have muscle weakness (Fialho et al., 2018). In some instances, individuals are affected with Hypokalemic periodic paralysis which is associated with muscle paralysis that can last for a few minutes to a few days and can be triggered by diet and exercise. Similar to observations in calves with MI, CACNA1S mutations in humans result in variation in the frequency and duration of muscle weakness, as well as the age when the condition first appears. Genetic conditions in genes other than CACNA1S can also cause MI. Most neuromuscular disorders have a genetic origin and mutations in a wide variety of genes can cause disease (Laing, 2012). Genetic recessive conditions and dominant de-novo mutations have been associated with genetic pediatric neuromuscular disease in humans (Herman et al., 2021). A heritable juvenile-onset motor polyneuropathy was recently described in cats that was characterized by mononuclear cell infiltration of intramuscular nerve branches (Crawford et al., 2020). Mononuclear cell infiltration in skeletal muscle is a consistent characteristic of inflammatory myopathies (Ascherman, 2012).
A majority of the calves in this study resulted from IVF and embryo transfer. Epigenetic aberrations and developmental abnormalities including large offspring syndrome have been associated with IVF in multiple species (Ventura-Juncá et al., 2015). However, 11 calves were affected that resulted from AI and one resulted from natural mating. That observation coupled with strong a genomic association suggests that IVF is not the determining factor for development of motor inability. Nevertheless, a genotype×environment interaction with IVF increasing the likelihood that the phenotype will be expressed when the genetic defect is inherited cannot be discounted.
The rate of inbreeding on both and annual and generational basis increased considerably following the introduction of genetic selection (Dechow et al., 2020; Makanjuola et al., 2020), which has implications for long term selection response and increases the potential for genetic recessives to emerge (Fritz et al., 2013). The pedigree in
Genomic testing has accelerated discovery of genetic recessives that result in embryonic mortality (VanRaden et al., 2011). Unfortunately, the haplotype screening method is not likely to identify conditions such as motor impairment because calves may appear normal at birth and be genotyped as part of a normal management routine. Genomic selection for heifer livability was introduced in national genetic selection programs in (Neupane et al., 2021); however, the heritability is low (<1%) resulting in modest levels of accuracy for genomic predictions. Moreover, heifers fail to survive for a variety of reasons such as death, injury, and reproductive failure. It is not clear that genomic evaluation of a quantitative trait with a large range of underlying conditions would help detect genetic recessives like that observed here unless a large proportion of calves were affected with a specific condition.
Current rates of inbreeding in the Holstein breed are favorable for rapid amplification of genetic defects. Support for a genetic defect on chromosome 16 that confers motor impairment includes genome wide significance that was shared across multiple mating combinations, heterozygous parent haplotypes, a plausible common ancestor for the affected pedigrees, and similar genetic conditions in other species. Despite strong evidence for the existence of a genetic condition, some caution is yet warranted because of the inconsistent phenotypic presentation of the calves. Future efforts to identify the exact location and type of mutation are needed to confirm to condition and facilitate genetic testing and selection.
One affected calf that was homozygous for the recumbency haplotype, the sire of the affected calf, and a male ancestor that was phenotypically normal but known to be homozygous for the recumbency haplotype were sequenced at a depth of 30× using an Illumina (San Diego, CA) HiSeq pair-ended platform. The homozygous ancestor was identified at the USDA Animal Genomic Improvement Laboratory by tracing the haplotype of genotyped Holstein sires.
The sequence was aligned (Bowtie 2; https://bowtie-bio.sourceforge.net/bowtie2/index.shtml) to bovine chromosome 16 (ARS-UCD1; https://www.ncbi.nlm.nih.gov/assembly/GCF_002263795.2). Variants from the reference genome were considered candidate mutations if they were 1) homozygous in the affected calf, 2) heterozygous in the sire, and 3) not homozygous in the male ancestor. A SNP at bp 79613592 (C to T on forward strand, or G to A on the reverse strand) was identified that met the genotype criteria. The mutation and surrounding variants were heterozygous in the sire. The mutation was heterozygous in the male ancestor who was otherwise homozygous over the recumbency haplotype region. See
The sequence of the positive DNA strand from bp 79613564 to 79613623 with the SNP indicated as [C] or [T] is:
The mutation resides in the calcium voltage-gated channel subunit alpha1 S gene (CACNA1S) that is encoded on the negative strand. The missense mutation alters a GGC codon to AGC which facilitates a glycine to serine amino acid substitution. Ensembl (https://useast.ensembl.org/index.html) predictions (Table 3) indicate 9 potential transcripts which span 39 to 44 exons; the mutation resides in the 30th to 32nd exon of the transcripts.
The amino acid sequence corresponding to the mutated exon is shown in Table 4 for cattle and 5 other species. The highlighted amino acid indicates that the amino acid sequence is conserved across many species.
CACNA1S enables calcium channel voltage gate activity and is highly expressed in skeletal muscle (https://www.ncbi.nlm.nih.gov/gene/682930). Mutations are known to cause periodic paralysis in humans (https://www.ncbi.nlm.nih.gov/gene/779) and mice (https://www.ncbi.nlm.nih.gov/gene/12292), which corresponds to the calf recumbency phenotype observed in Holstein calves.
The mutation was submitted to the Ensembl Variant Effect Predictor (https://useast.ensembl.org/Homo_sapiens/Tools/VEP). The SIFT (Sorting Intolerant From Tolerant) score confirmed that the mutation was deleterious (SIFT score range of 0 to 0.01) with a moderate projected impact, which corresponds to the partially penetrant nature of calf recumbency.
Genotypes of 16 animals for the C to T SNP at bp 79613592 in the CACNA1S gene (Table 5) were ascertained using a TaqMan assay. The genotyped animals included 6 recumbent calves that were homozygous for the MI haplotype (haplotype genotype=MI/MI); 4 normal animals that were previously determined to be carriers of the MI haplotype (haplotype genotype=+/MI); 4 recumbent calves that had not been genomically tested (haplotype genotype unknown), and two normal animals that were not genotyped and assumed to be free of the MI allele based on pedigree (haplotype genotype unknown). Two of the heterozygous animals were bulls that were ancestors to the recumbent calves and the remaining two were siblings of recumbent calves. All recumbent calves were homozygous for the MI allele regardless of previous genotype status. All animals heterozygous for the MI haplotype were heterozygous for the MI SNP. This demonstrated that directly testing for the SNP at bp 79613592 was able to ascertain the MI genotype correctly. The test was able to identify all expected genotypes (MI/MI, +/MI, and +/+) and a negative control demonstrated no detectable amplification.
In the foregoing description, it will be readily apparent to one skilled in the art that varying substitutions and modifications may be made to the invention disclosed herein without departing from the scope and spirit of the invention. The invention illustratively described herein suitably may be practiced in the absence of any element or elements, limitation or limitations that is not specifically disclosed herein. The terms and expressions which have been employed are used as terms of description and not of limitation, and there is no intention that in the use of such terms and expressions of excluding any equivalents of the features shown and described or portions thereof, but it is recognized that various modifications are possible within the scope of the invention. Thus, it should be understood that although the present invention has been illustrated by specific embodiments and optional features, modification and/or variation of the concepts herein disclosed may be resorted to by those skilled in the art, and that such modifications and variations are considered to be within the scope of this invention.
Citations to a number of patent and non-patent references are made herein. The cited references are incorporated by reference herein in their entireties. In the event that there is an inconsistency between a definition of a term in the specification as compared to a definition of the term in a cited reference, the term should be interpreted based on the definition in the specification.
This application claims the benefit of U.S. Provisional Application 63/300,489 filed Jan. 18, 2022, the entire content of which is incorporated herein by reference.
This invention was made with government support under Hatch Act Project No. PEN04672 awarded by the United States Department of Agriculture. The Government has certain rights in the invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2023/060834 | 1/18/2023 | WO |
Number | Date | Country | |
---|---|---|---|
63300489 | Jan 2022 | US |