The present invention relates in general to the analysis of genetic markers in individuals of nonhuman origin, especially to the genotyping of outbred populations of rodents or fishes. The invention specifically relates to the simultaneous amplification of at least 5 different polymorphic autosomal markers and at least two polymorphic Y-chromosomal markers of mouse in a reaction mix with the aid of the polymerase chain reaction or other multiplex methods and the detection of the specific alleles for each marker of the multiplex method. The invention further relates to a kit for the genetic identification and/or for the distinguishing of two or more animals from DNA extracts of individuals from wild-type populations, from different inbred or outbred strains, of the same inbred or outbred strain or of substrains of the same inbred or outbred strain, especially of a rat or mouse.
Animal experiments are still an indispensable part of modern research. With the goal of ensuring reproducible experiments, what is indispensable is not only the standardization of breeding conditions in animal husbandry, but also the standardization of the genetic quality of the test animals used. For the breeder of such animals, this means the use of defined systems for the stabilization of genetic structures. Such structures are intended to ensure a standardization over the period of many generations and over geographical distance. The latter also concerns the provision of test strains with identical designation from different breeding sites and/or from different breeders.
Inbred strains are mouse and rat models in which the genetic diversity among the individuals has been lost and is thus negligible owing to consistent brother—sister mating over many generations. Animals of an inbred strain or substrain of an inbred strain are genetically identical to an extent of over 99.9% and moreover homozygous. Experiments with such animals allow the statistical validity of experimental data, since they can be collected simultaneously on a multiplicity of effectively identical twins. However, such animal models are of only limited informative value for the application of experimental findings to humans, since what are represented here are artificial (inbred) populations which do not occur in this way in nature.
In parallel to inbred strains, there is also the necessity of the availability of standardized strains in which the genetic diversity has been maintained. Examples of such strains are mosaic and outbred populations. They serve as a practical model for noninbred populations, as typically also occur in animals and humans. With the aid of outbred populations, the aim is to represent the genetic variation within a population and to utilize said variation in relevant experiments. The genotype of each outbred animal within an outbred population is unique and is not repeated in a second animal. The goal of outbred populations is to maintain a defined, highest possible genetic heterogeneity while observing certain limits of variation within a closed, genetically well-characterized population over the course of generations. Such outbred strains exist for mouse, rat and zebrafish.
A high genetic variability can, for example, be achieved by crossing various, little-related strains or inbred strains during the setup phase of an outbred population. Thereafter, the population is closed, and only matings within the population are then allowed. In contrast to inbred strains, parent animals related to one another are not mated with one another. This is to ensure a stable and standardized genetic variability. The major requirements for an outbreeding system have been summarized by Rapp (1972): (1) maximum maintenance of population-specific allele and genotype frequencies, (2) minimization of increase in homozygosity and degree of inbreeding, (3) avoidance of formation of sublines, (4) ease of use. Strictly speaking, what is concerned is a genetically well-characterized, but limited population of animals that is composed of individuals which originally originate from different strains. Therefore, the technically correct designation is “outbred population”. Proceeding from the designation “inbred strains” and to illustrate the diametrical difference between the principles of outbreeding and inbreeding, the literature frequently also utilizes the designation “outbred strain”, and this is therefore a synonym for “outbred populations”.
For the reliable reproducibility of test results with outbred animals, a representative cross section of the genotype distribution of the specific outbred population would have to be available for each experiment. If no genetic markers are used to describe the genetic diversity, this represents a huge challenge for the breeder. For the user, the situation is even more complicated. Frequently, only very limited animal numbers are ordered from the breeder for the individual test stages and it is not possible at all to check whether the individual animals originate from a single (e.g., a parent pair) breeding group or different breeding groups of the outbred population.
The original creation and structuring of outbred populations involves crossing different male founder animals which, for their part, each introduce their individual variant of the Y chromosome into the outbred population. If the outbreeding is conducted correctly, all the originally available variants remain in the population. In different outbred populations and also within a defined outbred population, there is likely to be a multiplicity of different Y chromosomes which may also exhibit considerable genetic differences. Although a major aspect for defining the genetic diversity of an outbred population, no scientific data at all are available to date for the number of different Y chromosomes.
In the case of the inbred strain currently utilized in research, the Y chromosome goes back to one of the two types Mus musculus musculus (M. m. musculus) and Mus musculus domesticus (M. m. domesticus), which separated from one another approx. 900 000 years ago. Genetic analyses have revealed that the mouse species M. m. musculus, M. m. castaneus and M. m. molossinus all bear a Y chromosome of the type M. m. musculus. By contrast, M. m. domesticus represents a separate, second basic type (Pertile et al., 2009).
It is known that the Y chromosome in the case of mouse acts as a global regulator of genome-wide expression and that the crossing of a strain-exogenous Y chromosome often leads to considerable phenotypic changes in male and female descendants (Nelson et al., 2010). Therefore, different variants of the Y-chromosome represent a crucial parameter of the genetic and phenotypic variation of an outbred population. Describing genetic diversity thus also requires knowledge of the number and the nature of different variants of the Y chromosome in the population. For outbred strains in the case of mouse, neither the number of different Y-chromosomal haplotypes nor their belonging to M. m. musculus or M. m. domesticus is known to date.
Whereas a large marker repertoire of DNA polymorphisms is utilized for inbreeding in the case of mouse, there are to date no standardized monitoring programs or marker sets for outbreeding of mouse (Kluge and Wedekind, 2016). Yalcin et al. (2010) describe the parallel utilization of multiple 100 000 SNPs for the genetic characterization of outbred populations of mouse.
Microsatellites, also called short tandem repeats (STRs), are used for the genotyping of inbred strains. These are short DNA sequences of 1 to 6 bases in length, the basic motif of which is repeated multiple times like beads strung together, for example [CA]n, [GAC]n or [GATA]n. Owing to the different number of such repeats in different individuals, it is possible to distinguish such individuals from one another. Witmer et al. (2003) describe the utilization of STRs having a dinucleotide repeat unit to distinguish the various inbred strains of mouse. US 2014/0066322 describes the use of a PCR multiplex to distinguish cell lines in the case of mouse, in which 9 different mouse STRs are combined with 2 further human STR markers. WO 2016/008894 describes genotyping to distinguish individuals within inbred strains. Both applications use STR markers, the repeat unit of which consists of 4 nucleotides.
All the available STR data collected for mouse are based on the analysis of inbred strains. The utilization of STRs for outbred populations of mouse is unknown. Biostatistical parameters for assessing the quality of individual mouse STR markers, such as polymorphism information content (PIC) or heterozygosity, are unavailable. Shang et al. (2014) describe 6 STRs which were utilized for genotyping in the case of rat for the outbred strains Wistar and Sprague Dawley.
The genetic monitoring of outbred populations in the case of mouse, rat or zebrafish is therefore an area which can still be distinctly optimized (Kluge and Wedekind, 2016). This requires meaningful biostatistical parameters, which are unavailable to date. It would be advantageous if a method based on autosomal and sex-specific STR markers were to be available, allowing the genotyping and comprehensive biostatistical characterization of outbred populations. A suitable method is the multiplex STR analysis of polymorphic tetranucleotide STR loci of mouse, and this is the subject matter of the present invention.
It is therefore an object of the invention to overcome the disadvantages of the prior art and to make reliable means and methods for genotyping outbred populations available to the breeder and user.
This object is achieved by providing particularly suitable STR markers, with the aid of which a set of markers for genotyping outbred populations can be made available to the breeder and user.
It is further an object of the invention to enable the breeder to genetically characterize outbred populations, to document the genetic diversity and to carry out a genetic monitoring of important biostatistical parameters during breeding. It is further an object of the invention to make available a method for assessing the genetic diversity within an outbred population on the basis of autosomal and Y-chromosomal markers. It is likewise an object of the invention to allow a comparative assessment of the quality of outbred populations with one another through the use of a uniform set of markers. Furthermore, it is intended to allow the user to be able to independently assess the genetic diversity of the outbred animals purchased from commercial breeders.
To achieve the abovementioned objects, the invention provides, in one aspect, a method for genetically identifying and/or for distinguishing two or more animals from different wild-type populations, from different outbred strains, of the same outbred strain or of substrains of the same outbred strain of the species mouse, rat, hamster and zebrafish, the method comprising the simultaneous amplification of at least 5, preferably at least 6, 7 or 8, particularly preferably at least 9 or 10, most preferably at least 11 different polymorphic autosomal markers and at least two polymorphic Y-chromosomal markers of the respective species in a reaction mix with the aid of the polymerase chain reaction or other multiplex methods and, optionally, the detection of the specific allele for each marker of the multiplex method.
The method likewise comprises providing relevant oligonucleotide primer pairs, with each marker being assigned a primer pair which specifically hybridizes on both sides of the marker region. The method further describes the simultaneous amplification of at least 7, preferably at least 8, 9 or 10, particularly preferably at least 11 or 12, most preferably at least 13 of the selected markers in a multiplex reaction mix and the formation of a mixture of alleles with reaction products for each of the participating markers.
The method further comprises the detection of the individual alleles in the allele mixture and the unambiguous assignment thereof to a marker and to a defined allele for the specific marker.
In a separate embodiment of the present invention, a method allowing the assignment of the Y chromosome of outbred populations to the type M. m. musculus or M. m. domesticus with the aid of STR markers is provided.
In a further embodiment of the present invention, a method allowing the identification of different variants of the Y chromosome within an outbred population of mouse with the aid of STR markers is provided.
In a separate embodiment of the present invention, a method allowing the identification of different variants of the Y chromosome of rat with the aid of STR markers is provided.
The marker set according to the invention can additionally be supplemented by those markers which allow an assignment to individual mouse strains. Suitable for this purpose are, for example, known strain-specific insertion-deletion polymorphisms (indels) or SNPs. As an example of such markers, the 25 bp deletion in the gene Disc1 for the strain 129, as described by Clapcote and Roder (2006), can be mentioned. These additional markers are particularly suitable for the use of the inventive STR multiplex assays in assignment of founder animals of an outbred population to known inbred strains, the characterization of transgenic lines of mouse and also the testing of the origin of cell lines.
With the present invention, what is made available in a further aspect is a kit which can be used to detect the alleles of a marker set that are present in a DNA sample. The kit contains all the necessary components for carrying out a simultaneous coamplification and for detecting alleles with the involvement of at least 5, preferably at least 6, 7 or 8, particularly preferably at least 9 or 10, most preferably at least 11 different autosomal STR markers and at least two, preferably three Y-chromosomal STR markers for at least one DNA sample.
The following detailed description of the invention and the attached FIGURE and exemplary embodiments are intended to more particularly elucidate the essence of the invention, to show further possible uses and to demonstrate the the advantages arising therefrom.
In a further aspect, the invention provides a kit for the genetic identification and/or for the distinguishing of two or more animals from different strains, of the same strain or of substrains of the same strain of the of the species mouse, rat, hamster and zebrafish from DNA extracts, the kit comprising:
In a preferred embodiment of the invention, the kit consists of parts (a) to (d).
Allele: Allele refers to various forms of a gene or a gene sequence in a defined DNA region of a chromosome.
Amplification: Amplification describes the reproduction of DNA segments. This can occur naturally or else be generated artificially (in vitro). The latter is, for example, realized in molecular biology via the PCR method.
Haplotype: Haplotype describes a haploid genotype, a unique variant of a nucleotide sequence on the same chromosome. The Y chromosome occurs in the genome only singly and cannot exchange DNA segments with another chromosome, meaning that the Y-chromosomal haplotype remains constant.
Multiplex: In contrast to a singleplex PCR, in which only one primer pair is utilized, a multiplex PCR mix contains a plurality of different primer pairs, with each of the primer pairs amplifying a different region of the chromosomal DNA.
PCR: The polymerase chain reaction is a method which is used to reproduce DNA in vitro with the aid of DNA polymerases (e.g., Taq polymerase). The starting point used for the new synthesis are specific primers which bind specifically to defined DNA segments.
Polymorphic: A marker is polymorphic if it has at least two or more different alleles.
Primer/oligonucleotide primer: In molecular biology, primer refers to an oligonucleotide which serves as the starting point for the amplification of DNA segments.
Particularly good results were achieved in the genotyping of outbred populations when the set of loci for the DNA sample, from which the autosomal STR markers are selected, comprises at least 11 autosomal STR markers which can be coamplified simultaneously in a reaction mix.
According to a first aspect, the invention therefore provides a method for genetically identifying and/or for distinguishing two or more animals from different wild-type populations, from different outbred strains, of the same outbred strain or of substrains of the same outbred strain of the species mouse, rat, hamster or zebrafish, comprising the following substeps
In a preferred embodiment of the invention, the method for genetically identifying and/or for distinguishing two or more animals from different wild-type populations, from different outbred strains, of the same outbred strain or of substrains of the same outbred strain of the species mouse, rat, hamster and zebrafish consists of substeps (a) to (f).
In a further preferred embodiment of the method according to the invention, the DNA sample originates from at least one animal of the species mouse, rat, hamster or zebrafish or from a cell line of the species mouse, rat, hamster or zebrafish.
One advantage of the invention is that the use of the method according to the invention is in no way limited only to outbred strains. It can be utilized in all cases in which a comparative DNA analysis of genetic markers is meaningful, such as, for example, for the authentication of cell lines, and the genetic characterization of captured wild animals (wild-type populations), of inbred strains and of genetically modified lines of mouse. For example, it is possible to utilize a multiplex, containing in each case one marker per chromosome, for monitoring the ploidy of cell lines and for assessing the need to carry out a karyotyping of a cell line.
The present invention concerns the requirement of monitoring the genetic quality of outbred populations, as already elucidated at the start. This requires meaningful biostatistical parameters, which can be collected both by the breeder and by the user as a result of a genotyping procedure. A particular advantage of the invention is that the inventive multiplex assay based on STR markers having tetranucleotide repeat units can be utilized for sufficient genetic characterization and comparison of any desired outbred strains of mouse, rat, hamster or zebrafish, preferably of mouse or rat, particularly preferably of mouse. As a result, the genetic monitoring of outbred populations in the case of mouse, rat, hamster or zebrafish, preferably mouse or rat, particularly preferably mouse, is considerably simplified.
In one embodiment, the present invention provides a method which allows the assignment of alleles of defined STR markers in a DNA sample. This method comprises, in a preferred embodiment, the selection of a DNA sample intended for the analysis and the selection of at least 11 autosomal STR markers from a group of polymorphic STR markers of mouse: D1Mmu121, D2Mmu008, D3Mmu158, D4Mmu155, D5Mmu108, D6Mmu120, D7Mmu003, D8Mmu127, D9Mmu100, D10Mmu043, D11Mmu030, D12Mmu056, D13Mmu096, D14Mmu074, D15Mmu084, D16Mmu030, D17Mmu041, D18Mmu069, D19Mmu008. The individual autosomal markers are, according to the invention, located on different chromosomes, and so they are not genetically coupled to one another. This has the advantage that the use of simple biostatistical parameters for describing a population is made possible as a result.
In a preferred embodiment of the method according to the invention, in step (b), at least two of the eleven autosomal STR loci are selected from a group of loci, the group comprising the loci D1Mmu121, D2Mmu008, D3Mmu158, D4Mmu155, D5Mmu108, D6Mmu120, D7Mmu003, D8Mmu127, D9Mmu100, D10Mmu043, D11Mmu030, D12Mmu056, D13Mmu096, D14Mmu074, D15Mmu084, D16Mmu030, D17Mmu041, D18Mmu069 and D19Mmu008.
In a preferred embodiment of the method according to the invention, in step (b), the set of eleven autosomal loci consists of the loci D2Mmu008, D3Mmu158, D4Mmu155, D6Mmu120, D7Mmu003, D8Mmu127, D10Mmu043, D13Mmu096, D14Mmu074, D16Mmu030 and D18Mmu069.
Furthermore, the method according to the invention comprises selecting at least two Y chromosome-located STR markers from a group of possible mouse STR markers: DYMmu001, DYMmu002 and DYMmu003. The marker DYM003 is utilized in order to allow an assignment to the type M. m. musculus or M. m. domesticus. At least one further polymorphic STR marker from DYMmu001 or DYMmu002 is used for distinguishing different haplotypes.
Preferably, the set of loci in step (c) of the method according to the invention is situated on the Y chromosome and can advantageously be utilized for sex identification in the DNA sample of the individual in step (a). A further advantage of the set of loci according to step (c) is that said set of loci for sex identification can distinguish between the Y chromosomes of the type Mus musculus musculus and of the type Mus musculus domesticus.
In a further preferred embodiment, the locus for sex determination is a polymorphic STR locus of mouse and is selected from the group comprising the loci DYMmu001, DYMmu002 and DYMmu003. Such a locus has the advantage that it can distinguish Y-chromosomal haplotypes of mouse.
In a further preferred embodiment, the locus for sex determination is a polymorphic STR locus of rat and is selected from the group comprising the loci DYRno004, DYRno165 and DYRno304. Such a locus has the advantage that it can distinguish Y-chromosomal haplotypes of rat.
This means that it is possible to perform, within a specific outbred population, an assignment of male individuals to different haplotypes and also the assignment to one of the two basic Y chromosome types M. m. musculus or M. m. domesticus. This approach for describing the genetic diversity of outbred populations via the Y chromosome is made available for the first time by the present invention and can be utilized in future for genetic monitoring.
In a particularly preferred embodiment, in step (e) of the method according to the invention, a multiplex amplification with at least thirteen pairs of oligonucleotide primers, which comprises at least eleven autosomal and two Y-chromosomal STR loci, is carried out.
The amplified alleles are preferably separated by means of an analytical or semipreparative method before the evaluation in step (f). A preferred separation method is gel electrophoresis. Particular preference is given to polyacrylamide gel electrophoresis or capillary gel electrophoresis.
Furthermore, it has been found to be advantageous when the primer pairs are labeled for the subsequent detection method. In a preferred embodiment of the method according to the invention, at least one oligonucleotide primer of a primer pair is therefore covalently coupled to a detection dye, preferably to a fluorescent dye.
It is particularly preferred when at least four different dye-coupled primers with four different fluorescent dyes are used. This simplifies the analysis of the allele mixture in step (f), in which the specific alleles are determined and assigned for each of the amplified loci of the set for the given DNA sample. A direct detection of the alleles is thus possible.
The fluorescent dyes can, for example, be selected from the groups of phycobilins, rhodamine and safranins. Suitable fluorescent dyes can, for example, be selected from the group consisting of acridine yellow, acridine orange, aequorin, aesculin, allophycocyanin, 7-aminoactinomycin, ATTO dyes, auramine O, berberine, 9,10-bis(phenylethynyl)anthracene, calcein, 6-carboxyfluorescein, quinine, coumarin dyes, cyanines, 4′,6-diamidino-2-phenylindole, 2,7-dichlorofluorescein, 9,10-diphenylanthracene, eosin B, eosin Y, epicocconone, ethidium bromide, 6-FAM-phosphoramidite, flavins, fluorene, fluorescein, fluorescein arsenic helix binders, fura-2, furaptra, green fluorescent protein (GFP), Hoechst 33342, IAEDANS, Indian yellow, indocyanine green, luciferins, merbromin, N-methylacridone, Nile blue, Nile red, phycocyanin, phycoerythrin, propidium iodide, pyranine, rhodamine B, rubrene, safranin T, stilbene, SYBR green I, SYPRO orange, SYPRO red, SYPRO ruby, Texas red, TMRM+, umbelliferone and xylenol orange.
The fluorescent dyes are preferably covalently bonded to the nucleic acids in the sample.
Lastly, it is advantageous when the assignment of the amplified alleles in step (f) is done on the basis of comparison with a size standard. In a further preferred embodiment of the method according to the invention, what is therefore done is the assignment of the amplified alleles in step (f) on the basis of comparison with a size standard, the size standard being a mixture of DNA fragments of known size and/or a locus-specific mixture of known alleles.
In a further aspect, the invention provides a kit for the genetic identification and/or for the distinguishing of two or more animals from different strains, of the same strain or of substrains of the same strain from DNA extracts, comprising:
In a particularly preferred embodiment, the kit further comprises
In a further preferred embodiment of the invention, the kit for the genetic identification and/or for the distinguishing of two or more animals from different strains, of the same strain or of substrains of the same strain of mouse from DNA extracts consists of parts a. to d.
The present invention thus provides a method and a kit for genotyping individuals in the case of animal species which are bred in the form of outbred populations, specifically of outbred populations in the case of mouse or rat. It allows the genotyping of individuals, the identification of different haplotypes of the Y chromosome and the description of the genetic diversity of a specific outbred population. The method disclosed here and the kit allow the simultaneous analysis of genomic DNA segments in the case of preferably mouse with the aid of STR markers based on tetranucleotide repeat units, with coamplification of at least eleven different autosomal STR markers and at least two further STR markers for the Y chromosome in a reaction mix.
Regions containing potential STR markers were identified with the aid of the software FastPCR (PrimerDigital Ltd, Kalendar et al. 2014) by importing into the software the GeneBank-deposited Fasta files of the chromosomes of mouse (version GRCm38.p4) and rat (version Rnor_6.0). Regions containing potential STR markers were identified using the module “SSR Search”. The selection criterion was the presence of a tetranucleotide repeat unit which had to be present repeatedly in tandem at least 10 times. By means of PrimerBLAST, primers suitable for the analysis of the markers in the singular approach were generated.
Genomic DNA was extracted from tail-tip biopsies or ear punches with the aid of the NucleoSpin Tissue Kit (Macherey-Nagel, Duren, Germany) according to the information specified by the manufacturer.
On the basis of the analysis of 150 individuals from various inbred strains and outbred populations and of substrains of inbred strains of mouse, the alleles determinable in the DNA samples were determined for 240 different STR markers. Not all candidate markers led to amplifiable DNA products in all the tested strains. Therefore, a further criterion for the selection of suitable markers was that they can be used in all outbred populations and in all inbred strains.
The suitability of the STR markers for describing the genetic diversity of outbred populations was tested on the basis of the genetic analysis of altogether 70 different male individuals of outbred strains CD1, SWISS and NMRI via the determination of the biostatistical parameters polymorphism information content (PIC) and heterozygote rate (Het). For each chromosome, what was selected was that STR marker which has the highest PIC value.
For all STR markers, oligonucleotide primers suitable for the simultaneous amplification of at least 11 different autosomal STR markers were identified. Surprisingly, many of the PrimerBLAST-generated primers were not optimally suitable for multiplex applications. The subsequent optimization of the oligonucleotide sequences with respect to multiplexing ability was done manually without software by testing a multiplicity of different primers. All primers are used at an annealing temperature of 58° C. Forward primers were coupled to one of the fluorescent dyes 6FAM™ (blue), DY530™ (green), DY510XL™ (red), ATTO550™ (yellow) (Eurogentec, the Netherlands). Primers were synthesized by biomers (biomers.net GmbH, Ulm, Germany) and Eurogentec (Eurogentec GmbH, Cologne, Germany).
Table 1 contains the list of the inventive autosomal and Y-chromosomal STR markers of mouse, and the oligonucleotide primers used for the amplification in the multiplex method. For each primer, what is specified is the chromosomal location thereof in base pairs, which was ascertained on the basis of reference DNA (NCBI, version GRCm38.p4). Designating the STR markers has the aim of including essential characterizing data simply on the basis of the name: “D” stands for DNA; the naming of the chromosome (indication directly after the “D”, 1 to 19 for the autosomes and X and Y for the sex chromosomes); identification of the origin of the DNA (e.g., Mmu—Mus musculus, Rno—Rattus norwegicus, Dre—Danius rerio); and the approximate position on the chromosome (in megabases). The STR marker D1Mmu121 is, for example, a DNA marker of of mouse that is situated on chromosome 1 in the region of approx. 121 megabases.
Furthermore, Table 1 shows the structure of the repeat unit for each marker. The indicated number of repeat units is based on the reference sequence for the strain C57BL/6J as deposited in the NCBI database. The alleles were named according to the guidelines for the standardization of nomenclature for STR markers in forensics (DNA Recommendations, 1997).
The primers belonging to a marker are mixed together in the ratio of 1:1 at a concentration of 5 pmol/μl. The PCR was carried out in a total volume of 25 μl using Snooplex® FastPrep PCR reagents (GVG Genetic Monitoring, Leipzig). The reaction mix for the PCR analysis of STR markers in the singular approach consisted of the following components: 16.2 μl of nuclease-free water, 5 μl of 5×PCR buffer, 1.0 μl of primer mixture, 0.8 μl of Taq DNA polymerase and 2 μl of DNA. In the case of the multiplex PCR approach according to the invention, primer mixtures of the following amounts were used: D2Mmu008 (0.8 μl), D3Mmu158 (1.2 μl), D4Mmu155 (0.8 μl), D6Mmu120 (1.6 μl), D7Mmu003 (1.2 μl), D8Mmu127 (1.2 μl), D10Mmu043 (0.4 μl), D13Mmu096 (0.7 μl), D14Mmu074 (1.6 μl), D16Mmu030 (1.6 μl), D18Mmu069 (1.2 μl), DYMmu001 (1.0 μl), DYMmu002 (0.8 μl), DYMmu003 (1.0 μl). Top-up to 25 μl final volume was done by addition of 2.1 μl of nuclease-free water, 5 μl of 5×PCR buffer, 0.8 μl of Taq DNA polymerase and 2 μl of DNA.
The PCR amplification was carried out using a Bio-Rad C1000 Touch thermal cycler (Bio-Rad Laboratories, Hercules, USA). The PCR parameters were 94° C. for 2 min, followed by 32 cycles at 94° C. for 30 s, 58° C. for 1 min, 68° C. for 2 min and a final elongation step at 68° C. for 10 min. The amplified PCR products were analyzed on an ABI 3500 Genetic Analyzer (Applied Biosystems, Foster City, USA). To this end, 1.8 μl of the PCR mixture and 0.2 μl of an internal size standard (MapMarker Custom, BioVentures Inc., Murfreesboro, USA; fragment sizes: 60, 80, 90, 100, 120, 140, 160, 180, 200, 220, 240, 250, 260, 280, 300, 320, 340, 360, 380, 400, 425, 450, 475, 500, 525 and 550 bp labeled with the fluorescent dye Dy-632) were added to 12 μl of deionized formamide. The mixture was injected (15 s) and subsequently resolved at 15 kV and a running temperature of 60° C. in Performance Optimized Polymer 4 (POP4, Applied Biosystems). The analysis data were evaluated using GeneMapper v5 (Applied Biosystems).
The creation and structuring of outbred populations involves using only a limited number of founder animals. Therefore, not all STR-marker alleles which occur in nature in wild mice will also find their way into an outbred colony. Outbred populations thus have, in comparison with wild mice, a limited, defined number of different alleles. This number can vary between the various outbred populations, depending on which founder animals with which alleles were originally used. It is an object of the present invention to identify STR markers which exhibit, in different outbred populations, as many different alleles as possible with relatively uniform frequency distribution. Serving as reference value with respect to the possible number of alleles are genotyping results of over 150 different animals from altogether 17 different inbred strains of mouse.
Assessing an autosomal or X-chromosomal marker requires suitable criteria which describe the quality thereof with respect to genetic diversity and allow a comparison between various markers. Biostatistical parameters suitable for this purpose are “polymorphism information content” (PIC) and heterozygosity (H). Heterozygosity is calculated from the number of heterozygous DNA profiles (two different alleles detectable in the DNA sample) divided by the total number of analyzed DNA profiles. In the case of the calculation of the PIC value, the number of different alleles and also the percentage thereof in the total population is taken into consideration. Markers having a PIC value of 0.50 or more are particularly suitable for describing genetic diversity.
The methods described here and in the exemplary embodiments are merely to be understood as an example. A person skilled in the art is aware of various further possibilities, with the aid of which the goal of the invention can be achieved, such as DNA extraction from sample material, finding STR regions or generating and optimizing specific oligonucleotide primers.
Table 1: List of the inventive autosomal and Y-chromosomal STR markers in the case of mouse, the oligonucleotide primers used, the chromosomal location thereof in base pairs, and the structure of the repeat unit of the STR marker. The number of repeat units is based on the reference sequence for the strain C57BL/6J as deposited in the NCBI database. The alleles were named according to the guidelines (DNA Recommendations, 1997). The genomic DNA sequences of the inventive STR markers are listed in Table 2. (F— forward primer, R— reverse primer)
The invention is more particularly elucidated below on the basis of 1 drawing and 6 exemplary embodiments.
What is shown by:
Describing the genetic diversity of an outbred population requires knowledge of the number of different variants of the Y chromosome in the population. According to the invention, this object is achieved by first performing an assignment of the Y chromosomes of outbred populations to the two types M. m. domesticus and M. m. musculus. This requires markers which can assign a Y chromosome to one of the two basic variants. Surprisingly, it was possible to identify the STR marker DYMmu003, which has different alleles in different variants of M. m. musculus. By contrast, the PCR primers used do not have a homologous sequence for all variants of the Y chromosome of the type M. m. domesticus. The result of this is that, in the case of the presence of a chromosome of the type M. m. domesticus, no PCR product is generated. DYMmu003 is utilized to establish the belonging of Y chromosomes to one of the two types. To this end, between 5 and 25 male individuals of the outbred populations Hsd:ICR(CD1), Crl:CD1(ICR), RjOrl:SWISS, Crl:NMRI, Han:NMRI, RjHan:NMRI and HsdWin:NMRI were analyzed in each case. Altogether 100 male animals of 7 different outbred populations of mouse were analyzed using the STR marker DYMmu003 and assigned accordingly to M. m. musculus or M. m. domesticus (Table 3). In the case of type M. m. domesticus, DNA products are not obtained for either of the two markers; in the case of type M. m. musculus, alleles are always detected. Completely unexpectedly, only the outbred population Crl:NMRI has Y chromosomes of both types, whereas the outbred populations Hsd:ICR(CD1), Crl:CD1(ICR), RjOrl:SWISS and RjHan:NMRI have solely variants of the type M. m. domesticus and Han:NMRI and HsdWin:NMRI can always be assigned only to the type M. m. musculus.
M. m.
M. m.
domesticus
musculus
It is further an object of the present invention to make available STR markers for the Y chromosome that are polymorphic both in the case of the type M. m. musculus and in the case of the type M. m. domesticus and can be used for describing Y-chromosomal haplotypes. The inventive STR markers DYMmu001 and DYMmu002 have this property. On the basis of DNA samples from male animals of various inbred strains and outbred populations of mouse, the possible allele spectrum for the markers DYMmu001, DYMmu002 and DYM003 was determined.
Altogether 50 male animals of 15 different inbred strains and 85 male animals of 7 different outbred populations of mouse were analyzed using the markers DYMmu001, DYMmu002 and DYMmu003 (Table 4). The assignment to M. m. musculus or M. m. domesticus in the case of the outbred populations was done on the basis of the relevant investigation results for DYMmu003. Some alleles were detected solely in inbred strains.
M. m.
M. m.
domesticus
musculus
Since the Y chromosome does not recombine, the analysis results of different Y-STR markers can be combined with one another and presented in the form of haplotypes. Table 5 presents different Y-chromosomal haplotypes for 7 different outbred populations of CD1, SWISS and NMRI, which were obtained on the basis of the analysis results for the STR markers DYMmu001, DYMmu002 and DYMmu003. With the exception of the outbred population HsdWin:NMRI with 5 identical Y-STR haplotypes among 5 tested individuals, all the other outbred populations have more than one haplotype. According to the invention, it is possible to perform, within a specific outbred population, an assignment of male individuals to different haplotypes and also the assignment to one of the two basic Y-chromosome types M. m. musculus or M. m. domesticus. This approach to describing the genetic diversity of outbred populations via the Y chromosome is novel and can be used in future for genetic monitoring.
Altogether 100 male animals of 7 different outbred populations of mouse were analyzed using the STR markers DYMmu001, DYMmu002 and DYMmu003. The haplotypes were derived from the results of the analysis of the single samples, the haplotype results being presented in the order DYMmu001-DYMmu002-DYMmu003. The value “0” for DYMmu003 indicates that no PCR product is detectable and that a chromosome of the type M. m. domesticus is concerned. According to the invention, it is possible to detect different combinations of alleles within an outbred population, and this demonstrates the presence of various Y chromosomes within an outbred population that are distinguishable from one another.
The comparison of haplotypes between various outbred populations shows that there are in some cases haplotypes which are unique for a specific outbred population and were not able to be detected in another outbred population.
Male animals of 17 different strains of Rattus norwegicus were analyzed using the inventive Y-chromosomal markers DYRno004, DYRno165 and DYRno304. The alleles were designated on the basis of the measured lengths of the amplification products. This is sufficient for documenting the diversity of different alleles. Exact knowledge of the DNA sequences of the individual alleles is not absolutely necessary for this purpose. The haplotypes derived from the analysis results are presented in Table 7 in the order DYRno004, DYRno165 and DYRno304. All the strains are distinguishable from one another. These markers are thus suitable for genotyping the rat Y chromosome and for establishing the genetic diversity within outbred populations of rat. Table 6 brings together the inventive STR markers, the oligonucleotide primers used, the chromosomal location thereof and the structure of the repeat units. For each primer, what is specified is the chromosomal location thereof in base pairs, which was ascertained on the basis of reference DNA (NCBI, version Rnor_6.0). Since the rat Y chromosome is only very short, a detailed assignment was performed to characterize the chromosomal position of the marker. DYRno165 corresponds to the position at 1.65 megabases.
The oligonucleotide primer pair of the marker DYRno004 binds specifically to two different target regions of the Y chromosome and an allele product is obtained for each target region. The two alleles amplified in this connection usually differ from one another with respect to the number of repeat units, meaning that two different alleles can be detected simultaneously using one primer pair. Assigning the individual alleles to the specific target region is, nevertheless, not possible. The STR marker DYRno004 is thus a double marker which is highly polymorphic and is ideally suitable for creating a Y-specific haplotype. The smaller of the two alleles is listed first in the haplotype, then the second, larger allele. For the two markers DYRno165 and DYRno304, the same reverse primer is used; owing to the different forward primer, it is possible to perform an unambiguous assignment of alleles to the specific marker. According to the invention, the two specific forward primers can be provided with different fluorescent dyes, and this allows an unambiguous assignment of the amplification products.
Table 7 brings together the analysis results of the genotyping of 17 different strains of Rattus norwegicus using the STR markers DYRno004, DYRno165 and DYRno304. The designation of the alleles corresponds to the measured lengths of the amplification products. In principle, this is sufficient for demonstrating the diversity of different alleles without specific knowledge of the underlying DNA sequence. The presented haplotypes are presented in the order of the analysis results for DYRno004, DYRno165 and DYRno304. Each strain has its own haplotype. The combination of the three inventive STR markers is suitable for representing the genetic diversity of the Y chromosome in the case of rat.
The genotypes were determined for 20 male animals in each case of the outbred populations Hsd:ICR(CD1) and RjOrl:SWISS, and the PIC and Het values were calculated on the basis thereof. It is essential for the assessment of the quality of a marker that multiple different alleles are detectable and that none of the alleles should represent more than 50% of the alleles in a population. To be able to estimate this, the analysis of 20 animals per population was deemed sufficient. In the case of such a number of animals, there are results across altogether 40 alleles (2 per animal); an allele with a proportion of 10% in the population is, in this connection, detected four times from a statistical point of view.
The number of possible reference alleles as ascertained following the genotyping of a multiplicity of inbred strains and outbred populations varies, depending on the marker, between 7 and 11. For Hsd:ICR(CD1) and RjOrl:SWISS, STR markers were identified in which it was possible to detect the presence of at least 6 to 9 different alleles among the 20 genotyped animals. The PIC values vary between 0.34 and 0.84. The heterozygosity rate is between 0.50 and 0.95.
The PCR was carried out in a total volume of 25 μl using Snooplex® FastPrep PCR reagents (GVG Genetic Monitoring, Leipzig). The reaction mix for the PCR analysis consisted of the following components: 2.1 μl of nuclease-free water, 5 μl of 5×PCR buffer, 0.8 μl of Taq DNA polymerase, primer mixtures of the STR markers D2Mmu008 (0.8 μl), D3Mmu158 (1.2 μl), D4Mmu155 (0.8 μl), D6Mmu120 (1.6 μl), D7Mmu003 (1.2 μl), D8Mmu127 (1.2 μl), D10Mmu043 (0.4 μl), D13Mmu096 (0.7 μl), D14Mmu074 (1.6 μl), D16Mmu030 (1.6 μl), D18Mmu069 (1.2 μl), DYMmu001 (1.0 μl), DYMmu002 (0.8 μl), DYMmu003 (1.0 μl) and 2 μl of DNA.
The inventive multiplex PCR shown in exemplary embodiment 6 does not lead to any detectable alleles when using DNA of human origin or of rat, hamster or zebrafish.
Number | Date | Country | Kind |
---|---|---|---|
10 2017 005 000.7 | May 2017 | DE | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2018/063343 | 5/22/2018 | WO |