PREDICTIVE TEST FOR ADULT DOG BODY SIZE

Abstract
The invention provides a method of predicting the size of a dog that will be attained in adulthood, comprising typing the nucleotide(s) present for a single nucleotide polymorphic (SNP) marker present in the genome of the dog at a position equivalent to position 201 in one or more of the sequences identified in Table 1, and/or at one or more positions which are in linkage disequilibrium with any one of these positions, and thereby predicting the size of the dog that will be attained in adulthood.
Description
FIELD OF THE INVENTION

The present invention relates to methods of predicting the size of a dog that will be attained in adulthood.


BACKGROUND OF THE INVENTION

Dogs have the widest variation in size of any mammalian species. The adult bodyweights of the largest breeds are up to 70 times more than those of the smallest breeds. According to the American Kennel Club, in 2006 the most popular large-breed dogs in the USA were the Labrador Retriever, the German Shepherd Dog and Golden Retrievers; the most popular small-breed dogs were Yorkshire Terriers, Dachshunds and Shih Tzus.


SUMMARY OF THE INVENTION

A genetic test for predicting the size of a dog that will be attained in adulthood has now been developed. The present inventors have discovered single nucleotide polymorphisms (SNPs) that are associated with the size of a dog. The identification of these polymorphisms provides the basis for a predictive test to predict the size that a dog will reach when it becomes an adult by screening for specific molecular markers. The predictive power of the test can be magnified using models that involve combining the results of typing one or more of the defined SNPs. Furthermore, the model can be refined for mixed breed dogs by determining the breed origin of the SNP markers in the dog. Once the size that a dog will become has been predicted, it is then possible to provide care recommendations to the dog owner or carer, such as appropriate diets, in order to achieve the best quality of life for the dog.


Accordingly, the invention provides a method of predicting the size of a dog that will be attained in adulthood, comprising typing the nucleotide(s) present for a single nucleotide polymorphic (SNP) marker present in the genome of the dog at a position equivalent to position 201 in one or more of the sequences identified in Table 1, and/or at one or more positions which are in linkage disequilibrium with any one of these positions, and thereby predicting the size of the dog that will be attained in adulthood.


The invention further provides:


a method of preparing customised food for a dog that has had its future size predicted, the method comprising:


(a) predicting the size of a dog that will be attained in adulthood by a method according to the invention; and


(b) preparing food suitable for the dog, wherein the customised dog food comprises ingredients that are suitable for a dog of the predicted size, and/or does not include ingredients that are not suitable for a dog of the predicted size;


a method of providing care recommendations for a dog, the method comprising:

    • (a) predicting the size of the dog that will be attained in adulthood by a method according to the invention; and
    • (b) providing appropriate care recommendations to the dog's owner or carer;


a database comprising information relating to one or more polymorphisms identified in Table 1 or 2 and their association with size of a dog in adulthood;


a method of predicting the size of a dog that will be attained in adulthood, the method comprising:

    • (a) inputting data of the nucleotide(s), and optionally the breed origin of the nucleotide(s), present at one or more SNP marker positions in the dog's genome as defined herein to a computer system;
    • (b) comparing the data to a computer database, which database comprises information relating to one or more polymorphisms identified in Table 1 or 2 and their association with the size of a dog in adulthood; and
    • (c) predicting on the basis of the comparison the size of the dog that will be attained in adulthood;


a computer program encoded on a computer-readable medium and comprising program code means which, when executed, performs the method of the invention;


a computer storage medium comprising the computer program defined herein and the database defined herein;


a computer system arranged to perform a method according to the invention comprising:

    • (a) means for receiving data of the nucleotide(s) present at one or more SNP marker positions in the genome of a dog;
    • (b) a module for comparing the data with a database comprising one or more polymorphisms identified in Table 1 or 2 and their association with the size of a dog in adulthood; and
    • (c) means for predicting on the basis of said comparison the size of the dog that will be attained in adulthood;


a kit for carrying out the method of the invention comprising a probe or primer that is capable of detecting a polymorphism as defined herein;


a method of managing a disease condition influenced by the size of the dog, comprising predicting the size that the dog will attain in adulthood by a method according to the invention, wherein the dog has been determined to be susceptible to a condition influenced by size, and providing recommendations to the dog owner or dog carer to enable the management of the growth rate or size of the dog and to thereby reduce the likelihood of symptoms of the disease developing in the dog;

    • a method of determining whether the genome of a dog contains one or more SNP marker(s) predictive of the size that a dog will attain in adulthood, comprising typing the nucleotide(s) present for a SNP marker present in the genome of the dog at a position equivalent to position 201 in one or more of the sequences identified in Table 1, and/or at one or more positions which are in linkage disequilibrium with any one of these positions, and optionally further comprising determining the breed origin of the nucleotide(s) present for a SNP marker; and


use of one or more SNP marker(s) present in the genome of a dog at a position equivalent to position 201 in one or more of the sequences identified in Table 1, and/or at one or more positions which are in linkage disequilibrium with any one of these positions for predicting the size that a dog will attain in adulthood.


BRIEF DESCRIPTION OF THE SEQUENCES

SEQ ID NO: 1 to 146 show the polynucleotide sequences encompassing the SNPs of the invention.





BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 illustrates schematically embodiments of functional components arranged to carry out the present invention.



FIG. 2 shows predicted log breed weight (BW) versus observed log BW by applying a model of the invention to 65 breeds using the average allele frequency for each SNP per breed and the average BW for each breed.



FIG. 3 shows a comparison of the predicted weight of 960 dogs calculated from the actual genotype of the dog compared with the average weight for the breed. Each time that average breed weight is referred to it in this document it should be understood to mean the mid-point in the weight range for that breed as determined by reference to “The Encyclopaedia of the dog” by Bruce Fogel, published by Dorling Kindersley, 2000.



FIG. 4 illustrates the testing of a model of the invention on mixed breed dogs. The information from Table 8 is plotted graphically. The actual weight (kg) is plotted against the predicted weight (kg) for the Mixed 48 set.



FIG. 5 is a graph of the same results as FIG. 4 except that the results for male (squares) and female (diamonds) dogs are distinguished.



FIG. 6 is a graph showing the effects of the modification matrix when applied to the Mixed 48 set. The arrows demonstrate the change in predicted weight after application of the modification matrix.





DETAILED DESCRIPTION OF THE INVENTION

The present inventors have discovered SNP markers in the dog genome that are determinative of the size of a dog. The present invention therefore provides a method of predicting the size of a dog that will be attained in adulthood using one or more of these SNP markers. The term “size” as used herein means the weight or height of the dog. The “predicted” size means that the result is in the form of an estimated, average or approximate size or in the form of a range of size values. The SNPs that have been discovered to be determinative of size are set out in Tables 1 and 2.


The present invention provides a method of predicting the size of a dog that will be attained in adulthood, comprising typing the nucleotide(s) present for a SNP marker present in the genome of the dog at a position equivalent to position 201 in one or more of the sequences identified in Table 1, and/or at one or more positions which are in linkage disequilibrium with any one of these positions, and thereby predicting the size of the dog that will be attained in adulthood. The phrase “typing the nucleotide(s) present for a SNP marker” means genotyping the SNP marker. The presence or absence of a SNP marker is determined. Typically, the nucleotide present at the same position on both homologous chromosomes will be determined. A dog may therefore be determined to be homozygous for a first allele, heterozygous or homozygous for a second allele of the SNP. In discussions herein, a hypothetical first allele may be designated “A” and a hypothetical second allele may be designated “a”. In these discussions therefore, the following genotypes are possible for a SNP marker: AA (homozygous), Aa or aA (heterozygous) and aa (homozygous).


The present invention also provides a method of determining whether the genome of a dog contains one or more SNP marker(s) that are indicative of the size that a dog will attain in adulthood, comprising typing the nucleotide(s) for one or more SNP markers present in the genome of the dog at a position equivalent to position 201 in one or more of the sequences identified in Table 1, and/or at one or more positions which are in linkage disequilibrium with any one of these positions. In other words, the invention provides a method of identifying whether or not one or more of the polymorphisms defined herein that are associated with dog size are present in the genome of the dog.


The invention further provides the use of one or more SNP marker(s) present in the genome of a dog at a position equivalent to position 201 in one or more of the sequences identified in Table 1, and/or at one or more positions which are in linkage disequilibrium with any one of these positions for predicting the size that a dog will attain in adulthood.


Any one of the polymorphic positions as defined herein may be typed directly, in other words by determining the nucleotide present at that position, or indirectly, for example by determining the nucleotide present at another polymorphic position that is in linkage disequilibrium with said polymorphic position. Examples of SNPs that are in linkage disequilibrium with the SNPs of Table 1 and can therefore be used to predict size are identified in Table 2.


Polymorphisms which are in linkage disequilibrium with each other in a population are typically found together on the same chromosome. Typically one is found at least 30% of the times, for example at least 40%, at least 50%, at least 70% or at least 90%, of the time the other is found on a particular chromosome in individuals in the population. Thus a polymorphism which is not a functional susceptibility polymorphism, but is in linkage disequilibrium with a functional polymorphism, may act as a marker indicating the presence of the functional polymorphism. A polymorphism that is in linkage disequilibrium with a polymorphism of the invention is indicative of the size a dog will attain in adulthood.


Polymorphisms which are in linkage disequilibrium with the polymorphisms mentioned herein are typically located within 9 mb, preferably within 5 mb, within 2 mb, within 1 mb, within 500 kb, within 400 kb, within 200 kb, within 100 kb, within 50 kb, within 10 kb, within 5 kb, within 1 kb, within 500 bp, within 100 bp, within 50 bp or within 10 bp of the polymorphism.


Any number and any combination of the SNP positions as described herein may be typed to carry out the invention. Preferably at least 2 SNP positions are typed, more preferably at least 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40 or 45 SNP positions are typed. The number of SNP positions typed may be from 1 to 50, from 2 to 40, from 5 to 30 or from 5 to 20. In a more preferred embodiment, the SNP positions are selected from those identified in Table 3. Accordingly, any of these 7 SNPs or any SNPs that are in linkage disequilibrium with any of these 7 SNPs may be typed. Preferably at least 2 of these 7 SNPs or SNPs in linkage disequilibrium are typed. More preferably at least 3, 4, 5, 6 or all 7 positions are typed. Preferably therefore, the nucleotide(s) that are typed are selected from positions equivalent to:

    • position 201 of SEQ ID NO: 7 (BICFPJ1149345, SNP 1);
    • position 201 of SEQ ID NO: 35 (BICF230J67378, SNP 2);
    • position 201 of SEQ ID NO: 58 (BICF235J47583, SNP 3);
    • position 201 of SEQ ID NO: 84 (BICFPJ401056, SNP 4);
    • position 201 of SEQ ID NO: 96 (BICF235J20169, SNP 5);
    • position 201 of SEQ ID NO: 111 (BICF235J29129, SNP 6); and
    • position 201 of SEQ ID NO: 146 (BICF235J47857, SNP 7), or one or more positions which are in linkage disequilibrium with any one of these positions.


Typing the nucleotide(s) present in the genome of the dog at a position equivalent to position 201 in a sequence identified in Table 1 or Table 2 may mean that the nucleotide present at this position in a sequence corresponding exactly with the sequence identified in Table 1 or Table 2 is typed. However, it will be understood that the exact sequences presented in SEQ ID NOs: 1 to 146 identified in Tables 1 or 2 will not necessarily be present in the dog to be tested. Typing the nucleotide present may therefore be at position 201 in a sequence identified in Table 1 or Table 2 or at an equivalent or corresponding position in the sequence. The term equivalent as used herein therefore means at or at a position corresponding to position 201. The sequence and thus the position of the SNP could for example vary because of deletions or additions of nucleotides in the genome of the dog. Those skilled in the art will be able to determine a position that corresponds to or is equivalent to position 201 in each of SEQ ID NOs: 1 to 146, using for example a computer program such as GAP, BESTFIT, COMPARE, ALIGN, PILEUP or BLAST. The UWGCG Package provides programs including GAP, BESTFIT, COMPARE, ALIGN and PILEUP that can be used to calculate homology or line up sequences (for example used on their default settings). The BLAST algorithm can also be used to compare or line up two sequences, typically on its default settings. Software for performing a BLAST comparison of two sequences is publicly available through the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). This algorithm is further described below. Similar publicly available tools for the alignment and comparison of sequences may be found on the European Bioinformatics Institute website (http://www.ebi.ac.uk), for example the ALIGN and CLUSTALW programs.


A suitable model for predicting the size of a dog can be established by genotyping SNPs in Tables 1 or 2, or SNPs that are in linkage disequilibrium with those SNPs, in samples of dogs of different sizes and correlating the allele frequency for each SNP with the sizes of the dogs. One method of correlating the allele frequency is to determine the heterozygosity/homozygosity of each SNP for each dog sample in a panel of samples from dogs of different sizes, for example by giving an allele score for a homozygote (AA) as 0, for a homozygote for the other allele (aa) as 2 and for a heterozygote (Aa or aA) as 1. An average allele score can then be calculated for dogs of approximately the same size or breed.


An association measure can then be used to identify single SNPs associated with size. One example would be the Pearson's product moment correlation coefficient. The SNPs with the highest correlation coefficient are then suitable for incorporation into a model for predicting the particular size parameter of choice. Preferably, the correlation coefficient is greater than 0.3. More preferably, the correlation coefficient is greater than 0.4, 0.45, 0.5, 0.55, 0.6 or 0.65. Once the most significantly associated single SNPs have been identified then the best combination of these SNPs for predicting size can be identified using stepwise regression algorithms, for example by using a statistics package such as Stepwise. These SNPs can then be placed into a model which considers each SNP in series and in which the effect of each SNP is additive.


The inventors have established a model for using the SNPs of the invention to predict the weight of a dog. It will be appreciated that many different models with varying degrees of predictive power are possible, using any number or combination of the SNPs of the invention for predicting any size parameter such as height or weight.


An example of a model suitable for predicting the weight that a dog will attain is now described. This model utilises 7 of the SNPs from Table 2. These 7 SNPs are set out in Table 3 together. The model is as follows:






E(log(BW))=1.69202+0.25244X1−0.165X2+0.29516X3+0.51176X4−0.10618X5+0.26279X6−0.30707X7


where E(log(BW)) is “expected log-body weight in kg” and X1-7 represents the SNP score at SNPs 1 to 7. SNP scores are 0, 1, or 2, where 0 represents homozygotes for the first allele (AA), 1 represents heterozygotes (Aa and aA) and 2 represents homozygotes for the second allele (aa). The genotype allocated to each SNP score (0, 1 or 2) is set out in Table 3 for each of the 7 SNPs. In applying this equation to predict the average size for a breed the SNP score for all dogs in that breed are averaged.


The present invention provides a method of predicting the size of a dog that will be attained in adulthood, comprising (i) typing the nucleotide(s) present for a SNP marker present in the genome of the dog at a position equivalent to position 201 in one or more of the sequences identified in Table 1, and/or at one or more positions which are in linkage disequilibrium with any one of these positions and (ii) inputting the results from (i) into a model that is predictive of the size of the dog. The genotyping results from step (i) may be provided in the form of genotypic values or SNP scores, where a homozygote for one allele is designated a first value (e.g. 0), a heterozygote is designated a second value (e.g. 1) and a homozygote for the other allele is designated a third value (e.g. 2). The predicted value of size may then be determined by multiplying the genotypic value for each SNP by a constant. The constant for each SNP may be different. The constant for each SNP may be the correlation coefficient for the particular SNP with size. The multiplication of the genotypic value for a SNP by a constant is then added to the multiplication of the genotypic value for a second SNP by a constant. This is repeated for each SNP so that the multiplications of each SNP by a constant are added together. Finally, a further constant may be added. The final result is the expected log-body weight in kg.


In one aspect of the invention therefore, the expected log-body weight in kg of a dog is determined by adding the multiplication of the genotypic value of a SNP marker defined herein by a constant and adding to a second constant.


The dog to be tested may be male or female. One general factor which influences how big a dog will be is its sex. Therefore, in one aspect of the method of the invention the sex of the dog may be determined. Determination may for example be by examination by a vet or by questioning the dog's owner. The results of one or more SNP genotypic values in the model are then multiplied by a sex specific multiplication factor. This multiplication factor may be applied to any number of the SNPs in the model. For example, it may be applied to 1, 2, 3, 4, 5, 6 or all 7 of the SNPs in Table 3.


In any of the methods of the invention described herein it is preferable to genotype all 7 of the SNPs in Table 3 or SNPs that are in linkage disequilibrium with those SNPs. More preferably, it is only the 7 SNPs in Table 3 that are genotyped.


A dog may be tested by a method of the invention at any age, for example from 0 to 12, 0 to 6, 0 to 5, 0 to 4, 0 to 3, 0 to 2 or 0 to 1 years old. Preferably the dog is tested at as young an age as possible, for example within the first year, first 6 months or first 3 months of its life. The dog is preferably tested before it is known how big the dog is going to grow. The aim is therefore to predict the size of the dog that will be attained in adulthood in order to provide care recommendations suitable for the size of the dog.


The dog to be tested by a method of the present invention may be of any breed. Typically the dog will have genetic inheritance of a breed selected from any of the breeds in Table 4 or 5. Popular breeds of dog may be selected from Boston Terrier, Boxer, Bulldog, Chihuahua, American Cocker Spaniel, Daschund, Dobermann Pinscher, German Shepherd Dog, Golden Retriever, Great Dane, Labrador Retriever, Maltese, Miniature Pinscher, Newfoundland, Parson Russell Terrier, Pekinese, Poodle, Poodle (Miniature), Pug, Rottweiler, Schnauzer (Miniature), Shih Tzu, Yorkshire Terrier. The dog may have genetic breed inheritance of a breed that is susceptible to a disease or condition that is affected by size such as canine hip dysplasia (CHD). Breeds of dog that are susceptible to CHD may be selected from Labrador, Golden retriever, German shepherd dog, Rottweiler and Newfoundland.


The dog may be a mixed or crossbred dog, or a mongrel or out-bred dog. The dog may have at least 25%, at least 50%, or at least 100% of its genome inherited from any pure breed or more preferably from any of the breeds described herein. The dog may be a pure-bred. In one embodiment of the invention, one or both parents of the dog to be tested are or were pure-bred dogs. In another embodiment, one or more grandparents are or were pure-bred dogs. One, two, three or all four of the grandparents of the dog that is tested may be or may have been pure-bred dogs.


The method of the invention is particularly useful for predicting the size of a mixed or crossbred dog, or a mongrel or out-bred dog, as information concerning the size of such a dog is less likely to be available to the dog owner or carer compared with a pure-bred dog.


Example 3 demonstrates the capability of a model of the invention to accurately predict the size of mixed-breed dogs. The model can be further refined as described in Example 4 by using information concerning the breed origin of the individual alleles of the SNP markers. This is useful because, in certain instances, the same SNP is not always associated with the same gene allele in all breeds. For example, for the IGF1 SNP (BICFPJ40156; SNP4; SEQ ID NO: 84) almost all large dog breeds are homozygous for the “a” allele (or “2”). However, despite being a large breed, Rottweilers almost always have the opposite “A” allele (or “0”) more commonly associated with small dog breeds. This may indicate that Rottweilers have the “small” version of IGF1. However it is possible that they have the “large” version of the gene but that sometime in their history the “large” version of the gene has become associated with the SNP that is usually associated with the “small” version of the gene. If this holds true, in circumstances when the IGF1 gene has come from a Rottweiler, the genotype of the IGF1 SNP would be misleading.


The invention therefore provides means of refining a model of the invention for mixed breed dogs by determining the breed origin of the SNP marker alleles for the dog. If the mixed breed dog contains genetic breed inheritance of a breed that has an atypical allele frequency for one or more of the SNP markers in the model, the model can be refined accordingly to take this into account.


In more detail, the breed calls of the individual chromosomes in the mixed breed dog can be used to inform the model. This involves, in certain circumstances, altering the SNP call that is applied to the model, based on the breed that that SNP is thought to have originated from. To follow the example already discussed, if the IGF1 SNP came from a chromosome determined to have come from a Rottweiler, the genotype result would be modified by substituting the SNP allele that is normally associated with the “large” version of IGF1. To achieve this modification, a conversion matrix can be used which takes the genotyped SNP output and translates it into the modified result for dog breeds where it is believed that the SNP may be associated with the wrong allele.


Table 9 provides an example of a conversion matrix for three atypical dog breeds for the IGF 1 SNP (BICFPJ40156; SNP 4; SEQ ID NO: 84). Each of these dog breeds show unusual IGF1 SNP results compared to their size. This is clear from the average allele frequency of the IGF1 SNP for the atypical breeds compared with similar sized breeds as shown in columns 2 and 3 of Table 9. Column 4 lists the possible genotypes that could be obtained from a sample from a dog. The genotypes are hypothetical, where “A” represents one allele and “a” represents the other allele. The genotype may be converted into a score, as for example in column 5, where 0 represents homozygous for a first allele, 1 represents heterozygous and 2 represents homozygous for the second allele. Column 6 lists the possible predicted alleles of a second breed (i.e. a non-atypical breed) that contributes to the genetic breed background of the dog. The allele of the “atypical” breed may then be determined by subtracting the predicted allele of the second breed from the overall genotyped result (column 7).


The predicted allele of the atypical breed can be modified, based on the average allele frequency of the SNP in similar sized breeds (column 8). The resulting modified genotype comprises the modified allele from the atypical breed and an unmodified allele from the second non-atypical breed (column 9). The modified genotype can then be converted into a genotype score (column 10), which can then be applied to the model formula for predicting size.


In order to apply such a conversion matrix, and in one aspect of the invention therefore, the breed origin of each allele for a SNP genotype is determined. This may be determined by (i) determining the breed origin of the chromosome which comprises each allele for a SNP genotype and (ii) predicting which breed contributed to which allele.


The breed origin of each allele for a SNP genotype may be determined by determining the genetic breed background of the dog, i.e. by determining which breeds contributed to the genetic make-up of the dog. The test may be a genetic test, such as a SNP-based or microsatellite-based marker test. An example of a SNP-based test is the commercially available WISDOM PANEL™ MX mixed-breed test. The test may therefore involve genotyping a sample from the dog with a panel of SNPs which allow the breed signatures of the dog to be determined.


A genome-wide panel of SNPs may be used to determine the genetic breed background of the dog. Alternatively, it is possible to focus on the chromosome containing the “size” SNP of interest to determine the breed origin of the chromosome, for example, by using a panel of SNPs located on the chromosome of interest.


Once the breeds that contribute to the genetic make-up of the dog have been determined it is then necessary to determine which breed contributed to which allele of the genotype. When the genotyped SNP is homozygous (0 or 2) it is self-evident what the allele that has come from the atypical breed must be. However, when the genotyped SNP is heterozygous it is not clear which of the two breeds that contributed to the formation of the chromosome supplied which allele of the gene. It is necessary therefore to predict the individual genotypes of the chromosomes that came from the problematic breed and the second breed. This can be achieved by reference to a matrix of average allele frequencies for each breed, for example Table 8.


If according to the matrix, all dogs in the second breed are homozygous for a SNP it is possible to confidently predict the allele of the second breed and, by subtraction, the allele of the problematic breed. If according to the matrix the second breed has an average allele frequency nearer to 1 then it is more difficult to determine which allele comes from which breed. In this instance the allele of the second breed can be assigned using a probability that reflects the average allele frequency in that breed. For example, if the allele frequency of the SNP in the breed is 0.8, then the allele can be assigned as follows: A random number between 0 and 2 is selected (to 3 decimal places), for example 1.654. If this number is smaller than the allele frequency of the SNP in the breed in question then the allele is assigned as 0. If it is larger or equal to the allele frequency, then the allele is assigned as 2. In this case, as 1.654 is larger than the average allele frequency for the breed (0.8), the allele of the second breed would be assigned as 2. The prediction of alleles can be carried out using any known statistical package.


Once the breed origin of each allele in the SNP genotype has been predicted, the genotype can be modified to take into account the origin of one or both alleles being from a breed that is known to be atypical for that SNP. After modifying the SNP results, for example by using a conversion matrix, the prediction model/algorithm of the invention can then be used to make a modified prediction of the size of the dog. To illustrate this principle, the conversion matrix described in Table 9 has been used to modify the results of the dogs that are present in a sample of mixed-breed dogs for the IGF1 SNP (Example 4).


According to one aspect of the invention therefore, the method of predicting the size of a dog that will be attained in adulthood further comprises determining the breed origin of the nucleotide(s) present for a SNP marker. The method may comprise determining the breed origin of both alleles of a SNP genotype for one or more SNP markers. The method may comprise determining the breed origin of both alleles of a SNP genotype for any of the SNP markers in Table 1 or 2.


The breed origin of the nucleotide(s) present for a SNP marker may be determined by genotyping a sample from the dog with a panel of genetic markers, such as SNP markers or microsatellites. The predictive test of the invention may therefore be carried out in conjunction with one or more tests for determining the genetic breed background of the dog. Once the genetic breed background has been determined, SNP genotypes can then be modified, if necessary, to take account of the contributions of one or more breeds that have atypical allele frequencies for the particular SNPs. Once the genotypes have been modified, the genotype scores can be applied to the size prediction model in the manner described above.


The predictive test of the invention may be carried out in conjunction with one or more other other predictive or diagnostic tests such as determining susceptibility to one or more diseases. The test may be used in conjunction with a disease susceptibility test to help improve the accuracy of the disease susceptibility prediction, for conditions where expression of the disease phenotype is influenced by the size of the dog. The aim is therefore to improve information about the likelihood of developing the condition.


The test may also be used in conjunction with a disease susceptibility test as part of a preventative or management regime for the condition. In this case, a positive disease susceptibility result for a condition that is influenced by size drives the use of the size predictive test to allow the management of the dogs growth rate/weight in order to reduce the likelihood of developing disease symptoms.


An example of a disease condition that is influenced by size is canine hip dysplasia (CHD). CHD is a congenital disease that causes the hip joints in affected dogs to grow abnormally. The larger the dog, the more likely the dog is to suffer from symptoms of this disease.


Detection of Polymorphisms

The detection of polymorphisms according to the invention may comprise contacting a polynucleotide or protein of the dog with a specific binding agent for a polymorphism and determining whether the agent binds to the polynucleotide or protein, wherein binding of the agent indicates the presence of the polymorphism, and lack of binding of the agent indicates the absence of the polymorphism.


The method is generally carried out in vitro on a sample from the dog, where the sample contains DNA from the dog. The sample typically comprises a body fluid and/or cells of the dog and may, for example, be obtained using a swab, such as a mouth swab. The sample may be a blood, urine, saliva, skin, cheek cell or hair root sample. The sample is typically processed before the method is carried out, for example DNA extraction may be carried out. The polynucleotide or protein in the sample may be cleaved either physically or chemically, for example using a suitable enzyme. In one embodiment the part of polynucleotide in the sample is copied or amplified, for example by cloning or using a PCR based method prior to detecting the polymorphism.


In the present invention, any one or more methods may comprise determining the presence or absence of one or more polymorphisms in the dog. The polymorphism is typically detected by directly determining the presence of the polymorphic sequence in a polynucleotide or protein of the dog. Such a polynucleotide is typically genomic DNA, mRNA or cDNA. The polymorphism may be detected by any suitable method such as those mentioned below.


A specific binding agent is an agent that binds with preferential or high affinity to the protein or polypeptide having the polymorphism but does not bind or binds with only low affinity to other polypeptides or proteins. The specific binding agent may be a probe or primer. The probe may be a protein (such as an antibody) or an oligonucleotide. The probe may be labelled or may be capable of being labelled indirectly. The binding of the probe to the polynucleotide or protein may be used to immobilise either the probe or the polynucleotide or protein.


Generally in the method, a polymorphism can be detected by determining the binding of the agent to the polymorphic polynucleotide or protein of the dog. However in one embodiment the agent is also able to bind the corresponding wild-type sequence, for example by binding the nucleotides or amino acids which flank the variant position, although the manner of binding to the wild-type sequence will be detectably different to the binding of a polynucleotide or protein containing the polymorphism.


The method may be based on an oligonucleotide ligation assay in which two oligonucleotide probes are used. These probes bind to adjacent areas on the polynucleotide that contains the polymorphism, allowing after binding the two probes to be ligated together by an appropriate ligase enzyme. However the presence of a single mismatch within one of the probes may disrupt binding and ligation. Thus ligated probes will only occur with a polynucleotide that contains the polymorphism, and therefore the detection of the ligated product may be used to determine the presence of the polymorphism.


In one embodiment the probe is used in a heteroduplex analysis based system. In such a system when the probe is bound to polynucleotide sequence containing the polymorphism it forms a heteroduplex at the site where the polymorphism occurs and hence does not form a double strand structure. Such a heteroduplex structure can be detected by the use of a single or double strand specific enzyme. Typically the probe is an RNA probe, the heteroduplex region is cleaved using RNAase H and the polymorphism is detected by detecting the cleavage products.


The method may be based on fluorescent chemical cleavage mismatch analysis which is described for example in PCR Methods and Applications 3, 268-71 (1994) and Proc. Natl. Acad. Sci. 85, 4397-4401 (1998).


In one embodiment a PCR primer is used that primes a PCR reaction only if it binds a polynucleotide containing the polymorphism, for example a sequence-specific PCR system, and the presence of the polymorphism may be determined by detecting the PCR product. Preferably the region of the primer that is complementary to the polymorphism is at or near the 3′ end of the primer. The presence of the polymorphism may be determined using a fluorescent dye and quenching agent-based PCR assay such as the Taqman PCR detection system.


The specific binding agent may be capable of specifically binding the amino acid sequence encoded by a polymorphic sequence. For example, the agent may be an antibody or antibody fragment. The detection method may be based on an ELISA system. The method may be an RFLP based system. This can be used if the presence of the polymorphism in the polynucleotide creates or destroys a restriction site that is recognised by a restriction enzyme.


The presence of the polymorphism may be determined based on the change that the presence of the polymorphism makes to the mobility of the polynucleotide or protein during gel electrophoresis. In the case of a polynucleotide, single-stranded conformation polymorphism (SSCP) or denaturing gradient gel electrophoresis (DDGE) analysis may be used. In another method of detecting the polymorphism, a polynucleotide comprising the polymorphic region is sequenced across the region that contains the polymorphism to determine the presence of the polymorphism.


The presence of the polymorphism may be detected by means of fluorescence resonance energy transfer (FRET). In particular, the polymorphism may be detected by means of a dual hybridisation probe system. This method involves the use of two oligonucleotide probes that are located close to each other and that are complementary to an internal segment of a target polynucleotide of interest, where each of the two probes is labelled with a fluorophore. Any suitable fluorescent label or dye may be used as the fluorophore, such that the emission wavelength of the fluorophore on one probe (the donor) overlaps the excitation wavelength of the fluorophore on the second probe (the acceptor). A typical donor fluorophore is fluorescein (FAM), and typical acceptor fluorophores include Texas red, rhodamine, LC-640, LC-705 and cyanine 5 (Cy5).


In order for fluorescence resonance energy transfer to take place, the two fluorophores need to come into close proximity on hybridisation of both probes to the target. When the donor fluorophore is excited with an appropriate wavelength of light, the emission spectrum energy is transferred to the fluorophore on the acceptor probe resulting in its fluorescence. Therefore, detection of this wavelength of light, during excitation at the wavelength appropriate for the donor fluorophore, indicates hybridisation and close association of the fluorophores on the two probes. Each probe may be labelled with a fluorophore at one end such that the probe located upstream (5′) is labelled at its 3′ end, and the probe located downstream (3′) is labelled at is 5′ end. The gap between the two probes when bound to the target sequence may be from 1 to 20 nucleotides, preferably from 1 to 17 nucleotides, more preferably from 1 to 10 nucleotides, such as a gap of 1, 2, 4, 6, 8 or 10 nucleotides.


The first of the two probes may be designed to bind to a conserved sequence of the gene adjacent to a polymorphism and the second probe may be designed to bind to a region including one or more polymorphisms. Polymorphisms within the sequence of the gene targeted by the second probe can be detected by measuring the change in melting temperature caused by the resulting base mismatches. The extent of the change in the melting temperature will be dependent on the number and base types involved in the nucleotide polymorphisms.


Polymorphism typing may also be performed using a primer extension technique. In this technique, the target region surrounding the polymorphic site is copied or amplified for example using PCR. A single base sequencing reaction is then performed using a primer that anneals one base away from the polymorphic site (allele-specific nucleotide incorporation). The primer extension product is then detected to determine the nucleotide present at the polymorphic site. There are several ways in which the extension product can be detected. In one detection method for example, fluorescently labelled dideoxynucleotide terminators are used to stop the extension reaction at the polymorphic site. Alternatively, mass-modified dideoxynucleotide terminators are used and the primer extension products are detected using mass spectrometry. By specifically labelling one or more of the terminators, the sequence of the extended primer, and hence the nucleotide present at the polymorphic site can be deduced. More than one reaction product can be analysed per reaction and consequently the nucleotide present on both homologous chromosomes can be determined if more than one terminator is specifically labelled.


The invention further provides primers or probes that may be used in the detection of any of the SNPs defined herein for use in the prediction of size. Polynucleotides of the invention may also be used as primers for primer extension reactions to detect the SNPs defined herein.


Such primers, probes and other polynucleotide fragments will preferably be at least 10, preferably at least 15 or at least 20, for example at least 25, at least 30 or at least 40 nucleotides in length. They will typically be up to 40, 50, 60, 70, 100 or 150 nucleotides in length. Probes and fragments can be longer than 150 nucleotides in length, for example up to 200, 300, 400, 500, 600, 700 nucleotides in length, or even up to a few nucleotides, such as five or ten nucleotides, short of a full length polynucleotide sequence of the invention.


Primers and probes for genotyping the SNPs of the invention may be designed using any suitable design software known in the art using the SNP sequences in Tables 1 and 2. Homologues of these polynucleotide sequences would also be suitable for designing primers and probes. Such homologues typically have at least 70% homology, preferably at least 80, 90%, 95%, 97% or 99% homology, for example over a region of at least 15, 20, 30, 100 more contiguous nucleotides. The homology may be calculated on the basis of nucleotide identity (sometimes referred to as “hard homology”).


For example the UWGCG Package provides the BESTFIT program that can be used to calculate homology (for example used on its default settings) (Devereux et al (1984) Nucleic Acids Research 12, p387-395). The PILEUP and BLAST algorithms can be used to calculate homology or line up sequences (such as identifying equivalent or corresponding sequences (typically on their default settings), for example as described in Altschul S. F. (1993) J Mol Evol 36:290-300; Altschul, S, F et al (1990) J Mol Biol 215:403-10.


Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). This algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short words of length W in the query sequence that either match or satisfy some positive-valued threshold score T when aligned with a word of the same length in a database sequence. T is referred to as the neighborhood word score threshold (Altschul et al, supra). These initial neighborhood word hits act as seeds for initiating searches to find HSPs containing them. The word hits are extended in both directions along each sequence for as far as the cumulative alignment score can be increased. Extensions for the word hits in each direction are halted when: the cumulative alignment score falls off by the quantity X from its maximum achieved value; the cumulative score goes to zero or below, due to the accumulation of one or more negative-scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm parameters W, T and X determine the sensitivity and speed of the alignment. The BLAST program uses as default a word length (W) of 11, the BLOSUM62 scoring matrix (see Henikoff and Henikoff (1992) Proc. Natl. Acad. Sci. USA 89: 10915-10919) alignments (B) of 50, expectation (E) of 10, M=5, N=4, and a comparison of both strands.


The BLAST algorithm performs a statistical analysis of the similarity between two sequences; see e.g., Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 90: 5873-5787. One measure of similarity provided by the BLAST algorithm is the smallest sum probability (P(N)), which provides an indication of the probability by which a match between two polynucleotide sequences would occur by chance. For example, a sequence is considered similar to another sequence if the smallest sum probability in comparison of the first sequence to the second sequence is less than about 1, preferably less than about 0.1, more preferably less than about 0.01, and most preferably less than about 0.001.


The homologous sequence typically differs by at least 1, 2, 5, 10, 20 or more mutations, which may be substitutions, deletions or insertions of nucleotides


The polynucleotides of the invention such as primers or probes may be present in an isolated or substantially purified form. They may be mixed with carriers or diluents that will not interfere with their intended use and still be regarded as substantially isolated. They may also be in a substantially purified form, in which case they will generally comprise at least 90%, e.g. at least 95%, 98% or 99%, of polynucleotides of the preparation.


Detector Antibodies

A detector antibody is an antibody that is specific for one polymorphism but does not bind to any other polymorphism as described herein. Detector antibodies are for example useful in purification, isolation or screening methods involving immunoprecipitation techniques.


Antibodies may be raised against specific epitopes of the polypeptides of the invention. An antibody, or other compound, “specifically binds” to a polypeptide when it binds with preferential or high affinity to the protein for which it is specific but does substantially bind not bind or binds with only low affinity to other polypeptides. A variety of protocols for competitive binding or immunoradiometric assays to determine the specific binding capability of an antibody are well known in the art (see for example Maddox et al, J. Exp. Med. 158, 1211-1226, 1993). Such immunoassays typically involve the formation of complexes between the specific protein and its antibody and the measurement of complex formation.


For the purposes of this invention, the term “antibody”, unless specified to the contrary, includes fragments that bind a polypeptide of the invention. Such fragments include Fv, F(ab′) and F(ab′)2 fragments, as well as single chain antibodies. Furthermore, the antibodies and fragment thereof may be chimeric antibodies, CDR-grafted antibodies or humanized antibodies.


Antibodies may be used in a method for detecting polypeptides of the invention in a biological sample (such as any such sample mentioned herein), which method comprises:


I providing an antibody of the invention;


II incubating a biological sample with said antibody under conditions which allow for the formation of an antibody-antigen complex; and


III determining whether antibody-antigen complex comprising said antibody is formed.


Antibodies of the invention can be produced by any suitable method. Means for preparing and characterising antibodies are well known in the art, see for example Harlow and Lane (1988) “Antibodies: A Laboratory Manual”, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. For example, an antibody may be produced by raising an antibody in a host animal against the whole polypeptide or a fragment thereof, for example an antigenic epitope thereof, hereinafter the “immunogen”. The fragment may be any of the fragments mentioned herein (typically at least 10 or at least 15 amino acids long).


A method for producing a polyclonal antibody comprises immunizing a suitable host animal, for example an experimental animal, with the immunogen and isolating immunoglobulins from the animal's serum. The animal may therefore be inoculated with the immunogen, blood subsequently removed from the animal and the IgG fraction purified. A method for producing a monoclonal antibody comprises immortalizing cells which produce the desired antibody. Hybridoma cells may be produced by fusing spleen cells from an inoculated experimental animal with tumour cells (Kohler and Milstein (1975) Nature 256, 495-497).


An immortalized cell producing the desired antibody may be selected by a conventional procedure. The hybridomas may be grown in culture or injected intraperitoneally for formation of ascites fluid or into the blood stream of an allogenic host or immunocompromised host. Human antibody may be prepared by in vitro immunisation of human lymphocytes, followed by transformation of the lymphocytes with Epstein-Barr virus.


For the production of both monoclonal and polyclonal antibodies, the experimental animal is suitably a goat, rabbit, rat, mouse, guinea pig, chicken, sheep or horse. If desired, the immunogen may be administered as a conjugate in which the immunogen is coupled, for example via a side chain of one of the amino acid residues, to a suitable carrier. The carrier molecule is typically a physiologically acceptable carrier. The antibody obtained may be isolated and, if desired, purified.


Detection Kit

The invention also provides a kit that comprises means for typing one or more of the polymorphisms defined herein. In particular, such means may include a specific binding agent, probe, primer, pair or combination of primers, or antibody, including an antibody fragment, as defined herein which is capable of detecting or aiding detection of the polymorphisms defined herein. The primer or pair or combination of primers may be sequence specific primers that only cause PCR amplification of a polynucleotide sequence comprising the polymorphism to be detected, as discussed herein. The primer or pair of primers may alternatively not be specific for the polymorphic nucleotide, but may be specific for the region upstream (5′) and/or downstream (3′). These primers allow the region encompassing the polymorphic nucleotide to be copied. A kit suitable for use in the primer-extension technique may specifically include labelled dideoxynucleotide triphosphates (ddNTPs). These may for example be fluorescently labelled or mass modified to enable detection of the extension product and consequently determination of the nucleotide present at the polymorphic position.


The kit may also comprise a specific binding agent, probe, primer, pair or combination of primers, or antibody that is capable of detecting the absence of the polymorphism. The kit may further comprise buffers or aqueous solutions.


The kit may additionally comprise one or more other reagents or instruments that enable any of the embodiments of the method mentioned above to be carried out. Such reagents or instruments may include one or more of the following: a means to detect the binding of the agent to the polymorphism, a detectable label such as a fluorescent label, an enzyme able to act on a polynucleotide, typically a polymerase, restriction enzyme, ligase, RNAse H or an enzyme which can attach a label to a polynucleotide, suitable buffer(s) or aqueous solutions for enzyme reagents, PCR primers which bind to regions flanking the polymorphism as discussed herein, a positive and/or negative control, a gel electrophoresis apparatus, a means to isolate DNA from sample, a means to obtain a sample from the individual, such as swab or an instrument comprising a needle, or a support comprising wells on which detection reactions can be carried out. The kit may be, or include, an array such as a polynucleotide array comprising the specific binding agent, preferably a probe, of the invention. The kit typically includes a set of instructions for using the kit.


Care Recommendations and Customised Food

In one aspect, the invention relates to a customised diet for a dog that has been predicted to attain a particular size. Such a food may be in the form of, for example, wet pet foods, semi-moist pet foods, dry pet foods and pet treats. Wet pet food generally has a moisture content above 65%. Semi-moist pet food typically has a moisture content between 20-65% and can include humectants and other ingredients to prevent microbial growth. Dry pet food, also called kibble, generally has a moisture content below 20% and its processing typically includes extruding, drying and/or baking in heat. The ingredients of a dry pet food generally include cereal, grains, meats, poultry, fats, vitamins and minerals. The ingredients are typically mixed and put through an extruder/cooker. The product is then typically shaped and dried, and after drying, flavours and fats may be coated or sprayed onto the dry product.


The invention therefore provides a method of preparing customised food for a dog that has had its future size predicted, the method comprising:


(a) predicting the size of a dog that will be attained in adulthood by a method according to the invention; and


(b) preparing food suitable for the dog, wherein the customised dog food comprises ingredients that are suitable for a dog of the predicted size, and/or does not include ingredients that are not suitable for a dog of the predicted size.


Diets tailored specifically to the size of the dog are available commercially. For example, Royal Canin produces four diets called Mini, Medium, Maxi and Giant. Each diet has the appropriate nutritional and energy specification to ensure that the dog receives the correct balance of nutrients and sufficient (but not excessive) calories for its size. Versions of each diet are available to feed at puppy, junior and adult stages. The size predictive test of the invention can be used to ensure that a puppy is placed onto the correct diet (e.g. Mini, Medium, Maxi or Giant) to ensure that its energy and nutritional requirements are met.


The size of the dog also influences its risk from certain conditions which can be countered by the use of an appropriate diet. For example, small dogs are at greater risk from tooth decay which can be countered by the use of sodium polyphosphates in the diet helping to trap calcium and therefore reduce the build up of tartar that leads to tooth decay. Alternatively, large dogs are more prone to suffering joint problems because of their increased weight, therefore diets that include joint protecting substances such as chondroitin are advantageous for large dogs. Thus the use of the size prediction test allows the dog to be placed on a diet which both has the correct energy requirements but also contains additives to counter size specific risk factors for its size.


The invention also relates to providing care recommendations to a dog owner, veterinarian or dog carer to enable the management of the dog's weight. The predicted size of the dog established using the test of the invention acts as a guide to the dog owner, veterinarian or dog carer of the size that the dog should become and is therefore a useful tool in managing the weight of a dog and in combating obesity.


Furthermore, as explained above, the size prediction test may be used in conjunction with a disease susceptibility test. The size prediction test may improve the accuracy of disease susceptibility prediction for diseases where expression of the disease phenotype is influenced by the size of the dog. Alternatively, following a positive determination of susceptibility to a disease or condition that is influenced by size, the size prediction test may be useful to allow the management of the dog's growth rate or weight. This will reduce the likelihood of the dog developing disease symptoms. Care recommendations that are provided to the dog owner, veterinarian or carer may therefore relate to growth rate or weight management.


Bioinformatics

The sequences of the polymorphisms may be stored in an electronic format, for example in a computer database. Accordingly, the invention provides a database comprising information relating to one or more polymorphisms in Tables 1 or 2 and the association of the polymorphisms with size. The database may include further information about the polymorphism, for example the degree of association of the polymorphism with dog size or the breed origin of the alleles.


A database as described herein may be used to predict the size of a dog that will be attained in adulthood. Such a determination may be carried out by electronic means, for example by using a computer system (such as a PC). Typically, the determination will be carried out by inputting genetic data from the dog to a computer system; comparing the genetic data to a database comprising information relating to one or more polymorphisms in Tables 1 or 2 and the association of the polymorphisms with size; and on the basis of this comparison, predicting the size of a dog that will be attained in adulthood. Information concerning the breed origin of the alleles of the polymorphism may optionally be inputted to the computer system in order to aid size determination of a mixed-breed dog.


The invention also provides a computer program comprising program code means for performing all the steps of a method of the invention when said program is run on a computer. Also provided is a computer program product comprising program code means stored on a computer readable medium for performing a method of the invention when said program is run on a computer. A computer program product comprising program code means on a carrier wave that, when executed on a computer system, instruct the computer system to perform a method of the invention is additionally provided.


As illustrated in FIG. 1, the invention also provides an apparatus arranged to perform a method according to the invention. The apparatus typically comprises a computer system, such as a PC. In one embodiment, the computer system comprises: means 20 for receiving genetic data from the dog; a module 30 for comparing the data with a database 10 comprising information relating to polymorphisms; and means 40 for predicting on the basis of said comparison the size of a dog that will be attained in adulthood.


The invention is illustrated by the following Examples:


EXAMPLE 1
Methodology

An investigation was conducted to identify SNPs in the canine genome involved in size determination. SNPs were investigated in the 65 dog breeds set out in Tables 4 and 5.


Firstly, for each of the breeds, the average weight and height of the breed was determined using the mid point in the weight or height range for that breed taken from “The Encyclopaedia of the dog” by Bruce Fogel, published by Dorling Kindersley, 2000. Each SNP out of two collections of SNPs (described below) was genotyped in samples of dog genomic DNA for each breed. For each SNP, the genotype in each dog sample was given a designated allele score: a homozygote for one allele was designated as 0, a homozygote for the other allele was designated as 2 and a heterozygote was designated as 1. Then, for each SNP the average allele score per breed was calculated. SNPs that are near to genes that are important for determining breed characteristics tend towards homozygosity, i.e. the average is near to 0 or 2.


To then find which SNPs were best at discriminating size, we grouped the breeds into dogs of a similar size (either height or weight depending on the analysis being performed) and then took the average SNP score across the whole group. Finally, for each SNP we looked for the largest difference in allele score between two groups of dogs of differing average sizes. As an example at the extreme end that meant looking for the SNPs with the largest difference in SNP score between a group of tiny dogs including breeds like Chihuahuas and Yorkshire terriers and another group containing breeds like Great danes and Mastiffs. We then ranked all the SNPs for the difference in SNP score between size groups; those with the largest score were best at separating breed based on size.


Datasets

Two datasets of SNP genotyping were used for the project:


Dataset 1 comprised data from 3140 dogs genotyped from 87 different breeds at 4608 SNPs. These SNPs are spread out relatively evenly across the genome (with the exception of the sex chromosomes which are not represented).


Dataset 2 comprised data for SNPs selected from regions that were good at distinguishing between breeds in dataset 1. As a result dataset 2 had less SNPs (1536) and these are distributed at a much smaller number of locations on the genome but in greater numbers at each location. In addition the number of samples was increased (4140) and the number of breeds covered was increased dramatically to 163.


To reduce the problem of artificially enhanced homozygosity caused by low sample numbers per breed, dogs from breeds that were represented less than 7 times in the dataset were removed.


Analysing the Data.

Both datasets were analysed for height and weight. In determining which SNPs were the best to pursue the following criteria were taken into account:

    • 1 SNPs were ranked according to largest difference in allele score between groups.
    • 2 Extra importance was ascribed to locations where several nearby SNPs all scored highly.
    • 3 Extra importance was given to SNPs that were located near to genes known or suspected to be involved in size regulation.


Using these criteria, 102 potentially interesting loci were selected for the new round of genotyping that was to follow. Once we had identified potentially interesting regions of the genome the next step was to genotype further SNPs in these regions to both confirm whether they were significant and also to potentially identify new SNPs more tightly linked to the relevant alleles.


SNP Selection

Once the locations had been determined, and the number of SNPs chosen, new SNPs were selected from the Can Fam 1 SNP list downloaded from the Broad institute website (http://www.broad.mit.edu/mammals/dog/snp2/).


To select the SNPs the following criteria were applied:

    • 1 SNPs were chosen over a region of 600 KB which was centred about either the SNP identified in the data analysis or the middle of the candidate gene.
    • 2 SNPs were spread evenly over this region in close pairs.
    • 3 SNPs with more than 3 N's in the 300 base pairs before or after the SNP were not selected.
    • 4 SNPs were selected in decreasing priority order (depending on the source of the SNP) from the list below:
      • Priority 1—SNPs from Multiple breeds (other than Boxer, Poodle and wolf).
      • Priority 2—SNPs from one breed (other than Boxer, Poodle and wolf).
      • Priority 3—SNPs from Boxer and Poodle
      • Priority 4—SNPs from Boxer or Poodle only
      • Priority 5—SNPs from wolf breeds only.


The reason for this priority order is because it was observed that many of the SNPs identified were from multiple breeds other than boxer and poodle. Firstly this may be because the more breeds a SNP has been identified in the more likely it is actually to be a SNP. Secondly some of these other breeds differ in size from the boxer and therefore SNPs from these breeds could be more likely to be involved in size determination.


An “A” list of 2000 SNPs was selected and a corresponding “B” list was also selected. The A list was sent to Sequenom for analysis, the SNPs that failed design criteria were replaced by corresponding SNPs from the B list.


Selecting Samples

Selecting samples was a balance between cost and managing to cover all the different sizes of breeds. To avoid problems with low sample numbers giving artificial homozygosity, breeds where a minimum of 10 samples were available were selected. In total 960 samples were selected from 65 breeds. The list of breeds and sample number selected can be seen in Table 4.


A Model for Predicting Size.

Once the genotyping had been completed, to identify SNPs predictive of body size differences, a combination of single and multi-marker analysis were undertaken. The data set consisted of 1,579 SNPs genotyped in a total 960 dogs from n=65 AKC recognized breeds with an average sample size of 14.8 dogs per breed (range 6-25).


Two measures of association were used to identify single SNPs associated with log-transformed average male body weight [log(BW)]. The first is Pearson's product moment correlation coefficient, r, which follows a t-distribution with (n−2) degrees under the null hypothesis of no association between marker frequency and log(BW), assuming the data follow independent normal distributions (testing for a significant Pearson's product moment correlation is equivalent to testing for a significant regression of log(BW) on single marker (SNP) allele frequency). The second measure of association we considered was Kendall's τ statistic which tests for significant association using the joint distribution of ranks of the observations. That is, the observations themselves are not used, but rather their relative ordering (1st, 2nd, 3rd, etc.) for both log(BW) and allele frequency. We can think of this test as measuring whether breeds with high (or low body weight) tend to have high (or low) allele frequencies without regard to the actual values of the average body weight or allele frequencies.


Overall, we found that 302 SNPs out of 1,579 tested (19.2%) were significantly associated with log(BW) at the α=10% significance level. A measure of how well individual SNPs predict average body weight differences among breeds is the square of the Pearson's product moment correlation coefficient (equivalent to the R2 statistic in Ordinary Least Squares regression). The SNPs surveyed here showed a range of R2 from 0 to 63% when considered individually. Since body size is a typical quantitative trait (and, thus, likely to be influenced by many genes acting additively), we sought to improve upon single-marker analyses by searching for combinations of SNPs that yield high R2 when considered together.


The number of distinct regression models for K predictor variables is 2K-1, meaning that it is computationally impossible to consider all possible combinations of 1,579 SNPs when seeking to predict log(BW) (or even for the subset of SNPs that show strong association at the 10% level). Furthermore, since the number of independent observations available for the regression is the number of breeds in our analysis (n=65), models with more than 64 SNPs will perfectly fit the data. To overcome these hurdles, we used backwards and forwards stepwise regression algorithms implemented in the R statistics package Stepwise. Briefly, stepwise regression algorithms iteratively build regression models by successively adding or removing variables based on the t-statistic of estimated regression coefficients. The terms “backwards” and “forwards” describe whether one begins with the full model and removes terms or begins with the mean model and additively add terms. For our data, both approaches converged to the same solution when the full model contained the 60 most significant singly associated SNPs. The final solution we obtained contained 7 SNPs across 6 chromosomes as well as an intercept term (see Table 3 for the 7 SNPs). The adjusted R-squared for this model was 85.8% indicating a very high predictive ability for log(BW). The final prediction equation is:






E(log(BW))=1.69202+0.25244X1−0.165X2+0.29516X3+0.51176X4−0.10618X5+0.26279X6−0.30707X7


where E(log(BW)) is “expected log-body weight in kg” and X1-7 represents the SNP score at SNPs 1 to 7. SNP scores are either 0, 1, or 2, where 0 represents homozygotes for allele “A”, 1 represents heterozygotes and 2 represents homozygotes for allele “a”. The genotype allocated to each SNP score (0, 1 or 2) is set out in Table 3 for each of the 7 SNPs.


Applying the model to the 65 breeds used in the genotyping, using the average allele frequency per breed, gives the results in Table 5. This is plotted graphically in FIG. 2 (BW=Body weight).


EXAMPLE 2
Testing the Model

The model determined in Example 1 was tested by using the model to calculate the predicted size of the 960 dogs that went into the size genotyping. In this test we compared the predicted size of the dog calculated from the actual genotype of the dog with the average size for the breed. A good correlation between the predicted and actual weights can be seen from the graph in FIG. 3.


We now provide some worked examples that demonstrate how to use the model to predict an individual dog's size:


(1) A dog fixed for all the “small alleles” (i.e., a “0” at all loci with positive effects and a “2” at all loci with negative effects) yields a minimum size of 1.71 kg:





log(y)=1.69202+0(0.25244)+2(−0.165)+0(0.29516)+0(0.51176)+2(−0.10618)+0(0.26279)+2(−0.30707)





log(y)=1.69202−0.33−0.21236−0.61414






y=exp(0.53552)





y=1.708 Kg


(2) A dog fixed for all the “large alleles” (i.e., a “2” at all loci with positive effects and a “0” at all loci with negative effects) yields a maximum size of 76.43 kg





log(y)=1.69202+2(0.25244)+0(−0.165)+2(0.29516)+2(0.51176)+0(−0.10618)+2(0.26279)+0(−0.30707)





log(y)=1.69202+0.50488+0.59032+1.02352+0.52558






y=exp(4.33632)





y=76.426 Kg


(3) A dog heterozygous for all the size alleles (i.e., a “1” at all the QTLs) yields a size of 11.42633 kg





log(y)=1.69202+1(0.25244)+1(−0.165)+1(0.29516)+1(0.51176)+1(−0.10618)+1(0.26279)+1(−0.30707)





log(y)=1.69202+0.25244−0.165+0.29516+0.51176−0.10618+0.26278−0.30707





log(y)=2.43592





y=11.42633 Kg


(4) A hypothetical dog with some small and large alleles:





log(y)=1.69202+2(0.25244)+2(−0.165)+2(0.29516)+2(0.51176)+2(−0.10618)+2(0.26279)+2(−0.30707)





log(y)=1.69202+0.50488−0.33+0.59032+1.02352−0.21236+0.52558−0.61414





log(y)=3.17982





y=24.04243 Kg


EXAMPLE 3
Validating the Model in Mixed Breed Dogs

The model described in Example 2 was generated using data from pure-bred dogs. This Example details the testing of the model on a population of mixed breed dogs.


Selecting the Panel of Mixed Breed Dogs

The samples used for testing the size model came from a collection of dogs that performed at the “All about dogs” show in the UK. The breakdown of the different types of samples collected is provided in Table 6.


Dogs were initially genotyped using the WISDOM PANEL™ MX mixed breed analysis test to confirm they were mixed breed. They were selected for genotyping if their owner considered them mixed breed or if a visual inspection of their photograph suggested they may not be purebred. Of the dogs genotyped, 14 were excluded from the mixed breed set based on the WISDOM PANEL™ MX breed calls. Twelve of these were called as purebreds. Two more, were thought by their owners to be purebreds of breeds outside of the panel (Spanish water dog and American bulldog) and the WISDOM PANEL™ MX result did not contradict this. Once the genotyping with the SNPS from the size model had been performed, a further 4 samples were excluded because they did not genotype for all of the 7 SNPs in the model. Finally another 14 were excluded because they were not fully grown (i.e. were <1 year old). This left a set of 48 dogs (called the Mixed 48 set from here on) of which 24 were female and 24 male. Dogs in this set contain no more than 75% of one breed (as determined by WISDOM PANEL™ MX). In all but three cases the dogs contain no more than 50% of one breed. The Mixed 48 set contains 4 dogs that potentially could be Jack Russell terriers. This breed is not in the WISDOM PANEL™ MX test and is also very varied in nature. Photographs of the 4 dogs were studied carefully and although the owners believed them to be Jack Russells, they showed sufficient variability in appearance that they were considered as mixed breed dogs.


Testing the Size Prediction Model

Using the 7 SNP model and the results of the above genotyping analysis, a predicted weight was generated for each dog. Table 7 shows the genotypes for each of the mixed breed dogs along with the predicted weight of the dog and the actual weight of the dog. This information is plotted graphically in FIG. 4.


The correlation between the predicted weights and the actual weights of the dogs in this panel is 64% (using the correl function in Excel). This masks the fact that the model is better at predicting the weight of male dogs than female dogs. The predicted weights for the males show a correlation of 78% to the actual weights compared to 64% for the females. This difference in performance between male and female dogs is more obvious when the two sets of data are plotted on the same graph as depicted in FIG. 5. The graph shows that the model tends to over predict the weight of some female dogs. This is not surprising given that the model was developed using the weights of male dogs. To refine the model further, it is therefore possible to use information about the sex of the animal to inform the model.


EXAMPLE 4
Refining the Model Based on the Breeds that Make Up the Mixed Breed Dog

To demonstrate the effect of taking into account the breed origin of the SNP alleles on the model, the IGF1 SNP was studied. As discussed elsewhere herein, for the IGF1 model SNP (BICFPJ401056) almost all large dog breeds are homozygous for the “2” allele (Table 8). Despite being a large breed, Rottweilers almost all have the opposite allele (0) more commonly associated with small dog breeds. Thus, when the IGF1 gene has come from a Rottweiler, the genotype of the IGF1 SNP would be misleading. The same is true for two other dog breeds considered in this example, namely Bull Terriers and Whippets. In both of these cases, the allele commonly found in these breeds is also opposite to the allele usually found in breeds of a similar size (Table 9).


To take account of this information it was first necessary to develop a modification matrix for each of these three breeds. This can also be seen in Table 9. Once the modification matrix had been developed, the breed origins of chromosome 15 (which contains IGF1) in each dog were then predicted. This was achieved using the WISDOM PANEL™ MX test. The list of chromosome outputs for chromosome 15 for the Mixed 48 set is shown in Table 10. In some cases it was not possible to unambiguously determine the origins of chromosome 15.


To decide on the correct chromosome outputs, both the predicted best pair of breeds per chromosome and the overall distribution of breed calls for each chromosome was considered (for the selected family tree only). When the predicted best pair of breeds was significantly more likely than the next best pair (i.e. >3 times more likely) then this result was chosen. When the predicted best pair was similar in probability to other pairs of breeds then the overall distribution of breed calls on Chromosome 15 and the other chromosomes was considered in choosing the correct pair. In these cases the choice is somewhat subjective but generally if the probabilities for different pairs of breeds were not very different, preference was given to breeds that appear regularly both on the same chromosome and also on other chromosomes in the same dog. Finally, if applying these criteria had not aided a decision, reference was made to the photograph of the dog and also to the likely probability of that breed being present based on the incidence of the breed.


From Table 10, four dogs were unambiguously determined to contain chromosomes that originate from breeds with atypical IGF1 allele frequencies. These four dogs are highlighted. The results for the IGF1 SNP were then modified according to the matrix in Table 9. The SNP results, both before and after modification, are shown in Table 11.


Following this the modified SNP results were then applied to the size precition model. The application of the matrix modifications improves the weight prediction in three of the four cases and in the fourth, applying the modifications has no effect on the result. This is plotted graphically in FIG. 6. It is envisaged that a similar procedure could be applied to each of the other 7 SNPs in the model.















TABLE 1






SEQ ID
Gene


Correlation
Sequence


SNP
NO:
Nearby
Chr
Location
coefficient
SNP = [wildtype base/alternative base]





















BICFPJ401056
84
IGF1
15
44263980
0.67387
CAAGGAAAAGAAGTTATAAACTGGCCCTCTCT








AACTTGTACCTGCCTTGCTGTAGGTTGAGGTC








TTTCTGAACAATCGTGTCCTTTAGATATCTGG








ACCTTCATTAACAGGTTCAGGCTTGGGAACTT








GCCAAATTCCAGAAAGGGTCTAGTGAAGGCAT








TCAACTGGGGAGCCAGCTGCCTCTTTGGAAAG








TGGTTTTA[G/A]TTTACCCTTCATCTTCCAA








TAAGAGACAGAATCCCAATTTTCTTAGCTCAA








AACCATTTCTTTTAGATTCNAATAGCAAACCT








AATGGAACTAATCAACTCAGAGTCCTAAGAAA








TAATATTAGAAACTGGCTAAGCATGACAAGGG








AAGCAATTTGATATGAGTAAAACACACATTTG








TCCCACTCAATGCAATTAGAAA


BICF235J47583
58
HMGA2
10
11451490
0.607791
AAAAGCANCATATCCAACATTTGTAGTTTGTT








ACAATAACACATTGAAAAGATTTATAGACTGT








TTTGGGTGTGATTTTTGGATTAATTCCCTACT








TTGAAACCATTTGTGAGGCTCTGTTTATTTAA








AGGAGGGAATGAATAGACCTGAAAACACCTAA








TTTTCATTTTCATCTCAGACTGGAAGCCAGTA








CATCTGTA[G/T]GGTTTGTTTTTTGGGTTTT








GTTTTGTTTTGTTTTTTTGGTTTTGTTTTGTT








TTGTTTAGAATTGAAAACTAGATCACAGAACA








CACAATGCTATATTTATCATTTTGATCATCGG








TTATTAGATGCTTGTTTGCATGTGCTTAAGCC








TCTAGCCAAGATAAAAAAAAATTTTNAAAAAC








TATTGTGGTAATAGAGTCTAG


BICFPJ1148955
138
Glypican 3
X
107354447
0.53213
CATAAGTATTCTGGGAAGAAAATTCTGGAAGG








GGAGGGGAAGGAGAGTTTGTTGTCTTTAGCCA








TTTCCTCTGGAGGAGGCCAGTTGTTGCTATGA








TGACATCCTACACCAGCCTTCTAGCAGAAGAA








CTGAATCCAGAGATGCCCCTGTCAGGTTGAGG








GCTTGTGGCATTTTGAACCAAGTGATCCCAGG








ACCCTGGG[G/A]TCATTCGCAATCCAAGGGG








ACCAGAAGCCCATCAATAGGAACTTCTGGAAT








GCCTGCCAGGGGGGTGAGACTGTCCAGTGCAC








AGATCCTGCTGGGTTAGTCGTCTGGAGATCCT








CCGAGGGGACTCAAAAGAGCTTTTTGTTCCAC








TCACTGTTTGCTTTTCTTTTCCTCTTTCTAGC








TAGGTTGAACATGAGATCTGG


BICFPJ1149345
7
Growth
 4
70324248
0.504142
ATTGCAATGAATTTGTTTTAATTTGGTGTCTT




Hormone



CACATCCCTGGTTCACCTAGTTACTAACCTGG




receptor



GGATGTTGTCTCACTCCTCTTGACATAGTGTG








TGCCACACAGCAAATGCTCAGTAAGCACTCAC








TGAACTGAACTGACTTGCCCAGTACGACTACC








AGGGTCAGATTCAACTCACTATAGACTCACTT








GCTGACTT[G/T]GATCAAATTTAATTTTATT








AAAAATACAAGAACTAGCAGATAGAGGTTGTT








GTTGTTGTTTCTAAATCAAACTTATCCTCAGA








ACAGTCATTGTAAAAATGATAAATATAGAAGT








GTCTCATTTAATAAAAGTTTATGCTATAAAAT








CAGTTCTATCGTTAAAAACACCTTAAACATTA








GCATCCTCTTTTCCACAGTTT


BICF235J29129
111
Chr 25
25
39552390
0.472837
CCACTCATTATGTTCCCTGCAGTATGGAAGTT








CTGTGGCCAAGGTTCATATAACTGAGAGTGTA








TTTATGGCGGTCCATACTCTTTCTTAGGAAAA








TATTGATTTTCTAACAGCAGAATGACTGTAGA








GCCGTTAAATCAGACTAGACTATCATAAACTC








CAGGATTAACCAAAGAGTACTTTCACCTTTTC








TTTTAGTT[A/T]CTCATGAGCCATCGGGAGT








AGATACATCCACTTAAGCAGGACAGGATCACA








GCATTTATTACTTGATTTGAACAAACCACCAC








TATTCCCCACCCTTATTGCCGGATAAGTAATT








AAACATTCTGCTCTTATTTTAAAGATTGACTG








ACAGGAATGAAAGAGGCCAAGTTGTATTTAAA








AAAAAAAAATACAAAGGCTTC


BICF233J61597
109

22
10305141
0.461521
TTGAGATTCTCTCTCTCCCTCTCCCTTTGCCC








CTCCCCCCATTCACTCGCTCTCGCGCTCTCTC








TAAAATAAATAAAATCTTAAAATAAAGAAAGC








ACATCCTAGAAATATATTGTAATATGTAATAT








GTAGAGCTCTCTTTCTCAAATTTTCTTTTAAA








AGGCTCTGATTTCTTGAGACATTTACCGTAAT








AGAGGGAC[A/C]TTTCCATAGAAAAATAAAT








TCTCATTCACTANGATTTTTTTTAATTTAGCA








TAAGAAATCATTGAATTCCCTACTACAGAGGT








TACTTATTAACGAAATGAGAATTCATCACTTA








CAGATATAATTCTAAGTAGGAGTATCTGGGTT








GTTATAATAGATGATACTTAATAAATATCTGC








CTTAGCTTCTATAAAATACAC


BICF230J37720
12

 6
10897023
0.45149
TTTTAAAATCCAGCAGAGGAAAAAAAAACCAA








GTCAATAAAACATTATAAGACACCTTCCCCAA








AAATAATGGTAAAAAGAAAAGCGTATAATATT








GCACAAGTCCTAAAATAACCATAATTACCCTA








AATGTATGCCAATTAAATTCACTAATCAAAGA








CAGACTCTTAACCTGGGTTTAAAAAACAGAAT








CCTACATC[G/A]TATATCTAAAATAAACACA








TCCAAAGCAAAAGGATATGGAAAGACTGAAAA








TAATGTTAGAGGAAAAAAAAAAGTGTTATCGG








GCACACAGAAACTACAAAGAAAAATAAACAGA








AATCTTATAGCAACAATACAAACAGAATTTAA








GGTGAAAAACATCATAAAGGAGGAAGAAAGAG








TCTACATATACACCAGAAAAA


BICFG630J8331
1

 1
32136268
0.401824
GGCAAGCTCTCTCCCCACCCCAATTCTGAGTT








GATGAGTCACTTCCTCTTTCTGAATTGGAATT








CTACCCCGGTTATTTATATTGTGAGGAGGAAA








TAGGCCACAGGGAGTAAACAGATTAGGAACTC








ATAGTTCAAATGGGAAGTGATTAAGGTATGAG








TAAGGATCACAAAGAGGTTGGGCAAAAAAAAA








AAAAAAAA[T/A]TTTTTTCTAAAATCTTGTC








AGCCCTTAGAGTGGTTATAGCCAAGTAAGAGA








GAAGTAATGACATGAAGGGGAAAACAAATAAC








AAACTAAGAGTTAACAGAAAAACATCTCAGTG








GCATTAAGCAGGAGACTGAGCAAATCATAGAA








ATACGTGATGAGAAAAGAGCCAGATCAAATGC








ATAACATTTTTCAATAAGCAT


BICF233J3303
76

13
19168116
0.397663
TTTATGAATTCTGACCAATAATTTCTTCTAAA








TGCCAAGATGAAGATAAGGAAGGGAGGGGGCT








ATCTTTTAAGACAAGTAAGAGCTCTGAAAACA








GGAAATCAGGAGGGGTTTTTTTTTTTTTTTTG








AGGTCTTTATGTGTCTCTAAAAAGTCTTTTAT








GAAATAAACTGGACTCTTTACAGAAAATAACA








TGTACATC[T/C]TGTACAACCAAATCATGAA








ACACAACCAAGAGATCTTATTTCTTTGAGGTC








ATGAAATTTAAAATGTATATACATTTATGCCC








TTGGTCATGAAAACACATGCAGGTAACTGGAT








GACAGAGAGAGCAACTAAGAAGTTAACTATAT








GTCATCTGAGATCTGTTTATACAAAGTGAATT








CACCTGAATGAGACAAAGGCT


BICF230J25861
135

38
16264182
0.393715
TAAAATCTGAGACCTCCTTATGCAAATCATTT








TGCCTTTAGCCATTTCAAAAAGAAATGAAGGA








CCTAGAGGATTTTACAGTTTTACATAACACTG








GTGAGATGGTTGTCAACTTTGATCTTACATTA








ATTAGTTAAGATCTTGACTGATCATAGCAAAA








GCAAACTAAAAAATCTGGTCCCCAGTTAAAAT








GAAATACA[G/C]CTACGACCTATAATGATGA








AAATTTCTGCTTTATCTGTGATATTCTCCAAT








ATTTGGCATATTATTGAAGGGCATATGATAAC








ATAATTCATTGTCTAGTAAAGTGATTCACATG








ATCTAAGTACATTTTTAAACCTTATTATATAG








ACATCAATCTCAATATTAGGTTGTTGTATACT








TAAGCCATTGGGGGTATAAAT


BICF236J34682
11

 5
87922545
0.392595
TTTTTAATAATAGCAGTTTAACTTTGAAACAT








TTATAGAATCTATAATAATAATAATGATAAAA








TATTTTAGGTAAAAGGACAAACGAGTAAAACA








AGCTTGTAAGATTTTTAGTACTTTTATGCCTT








TTATGCCTTTAGTTTGTTTTATAGAGACTCAT








GCTGCATTGTTTGCTGGTGTTTATTTAATAAA








TATGTGTT[C/T]GTTCTATTTTTGTAGCCGG








AATTGTGCTAGATAGCAAAGATTTAAAGATGC








AGTTGAAAGTTTCTGTTCTCATGGAGTTCATA








GTCCGATGAGTAAGACAGAAGTGAATAATAAT








TCCAATTAAATAAAATTCAGTGATTTTGGACA








AAGGACAGCCAGTTTGGATTGGGGGCTTCATG








GAAGACATAGTATGAAAGTTT


BICF230J63373
112

26
13241060
0.386944
GGGTCTCCAGGATCACGCCCTGGGCTGCAGGC








GGCGCTAAACCACCAGGGCTACCCTAAGGCAA








CTACTTGTGTTGTATGCTCACTAAAGATGGAT








CTAATTTTGGGTATGGCTACATCCAGAAGCTC








TAAAAAAGTTACTAAAGATTTATCTCTTGAGT








CGGTGTTGGCTTTATTTAGGCTGTTTCTTTCT








TCATGGTG[A/G]CAAGATGGCTACCAGTATA








TCTCCAGGCTTAACTCCTGCCCCCTAGGCAGC








TCCTATGAAGAGAGAGTACCTCTTTCCTAACA








GTTCTTATAAAAATTAAGGGATTGGTTTTAAT








TAGAGCACTATAGGTCATATGNGCATTGCTGA








GCCAATCCTTATGGCCAGCGGATGGAATTGGT








CATTGGTCAGGCCTGGGCCAG


BICF235J20169
96

20
35391970
0.382896
GTTTCCGAGCAGAGATGGAGAAGCAGGGCTTG








TAAAATGAACGCCGCCTTCCCCGTTGCATCTT








TGCTCCAGGGTGGGGGCCGCCTCGGTTGTAAT








TTTACACCGATGTCCACACCCTGCTAGGGAGC








AAGAGAGGCGAACTGTAAGTGAGAATATTTGC








TCTGCCTCCACCCCCTGGAGGAAGAGGAGCTG








GTTCTCTC[G/A]GCAGCCTGCGAGCAGAAGT








GGGAGGGCTCCCCCCACCCCAGCCCCTGCGGC








CAAGGGCCTGGGGCCATGTGGGTGGGTCCCGA








GGAGCAGGTCTTCCCCCCAAAGAGGTGACAAA








GACAATGGCAGTTTGAAGGCGCAGCCAGCCCT








GCCTTGAGGTAAGGTTGGGGGTGCCGGTAAGC








AGGCTGCTCCGAGAAGGCACC


BICF229J57386
14

 7
54659539
0.369131
GGCTCCCTGCTCAGCAGGGAGTCTGCTTCTCC








CTCTCTCTTTCCCTCTGCCGCCCCTCCTCCCC








CACTCATGCTCTGTCTCAAATAAATAAAATCT








TTTAAAAAGCAGCATGCAAAAGTCCCCAAGAG








TCTCTGTATATTAATGTTGTTATCTTTTACAT








TTGAGGTTAGTTTATTAAAAAGAGAAAGAGAA








ATATAGAG[G/T]GCACACCCAAACATAAAGT








CTGGTGAGAATAGGTAGTGGATCTGGATACCC








AGAAGAGTGGCTCAGCACAGAATTTGGGGGTA








CACCAGTGATATAAGGTAGTAAGAAAGTGCCA








AAGATGAATTTCCCATCCTTTACAGTTGCTAA








AGATGAGTCTTTGCAGAACTGTGTACTGACAG








AGGCTCCATAAAGTCCTTTCG


BICF231J12866
74

13
18853457
0.36732
ACCTTCTATGAACCTAGCACCTTAATATTGTT








TGTGTTAATAATGGTTGATATTTATGGTGGAC








CAATAGCTCTGGAAAAGTTCCAGGGCTAAGAA








CTATCCATGAACTATTACATTTTATCCTCACA








ACCCCAAGATATGGGGCAGAGAGTTGGAGACT








GGCGCCAACATCATACACAGTTGACAAAGCAG








CTGAACTG[G/A]GATTTGAACACAGAATGTT








CAGCTTGAGGACTTGCTGTTTTGTGATTTAGT








GACCAAAGCAACCTTGGTACATAGAAATCATT








TCTTTAATTTTATGAATGTAGGAACAATAGCA








CAGAGAGGTTAAGTAAGTCTTTCAAAGCCACA








TAGCCAACAAGTGGCAAAATGAAAGCCAGCTC








AGATTTGTCCCATATCAAAGG


BICF237J26004
121

34
21417087
0.367118
CCTCTCTCTCTCTCTCTGTGACTATCATAAAT








AAATAAATTGAAAAAAATNAAAAAAAAATAGG








GGTATGACACCAGTTTGACAGATTATTGGTAA








CTTTAAGAAAAGCGGTTTCTATCAGCAGCAAT








AAGGACTAGGTGGGGGCTTCATGGCTTCTATT








TCTTTAGCATTCATTAATTTAGCATTCAGTAG








ATATTCAC[C/T]GAATGCCTTGTGTCCTAGA








TCCTGTACTAGGATACAATGGTGAAAGGATGT








AATCTCTGTTTTCATGGAATTTAAAGTTTAGT








GTGGGATGTAGACATTAAACAAATAATGACAC








CAATAATTAATCCAGTGGTCCAGACATGATTA








AAGGAAAAGTGTAGTACCAGAGAGGGTATGTG








TCACAAGAGAGCTAAATCCAC


BICF230J27652
119

32
7408543
0.362135
GCTCCCACGATTCCATTTATTTTTAAAAGGCG








GCGGGGGGCGGCGGGGGGAGTATTCCTCAGTT








GGCATTTTCAAAATATGCCAGATTTAATCTGC








CACTGGCTTTATTTTTGCAAAAAGTAGGCAAA








TTCAAGAAAAATAATGTCTAATAGTTGAAATG








TTCTGCTTGGATTCATAGAGGCAAAAGGAGTA








TAAACAAG[G/T]AGTAATATAAGTTGTTTCC








TTGTCCTGTGTATCTGTCACCAGTGATGGAGG








ATTCAGGCATCCAGCGAGGCATCTGGGATGGA








GATGCCAAGGCTGTCCAGCAATGCCTGACAGA








TATTTTTACCAGTGTTTACACCACCTGCGACA








TCCCTGAGAATGCTATATTCGGTCCCTGCATC








CTGAGCCANACTTCCCTGTAT


BICF237J62215
98

20
44783441
0.357059
TCTCTGGTTAAAGTGCCACCGTGGAGGTTGTG








TGTCACACATTAACTGGTAGCACCCCAGTGCC








TAGCAGAGCCAGCCTGCCCTCTTTGTCAGGCA








ATCCCCGTGGGGCCCCAAGGGTCAGTTTCTGG








TTAGTTTTAGGTCAGTTTCAGTGGCATTTGAA








AGGCTTGGTTGGGGGCAGGGAGTCCCCTTTGG








TGACTCCC[G/A]TCTCTGATGGGGTCCTTGG








AGGAANAACCAGGGTAGTCACTAGAGCTCAGA








ACTGGAGCAGGGTCTGGACTCTGGCCCAGGGG








CCCTAAACTGGGCTCTGCTGCCATGAGTAGGG








CTGTGGCCAAGCTCTATAGACCCTAGGGCCAG








GGTGGGCAGCAAACTCAAAAAGAAAAGACAGA








GGCTCAGCTCTCAGCTCTGCT


BICF245J13607
114

27
22519619
0.352491
TGCATATTAAAACAAATGCAGGTACAAATCTA








CTAAATAGATCCACATCTACTAGAAGGTACAG








AAACATCTACTGAAATGCCTAAAATTAAAAAG








ACCTAAAATATCAACTGATGGCAAAGATAATT








GTTGGCATCTGGAACTTTCAAATGCTGCTGCT








GAGAATACAAAATGGTACAGTGACTTAGGAAA








ACAGGTTG[A/G]TAGTAGTATATAAAATTAA








ACATATGATTTGTTACATAACCCAGGAATCCT








ACTCCTAACTATTTACTTCTGGAGAAATGAAG








ATATATGTCCACATAAAAACCTATCAAAGAAT








GTTCATGGTAGTCTTATTCATAATAGTAAAAA








AAAAAAAAAAAATTAAATAAAGAACAAAAAAA








AAACTGGAATGTCTATCAGCT


BICF236J9894
77

14
38955880
0.346263
CACAACAAAGAAAACACAATTTGACCAAGTTT








TCTACCACTAGATAGCAAGGATAACCTTTGCT








CCCTTTTCTGATAAATGTCCCTCATTTCCTTT








TGAGGTCTTGCCAGAAGCACCTTTAATGTCCA








TTTTCCTAACAGTTTTCTATACATGGCAATAT








ATGTATCCACGAAGACCACAGATGCTTTCTGC








ACCGTGCC[A/G]CTCATGTCCTGGTGAGTTT








CTCACCAGAATCTACACTTTGGTTATAATGAA








CTTCACAGTTCATTTAGCCTCTAACCATTACC








CAGTTCCACAGCCATTCCCTCTGTTTTAGGTA








TTTGGTAGAGCATCATCCTACTTCCGGGTAGC








AAAATCTGTATTAGTCTCCCAGGGCTGTCTTA








ACAAGTAACACAAAATTAGTG


BICF234J31015
3

 3
44198567
0.344778
CANTCTGAAAGGTCTGTAGGTTTTTCCTTTTT








GTTGGTGGAATGAGGAACCTTAGAANCACCCC








AAGTGGTTACCCCAAGGCCAAGCGTGAAGGGA








GGTTCAGAATTGAAGTTTCTTTTCAGACCCAG








TGAGTCTGGTCATTCTGTCCCATCATGGAAAT








GGGGACAAGTGAAAACAACTTCCCCAGGCAGG








TTCCCCCA[G/A]TCTCCCAGCAAGGATCTAT








GAAATTTCTCAGGAAGCTTCTTCTGTTCAGAT








TTGCATATTAGGCGCTCACACTTGAGTTCGAA








TGATTTTGAGAACAAATTTTGGGCTCTTCTGC








TCTATGGTGGTGGGAGTGGGAACCCCGAGGGA








ACTCAAAGATGAGAAGGCCTCAGAAAGAGGGC








ACTGAGGATGCCAAGGATAAG


BICF230J38817
15

 8
62091061
0.338515
GATTACAGATTGGAGTGGACTTTTTTCTTTGT








CTTGCCCCATCTTGGTCACCGAATGACTTTGT








CTGATGTCAATCTTCCATGAAAATGTTTATAT








TTAATAGAAAAAAAAAAAAAAGGACAGCCTAT








CCATGAGTACAGGACAGTTCCTAGCAACACCA








AGGTGTAGCTGATAATGCCTCTTTCACAGGGG








AATAAACT[G/T]TAAGACGCTTGATCACTTC








TGGTTTTGCTTAGGTTAAGCCAGCCACCTACA








TAGGAAGTCCAACTACTTTGAGACTTCCACGT








TTTTGTTTTTGAAATAAGGGAGGCTGCCCTCC








TACCTTTTCCTTCCTGCATCCAACATGCCTCA








TGCAACACCTGTGCTCTCCACTGGCCTGTGAC








AGCAACTTGTTATTCTGGGGC


BICF233J46097
134

34
39797181
0.336551
AATCATCAGGGGTTGAGATTGCCGTATCAACT








CAGAAAAAAAGGAATAGCACTGCCCAGTTATT








CTTTAACTTTTATTCTCCTCCCACAAGGCAAA








TAGCTTGAAAGCATGAGCTCTNCTTTTGAAGC








AGATTCCTCTTAGGCTCTTTCTCTGACCCGGC








ATAGCAGACACTGCTGACCACCTACTTTGAAG








CCATTCTA[T/C]CCACTAATCTTCCCTTTGA








TGAAAAACTTGATTTTGTTCAGTTATCAGGAG








ACCACATAGTTTAGAAAAGGGTGGACCTTTCC








CCAGCCCTATGGAGGATGATAATTCATCTAAT








CCAATCATGGAAATTCCATTTTCCTTGCCAGC








GAAAAATTTAGGAGTGGGCATATTTATAATTC








TGGATAGCGAGTGGGGAAGAG


BICFPJ1436705
4

 4
62267382
0.335782
AAATGTGGTATATCTATACAGTGGAACATTAT








TCAGTCATAAAGAGGAATGGAGTTCTGATACA








TGTAACATGGTTGAACCTTGAAAACATTACGC








TATATGAAAGTAGTCAGACACAAAGGGCCACA








TATTGAATAATTCCATTTCTATAAAATGTCCA








GAAGAGGCAAACCATTAGAGAGAGAAAATAGA








TTAGNGGT[A/C]ACCAGGGGATGAAAAAGTA








GATCATTGGTGGCATATCATAACAAATATACT








AAACAAAAAAACAAAAATGCTGAATTGTACAA








TTTAAAATGGTGGATTTTATGTTATGTGAATC








AGTTCTCAATTTAAAAAATTTTTAAGTCACCT








ATGATTTGAGTCATCTAACTTTTCAAAACTTT








TCACTTTTTTCACCCATGAAA


BICFG630J426502
99

22
9735062
0.334417
GTCAGGGGAATTGGCTCTCAATATACAGGGAA








ATTTCAGAGAAACATTAATGAGCTCCCTCTTC








GTTGAAAATTAAATCTGTCAAGGATATGAATC








AGGTGTCATGTGAAAGAGCCTGATCAACTCTT








TCAAAGCAATTTCCTATTAGAACTCCAATCCT








GGAAGATGCCATTTCCCTTGCTCCAAGGTAGT








TGAGATCC[C/T]GTTGGCAAGTTGTTTTGCA








ATCCTTCCCATGAGAAAGAATACAGTAAAGAT








GACAGCCCAGTTAATTCACATCCAGAAAAATG








AAATGTATATTCATGGTCATTTCTCTTTTTCT








CGGCATTGATCAGTAACCTTGGGAGAGCATAT








CAAGCCCTTTTTTCAACACATTTTTCCTCTCC








TTCTTCCTCATGTCGTTTAAT


BICFG630J163689
16

 9
13329359
0.330827
CCCTCCCTTCCTTCCTTTTTGTTCTCATTTTT








CCAAGTAGTTGCTACAGAGACCAGCAGACCCT








GTGGCCCAAATATAAATAAAATAGTTGTGACT








CTCCATCAGTTTATCTGGAAGGATAGGGAGAA








AGAGAGAGAACTGAACTGAAGGAGGAAATATC








CCTAATTTTTTTTTTTTTTAAGGGTGGAAGTA








ACTTGTTC[G/A]GAGAAAGAAAAGAAATACA








CTGTGAGATCTGATGCGTAAAGAAAGAGAGGG








AGAACCTTGGAGAATGATTTGAAAAACAACCC








ATGCAATTTAGATTTCAAGTAGAATCTTAATC








TGTGGACTACTTTAGAGCCTCTTAAACAGAAC








ATTAAACATATGGCCATAAAGGTAGAAACTTG








AGGTTTTTTTTTTTTTTTTTA


BICF234J44301
72

12
75211352
0.329537
CGCCTGCGAGCCACGCCCCGGTCACTCAGGAG








GGGCCCCTGGGAAGCGGGGGCTGCCCTGGGAC








CCGAGGCCTCTGCGGCCTGCACGGATCGGCCG








AAGCCTGACTGGGCTGGGACCGGCCGGATCAG








CCGGCGCTCTGGTCACCCAACACTCGACAGCT








GCTCTCCTGGGCACTGGCGTCTGCCTTTGATC








CGCGCGAC[A/T]GTAAAACCGATCAAAGCGG








AAGTGCACACAGGCTCCGTGCAGAAAATGAGA








GGGGCCCTCGGAGAGGAAAAGCTGGAGCATCG








CGTGGTTGAGGGGCCTCGCACGGCTAAGGGGC








GGCTCGTTGTGTACGACACCACACGCTCGCCA








GCAAAGCACCCGGTGCTCCAGGCAAAGGTGAG








AGGAAAGTCGCGACTCCCGTG


BICFG630J610801
118

30
34498508
0.321396
CCTTTGATGGTTCATGGAAGTGACAAACTTTC








AGTGCCTTTCTCAACTCAATACAGGAGCGTGA








TCATTTTTGTAAGCCTGTAAACAAATTCTCAC








AAAGCTCAGAGTAGCCAAACTTCATGATTAAA








TGTAGCAATAAAAATATGGTGGGCATTTCAAA








CCTTGTTTTTTGGATAAGCAGCCACATACTTC








GGTGTTTT[G/T]TTTGTTTGTTTGTTTGTTT








GGTCTCCTAGTTCTGGCTGGCGTGGTAAACTC








CCTCTTAGGCTGAATAAGTGTTGGAATAGGCT








AGTCTCAATAATTGAACATTCAGGATAACCAG








GAGGTGGTCTGGCTCTTCAGGGTTCTGTAGCC








CAGACACATCAAGGTCACTAGAGGGGAGCCAT








GGGAAATCACTTTTGCTCTCC


BICF229J41242
78

15
36683521
0.318983
TCCTTAAACCATTATTCATCAGCATTTGCTTA








ATTTTCTCAGTGTCCAGCAATCAAGCCTAATC








TTCAAAGATAAAGATTTACTACCACAAATTTT








TTAAAAAATAATGGCTCACAAGGCTTAAGAAT








ATTTCTTAAATGTTCTAAAGCAATGCAGGTTT








CAAATGTGACTTAATTTTGAATAACTGATAGA








TTATCTAA[T/C]AAAACTACAGCTTTGTTTC








ATTCACCATTGTCATTTCTTAAGTTACTTACT








CAGAAAACTTCAGCTAAAAACTTTAAAAGGGA








ATAAAGTATTAATACATGCTATAATGTGAACC








TTGGAAGACATGCTAATTGAAAGAAGCCAGAT








ACAAAAAGCCCTGTATTATATGATTCCATTTA








TATGAAAAGTCCAAAATAGAC


BICF232J58180
71

11
71402215
0.318268
CAGACCGACACAGAATGAAGGAGGGTTCAGGG








CAAAATGATGCTCCTAGCCTCCGCCTAGCCAC








GTTGTAGCCATCCAGTCTTGGGCAAGTCTCAT








AATCTCCTTGAGCCTCCATTTCCACATCTAGG








GAAAGGGAATAATAATAATATCTGACCGCCTG








GATCACGTGATCGCCTTGAGGGTCACATAAAA








TAATATCC[C/T]AGCAAGAGTTCTGGGAAGA








GTTAACCAGCACACAGATGAAAGAGGGCTTTG








TTATTTGTTCAGGAACTTTGTTCATTCTTTTC








CAGTAATCGTGTAAGAGAAATTGCTTGGAATT








TTATAATCAGAATATCAGAGTTTATTTAACGT








GACTAATATTCATTAAAGCAATAACAGGAGTC








AGGCCTGGTATAGTGGAAAAG


BICF234J24531
113

27
17672045
0.314687
GAGTACAGGCTTGGACCAGAATATATAGGTAT








TTTTAGTATTTGAATTTTATCACAAACACAGT








GAGAAAAAGCATGGTTTTTGTTCAGAGAGGTT








CTTTCACTTCTGTGTGCAGAATAATTGTGGGT








AGTTAACAGAAAGATTAGTAAATTAATTGCTG








TTGAAATAATCTGGTTCAGAGAAGATGGTAGT








TTGGACTA[C/A]GAAAATGAAGAGGAGTAAG








CTGATTAAAATATGTTTTTAAGATTCATTTCA








CAAGGATTAATCAAGGCTGATAGTCTTGATTA








AAAAGGATTTCAAGGAAGANCTTCAGATCTCT








TGTNCAAGTAACTGAATGAATGGATGTATCAT








TTTCTGACAAGGGGAACATCATCCATTTCTGG








GCTTTCCATAAGTTAACAATG


BICF230J17282
13

 7
47321194
0.314311
CCTGTGTGTGCACACAGGGGAGCGGAGGCTGG








AAAATGAAAAGGACAACTTTGAGAGGGGAGCT








GAGGACAAGTTCACACTGGATGCTCCAGATCT








GGGGCAGCTGATGAAGATCAACATTGGCCACA








ACAACAAGGGCGGGTCTGCAGGTTGGTTCCTG








TCCAAGGTAGGTCCAACTGCCCGACTCTGGTC








CCTGTGGC[T/C]CGGGACGGGGAGTTCTGCT








TCTCAGAGGCCACATTTCAGCCCTAACCTCCT








TCTCTTGGGCTGTCTGCCTCCGCTCTTCTACC








TACACCTGTTGGCCAACCATGCTGCCTGACTC








ATAGCCTCCCAGACCCCATGCACGTTACCACC








TTCCCAGAAAGCCCAGGGCTCTCATCTAAACG








TAGCTGGGGGTAGGAGGTGGT


BICF233J9971
93

20
29901111
0.314272
TGGACCTGGAGTGCCCTGAGCTTCACTCACTC








TGAATGTAAAATGAAGAGGCTTATACCCAAGT








CTGAAGTAGCTGTGACTACAGACTAATTCAGT








TTCTCTCTAAAGACCCTAAAAGATGCTGACCC








ATAGTTGATCCTTATTGAGTGGCAGCTACTGT








CATCACTAAGGTCATTATTTGGGTCTTCAAGA








TGTTGCAG[G/C]AGAGTTAGTTGCCTGTGGA








TAATAACAGGGAATGAGCCCCAGAAGCTATTC








CCTTCCATCTCCAGCATCTTGCCCCTGACCCA








CGTCTCTTGAATATGGCCTGAATACAACAAGG








CTGTTTGTTTGTTTGTTTTAAANTTTTATTTA








TTTATTCATGAGAGACAGAGAAAGAGAGGCAG








AGGCATAGGCAGAGGGAGAAG


BICF233J31513
115

29
30317809
0.313567
ATATGAACCAGACTCAGATATTTGAAATCTGT








ATGCATAAAATCTGTTCATGTAGCACAACTTT








TTAATTTTTGTTCAAAGCTCTAAACCAAAGTG








GTGAAACACCATTACTCAGAAATCCTGGGGTG








GCGGTAGAGATGAGGAGTTGGGTGTGAAGACT








GGAAGACAGGAAGAGAGAAATGGGAGGTCATT








TAGGAGAT[C/T]TGGGCTTATCTCATTGCTA








AAGACGTCTGCTTTCTACCTGAGGCAGCAGAA








TTGCAGAACAATTAATCTTTCTCTTACTGACA








GATAATCTTTTGTAATTATGGCCGCTGGATCA








AGCAAATTACTCCCAACAAATATTGATGAATA








TTTTCTATGTGTTGGACACTGTTGGGCACAGA








AGATACAAAAATGAGTAAAAA


BICFPJ350145
2

 1
103363294
0.310439
GGGCAGGAGTAGGGGATAGGCATGAGCTCCTT








CGTGGTTGTTTGTCTCTCACAATCTTCCTCTT








CTTACAGGTTCGGATGAACTCGGAAGTAATTA








TGGGCCCTGCACAGGTGAGGGAAGGGCACCTG








TGCTCTCATCCATCCCACTCCCACCCTCACCC








CCAATGGTCTCCAGGATTGTGGGATCATACAA








CACTCTAC[G/A]TATACTCTCCCAGCTCTTG








ATTCTGAGGAACCTGGAGAGAGCCTCAGCCCA








GGGTTTTGAGTTCAGGCCAGGGGTTATATGGA








AGGATTCCCGTTTTGGGTAACTGCTGGAGTAT








GATGTGGACAGTTTTTCCAGGTACAGGAACCA








ACATGTGCAAATACTTGGAGGAGAGACTGAGA








TAGAAGTTTTCAAGAAACAAC


BICF230J33141
117

29
38575425
0.310291
TTTCCCCCAGTTTGTAGCAACTCCTATTAAAA








TGAACAGAGTCTAAAGATGACTTATACTCCTT








AGTTATGAATTATACTGTCTTTTAAATTTTGT








GCTAATATAATGGGTAAAAATAGGTTATTATT








TCCCTTAATTTGCATACAGTATTCTTAAAATT








TACCTTCTTTTTCTTCTAAGGTATAAAAATTC








CTCTCTTG[C/T]ACTGGCAAGCGCTTGTTCT








CTAAATGTACAGAATTTTCTTTGATAGCAGAA








GTATAATTCCATAGATAATATTTTTCCTCAGG








ACTATTATTGGTATATTGTCACAGATTTTCAC








TTCAAAGGAATATCTCTTCTCAGACTATTTTC








AGCCATTTTAGATTAAATTCTATTTTATGATA








ACANTAAATGAGTATATATTC


BICF234J35168
120

33
28805876
0.309937
GGAGTGGATTTTGGAAGTGATGACAAGTGGCT








TTGGTGGGCAAGAACTGCATGAAAAAAAAAAA








AACTTGTGTCAGGTTTTGGTCATGGTTCTACA








CACTGTGATGATTTTATGTTCTTAGGAAGGTT








TCTATCTTTCTCTTTACAGCTGCAGCTTATGG








AAAAGGAACCTATTTTGCTGTTGATGCCAGAT








ATTCTGCA[A/G]ATGATATATATTCCAGACC








AGACAGCAATGGGAGAAAACATATTTATGTTG








TACGAGTACTTACGGGAGTCTACACACTGGGA








CATGCAGGATTAGTTACCCCTCCATCAAAGAA








CCCTCACAATCCCACAGATCTGTTTGACTCTG








TCACAAACGATACACAACATCCAAACCTGTTT








GTGGTATTCTCTGATAATCAA


BICF231J52887
110

23
18195511
0.308913
TAAATCCAAATAAAATACAAAAAGTGCTTTGT








GAGCTCTTAACCTGCAATGCAAACATAGCATG








TTACTCTATTTTATCAGCGAGTGCGTGGCTGA








TGTTTTTGTATTTAATTCTAGTAAATTACAGG








ATTTCCAGAGCATTACCTGGTCACAACTCCTC








ATTTGCAAAGGGCTAAATGAGACCCACGAGTG








ACTTGTCT[A/G]AGGACACACGGCTAGTGAT








AAACAGAACCGGTCTTCTGTTTGCCATGCCTC








CTTCCTAAAATTAATCTTTGCAACTTCATGAG








AGTGGAAACTGCACCTGCTGTTCTTTTGCACC








ACCAGCCTGAGCAACTGTGCTNTATGTACTCT








GCAGCATTATTCAAACCTGAGGTGGATGATGG








TCCCTATCTCTTTAAAAAGAA


BICFG630J367539
92

18
56642845
0.305915
AGAAAGATGGCAGGGTTTCCATTGGGATTTAG








ATGCCTGCACCGTACTGTGACTACCTATCCAA








AAATGACCTACTTCTGCTAACTCTCTAGAGCC








CTGGGGTAGTTGTTTGTTTGTTGTCCAGGGTT








TGAGGTTGCTATCTGCAGGGGGGNCAGTTTGT








CAGAACACCTCGTGCAGGAAACACACTCCATA








CTTGGAAC[A/G]AAGAGGGATTTCAGGGTAG








GGGTGAGAGTGGGCTCAGCAGTAGCCAATGCT








CCCATCGGGCCACATTCAATGATGAAAAAAGT








GTCTATGGAAGTCCACGTCAGGGATGCTCACG








GCTGATGAGGAGGTCATCTCAGAAGGGGACAG








GCAGTGGGCTGGGACAGAGTGTGGCAGGGCAA








GCAGTAAGTATTCTCTCACGT


BICF235J47857
146
GLYPICAN 3
X
107955905
0.5263239
CTTCACTAGAGTATGAGAACCATGAAGACAGG








GACTTTGTTTTGTTCATACTGATTCTCTAGCA








CTAAGAGAGCACCTGGCACATGATGCTCAGTA








AACATTCCTGGAAGGGGGGAGGGAGGAGGAAG








TTTACTATTTCTATATACTAAACACTATGATT








TCTGAGTTTGTCTTTTGCCTTTTAAGATTTTT








TTTTATTT[A/G]CGTATTTGAGGGGCTCCTG








GGTGGCTGGCTCTGTGGTTAGGCGTCTGCCTT








CGGCTCAGCATGTGATCCCAGGCCCAGGGATC








GAGTCCCGTATTGGGCTCCCTGAGAGGGGCCT








GCTTTTCTCTCTGTGTCTCTGCCTCTTTCTGT








GTGTCTTTCATGAATAAATTTTTGTTTTAAAA








AAGGATTTATTTATTTGAGAG


BICF230J67378
35
HMGA2
10
8445140
0.48067147
CATTACTGGTAATTGTGACCCACTTTTATTTA








TCCATTCATTTCACCATTTTTCATAATATAAG








TAGGAACCATGAATCTCCTCACCCAAAAGAAG








TCAGAACACTCTGATCACAGCTCACATTCAGC








TACGTGGTTACTTCCTAGGACATCCCTTTTGA








TTCCAGACCTGAGACAATAACCACATTGCCTT








CTACATTC[G/A]TAATTCCCTTGATAATCTC








GTTATACAGGATTACATCTCCCTATCATTAAG








AAATATTTTAGTCATTTTTAACTTTATAAAAA








TGGCGTTGCAAATTATTTTTCAGAACTTGTTT








TTTACTTAGTATTGTATTGCTAATACTCATTC








ATATTTATAAATGCTGTACTTCATTCAACTAC








TGTGTCATATTTTATTACTGA





SNPs useful for predicting dog size based on correlation of the SNP allele homozygosity score with size (height or weight):


For Tables 1, 2 and 3 the correlation coefficient was calculated using the formula:







Correl
(

X
,




Y

)

=





(

x
-

x
_


)



(

y
-

y
_


)









(

x
-

x
_


)

2






(

y
-

y
_


)

2









Where X refers to size and Y refers to the SNP score.















TABLE 2







SNPs in LD with the SNPs in Table 1 (demonstrating the same pattern of allele



frequency distribution across breeds as the SNPs in Table 1):














SEQ ID


Correlation
SNP Sequence



SNP
NO:
Chr
Location
coefficient
SNP = [wild type base/alternative base]
















BICFG630J8331
1
1
32136268
0.420
GGCAAGCTCTCTCCCCACCCCAATTCTGAGTTGATGAGTCACT








TCCTCTTTCTGAATTGGAATTCTACCCCGGTTATTTATATTGT







GAGGAGGAAATAGGCCACAGGGAGTAAACAGATTAGGAACTCA







TAGTTCAAATGGGAAGTGATTAAGGTATGAGTAAGGATCACAA







AGAGGTTGGGCAAAAAAAAAAAAAAAAA[T/A]TTTTTTCTAA







AATCTTGTCAGCCCTTAGAGTGGTTATAGCCAAGTAAGAGAGA







AGTAATGACATGAAGGGGAAAACAAATAACAAACTAAGAGTTA







ACAGAAAAACATCTCAGTGGCATTAAGCAGGAGACTGAGCAAA







TCATAGAAATACGTGATGAGAAAAGAGCCAGATCAAATGCATA







ACATTTTTCAATAAGCAT





BICFPJ350145
2
1
103363294
0.397
GGGCAGGAGTAGGGGATAGGCATGAGCTCCTTCGTGGTTGTTT







GTCTCTCACAATCTTCCTCTTCTTACAGGTTCGGATGAACTCG







GAAGTAATTATGGGCCCTGCACAGGTGAGGGAAGGGCACCTGT







GCTCTCATCCATCCCACTCCCACCCTCACCCCCAATGGTCTCC







AGGATTGTGGGATCATACAACACTCTAC[G/A]TATACTCTCC







CAGCTCTTGATTCTGAGGAACCTGGAGAGAGCCTCAGCCCAGG







GTTTTGAGTTCAGGCCAGGGGTTATATGGAAGGATTCCCGTTT







TGGGTAACTGCTGGAGTATGATGTGGACAGTTTTTCCAGGTAC







AGGAACCAACATGTGCAAATACTTGGAGGAGAGACTGAGATAG







AAGTTTTCAAGAAACAAC





BICF234J31015
3
3
44198567
0.246
CANTCTGAAAGGTCTGTAGGTTTTTCCTTTTTGTTGGTGGAAT







GAGGAACCTTAGAANCACCCCAAGTGGTTACCCCAAGGCCAAG







CGTGAAGGGAGGTTCAGAATTGAAGTTTCTTTTCAGACCCAGT







GAGTCTGGTCATTCTGTCCCATCATGGAAATGGGGACAAGTGA







AAACAACTTCCCCAGGCAGGTTCCCCCA[G/A]TCTCCCAGCA







AGGATCTATGAAATTTCTCAGGAAGCTTCTTCTGTTCAGATTT







GCATATTAGGCGCTCACACTTGAGTTCGAATGATTTTGAGAAC







AAATTTTGGGCTCTTCTGCTCTATGGTGGTGGGAGTGGGAACC







CCGAGGGAACTCAAAGATGAGAAGGCCTCAGAAAGAGGGCACT







GAGGATGCCAAGGATAAG





BICFPJ1436705
4
4
62267382
0.336
AAATGTGGTATATCTATACAGTGGAACATTATTCAGTCATAAA







GAGGAATGGAGTTCTGATACATGTAACATGGTTGAACCTTGAA







AACATTACGCTATATGAAAGTAGTCAGACACAAAGGGCCACAT







ATTGAATAATTCCATTTCTATAAAATGTCCAGAAGAGGCAAAC







CATTAGAGAGAGAAAATAGATTAGNGGT[A/C]ACCAGGGGAT







GAAAAAGTAGATCATTGGTGGCATATCATAACAAATATACTAA







ACAAAAAAACAAAAATGCTGAATTGTACAATTTAAAATGGTGG







ATTTTATGTTATGTGAATCAGTTCTCAATTTAAAAAATTTTTA







AGTCACCTATGATTTGAGTCATCTAACTTTTCAAAACTTTTCA







CTTTTTTCACCCATGAAA





BICFPJ706168
5
4
70224288
0.422
CATGAAAAAGTCTCTAGTTCAGGTGAACGGCACTTGGTGAACT







TAGGCTTCTCAGAAGAATCTGCAGAACAAACAGGAAAGACAAA







GCATGAAATCAAGAGGGTAATGTATGGTCCTAATAAAACACAT







ATTTACAGTGGTAAAGAAGAACCGTTTCTTTTAGAAACACTGT







TTTCCCAGGTCACAGTTTCAAAGCTCAT[T/C]GTATGTTAGT







TATTTCTCTGTCAGTTCTTGAATTCAGGGGAAAACTGTTTTGT







CTTTATTTCAAACATTATTTTTAGGGCCTGTTTAAGAAACCAG







AACTGGATGTAAGGTTTATTTGGAAGGTTGACCCCAAAAACCG







GGAGTGAGGGCAAGGGAGAGTGGGGGGAGGGAGGACAGCTAAG







TGTGCTGTACGGCAGGTT





BICFPJ159894
6
4
70228384
0.422
GTGCCCTTAGAAGCTTGTTTTATACTTCATTAGAAAGACAACA







AATGGCTTTCTTAGGTCCTCTTTGAGCCACAAACAATAATTTT







TGAACTAGTAATAATTCCTAGTCTTTCATGATTTTTAGCCTAC







CTCTAGAGAGAGTATTGTTCAATATCCATTTACAGCATCTCTG







AAAAGAATGTTTCACTTGACAAGACAGT[T/C]TGCCTTTGGC







CCTAAATTAATTCTCACTAGGAGAAACGAATGTTCTAATTCAA







GTGGGTCCCACATTCTGAAAAAAATGTTAGTATTAACTCTGAT







CAGCATTCAAAATCTTGACAACAAATCCTTATTTTCTCTTTAG







AGTCTTCACTTTGAAAATGAGGATTTTGACTCACAACACCATA







TTTCTTTTAGTTATTATT





BICFPJ1149345
7
4
70324248
0.504
ATTGCAATGAATTTGTTTTAATTTGGTGTCTTCACATCCCTGG







TTCACCTAGTTACTAACCTGGGGATGTTGTCTCACTCCTCTTG







ACATAGTGTGTGCCACACAGCAAATGCTCAGTAAGCACTCACT







GAACTGAACTGACTTGCCCAGTACGACTACCAGGGTCAGATTC







AACTCACTATAGACTCACTTGCTGACTT[G/T]GATCAAATTT







AATTTTATTAAAAATACAAGAACTAGCAGATAGAGGTTGTTGT







TGTTGTTTCTAAATCAAACTTATCCTCAGAACAGTCATTGTAA







AAATGATAAATATAGAAGTGTCTCATTTAATAAAAGTTTATGC







TATAAAATCAGTTCTATCGTTAAAAACACCTTAAACATTAGCA







TCCTCTTTTCCACAGTTT





BICF233J33542
8
4
70332822
0.360
CCAAAAGAATTTTGGTTTGCCCACTAACTTAAAAAAAAGGAGA







AGGAAATTTTTTCTTTTGCTAATTACGTGTGTAAGTACTGGTA







GGCTACATGATTTGTTTACTGTAAGCCACTGCATTTTCTTATC







TCTACTACCCCAAACAGACCAAGCAAACAAAAAAATGATCAAA







AAGCAAACATAAAACCAGCTAATGGTGT[C/T]GTTCTCTCCA







CTCTCCTTACGAAGTTAAAGGTTTATGTCAAAGCTAAAATGTC







AAAACAGCTGAAATAGATGCTCTCTGCAGACTCTCTAGGTCTA







AATGTAACTTGAGCTCTCGTCTTTCAGCTGGGGATACATTTTT







AATTTTCTTTAAAGATTTCCTTTGCCATACTTTTCATTTCACA







GGTTTTTTAAATTCCTCA





BICFPJ868850
9
4
70339136
0.241
AAATACACACAGAACCAGAATTGTTTATATTGCAAAACCAAGA







CTCTAGTTAGATAACCTAATAAATGTGTACTTTCTGATGCAGG







TGATCATTAGTAATGAATTATTGCAAAACTGAATCTAGTTGAT







TTCCTGGTGAAGCTGTTCTAAAAACTCAACATCAAAAATGGCT







AATCCAAAAGGGTATTTGAAATGATCCA[T/C]TCTAGTTAAT







ACCAGGAATTTATTTGACTAATTTGCATTTGTTCATGTTATTA







CCTACTTTTTATGGTGCTGGTCATTTGAAAATTGGTCAGATCT







GAACTGCCTTAGGTCAATTTCTTTCTTTCCCATTAGTGAGAGA







GAAAAAGTTTACAGACATTACTGGCATATGTACTTTTGTTGAG







ATTTTTCTCTTCTCAGGA





BICF237J63288
10
4
70344838
0.347
TTACTAATCAACTGATTTTAAAATAGGGAGATTATCCTCTATT







ACTGAGGCAACTAAATGTAATTAGCCTATAACAGCAGAAGTAA







CTGTGAGATGTGCCAGAAGAGGAAGGCAGAGAGATGAGAAATC







TAAGAGGGATTTGATGTGCTGTTCCTGGAGGGGGTGGCACAGA







AAACATGAGATGCCAGCCAGGCAGACTC[T/G]GGTAGCAAAC







CTGGCTCCTAGCAGATAGCCAGGAAGGAAACAGAGAAGACTGT







CCTCCAATAGCAAGGAATTGAACTGTATTAGCAACCTCAAGGA







GCTTGGAAGCAAATTCATCCCCAGAGTCCCCAGAAGATAACAA







AGTCCTACAAAGTATTTTGATTTCAGTGTTCTCTAAGCAGAGA







ATCAGCTGAGCCAGGCTG





BICF236J34682
11
5
87922545
0.376
TTTTTAATAATAGCAGTTTAACTTTGAAACATTTATAGAATCT







ATAATAATAATAATGATAAAATATTTTAGGTAAAAGGACAAAC







GAGTAAAACAAGCTTGTAAGATTTTTAGTACTTTTATGCCTTT







TATGCCTTTAGTTTGTTTTATAGAGACTCATGCTGCATTGTTT







GCTGGTGTTTATTTAATAAATATGTGTT[C/T]GTTCTATTTT







TGTAGCCGGAATTGTGCTAGATAGCAAAGATTTAAAGATGCAG







TTGAAAGTTTCTGTTCTCATGGAGTTCATAGTCCGATGAGTAA







GACAGAAGTGAATAATAATTCCAATTAAATAAAATTCAGTGAT







TTTGGACAAAGGACAGCCAGTTTGGATTGGGGGCTTCATGGAA







GACATAGTATGAAAGTTT





BICF230J37720
12
6
10897023
0.430
TTTTAAAATCCAGCAGAGGAAAAAAAAACCAAGTCAATAAAAC







ATTATAAGACACCTTCCCCAAAAATAATGGTAAAAAGAAAAGC







GTATAATATTGCACAAGTCCTAAAATAACCATAATTACCCTAA







ATGTATGCCAATTAAATTCACTAATCAAAGACAGACTCTTAAC







CTGGGTTTAAAAAACAGAATCCTACATC[G/A]TATATCTAAA







ATAAACACATCCAAAGCAAAAGGATATGGAAAGACTGAAAATA







ATGTTAGAGGAAAAAAAAAAGTGTTATCGGGCACACAGAAACT







ACAAAGAAAAATAAACAGAAATCTTATAGCAACAATACAAACA







GAATTTAAGGTGAAAAACATCATAAAGGAGGAAGAAAGAGTCT







ACATATACACCAGAAAAA





BICF230J17282
13
7
47321194
0.296
CCTGTGTGTGCACACAGGGGAGCGGAGGCTGGAAAATGAAAAG







GACAACTTTGAGAGGGGAGCTGAGGACAAGTTCACACTGGATG







CTCCAGATCTGGGGCAGCTGATGAAGATCAACATTGGCCACAA







CAACAAGGGCGGGTCTGCAGGTTGGTTCCTGTCCAAGGTAGGT







CCAACTGCCCGACTCTGGTCCCTGTGGC[T/C]CGGGACGGGG







AGTTCTGCTTCTCAGAGGCCACATTTCAGCCCTAACCTCCTTC







TCTTGGGCTGTCTGCCTCCGCTCTTCTACCTACACCTGTTGGC







CAACCATGCTGCCTGACTCATAGCCTCCCAGACCCCATGCACG







TTACCACCTTCCCAGAAAGCCCAGGGCTCTCATCTAAACGTAG







CTGGGGGTAGGAGGTGGT





BICF229J57386
14
7
54659539
0.369
GGCTCCCTGCTCAGCAGGGAGTCTGCTTCTCCCTCTCTCTTTC







CCTCTGCCGCCCCTCCTCCCCCACTCATGCTCTGTCTCAAATA







AATAAAATCTTTTAAAAAGCAGCATGCAAAAGTCCCCAAGAGT







CTCTGTATATTAATGTTGTTATCTTTTACATTTGAGGTTAGTT







TATTAAAAAGAGAAAGAGAAATATAGAG[G/T]GCACACCCAA







ACATAAAGTCTGGTGAGAATAGGTAGTGGATCTGGATACCCAG







AAGAGTGGCTCAGCACAGAATTTGGGGGTACACCAGTGATATA







AGGTAGTAAGAAAGTGCCAAAGATGAATTTCCCATCCTTTACA







GTTGCTAAAGATGAGTCTTTGCAGAACTGTGTACTGACAGAGG







CTCCATAAAGTCCTTTCG





BICF230J38817
15
8
62091061
0.339
GATTACAGATTGGAGTGGACTTTTTTCTTTGTCTTGCCCCATC







TTGGTCACCGAATGACTTTGTCTGATGTCAATCTTCCATGAAA







ATGTTTATATTTAATAGAAAAAAAAAAAAAAGGACAGCCTATC







CATGAGTACAGGACAGTTCCTAGCAACACCAAGGTGTAGCTGA







TAATGCCTCTTTCACAGGGGAATAAACT[G/T]TAAGACGCTT







GATCACTTCTGGTTTTGCTTAGGTTAAGCCAGCCACCTACATA







GGAAGTCCAACTACTTTGAGACTTCCACGTTTTTGTTTTTGAA







ATAAGGGAGGCTGCCCTCCTACCTTTTCCTTCCTGCATCCAAC







ATGCCTCATGCAACACCTGTGCTCTCCACTGGCCTGTGACAGC







AACTTGTTATTCTGGGGC





BICFG630J163689
16
9
13329359
0.316
CCCTCCCTTCCTTCCTTTTTGTTCTCATTTTTCCAAGTAGTTG







CTACAGAGACCAGCAGACCCTGTGGCCCAAATATAAATAAAAT







AGTTGTGACTCTCCATCAGTTTATCTGGAAGGATAGGGAGAAA







GAGAGAGAACTGAACTGAAGGAGGAAATATCCCTAATTTTTTT







TTTTTTTAAGGGTGGAAGTAACTTGTTC[G/A]GAGAAAGAAA







AGAAATACACTGTGAGATCTGATGCGTAAAGAAAGAGAGGGAG







AACCTTGGAGAATGATTTGAAAAACAACCCATGCAATTTAGAT







TTCAAGTAGAATCTTAATCTGTGGACTACTTTAGAGCCTCTTA







AACAGAACATTAAACATATGGCCATAAAGGTAGAAACTTGAGG







TTTTTTTTTTTTTTTTTA





BICF232J28587
17
10
7116121
0.318
ATGATTGTCTGNTAGGTAATGTATGTACCCAGAACAGGAGCTG







NGTATTGGTTATCTTGATGGGAGACCCAGATGAGGTGGTATCT







TGGAACAGAGTAACTGCACCAGTAAGAGGTTNCAGATTGTGTA







ATTGGGTGGGGACAGACAAGGGCNGAGGAGGTTGAAAGAGTTT







AGCCAGGAAGTGTATCACAGAGTCTGAA[G/A]GCTGGACTTT







TCAGACAGTCATCCTCCAGGGCCAACTAATGACCCACACATGA







CCTAAGAGAGAGCCTGAGACCAGGCCGCATGTGACTATGGAAA







AATAGAGCCACTCGGTAAAAATGTTCTTGTTCCAATTTATGTA







GTTGTTTTCTGTACGTACTCTGAATTCAGCCCACTCAGAAAGA







TTATACTAGATCTTGATC





BICF232J33774
18
10
7119531
0.383
CCACCAGGTTCTCATGGAGACCAGAGAAAAGTCTCTTGTGCTT







GGCCAAGAGGAAGGAAAAAGGTACCATTTTGACATATACCTAG







AGGAAATGTATTGTGAAAAAAGCCTCTAGAAACTGCTACAATG







AATGTCTACCAGACAAAAGAAAAGACTGCACTAAAGTCTGACT







TGCAGGAGAGGAAATGACCTAATTCCAT[C/T]CCCTTCTAGA







CTTCCTGCCTCAACCAAGGGAGGAAAAATGCTGAGAAGTTCTT







GTGAAGTCATAACCCAGGTCCACTATAAGACTAAAACTTAAGC







CCAGAAGTAGAGAATGCTCCCTCTCCCACACACTAACTACACT







TTGCTACCACATTAACAGGCCTCCTAAGATATTTACTCAATTG







AGATGAAAACTTGTATTC





BICF230J39560
19
10
7191026
0.130
TGAGGGAGGAAAATCTGATACTAAAAGATTTTATTCCTAGAGA







AAATATGAAGACTAGTTCTGTATATTCTCATCCTAAGTAGAAC







TCTTTTACTCTTCTCAGTGCTTTGGTTTTGTGTCCTTATTTCA







GTTTATATATGTGACTAAGTTAAAGAGATTTAGAATAAGTGAA







TTTTATTCCTCCAGCAGGAATACAGAGG[A/G]CATTTTGTCA







AAACAACAGCCCTCTTGGCAGCTTGGTTTTCTGGGACACAGGC







TTCACTAGCTTCTCTCACTTCACCCCACCCATGGAAGATAAAG







CTTTTGAGTGGGAAAGCTTTACCCCAAGGTCAGGAATAGCATA







ATGGGGGCAGCAAAGCCTAGATTGGTTCACAGTCCCCGGTAAT







CTGAAAGCCAGATATCTG





BICF235J44776
20
10
7198354
0.399
AGCCATTGAGTTATGAAAGAGAATAATAGTTATAGGAAGGTGG







AAGTCTGTTAGGAAAGGCTATTGTAATACAAGGCGCTTACTGG







TGTGTGAGAGGGACAGAAGGCTACATTGGAAATGTAGGAAAGA







ACCATAATATACAAAGATAGAATATCACCTCAAAAAGTTTAGA







ATTTATCCTAGATCAATGGAAACTCACC[A/G]AATACTTTCC







TGCTTAATCAGGAAAAATGATATGATCAAAATTTAGTTTTAAC







TTAGTTCCGTGTAAATTAGGGACTGTAAGTGATTAGAGTGGAT







GTATGTTGATGAGTAGAGCTATAGCAATATGTCAGGGAGAAAT







GATATATTAAAATGAGCAGTGATTGTAGGGTACAGAGGAAGGG







AAGGAAATAAATACTTAT





BICF232J56993
21
10
7252690
0.407
TAAAGAAAGTAACAAAAGGTTATTTTGAAAACTATTTGGCTAT







TCAATTGTGTGATTTACTTTTCCTATTTATAATGGACACTGTA







CTGACAATTGTAGATACTCATAAATCTGTGAAAAAAGAAAGTA







TTCACACTTTGATTTTAGTAGAAATTTATTTAAATTAACTTAA







GTAATATTTCGTTGGCATTTAGCTTCAG[C/T]AATGGATAAA







ACTATCAAATTCAAGAAGGTAACTGGAATTTTAAAGAAGAAAG







ATTATTTTATGGCCAAAGTCATTTTCAGTTGTCTTTGTGAAGA







ATGAGAGTGATTAAACTCTGTAATATCAAGTTATTTCTAAAAC







TCATGGAAGCTAGGAAAAAATGTTTTCTACTGTGTTTTGCAGA







ATTTCTCAAATGACTTAT





BICF237J13241
22
10
7345063
0.267
TTACTATAGGACAAATGATGCCTCATTTGGAGCTTATAAATTG







TCTTTTTGATTACTTGTGTTCTTTCTCTGTGCCATGGCACCCC







TGAAAGTAGTTTGCTCTTCTCTACAGAAATCCCAAGTTGATTT







ATGCATTCCTTTTTTTTTTTGAAGGATCAATAGCAAAACTATT







TTTAAACTATTACAGCATGAACTGTATT[T/G]CATCTAAGAG







AGTATATTATTTTTTTAGAAATTAAAAAGTTAGTAATTTGGTC







TAATTACTATAGTCACCTTGCAATTTGAATGTTAATATTACAC







TGTAATGCATCAATATTGGCAGCGACTGAACAATTTGCCCATT







TAATGAACAGAAAGAAGCAATATTAAATATATTTAAACAAATA







TATATTTCCAAGACAGAG





BICF233J7609
23
10
7394805
0.058
ATCAACAGTTGTTGAACATTCTTTTAATTAGCAAATTAATTAC







ACATTGTAAGTATGGTTTACTATCCCCGTAATCACAGTTATAT







TTTGCCAAATGGCAGCTCTCAAATACTCCCATGGAAAAGTGTG







GTTTATTGGGGTGTCTTATTACAATACGGATTTCAAAACTATT







TTGGCTGAACTGATGTTACCATTTTATG[T/G]CTTACTTTTT







TTCAGGCATTAATTTCAACAAAATCAGCTAGATAAAAAATAAA







CTCCAAATAAAATGTCATATATAAAATGTGACTGGAATATCAA







TTTCCAGGTAGTTATTGTATTTCTGTTTCAAAATCTGTCCATT







ACCTTCCATTTACATTTAATTTCATCATTCATTTAACAATTAA







TTGCACACTAATTTATGA





BICF236J47678
24
10
7396563
0.114
TTATTACTCAGGTTCTCCAAAGATACAACCAATATATATGTAA







AGAGATTACAAGGCATTGGCTCATGTAAATATGGAAGGTTGAG







AAATCCCAAAATCTGAAGTTGCTGGGCTGGAGACCCAGTACAG







CTAATAGTATAAATTCCAGGCTGAAAGCTGGAAGACTCAAAAT







CTAAGAAGAGCTGACTTTTCAGTTTGAG[A/T]CCAAAGACAG







AAAAATACCAATGTCCCACCTAAAGCAAGCAGGCAGAAATTTC







CTCTTACTCAATTTTTTTGTTCTATCCCAGTTTTCCATTAATT







GAATGAGACTTATTCATATTAGGAAGAGCAATTTGCTTCCTTC







AGTCTGTAAATGTTAATCTTATCTAGAAGTACCTTTACAGCAC







ACCTAAAATTATGTTTGA





BICF231J59264
25
10
7754196
0.240
TCTGGATTCTGATTTGATGGGTTCCTGTATATTCTGGGATTTG







GTATTTCTGTTTTAATTCTCCAGTCGATTGATATTTGAATGCA







AGGTTATGAATTATTGGATTAGATATCAGCCATTATTTAGACA







CACATTCAGTATACATCAGAGTCACCTGGAAAGCAGGCTATTA







AAACACACATTGTTGAGCCTACCGTCAG[A/C]AGTTTTTATT







TGCTAGGTCTGAAGTGGGGACCAAAAAGTCACATTTCTAACAT







GGTCCCAGGTGTGCTTATGCTGGTTTGGGGATGATACTTTGAA







AAGCACAGAACTGAGTCAGTTATGCTTAAATTTTTGAAAGTGA







TAGAACATTTTTCAAAAGTTATAAATAGGATTAACATACCCTA







CTCCCAAAATGTATATAC





BICF233J61217
26
10
7758444
0.188
AATGCCTAAAAGCAAGTGCTATTCAGATATTTGTCTGAGAACA







TAGAATTTAAGCTTATACCATTAGAAAAAAATACCATAAACCT







TTTTTTGAAAACCTCTGTATTTTCAATAGATAGAAAAGAAAGA







TAAATGAAGTGATTTAATAGCAAATGGGAAACTATATTTTAAT







AAACCACTCAAAATATCAGAAATCAACA[T/C]GTTAGGGCTA







AAAGTGTTTTTATTTTCCAAAAAGCATAGGAAGATTTCTGGGA







AAACGTGGTAACCTAAGTAGACACATAAATATTTGATTCTCCA







CTCTGCTCCTACCCCAGAATCATCCTCCAAAGGAAATGACCTA







CTAAATTTTAAAATAGAAGAAACTCTGCACTAGCTATGGAAAT







TGACAAAGATATCAAAAT





BICF236J3335
27
10
7904371
0.301
CATCTCAATTTAAATCCCTGAATATAATTTGAGAGCCAAAGGA







CATAAATAGATGACCCAAACTATCGCAATTAATAAACTACATA







CAATTGATAAAACAGTAAATAATTAACAAAGAACCTCTTAATT







TTGTGGTAAATCTTAAAATTGCTGTAAAAAATGAAATCTTTTG







TAAAAAACAATAGTTAACAAAGGTGTAT[G/A]CATCATCAGG







TATTATGTGAACAGTGACAATAAAATAGTGCTTCCAGTTTAAA







CAATTCTTAGGCTCCAATCTGAATTTTTTAGCTACAAAAGACA







AGCCATATTTTTTACTTGAATTATTATGTTATAACACAAATCT







ATAATTAGTTGTAAACCTAATGTATGAAAAGCTATTCCAAATT







TATAAAGTCTTTAAATAA





BICF229J43178
28
10
7914000
0.235
AGATGGCAAGCCCAATGGAAGACCAGTTAAAAGGGATGACGAG







CCTACAGGGAGAAAATATGAAGGTCTGCTGTATGGTTAAGGGG







TTAGAGAAGATGAATTTAAGCAATTCTTAGAAATCCAAATCCA







TTTTTTAAGCCAAGATTTTGGTGGTTCCCCCAGGATTCGGTGA







CATCACAAGGGAAGGACACAGAAAAGGA[T/G]AAAGGGATCT







GAAAGGCAAGATTAGAAGTTCACGACTTGCAATGTGACTTAGT







TGAGATACTTACTATACTTTACAATTCACCAGCGAAGGTACAT







CTTCCCTCAATCCTTCCACCTTCCTGCTAAGAACATTTCAGAG







TGAGCACAGTTCTCAGAACAACACCCGAAGTCAAAATAATGCC







ACTGAAGCGGAATTCCCC





BICF233J33223
29
10
7926382
0.207
GCATCTTTACATGTGTCTGTTGGCCATTTGTAGGTCTTCTTTG







GAAAAAATATCTGTGTAGGTGTTCTGTCCATTTTTAATCGGGT







TATTTGGGGTGGGGGTTTTTTGGTGTTGAGTTGTAGAAGTTTT







TTAATATATTTTGTATATTAACCCCTTATCAGATAGATCATTT







GCAAATATCTGCTCCCACTCAGTAGGTC[T/G]CCTTGTCATT







TTGCTGAGGATTTCCTTAAATATGCAAAAGCTTTGTATTTTGG







TCTAGGCCAATAGTTTATTTTTGCTTTTGTTTCCTTTGTGCCT







GAGGAGACATATCTAGAAAAATGTTACCAAAGCTGTTCAAATG







TTATCAAAGAAATTACGACCTATATTTTCTTCTAGGAGTTTTA







TGGTTTCAGGTCTTCTTA





BICF231J47172
30
10
8033689
0.025
TGAACATAGGGTAGCACATCATTGAATAACATAGTACATAAAA







ATATAGCACACCAAATATCAGTAATAAAATGTGCTGGGGAATA







TGGAGAAAAGGGAACTCTTGCACCCTGTTGGTGGGAATTTAAA







TTAATAGAGCCATTACAGAAAATACTATGAGGGCTCCTCAAAA







AATTAAAAATAGGTATAATACAGCAATT[G/C]CACTTCTGGG







CATATATCCAAAGGAAACGACATCAGTATCTCAAAGAGACACC







TCTGTGCTTGCATGTTCATTTGAGCATTTTTCACAATAGCCAA







GGTATGGATATAACCCATGTTTACGGACAAATGAATGTCTAGA







GAAGATGCGATACACATACACACACGTACACACACACGCGCAC







ACACACACACACACACAT





BICF230J68893
31
10
8047339
0.042
CCCTTATGTTAGGCACATAAATCTTTATAAATGTTACATCCTC







TTGTTGGTTTCACCCTTTTATTATTGTATAAAAACCTTCTTTG







TTTCTTATTACACAGTCTTTGTATTAAGGTACCATTTGTCTGG







TATAAGTATAGTTACCTCAGCTCTCTTTTGGTATCTGCTTACA







CAAAATATGTTTTTCCTTCCCATCACTT[A/T]CATATTCTTT







GAGTCTATTCTAGGCCAGGTATAGTTGGGTCTTATATTTTTTA







TCCATTCAGCCATTCTTTGTCTTTTGATTGGAGATTTAGTCTT







TTTACATTTAAAAGATATGTACTAATTATCAATAGATGTGTCT







TTTTTGCTATTTTGTTAATTGTTTTCTGACTGTTTTATAATTC







CTTTGTTTTAGTCCCCTG





BICF231J5699
32
10
8154363
0.417
GTGTGTCATTTGGACTGTGACTCATATTAACATAAATTTGCAA







GTAAGAACTCTAATAAGAACATATTTATCTTCCATCCTTAAAA







TTCCACCAGCACCTCTCCAAGAGAATGTTAATTATAACAATGT







ATAACTAACTTTATGGGATTTAAATTCAGAGAAAAAAAATTTT







TTTAATTAACTCATTTTCTGCCATGCTT[C/T]AAAATTTGTT







TCCAAAACTTAAGAAATGAAAGTTCTGAACTCATATTTCTAAA







TTTTGAACGGTTTTCATTTCTGTAGAGCTTTTAATTTGTAGGG







AATTTTCCTAGCCATGAGCTTCTTTGAGTTTCACAATCCACCT







GCGGGGTAAATTGGAACTGGTAGCCCCTTTTACAGATAATCAT







AGTTATAAGTAATTAAAT





BICF230J68738
33
10
8261043
0.252
GATATAACAGAGAAGGGAACAGAAAAACCCAAGACCCCATTGA







ACTTATGTTCAAGAGAAGAACAGACAATAAACCAAATAAATGA







GTAAAATATATAGTCTCTCAGATGATGGCAGTGCCGGCTCCAT







GGGCACAGAGTCTGTGCAAGTTCACAAGGCCCTGAGCTCAGGT







TGATCTAATGCTCTGCTGTTGCCATCTT[A/G]AAATTTTTAA







TGTTTTTGAACAAGGGGCCCCATATTTCCATTTTCTGGATGGT







GGTAGATACTATGGGGAAAAAAATAGTGCAAAGGGGTTAGGGA







GATGAAAGGCAGGAGGACTGGTCAACATAACAATGCACATTTG







GAACACTATACAAAGGTGCCTAGCAGAAGGGACTGGGGGGTGC







TGAAACCACCTAGAGGAA





BICF237J66279
34
10
8428391
0.440
TATATTTACACATTAAATGTTTGCTTAATAAATCTAATAAACC







ATAAGATCACTTGAGGTTACAAAGCAGATGGCATATTTTAGAT







AAATTTAAAGGAATTAGGCAGTGCTTATAAAACTCCTGCATCC







AGTACACGTCATGAAAAACTAAAAATTCTGTAATAGAAAGGGA







CCCATAGGTCTTTTCAATTTGATTCTAG[T/G]GGTAATGATG







ATAAAAAGAATCAAAAACTGGTAAATCACAAAATATAGAAACA







CTCTGTCTAATCAATAGCGTAAGTCTCTAGAACATCCTCCAAA







TCAATAAGATATAGAATAATGACAAGAACCACCTAAGAGTAAG







TGAAAAAAAAAATGAATTGTAACACAGCCCAGAAACATTATCC







AAACTATATTAAAAGTGC





BICF230J67378
35
10
8445140
0.481
CATTACTGGTAATTGTGACCCACTTTTATTTATCCATTCATTT







CACCATTTTTCATAATATAAGTAGGAACCATGAATCTCCTCAC







CCAAAAGAAGTCAGAACACTCTGATCACAGCTCACATTCAGCT







ACGTGGTTACTTCCTAGGACATCCCTTTTGATTCCAGACCTGA







GACAATAACCACATTGCCTTCTACATTC[G/A]TAATTCCCTT







GATAATCTCGTTATACAGGATTACATCTCCCTATCATTAAGAA







ATATTTTAGTCATTTTTAACTTTATAAAAATGGCGTTGCAAAT







TATTTTTCAGAACTTGTTTTTTACTTAGTATTGTATTGCTAAT







ACTCATTCATATTTATAAATGCTGTACTTCATTCAACTACTGT







GTCATATTTTATTACTGA





BICF234J4350
36
10
8464927
0.380
TATCATTCCAATATTCAAAAAATATAAATGGCAGAAAACACAN







CTTTTCAGAAGATAATTCTTATCAACATAGGTTTGGGAGGAAC







ACTGCCCAGGAAAAAAATAAGCTATTGTCATAACTCATGACAA







TAAGCATAGGTTAGAGAGACTCCTAAGCTTTTTCTGTAAACAG







CATAAACGCACAGAAGTTTATAATTAGC[C/G]GTGGTTGGGG







AGTCAAAGACAGAAGATATTGATGGGCTGGGAAATATAAGAAA







CCAGGATAACATCTTCCAGCCACAAAGTGTTTTGTGGTTAAAT







ATCTGTTAACAGAATTAGCAAGAGTAGGTAACAATTTCTTTTT







TTTGGGGGGGGGAGTGGGGGTAGGTGGTAATTTAGAACACAGC







CATCGCTAAGTAAAGTTT





BICF229J64181
37
10
8482034
0.470
AGTACAAATGAAAAGGAAAGGCACTGAAAACATTCGATTCCTT







CTTCAGTTTACATAAATGGAAGTGTTCACCCTTAATTCAAATA







CTCCAGTTTGGAAGTAAAAGGTAATAACTTCACATACTACGCA







GGTAGGACAACCAACTTCTTAATTCAGTATGGAGTTCTAAGAA







AATTATTAGAGTTCAGGCTGTCAAAAGA[C/T]TCATTATAAT







TTCTTCCTACCCCATGGTGCAACTTGACCTGGCACTGAAGGGC







TCTTCTGTTTCTGTAAATGCGTCAGATGTGGCTAAAAGACATT







TTTTAAACACGAAAGTAGTTTCCTGCTCATGCAGTGATATAGT







CAATCCCTTGCCAAAAGGTGGGTTCGGGGAGAATAAAAATAAA







ACCAAAAAATAACAAAGG





BICF234J19872
38
10
8623480
0.216
CCATTCTCATTTGACTCTGCTACATCAGACATTCAGACCTACC







ACTCCCTTGGCACTGCTTCTATGATGGTCTCTGACTACTTCTA







CATTGATAAATCTAATGGTAATTTCAGGCTTCCATTTTCTTAA







CCTGCCAGCAAAAATTAACATATTTATTCACTTTTCCATCTAG







AAATAAATTCTTTTTACATGGCTTCCAT[G/T]GTACAGATTT







TCACTGCCTGCATTTTCTCAGAGAATCTGAAAACATCCTCTTC







ATGTTTCTGATCTCAAAACATCAGGGTGTTGCCATGCTTAGTC







CTTGAATTTCCTCTTTTCTTTGGTGATTTCCTTCCATGTCCTA







GCTTTTTAAATACTTTCTGCATATTGATATCTTTTGAATTTAT







ATCTGTTGCGCAGAACTC





BICF230J19440
39
10
9101316
0.224
GGAGGGGTGCCCATGCTTTAAATTTTTATCACCTCTCTCAGTG







AAGCTGAGCATCTGATCAAAAGTGGGGAATTGTTAAAAATAAA







GTCACTTGTGCTATGTGCATGCCACCAAACCAAGTTACAACCT







CTCCCAAGAGTGGAAGAGTGGAATCTTAAACCAGTCAATCTCA







AATCACCTGATCAGCACAAGTGAAGTAA[G/T]CCGCTTGACA







GGACCCCTGCCACCTCTGTGAAGGAAGGTGGCCTTGCCTGCAA







CAATTCACTATTTGACAGTAACTTCCTTGTCTGGCCCCCTTCT







ACCCATAAAAGTCTTCCATTTTATACAGCTCTTTGGGGCTCCT







TTCTATCTGCCAGGTAGGAGGCTGCCAATTCATGAATCACTGA







ATAAATCCCAAAAGATCT





BICF231J12788
40
10
9224910
0.126
TATTGGTGGTCTTCTACATTTTTATAAAAAAAAGAGCATATCT







TTTTTGATTGGTCCTGTATGGCTTCACTTGCCCTGACCTTCAA







CTCTGGTGCTAGCTGTGGCTCCCCACTCTGCTGTCTCTCTAGG







GCTGCTTCATTCTTGGTCAGTCAGGAATGATACAAAGCACGTA







TATTTTTCCACAGAAGAGCTTCCGTTCC[G/A]TTTTGTGGGT







TTGTTTGTTTTTGACTATGTCTTTAGATGTGGGGTTTTTTATC







TTGGTTTGACATCTTTACCTTTCTGGGTACATTTATTTCTTAG







AGCTAGCATTTATTAAGCACTTGCTCTATACTAGACACCCAGT







ACTCAAAATATGCTTATGATAATGTGGGTAGTCTGCTGTCTTA







TTTATGGAGAAACCAAAG





BICF233J16820
41
10
9347714
0.153
CAGCATTACCAAACACCACAAACCAGGCCAGGACAGACACTCA







CACATGTGTGCACGCACACACACACAAAAAGGAAGAAAGAAAA







TTATAGGCCAATATTCCTGATAACCATAGATGCAAAAATCCTC







AACAAAATATTAACAAAATAAATTCAACAGTATATAAGAAATA







TGTACACATTAAATATGTACACTTATTC[C/A]ATGGATATAA







GGATGGTTCAACATTCACAAGTCTATCAATGTGATATACCACA







TTAACAGGATGAAGGGTAAAATCATATGATCATTTTAATAGAT







GCAGAAAAAGCATTTAAGAAAATTCAACATGCATTTATGATAA







AAATCCTCAATAAAAGTAGATATAGAGAGACTGTTCTTCAGTA







TAATAAAGGCCATATATG





BICF233J39337
42
10
9648006
0.090
AAACAGAAAGATCCATAAAAAGACAGCACTGTATAAGGCAATG







GGTAACAAGCCCCAAATGAGTAGTAAAACAAAACTTGAGAAGG







CGGTGCTTTATTGTGGTAAAAATATGGACTTTTGAGCCCAACA







CTGTGTGCCTGAGTCCCAACCCTGCCACTTTCTATCTGTGATG







GGCACATTTCTTAGATTCCGCCCCCCTT[C/T]CTCCTGGGGC







AGCAAATAAGAAAGCAGATTTTCAAACGATCTGTAACCTGCCA







GGCACCCTGGATATATTAATTACTAACGATCGCTGTGAGGTAG







GCATGGTCAGCAGTTTCACGGGGTGACCCATGCTATATAAAAG







CAAATATTCCCTGCAGACCTAATATACACCCAGCATTGTGCTA







GATATACAATTTGAATCT





BICF234J8664
43
10
9891080
0.061
TCCATTTAGTCAGCGGTAGAAAATCCTTTTACCCAGGATGCCT







GGGTGGCTCAGTGGTTGAGCATCTGCCTTTGGCTCAGGATCCC







CGGGATCCTGGGATCAAGTCCCACATCGGGCTCCCTGCAGGGA







GCCTACTTCTCCCTCAGCCCATTTCTCTGCCCCTCTCTGTGTC







TTGTGAATAAAAACATAAAACCTTTTTT[A/T]AAAAAACAAC







CAAACTGTTCTTTCACATAAAGAGTTTGAAGCAGATAATTCTA







GGAATGTTTTTGTGCCCAAATAGACATAATAATTGGAAAGACA







CATAAAAATACATAATTCCACACATAGGTGTAGGGAACAGCCT







TAAAGCTTAAATTTTAATCACATCAGAACTAGTTAGAGTTTTC







ATCAGGTAGAAAAAGTAA





BICF230J39580
44
10
9947523
0.267
GGAGATGTACCCTGCTGCATTCTGTCCACCAATGAAGTTTGAG







TTTCAGTCTCACATGGGTGATGAGGTCAGTAATTCATTGTTTT







AAAACTAATGAGTTGTCTTTTCACTTACATTAGTTACTCTCCA







GAACTAAAAGTTGTCAGAATTCCCAGTTGTGTTTTCTCTCCTG







CCTTTGTATGAGATCTTTGCTAGTTACA[T/A]CCTAGGATCA







GCCACTACCACCTTCCCCTGCCCCATCCACTGCCCCCAAGCCA







TAACCGCCTCTTAAAGCCAAGGGTCCAGTATTCAAATGCCTGC







AGAGGCCAAACCTTTGCACAAACTCGGTTGTAAGCACTACACT







GTCTCAGGATAGTTCTTACTTCACCTCTAGACATTCAGATTTA







AATTCAAAGTGAGACCCT





BICF237J61418
45
10
10022361
0.072
CTCAGAGCAGTTAGGGGGCTAAGGAGTGTGGACAGAGGCCAGA







GTTTGGAGGACCATGGAGGCCAAAGTAAGAAGGGTAGTTAGAT







GGCACTTGAAGGTGAAGGAGAGCCATTGGAAGGACCTAAGCTG







GAGCGCAACATCACTCCTGCTTTTAAAAGATCGCTCCACCAAA







CCCCAAGGTGGGATCACCGCACACCTGT[T/C]AGGACGGCCG







AGGAAAGAAAGGGAGGGAGGGAACTGTTGGAAGGAATAAAATG







ATACAGTCGCTGTGGAAAACAGTCTGGAAGGGCCTCAGAAAAC







TAAAAATAAAACTACTGGGGTACCTGGGTGGCTCAGTGGTTGA







GCGTCTGCCTTTGGCTCAGTTCATGATCCTGGAGTCCTGGGAT







CGAGTCCCGCATCAGGCT





BICF233J22431
46
10
10263857
0.252
CTTACCCCCGTTCTTGCAAACATCCTGGAGGCAGATCAGGAAA







AGTGTTGGGGTTTTGACCAGTTTTTTGCAGAAACTAGTGATAT







ACTTCATCGAATGATAATTCATGTTTTTTCACTGCAACAAATG







ACAGCTCATAAGATTTATATTCATGGCTATAATACGTAAGTAC







CTCTTTATGTTTTCATCCTATATCATAA[T/C]GTGTTCTATA







ATTATCTTCCTAAAAATAGAGGAAATGGGATCCCTGGGTGGCG







CAGCGGTTTGGCGCCTGCCTTTGGCCCAGGGCGCGATCCTGGA







GACCCGGGATCGAATCCCACATTGGGCTCCCGGTGCATGGAGC







CTGCTTCTCCCTCTGCCTGTGTCTCTGCCTCTCTCTCTCTCTT







TCTCTCTCTCTCTCTCTC





BICF237J14551
47
10
10567931
0.121
AAGTCTCTGGCTTAGGAAGGAGAGCTGAGTTCACGGCGGGGAG







GGAGATGGGTTCGTATGGAGACAGGTTGCATTGGAGATACATT







CTGGCCGCCCTGTGTGTCACCTCTCCAAGATGTTCCCCACCAC







CACACAGCTGGAACCTCCAGCATGGGTCACCCATGGGCTGTCC







CTTCCCTCCCCCATCCCCCCCGCAACCT[C/T]CTTCGTCTCA







GCCCTCAGCGAGGCCTCCTCTGTGTTCCTTCTGCGATCCAGAG







TTTCAACAACCGACACAATGCTAATTGTAGACAGTTTTACATT







ATTTGGTATTTTTTCAAGTATGTTAATCTTAATTGTCTAATGA







AGTGGTAAGTTCCCTAATGACAGGGGCCTCATTCTACTCAGCA







TCCTCATAACACCTAGAG





BICF232J42790
48
10
10603901
0.324
CTGTCTTGCAGAGGTGGAGAGCTCTGTGGCCTTGGTCACATCA







CATACATCTCCAAGTCTGTTTCCTTATCTATTAGAGAAAACAG







GGTTTAAACCACTTGCCCCTGAGGGTAGAGAGAATGTATGTAA







AGCACGGAGCTCAGTACTGGGCATAGAATAGGTCCTTGATATA







TGTTCGCTATTATTGAATAATGCCAGAG[T/C]TTAAATATTT







ATTTATTAATTTTAGAAGGTGTTGAGGGGGGTAGAGGGAAAGG







TAGAGAGAGAACTCTCAAGCAGACTTCATGGTGAGTATGGAGC







CTGAGGTGGGGCTCGGTCCTAGAACCCTGAGATCATGACCTGA







GCCAAAATCAGGAGTCGGACACTTAACCAACTGAGTCATCCAG







GCACCACAGGGCTAAAGT





BICF235J33471
49
10
10676776
0.226
GCTATAATTGAAAAAGTAACCTACGAGCTGTGTGTTCTTGGTC







GCGATACTTAACTATGGTGTTTCATAAATTACTGAATGCTTTA







ATTTTACATTGATTCTCAAACTTAGATCTTCAGTCTCTCTCCC







TTTCACACAAAACTCTCTCATTTCAAAGGAATATATAAGTGTT







TCCCTTCCAAGGAGTCAGCTTCCCAGGG[C/T]ACTAGATCAA







AAGCTGCTTATAACCTGGGGCTTCATCTGTTTTGATGACTGCA







ATATCCCCAGCACCTAGAAGAGAGACTCTCTAATAGAAAGTGT







GTCCAATAAATATTTATTGAGTAAGTGAGTAAATGAGCAGCTC







ATGTCTTGGATGGAAACACTTTCTCTCTTGATTCTGTGATAAA







TTATTTGACTCAATGTGA





BICF236J49836
50
10
10793797
0.145
ATGTTGGTGTTTGTTTCTAGAAGCCTGAAGACTCAGAGTCATA







TCAAAACAGCAAGCTTAGGCACACTGCTCATCTTTTTATTTTT







ATTTATTTATTTATTTGCTCACCTTTTCCTTTTGGGAATCTTT







ATGTGACACGTTGGAATAAACGCGAGTAAATTCCTTATTTGCT







TTGGAGATGATCCCATGAGAAATCACTC[A/T]AAAAAAGTCG







TAAAAGTGTAGCAGCCTCCACACTACACAGTATTCTCTCTACC







TGCCCATCAGGCAATGGAAATATTACAGGCTCATTTTCTCCAC







CCTTCCTCGTTATCAAAGCTTACCTGCCCTGCAGCTTGCCAGG







TGAAATTCATGGAATGGATATTGACGGGAATGGCTGGCATTCT







CTGCTGCGCTTTCCTGAA





BICF234J57518
51
10
10946643
0.230
TTCAGAGTGGGTTTATCCTATTTCAGTTCCCAGTGTGGATGCG







CAGTTTCCTTTATATTCTCTTTTAAAGGGTATTTATTTGTATT







GCCACCTTTGTGTGTTTGTGTGGTGTGTACGTGTATGCATGTG







TACAGTCTCTCTTTATCTCTGCTTTTTAGAGGTTCCCTCTATG







CCATTTCTTTGTTTTGTAACTCTCTCTA[C/T]AGGAGAGTCC







TTTTTTTAGTAGGTCCATTTAAATCACTTATATTTTGTGTTAT







CATGGACGAACTTGATGACATTCCTTGTCCTGTTTTATCCTCT







CTTTGTGATCTTTCCCCCTTAGTTATCAATCGTTTGCCGTATC







ATTTCTTTGTTTTATTTACTTTTATAGTTCATAGGGATTTGAA







AGGTTAACATTTTAGGTT





BICF237J16497
52
10
11086313
0.237
TGGGTGATAGGGCTACGTGGTTTGTACAGAAGAAATTACTAAT







AATCAGTTTAAAGAAATGCACTATTTCTGATAGGCAGGAAAAG







AAACATTATATTAAATTTGATATTTGTAAAAAAATTGCTTGAA







GCAAAATAGTAGAAATTCTTTGCCTATCTTTTAAAAATTCCTT







TGATCGGCCTAGTTATCAGTGTATCTAA[A/G]CTAAGTGATA







ATCTTCTAATGAAATGCAGTTCTATTGTTGCATAATGAAATAT







GATATATTCTTTATTGATTAATTCAGTGACGTGTTGAAGATAT







AGTAGTAAGCAAAGCCATCATGGTCTTGGGGAGCGTAGGGAAA







GAAGCAGACATTACTCAAAAGTTTCTAAATAAGAAACTACAAA







CTGTGATACGTGCTTTGA





BICF233J362
53
10
11295778
0.427
AGGCTTCATTGCTTGAGACTAGGTATGATCTCAGTAACAGAGT







CCAATATATCTATTGTCCTGGCTATTGTGTGAATTACTAATAA







AAATTAATATTTACTGAGTACTCATGCTAGGAACTGTGTGAGT







GCTTTTACATGGGTCATCTCATTTGTATTATATGTTGTTAACA







CAACACTATGAGTAGATACTATCACTGC[T/C]CCAATTCTTG







AGGAAACTGTAACTTAGGAAGACTAAGAAACTAGACCAAGGTC







ACACAGCTAGTCGGTGGTAGAGCCACAATTCAAACTCAGAGCT







TGAAATGAATATTACATGATAATGTCACCCCCAGAGAGTCCCA







GTGTTTTGAACTTGAAGCAGGTCCTCTGTAGGCTTTGTTCAAA







AACTAAGATAAAATGGTA





BICF235J39827
54
10
11315630
0.424
GTTAACACACATCTTATATGTAATATGATTATATATTGTATTT







CTATAATAAAGTAAACCAGAGGAAAGAAAATGTTATTAAGAAA







ATCATAAGAAAGAGAAAAATACATTTACAGTACTATAGTGAGT







GCGTTTGTTTTTTAAAAATCCATCTAGAAGCAGACCTGTGGGC







TTCATAACTATGTTGTACAAGAGTCAAC[C/T]GTAGCTCAGA







GGGAGTCATCATGAGAGAATGAAGTTACTGTGATGGTCTTCTC







AAGACACTGTGCAGAGAGGGCCCATAGACAGTTGGAAAGTTCT







GGAACTTGATTTTAACTCATTCAAGTAATGAATGAGTTCCCAT







GAATGGTAGGAAGATTTTATCTCCACCCCTGAATCCCTCATAA







AGAACCCTTCAATGTGCT





BICF232J59874
55
10
11354733
0.227
TCAGATGCTTCATTAACTGAGCTACACAGCCACCCCTCTCTTT







ACTTTCAAATGTACTTTAAAATAAGTATATAGACACCTTTTAA







AGGAACGCAGCCTATTCTAGTCTCCCTATAACTGATCATCAAA







ATTGTCCCGCTTGCAATTAAAAATTACTAAGTACTCAAAGGAA







GGATGCAAAGAAGTGGGAAAATTTGACT[A/C]CTAGTCATGA







AAAAAACAGTCAGTGGAAATAAATGATGGAATTGTGTGGCAAG







GACTTTAAAACATCATTAGAAATCAGTGAATCAATGGGAATTA







TCAGCAGAGAACTAGTAACAAAAAATGAAAATTCTAGAACTGA







AAACTATAATATGTGTACTGAAAAAGTCAGTAAATGGGCTTAA







CAGCACATTGGCATCTGG





BICF236J6812
56
10
11396331
0.266
TTTCACCCACAAGTATCTGTTGAACTGGACAGTGTTAGATATT







AGTCATCACAACAACAACCACGAATCATTTTTTACCATGTACA







AAAGGAAGACAAGGCAGCCAGCAAATCCATAGGCGATGGAAAC







TCAGCAGTGTTCCGATCACCCAAGCTGTTTGTTTATGCCTGTT







ACCAGAGCTCTAAAACTAACCTGTACTG[T/C]TCTTTGTTGT







GGCATCCTATAAGCAACAAAGATCAAATGTGATAGTAAATTAT







ATGTTGTTTTCGATATATTGGTTGGCTGCAAATTATTTTTAAA







AACACGGATTAGCAAGCTAGAAATGTTTCTCTTATGTTCAAAA







ATCTGAAGACAATTGATCTTGGTTGGAATAGCATTCCTGGGTG







GCAGGGACCACCTCCTTC





BICF236J12925
57
10
11407741
0.269
TAACCCTGTGGTGAAAATCTGCTTCAGGAAAAGTTTGGTTTAT







TTGTCATTTTCTTCATCACTTTGTTGTCTAGGATATGGATGTG







AAACTAGAGGCAAAGGAGACCATACAGATGAAAGCCATGTGCT







AAGGATTGGAGTGCACGGTGGAAGAAGGAACATGGGACACTGA







TACCATCCTGGAGTTCTCCCTACCTGGC[T/C]TGGCCAGTTC







TGGGATTTTTGTTATCTCACTGTGTTGCTGGCATCAGTATCTC







TACATTAACTACCAAATGATTGAGAAACTTTTTAAAGTGGTAA







TTAAGATTCTAGAGTCTGGGGCACCTAGGTGGCTCAGTGGTTG







AGTGTCTGCCTTCGGCTCAGGTCGTGATCCCAGGGTCCCTGGA







TCAAATCCCCAATCAGAC





BICF235J47583
58
10
11451490
0.608
AAAAGCANCATATCCAACATTTGTAGTTTGTTACAATAACACA







TTGAAAAGATTTATAGACTGTTTTGGGTGTGATTTTTGGATTA







ATTCCCTACTTTGAAACCATTTGTGAGGCTCTGTTTATTTAAA







GGAGGGAATGAATAGACCTGAAAACACCTAATTTTCATTTTCA







TCTCAGACTGGAAGCCAGTACATCTGTA[G/T]GGTTTGTTTT







TTGGGTTTTGTTTTGTTTTGTTTTTTTGGTTTTGTTTTGTTTT







GTTTAGAATTGAAAACTAGATCACAGAACACACAATGCTATAT







TTATCATTTTGATCATCGGTTATTAGATGCTTGTTTGCATGTG







CTTAAGCCTCTAGCCAAGATAAAAAAAAATTTTNAAAAACTAT







TGTGGTAATAGAGTCTAG





BICF235J49835
59
10
11455320
0.516
ACCCCATCAACAGGACATTTTCATACCTTCGCTTTTCCCTACA







GTTTCTTTCTCCCACTTTCAACTACGTAACATCTTCTTTGACC







TTCCAAGTCTGGTTTTGACACCAGCTCTTTTTACGTAGTAGCT







CCCAACTCCCTGAGTCAGATGCAGGCTGGCTCACCCCTGGAAC







ATGAACACGTATTATTAGTGCCTTGCTC[G/A]CAGCATCATC







CATTTTCTCTTGTCCTGGTAATTTCTGAACACTTTGTATCTTT







GTTTACCAGTTTTGTAAGCTCCTTGAGAAGAGATACCTATGTG







TTTTGCATTTCCTTCAAAATTGAGGATAGTGGTATACAGTAGT







TGCTTAATACANGTTTGTTGAATGAAGGGAAAAAAGCATATGG







TTAGGCTACATAGGTCAT





BICF239J15170
60
10
11461814
0.301
GTTTTTTTTTTTTTTTTTTTTTTTTTTTTCACTTACGGTGCAG







ATAAGCGCCTGATCTTGCAGTATCATGTTTCTGACGTCTTTGT







TCTGGCTTCATTTCCTTCTTTTTTTCCCCCCCTAAATGCCGTT







TTCATTTGTTCTTAGGGCTTAGAACATGTCAAAGAGCTTCTCT







GAGCAGTAGGTGGTTTTACAGAGCGCTC[A/G]GAGAATGAAA







ACTAAATGTCATCCTGGAAGCAGTCCACTACGAGCGGGGAGGG







GTCGGATTACTTTCCAGTCTTGGCTGCATTTGAACATGAGACA







GGAAAGAGAGAACTGAGGCCATGAGTCACCTCATTTGGACCCA







AGGCACCATTTACCTTGAATATGGAGAAAAATCGAAGCCAGAG







TTTCTTTGCTATTTTCTC





BICF232J39707
61
10
11523326
0.502
GTCTGTTGTTCTGAAATGGTTCTTCACGGAAAGAATAAGTATA







AAAGATGGATGGAACCATCCACCAAACGTAGCTTTATCTTTTT







TAACATCAGCTGGCTTTCTGTAAGCAATAAGCCAAATCGAAGA







GGTTTTCTAATCACTAAGTTTTTGGTTGCTATCTCTAAGAAGA







CTCCGCCTGTCTCTAAGTTGCTTAAACC[C/T]ACGGAGACGG







TGAATCTCTAACATCAGAAGAGAACGTGGAAAGCAGCGCCAGC







ACGGACCCATGTTATCAGCAGCATAAAACTGTTCTGGGACTAG







GGCCTGTGCGGAAAGCAGTTCTCAGAAGCAGCCACAGAAATCT







TTTGTAAACACCTGTGTCAGCCATACCTTATCTCCTTTTGGGC







CTCACTCTGCCACAGCAT





BICF229J53462
62
10
11549691
0.406
CACAGAGAGAGAGAGAGAGGCAGAGACACAGGCAGACGGAGAA







GCATGCTCCATGCAGGGAGCCCGATGTGGGACTCAATCCCGGG







ACTCCAGGATCACGCCCTGGGCCGAAGGAAGGNGCTAAACTGC







TGAGCCACCCAGGGATCCCCTACTTCTCTATTCTTTATGCCTT







ATGTGCCTTGTCCTAAATGTAGTCGATA[T/C]ATAAAAACAT







AAGAATCATATTTATTGAGGAAATACATCTTGAAGGAAAAAAT







CTGAAGAATAAATTAACCATGATTGCCAAGTATGTGAGCTGAT







ATTAAAATATAAGAATGTCCCACTCTTCTCCCCTTCTGGGTTT







AAAAAGATGCCCACGAGGGGGTTCCTGGGTGGGGCAGTCAGTT







GAGCGTCTGACTCTTAGT





BICF231J57981
63
10
11566268
0.149
GATTCTCAGAGTTAGGTAGGGTCACCCTTGAGCAATTACTCCT







CAGATCTAGGAAACCTGGCCTCAAGGGGGAAGGGAGGTTGAAA







AGCATCAAAATTACTTGTGAAATATGGAATGCAGATGGGTAAC







GTCCAGACAGGCCAAGGCAACAACAGAGGTCCACACCCATAGT







CTGAGGTTAGCCATCTGTGAAACTGGGC[G/A]GTCTGCGCTC







TGCCGGAAGTATGGGATATGACCCTCTGCCTGGCTCGCTGCTC







CTGTGGCCTCGTCCTCTTCATTCGGGACTCCACTCATGCTAAG







CCAGCCACGCCTCTGCCCTCATACCAGGTACAGCCCGTGGGAC







ATACTTCAGGTCAGTGACCACTGCCCCACTGTGCATCCGGTTT







ACAGTAGTTGGGTTCTCA





BICF233J47253
64
10
11633608
0.059
AGGGAAAAGTCACCCCAAGCTTCAGAAAGACCCGAGTTCTGGG







CTTGGCTCTGCCACAAGCCTGTTTATGGCTGTGCTTATTGCCC







TGGGCAGTCATAAAAATAAGTGACTTTATGGAGTGCCTGTTAC







CTGCCCAGTACTCCCCTTAATGTTTTGTGTCTATTCATTTAGG







TGATCATCACAATGGTAGCAGGTATGGT[C/T]TTCACACCAC







CTAATAGTTGAGACAGCAATGCCCAAGGTCAGGCAGCTTGGAA







GTGGAGGACCTGGGATTTGGATCCAAGTGCTCTGGCTCCAGAG







CACAAGTTCCTAATGACTACCTTTCACCTGTACGAGTCTTCTC







TTCATAACGAAGGAGGGTTATAAACCTCACAGATTTATCATGT







CAGTGAAATGAGATATGG





BICF229J21022
65
10
11773283
0.168
TCATTTATCACAGTTCAGGGGGGCTAGAAAGAGGAGGCCTCAA







GTAGGAAGAGCAAGGAAATGAGCAGAGTAGGAGTTTCTGAGAA







AGCTGAGTGGCTGCAGCCAGTAAAGCAGAAAGGTATTGAGGAG







TCAAACTCAGGAGTCAGGAGGATACTGGGGTGCACACATAACA







CCCTAAAGAGCTGCTCTTCATGTTCTGC[G/T]TTGTTAGTCA







ACAGTCAGTAAACATTTATTAAACTCTCGCTGGGCTGGGTGCT







GAGCAAGGATGTATAGGGTGGAATTCTATCCTCAGGGAGCTCA







GTCTTAGGGAGCAAACAATCAAGTAAAGAAAAGATTAACCTGT







GGAGTGAGAAGTCCTTGAGCAGAGATTTGTGTAAAAGGAGGCC







ACAGGAGCATAAACAAGA





BICF236J4590
66
10
11785152
0.248
GGGAGCACTGGTGCTGGGTGGCCCTTCGAGAACTTGCCAAAGT







GGAGCAGGCGCTTGGGCTTCTGCACCCCTGCGTCTGACTAGCC







CTGCGTGTGGGGCTGTGACTATGACTTGGACGAGGCAGTTCCC







TCCTACCAAGGGCAGGTTCCAGGTAGGGGTTAGTGCGCTGTCA







ACAGCCACCCCTCCCTGGCTGGGGACCT[G/C]TGGTGTCCCT







GGGTCCTGAAGGAGGGATCTGAGCCACACCCCTGCGGCCACTG







CCCTCACGGGGACAGATTGACTGCCGTGTGCTCTGCAAAACGG







AACCAAATCATTTTTCTTTTTCATCTACTTTCCTCCCCACCTT







CTTCTCACCCCTTCCTCTTAGCTCACTATCTCCTGGTGTCTCC







GAGACATCTGATTTAAGA





BICF231J44912
67
10
11920925
0.118
GTAAAATATACCTACCAATGCCACTGAATGCCCTAGAGTAATA







GATCTCTCTTTCTAGTACCTCTGTGCATTTCTGTGCCTGCTAA







CAATTGAGTTATAAGCTTAAAAAGTTAAAGTCCATTTTGTTTA







TTCCCAAGCCTGTTTACATCAATTTTACTAGTTTTTGTAACTA







TGTGATTTAATGAATCTTACATAACTTA[C/T]GAAATGTGAG







TACAGTGGTTCTACGAAGAAAACAAAATCAAAGGCTTTGGGAA







GATTAAATAAGGAGAGTCGTTAGGCAAAAATCTCTTAATTTTG







TAGAGAAAAAAACTTGAAAACATTGGGCAAATAAAAATCTAAA







TGGAGTAAACATTCTGATTACTTTAAATTCTCACTGTATTAAA







ACAAAAACAAAACTATAT





BICF235J3897
68
10
11954383
0.141
AATCCACCACAAACCCTGCAGGTTTATATGGAAGCCTCAAAAC







CTATATGGAAGCCAAAGTAACCACTGTGAAAAATGCGTCCCTA







TAAGAACACTGTCAACAGATTTTCACACATGACAATTTAGATT







CAAGGCATAATAACTAATAAACTGTTTAAACGGCTAATAGTTC







AATAGTGTTTACCATTTACCAGGAGTTC[C/T]AGTATATATA







TGTTAATATGGAATATATTTCATTGGGTAATTAGGAAAATGAT







ATGTATACAATATATACATGTATGTATGTGCAGCTGACATATA







ATTTTATTATAATATGCATATTTTGTATGTATGTATATATATA







ATTATTGGAGTTCGATTTGCCAATATATAGTATAACACCCAGT







GCTCATCCTGTCAAGTGA





BICF235J18100
69
10
11995316
0.319
ATGATAATATCCAAAGTTGACAAGCTCTGGTGGGATAGGTACT







CTTAAAACATTGCTGGTAGCAATATAAAATAGTATACCCTTTC







TGGAGGACAAGTTAGAATGATGTGCCAAATGTTTAATTAAGAA







CTACATACAGTCAAAGATTTGGGTAAAAGGATATTCATTCCAG







TTATTTGTAATAGCAGTTTATTGGGAAA[T/C]GGTTATCCAA







ACACAGCATTTATTTTTGAGCAGAGCACAAAGTAGGAATTTCA







TCGAATTTTCCCCGGGTGGCTAACTAACTGACCAAGCATCACG







TACTGACTAGGTCAACCTTTCTCCACTTATATAAGACTACACC







TCTCTTTCCACATCCACAGTAGTTTGTTTATAGGTATTTGATT







CTCTTCCATTGATCTCTT





BICF229J8324
70
10
11999323
0.313
AAATCTCCTATTTTCCTCCGTCTCTGTGTAGTAAAAGACAAAG







TACCAGCCCTTAAGTCAATAACACTATTTTTTGTGATTTGGAG







GGGAAAACACTTTGCTGCCAAAGGGTAATCCTCATAGAAGAAC







TGAGAGCTAGGTGAAACACTCTGACAGATGAGGACCTTATTAT







AGAAGCTATTGACACAAGAACTCTAAAT[T/C]GCTGAAAAGT







ACCACTCACCTCTCCCTGAACCCAACAGAGTAATGATATAGTA







TTGATGAAACTGTGTCTGCTGCCAGAGGACATAACAGTCCATT







CTTAATCCAAAGTGGTTTATCAGGGATGGACAATGACATGACC







TAGGGCTTAGAAAGCACCATCTCAGGTCTCTGACATACCCATC







ACTGAGACATCATATTCT





BICF232J58180
71
11
71402215
0.338
CAGACCGACACAGAATGAAGGAGGGTTCAGGGCAAAATGATGC







TCCTAGCCTCCGCCTAGCCACGTTGTAGCCATCCAGTCTTGGG







CAAGTCTCATAATCTCCTTGAGCCTCCATTTCCACATCTAGGG







AAAGGGAATAATAATAATATCTGACCGCCTGGATCACGTGATC







GCCTTGAGGGTCACATAAAATAATATCC[C/T]AGCAAGAGTT







CTGGGAAGAGTTAACCAGCACACAGATGAAAGAGGGCTTTGTT







ATTTGTTCAGGAACTTTGTTCATTCTTTTCCAGTAATCGTGTA







AGAGAAATTGCTTGGAATTTTATAATCAGAATATCAGAGTTTA







TTTAACGTGACTAATATTCATTAAAGCAATAACAGGAGTCAGG







CCTGGTATAGTGGAAAAG





BICF234J44301
72
12
75211352
0.253
CGCCTGCGAGCCACGCCCCGGTCACTCAGGAGGGGCCCCTGGG







AAGCGGGGGCTGCCCTGGGACCCGAGGCCTCTGCGGCCTGCAC







GGATCGGCCGAAGCCTGACTGGGCTGGGACCGGCCGGATCAGC







CGGCGCTCTGGTCACCCAACACTCGACAGCTGCTCTCCTGGGC







ACTGGCGTCTGCCTTTGATCCGCGCGAC[A/T]GTAAAACCGA







TCAAAGCGGAAGTGCACACAGGCTCCGTGCAGAAAATGAGAGG







GGCCCTCGGAGAGGAAAAGCTGGAGCATCGCGTGGTTGAGGGG







CCTCGCACGGCTAAGGGGCGGCTCGTTGTGTACGACACCACAC







GCTCGCCAGCAAAGCACCCGGTGCTCCAGGCAAAGGTGAGAGG







AAAGTCGCGACTCCCGTG





BICF231J47571
73
13
17833004
0.243
TGTTGAGATTCCATTTCTTTTAATGGCCATCAAGTGGCAGTAT







TTTGTATACTCAAGAGGATAAATGTGAAAGAAGGTGACCTATG







GTTGTAACTTTGTAAAAATCACAGGACACACTTCCTAAAACTG







AGAAATTGAAAGTTTGAGCCAGATCTATCCGATCACCAAGGAC







AGGAAAATCCTCATATAATACCTGAGTA[C/T]CACATGTAGC







CCACATCTCCCAAGTTTTAGCTAAGTTATATTCAGTCAGATTG







CTTGACCCCAATATCCTAAATAATATAAAATGATAAATTTTTA







CAGATTAATAATTAAGATAATCTCTTATATGTCCTCTTAAAGC







CCTTTTGTTTATTATTAACCCATATGCAGGACCATAGCATTTA







TAAAGTAGAAAACTGAAA





BICF231J12866
74
13
18853457
0.269
ACCTTCTATGAACCTAGCACCTTAATATTGTTTGTGTTAATAA







TGGTTGATATTTATGGTGGACCAATAGCTCTGGAAAAGTTCCA







GGGCTAAGAACTATCCATGAACTATTACATTTTATCCTCACAA







CCCCAAGATATGGGGCAGAGAGTTGGAGACTGGCGCCAACATC







ATACACAGTTGACAAAGCAGCTGAACTG[G/A]GATTTGAACA







CAGAATGTTCAGCTTGAGGACTTGCTGTTTTGTGATTTAGTGA







CCAAAGCAACCTTGGTACATAGAAATCATTTCTTTAATTTTAT







GAATGTAGGAACAATAGCACAGAGAGGTTAAGTAAGTCTTTCA







AAGCCACATAGCCAACAAGTGGCAAAATGAAAGCCAGCTCAGA







TTTGTCCCATATCAAAGG





BICF232J26139
75
13
19113938
0.170
CAGCTCTATTTCCTTATAGGAGTATATATACATATATAAATAT







GAAATAGGATGGACACAAATGAGCTCTCCAGAGAGGCACCAAC







ATCTTCTTTGAGAGTGCAGAGGACGAGGAGATTAACTTTGCCT







GTGATTAGGGTGCAGAGACATAGGTACAAGGTCATCAGGCAGG







GACGGGGGGCTGGAATGGGAGCCTCCAC[A/G]TGGTAAGAAA







ATCCTGCTTAAAAACCAGGAAGTAGGAAAATAGTAAGTAATCT







GGAACATTTCTCCAGTCTTGCCTCCATCTCTCGTTAAGCCAAT







CTACCCTCCCCCAGAAAACTCCTTCAGGAGACCTGAGCTTAGA







TTTCACATTTGCCAGGAAACCTCTGCTATGGACTGAATTGCAT







CCCCTCAAAAGGCCTATG





BICF233J3303
76
13
19168116
0.339
TTTATGAATTCTGACCAATAATTTCTTCTAAATGCCAAGATGA







AGATAAGGAAGGGAGGGGGCTATCTTTTAAGACAAGTAAGAGC







TCTGAAAACAGGAAATCAGGAGGGGTTTTTTTTTTTTTTTTGA







GGTCTTTATGTGTCTCTAAAAAGTCTTTTATGAAATAAACTGG







ACTCTTTACAGAAAATAACATGTACATC[T/C]TGTACAACCA







AATCATGAAACACAACCAAGAGATCTTATTTCTTTGAGGTCAT







GAAATTTAAAATGTATATACATTTATGCCCTTGGTCATGAAAA







CACATGCAGGTAACTGGATGACAGAGAGAGCAACTAAGAAGTT







AACTATATGTCATCTGAGATCTGTTTATACAAAGTGAATTCAC







CTGAATGAGACAAAGGCT





BICF236J9894
77
14
38955880
0.363
CACAACAAAGAAAACACAATTTGACCAAGTTTTCTACCACTAG







ATAGCAAGGATAACCTTTGCTCCCTTTTCTGATAAATGTCCCT







CATTTCCTTTTGAGGTCTTGCCAGAAGCACCTTTAATGTCCAT







TTTCCTAACAGTTTTCTATACATGGCAATATATGTATCCACGA







AGACCACAGATGCTTTCTGCACCGTGCC[A/G]CTCATGTCCT







GGTGAGTTTCTCACCAGAATCTACACTTTGGTTATAATGAACT







TCACAGTTCATTTAGCCTCTAACCATTACCCAGTTCCACAGCC







ATTCCCTCTGTTTTAGGTATTTGGTAGAGCATCATCCTACTTC







CGGGTAGCAAAATCTGTATTAGTCTCCCAGGGCTGTCTTAACA







AGTAACACAAAATTAGTG





BICF229J41242
78
15
36683521
0.366
TCCTTAAACCATTATTCATCAGCATTTGCTTAATTTTCTCAGT







GTCCAGCAATCAAGCCTAATCTTCAAAGATAAAGATTTACTAC







CACAAATTTTTTAAAAAATAATGGCTCACAAGGCTTAAGAATA







TTTCTTAAATGTTCTAAAGCAATGCAGGTTTCAAATGTGACTT







AATTTTGAATAACTGATAGATTATCTAA[T/C]AAAACTACAG







CTTTGTTTCATTCACCATTGTCATTTCTTAAGTTACTTACTCA







GAAAACTTCAGCTAAAAACTTTAAAAGGGAATAAAGTATTAAT







ACATGCTATAATGTGAACCTTGGAAGACATGCTAATTGAAAGA







AGCCAGATACAAAAAGCCCTGTATTATATGATTCCATTTATAT







GAAAAGTCCAAAATAGAC





BICF230J149
79
15
44212792
0.597
GATATGCAATGGATCCTGCAGATGGTATGTGGCTCATCAAAAC







CCCCACCAATGGCTATTATTAAGGATAATCAAACTATCAAACT







ACTAATCAAATCAAATAACAGATCTAAAAAATATGCTAATCTG







AGTTTGCCTTCTTTTCGCTTTAACAGGCATACTAGCCTAGAAA







AAGAAGACTTAAGAATCCTAATGATGCT[T/C]ACACTTGGAA







AATGCTCAAGAATGAAATAGTAACAAACTAAAAGATGAAGTAT







TTAATCTCTAGTCTTTCTCTCAAGCCAAATTAATCCCAGATTT







ACAGAAGAAAAAAACTCAGGTTATATTTATTCTATATATCCCT







CTCCAAATTCATGAGGGCTGTAAGTCAAAGGACTGAAGTCAGA







TTGGTCAAGCGCATGGAT





BICFPJ509072
80
15
44226324
0.525
GAGTGGAAAGGAGAATCTACAATGTGGAGTGACCAGAGGGATG







GAGCTCTGAAGTGAGGACGGTGTCTTTCCGACCTGTGAGCCAG







GATATTGGCCGTAATTCCTATTTGGCCTCCCCACAGATAGAAG







ACACTTCAATGTCTTGAGGATGGTGGATCCTTCCCCGCCAGCT







AGCTTACCTTCTGAGCCTTGGGCATGTC[A/G]GTGTGGCGCT







GGGCACGGACCGAGCGGGCAGACTTGGCAGGCTTGAGGGGTGC







ACAGTACATCTCTAGCCTCCTCAGATCACAGCTCCGGAAGCAG







CACTCATCCACGATGCCTGTCTGAGGTGCCCTCCGACTGCTGG







AGCCGTACCCTGTGGGCTTGTCTGTGCAAATCAAACACAAGGT







GGCCTGGCTTTAGAGGGG





BICFPJ509073
81
15
44226684
0.636
TCTGTGCAAATCAAACACAAGGTGGCCTGGCTTTAGAGGGGAA







TATGCTACCTCCACAGTTCCACCCAACCCCGGAAGCCTCTCTT







CTCCCCACAGCTGGTCAACCTACACATCCCAATCTCTTCCTTG







AAGCTTCCCCAACAATTCCTGGCCCCACAAATCCTTCTGGATC







CCAGTGTGGCTCTGGACTCTGCCCCACA[T/C]GCCTTAGCAC







TAACTTGTGCACTGGGGATTTTGTGTTAATTCTTGAGTTCTCA







GAAACCTGTTGTCAAGGATCTCGTCTACACCCAAACTGAGTCT







TTCCTTATATATTCGCTTAGTTAATTAAATGAAGATCATGTGT







TAAGTTTTTTTCATCTTTTTAGCAAATCTGCCTAGTTTTTATG







CTCAGGATGGCTCAAGTG





BICFPJ1100923
82
15
44228468
0.664
GGGCAGTAATTCAGTGAGGGTTCATTTGTGGTCTCTTTGGGTG







GGGTCTACTTTCTCGCTTAGGGGCAAACCCTGTGGGTGCCTCA







TAGTTGAGGGGTTTGGGAGGGACTAGGCAGCTGGCCCCAATTG







AAGACTTAGTAGTGTTTTCACTTGCCATTGGAGGATCCACGTG







CAGAAGACTCTCGTTCTGTTCGCCAGCC[G/A]GGCCCTGGCA







AGCTGAGACTTGGCCAGTCCCTTGGGCAATGTAAACAATGTTT







TTTTGTTTTTATAGTCTTTTCCTGGGACTATAAATTAGAGGAA







AGGCACAAATAGGCTTACATGGTTCTTTGTAATCCACAAAGGA







CTTCTACATACTTTTTGCTAAGTGGTTATTTCAAGAGGTTGAA







GGGGTCAGCCAGTTACGA





BICFPJ1235295
83
15
44260949
0.632
CCTGCTAGGGCAAACAAGAATGAGAGTGCCTTTTTGGGTGCTT







GGTAAGTCTGGAAATGGATACCTTTGAATGCAGTGTGCAGAGT







TCTTTCTGTTTCATTTTTTTCCCAGCATATTTGTGCTCTTCCT







GGCACTGGCTCAACTCACTGTTTCAGCTGGACCCCCAGGAATA







GACTGACTCCTGTAATTCTGACAAAGTC[G/A]AGCATACTAA







AGGGTTTGCTGATGCTCCTTGTGAGACATGCAATAGGGATTTA







ATCGGAAGATCACAGCCGGCTTCCAACAAAAAGGATCATGTTC







TGTTTACCTAAATTCCCTCAGTAGTTTCCTTTAGATACAGCAT







TTCTCACTCTCTACTTGAAAATGCTTAGAAGTCCATGGGGACC







TTCCGACTCAGTCATGTC





BICFPJ401056
84
15
44263980
0.635
CAAGGAAAAGAAGTTATAAACTGGCCCTCTCTAACTTGTACCT







GCCTTGCTGTAGGTTGAGGTCTTTCTGAACAATCGTGTCCTTT







AGATATCTGGACCTTCATTAACAGGTTCAGGCTTGGGAACTTG







CCAAATTCCAGAAAGGGTCTAGTGAAGGCATTCAACTGGGGAG







CCAGCTGCCTCTTTGGAAAGTGGTTTTA[G/A]TTTACCCTTC







ATCTTCCAATAAGAGACAGAATCCCAATTTTCTTAGCTCAAAA







CCATTTCTTTTAGATTCNAATAGCAAACCTAATGGAACTAATC







AACTCAGAGTCCTAAGAAATAATATTAGAAACTGGCTAAGCAT







GACAAGGGAAGCAATTTGATATGAGTAAAACACACATTTGTCC







ACTCAATGCAATTAGAAA





BICFPJ401057
85
15
44264051
0.540
ACAATCGTGTCCTTTAGATATCTGGACCTTCATTAACAGGTTC







AGGCTTGGGAACTTGCCAAATTCCAGAAAGGGTCTAGTGAAGG







CATTCAACTGGGGAGCCAGCTGCCTCTTTGGAAAGTGGTTTTA







NTTTACCCTTCATCTTCCAATAAGAGACAGAATCCCAATTTTC







TTAGCTCAAAACCATTTCTTTTAGATTC[C/T]AATAGCAAAC







CTAATGGAACTAATCAACTCAGAGTCCTAAGAAATAATATTAG







AAACTGGCTAAGCATGACAAGGGAAGCAATTTGATATGAGTAA







AACACACATTTGTCCACTCAATGCAATTAGAAATATTTTTTTT







AAAGGACTCTTGCTTGGCTGTTCTTATAAAATGTAACTATTGA







AAAAGAAGCTGGCAAGAT





BICF231J34186
86
15
44279290
0.530
CTGAAAGCTAGATTTCTGTGGCTCGGGGCCCCATCCTCTCTCA







AAGTAGTAGAAACAATGCCAAGGTGATCACTGACTCTCCCACA







TCGCTACTTATGGCACAAAGACGGGGTTTCTTCTAGGAAGCCT







CCCAAGGCGAGTGGCTGCAGTGGCCTTGGAAGAGTCACCGGCA







AGATACAAGTGAGGGGCTCTATCCAAAG[A/G]GCCTTGGAGT







TACACTGAAAGCGGGCACTGTAAGGAGAGGCTTCTCCAGTTCT







TGGGGTCATGGCCAGCCCGTCCAGCCCCCATCCCTTTTGGAGA







AACAGACTCAGTCATTTGCCTTCCTTGCCCTCCATGAACTCTC







AACTTAATGGCTCTTTACCTCTGTGGCAGAGTTTGCTCCATCG







TTTTAATTAAGATTTCTA





BICF232J62306
87
15
44281633
0.620
GTCTAAGTAGGGAGACAAAAACCTTAGCTGCCGGACCACATAG







TATATTACAGACACGGTACAGGTCGTATGTGTTTATATATGGA







CTTCGCTGCTTGTAAAAGCAAGGCAGCCAGCATGTGTCTCGGC







ACCTGGAGGGCAGCAGAATCAGGTGCCTGACAAACATGTATTC







TTAACTCCTAAGCCACTTATTTGACTAA[T/G]AAAGTCCTAG







TGGTGGAGTATTTGAAAGCAGCGAATGAATCCTGTACTGAGTG







GCAGGGAGGTTTTATGAATCGAGCTCTTAGGCAATTGCATCGA







GGAGCCAGTGACAGCGGCTTAAAAATGCTTGCAAATTGGAAAA







ACACAAGTCTTGAAGGATATAATTTTGCTGAGAATGGAAACTT







GAACTAGGGCCAGTGATT





BICFPJ1434769
88
15
44282162
0.534
CAGAGGAAAAAAATTGGACATGCAGTTGCTAGAAAGTTTCCGC







TTTCCGTATATACAGCATGCATTTGTTAAATTATCCAAATATC







TTGCAGGGACAGCAAGAGTTGTCAAGTTCTTTTTCAGAAGAAC







AAATTAAACTTCGCTCTAACTTTGACTCCCTGAGCATAAGTGC







AGAGAGTTCTGGGACCTTAGCTCCAGAG[T/A]TCATATTTAA







AAGCTTTAATATCATCTCAAAGGTAACTCTTCATATGTGGCTT







NCCTTATAATAAAAAGTCCTGACAAGTTACACACACACACACA







TGCACGCACAGACACAGACACACACACAAACATGGTCTTAAAA







ATAAAACATCTGCACCTGCAAAAAAAAAATTTGAAAGTTCATA







TCCACGCTCTATAGCCCT





BICFPJ1072107
89
15
44349363
0.332
TTTCCAGGTCCTTTTCCCACTGCTGCGCATTAACGGTCTCTCT







ATTCTTTCTCTTCACTCCAGCTGCTTACAGCTGTCAGCCACCC







ACTCCAGCTCTACCTTTCTAGTGTTTTTATCTTCCAGCATTAT







AAGATTTAATTTATAACAACAAAAATGTTCCCCAAGTCTCCTT







TCTTGATGAATTTCTGGTTTCCTTCACC[A/G]TTTAGGATTC







TCCCCATTTCTCTGGGCACCTGTTCCTACAGATGTCCTTTTAA







AATGTTGCCTTTTCTCTGCCCACGTTCATGTTCCTAGTGGACA







TGGGAAAGAGCACAAAAGCTTTGGAAGGTGATTTGTACTCAGG







AGNTAGGGTGGTGCTAGGTGGAGAGTATGGTGGTTTGGANGCT







TGGTTCTGAATCATTGTG





BICFPJ1072108
90
15
44349505
0.455
TAACAACAAAAATGTTCCCCAAGTCTCCTTTCTTGATGAATTT







CTGGTTTCCTTCACCNTTTAGGATTCTCCCCATTTCTCTGGGC







ACCTGTTCCTACAGATGTCCTTTTAAAATGTTGCCTTTTCTCT







GCCCACGTTCATGTTCCTAGTGGACATGGGAAAGAGCACAAAA







GCTTTGGAAGGTGATTTGTACTCAGGAG[C/T]TAGGGTGGTG







CTAGGTGGAGAGTATGGTGGTTTGGANGCTTGGTTCTGAATCA







TTGTGCGACACATCTCCAGGGACCACTATTCACATAGAAAATC







AAGTGAATGGTGACTTCTGCAGCTCTAAGATAATGAGGCTGAA







TGAGGCTGCCCCCTGAAGTTGTGCAAAGCAATGGCCCTCTCAC







CAATGATCTGTGAGACTT





BICFPJ1296884
91
15
44350759
0.400
AGGCAGGTAATATTTGTAAAGTAGATTCTCTAAGTTTATATTA







CTCTCCAGGCTTATTACAAAGACAGAGAAAGGGAGATTCAGCA







GAATTATTCTTAGAAATGCACTGTATCTATGTGGAGGGANTTC







TTTGATGAATTTCTAGGGCAGGAGGGTGAGATTCCAAGGGTAT







ATTTAAATAGCAGAGGATGTTTGCTAAA[C/T]TGCTAAGACA







AGGAAAGAAGCGTATATGGCTTCTCAGAGGGTTAAGGGTTGAG







AGACCACAAGAAGAAAGGAGAGATGAGACATGAGTGGAGACTG







GGCATAGCTAAGGAGATGGCTACTGGCTCCAACATAGGTAGGG







AGCTTTGGGATAACAGTGTTCAAGAAGAGGTTTTCCCTCTGTT







TGGTATATTTGATCTAAA





BICFG630J367539
92
18
56642845
0.237
AGAAAGATGGCAGGGTTTCCATTGGGATTTAGATGCCTGCACC







GTACTGTGACTACCTATCCAAAAATGACCTACTTCTGCTAACT







CTCTAGAGCCCTGGGGTAGTTGTTTGTTTGTTGTCCAGGGTTT







GAGGTTGCTATCTGCAGGGGGGNCAGTTTGTCAGAACACCTCG







TGCAGGAAACACACTCCATACTTGGAAC[A/G]AAGAGGGATT







TCAGGGTAGGGGTGAGAGTGGGCTCAGCAGTAGCCAATGCTCC







CATCGGGCCACATTCAATGATGAAAAAAGTGTCTATGGAAGTC







CACGTCAGGGATGCTCACGGCTGATGAGGAGGTCATCTCAGAA







GGGGACAGGCAGTGGGCTGGGACAGAGTGTGGCAGGGCAAGCA







GTAAGTATTCTCTCACGT





BICF233J9971
93
20
29901111
0.335
TGGACCTGGAGTGCCCTGAGCTTCACTCACTCTGAATGTAAAA







TGAAGAGGCTTATACCCAAGTCTGAAGTAGCTGTGACTACAGA







CTAATTCAGTTTCTCTCTAAAGACCCTAAAAGATGCTGACCCA







TAGTTGATCCTTATTGAGTGGCAGCTACTGTCATCACTAAGGT







CATTATTTGGGTCTTCAAGATGTTGCAG[G/C]AGAGTTAGTT







GCCTGTGGATAATAACAGGGAATGAGCCCCAGAAGCTATTCCC







TTCCATCTCCAGCATCTTGCCCCTGACCCACGTCTCTTGAATA







TGGCCTGAATACAACAAGGCTGTTTGTTTGTTTGTTTTAAANT







TTTATTTATTTATTCATGAGAGACAGAGAAAGAGAGGCAGAGG







CATAGGCAGAGGGAGAAG





BICF231J2898
94
20
35381149
0.261
AGCTACTTAGTGGCAAGTGGGATTTGAACCAAGAATCTTTGCT







CTTACCTACTACAAGAGCCACATCTCCGACTGTGTTATTTTGT







GGCAAATGGCCCAGAGCTCATGGATTCTGACAAGGAAAGCATG







CTACTTGGCTACTTTTAAAGCATCTTCTACTACCTTTGCCTAA







AAGCCACACTGGCATTTGCTAGATCAGG[G/A]AAGAACGGGA







TTTAGGATTATNGTATTTGTGTGTTATTTTGGTGGGGGTGGGA







CAATAGGAAGGAAGAGGGGGCCAAGCCCCGTGGTGACCCACCA







CCCTGGAAACCCCACCCAGTGACCCAAGGTCTTGGGAATACAC







ACCCACCTGCTCTGGGGCACTGGCCTGGCACAGGCAACTTAAA







TCAACATTAGGGTCAGAC





BICF231J25080
95
20
35390983
0.265
GTTAATAAGCTGGAGACCCCTAATGCTCTCACCAGAGTCCTGA







CCTTGGGACCCCAGCCTATATGTGAAACCCACCTCCACACTTC







AGGAACGCCAGCTCTGTGAAAGGGGTTCNAGGTGAACCTGCAT







TCTGTGGTTTCTTGGCATCCTGGGTCCACTGGATGACAGCATG







TCAGGATATTCATGACACTAAAAGTAGT[A/T]AAATAATAAA







TAATAACAGAAGCTTCCATTTCTCAACCATTTTCTCATTTAAT







CATCAGAATAACCCGAGGTGATGTTATTAATAATTTTTTAAGG







ATTTTATTTATGGGGCGCCTTCGTGGCTCAGTCAGTTAAGCGG







CTGCCTTGGGCTCAGGTCATGATCTCAGAATGCTGGGATCGAG







CCCCACATCGGGCTCCCT





BICF235J20169
96
20
35391970
0.455
GTTTCCGAGCAGAGATGGAGAAGCAGGGCTTGTAAAATGAACG







CCGCCTTCCCCGTTGCATCTTTGCTCCAGGGTGGGGGCCGCCT







CGGTTGTAATTTTACACCGATGTCCACACCCTGCTAGGGAGCA







AGAGAGGCGAACTGTAAGTGAGAATATTTGCTCTGCCTCCACC







CCCTGGAGGAAGAGGAGCTGGTTCTCTC[G/A]GCAGCCTGCG







AGCAGAAGTGGGAGGGCTCCCCCCACCCCAGCCCCTGCGGCCA







AGGGCCTGGGGCCATGTGGGTGGGTCCCGAGGAGCAGGTCTTC







CCCCCAAAGAGGTGACAAAGACAATGGCAGTTTGAAGGCGCAG







CCAGCCCTGCCTTGAGGTAAGGTTGGGGGTGCCGGTAAGCAGG







CTGCTCCGAGAAGGCACC





BICF229J60744
97
20
35401421
0.202
GAGGGCTGGAGACCTGGGGGCCTGCTCTGGCACCGGGCAGGAG







ATAAGCAGTTTGAGGAACTGGAAGCACTTCGTGCTGCCAGAGG







ATGGGTCACGGGAGGAGTCATTTGGCATCAAGCCTCAGATCTG







AGTCAGTTTCTGTGTCTTTAAAATGGAGATAACAGTCCCTTCC







TCACAGCCCCGGCTGTGGGGGGATTGGA[G/T]AAGTCAAGAC







GTGTGAGGTATGAGCAAGGTGCGTGGCACAGAGAAAGTGCTCA







GTAAGTGTTGACAGTTAACAAATGTCTTAGTTGGGTTTCCTCA







GAAGCAGACTGAGTCCAGAATACAAATGCAAGACGTTCTTTTG







GGAAATGATCCTGGAAAGCCCTGGCAGAGGGTGGGAGGAGATG







AGACAGGCAAGGAAGGAA





BICF237J62215
98
20
44783441
0.319
TCTCTGGTTAAAGTGCCACCGTGGAGGTTGTGTGTCACACATT







AACTGGTAGCACCCCAGTGCCTAGCAGAGCCAGCCTGCCCTCT







TTGTCAGGCAATCCCCGTGGGGCCCCAAGGGTCAGTTTCTGGT







TAGTTTTAGGTCAGTTTCAGTGGCATTTGAAAGGCTTGGTTGG







GGGCAGGGAGTCCCCTTTGGTGACTCCC[G/A]TCTCTGATGG







GGTCCTTGGAGGAANAACCAGGGTAGTCACTAGAGCTCAGAAC







TGGAGCAGGGTCTGGACTCTGGCCCAGGGGCCCTAAACTGGGC







TCTGCTGCCATGAGTAGGGCTGTGGCCAAGCTCTATAGACCCT







AGGGCCAGGGTGGGCAGCAAACTCAAAAAGAAAAGACAGAGGC







TCAGCTCTCAGCTCTGCT





BICFG630J426502
99
22
9735062
0.268
GTCAGGGGAATTGGCTCTCAATATACAGGGAAATTTCAGAGAA







ACATTAATGAGCTCCCTCTTCGTTGAAAATTAAATCTGTCAAG







GATATGAATCAGGTGTCATGTGAAAGAGCCTGATCAACTCTTT







CAAAGCAATTTCCTATTAGAACTCCAATCCTGGAAGATGCCAT







TTCCCTTGCTCCAAGGTAGTTGAGATCC[C/T]GTTGGCAAGT







TGTTTTGCAATCCTTCCCATGAGAAAGAATACAGTAAAGATGA







CAGCCCAGTTAATTCACATCCAGAAAAATGAAATGTATATTCA







TGGTCATTTCTCTTTTTCTCGGCATTGATCAGTAACCTTGGGA







GAGCATATCAAGCCCTTTTTTCAACACATTTTTCCTCTCCTTC







TTCCTCATGTCGTTTAAT





BICFG630J426600
100
22
9909920
0.275
ACTCACCCAACACAGGCAAATATTCACCTCCGAATCTTCATGT







AAGTTATTTAATCACATGGTCGCTGAGTTTATTCATCTGTAAA







ACAAAGTGGTTGGAATAAATGATCTTTATTGTCTTTCTCTAGA







TCTAAAAGTCTCTGGTTTTACATTACCAGGAATTCATCATACC







TGTCTTAATTAAGGTTTTATATTCATTG[A/C]GGATGCTCCC







TCTTTCTGATCACAACTGTGATGTCCCGATAATCTTGTTTCTT







ACTTAACGGAATTTCTCATCCTGTAGAACACNTTTTTTTTTTT







TTTTAAGTACATTGGACTTTAGGTCAGTTTCAACCCCTATCTA







TGCCTGAGAACTGAGAACTGAAGTATTATAAAGAGAAAAACCC







AAACAGGAAGGGTTGTAT





BICFPJ646763
101
22
9981417
0.352
TGGCTCATTATGGCTAACACTTCAAATGGCTTTGCTGTAACAC







AATCAGCTGTGGTGCATGACTCAGAAATACACTTACAAATAAA







AATGGTTAAACCTGCATAGAGAGTTCCCACTATGCTCAGCTGA







AGACAAGACTGAAAGTTTTAAAGGCTGATCATATAAAAATACT







TCAATTTACTTTGCATACTTAAAAGGCT[A/G]TTCTTATAAC







TATGCTTCATAATTTTATAGTTTATAAAAGAAAAATTCTACTT







TTCTAAACTCTCGATAAGTTTATAACAGTTTTCAAAACATTTG







TATGTGATAGCTAACACCTTGAGTTATATTCNTAAAAGTTATT







TAAGATATAATGAAAATAGGTTTAATCTGCTTAAGATAATATG







CTCTATTTGAGAAAGTAA





BICF237J56401
102
22
10003324
0.393
TGAGCCTACTAAAATCAGTTTTGCCATTTTTATTCTGCTATGT







TCTGTTGTCTATGATTAGGAGGCACAGCAAAAGATGGGTTGGG







AATTGGATTAAGTAAATTCAGTACATTGAACACATTTTAACTT







TTCTCTCTTAATAATTAATTTTTACCTCAGGGTCACATTTTAA







ATAGCTGTTCTCTAAATATGGTAGTATC[T/C]TGCTAAATAT







ATACGTTTCTGAAGATAAAGCTGACCTCAAAACATAGCTGGGA







AGTAGTTTGTTTGGTTTGTTGGTTTATATTAAAACTCAATTCA







GTATCTCAGAGTACAAGTAGAATGTTAATGCTTATTGAACACC







TAAGAAATACTAGACATTTGCTTGGGTACCACAAAGAAATGTA







ACTTGATTTAATTAACTA





BICF233J58345
103
22
10028791
0.355
GGAGAGACCCAGACGGGCGNTCCCGGGGAGGTGGGCGGGTCCT







GCTGAAGAAGGCTTTGTCCCAGAAAGGGGTAGTTTTGGAGGAT







GTTTCGTTCCTACCAGCACACTTTGTTTCTGTTTTGCTGGCGC







TGTCCTTTTGATACGAGACTTTATAATAATAGTGAAATAATGT







CAACCCATGTAACGGGGTTTTTGCTTAT[T/G]CCGGTGTGCT







GTGATATATAGATTTGCAAGTGGCAATGAATTTCAAAAAGTGA







ATTTTAAAAGTTCCTTTAGAGGGATGCGCTAAATAGTTAAGGC







AAAGTGCTTTTGAAATATTGTGAAAGAAGACGGATGAGCATTG







CATAAAATCCAAAGATTGTTCTGGCGTTATTTAGGTCCTTTGC







AGAAATATATATGTATGC





BICFPJ1583908
104
22
10143323
0.294
GGTCATGATCCGGGGTTCTGGGATCGAGTCCTGCATCTGGCTC







CCCACAGGGAGCCTGCTTCTCCCTCTGCCTGTGTCTCTGACTC







TCTCTTTGTGTCTCTCATGAATAAATACATAAAACTTTAAAAC







AAAAACCCAAAATCCTATGTGAAATCATAATCCAGTATTTTTT







CTAAGTGAGATAGTTGACTCTCGTTGCT[T/G]GTCTAGGTAC







TTGCCTAACCTAACCCTACACTCAATGTGCCTTAGGTAGGCTA







CTTTTTGAAAAATCTTTCATCTTTTTATTTTTTTCAAAGCTGA







GTTCAGGAAGTCTTCCTAGGATTCCAGTGTGCACTTGTACCCA







CCTCTGCAACCTGGTTTCCTGGGCTGACCTTTCCATGTTAGAC







CTACAATTCAGTTTAAAA





BICF236J32937
105
22
10227231
0.403
CTCAGAAGACAATCTACTTCTGTTTTTCATCCTGTCTCTTGGT







TAGTGTCATTACCATTCACTTTGCCTCCTAAACCGAAAACCTT







GAAGGTATCCTCAGTTATATTCAATGTGTCCTAAACTGTGTTG







GGTACTAAGAAAACAGGGATAAATTAAGACAAAGTTCCTGCTC







TCAACCAGCTCTCAGATATTAATTTAAA[T/C]GCATTAACAA







AGCACGTGACTGGTACAATAATGAAATATTTATGGGAGAGGTA







AAGAGCTAACCTAAAGAAGGGAATGACTAGCTTTTTCCAGGGA







GTGGTGTAGGATAATTTGCAAACTGGGTTTCTGGAGCCAAGCT







GGAGTTTGTCAAGCAGGCAAACAAGAAAAAAACAAAACAAAAC







AAAACAAAAAAACAAAAA





BICF233J26298
106
22
10280714
0.366
AAAATGTCAGGGAGTCTGTTTAGCATGTACATAATCTCAAAAA







TGTAAGCTGCACAAGATGGGGAAGCTAATATTCTATGTTTGGG







GCTGCTTTGCTTGTTTGAGGAATTTAATTATGACTGTGGCTTT







CTGAGCATGTAAGAAATTCTTACCCTAAGACCGTAGAACATTT







TGCAACTATCCTGTTTGATTTACTGGGC[C/T]ACTTAGAGAG







ACAAGTTAGATAACACACCAGCTTGACTTTTCTAGAAGCAGGA







ACTGAAGGACCAAAGTATTGCATTTGCATTGTTCCTTATGTTC







TTCTTGTGTGAAAGAATTTTGTTTTCCCCAGATGGCTTCAATG







GTCATTGCAGAGCAGGGAAACTTGATCCTTCTTCAGTAGTCAG







GAGAGCAGAGCTCTCAAA





BICF230J60654
107
22
10285559
0.395
AAGAAAATTTCGGATGATGTCCTCTTGGATCTCTTAGAACAAA







CCTGGAAGATTATTAAAGTCTGAAGCAACAGCTGCTGCTGACA







GTTTTCCCTTACAAGGAGGTGAAATGATGCACACCAATGGTCT







AACATTTCCCAAAGCGCTATAAAAGGACAAACAAGAATCAGTG







AAATGGAATATTGCTTTATGCTTTATTT[T/C]GGGAATTTTA







GAATTTGTTTGGAAAACTACTTATTTTTTCCCCCAGAAGAACC







CCTTGACCTAATTTAGAAGTTATAATGGANAGGTCTTACTCAC







AGCACAGAGTACGGTTCAGAAATAGTGTGTTGNTTTGTTTGGG







GCCTTTCCTTTTAAATGCTCAAGAAAAATTCAGGTCGATCTGA







TATATTTACATTGAAGTA





BICF233J61494
108
22
10301664
0.424
CCTGAAGCCTTCGGGCTATGACAGACCCGGGTTCTAAGCCTGA







CCTCAGGAGTCTGTACTTGAGTTTTCTCACCTGTGAAATGGAG







CCAGGACACTCCAATTGCAGCCCTGTGGTTCAGATTAAAAAGC







CTGTATGTATATATATATACACACACACACACAGCCCCTTGCA







CAGCCTCTTGCTGTGCTAACAGGCACTC[G/C]GTGAATCCAG







CATCGCTGCCTCTGTGCTCATTCTCCACGCACAGGTCTTCAGT







CCCACCCTGCTTAGATGATGGGAAAAGTAGAGTTAGTAAAATA







CTCTTTTATTCTTCAAAGGCTTTTAGATACAGGCCTGAGAGAA







TACTGTCCCACCCTGCCATTTATGAATAAGATTTTGAGAGTAT







AAGGTAGCAGAAACTAAT





BICF233J61597
109
22
10305141
0.462
TTGAGATTCTCTCTCTCCCTCTCCCTTTGCCCCTCCCCCCATT







CACTCGCTCTCGCGCTCTCTCTAAAATAAATAAAATCTTAAAA







TAAAGAAAGCACATCCTAGAAATATATTGTAATATGTAATATG







TAGAGCTCTCTTTCTCAAATTTTCTTTTAAAAGGCTCTGATTT







CTTGAGACATTTACCGTAATAGAGGGAC[A/C]TTTCCATAGA







AAAATAAATTCTCATTCACTANGATTTTTTTTAATTTAGCATA







AGAAATCATTGAATTCCCTACTACAGAGGTTACTTATTAACGA







AATGAGAATTCATCACTTACAGATATAATTCTAAGTAGGAGTA







TCTGGGTTGTTATAATAGATGATACTTAATAAATATCTGCCTT







AGCTTCTATAAAATACAC





BICF231J52887
110
25
18195511
0.229
TAAATCCAAATAAAATACAAAAAGTGCTTTGTGAGCTCTTAAC







CTGCAATGCAAACATAGCATGTTACTCTATTTTATCAGCGAGT







GCGTGGCTGATGTTTTTGTATTTAATTCTAGTAAATTACAGGA







TTTCCAGAGCATTACCTGGTCACAACTCCTCATTTGCAAAGGG







CTAAATGAGACCCACGAGTGACTTGTCT[A/G]AGGACACACG







GCTAGTGATAAACAGAACCGGTCTTCTGTTTGCCATGCCTCCT







TCCTAAAATTAATCTTTGCAACTTCATGAGAGTGGAAACTGCA







CCTGCTGTTCTTTTGCACCACCAGCCTGAGCAACTGTGCTNTA







TGTACTCTGCAGCATTATTCAAACCTGAGGTGGATGATGGTCC







CTATCTCTTTAAAAAGAA





BICF235J29129
111
25
39552390
0.412
CCACTCATTATGTTCCCTGCAGTATGGAAGTTCTGTGGCCAAG







GTTCATATAACTGAGAGTGTATTTATGGCGGTCCATACTCTTT







CTTAGGAAAATATTGATTTTCTAACAGCAGAATGACTGTAGAG







CCGTTAAATCAGACTAGACTATCATAAACTCCAGGATTAACCA







AAGAGTACTTTCACCTTTTCTTTTAGTT[A/T]CTCATGAGCC







ATCGGGAGTAGATACATCCACTTAAGCAGGACAGGATCACAGC







ATTTATTACTTGATTTGAACAAACCACCACTATTCCCCACCCT







TATTGCCGGATAAGTAATTAAACATTCTGCTCTTATTTTAAAG







ATTGACTGACAGGAATGAAAGAGGCCAAGTTGTATTTAAAAAA







AAAAAATACAAAGGCTTC





BICF230J63373
112
26
13241060
0.377
GGGTCTCCAGGATCACGCCCTGGGCTGCAGGCGGCGCTAAACC







ACCAGGGCTACCCTAAGGCAACTACTTGTGTTGTATGCTCACT







AAAGATGGATCTAATTTTGGGTATGGCTACATCCAGAAGCTCT







AAAAAAGTTACTAAAGATTTATCTCTTGAGTCGGTGTTGGCTT







TATTTAGGCTGTTTCTTTCTTCATGGTG[A/G]CAAGATGGCT







ACCAGTATATCTCCAGGCTTAACTCCTGCCCCCTAGGCAGCTC







CTATGAAGAGAGAGTACCTCTTTCCTAACAGTTCTTATAAAAA







TTAAGGGATTGGTTTTAATTAGAGCACTATAGGTCATATGNGC







ATTGCTGAGCCAATCCTTATGGCCAGCGGATGGAATTGGTCAT







TGGTCAGGCCTGGGCCAG





BICF234J24531
113
27
17672045
0.315
GAGTACAGGCTTGGACCAGAATATATAGGTATTTTTAGTATTT







GAATTTTATCACAAACACAGTGAGAAAAAGCATGGTTTTTGTT







CAGAGAGGTTCTTTCACTTCTGTGTGCAGAATAATTGTGGGTA







GTTAACAGAAAGATTAGTAAATTAATTGCTGTTGAAATAATCT







GGTTCAGAGAAGATGGTAGTTTGGACTA[C/A]GAAAATGAAG







AGGAGTAAGCTGATTAAAATATGTTTTTAAGATTCATTTCACA







AGGATTAATCAAGGCTGATAGTCTTGATTAAAAAGGATTTCAA







GGAAGANCTTCAGATCTCTTGTNCAAGTAACTGAATGAATGGA







TGTATCATTTTCTGACAAGGGGAACATCATCCATTTCTGGGCT







TTCCATAAGTTAACAATG





BICF245J13607
114
27
22519619
0.389
TGCATATTAAAACAAATGCAGGTACAAATCTACTAAATAGATC







CACATCTACTAGAAGGTACAGAAACATCTACTGAAATGCCTAA







AATTAAAAAGACCTAAAATATCAACTGATGGCAAAGATAATTG







TTGGCATCTGGAACTTTCAAATGCTGCTGCTGAGAATACAAAA







TGGTACAGTGACTTAGGAAAACAGGTTG[A/G]TAGTAGTATA







TAAAATTAAACATATGATTTGTTACATAACCCAGGAATCCTAC







TCCTAACTATTTACTTCTGGAGAAATGAAGATATATGTCCACA







TAAAAACCTATCAAAGAATGTTCATGGTAGTCTTATTCATAAT







AGTAAAAAAAAAAAAAAAAATTAAATAAAGAACAAAAAAAAAA







CTGGAATGTCTATCAGCT





BICF233J31513
115
29
30317809
0.337
ATATGAACCAGACTCAGATATTTGAAATCTGTATGCATAAAAT







CTGTTCATGTAGCACAACTTTTTAATTTTTGTTCAAAGCTCTA







AACCAAAGTGGTGAAACACCATTACTCAGAAATCCTGGGGTGG







CGGTAGAGATGAGGAGTTGGGTGTGAAGACTGGAAGACAGGAA







GAGAGAAATGGGAGGTCATTTAGGAGAT[C/T]TGGGCTTATC







TCATTGCTAAAGACGTCTGCTTTCTACCTGAGGCAGCAGAATT







GCAGAACAATTAATCTTTCTCTTACTGACAGATAATCTTTTGT







AATTATGGCCGCTGGATCAAGCAAATTACTCCCAACAAATATT







GATGAATATTTTCTATGTGTTGGACACTGTTGGGCACAGAAGA







TACAAAAATGAGTAAAAA





BICFG630J590374
116
29
31447893
0.283
AAGAAACTGAAAAATCTTTGAAATCAGGAACCTGTGCAGGTTT







TTATTAGTTACCATATATCTTGGTCCTTTGGTCCCTTGCTGCA







TTATGCCTACTGTTCCAGTGTAATTATTAATAGTATTTCTCTC







ACTCTCAAAAGCATCCTTGTTTGGATGATAAATTATAGTCACT







CTAGTTATTATTAACTTCCCCAAACACC[A/G]CGATAGTACT







TAGTGTAGCTGAGATAGCTTTCTGGACTTTCAGAGAAAAGTTG







GGCTTTCAAAATTAGAATATTCACAATTAGAATTAAAATAGAG







TAGGAGACTTAAAGAATAGTTATTGCAATTTATTACAGGAAGA







TAATAATAATAAATGTACTTCTAATGAAAACATATATCACAAT







TAGAATTTTTTAAACTTG





BICF230J33141
117
29
38575425
0.310
TTTCCCCCAGTTTGTAGCAACTCCTATTAAAATGAACAGAGTC







TAAAGATGACTTATACTCCTTAGTTATGAATTATACTGTCTTT







TAAATTTTGTGCTAATATAATGGGTAAAAATAGGTTATTATTT







CCCTTAATTTGCATACAGTATTCTTAAAATTTACCTTCTTTTT







CTTCTAAGGTATAAAAATTCCTCTCTTG[C/T]ACTGGCAAGC







GCTTGTTCTCTAAATGTACAGAATTTTCTTTGATAGCAGAAGT







ATAATTCCATAGATAATATTTTTCCTCAGGACTATTATTGGTA







TATTGTCACAGATTTTCACTTCAAAGGAATATCTCTTCTCAGA







CTATTTTCAGCCATTTTAGATTAAATTCTATTTTATGATAACA







NTAAATGAGTATATATTC





BICFG630J610801
118
30
34498508
0.321
CCTTTGATGGTTCATGGAAGTGACAAACTTTCAGTGCCTTTCT







CAACTCAATACAGGAGCGTGATCATTTTTGTAAGCCTGTAAAC







AAATTCTCACAAAGCTCAGAGTAGCCAAACTTCATGATTAAAT







GTAGCAATAAAAATATGGTGGGCATTTCAAACCTTGTTTTTTG







GATAAGCAGCCACATACTTCGGTGTTTT[G/T]TTTGTTTGTT







TGTTTGTTTGGTCTCCTAGTTCTGGCTGGCGTGGTAAACTCCC







TCTTAGGCTGAATAAGTGTTGGAATAGGCTAGTCTCAATAATT







GAACATTCAGGATAACCAGGAGGTGGTCTGGCTCTTCAGGGTT







CTGTAGCCCAGACACATCAAGGTCACTAGAGGGGAGCCATGGG







AAATCACTTTTGCTCTCC





BICF230J27652
119
32
7408543
0.362
GCTCCCACGATTCCATTTATTTTTAAAAGGCGGCGGGGGGCGG







CGGGGGGAGTATTCCTCAGTTGGCATTTTCAAAATATGCCAGA







TTTAATCTGCCACTGGCTTTATTTTTGCAAAAAGTAGGCAAAT







TCAAGAAAAATAATGTCTAATAGTTGAAATGTTCTGCTTGGAT







TCATAGAGGCAAAAGGAGTATAAACAAG[G/T]AGTAATATAA







GTTGTTTCCTTGTCCTGTGTATCTGTCACCAGTGATGGAGGAT







TCAGGCATCCAGCGAGGCATCTGGGATGGAGATGCCAAGGCTG







TCCAGCAATGCCTGACAGATATTTTTACCAGTGTTTACACCAC







CTGCGACATCCCTGAGAATGCTATATTCGGTCCCTGCATCCTG







AGCCANACTTCCCTGTAT





BICF234J35168
120
33
28805876
0.298
GGAGTGGATTTTGGAAGTGATGACAAGTGGCTTTGGTGGGCAA







GAACTGCATGAAAAAAAAAAAAACTTGTGTCAGGTTTTGGTCA







TGGTTCTACACACTGTGATGATTTTATGTTCTTAGGAAGGTTT







CTATCTTTCTCTTTACAGCTGCAGCTTATGGAAAAGGAACCTA







TTTTGCTGTTGATGCCAGATATTCTGCA[A/G]ATGATATATA







TTCCAGACCAGACAGCAATGGGAGAAAACATATTTATGTTGTA







CGAGTACTTACGGGAGTCTACACACTGGGACATGCAGGATTAG







TTACCCCTCCATCAAAGAACCCTCACAATCCCACAGATCTGTT







TGACTCTGTCACAAACGATACACAACATCCAAACCTGTTTGTG







GTATTCTCTGATAATCAA





BICF237J26004
121
34
21417087
0.354
CCTCTCTCTCTCTCTCTGTGACTATCATAAATAAATAAATTGA







AAAAAATNAAAAAAAAATAGGGGTATGACACCAGTTTGACAGA







TTATTGGTAACTTTAAGAAAAGCGGTTTCTATCAGCAGCAATA







AGGACTAGGTGGGGGCTTCATGGCTTCTATTTCTTTAGCATTC







ATTAATTTAGCATTCAGTAGATATTCAC[C/T]GAATGCCTTG







TGTCCTAGATCCTGTACTAGGATACAATGGTGAAAGGATGTAA







TCTCTGTTTTCATGGAATTTAAAGTTTAGTGTGGGATGTAGAC







ATTAAACAAATAATGACACCAATAATTAATCCAGTGGTCCAGA







CATGATTAAAGGAAAAGTGTAGTACCAGAGAGGGTATGTGTCA







CAAGAGAGCTAAATCCAC





BICF237J30138
122
34
21421213
0.343
ACATCCTAGTGAAAGATGGTATACCAAACTTTAGAGCATTGTC







AGACCCCAGGGCTTTGACTTGGGTTTATCCAGTACAAGTAGTT







TAGTAAAAAACTGTTCAATTCCTAGCTTCTACTTAGCAATATT







TTGTGAGCCTAAAATTTCATTTCTTAATATTTATTTTGTTAAT







TTCTTTATATTTCACCACTAGCTGTTTA[T/C]TAAATGGCAT







TAAANGATAAGTGAATGTCTTGTTACTTTGGAAACTATGTAAG







TTGAAATTCTAGCTATATGATTGATTAATAAAGGAAACATAAA







GTCTTTTCTTTATCATCTTCACAGATAGAGTTGTTGAAACAAG







GGGACCGCTATGCTCAGTGAGAAGTGAGAAGAGGTACATGGTT







CAGTTCATTCTAAGTTTT





BICF237J30137
133
34
21421228
0.358
ATGGTATACCAAACTTTAGAGCATTGTCAGACCCCAGGGCTTT







GACTTGGGTTTATCCAGTACAAGTAGTTTAGTAAAAAACTGTT







CAATTCCTAGCTTCTACTTAGCAATATTTTGTGAGCCTAAAAT







TTCATTTCTTAATATTTATTTTGTTAATTTCTTTATATTTCAC







CACTAGCTGTTTANTAAATGGCATTAAA[A/T]GATAAGTGAA







TGTCTTGTTACTTTGGAAACTATGTAAGTTGAAATTCTAGCTA







TATGATTGATTAATAAAGGAAACATAAAGTCTTTTCTTTATCA







TCTTCACAGATAGAGTTGTTGAAACAAGGGGACCGCTATGCTC







AGTGAGAAGTGAGAAGAGGTACATGGTTCAGTTCATTCTAAGT







TTTCTTTTGATATATTAT





BICF233J46097
134
34
39797181
0.489
AATCATCAGGGGTTGAGATTGCCGTATCAACTCAGAAAAAAAG







GAATAGCACTGCCCAGTTATTCTTTAACTTTTATTCTCCTCCC







ACAAGGCAAATAGCTTGAAAGCATGAGCTCTNCTTTTGAAGCA







GATTCCTCTTAGGCTCTTTCTCTGACCCGGCATAGCAGACACT







GCTGACCACCTACTTTGAAGCCATTCTA[T/C]CCACTAATCT







TCCCTTTGATGAAAAACTTGATTTTGTTCAGTTATCAGGAGAC







CACATAGTTTAGAAAAGGGTGGACCTTTCCCCAGCCCTATGGA







GGATGATAATTCATCTAATCCAATCATGGAAATTCCATTTTCC







TTGCCAGCGAAAAATTTAGGAGTGGGCATATTTATAATTCTGG







ATAGCGAGTGGGGAAGAG





BICF230J25861
135
38
16264182
0.343
TAAAATCTGAGACCTCCTTATGCAAATCATTTTGCCTTTAGCC







ATTTCAAAAAGAAATGAAGGACCTAGAGGATTTTACAGTTTTA







CATAACACTGGTGAGATGGTTGTCAACTTTGATCTTACATTAA







TTAGTTAAGATCTTGACTGATCATAGCAAAAGCAAACTAAAAA







ATCTGGTCCCCAGTTAAAATGAAATACA[G/C]CTACGACCTA







TAATGATGAAAATTTCTGCTTTATCTGTGATATTCTCCAATAT







TTGGCATATTATTGAAGGGCATATGATAACATAATTCATTGTC







TAGTAAAGTGATTCACATGATCTAAGTACATTTTTAAACCTTA







TTATATAGACATCAATCTCAATATTAGGTTGTTGTATACTTAA







GCCATTGGGGGTATAAAT





BICF229J19422
136
38
16280473
0.098
GCTACTTCCAGCACAACCAAGGGACAAGCAATTTTCAATAGAA







AAATAGAAAAATCTGTTTCAAGAGGGAAAGAACTCCCTTGCCA







AGAATCTGTAGGTCAATGATCAGGAATTTGTGATATTTTAGAG







TTTGAATATTTACCAAAGATGATTGTAAATTCACTAAATACAA







AAACAGGCCAACAAGCAGAACTCACTAC[T/C]GTATTTGTTC







CAATTAGCTGGAATTATTGACATTTTATGTTTTAAAGATCAAC







AATAAACTGTTTTATGCTAAAAATAAAAAATAAACAAAATAAA







TTCACTAAATACATACTTTTACCACTCTACTTGGTTTTGGGTA







ACGTTAACCTATCTTCTGTTTGAACTAATTAATTATTCACTGA







AAAATCTGTTTTTACAGT





BICF236J58292
137
38
16334172
0.329
AATAGTTTTTTTTTTCTGCAGGTATTCAATGATGACAAATGAT







TTTTCAAAGGGTAAACAATTAGTATATAGGATTCAGCACATTA







AGCAGACTTCATGGTCTCTTTATTAATCTTGAACTTGACAATT







TTTGAAAATTTTGAATCAATGGGTTTCTGGTTACTTTCCAATG







ACATTTAACAGTTAGACTTAAGAATACA[A/G]AAGAAAGCAT







TATAAGTTTTATCCAAAGTGATTATGGCCATCATTTGAATAAA







ATACAGATTTTGCAAGCTGTAAAGCATATCATTACTACACACT







GGCCTAAGTAAGGTTTGGTTCACAAACTACAGAGCTCGAACCA







AGTCATTAAATCTATCTAAAAGGGCCTCATTTGAGAAGCAATA







AAATTTTTAATCATTTTA





BICFPJ1148955
138
X
107354447
0.482
CATAAGTATTCTGGGAAGAAAATTCTGGAAGGGGAGGGGAAGG







AGAGTTTGTTGTCTTTAGCCATTTCCTCTGGAGGAGGCCAGTT







GTTGCTATGATGACATCCTACACCAGCCTTCTAGCAGAAGAAC







TGAATCCAGAGATGCCCCTGTCAGGTTGAGGGCTTGTGGCATT







TTGAACCAAGTGATCCCAGGACCCTGGG[G/A]TCATTCGCAA







TCCAAGGGGACCAGAAGCCCATCAATAGGAACTTCTGGAATGC







CTGCCAGGGGGGTGAGACTGTCCAGTGCACAGATCCTGCTGGG







TTAGTCGTCTGGAGATCCTCCGAGGGGACTCAAAAGAGCTTTT







TGTTCCACTCACTGTTTGCTTTTCTTTTCCTCTTTCTAGCTAG







GTTGAACATGAGATCTGG





BICFG630J751770
139
X
107745838
0.484
TAAAACTAAATATGGCTTTCATAAACTCAACAAAGAGAGGCCA







ACCTTGGTTGTTTCTTAAATCTCCATGGCTTCAATATAATGCC







ACTCAATTCTAGGAACTGAAGAGAGGGGAAAGAAACAGAATGA







AAAACTTAAGAATGTACAAAGATTGGTGAAGAGCCTTCTCTGT







TTGGTGGCCTCCATAGAGAGCTTTTGTT[T/C]TCAAAATTCC







CAGACTCTCCAAGAGTCTTAAAGGAGAGTAACTCAATCAAGCC







TCCTCAGGTTAAAGGGGAGAGGGGGAAAAAAAAAGCTTGTCTA







TTAACTCCAAACCAATCACCTCCTGCTTCCCTTGTATTACACA







AAGTCCCTAGTGACTTTGTCTTCAGTGAGTAAATAAATAACCC







CCCTAGAAGCAGAATAGT





BICFPJ818033
140
X
107749150
0.392
CTTCGATTGTGTTAATGAGCACCTAAAGCTAAAGGATCTGTCC







TGTGCATTTGATTACTGCCATTGGTTCTTCCAAAAGTGGTGTC







ATATTTATCCTGCAGAGAAAAAGGGAAGGCATTTCCAAGTGCA







ACAACAGTAATAATTATTAATAATAATAAAACCTCCAACCCTT







CCTCTTCCTTTTCCCCCATCCCTCTCTC[A/G]AATACAGCTG







GTTTATCATTCTGTAACTCACAATTTCCAAAAAATAAGCTGAA







AAATGTGATGGTATTAAATATGAATAACCATCTGCTCCATCTC







TTTGAGGAGAGCTGGAGCTCCAAACTCCAAATATAGCACACCA







AAAAAGCCCGCCCTCTGGCATACTTAAGCCAAGGGTTCTTCTT







TCAGCCTACACACTCAAA





BICF231J51880
141
X
107793509
0.367
AATAAATACCAAGAGCTGTCAACTCAGAATCCACAGCACACTT







TGTATCTCTAACACTGCATTTTTCAATCTAGTATCATAGTCAT







TAGTATATCAGCGGTGTTCCTACTACTAAATGTAAGTTTCTGG







AGGCAGGGACTATATCTTGGCCATCCTTGTATTACCTGACATA







AATGGACCATATGTTGGTCCTTCTATGA[G/A]AAGCAAGTGT







GGCAGTGTTTCTTCAGGGAGTGTCTCAACTATGACAACTACCT







TTTTATCACTTTGCATGCAAGTTGTCCATGAAACTCGGTATAC







CTGAACCTAAAACAGCAGTGTTTGGGTTCGCAGCTGTTTCTGG







CTTCCAAAGCTCCCTGTAGCTGATATTTCAGCAAACCATGAAG







ACTCGGTAGAAAGCTACA





BICF229J272
142
X
107804940
0.391
GCTGTTCTCCAACCTATCCAAACTAGTCCAACAAATGTGTCAT







TTTTGTCTCTATGTGCCTTTACATGACAGTCTACTTAAGGCAA







CTAGGCACTCCTCCTCCATAGCTCAATGACCCCCCAGATCCCT







TCATGGTGCCCGGCTCCATGGCAGTTCTCATGGTGCTGTCTTG







ATATCCATGGTTTTCCTTATCTGGCTCC[C/T]TCCCAAGGCA







GTGATTGTATCTTATTCATCAGCTCCTAGCTCAGTACCTGGCA







TAGAATCAGTGCTCAATGAATGTTGGCTGAAGTAGAGAATGAA







TGAAAAACTTTTAAATATTTATGCAGAGATGGCATATAAAGTT







CATTATTAGTAGGTTCCTTTCCTGAGAATTTCCATCTGTTTTC







CAAAACAACTAAATGTGG





BICF235J49607
143
X
107811584
0.379
AACTGCTTCAAACTAATTGGGGAGCCACAGCTGGCAACCCAGA







TACAGTTGCTAGGTCTCACGCTTTAAAACAACAACGACGACAG







ATACCAAACTCTCTCTTTGTTTCAAGAACACTGGTCTTTATTT







TTACGATGTTTCATCTATTTTGAGACATTCCAAGTCGATCTTG







GCTCTTCAGGGTTGCCTTTTACCCATAC[G/A]GGAGTTTGGC







CTTCTGGATTGGTTCCTGCACTTTCCAGGGCCCTCTAAGGCCA







GATTCCTCAATCTTTGGGGAGTTGCAACACTACCACGGTTTGT







TTGATTTTCTCATTCTTGTCTGTCCCCTTCACCCCCCNCACAC







ACACACACAAAAGCGGTGTATGGCATTCATAAAGTGAATTACA







CTTGATTTTTGTTTAGAG





BICFPJ1116830
144
X
107818542
0.365
AAACTGGCCCTGGGGTTTGATGCAATATGTCTTTGGCCTTAAA







GCATCTGCTCCTCCTCCAAGCAGCTTCTTCAAGGCTTTAAGCC







AATCTAAGGGAGGAGTTGAATAAAAGGTGCAAGGCTTAAGAAG







GAGTTCCATTTTTATAAGGGCCTCTTAAAGACTTTCTGGCCTT







CTAGCCCCCAAGCGACAGCTCTGGCTGC[G/A]GAATTTCAGG







CACTGTCCAGACCCTGAGGAAGAACAGGCTTTGGGGCAGGAAC







GCCTGCAGGAACAGCACGGGTGGCTGTGGATCCGGTACGCTCT







GCCTATCCGGTGACACAGAATGTTAACTGACAAACATCTGGTT







CCTTTACCGAAGTATAAAATGTGTTGCATTATATGCTCTCTAC







AGCCCAGATGAGTGACTA





BICFPJ1327162
145
X
107875456
0.442
TCTGCTTAGTCGGCTTAGTAGTCAGTTATCCATTGAACAGGAA







TTTCTTTAAAATTCTGAAACCAAGAAGTCTAACAATCTTTGCT







GAGGGGCTCTGTATGTGTGTTAGGGCCCATGTTCAATGCTGCA







GCAAGCAATTTAAAACTTTACCTTACTGTTCACTTTCTGCTTT







TGCAGAGCCTCAAGTCAACCAGTGGTGA[C/G]AGTTTAGGAC







TCAGGTCCTTCCTGAGTATGTGAACAACTCTGAGGTTGCACAC







GGCCTTCTAGATTCCCAGGAATACATCAGACAGCTTTTCAAAA







CCTCTATGAACATCTCATTCCCTAGCTTTATTTTAAGGGTTTT







GGTTAACTTGTTGTTTTCTACAACTGCTATCACTTCCCTAGAC







AGCCAGGAAGTTAAACAA





BICF235J47857
146
X
107955905
0.526
CTTCACTAGAGTATGAGAACCATGAAGACAGGGACTTTGTTTT







GTTCATACTGATTCTCTAGCACTAAGAGAGCACCTGGCACATG







ATGCTCAGTAAACATTCCTGGAAGGGGGGAGGGAGGAGGAAGT







TTACTATTTCTATATACTAAACACTATGATTTCTGAGTTTGTC







TTTTGCCTTTTAAGATTTTTTTTTATTT[A/G]CGTATTTGAG







GGGCTCCTGGGTGGCTGGCTCTGTGGTTAGGCGTCTGCCTTCG







GCTCAGCATGTGATCCCAGGCCCAGGGATCGAGTCCCGTATTG







GGCTCCCTGAGAGGGGCCTGCTTTTCTCTCTGTGTCTCTGCCT







CTTTCTGTGTGTCTTTCATGAATAAATTTTTGTTTTAAAAAAG







GATTTATTTATTTGAGAG
















TABLE 3







The 7 SNPs used in a model of predicting dog size:



















SNP
SNP
SNP
Sequence



SNP
Chr
Location
Gene
score 0
score 1
score 2
SNP = [wildtype base/alternative base]


















BICFPJ1149345
4
70324248
Growth
T
TG
G
ATTGCAATGAATTTGTTTTAATTTGGTGTC



(SNP 1; SEQ ID


Hormone



TTCACATCCCTGGTTCACCTAGTTACTAAC


NO: 7)


receptor



CTGGGGATGTTGTCTCACTCCTCTTGACAT









AGTGTGTGCCACACAGCAAATGCTCAGTAA









GCACTCACTGAACTGAACTGACTTGCCCAG









TACGACTACCAGGGTCAGATTCAACTCACT









ATAGACTCACTTGCTGACTT[G/T]GATCA









AATTTAATTTTATTAAAAATACAAGAACTA









GCAGATAGAGGTTGTTGTTGTTGTTTCTAA









ATCAAACTTATCCTCAGAACAGTCATTGTA









AAAATGATAAATATAGAAGTGTCTCATTTA









ATAAAAGTTTATGCTATAAAATCAGTTCTA









TCGTTAAAAACACCTTAAACATTAGCATCC









TCTTTTCCACAGTTT





BICF230J67378
10
8445140
HMGA2
A
AG
G
CATTACTGGTAATTGTGACCCACTTTTATT


(SNP 2; SEQ ID






TATCCATTCATTTCACCATTTTTCATAATA


NO: 35)






TAAGTAGGAACCATGAATCTCCTCACCCAA









AAGAAGTCAGAACACTCTGATCACAGCTCA









CATTCAGCTACGTGGTTACTTCCTAGGACA









TCCCTTTTGATTCCAGACCTGAGACAATAA









CCACATTGCCTTCTACATTC[G/A]TAATT









CCCTTGATAATCTCGTTATACAGGATTACA









TCTCCCTATCATTAAGAAATATTTTAGTCA









TTTTTAACTTTATAAAAATGGCGTTGCAAA









TTATTTTTCAGAACTTGTTTTTTACTTAGT









ATTGTATTGCTAATACTCATTCATATTTAT









AAATGCTGTACTTCATTCAACTACTGTGTC









ATATTTTATTACTGA





BICF235J47583
10
11451490
HMGA2
T
TG
G
AAAAGCANCATATCCAACATTTGTAGTTTG


(SNP 3; SEQ ID






TTACAATAACACATTGAAAAGATTTATAGA


NO: 58)






CTGTTTTGGGTGTGATTTTTGGATTAATTC









CCTACTTTGAAACCATTTGTGAGGCTCTGT









TTATTTAAAGGAGGGAATGAATAGACCTGA









AAACACCTAATTTTCATTTTCATCTCAGAC









TGGAAGCCAGTACATCTGTA[G/T]GGTTT









GTTTTTTGGGTTTTGTTTTGTTTTGTTTTT









TTGGTTTTGTTTTGTTTTGTTTAGAATTGA









AAACTAGATCACAGAACACACAATGCTATA









TTTATCATTTTGATCATCGGTTATTAGATG









CTTGTTTGCATGTGCTTAAGCCTCTAGCCA









AGATAAAAAAAAATTTTNAAAAACTATTGT









GGTAATAGAGTCTAG





BICFPJ401056
15
44263980
IGF1
A
AG
G
CAAGGAAAAGAAGTTATAAACTGGCCCTCT


(SNP 4; SEQ ID






CTAACTTGTACCTGCCTTGCTGTAGGTTGA


NO: 84)






GGTCTTTCTGAACAATCGTGTCCTTTAGAT









ATCTGGACCTTCATTAACAGGTTCAGGCTT









GGGAACTTGCCAAATTCCAGAAAGGGTCTA









GTGAAGGCATTCAACTGGGGAGCCAGCTGC









CTCTTTGGAAAGTGGTTTTA[G/A]TTTAC









CCTTCATCTTCCAATAAGAGACAGAATCCC









AATTTTCTTAGCTCAAAACCATTTCTTTTA









GATTCNAATAGCAAACCTAATGGAACTAAT









CAACTCAGAGTCCTAAGAAATAATATTAGA









AACTGGCTAAGCATGACAAGGGAAGCAATT









TGATATGAGTAAAACACACATTTGTCCACT









CAATGCAATTAGAAA





BICF235J20169
20
35391970
?
A
AG
G
GTTTCCGAGCAGAGATGGAGAAGCAGGGCT


(SNP 5; SEQ ID






TGTAAAATGAACGCCGCCTTCCCCGTTGCA


NO: 96)






TCTTTGCTCCAGGGTGGGGGCCGCCTCGGT









TGTAATTTTACACCGATGTCCACACCCTGC









TAGGGAGCAAGAGAGGCGAACTGTAAGTGA









GAATATTTGCTCTGCCTCCACCCCCTGGAG









GAAGAGGAGCTGGTTCTCTC[G/A]GCAGC









CTGCGAGCAGAAGTGGGAGGGCTCCCCCCA









CCCCAGCCCCTGCGGCCAAGGGCCTGGGGC









CATGTGGGTGGGTCCCGAGGAGCAGGTCTT









CCCCCCAAAGAGGTGACAAAGACAATGGCA









GTTTGAAGGCGCAGCCAGCCCTGCCTTGAG









GTAAGGTTGGGGGTGCCGGTAAGCAGGCTG









CTCCGAGAAGGCACC





BICF235J29129
25
39552390
?
A
AT
T
CCACTCATTATGTTCCCTGCAGTATGGAAG


(SNP 6; SEQ ID






TTCTGTGGCCAAGGTTCATATAACTGAGAG


NO: 111)






TGTATTTATGGCGGTCCATACTCTTTCTTA









GGAAAATATTGATTTTCTAACAGCAGAATG









ACTGTAGAGCCGTTAAATCAGACTAGACTA









TCATAAACTCCAGGATTAACCAAAGAGTAC









TTTCACCTTTTCTTTTAGTT[A/T]CTCAT









GAGCCATCGGGAGTAGATACATCCACTTAA









GCAGGACAGGATCACAGCATTTATTACTTG









ATTTGAACAAACCACCACTATTCCCCACCC









TTATTGCCGGATAAGTAATTAAACATTCTG









CTCTTATTTTAAAGATTGACTGACAGGAAT









GAAAGAGGCCAAGTTGTATTTAAAAAAAAA









AAATACAAAGGCTTC





BICF235J47857
X
107955905
Glypican 3
A
AG
G
CTTCACTAGAGTATGAGAACCATGAAGACA


(SNP 7; SEQ ID






GGGACTTTGTTTTGTTCATACTGATTCTCT


NO: 146)






AGCACTAAGAGAGCACCTGGCACATGATGC









TCAGTAAACATTCCTGGAAGGGGGGAGGGA









GGAGGAAGTTTACTATTTCTATATACTAAA









CACTATGATTTCTGAGTTTGTCTTTTGCCT









TTTAAGATTTTTTTTTATTT[A/G]CGTAT









TTGAGGGGCTCCTGGGTGGCTGGCTCTGTG









GTTAGGCGTCTGCCTTCGGCTCAGCATGTG









ATCCCAGGCCCAGGGATCGAGTCCCGTATT









GGGCTCCCTGAGAGGGGCCTGCTTTTCTCT









CTGTGTCTCTGCCTCTTTCTGTGTGTCTTT









CATGAATAAATTTTTGTTTTAAAAAAGGAT









TTATTTATTTGAGAG
















TABLE 4







Breeds of dog and number of samples genotyped










Breed
Number














Afghan Hound
14



Airedale Terrier
15



Akita
15



Alaskan Malamute
14



American Cocker
15



Spaniel



Basset Hound
17



Bassett Griff Von Petit
13



Beagle
15



Belgian Sheepdog
13



Bernese Mountain
24



Dog



Bloodhound
15



Borzoi
13



Boston Terrier
14



Boxer
15



Bull Terrier
13



Bulldog
15



Bullmastiff
20



Chihuahua
10



Chihuahua (long coat)
6



Clumber Spaniel
20



Collie (rough)
10



Collie (smooth)
9



Dachshund LH
14



Dachshund SH
16



Dachshund WH
13



Dandie Dinmont
6



Terrier



Dobermann Pinscher
15



English Springer
15



Spaniel



French Bulldog
15



German Shepherd Dog
15



Golden Retriever
13



Great Dane
19



Irish Wolfhound
20



Italian Greyhound
12



Italian Spinone
15



Japanese Chin
10



Labrador Retriever
10



Maltese
15



Manchester Terrier
11



Manchester Terrier
15



(toy)



Mastiff
17



Miniature Pinscher
21



Newfoundland
25



Norfolk Terrier
12



Norwich Terrier
12



Papillon
20



Parson Russell Terrier
15



Pekingese
7



Pembroke Welsh
19



Corgi



Poodle (Standard)
17



Poodle Miniature
10



Portuguese Water Dog
17



Pug
15



Rhodesian Ridgeback
15



Rottweiler
25



Saint Bernard
15



Saluki
20



Samoyed
15



Schnauzer (Giant)
17



Schnauzer (Miniature)
16



Schnauzer (Standard)
12



Shih Tzu
10



Siberian Huskey
19



West Highland white
11



Yorkshire Terrier
13

















TABLE 5







Applying the model to the 65 breeds











Breed average


Breed
Predicted weight (kg)
(kg)












Afghan Hound
20.55
25


Airedale terrier
24.85
21.5


Akita
44.42
42


Alaskan Malamute
35.03
47.5


American Cocker Spaniel
13.88
12


Basset Hound
29.67
22.5


Bassett Griffon ven deen (Petit)
12.98
16


Beagle
12.89
11


Belgian Sheepdog
22.12
28


Bernese Mountain Dog
32.63
42


Bloodhound
55.54
43


Borzoi
27.06
41.5


Boston Terrier
10.55
8


Boxer
37.81
28.5


Bull Terrier
27.55
26


Bulldog
22.60
24


Bullmastiff
52.53
50


Chihuahua
4.60
2


Chihuahua (long coat)
5.16
2


Clumber Spaniel
31.42
32.5


Collie (rough)
27.60
24


Collie (smooth)
26.32
24


Dachshund LH
10.47
9


Dachshund SH
10.02
9


Dachshund WH
12.68
9


Dandie Dinmont Terrier
12.18
9.5


Dobermann Pinscher
24.71
35


English Springer Spaniel
22.96
23


French Bulldog
19.75
11.5


German Shepherd Dog
34.22
38.5


Golden Retriever
30.74
31.5


Great Dane
51.36
50


Irish Wolfhound
37.73
47.5


Italian Greyhound
5.62
3.3


Italian Spinone
39.06
32.5


Japanese Chin
3.74
3.5


Labrador Retriever
33.96
29.5


Maltese
3.19
2.5


Manchester Terrier
5.43
7.5


Manchester Terrier (toy)
3.22
4


Mastiff
70.40
82.5


Miniature Pinscher
7.19
4.5


Newfoundland
53.09
59


Norfolk Terrier
4.78
5.3


Norwich Terrier
3.04
5.3


Papillon
6.61
4.3


Parson Russell Terrier
5.56
6.5


Pekingese
3.89
4.5


Pembroke Welsh Corgi
12.62
11


Poodle (Standard)
17.34
26


Poodle Miniature
5.80
13


Portuguese Water Dog
19.88
20.5


Pug
2.78
7


Rhodesian Ridgeback
28.31
34.5


Rottweiler
24.31
45.5


Saint Bernard
73.31
70.5


Saluki
30.27
19.5


Samoyed
18.00
26.5


Schnauzer (Giant)
31.90
33.5


Schnauzer (Miniature)
6.68
6.5


Schnauzer (Standard)
17.27
15


Shih Tzu
4.37
6


Siberian Huskey
14.61
21.5


West Highland white terrier
6.09
8.5


Yorkshire Terrier
5.96
3
















TABLE 6







A breakdown of the samples used for the


size model validation (Example 3)









No. of samples














Genotyped
80



Called as Mixed breed dogs
66



Mixed breed with all 7 Model SNPs
62



Mixed mature dogs
48



Male Mature
24



Female Mature
24

















TABLE 7







Genotype and size prediction results for the Mixed 48 set (Example 3)










Chr



















4
10
10
15
20
25
X
Predicted
Actual



















SNP ID
1
2
3
4
5
6
7
(kg)
(kg)
Sex
Age (years)





















40051462
2
2
0
0
1
1
2
4.11
3.96
F
3


40047491
1
2
0
0
2
0
2
2.20
4.54
F
3


40049974
2
2
0
0
2
2
2
4.81
4.72
F
4


40043711
1
2
0
0
1
2
0
7.65
5.96
M
2


40060406
2
0
1
1
1
0
1
13.39
6.3
F
4


40053199
2
2
0
0
1
0
2
3.16
6.98
M
3


40050815
2
2
0
1
2
1
2
6.16
7.04
M
4


40036652
1
2
0
0
1
1
0
5.89
7.64
M
4


40036518
1
1
1
0
2
2
2
5.90
7.8
M
1


40056463
1
2
0
0
1
2
0
7.66
7.84
M
11.75


40055628
1
2
0
0
2
1
0
5.30
8.08
M
3


40055160
2
2
0
0
1
0
2
3.16
8.14
M
4.60


40048130
2
2
1
0
1
0
0
7.84
8.58
M
1.08


40049353
2
1
0
0
0
0
2
4.14
8.72
F
1


40055850
1
2
0
0
1
1
0
5.89
8.84
F
8


40043243
2
2
1
0
1
2
2
7.18
10.1
M
2


40054384
1
2
1
1
0
0
1
8.30
10.46
F
1.6


40050208
2
2
1
0
1
2
2
7.18
10.6
M
2


40049961
2
0
2
2
0
2
2
41.52
10.96
F
1.88


40049466
2
1
0
0
1
1
2
4.85
12.64
M
4


40047766
2
2
1
0
2
2
2
6.46
12.88
F
1


40048252
2
2
1
2
2
1
1
18.78
13.5
F
5


40059930
2
2
1
0
0
2
2
7.98
14.84
M
4


40046359
2
1
1
1
2
1
0
18.05
15.1
M
9


40052662
2
2
2
2
0
2
2
29.85
16.54
F
4


40047873
2
1
2
1
0
1
0
29.98
16.62
F
2


40058848
2
1
2
1
1
0
2
11.22
16.96
M
2


40059059
2
2
1
1
1
1
1
12.52
18.42
F
6


40040572
1
0
2
1
1
1
1
18.14
18.64
F
6


40056389
2
2
2
1
0
1
1
18.70
20.05
F
1.6


40046997
2
1
2
2
1
2
2
31.66
21
M
10


40048234
1
1
2
1
0
0
2
9.67
21.5
F
7


40045270
2
1
2
2
1
1
2
24.34
21.7
F
6


40047342
0
1
2
0
2
1
0
8.74
22.35
M
12


40042348
1
1
2
2
1
2
2
24.55
23.25
M
2.5


40054426
2
1
1
2
0
0
0
28.63
23.9
F
4.75


40048696
0
1
2
2
2
0
0
18.70
24.2
F
2


40059748
2
1
2
1
0
0
1
16.96
24.3
F
5


40059949
2
1
2
2
1
2
0
58.50
24.4
F
2


40052268
2
2
2
1
0
1
2
13.76
24.85
M
2.6


40037555
2
1
2
2
2
2
0
52.61
25.85
F
2.5


40059825
2
2
2
2
1
2
2
26.84
26.1
F
2


40050617
2
1
2
2
0
2
2
35.20
26.15
M
8


40057568
2
2
2
1
0
1
2
13.76
30.35
M
11


40049706
2
1
2
1
0
0
0
23.06
30.5
M
3.6


40056320
2
1
2
2
0
1
1
36.8
31.3
F
5


40046852
2
0
2
0
0
2
2
14.92
35
M
3.5


40036517
1
0
2
2
1
1
0
41.14
43.3
M
4
















TABLE 8







Average allele frequencies for 65 breeds across model SNPs









Chr
















Average
4
10
10
15
20
25
X


Location
weight
70324248
8445140
11451490
44263980
35391970
39552390
107955905


















Mastiff
82.5
0.12
0.13
0.00
1.76
0.00
1.88
0.00


Saint Bernard
70.5
0.40
1.47
0.00
2.00
0.13
1.87
0.00


Newfoundland
59.0
0.52
0.16
0.00
2.00
0.16
1.52
0.00


Great Dane
50.0
0.05
0.74
0.00
2.00
1.05
2.00
0.00


Bullmastiff
50.0
0.05
0.60
0.00
1.90
0.00
2.00
0.00


Irish Wolfhound
47.5
0.58
0.00
0.00
2.00
1.70
2.00
0.00


Alaskan Malamute
47.5
0.00
0.00
0.07
2.00
0.31
0.57
0.00


Rottweiler
45.5
0.00
1.42
0.00
0.08
0.56
1.20
0.00


Bloodhound
43.0
0.07
2.00
0.00
2.00
1.29
1.73
0.00


Akita
42.0
0.00
1.73
0.00
2.00
0.14
1.43
0.53


Bernese Mountain Dog
42.0
0.54
0.08
0.00
1.58
0.25
2.00
0.00


Borzoi
41.5
0.23
1.23
0.00
2.00
1.38
1.85
2.00


German Shepherd Dog
38.5
0.47
2.00
0.00
2.00
0.57
2.00
1.07


Dobermann Pinscher
35.0
0.00
1.87
0.00
2.00
1.87
2.00
2.00


Rhodesian Ridgeback
34.5
0.27
1.43
0.00
1.87
0.67
1.47
0.27


Schnauzer (Giant)
33.5
0.76
1.65
0.00
1.88
0.24
1.06
0.12


Italian Spinone
32.5
0.00
1.07
0.00
1.47
1.73
1.73
0.27


Clumber Spaniel
32.5
0.58
1.50
0.00
2.00
2.00
0.40
0.00


Golden Retriever
31.5
0.08
0.77
0.00
1.69
1.38
1.38
0.00


Labrador Retriever
29.5
0.20
0.60
0.00
1.80
1.00
1.60
0.00


Boxer
28.5
0.07
2.00
0.00
2.00
1.71
1.20
0.00


Belgian Sheepdog
28.0
0.00
0.46
0.00
2.00
1.69
1.38
2.00


Samoyed
26.5
0.00
1.73
0.00
2.00
1.87
1.87
1.73


Poodle (Standard)
26.0
0.35
0.47
0.00
1.41
1.29
1.18
1.29


Bull Terrier
26.0
0.00
0.00
0.00
0.46
0.00
2.00
0.00


Afghan Hound
25.0
0.00
1.07
0.00
2.00
1.20
0.13
1.87


Collie (rough)
24.0
0.00
2.00
0.00
2.00
0.80
1.80
2.00


Collie (smooth)
24.0
0.11
2.00
0.00
2.00
1.11
2.00
2.00


Bulldog
24.0
0.00
1.20
0.00
0.53
1.85
1.20
0.00


English Springer Spaniel
23.0
0.07
0.00
0.00
0.67
0.53
0.93
0.00


Basset Hound
22.5
0.44
2.00
0.00
2.00
1.29
0.88
0.50


Airedale terrier
21.5
0.00
2.00
0.00
1.47
2.00
2.00
0.27


Siberian Huskey
21.5
1.21
1.16
0.16
2.00
0.42
0.00
0.32


Portuguese Water Dog
20.5
0.13
1.41
0.00
1.50
2.00
0.50
0.25


Saluki
19.5
0.40
1.00
0.00
2.00
0.30
1.10
1.30


Bassett Griffon ven deen
16.0
0.31
2.00
0.00
1.85
1.08
0.46
1.38


(Petit)


Schnauzer (Standard)
15.0
0.00
1.33
0.00
0.83
0.17
0.00
0.67


Poodle Miniature
13.0
0.80
1.80
1.80
0.00
1.80
1.60
0.20


American Cocker
12.0
1.60
1.60
0.00
0.40
1.33
2.00
0.27


Spaniel


French Bulldog
11.5
0.07
0.40
0.00
0.00
0.29
1.07
0.00


Beagle
11.0
0.27
1.47
0.27
0.13
0.46
0.14
0.29


Pembroke Welsh Corgi
11.0
1.44
1.47
0.00
1.89
1.89
1.47
1.89


Dandie Dinmont Terrier
9.5
1.67
2.00
0.00
2.00
2.00
0.67
0.33


Dachshund LH
9.0
1.79
1.29
0.00
1.71
1.86
1.43
2.00


Dachshund SH
9.0
0.88
0.88
0.06
0.63
1.25
1.25
1.63


Dachshund WH
9.0
0.38
0.92
0.00
0.67
1.54
0.92
1.08


West Highland white
8.5
1.55
1.82
1.36
1.09
2.00
0.55
2.00


terrier


Boston Terrier
8.0
1.71
1.00
0.00
0.31
0.71
0.71
0.71


Manchester Terrier
7.5
0.36
2.00
1.64
0.00
1.82
2.00
2.00


Pug
7.0
1.27
1.47
2.00
0.00
1.20
0.67
2.00


Parson Russell Terrier
6.5
0.40
2.00
1.93
0.00
0.80
1.47
1.20


Schnauzer (Miniature)
6.5
1.44
1.86
0.50
0.00
1.87
0.13
0.13


Shih Tzu
6.0
0.70
1.00
2.00
0.00
1.00
1.60
1.40


Norwich Terrier
5.3
0.50
2.00
2.00
0.33
1.17
0.00
2.00


Norfolk Terrier
5.3
0.92
2.00
2.00
2.00
1.17
0.00
2.00


Miniature Pinscher
4.5
0.85
1.90
1.67
0.00
1.90
1.24
0.67


Pekingese
4.5
1.43
2.00
2.00
0.57
1.71
0.86
0.86


Papillon
4.3
0.80
1.70
0.40
0.20
1.80
1.10
2.00


Manchester Terrier (toy)
4.0
1.67
2.00
2.00
0.00
2.00
1.60
0.93


Japanese Chin
3.5
1.20
2.00
2.00
0.20
1.00
1.80
2.00


Italian Greyhound
3.3
0.25
1.83
1.75
0.83
1.00
0.17
1.83


Yorkshire Terrier
3.0
0.92
2.00
1.92
0.33
0.62
1.23
1.69


Maltese
2.5
0.93
1.07
1.93
0.27
2.00
0.53
1.73


Chihuahua
2.0
0.80
2.00
1.90
0.00
1.40
1.20
1.40


Chihuahua (long coat)
2.0
0.17
2.00
2.00
0.00
1.67
1.67
2.00
















TABLE 9







A conversion matrix for atypical breeds at IGF1 SNP


















IGF1











SNP




Average



IGF1
allele



SNP
frequency

Genotyped
Predicted
Predicted
Modified

Modified



Average
of similar

SNP result
allele of
allele of
allele of

genotype


“Atypical”
allele
sized
Genotyped
as a score
second
“atypical”
“atypical”
Modified
as a score


Breed
frequency
breeds
SNP result
(0, 1 or 2)
breed
breed
breed
genotype
(0, 1 or 2)





Rottweiler
0.08
2
AA
0
A
A
a
Aa
1





Aa
1
A
a
a
Aa
1





aA
1
a
A
a
aa
2





aa
2
a
a
a
aa
2


Bull
0.46
2
AA
0
A
A
a
Aa
1


terrier


Aa
1
A
a
a
Aa
1





aA
1
a
A
a
aa
2





aa
2
a
a
a
aa
2


Whippet
1.89
0
AA
0
A
A
A
AA
0





Aa
1
A
a
A
AA
0





aA
1
a
A
A
aA
1





aa
2
a
a
A
aA
1
















TABLE 10







Chromosome 15 breed calls for Mixed 48 Set










Chr15 (breed 1)
Chr15 (breed 2)













40036517
Fox Terrier (Wire)
Labrador Retriever{circumflex over ( )}2


40036518
Shih Tzu
Cavalier King Charles Spaniel


40036652
Manchester Terrier
Staffordshire Bull Terrier


40037555
Irish Setter{circumflex over ( )}UK
German Shepherd Dog


40040572
?
?


40042348
?
border collie


40043243
?
?


40043711
?
?


40045270
border collie
greyhound


40046359
shetland sheepdog
kerry blue terrier


40046852
Staffordshire Bull Terrier
bullterrier


40046997
Border collie
English Cocker Spaniel


40047342
eng cocker spaniel
Poodle (Miniature)


40047491
Yorkshire terrier
Yorkshire terrier


40047766
shetland sheep dog
border collie


40047873
chinese shar pei
shetlanf sheep dog




(or stafford shire bull terrier)


40048130
cavalier king charles spaniel
Shetland Sheepdog


40048234
german shepherd
English setter


40048252
german shepherd dog
german shepherd dog


40048696
English Springer
Labrador Retriever{circumflex over ( )}2


40049353
Australian Cattle dog
Parson russell terrier


40049466
welsh terrier
parson russell terrier


40049706
cavalier king charles spaniel
Portugese water dog


40049961
Whippet
Whippet


40049974
yorkshire terrier
west highland white


40050208
yorkshire terrier
whippet


40050617
samoyed
bearded or border collie


40050815
chinese crested
Yorkshire terrier


40051462
?
?


40052268
rottweller
old english sheepdog


40052662
german shepherd dog
old english sheep dog


40053199
Parson Jack russell
japenese chin


40054384
English Springer Spaniel
Border Terrier


40054426
Boxer
Am staff


40055160
?
?


40055628
Parson Russel terrier
Toy fox terrier


40055850
?
?


40056320
german shepherd dog
german shepherd dog


40056389
Border Collie
Labrador Retriever


40056463
?
?


40057568
Border collie
papillion


40058848
?
?


40059059
cocker spaniel
?


40059748
greyhound
border collie


40059825
german shepherd dog
german shepherd dog


40059930
shetland sheep dog
yorkshire terrier


40059949
poodle
boxer


40060406
Ihasa apso
king charles cavalier spaniel
















TABLE 11





Table showing the effects of the modification matrix when applied to the Mixed 48 set























4
10
10
15
20
25
X



BICFPJ1149345
BICF230J67378
BICF235J47583
BICFPJ401056
BICF235J20169
BICF235J29129
BICF235J47857











Before modification














40046852
2
0
2
0
0
2
2


40049961
2
0
2
2
0
2
2


40050208
2
2
1
0
1
2
2


40052268
2
2
2
1
0
1
2







After modification














40046852
2
0
2
1
0
2
2


40049961
2
0
2
0
0
2
2


40050208
2
2
1
0
1
2
2


40052268
2
2
2
2
0
1
2
















Standard predicted
Modified predicted
Actual wt







40046852
14.92
24.89
35



40049961
41.52
14.92
10.96



40050208
7.18
7.18
10.6



40052268
13.76
22.95
24.85









Claims
  • 1. A method of predicting the size of a dog that will be attained in adulthood, comprising typing the nucleotide(s) present for a single nucleotide polymorphic (SNP) marker present in the genome of the dog at a position equivalent to position 201 in one or more of the sequences identified in Table 1, and/or at one or more positions which are in linkage disequilibrium with any one of these positions, and thereby predicting the size of the dog that will be attained in adulthood.
  • 2. The method according to claim 1, wherein the one or more positions which are in linkage disequilibrium are identified in Table 2.
  • 3. The method according to claim 1, wherein the one or more positions are selected from positions equivalent to: position 201 of SEQ ID NO: 7 (BICFPJ1149345, SNP 1);position 201 of SEQ ID NO: 35 (BICF230J67378, SNP 2);position 201 of SEQ ID NO: 58 (BICF235J47583, SNP 3);position 201 of SEQ ID NO: 84 (BICFPJ401056, SNP 4);position 201 of SEQ ID NO: 96 (BICF235J20169, SNP 5);position 201 of SEQ ID NO: 111 (BICF235J29129, SNP 6); andposition 201 of SEQ ID NO: 146 (BICF235J47857, SNP 7).
  • 4. The method according to claim 1, wherein the size predicted is selected from the weight or height of the dog.
  • 5. The method according to claim 1, comprising: a) typing the nucleotide(s) for a SNP marker present in the genome of the dog at a position equivalent to position 201 in a sequence selected from the sequences identified in Table 1 or 2;b) typing the nucleotide(s) for a SNP marker present at a position equivalent to position 201 in one or more further sequences selected from the sequences identified in Table 1 or 2; andc) predicting the size of the dog that will be attained in adulthood by combining the results from step a) and b).
  • 6. The method according to claim 5 comprising combining the results from typing the nucleotide(s) for a SNP marker at a position equivalent to position 201 of the following: SEQ ID NO: 7 (BICFPJ1149345, SNP 1);SEQ ID NO: 35 (BICF230J67378, SNP 2);SEQ ID NO: 58 (BICF235J47583, SNP 3);SEQ ID NO: 84 (BICFPJ401056, SNP 4);SEQ ID NO: 96 (BICF235J20169, SNP 5);SEQ ID NO: 111 (BICF235J29129, SNP 6); andSEQ ID NO: 146 (BICF235J47857, SNP 7).
  • 7. The method according to claim 1, comprising typing the nucleotide(s) present for one or more SNP markers present in the genome of the dog and inputting the results into a model predictive of the size of the dog.
  • 8. The method according to claim 7, wherein the size of the dog is predicted by adding a multiplication of the value of one SNP marker by a constant and adding a second constant, wherein the result is the log-body weight.
  • 9. The method according to claim 8, further comprising adding a multiplication of the value of a different SNP marker by a constant.
  • 10. The method according to claim 9, comprising adding a multiplication of the value of a SNP marker by a constant for each of the SNP markers defined in Table 3 and adding a second constant.
  • 11. The method according to claim 1, further comprising determining the genetic breed inheritance of the dog; and/or wherein one or both parents of the dog is or was a pure-bred dog; and/or wherein one or more grandparents of the dog is or was a pure-bred dog; and/or wherein the dog has or is suspected of having genetic inheritance of a breed selected from a breed identified in Table 4 or 5.
  • 12. The method according to claim 1, wherein the dog is a mixed-breed dog, further comprising determining the breed origin of the nucleotide(s) present for the SNP marker.
  • 13. The method according to claim 12, wherein the breed origin of the nucleotide(s) is determined by genotyping a sample from the dog for a panel of breed-specific SNP markers.
  • 14. The method according to claim 1, wherein the typing is performed by contacting a polynucleotide or protein in the sample from the dog with a specific binding agent and determining whether the agent binds to the polynucleotide or protein.
  • 15. The method according to claim 14, wherein the agent is a polynucleotide.
  • 16. The method according to claim 1, wherein the nucleotide present at a polymorphic position is detected by measuring the mobility of a polynucleotide during gel electrophoresis.
  • 17. The method according to claim 1, wherein the nucleotide present at a polymorphic position is determined by hybridising at least one oligonucleotide primer and contacting the sample with a polymerase under conditions suitable for generation of a primer extension product, wherein determining the nucleotide comprises detecting the presence of the primer extension product.
  • 18. The method according to claim 1, further comprising providing the dog's owner or carer with a report of the predicted size of the dog that will be attained at adulthood.
  • 19. The method according to claim 1, further comprising determining the susceptibility of the dog to a disease, wherein the phenotype of the disease is influenced by the size of the dog.
  • 20. The method according to claim 19, wherein the disease is canine hip dysplasia (CHD).
  • 21. The method according to claim 19, further comprising providing care recommendations to the dog's owner or carer to control the weight or growth rate of the dog.
  • 22. A method of preparing customised food for a dog that has had its future size predicted, the method comprising: (a) predicting the size of a dog that will be attained in adulthood by a method according to any one of the preceding claims; and(b) preparing food suitable for the dog, wherein the customised dog food comprises ingredients that are suitable for a dog of the predicted size, and/or does not include ingredients that are not suitable for a dog of the predicted size.
  • 23. The method according to claim 22, further comprising providing the food to the dog's owner or the person responsible for feeding the dog.
  • 24. A method of providing care recommendations for a dog, the method comprising: (a) predicting the size of the dog that will be attained in adulthood by a method according to claim 1; and(b) providing appropriate care recommendations to the dog's owner or carer.
  • 25. The method according to claim 24, wherein the care recommendations comprise advising the type and/or amount of food that is suitable for the size of the dog that will be attained in adulthood.
  • 26. A database comprising information relating to one or more polymorphisms identified in Table 1 or 2 and their association with size of a dog in adulthood.
  • 27. A method of predicting the size of a dog that will be attained in adulthood, the method comprising: (a) inputting data of the nucleotide(s), and optionally the breed origin of the nucleotide(s), present at one or more SNP marker positions in the dog's genome as defined in any one of claims 1 to 3 to a computer system;(b) comparing the data to a computer database, which database comprises information relating to one or more polymorphisms identified in Table 1 or 2 and their association with the size of a dog in adulthood; and(c) predicting on the basis of the comparison the size of the dog that will be attained in adulthood.
  • 28. A computer program encoded on a computer-readable medium and comprising program code means which, when executed, performs all the steps of claim 27, or a computer system arranged to perform a method according to claim 27 comprising: (a) means for receiving data of the nucleotide(s) present at one or more SNP marker positions in the genome of a dog;(b) a module for comparing the data with a database comprising one or more polymorphisms identified in Table 1 or 2 and their association with the size of a dog in adulthood; and(c) means for predicting on the basis of said comparison the size of the dog that will be attained in adulthood.
  • 29. A kit for carrying out the method of claim 1, comprising a probe or primer that is capable of detecting a polymorphism as defined in any one of claims 1 to 3.
  • 30. A method of managing a disease condition influenced by the size of the dog, comprising predicting the size that the dog will attain in adulthood by the method of claim 1, wherein the dog has been determined to be susceptible to a condition influenced by size, and providing recommendations to the dog owner or dog carer to enable the management of the growth rate or size of the dog and to thereby reduce the likelihood of symptoms of the disease developing in the dog.
  • 31. The method according to claim 30, wherein the disease is canine hip dysplasia (CHD).
  • 32. A method of determining whether the genome of a dog contains one or more SNP marker(s) predictive of the size that a dog will attain in adulthood, comprising typing the nucleotide(s) present for a SNP marker present in the genome of the dog at a position equivalent to position 201 in one or more of the sequences identified in Table 1, and/or at one or more positions which are in linkage disequilibrium with any one of these positions, and optionally further comprising determining the breed origin of the nucleotide(s) present for a SNP marker.
  • 33. Use of one or more SNP marker(s) present in the genome of a dog at a position equivalent to position 201 in one or more of the sequences identified in Table 1, and/or at one or more positions which are in linkage disequilibrium with any one of these positions for predicting the size that a dog will attain in adulthood.
Priority Claims (1)
Number Date Country Kind
0722068.4 Nov 2007 GB national
PCT Information
Filing Document Filing Date Country Kind 371c Date
PCT/GB08/03760 11/7/2008 WO 00 8/2/2010
Provisional Applications (1)
Number Date Country
61042341 Apr 2008 US