PROCESSES AND KITS FOR IDENTIFYING ANEUPLOIDY

Information

  • Patent Application
  • 20220098644
  • Publication Number
    20220098644
  • Date Filed
    October 15, 2021
    4 years ago
  • Date Published
    March 31, 2022
    4 years ago
Abstract
Provided are methods for identifying the presence or absence of a chromosome abnormality by which a cell-free sample nucleic acid from a subject is analyzed. In certain embodiments, provided are methods for identifying the presence or absence of a fetal chromosome abnormality in a nucleic acid from cell-free maternal blood.
Description
SEQUENCE LISTING

The instant application contains a Sequence Listing which has been submitted electronically in ASCII format and is hereby incorporated by reference in its entirety. Said ASCII copy, created on Jun. 26, 2014, is named SEQ-6027-US_SL.txt and is 5,172,775 bytes in size.


FIELD

The technology in part relates to methods and compositions for identifying a chromosome abnormality, which include, without limitation, prenatal tests for detecting an aneuploidy (e.g., trisomy 21 (Down syndrome), trisomy 18 (Edward syndrome), trisomy 13 (Patau syndrome)).


BACKGROUND

A chromosome is an organized structure of deoxyribonucleic acid (DNA) and protein found in cells. A chromosome generally includes a single piece of DNA that contains many genes, regulatory elements and other nucleotide sequences. Most cells in humans and other mammals typically include two copies of each chromosome.


Different organisms include different numbers of chromosomes. Most feline cells include nineteen (19) pairs of chromosomes and most canine cells include thirty-nine (39) pairs of chromosomes. Most human cells include twenty-three (23) pairs of chromosomes. One copy of each pair is inherited from the mother and the other copy is inherited from the father. The first twenty-two (22) pairs of chromosomes (referred to as autosomes) are numbered from 1 to 22, and are arranged from largest to smallest in a karyotype. The twenty-third (23rd) pair of chromosomes is a pair of sex chromosomes. Females typically have two X chromosomes, while males typically have one X chromosome and one Y chromosome.


Chromosome abnormalities can occur in different forms. Aneuploidy is an abnormal number of certain chromosomes in cells of an organism. There are multiple mechanisms that can give rise to aneuploidy, and aneuploidy can occur within cancerous cells or fetal cells, for example. Many fetuses with aneuploid cells do not survive to term. Where a fetus having aneuploid cells does survive to term, the affected individual is at risk of certain diseases and syndromes, including cancer and others described herein.


An extra or missing chromosome is associated with a number of diseases and syndromes, including Down syndrome (trisomy 21), Edward syndrome (trisomy 18) and Patau syndrome (trisomy 13), for example. Incidence of trisomy 21 is estimated at 1 in 600 births and increases to 1 in 350 in women over the age of 35. Down syndrome presents as multiple dysmorphic features, including physical phenotype, mental retardation and congenital heart defects (e.g., in about 40% of cases). Incidence of trisomy 18 is estimated at 1 in 80,000 births, increasing to 1 in 2,500 births in women over the age of 35. Edward syndrome also presents as multiple dysmorphic features and profound mental deficiency. Open neural tube defects or open ventral wall defects present in about 25% of cases and there is a 90% fatality rate in the first year. Incidence of trisomy 13 is estimated in 1 in 10,000 live births, and presents heart defects, brain defects, cleft lip and cleft palate, visual abnormalities (e.g., omphalocele, proboscis and holoprosencephaly) for example. More than 80% of children with trisomy 13 die in the first month of life.


Aneuploidy in gestating fetuses can be diagnosed with relative accuracy by karyotyping and fluorescent in situ hybridization (FISH) procedures. Such procedures generally involve amniocentesis and chorionic villus sampling (CVS), both relatively invasive procedures, followed by several days of cell culture and a subjective analysis of metaphase chromosomes. There also is a non-trivial risk of miscarriage associated with these procedures. As these procedures are highly labor intensive, certain procedures that are less labor intensive have been proposed as replacements. Examples of potentially less labor intensive procedures include detection using short tandem repeats, PCR-based quantification of chromosomes using synthetic competitor template and hybridization-based methods.


SUMMARY

Current methods of screening for trisomies include serum testing and may also include a Nuchal Translucency (NT) Ultrasound. If the calculated risk analysis is high, the patient may be referred for an amniocentesis or CVS for confirmation. However, the standard of care in the United States and Europe typically can achieve an 80-85% detection rate with a 4-7% false positive rate. As a result, many patients are being unnecessarily referred to invasive amniocentesis or CVS procedures. Amniocentesis involves puncturing the uterus and the amniotic sac and increases risk of miscarriage, and fetal cells obtained by amniocentesis often are cultured for a period of time to obtain sufficient fetal cells for analysis.


Technology described herein provides non-invasive methods for detecting the presence or absence of a chromosome abnormality by analyzing extracellular nucleic acid (e.g., nucleic acid obtained from an acellular sample). Methods described herein also offer increased sensitivity and specificity as compared to current non-invasive procedures (e.g., serum screening).


Determining whether there is a chromosome abnormality when analyzing cell-free nucleic acid can present challenges because there is non-target nucleic acid mixed with target nucleic acid. For example, extracellular nucleic acid obtained from a pregnant female for prenatal testing includes maternal nucleic acid background along with the target fetal nucleic acid. Technology described herein provides methods for accurately analyzing extracellular nucleic acid for chromosome abnormalities when a background of non-target nucleic acid is present.


Thus, provided herein are methods for identifying the presence or absence of a chromosome abnormality in a subject, which comprise: (a) preparing a plurality of sets of amplified nucleic acid species by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; and (b) determining the amount of each amplified nucleic acid species in each set; whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets.


Also provided herein are methods for identifying the presence or absence of a chromosome abnormality in a subject, which comprise: (a) preparing a plurality of sets of amplified nucleic acid species by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set by a single set of amplification primers, (v) and each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; and (b) determining the amount of each amplified nucleic acid species in each set; whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets. In another embodiment, amplification primers are modified or otherwise different from each other and yield amplification products at reproducible levels relative to each other.


Also provided herein are methods for identifying the presence or absence of an abnormality of a target chromosome in a subject, which comprise: (a) preparing three or more sets of amplified nucleic acid species by amplifying three or more nucleotide sequence sets from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) a nucleotide sequence in a set is present on a target chromosome and at least one other nucleotide sequence in the set is present on one or more reference chromosomes, (iii) the target chromosome is common for all of the sets; (iv) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (v) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (vi) the primer hybridization sequences of each nucleotide sequence in a set in the extracellular nucleic acid template are substantially identical; and (vii) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; and (b) determining the amount of each amplified nucleic acid species in each set; (c) detecting the presence or absence of a decrease or increase of the target chromosome from the amount of each amplified nucleic acid species in the sets; whereby the presence or absence of the chromosome abnormality is identified based on a decrease or increase of the target chromosome relative to the one or more reference chromosomes. In a related embodiment, the three or more sets of amplified nucleic acid species are amplified in a single, multiplexed reaction. In another embodiment, the amount of each amplified nucleic acid species in each set is determined in a single, multiplexed reaction. In another embodiment, the amount of each amplified nucleic acid species in each set is determined in two or more replicated multiplexed reactions. In yet another embodiment, detecting the presence or absence of a decrease or increase of the target chromosome from the amount of each amplified nucleic acid species in the sets; whereby the presence or absence of the chromosome abnormality is identified based on a decrease or increase of the target chromosome relative to the one or more reference chromosomes.


Provided also herein are methods for identifying the presence or absence of a chromosome abnormality in a subject, which comprise: (a) preparing a set of amplified nucleic acid species by amplifying nucleotide sequences from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in the set is present on three or more different chromosomes, (iii) each nucleotide sequence in the set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in the set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in the set comprises a nucleotide sequence having the one or more mismatch nucleotides; and (b) determining the amount of each amplified nucleic acid species in the set; whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species.


Also provided are methods for identifying the presence or absence of a chromosome abnormality in a subject, which comprise: (a) preparing a set of amplified nucleic acid species by amplifying nucleotide sequences from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in the set is present on chromosome 21, chromosome 18 and chromosome 13, (iii) each nucleotide sequence in the set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in the set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in the set comprises a nucleotide sequence having the one or more mismatch nucleotides; and (b) determining the amount of each amplified nucleic acid species in each set; whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species in the set. In certain embodiments, two or more sets of nucleotide sequence species, and amplified nucleic acid species generated there from, are utilized.


In some embodiments, the chromosome abnormality is aneuploidy of a target chromosome, and in certain embodiments, the target chromosome is chromosome 21, chromosome 18, chromosome 13, chromosome X and/or chromosome Y. In some embodiments each nucleotide sequence in a set is not present in any chromosome other than in each and every target chromosome.


The template nucleic acid is from blood, in some embodiments, and sometimes the blood is blood plasma, blood serum or a combination thereof. The extracellular nucleic acid sometimes comprises a mixture of nucleic acid from cancer cells and nucleic acid from non-cancer cells. In some embodiments, the extracellular nucleic acid comprises a mixture of fetal nucleic acid and maternal nucleic acid. Sometimes the blood is from a pregnant female subject is in the first trimester of pregnancy, the second trimester of pregnancy, or the third trimester of pregnancy. In some embodiments, the nucleic acid template comprises a mixture of maternal nucleic acid and fetal nucleic acid, and the fetal nucleic acid sometimes is about 5% to about 40% of the nucleic acid. In some embodiments the fetal nucleic acid is about 0.5% to about 4.99% of the nucleic acid. In certain embodiments the fetal nucleic acid is about 40.01% to about 99% of the nucleic acid. In some embodiments, a method described herein comprises determining the fetal nucleic acid concentration in the nucleic acid, and in some embodiments, the amount of fetal nucleic acid is determined based on a marker specific for the fetus (e.g., specific for male fetuses). The amount of fetal nucleic acid in the extracellular nucleic acid can be utilized for the identification of the presence or absence of a chromosome abnormality in certain embodiments. In some embodiments, fetal nucleic acid of the extracellular nucleic acid is enriched, by use of various enrichment methods, relative to maternal nucleic acid.


Each nucleotide sequence in a set is substantially identical to each other nucleotide sequence in the set, in some embodiments. In certain embodiments, each nucleotide sequence in a set is a paralog sequence, and sometimes each nucleotide sequence in each set shares about 50%, 60%, 70%, 80%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity with another nucleotide sequence in the set. In some embodiments, each nucleotide sequence in a set differs by one or more nucleotide base mismatches (e.g., about 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10 mismatch differences). In certain embodiments, the one or more nucleotide base mismatches are polymorphisms (e.g., SNPs, insertions or deletions) with a low heterozygosity rate (e.g., less than 5%, 4%, 3%, 2%, 1% or less). One or more of the nucleotide sequences are non-exonic in some embodiments, and sometimes one or more of the nucleotide sequences are intergenic, intronic, partially exonic or partially non-exonic. In certain embodiments, a nucleotide sequence in a set comprises an exonic nucleotide sequence, intergenic sequence or a non-exonic nucleotide sequence. In some embodiments, one or more nucleotide sequence species are selected from the group consisting of those listed in Table 4B herein. In certain embodiments, the entire length of a nucleotide sequence species provided in Table 4B is amplified, and in some embodiments a nucleic acid is amplified that is shorter or longer than a nucleotide sequence species provided in Table 4B. In certain embodiments, the entire length of a nucleotide sequence species provided in Table 4B is detected, and in some embodiments a nucleic acid is detected that is shorter or longer than a nucleotide sequence species provided in Table 4B.


In some embodiments, one or more synthetic competitor templates that contain a mismatch are introduced at a known concentration, whereby the competitor can facilitate determining the amount of each amplified nucleic acid species in each set. The synthetic competitor template should amplify at a substantially reproducible level relative to each other nucleotide sequence in a set.


One or more of the sets comprises two nucleotide sequences in some embodiments, and sometimes one or more sets comprise three nucleotide sequences. In some embodiments, in about 50%, 60%, 70%, 80%, 90% or 100% of sets, two nucleotide sequences are in a set, and sometimes in about 50%, 60%, 70%, 80%, 90% or 100% of sets, three nucleotide sequences are in a set. In a set, nucleotide sequence species sometimes are on chromosome 21 and chromosome 18, or are on chromosome 21 and chromosome 13, or are on chromosome 13 and chromosome 18, or are on chromosome 21, and chromosome 18 and chromosome 13, and in about 50%, 60%, 70%, 80%, 90% or 100% of sets, the nucleotide species are on such designated chromosomes. In certain embodiments, each nucleotide sequence in all sets is present on chromosome 21, chromosome 18 and chromosome 13.


In some embodiments, the amplification species of the sets are generated in one reaction vessel. The amplified nucleic acid species in a set sometimes are prepared by a process that comprises contacting the extracellular nucleic acid with one reverse primer and one forward primer, and in some embodiments, nucleotide sequences in a set are amplified using two or more primer pairs. In certain embodiments, the amounts of the amplified nucleic acid species in each set vary by about 50%, 40%, 30%, 20%, 10% or less, and in some embodiments, the amounts of the amplified nucleic acid species in each set vary by up to a value that permits detection of the chromosome abnormality with a confidence level of about 95% or more. The length of each of the amplified nucleic acid species independently is about 30 to about 500 base pairs (e.g., about 50, 55, 60, 65, 70, 75, 80, 85, 90, 95 or 100 base pairs in length) in some embodiments.


The amount of amplified nucleic acid species means the absolute copy number of a nucleic acid species or the relative quantities of nucleic acid species compared to each other or some standard. The amount of each amplified nucleic acid species, in certain embodiments, is determined by any detection method known, including, without limitation, primer extension, sequencing, digital polymerase chain reaction (dPCR), quantitative PCR (Q-PCR) and mass spectrometry. In some embodiments, the amplified nucleic acid species are detected by: (i) contacting the amplified nucleic acid species with extension primers, (ii) preparing extended extension primers, and (iii) determining the relative amount of the one or more mismatch nucleotides by analyzing the extended extension primers. The one or more mismatch nucleotides are analyzed by mass spectrometry in some embodiments.


For multiplex methods described herein, there are about 4 to about 100 sets of nucleotide sequences, or amplification nucleic acids, in certain embodiments (e.g., about 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24 or 25 sets). In some embodiments, a plurality of specific sets is in a group, and an aneuploidy determination method comprises assessing the same group multiple times (e.g., two or more times; 2, 3, 4, 5, 6, 7, 8, 9 or 10 or more times). For example, a group may include sets A, B and C, and this same group of sets can be assessed multiple times (e.g., three times).


In certain embodiments, an aneuploidy determination method comprises assessing different groups, where each group has different sets of nucleotide sequences. In some embodiments, one or more sets may overlap, or not overlap, between one or more groups. For example, one group including sets A, B and C and a second group including sets D, E and F can be assessed, where each group is assessed one time or multiple times, for an aneuploidy determination.


In certain embodiments, a nucleotide sequence species designated by an asterisk in Table 4 herein, and/or an associated amplification primer nucleic acid or extension nucleic acid, is not included in a method or composition described herein. In some embodiments, nucleotide sequence species in a set of nucleic acids are not from chromosome 13 or chromosome 18.


In some embodiments, the presence or absence of the chromosome abnormality is based on the amounts of the nucleic acid species in 80% or more of the sets. The number of sets provides a 70% to 99.99%, and sometimes 85% to 99.99%, sensitivity for determining the absence of the chromosome abnormality in some embodiments (e.g., about 82, 84, 86, 88, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99 or 99.5% sensitivity), and in certain embodiments, the number of sets provides a 70% to 99.99%, and sometimes 85% to 99.99%, specificity for determining the presence of the chromosome abnormality (e.g., about 82, 84, 86, 88, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99 or 99.5% specificity). In certain embodiments, the number of sets is determined based on (i) a 80% to 99.99% sensitivity for determining the absence of the chromosome abnormality, and (ii) a 80% to 99.99% specificity for determining the presence of the chromosome abnormality. In higher risk pregnancies (e.g., those assessed as such by a health care provider or those of females over 35 or 40 years of age), it can be assumed there will be a higher frequency of the presence of a chromosome abnormality, and select (i) number of sets, and/or (ii) types of nucleotide sequences that provide a (a) relatively lower specificity and (b) relatively higher sensitivity, in some embodiments. In certain embodiments, a method herein comprises determining a ratio between the relative amount of (i) an amplified nucleic acid species and (ii) another amplified nucleic acid species, in each set; and determining the presence or absence of the chromosome abnormality is identified by the ratio. In some embodiments, the presence or absence of the chromosome abnormality is based on nine or fewer replicates (e.g., about 8, 7, 6, 5, 4, 3 or 2 replicates) or on no replicates, but just a single result from a sample. In a related embodiment, the amplification reaction is done in nine or fewer replicates (e.g., about 8, 7, 6, 5, 4, 3 or 2 replicates).


Also provided herein are kits for identifying presence or absence of chromosome abnormality. In certain embodiments, the kits comprise one or more of (i) one or more amplification primers for amplifying a nucleotide sequence species of a set, (ii) one or more extension primers for discriminating between amplified nucleic acid species or nucleotide sequence species of each set, (iii) a solid support for multiplex detection of amplified nucleic acid species or nucleotide sequence species of each set (e.g., a solid support that includes matrix for matrix-assisted laser desorption ionization (MALDI) mass spectrometry; (iv) reagents for detecting amplified nucleic acid species or nucleotide sequence species of each set; (vi) a detector for detecting the amplified nucleic acid species or nucleotide sequence species of each set (e.g., mass spectrometer); (vii) reagents and/or equipment for quantifying fetal nucleic acid in extracellular nucleic acid from a pregnant female; (viii) reagents and/or equipment for enriching fetal nucleic acid from extracellular nucleic acid from a pregnant female; (ix) software and/or a machine for analyzing signals resulting from a process for detecting the amplified nucleic acid species or nucleotide sequence species of the sets; (x) information for identifying presence or absence of a chromosome abnormality (e.g., tables that convert signal information or ratios into outcomes), (xi) container and/or reagents for procuring extracellular nucleic acid (e.g., equipment for drawing blood; equipment for generating cell-free blood; reagents for isolating nucleic acid (e.g., DNA) from plasma or serum; reagents for stabilizing serum or plasma or nucleic acid for shipment and/or processing).


Certain embodiments are described further in the following description, claims and drawings.





BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 provides an overview for using paralogs to detect chromosomal imbalances from a sample comprising a hetergenous mixture of extracellular nucleic acid. FIG. 1 discloses SEQ ID NOS 5178-5179, respectively, in order of appearance.



FIG. 2 shows more marker sets (e.g., multiplexed assays) increases discernibility between euploids and aneuploids.



FIG. 3 shows simulations where fetal concentration (10% vs 20%) versus decreasing coefficient of variation (CV) versus sensitivity and specificity are graphed.



FIG. 4 shows different levels of variance for different steps of detection and quantification by Sequenom MassARRAY, which includes amplification (PCR), dephosphorylation using Shrimp Alkaline Phosphatase (SAP), primer extension (EXT) and identification and quantification of each nucleotide mismatch by MALDI-TOF mass spectrometry (MAL).



FIG. 5 shows an example of a working assay from the model system DNA Set 1: no ethnic bias (p>0.05); Large, significant (p<0.001) difference between N and T21; Low CVs.



FIG. 6 shows an example of two poor assays from the model system DNA Set 1: Ethnic bias (p<0.001) and large variance.



FIG. 7 shows an example of a working assay and a poor assay based on DNA set 2. For the working assay, the observed results (darker crosses and corresponding light-colored line) show a linear response that match the expected results (lighter crosses and corresponding dark-colored line); whereas, the poor assay does not show a linear response and does not match the expected results.



FIG. 8 shows an example of a working assay and a poor assay based on DNA set 3.



FIG. 9 shows results from Experiment I, Tier IV. The chart is based on a Simple Principle Component Analysis, and shows the two main components can separate euploid samples from aneuploid samples. Euploid samples are designated by diamonds and aneuploid samples are designated by circles in FIG. 9.





DETAILED DESCRIPTION

Provided herein are improved processes and kits for identifying presence or absence of a chromosome abnormality. Such processes and kits impart advantages of (i) decreasing risk of pregnancy complications as they are non-invasive; (ii) providing rapid results; and (iii) providing results with a high degree of one or more of confidence, specificity and sensitivity, for example. Processes and kits described herein can be applied to identifying presence or absence of a variety of chromosome abnormalities, such as trisomy 21, trisomy 18 and/or trisomy 13, and aneuploid states associated with particular cancers, for example. Further, such processes and kits are useful for applications including, but not limited to, non-invasive prenatal screening and diagnostics, cancer detection, copy number variation detection, and as quality control tools for molecular biology methods relating to cellular replication (e.g., stem cells).


Chromosome Abnormalities


Chromosome abnormalities include, without limitation, a gain or loss of an entire chromosome or a region of a chromosome comprising one or more genes. Chromosome abnormalities include monosomies, trisomies, polysomies, loss of heterozygosity, deletions and/or duplications of one or more nucleotide sequences (e.g., one or more genes), including deletions and duplications caused by unbalanced translocations. The terms “aneuploidy” and “aneuploid” as used herein refer to an abnormal number of chromosomes in cells of an organism. As different organisms have widely varying chromosome complements, the term “aneuploidy” does not refer to a particular number of chromosomes, but rather to the situation in which the chromosome content within a given cell or cells of an organism is abnormal.


The term “monosomy” as used herein refers to lack of one chromosome of the normal complement. Partial monosomy can occur in unbalanced translocations or deletions, in which only a portion of the chromosome is present in a single copy (see deletion (genetics)). Monosomy of sex chromosomes (45, X) causes Turner syndrome.


The term “disomy” refers to the presence of two copies of a chromosome. For organisms such as humans that have two copies of each chromosome (those that are diploid or “euploid”), it is the normal condition. For organisms that normally have three or more copies of each chromosome (those that are triploid or above), disomy is an aneuploid chromosome complement. In uniparental disomy, both copies of a chromosome come from the same parent (with no contribution from the other parent).


The term “trisomy” refers to the presence of three copies, instead of the normal two, of a particular chromosome. The presence of an extra chromosome 21, which is found in Down syndrome, is called trisomy 21. Trisomy 18 and Trisomy 13 are the two other autosomal trisomies recognized in live-born humans. Trisomy of sex chromosomes can be seen in females (47, XXX) or males (47, XXY which is found in Klinefelter's syndrome; or 47,XYY).


The terms “tetrasomy” and “pentasomy” as used herein refer to the presence of four or five copies of a chromosome, respectively. Although rarely seen with autosomes, sex chromosome tetrasomy and pentasomy have been reported in humans, including)(XXX, XXXY, XXYY, XYYY, XXXXX, XXXXY, XXXYY, XXYYY and XYYYY.


Chromosome abnormalities can be caused by a variety of mechanisms. Mechanisms include, but are not limited to (i) nondisjunction occurring as the result of a weakened mitotic checkpoint, (ii) inactive mitotic checkpoints causing non-disjunction at multiple chromosomes, (iii) merotelic attachment occurring when one kinetochore is attached to both mitotic spindle poles, (iv) a multipolar spindle forming when more than two spindle poles form, (v) a monopolar spindle forming when only a single spindle pole forms, and (vi) a tetraploid intermediate occurring as an end result of the monopolar spindle mechanism.


The terms “partial monosomy” and “partial trisomy” as used herein refer to an imbalance of genetic material caused by loss or gain of part of a chromosome. A partial monosomy or partial trisomy can result from an unbalanced translocation, where an individual carries a derivative chromosome formed through the breakage and fusion of two different chromosomes. In this situation, the individual would have three copies of part of one chromosome (two normal copies and the portion that exists on the derivative chromosome) and only one copy of part of the other chromosome involved in the derivative chromosome.


The term “mosaicism” as used herein refers to aneuploidy in some cells, but not all cells, of an organism. Certain chromosome abnormalities can exist as mosaic and non-mosaic chromosome abnormalities. For example, certain trisomy 21 individals have mosaic Down syndrome and some have non-mosaic Down syndrome. Different mechanisms can lead to mosaicism. For example, (i) an initial zygote may have three 21st chromosomes, which normally would result in simple trisomy 21, but during the course of cell division one or more cell lines lost one of the 21st chromosomes; and (ii) an initial zygote may have two 21st chromosomes, but during the course of cell division one of the 21st chromosomes were duplicated. Somatic mosaicism most likely occurs through mechanisms distinct from those typically associated with genetic syndromes involving complete or mosaic aneuploidy. Somatic mosaicism has been identified in certain types of cancers and in neurons, for example. In certain instances, trisomy 12 has been identified in chronic lymphocytic leukemia (CLL) and trisomy 8 has been identified in acute myeloid leukemia (AML). Also, genetic syndromes in which an individual is predisposed to breakage of chromosomes (chromosome instability syndromes) are frequently associated with increased risk for various types of cancer, thus highlighting the role of somatic aneuploidy in carcinogenesis. Methods and kits described herein can identify presence or absence of non-mosaic and mosaic chromosome abnormalities.


Following is a non-limiting list of chromosome abnormalities that can be potentially identified by methods and kits described herein.














Chromosome
Abnormality
Disease Association







X
XO
Turner’s Syndrome


Y
XXY
Klinefelter syndrome


Y
XYY
Double Y syndrome


Y
XXX
Trisomy X syndrome


Y
XXXX
Four X syndrome


Y
Xp21 deletion
Duchenne’s/Becker syndrome, congenital adrenal




hypoplasia, chronic granulomatus disease


Y
Xp22 deletion
steroid sulfatase deficiency


Y
Xq26 deletion
X-linked lymphproliferative disease


 1
1p (somatic)
neuroblastoma



monosomy trisomy



 2
monosomy trisomy
growth retardation, developmental and mental



2q
delay, and minor physical abnormalities


 3
monosomy trisomy
Non-Hodgkin’s lymphoma



(somatic)



 4
monosomy trsiomy
Acute non lymphocytic leukaemia (ANLL)



(somatic)



 5
5p
Cri du chat; Lejeune syndrome


 5
5q
myelodysplastic syndrome



(somatic) monosomy




trisomy



 6
monosmy trisomy
clear-cell sarcoma



(somatic)



 7
7q11.23 deletion
William’s syndrome


 7
monosomy trisomy
monosomy 7 syndrome of childhood; somatic:




renal cortical adenomas; myelodysplastic syndrome


 8
8q24.1 deletion
Langer-Giedon syndrome


 8
monosomy trisomy
myelodysplastic syndrome; Warkany syndrome;




somatic: chronic myelogenous leukemia


 9
monosomy 9p
Alfi’s syndrome


 9
monosomy 9p partial
Rethore syndrome



trisomy



 9
trisomy
complete trisomy 9 syndrome;




mosaic trisomy 9 syndrome


10
Monosomy trisomy
ALL or ANLL



(somatic)



11
11p-
Aniridia; Wilms tumor


11
11q-
Jacobson Syndrome


11
monosomy (somatic)
myeloid lineages affected (ANLL, MDS)



trisomy



12
monosomy trisomy
CLL, Juvenile granulosa cell tumor (JGCT)



(somatic)



13
13q-
13q-syndrome; Orbeli syndrome


13
13q14 deletion
retinoblastoma


13
monosomy trisomy
Patau’s syndrome


14
monsomy trisomy
myeloid disorders (MDS, ANLL, atypical CML)



(somatic)



15
15q11-q13 deletion
Prader-Willi, Angelman’s syndrome



monosomy



15
trisomy (somatic)
myeloid and lymphoid lineages affected,




e.g., MDS, ANLL, ALL, CLL)


16
16q13.3 deletion
Rubenstein-Taybi



monosomy trisomy
papillary renal cell carcinomas (malignant)



(somatic)



17
17p-(somatic)
17p syndrome in myeloid malignancies


17
17q11.2 deletion
Smith-Magenis


17
17q13.3
Miller-Dieker


17
monosomy trisomy
renal cortical adenomas



(somatic)



17
17p11.2-12 trisomy
Charcot-Marie Tooth Syndrome type 1; HNPP


18
18p-
18p partial monosomy syndrome or




Grouchy Lamy Thieffry syndrome


18
18q-
Grouchy Lamy Salmon Landry Syndrome


18
monosomy trisomy
Edwards Syndrome


19
monosomy trisomy



20
20p-
trisomy 20p syndrome


20
20p11.2-12 deletion
Alagille


20
20q-
somatic: MDS, ANLL, polycythemia vera, chronic




neutrophilic leukemia


20
monosomy trisomy
papillary renal cell carcinomas (malignant)



(somatic)



21
monosomy trisomy
Down’s syndrome


22
22q11.2 deletion
DiGeorge’s syndrome, velocardiofacial syndrome,




conotruncal anomaly face syndrome, autosomal dominant




Opitz G/BBB syndrome, Caylor cardiofacial syndrome


22
monosomy trisomy
complete trisomy 22 syndrome









In certain embodiments, presence or absence of a fetal chromosome abnormality is identified (e.g., trisomy 21, trisomy 18 and/or trisomy 13). In some embodiments, presence or absence of a chromosome abnormality related to a cell proliferation condition or cancer is identified. Presence or absence of one or more of the chromosome abnormalities described in the table above may be identified in some embodiments.


Template Nucleic Acid


Template nucleic acid utilized in methods and kits described herein often is obtained and isolated from a subject. A subject can be any living or non-living source, including but not limited to a human, an animal, a plant, a bacterium, a fungus, a protist. Any human or animal can be selected, including but not limited, non-human, mammal, reptile, cattle, cat, dog, goat, swine, pig, monkey, ape, gorilla, bull, cow, bear, horse, sheep, poultry, mouse, rat, fish, dolphin, whale, and shark, or any animal or organism that may have a detectable chromosome abnormality.


Template nucleic acid may be isolated from any type of fluid or tissue from a subject, including, without limitation, umbilical cord blood, chorionic villi, amniotic fluid, cerbrospinal fluid, spinal fluid, lavage fluid (e.g., bronchoalveolar, gastric, peritoneal, ductal, ear, athroscopic), biopsy sample (e.g., from pre-implantation embryo), celocentesis sample, fetal nucleated cells or fetal cellular remnants, washings of female reproductive tract, urine, feces, sputum, saliva, nasal mucous, prostate fluid, lavage, semen, lymphatic fluid, bile, tears, sweat, breast milk, breast fluid, embryonic cells and fetal cells. In some embodiments, a biological sample may be blood, and sometimes plasma. As used herein, the term “blood” encompasses whole blood or any fractions of blood, such as serum and plasma as conventionally defined. Blood plasma refers to the fraction of whole blood resulting from centrifugation of blood treated with anticoagulants. Blood serum refers to the watery portion of fluid remaining after a blood sample has coagulated. Fluid or tissue samples often are collected in accordance with standard protocols hospitals or clinics generally follow. For blood, an appropriate amount of peripheral blood (e.g., between 3-40 milliliters) often is collected and can be stored according to standard procedures prior to further preparation in such embodiments. A fluid or tissue sample from which template nucleic acid is extracted may be acellular. In some embodiments, a fluid or tissue sample may contain cellular elements or cellular remnants. In some embodiments fetal cells or cancer cells may comprise the sample.


The sample may be heterogeneous, by which is meant that more than one type of nucleic acid species is present in the sample. For example, heterogeneous nucleic acid can include, but is not limited to, (i) fetally derived and maternally derived nucleic acid, (ii) cancer and non-cancer nucleic acid, and (iii) more generally, mutated and wild-type nucleic acid. A sample may be heterogeneous because more than one cell type is present, such as a fetal cell and a maternal cell or a cancer and non-cancer cell.


For prenatal applications of technology described herein, fluid or tissue sample may be collected from a female at a gestational age suitable for testing, or from a female who is being tested for possible pregnancy. Suitable gestational age may vary depending on the chromosome abnormality tested. In certain embodiments, a pregnant female subject sometimes is in the first trimester of pregnancy, at times in the second trimester of pregnancy, or sometimes in the third trimester of pregnancy. In certain embodiments, a fluid or tissue is collected from a pregnant woman at 1-4, 4-8, 8-12, 12-16, 16-20, 20-24, 24-28, 28-32, 32-36, 36-40, or 40-44 weeks of fetal gestation, and sometimes between 5-28 weeks of fetal gestation.


Template nucleic acid can be extracellular nucleic acid in certain embodiments. The term “extracellular template nucleic acid” as used herein refers to nucleic acid isolated from a source having substantially no cells (e.g., no detectable cells; may contain cellular elements or cellular remnants). Examples of acellular sources for extracellular nucleic acid are blood plasma, blood serum and urine. Without being limited by theory, extracellular nucleic acid may be a product of cell apoptosis and cell breakdown, which provides basis for extracellular nucleic acid often having a series of lengths across a large spectrum (e.g., a “ladder”).


Extracellular template nucleic acid can include different nucleic acid species, and therefore is referred to herein as “heterogeneous” in certain embodiments. For example, blood serum or plasma from a person having cancer can include nucleic acid from cancer cells and nucleic acid from non-cancer cells. In another example, blood serum or plasma from a pregnant female can include maternal nucleic acid and fetal nucleic acid. In some instances, fetal nucleic acid sometimes is about 5% to about 40% of the overall template nucleic acid (e.g., about 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38 or 39% of the template nucleic acid is fetal nucleic acid). In some embodiments, the majority of fetal nucleic acid in template nucleic acid is of a length of about 500 base pairs or less (e.g., about 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99 or 100% of fetal nucleic acid is of a length of about 500 base pairs or less).


The terms “nucleic acid” and “nucleic acid molecule” may be used interchangeably throughout the disclosure. The terms refer to nucleic acids of any composition from, such as deoxyribonucleic acid (DNA, e.g., complementary DNA (cDNA), genomic DNA (gDNA) and the like), ribonucleic acid (RNA, e.g., message RNA (mRNA), short inhibitory RNA (siRNA), ribosomal RNA (rRNA), transfer RNA (tRNA), microRNA, RNA highly expressed by the fetus or placenta, and the like), and/or DNA or RNA analogs (e.g., containing base analogs, sugar analogs and/or a non-native backbone and the like), RNA/DNA hybrids and polyamide nucleic acids (PNAs), all of which can be in single- or double-stranded form, and unless otherwise limited, can encompass known analogs of natural nucleotides that can function in a similar manner as naturally occurring nucleotides. A nucleic acid can be in any form useful for conducting processes herein (e.g., linear, circular, supercoiled, single-stranded, double-stranded and the like). A nucleic acid may be, or may be from, a plasmid, phage, autonomously replicating sequence (ARS), centromere, artificial chromosome, chromosome, or other nucleic acid able to replicate or be replicated in vitro or in a host cell, a cell, a cell nucleus or cytoplasm of a cell in certain embodiments. A template nucleic acid in some embodiments can be from a single chromosome (e.g., a nucleic acid sample may be from one chromosome of a sample obtained from a diploid organism). The term also may include, as equivalents, derivatives, variants and analogs of RNA or DNA synthesized from nucleotide analogs, single-stranded (“sense” or “antisense”, “plus” strand or “minus” strand, “forward” reading frame or “reverse” reading frame) and double-stranded polynucleotides. Deoxyribonucleotides include deoxyadenosine, deoxycytidine, deoxyguanosine and deoxythymidine. For RNA, the base cytosine is replaced with uracil. A template nucleic acid may be prepared using a nucleic acid obtained from a subject as a template.


Template nucleic acid may be derived from one or more sources (e.g., cells, soil, etc.) by methods known to the person of ordinary skill in the art. Cell lysis procedures and reagents are commonly known in the art and may generally be performed by chemical, physical, or electrolytic lysis methods. For example, chemical methods generally employ lysing agents to disrupt the cells and extract the nucleic acids from the cells, followed by treatment with chaotropic salts. Physical methods such as freeze/thaw followed by grinding, the use of cell presses and the like are also useful. High salt lysis procedures are also commonly used. For example, an alkaline lysis procedure may be utilized. The latter procedure traditionally incorporates the use of phenol-chloroform solutions, and an alternative phenol-chloroform-free procedure involving three solutions can be utilized. In the latter procedures, solution 1 can contain 15 mM Tris, pH 8.0; 10 mM EDTA and 100 ug/ml Rnase A; solution 2 can contain 0.2N NaOH and 1% SDS; and solution 3 can contain 3M KOAc, pH 5.5. These procedures can be found in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y., 6.3.1-6.3.6 (1989), incorporated herein in its entirety.


Template nucleic acid also may be isolated at a different time point as compared to another template nucleic acid, where each of the samples are from the same or a different source. A template nucleic acid may be from a nucleic acid library, such as a cDNA or RNA library, for example. A template nucleic acid may be a result of nucleic acid purification or isolation and/or amplification of nucleic acid molecules from the sample. Template nucleic acid provided for processes described herein may contain nucleic acid from one sample or from two or more samples (e.g., from 1 or more, 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, 9 or more, 10 or more, 11 or more, 12 or more, 13 or more, 14 or more, 15 or more, 16 or more, 17 or more, 18 or more, 19 or more, or 20 or more samples).


Template nucleic acid may be provided for conducting methods described herein without processing of the sample(s) containing the nucleic acid in certain embodiments. In some embodiments, template nucleic acid is provided for conducting methods described herein after processing of the sample(s) containing the nucleic acid. For example, a template nucleic acid may be extracted, isolated, purified or amplified from the sample(s). The term “isolated” as used herein refers to nucleic acid removed from its original environment (e.g., the natural environment if it is naturally occurring, or a host cell if expressed exogenously), and thus is altered by human intervention (e.g., “by the hand of man”) from its original environment. An isolated nucleic acid generally is provided with fewer non-nucleic acid components (e.g., protein, lipid) than the amount of components present in a source sample. A composition comprising isolated template nucleic acid can be substantially isolated (e.g., about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or greater than 99% free of non-nucleic acid components). The term “purified” as used herein refers to template nucleic acid provided that contains fewer nucleic acid species than in the sample source from which the template nucleic acid is derived. A composition comprising template nucleic acid may be substantially purified (e.g., about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or greater than 99% free of other nucleic acid species). The term “amplified” as used herein refers to subjecting nucleic acid of a sample to a process that linearly or exponentially generates amplicon nucleic acids having the same or substantially the same nucleotide sequence as the nucleotide sequence of the nucleic acid in the sample, or portion thereof.


Template nucleic acid also may be processed by subjecting nucleic acid to a method that generates nucleic acid fragments, in certain embodiments, before providing template nucleic acid for a process described herein. In some embodiments, template nucleic acid subjected to fragmentation or cleavage may have a nominal, average or mean length of about 5 to about 10,000 base pairs, about 100 to about 1,000 base pairs, about 100 to about 500 base pairs, or about 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 2000, 3000, 4000, 5000, 6000, 7000, 8000 or 9000 base pairs. Fragments can be generated by any suitable method known in the art, and the average, mean or nominal length of nucleic acid fragments can be controlled by selecting an appropriate fragment-generating procedure by the person of ordinary skill. In certain embodiments, template nucleic acid of a relatively shorter length can be utilized to analyze sequences that contain little sequence variation and/or contain relatively large amounts of known nucleotide sequence information. In some embodiments, template nucleic acid of a relatively longer length can be utilized to analyze sequences that contain greater sequence variation and/or contain relatively small amounts of unknown nucleotide sequence information.


Template nucleic acid fragments may contain overlapping nucleotide sequences, and such overlapping sequences can facilitate construction of a nucleotide sequence of the previously non-fragmented template nucleic acid, or a portion thereof. For example, one fragment may have subsequences x and y and another fragment may have subsequences y and z, where x, y and z are nucleotide sequences that can be 5 nucleotides in length or greater. Overlap sequence y can be utilized to facilitate construction of the x-y-z nucleotide sequence in nucleic acid from a sample in certain embodiments. Template nucleic acid may be partially fragmented (e.g., from an incomplete or terminated specific cleavage reaction) or fully fragmented in certain embodiments.


Template nucleic acid can be fragmented by various methods known to the person of ordinary skill, which include without limitation, physical, chemical and enzymatic processes. Examples of such processes are described in U.S. Patent Application Publication No. 20050112590 (published on May 26, 2005, entitled “Fragmentation-based methods and systems for sequence variation detection and discovery,” naming Van Den Boom et al.). Certain processes can be selected by the person of ordinary skill to generate non-specifically cleaved fragments or specifically cleaved fragments. Examples of processes that can generate non-specifically cleaved fragment template nucleic acid include, without limitation, contacting template nucleic acid with apparatus that expose nucleic acid to shearing force (e.g., passing nucleic acid through a syringe needle; use of a French press); exposing template nucleic acid to irradiation (e.g., gamma, x-ray, UV irradiation; fragment sizes can be controlled by irradiation intensity); boiling nucleic acid in water (e.g., yields about 500 base pair fragments) and exposing nucleic acid to an acid and base hydrolysis process.


Template nucleic acid may be specifically cleaved by contacting the nucleic acid with one or more specific cleavage agents. The term “specific cleavage agent” as used herein refers to an agent, sometimes a chemical or an enzyme that can cleave a nucleic acid at one or more specific sites. Specific cleavage agents often cleave specifically according to a particular nucleotide sequence at a particular site.


Examples of enzymatic specific cleavage agents include without limitation endonucleases (e.g., DNase (e.g., DNase I, II); RNase (e.g., RNase E, F, H, P); CleavaseT™ enzyme; Taq DNA polymerase; E. coli DNA polymerase I and eukaryotic structure-specific endonucleases; murine FEN-1 endonucleases; type I, II or III restriction endonucleases such as Acc I, Afl III, Alu I, AIw44 I, Apa I, Asn I, Ava I, Ava II, BamH I, Ban II, Bcl I, Bgl I. Bgl II, Bln I, Bsm I, BssH II, BstE II, Cfo I, Cla I, Dde I, Dpn I, Dra I, EcIX I, EcoR I, EcoR I, EcoR II, EcoR V, Hae II, Hae II, Hind II, Hind III, Hpa I, Hpa II, Kpn I, Ksp I, Mlu I, MluN I, Msp I, Nci I, Nco I, Nde I, Nde II, Nhe I, Not I, Nru I, Nsi I, Pst I, Pvu I, Pvu II, Rsa I, Sac I, Sal I, Sau3A I, Sca I, ScrF I, Sfi I, Sma I, Spe I, Sph I, Ssp I, Stu I, Sty I, Swa I, Taq I, Xba I, Xho I.); glycosylases (e.g., uracil-DNA glycolsylase (UDG), 3-methyladenine DNA glycosylase, 3-methyladenine DNA glycosylase II, pyrimidine hydrate-DNA glycosylase, FaPy-DNA glycosylase, thymine mismatch-DNA glycosylase, hypoxanthine-DNA glycosylase, 5-Hydroxymethyluracil DNA glycosylase (HmUDG), 5-Hydroxymethylcytosine DNA glycosylase, or 1,N6-etheno-adenine DNA glycosylase); exonucleases (e.g., exonuclease III); ribozymes, and DNAzymes. Template nucleic acid may be treated with a chemical agent, and the modified nucleic acid may be cleaved. In non-limiting examples, template nucleic acid may be treated with (i) alkylating agents such as methylnitrosourea that generate several alkylated bases, including N3-methyladenine and N3-methylguanine, which are recognized and cleaved by alkyl purine DNA-glycosylase; (ii) sodium bisulfite, which causes deamination of cytosine residues in DNA to form uracil residues that can be cleaved by uracil N-glycosylase; and (iii) a chemical agent that converts guanine to its oxidized form, 8-hydroxyguanine, which can be cleaved by formamidopyrimidine DNA N-glycosylase. Examples of chemical cleavage processes include without limitation alkylation, (e.g., alkylation of phosphorothioate-modified nucleic acid); cleavage of acid lability of P3′-N5′-phosphoroamidate-containing nucleic acid; and osmium tetroxide and piperidine treatment of nucleic acid.


As used herein, “fragmentation” or “cleavage” refers to a procedure or conditions in which a nucleic acid molecule, such as a nucleic acid template gene molecule or amplified product thereof, may be severed into two or more smaller nucleic acid molecules. Such fragmentation or cleavage can be sequence specific, base specific, or nonspecific, and can be accomplished by any of a variety of methods, reagents or conditions, including, for example, chemical, enzymatic, physical fragmentation.


As used herein, “fragments”, “cleavage products”, “cleaved products” or grammatical variants thereof, refers to nucleic acid molecules resultant from a fragmentation or cleavage of a nucleic acid template gene molecule or amplified product thereof. While such fragments or cleaved products can refer to all nucleic acid molecules resultant from a cleavage reaction, typically such fragments or cleaved products refer only to nucleic acid molecules resultant from a fragmentation or cleavage of a nucleic acid template gene molecule or the portion of an amplified product thereof containing the corresponding nucleotide sequence of a nucleic acid template gene molecule. For example, it is within the scope of the present methods, compounds and compositions, that an amplified product can contain one or more nucleotides more than the amplified nucleotide region of the nucleic acid template gene sequence (e.g., a primer can contain “extra” nucleotides such as a transcriptional initiation sequence, in addition to nucleotides complementary to a nucleic acid template gene molecule, resulting in an amplified product containing “extra” nucleotides or nucleotides not corresponding to the amplified nucleotide region of the nucleic acid template gene molecule). In such an example, the fragments or cleaved products corresponding to the nucleotides not arising from the nucleic acid template molecule will typically not provide any information regarding methylation in the nucleic acid template molecule. One skilled in the art can therefore understand that the fragments of an amplified product used to provide methylation information in the methods provided herein may be fragments containing one or more nucleotides arising from the nucleic acid template molecule, and not fragments containing nucleotides arising solely from a sequence other than that in the nucleic acid target molecule. Accordingly, one skilled in the art will understand the fragments arising from methods, compounds and compositions provided herein to include fragments arising from portions of amplified nucleic acid molecules containing, at least in part, nucleotide sequence information from or based on the representative nucleic acid template molecule.


As used herein, the term “complementary cleavage reactions” refers to cleavage reactions that are carried out on the same template nucleic acid using different cleavage reagents or by altering the cleavage specificity of the same cleavage reagent such that alternate cleavage patterns of the same target or reference nucleic acid or protein are generated. In certain embodiments, template nucleic acid may be treated with one or more specific cleavage agents (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more specific cleavage agents) in one or more reaction vessels (e.g., template nucleic acid is treated with each specific cleavage agent in a separate vessel).


Template nucleic acid also may be exposed to a process that modifies certain nucleotides in the nucleic acid before providing template nucleic acid for a method described herein. A process that selectively modifies nucleic acid based upon the methylation state of nucleotides therein can be applied to template nucleic acid, for example. The term “methylation state” as used herein refers to whether a particular nucleotide in a polynucleotide sequence is methylated or not methylated. Methods for modifying a template nucleic acid molecule in a manner that reflects the methylation pattern of the template nucleic acid molecule are known in the art, as exemplified in U.S. Pat. No. 5,786,146 and U.S. patent publications 20030180779 and 20030082600. For example, non-methylated cytosine nucleotides in a nucleic acid can be converted to uracil by bisulfite treatment, which does not modify methylated cytosine. Non-limiting examples of agents that can modify a nucleotide sequence of a nucleic acid include methylmethane sulfonate, ethylmethane sulfonate, diethylsulfate, nitrosoguanidine (N-methyl-N′-nitro-N-nitrosoguanidine), nitrous acid, di-(2-chloroethyl)sulfide, di-(2-chloroethyl)methylamine, 2-aminopurine, t-bromouracil, hydroxylamine, sodium bisulfite, hydrazine, formic acid, sodium nitrite, and 5-methylcytosine DNA glycosylase. In addition, conditions such as high temperature, ultraviolet radiation, x-radiation, can induce changes in the sequence of a nucleic acid molecule. Template nucleic acid may be provided in any form useful for conducting a sequence analysis or manufacture process described herein, such as solid or liquid form, for example. In certain embodiments, template nucleic acid may be provided in a liquid form optionally comprising one or more other components, including without limitation one or more buffers or salts selected by the person of ordinary skill.


Determination of Fetal Nucleic Acid Content and Fetal Nucleic Acid Enrichment


The amount of fetal nucleic acid (e.g., concentration) in template nucleic acid is determined in some embodiments. In certain embodiments, the amount of fetal nucleic acid is determined according to markers specific to a male fetus (e.g., Y-chromosome STR markers (e.g., DYS 19, DYS 385, DYS 392 markers); RhD marker in RhD-negative females), or according to one or more markers specific to fetal nucleic acid and not maternal nucleic acid (e.g., differential methylation between mother and fetus, or fetal RNA markers in maternal blood plasma; Lo, 2005, Journal of Histochemistry and Cytochemistry 53 (3): 293-296). Methylation-based fetal quantifier compositions and processes are described in U.S. application Ser. No. 12/561,241, filed Sep. 16, 2009, which is hereby incorporated by reference. The amount of fetal nucleic acid in extracellular template nucleic acid can be quantified and used in conjunction with the aneuploidy detection methods provided herein. Thus, in certain embodiments, methods of the technology comprise the additional step of determining the amount of fetal nucleic acid. The amount of fetal nucleic acid can be determined in a nucleic acid sample from a subject before or after processing to prepare sample template nucleic acid. In certain embodiments, the amount of fetal nucleic acid is determined in a sample after sample template nucleic acid is processed and prepared, which amount is utilized for further assessment. The determination step can be performed before, during or after aneuploidy detection methods described herein. For example, to achieve an aneuploidy detection method with a given sensitivity or specificity, a fetal nucleic acid quantification method may be implemented prior to, during or after aneuploidy detection to identify those samples with greater than about 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, 10%, 11%, 12%, 13%, 14%,15%,16%, 17%, 18%, 19%, 20%, 21%, 22%, 23%, 24%, 25% or more fetal nucleic acid. In some embodiments, samples determined as having a certain threshold amount of fetal nucleic acid (e.g., about 15% or more fetal nucleic acid) are further analyzed for the presence or absence of aneuploidy. In certain embodiments, determinations of the presence or absence of aneuploidy are selected (e.g., selected and communicated to a patient) only for samples having a certain threshold amount of fetal nucleic acid (e.g., about 15% or more fetal nucleic acid).


In some embodiments, extracellular nucleic acid is enriched or relatively enriched for fetal nucleic acid. Methods for enriching a sample for a particular species of nucleic acid are described in U.S. Pat. No. 6,927,028, filed August 31, 2001, PCT Patent Application Number PCT/US07/69991, filed May 30, 2007, PCT Patent Application Number PCT/US2007/071232, filed Jun. 15, 2007, U.S. Provisional Application Nos. 60/968,876 and 60/968,878, and PCT Patent Application Number PCT/EP05/012707, filed Nov. 28, 2005. In certain embodiments, maternal nucleic acid is selectively removed (partially, substantially, almost completely or completely) from the sample. In certain embodiments, fetal nucleic acid is differentiated and separated from maternal nucleic acid based on methylation differences. Enriching for a particular low copy number species nucleic acid may also improve quantitative sensitivity. For example, the most sensitive peak ratio detection area is within 10% from center point. See FIG. 1.


Nucleotide Sequence Species in a Set


In methods described herein, particular nucleotide sequence species located in a particular target chromosome and in one or more reference chromosomes are analyzed. The term “target chromosome” as used herein is utilized in two contexts, as the term refers to (i) a particular chromosome (e.g., chromosome 21, 18 or 13) and sometimes (ii) a chromosome from a particular target source (e.g., chromosome from a fetus, chromosome from a cancer cell). When the term refers to a particular chromosome, the term “target chromosome” is utilized (e.g., “target chromosome 21”) and when the term refers to a particular target chromosome from a particular source, the source of the target chromosome is included (e.g., “fetal target chromosome,” “cancer cell target chromosome”).


A “set” includes nucleotide sequence species located in a target chromosome and one or more reference chromosomes. Nucleotide sequence species in a set are located in the target chromosome and in the one or more reference chromosomes. The term “reference chromosome” refers to a chromosome that includes a nucleotide sequence species as a subsequence, and sometimes is a chromosome not associated with a particular chromosome abnormality being screened. For example, in a prenatal screening method for Down syndrome (i.e., trisomy 21), chromosome 21 is the target chromosome and another chromosome (e.g., chromosome 5) is the reference chromosome. In certain embodiments, a reference chromosome can be associated with a chromosome abnormality. For example, chromosome 21 can be the target chromosome and chromosome 18 can be the reference chromosome when screening for Down syndrome, and chromosome 18 can the target chromosome and chromosome 21 can be the reference chromosome when screening for Edward syndrome.


The terms “nucleotide sequence species in a set,” a “set of nucleotide sequence species” and grammatical variants thereof, as used herein, refer to nucleotide sequence species in a target chromosome and a reference chromosome. Nucleotide sequence species in a set generally share a significant level of sequence identity. One nucleotide sequence species in a set is located in one chromosome and another nucleotide sequence species in a set is located in another chromosome. A nucleotide sequence species in a set located in a target chromosome can be referred to as a “target nucleotide sequence species” and a nucleotide sequence species in a set located in a reference chromosome can be referred to as a “reference nucleotide sequence species.”


Nucleotide sequence species in a set share about 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93% or 94%, and all intermediate values thereof, identity to one another in some embodiments. Nucleotide sequence species in a set are “substantially identical” to one another to one another in some embodiments, which refers to nucleotide sequence species that share 95%, 96%, 97%, 98% or 99% identity, or greater than 99% identity, with one another, in certain embodiments. For highly identical nucleotide sequence species in a set, the nucleotide sequence species may be identical to one another with the exception of a one base pair mismatch, in certain embodiments. For example, nucleotide sequence species in a set may be identical to one another with the exception of a one base pair mismatch for a nucleotide sequence species length of about 100 base pairs (e.g., about 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118 or 120 base pair sequence length). Thus, nucleotide sequence species in a set may be “paralog sequences” or “paralogous sequences,” which as used herein refer to nucleotide sequence species that include only one or two base pair mismatches. Paralogous sequences sometimes have a common evolutionary origin and sometimes are duplicated over time in a genome of interest. Paralogous sequences sometimes conserve sequence and gene structure (e.g., number and relative position of introns and exons and often transcript length). In some embodiments, nucleotide sequence species in a set may differ by two or more base pair mismatches (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10 base pair mismatches), where the mismatched base pairs are sequential or non-sequential (e.g., base pair mismatches may be sequential for about 2, 3, 4, 5, 6, 7, 8, 9 or 10 bases).


Alignment techniques and sequence identity assessment methodology are known. Such analyses can be performed by visual inspection or by using a mathematical algorithm. For example, the algorithm of Meyers & Miller, CABIOS 4: 11-17 (1989), which has been incorporated into the ALIGN program (version 2.0) can be utilized. Utilizing the former algorithm, a PAM120 weight residue table, a gap length penalty of 12 and a gap penalty of 4 may be used for determining sequence identity.


Base pair mismatches between nucleotide sequence species in a set are not significantly polymorphic in certain embodiments, and the nucleotides that give rise to the mismatches are present at a rate of over 95% of subjects and chromosomes in a given population (e.g., the same nucleotides that give rise to the mismatches are present in about 98%, 99% or over 99% of subjects and chromosomes in a population) in some embodiments. Each nucleotide sequence species in a set, in its entirety, often is present in a significant portion of a population without modification (e.g., present without modification in about 97%, 98%, 99%, or over 99% of subjects and chromosomes in a population).


Nucleotide sequence species in a set may be of any convenient length. For example, a nucleotide sequence species in a set can be about 5 to about 10,000 base pairs in length, about 100 to about 1,000 base pairs in length, about 100 to about 500 base pairs in length, or about 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 2000, 3000, 4000, 5000, 6000, 7000, 8000 or 9000 base pairs in length. In some embodiments, a nucleotide sequence species in a set is about 100 base pairs in length (e.g., about 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118 or 120 base pairs in length). In certain embodiments, nucleotide sequence species in a set are of identical length, and sometimes the nucleotide sequence species in a set are of a different length (e.g., one nucleotide sequence species is longer by about 1 to about 100 nucleotides (e.g., about 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 30, 40, 50, 60, 70, 80 or 90 nucleotides longer).


Nucleotide sequence species in a set are non-exonic in some embodiments, and sometimes one or more of the nucleotide sequence species in a set are intronic, partially intronic, partially exonic or partially non-exonic. In certain embodiments, a nucleotide sequence in a set comprises an exonic nucleotide sequence.


In some embodiments, one or more nucleotide sequence species are selected from those shown in tables herein (e.g., Table 4A, Table 4B and Table 14).


Each set can include two or more nucleotide sequence species (e.g., 2, 3, 4 or 5 nucleotide sequence species). In some embodiments, the number of target and reference chromosomes equals the number of nucleotide sequence species in a set, and sometimes each of the nucleotide sequence species in a set are present only in one chromosome. In certain embodiments, a nucleotide sequence species is located in more than one chromosome (e.g., 2 or 3 chromosomes).


Methods described herein can be conducted using one set of nucleotide sequence species, and sometimes two or three sets of nucleotide sequence species are utilized. For multiplex methods described herein, about 4 to about 100 sets of nucleotide sequence species can be utilized (e.g., about 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24 or 25 sets).


One or more of the sets consist of two nucleotide sequence species in some embodiments, and sometimes one or more sets consist of three nucleotide sequence species. Some embodiments are directed to mixtures of sets in which some sets consist of two nucleotide sequence species and other sets consist of three nucleotide sequence species can be used. In some embodiments, about 50%, 60%, 70%, 80%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% of sets consist of two nucleotide sequence species, and in certain embodiments about 50%, 60%, 70%, 80%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% of sets consist of three nucleotide sequences. In a set, nucleotide sequence species sometimes are in: chromosome 21 and chromosome 18, or are in chromosome 21 and chromosome 13, or are in chromosome 13 and chromosome 18, or are in chromosome 21, and chromosome 18 and chromosome 13, or are in chromosome X, or are in chromosome Y, or are in chromosome X and Y, or are in chromosome 21, chromosome 18 and chromosome 13 and chromosome X or Y, and in about 50%, 60%, 70%, 80%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% of sets, the nucleotide sequence species sometimes are in such designated chromosomes. In certain embodiments, the set utilized, or every set when more than one set is utilized, consists of nucleotide sequence species located in chromosome 21, chromosome 18 and chromosome 13.


In some embodiments, nucleotide sequence species are amplified and base pair mismatches are detected in the resulting amplified nucleic acid species. In other embodiments, the nucleotide sequence species are not amplified prior to detection (e.g., if the detection system is sufficiently sensitive or a sufficient amount of chromosome nucleic acid is available or generated), and nucleotide sequence species are detected directly in chromosome nucleic acid or fragments thereof.


Identification of Nucleotide Sequence Species


In one aspect, the technology in part comprises identifying nucleotide sequence species that amplify in a stable, reproducible manner relative to each other and are thereby useful in conjunction with the methods of the technology. The identification of nucleotide sequence species may be done computationally by identifying sequences which comprises at least about 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98% or 99% identity over an amplifiable sequence region. In another embodiment, the primer hybridization sequences in the nucleotide sequence species are substantially identical. Often, the nucleotide sequence species comprise a substantially identical GC content (for example, the sequences sometimes have less than about 5% and often, less than about 1% difference in GC content).


Sequence search programs are well known in the art, and include, but are not limited to, BLAST (see, Altschul et al., 1990, J. Mol. Biol. 215: 403-410), BLAT (Kent, W. J. 2002. BLAT—The BLAST-Like Alignment Tool. Genome Research 4: 656-664), FASTA, and SSAHA (see, e.g., Pearson, 1988, Proc. Natl. Acad. Sci. USA 85(5): 2444-2448; Lung et al., 1991, J. Mol. Biol. 221(4): 1367-1378). Further, methods of determining the significance of sequence alignments are known in the art and are described in Needleman and Wunsch, 1970, J. of Mol. Biol. 48: 444; Waterman et al., 1980, J. Mol. Biol. 147: 195-197; Karlin et al., 1990, Proc. Natl. Acad. Sci. USA 87: 2264-2268; and Dembo et al., 1994, Ann. Prob. 22: 2022-2039. While in one aspect, a single query sequence is searched against the database, in another aspect, a plurality of sequences are searched against the database (e.g., using the MEGABLAST program, accessible through NCBI).


A number of human genomic sequence databases exist, including, but not limited to, the NCBI GenBank database and the Genetic Information Research Institute (GIRI) database. Expressed sequence databases include, but are not limited to, the NCBI EST database, the random cDNA sequence database from Human Genome Sciences, and the EMEST8 database (EMBL, Heidelberg, Germany).


While computational methods of identifying suitable nucleotide sequence sets often are utilized, any method of detecting sequences which are capable of significant base pairing can be used to identify or validate nucleotide sequences of the technology. For example, nucleotide sequence sets can be validated using a combination of hybridization-based methods and computational methods to identify sequences which hybridize to multiple chromosomes. The technology is not limited to nucleotide sequences that appear exclusively on target and reference chromosomes. For example, the amplification primers may co-amplify nucleotide sequences from 2, 3, 4, 5, 6 or more chromosomes as long as the amplified nucleic acid species are produced at a reproducible rate and the majority (for example, greater than 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98% or 99%) of the target species comes from the target chromosome, thereby allowing for the accurate detection of target chromosomal abnormalities. As used herein, the terms “target” and “reference” may have a degree of ambiguity since the “target” may be any chromosome that is susceptible to chromosomal abnormalities. For example, a set that consists of nucleotide sequence species from chromosomes 13, 18 and 21 has the power to simultaneously detect a chromosomal abnormality originating from any of the three chromosomes. In the case of a Down Syndrome (trisomy 21) sample, chromosome 21 is the “target chromosome” and chromosomes 13 and 18 are the “reference chromosomes”.


Tables 3 and 4 provide examples of non-limiting candidate nucleotide sequence sets, where at least one species of the set is located on chromosome 21, 18 or 13.


Amplification


In some embodiments, nucleotide sequence species are amplified using a suitable amplification process. It may be desirable to amplify nucleotide sequence species particularly if one or more of the nucleotide sequence species exist at low copy number. In some embodiments amplification of sequences or regions of interest may aid in detection of gene dosage imbalances, as might be seen in genetic disorders involving chromosomal aneuploidy, for example. An amplification product (amplicon) of a particular nucleotide sequence species is referred to herein as an “amplified nucleic acid species.”


Nucleic acid amplification often involves enzymatic synthesis of nucleic acid amplicons (copies), which contain a sequence complementary to a nucleotide sequence species being amplified. Amplifying nucleotide sequence species and detecting the amplicons synthesized, can improve the sensitivity of an assay, since fewer target sequences are needed at the beginning of the assay, and can improve detection of nucleotide sequence species.


Any suitable amplification technique can be utilized. Amplification of polynucleotides include, but are not limited to, polymerase chain reaction (PCR); ligation amplification (or ligase chain reaction (LCR)); amplification methods based on the use of Q-beta replicase or template-dependent polymerase (see US Patent Publication Number US20050287592); helicase-dependant isothermal amplification (Vincent et al., “Helicase-dependent isothermal DNA amplification”. EMBO reports 5 (8): 795-800 (2004)); strand displacement amplification (SDA); thermophilic SDA nucleic acid sequence based amplification (3SR or NASBA) and transcription-associated amplification (TAA). Non-limiting examples of PCR amplification methods include standard PCR, AFLP-PCR, Allele-specific PCR, Alu-PCR, Asymmetric PCR, Colony PCR, Hot start PCR, Inverse PCR (IPCR), In situ PCR (ISH), Intersequence-specific PCR (ISSR-PCR), Long PCR, Multiplex PCR, Nested PCR, Quantitative PCR, Reverse Transcriptase PCR (RT-PCR), Real Time PCR, Single cell PCR, Solid phase PCR, combinations thereof, and the like. Reagents and hardware for conducting PCR are commercially available.


The terms “amplify”, “amplification”, “amplification reaction”, or “amplifying” refers to any in vitro processes for multiplying the copies of a target sequence of nucleic acid. Amplification sometimes refers to an “exponential” increase in target nucleic acid. However, “amplifying” as used herein can also refer to linear increases in the numbers of a select target sequence of nucleic acid, but is different than a one-time, single primer extension step. In some embodiments a limited amplification reaction, also known as pre-amplification, can be performed. Pre-amplification is a method in which a limited amount of amplification occurs due to a small number of cycles, for example 10 cycles, being performed. Pre-amplification can allow some amplification, but stops amplification prior to the exponential phase, and typically produces about 500 copies of the desired nucleotide sequence(s). Use of pre-amplification may also limit inaccuracies associated with depleted reactants in standard PCR reactions, and also may reduce amplification biases due to nucleotide sequence or species abundance of the target. In some embodiments a one-time primer extension may be used may be performed as a prelude to linear or exponential amplification.


A generalized description of an amplification process is presented herein. Primers and target nucleic acid are contacted, and complementary sequences anneal to one another, for example. Primers can anneal to a target nucleic acid, at or near (e.g., adjacent to, abutting, and the like) a sequence of interest. A reaction mixture, containing components necessary for enzymatic functionality, is added to the primer—target nucleic acid hybrid, and amplification can occur under suitable conditions. Components of an amplification reaction may include, but are not limited to, e.g., primers (e.g., individual primers, primer pairs, primer sets and the like) a polynucleotide template (e.g., target nucleic acid), polymerase, nucleotides, dNTPs and the like. In some embodiments, non-naturally occurring nucleotides or nucleotide analogs, such as analogs containing a detectable label (e.g., fluorescent or colorimetric label), may be used for example. Polymerases can be selected by a person of ordinary skill and include polymerases for thermocycle amplification (e.g., Taq DNA Polymerase; Q-Bio™ Taq DNA Polymerase (recombinant truncated form of Taq DNA Polymerase lacking 5′-3′exo activity); SurePrime™ Polymerase (chemically modified Taq DNA polymerase for “hot start” PCR); Arrow™ Taq DNA Polymerase (high sensitivity and long template amplification)) and polymerases for thermostable amplification (e.g., RNA polymerase for transcription-mediated amplification (TMA). Other enzyme components can be added, such as reverse transcriptase for transcription mediated amplification (TMA) reactions, for example.


The terms “near” or “adjacent to” when referring to a nucleotide sequence of interest refers to a distance or region between the end of the primer and the nucleotide or nucleotides of interest. As used herein adjacent is in the range of about 5 nucleotides to about 500 nucleotides (e.g., about 5 nucleotides away from nucleotide of interest, about 10, about 20, about 30, about 40, about 50, about 60, about 70, about 80, about 90, about 100, about 150, about 200, about 250, about 300, abut 350, about 400, about 450 or about 500 nucleotides from a nucleotide of interest). In some embodiments the primers in a set hybridize within about 10 to 30 nucleotides from a nucleic acid sequence of interest and produce amplified products.


Each amplified nucleic acid species independently is about 10 to about 500 base pairs in length in some embodiments. In certain embodiments, an amplified nucleic acid species is about 20 to about 250 base pairs in length, sometimes is about 50 to about 150 base pairs in length and sometimes is about 100 base pairs in length. Thus, in some embodiments, the length of each of the amplified nucleic acid species products independently is about 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 125, 130, 135, 140, 145, 150, 175, 200, 250, 300, 350, 400, 450, or 500 base pairs (bp) in length.


An amplification product may include naturally occurring nucleotides, non-naturally occurring nucleotides, nucleotide analogs and the like and combinations of the foregoing. An amplification product often has a nucleotide sequence that is identical to or substantially identical to a sample nucleic acid nucleotide sequence or complement thereof. A “substantially identical” nucleotide sequence in an amplification product will generally have a high degree of sequence identity to the nucleotide sequence species being amplified or complement thereof (e.g., about 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or greater than 99% sequence identity), and variations sometimes are a result of infidelity of the polymerase used for extension and/or amplification, or additional nucleotide sequence(s) added to the primers used for amplification.


PCR conditions can be dependent upon primer sequences, target abundance, and the desired amount of amplification, and therefore, one of skill in the art may choose from a number of PCR protocols available (see, e.g., U.S. Pat. Nos. 4,683,195 and 4,683,202; and PCR Protocols: A Guide to Methods and Applications, Innis et al., eds, 1990. Digital PCR is also known to those of skill in the art; see, e.g., US Patent Application Publication Number 20070202525, filed Feb. 2, 2007, which is hereby incorporated by reference). PCR often is carried out as an automated process with a thermostable enzyme. In this process, the temperature of the reaction mixture is cycled through a denaturing region, a primer-annealing region, and an extension reaction region automatically. Machines specifically adapted for this purpose are commercially available. A non-limiting example of a PCR protocol that may be suitable for embodiments described herein is, treating the sample at 95° C. for 5 minutes; repeating forty-five cycles of 95° C. for 1 minute, 59° C. for 1 minute, 10 seconds, and 72° C. for 1 minute 30 seconds; and then treating the sample at 72° C. for 5 minutes. Multiple cycles frequently are performed using a commercially available thermal cycler. Suitable isothermal amplification processes known and selected by the person of ordinary skill in the art also may be applied, in certain embodiments.


In some embodiments, multiplex amplification processes may be used to amplify target nucleic acids, such that multiple amplicons are simultaneously amplified in a single, homogenous reaction. As used herein “multiplex amplification” refers to a variant of PCR where simultaneous amplification of many targets of interest in one reaction vessel may be accomplished by using more than one pair of primers (e.g., more than one primer set). Multiplex amplification may be useful for analysis of deletions, mutations, and polymorphisms, or quantitative assays, in some embodiments. In certain embodiments multiplex amplification may be used for detecting paralog sequence imbalance, genotyping applications where simultaneous analysis of multiple markers is required, detection of pathogens or genetically modified organisms, or for microsatellite analyses.


In some embodiments multiplex amplification may be combined with another amplification (e.g., PCR) method (e.g., nested PCR or hot start PCR, for example) to increase amplification specificity and reproducibility. In other embodiments multiplex amplification may be done in replicates, for example, to reduce the variance introduced by said amplification.


In some embodiments amplification nucleic acid species of the primer sets are generated in one reaction vessel. In some embodiments amplification of paralogous sequences may be performed in a single reaction vessel. In certain embodiments, paralogous sequences (on the same or different chromosomes) may be amplified by a single primer pair or set. In some embodiments nucleotide sequence species may be amplified by a single primer pair or set. In some embodiments nucleotide sequence species in a set may be amplified with two or more primer pairs.


In certain embodiments, nucleic acid amplification can generate additional nucleic acid species of different or substantially similar nucleic acid sequence. In certain embodiments described herein, contaminating or additional nucleic acid species, which may contain sequences substantially complementary to, or may be substantially identical to, the sequence of interest, can be useful for sequence quantification, with the proviso that the level of contaminating or additional sequences remains constant and therefore can be a reliable marker whose level can be substantially reproduced. Additional considerations that may affect sequence amplification reproducibility are; PCR conditions (number of cycles, volume of reactions, melting temperature difference between primers pairs, and the like), concentration of target nucleic acid in sample (e.g. fetal nucleic acid in maternal nucleic acid background, viral nucleic acid in host background), the number of chromosomes on which the nucleotide species of interest resides (e.g., paralogous sequence), variations in quality of prepared sample, and the like. The terms “substantially reproduced” or “substantially reproducible” as used herein refer to a result (e.g., quantifiable amount of nucleic acid) that under substantially similar conditions would occur in substantially the same way about 75% of the time or greater, about 80%, about 85%, about 90%, about 95%, or about 99% of the time or greater.


In some embodiments where a target nucleic acid is RNA, prior to the amplification step, a DNA copy (cDNA) of the RNA transcript of interest may be synthesized. A cDNA can be sytnesized by reverse transcription, which can be carried out as a separate step, or in a homogeneous reverse transcription-polymerase chain reaction (RT-PCR), a modification of the polymerase chain reaction for amplifying RNA. Methods suitable for PCR amplification of ribonucleic acids are described by Romero and Rotbart in Diagnostic Molecular Biology: Principles and Applications pp. 401-406; Persing et al., eds., Mayo Foundation, Rochester, Minn., 1993; Egger et al., J. Clin. Microbiol. 33:1442-1447, 1995; and U.S. Pat. No. 5,075,212. Branched-DNA technology may be used to amplify the signal of RNA markers in maternal blood. For a review of branched-DNA (bDNA) signal amplification for direct quantification of nucleic acid sequences in clinical samples, see Nolte, Adv. Clin. Chem. 33:201-235, 1998.


Amplification also can be accomplished using digital PCR, in certain embodiments (e.g., Kalinina and colleagues (Kalinina et al., “Nanoliter scale PCR with TaqMan detection.” Nucleic Acids Research. 25; 1999-2004, (1997); Vogelstein and Kinzler (Digital PCR. Proc Natl Acad Sci USA. 96; 9236-41, (1999); PCT Patent Publication No. WO05023091A2; US Patent Publication No. US 20070202525). Digital PCR takes advantage of nucleic acid (DNA, cDNA or RNA) amplification on a single molecule level, and offers a highly sensitive method for quantifying low copy number nucleic acid. Systems for digital amplification and analysis of nucleic acids are available (e.g., Fluidigm® Corporation).


Use of a primer extension reaction also can be applied in methods of the technology. A primer extension reaction operates, for example, by discriminating nucleic acid sequences at a single nucleotide mismatch (e.g., a mismatch between paralogous sequences). The mismatch is detected by the incorporation of one or more deoxynucleotides and/or dideoxynucleotides to an extension oligonucleotide, which hybridizes to a region adjacent to the mismatch site. The extension oligonucleotide generally is extended with a polymerase. In some embodiments, a detectable tag or detectable label is incorporated into the extension oligonucleotide or into the nucleotides added on to the extension oligonucleotide (e.g., biotin or streptavidin). The extended oligonucleotide can be detected by any known suitable detection process (e.g., mass spectrometry; sequencing processes). In some embodiments, the mismatch site is extended only by one or two complementary deoxynucleotides or dideoxynucleotides that are tagged by a specific label or generate a primer extension product with a specific mass, and the mismatch can be discriminated and quantified.


In some embodiments, amplification may be performed on a solid support. In some embodiments, primers may be associated with a solid support. In certain embodiments, target nucleic acid (e.g., template nucleic acid) may be associated with a solid support. A nucleic acid (primer or target) in association with a solid support often is referred to as a solid phase nucleic acid.


In some embodiments, nucleic acid molecules provided for amplification and in a “microreactor”. As used herein, the term “microreactor” refers to a partitioned space in which a nucleic acid molecule can hybridize to a solid support nucleic acid molecule. Examples of microreactors include, without limitation, an emulsion globule (described hereafter) and a void in a substrate. A void in a substrate can be a pit, a pore or a well (e.g., microwell, nanowell, picowell, micropore, or nanopore) in a substrate constructed from a solid material useful for containing fluids (e.g., plastic (e.g., polypropylene, polyethylene, polystyrene) or silicon) in certain embodiments. Emulsion globules are partitioned by an immiscible phase as described in greater detail hereafter. In some embodiments, the microreactor volume is large enough to accommodate one solid support (e.g., bead) in the microreactor and small enough to exclude the presence of two or more solid supports in the microreactor.


The term “emulsion” as used herein refers to a mixture of two immiscible and unblendable substances, in which one substance (the dispersed phase) often is dispersed in the other substance (the continuous phase). The dispersed phase can be an aqueous solution (i.e., a solution comprising water) in certain embodiments. In some embodiments, the dispersed phase is composed predominantly of water (e.g., greater than 70%, greater than 75%, greater than 80%, greater than 85%, greater than 90%, greater than 95%, greater than 97%, greater than 98% and greater than 99% water (by weight)). Each discrete portion of a dispersed phase, such as an aqueous dispersed phase, is referred to herein as a “globule” or “microreactor.” A globule sometimes may be spheroidal, substantially spheroidal or semi-spheroidal in shape, in certain embodiments.


The terms “emulsion apparatus” and “emulsion component(s)” as used herein refer to apparatus and components that can be used to prepare an emulsion. Non-limiting examples of emulsion apparatus include without limitation counter-flow, cross-current, rotating drum and membrane apparatus suitable for use by a person of ordinary skill to prepare an emulsion. An emulsion component forms the continuous phase of an emulsion in certain embodiments, and includes without limitation a substance immiscible with water, such as a component comprising or consisting essentially of an oil (e.g., a heat-stable, biocompatible oil (e.g., light mineral oil)). A biocompatible emulsion stabilizer can be utilized as an emulsion component. Emulsion stabilizers include without limitation Atlox 4912, Span 80 and other biocompatible surfactants.


In some embodiments, components useful for biological reactions can be included in the dispersed phase. Globules of the emulsion can include (i) a solid support unit (e.g., one bead or one particle); (ii) sample nucleic acid molecule; and (iii) a sufficient amount of extension agents to elongate solid phase nucleic acid and amplify the elongated solid phase nucleic acid (e.g., extension nucleotides, polymerase, primer). Inactive globules in the emulsion may include a subset of these components (e.g., solid support and extension reagents and no sample nucleic acid) and some can be empty (i.e., some globules will include no solid support, no sample nucleic acid and no extension agents).


Emulsions may be prepared using known suitable methods (e.g., Nakano et al. “Single-molecule PCR using water-in-oil emulsion;” Journal of Biotechnology 102 (2003) 117-124). Emulsification methods include without limitation adjuvant methods, counter-flow methods, cross-current methods, rotating drum methods, membrane methods, and the like. In certain embodiments, an aqueous reaction mixture containing a solid support (hereafter the “reaction mixture”) is prepared and then added to a biocompatible oil. In certain embodiments, the reaction mixture may be added dropwise into a spinning mixture of biocompatible oil (e.g., light mineral oil (Sigma)) and allowed to emulsify. In some embodiments, the reaction mixture may be added dropwise into a cross-flow of biocompatible oil. The size of aqueous globules in the emulsion can be adjusted, such as by varying the flow rate and speed at which the components are added to one another, for example.


The size of emulsion globules can be selected by the person of ordinary skill in certain embodiments based on two competing factors: (i) globules are sufficiently large to encompass one solid support molecule, one sample nucleic acid molecule, and sufficient extension agents for the degree of elongation and amplification required; and (ii) globules are sufficiently small so that a population of globules can be amplified by conventional laboratory equipment (e.g., thermocycling equipment, test tubes, incubators and the like). Globules in the emulsion can have a nominal, mean or average diameter of about 5 microns to about 500 microns, about 10 microns to about 350 microns, about 50 to 250 microns, about 100 microns to about 200 microns, or about 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 200, 300, 400 or 500 microns in certain embodiments.


In certain embodiments, amplified nucleic acid species in a set are of identical length, and sometimes the amplified nucleic acid species in a set are of a different length. For example, one amplified nucleic acid species may be longer than one or more other amplified nucleic acid species in the set by about 1 to about 100 nucleotides (e.g., about 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 30, 40, 50, 60, 70, 80 or 90 nucleotides longer).


In some embodiments, a ratio can be determined for the amount of one amplified nucleic acid species in a set to the amount of another amplified nucleic acid species in the set (hereafter a “set ratio”). In some embodiments, the amount of one amplified nucleic acid species in a set is about equal to the amount of another amplified nucleic acid species in the set (i.e., amounts of amplified nucleic acid species in a set are about 1:1), which generally is the case when the number of chromosomes in a sample bearing each nucleotide sequence species amplified is about equal. The term “amount” as used herein with respect to amplified nucleic acid species refers to any suitable measurement, including, but not limited to, copy number, weight (e.g., grams) and concentration (e.g., grams per unit volume (e.g., milliliter); molar units). In certain embodiments, the amount of one amplified nucleic acid species in a set can differ from the amount of another amplified nucleic acid species in a set, even when the number of chromosomes in a sample bearing each nucleotide sequence species amplified is about equal. In some embodiments, amounts of amplified nucleic acid species within a set may vary up to a threshold level at which a chromosome abnormality can be detected with a confidence level of about 95% (e.g., about 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or greater than 99%). In certain embodiments, the amounts of the amplified nucleic acid species in a set vary by about 50% or less (e.g., about 45, 40, 35, 30, 25, 20, 15, 10, 5, 4, 3, 2 or 1%, or less than 1%). Thus, in certain embodiments amounts of amplified nucleic acid species in a set may vary from about 1:1 to about 1:1.5. Without being limited by theory, certain factors can lead to the observation that the amount of one amplified nucleic acid species in a set can differ from the amount of another amplified nucleic acid species in a set, even when the number of chromosomes in a sample bearing each nucleotide sequence species amplified is about equal. Such factors may include different amplification efficiency rates and/or amplification from a chromosome not intended in the assay design.


Each amplified nucleic acid species in a set generally is amplified under conditions that amplify that species at a substantially reproducible level. The term “substantially reproducible level” as used herein refers to consistency of amplification levels for a particular amplified nucleic acid species per unit template nucleic acid (e.g., per unit template nucleic acid that contains the particular nucleotide sequence species amplified). A substantially reproducible level varies by about 1% or less in certain embodiments, after factoring the amount of template nucleic acid giving rise to a particular amplification nucleic acid species (e.g., normalized for the amount of template nucleic acid). In some embodiments, a substantially reproducible level varies by 10%, 5%, 4%, 3%, 2%, 1.5%, 1%, 0.5%, 0.1%, 0.05%, 0.01%, 0.005% or 0.001% after factoring the amount of template nucleic acid giving rise to a particular amplification nucleic acid species. Alternatively, substantially reproducible means that any two or more measurements of an amplification level are within a particular coefficient of variation (“CV”) from a given mean. Such CV may be 20% or less, sometimes 10% or less and at times 5% or less. The two or more measurements of an amplification level may be determined between two or more reactions and/or two or more of the same sample types (for example, two normal samples or two trisomy samples)


Primers


Primers useful for detection, quantification, amplification, sequencing and analysis of nucleotide sequence species are provided. In some embodiments primers are used in sets, where a set contains at least a pair. In some embodiments a set of primers may include a third or a fourth nucleic acid (e.g., two pairs of primers or nested sets of primers, for example). A plurality of primer pairs may constitute a primer set in certain embodiments (e.g., about 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95 or 100 pairs). In some embodiments a plurality of primer sets, each set comprising pair(s) of primers, may be used. The term “primer” as used herein refers to a nucleic acid that comprises a nucleotide sequence capable of hybridizing or annealing to a target nucleic acid, at or near (e.g., adjacent to) a specific region of interest. Primers can allow for specific determination of a target nucleic acid nucleotide sequence or detection of the target nucleic acid (e.g., presence or absence of a sequence or copy number of a sequence), or feature thereof, for example. A primer may be naturally occurring or synthetic. The term “specific” or “specificity”, as used herein, refers to the binding or hybridization of one molecule to another molecule, such as a primer for a target polynucleotide. That is, “specific” or “specificity” refers to the recognition, contact, and formation of a stable complex between two molecules, as compared to substantially less recognition, contact, or complex formation of either of those two molecules with other molecules. As used herein, the term “anneal” refers to the formation of a stable complex between two molecules. The terms “primer”, “oligo”, or “oligonucleotide” may be used interchangeably throughout the document, when referring to primers.


A primer nucleic acid can be designed and synthesized using suitable processes, and may be of any length suitable for hybridizing to a nucleotide sequence of interest (e.g., where the nucleic acid is in liquid phase or bound to a solid support) and performing analysis processes described herein. Primers may be designed based upon a target nucleotide sequence. A primer in some embodiments may be about 10 to about 100 nucleotides, about 10 to about 70 nucleotides, about 10 to about 50 nucleotides, about 15 to about 30 nucleotides, or about 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95 or 100 nucleotides in length. A primer may be composed of naturally occurring and/or non-naturally occurring nucleotides (e.g., labeled nucleotides), or a mixture thereof. Primers suitable for use with embodiments described herein, may be synthesized and labeled using known techniques. Oligonucleotides (e.g., primers) may be chemically synthesized according to the solid phase phosphoramidite triester method first described by Beaucage and Caruthers, Tetrahedron Letts., 22:1859-1862, 1981, using an automated synthesizer, as described in Needham-VanDevanter et al., Nucleic Acids Res. 12:6159-6168, 1984. Purification of oligonucleotides can be effected by native acrylamide gel electrophoresis or by anion-exchange high-performance liquid chromatography (HPLC), for example, as described in Pearson and Regnier, J. Chrom., 255:137-149, 1983.


All or a portion of a primer nucleic acid sequence (naturally occurring or synthetic) may be substantially complementary to a target nucleic acid, in some embodiments. As referred to herein, “substantially complementary” with respect to sequences refers to nucleotide sequences that will hybridize with each other. The stringency of the hybridization conditions can be altered to tolerate varying amounts of sequence mismatch. Included are regions of counterpart, target and capture nucleotide sequences 55% or more, 56% or more, 57% or more, 58% or more, 59% or more, 60% or more, 61% or more, 62% or more, 63% or more, 64% or more, 65% or more, 66% or more, 67% or more, 68% or more, 69% or more, 70% or more, 71% or more, 72% or more, 73% or more, 74% or more, 75% or more, 76% or more, 77% or more, 78% or more, 79% or more, 80% or more, 81% or more, 82% or more, 83% or more, 84% or more, 85% or more, 86% or more, 87% or more, 88% or more, 89% or more, 90% or more, 91% or more, 92% or more, 93% or more, 94% or more, 95% or more, 96% or more, 97% or more, 98% or more or 99% or more complementary to each other.


Primers that are substantially complimentary to a target nucleic acid sequence are also substantially identical to the compliment of the target nucleic acid sequence. That is, primers are substantially identical to the anti-sense strand of the nucleic acid. As referred to herein, “substantially identical” with respect to sequences refers to nucleotide sequences that are 55% or more, 56% or more, 57% or more, 58% or more, 59% or more, 60% or more, 61% or more, 62% or more, 63% or more, 64% or more, 65% or more, 66% or more, 67% or more, 68% or more, 69% or more, 70% or more, 71% or more, 72% or more, 73% or more, 74% or more, 75% or more, 76% or more, 77% or more, 78% or more, 79% or more, 80% or more, 81% or more, 82% or more, 83% or more, 84% or more, 85% or more, 86% or more, 87% or more, 88% or more, 89% or more, 90% or more, 91% or more, 92% or more, 93% or more, 94% or more, 95% or more, 96% or more, 97% or more, 98% or more or 99% or more identical to each other. One test for determining whether two nucleotide sequences are substantially identical is to determine the percent of identical nucleotide sequences shared.


Primer sequences and length may affect hybridization to target nucleic acid sequences. Depending on the degree of mismatch between the primer and target nucleic acid, low, medium or high stringency conditions may be used to effect primer/target annealing. As used herein, the term “stringent conditions” refers to conditions for hybridization and washing. Methods for hybridization reaction temperature condition optimization are known to those of skill in the art, and may be found in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y., 6.3.1-6.3.6 (1989). Aqueous and non-aqueous methods are described in that reference and either can be used. Non-limiting examples of stringent hybridization conditions are hybridization in 6× sodium chloride/sodium citrate (SSC) at about 45° C., followed by one or more washes in 0.2× SSC, 0.1% SDS at 50° C.


Another example of stringent hybridization conditions are hybridization in 6× sodium chloride/sodium citrate (SSC) at about 45° C., followed by one or more washes in 0.2× SSC, 0.1% SDS at 55° C. A further example of stringent hybridization conditions is hybridization in 6× sodium chloride/sodium citrate (SSC) at about 45° C., followed by one or more washes in 0.2× SSC, 0.1% SDS at 60° C. Often, stringent hybridization conditions are hybridization in 6× sodium chloride/sodium citrate (SSC) at about 45° C., followed by one or more washes in 0.2× SSC, 0.1% SDS at 65° C. More often, stringency conditions are 0.5M sodium phosphate, 7% SDS at 65° C., followed by one or more washes at 0.2× SSC, 1% SDS at 65° C. Stringent hybridization temperatures can also be altered (i.e. lowered) with the addition of certain organic solvents, formamide for example. Organic solvents, like formamide, reduce the thermal stability of double-stranded polynucleotides, so that hybridization can be performed at lower temperatures, while still maintaining stringent conditions and extending the useful life of nucleic acids that may be heat labile.


As used herein, the phrase “hybridizing” or grammatical variations thereof, refers to binding of a first nucleic acid molecule to a second nucleic acid molecule under low, medium or high stringency conditions, or under nucleic acid synthesis conditions. Hybridizing can include instances where a first nucleic acid molecule binds to a second nucleic acid molecule, where the first and second nucleic acid molecules are complementary. As used herein, “specifically hybridizes” refers to preferential hybridization under nucleic acid synthesis conditions of a primer, to a nucleic acid molecule having a sequence complementary to the primer compared to hybridization to a nucleic acid molecule not having a complementary sequence. For example, specific hybridization includes the hybridization of a primer to a target nucleic acid sequence that is complementary to the primer.


In some embodiments primers can include a nucleotide subsequence that may be complementary to a solid phase nucleic acid primer hybridization sequence or substantially complementary to a solid phase nucleic acid primer hybridization sequence (e.g., about 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or greater than 99% identical to the primer hybridization sequence complement when aligned). A primer may contain a nucleotide subsequence not complementary to or not substantially complementary to a solid phase nucleic acid primer hybridization sequence (e.g., at the 3′ or 5′ end of the nucleotide subsequence in the primer complementary to or substantially complementary to the solid phase primer hybridization sequence).


A primer, in certain embodiments, may contain a modification such as inosines, abasic sites, locked nucleic acids, minor groove binders, duplex stabilizers (e.g., acridine, spermidine), Tm modifiers or any modifier that changes the binding properties of the primers or probes.


A primer, in certain embodiments, may contain a detectable molecule or entity (e.g., a fluorophore, radioisotope, colorimetric agent, particle, enzyme and the like). When desired, the nucleic acid can be modified to include a detectable label using any method known to one of skill in the art. The label may be incorporated as part of the synthesis, or added on prior to using the primer in any of the processes described herein. Incorporation of label may be performed either in liquid phase or on solid phase. In some embodiments the detectable label may be useful for detection of targets. In some embodiments the detectable label may be useful for the quantification target nucleic acids (e.g., determining copy number of a particular sequence or species of nucleic acid). Any detectable label suitable for detection of an interaction or biological activity in a system can be appropriately selected and utilized by the artisan. Examples of detectable labels are fluorescent labels such as fluorescein, rhodamine, and others (e.g., Anantha, et al., Biochemistry (1998) 37:2709 2714; and Qu & Chaires, Methods Enzymol. (2000) 321:353 369); radioactive isotopes (e.g., 125I, 131I, 35S, 31P, 32P, 33P, 14C, 3H, 7Be, 28Mg, 57Co, 65Zn, 67Cu, 68Ge, 82Sr, 83Rb, 95Tc, 96Tc, 103Pd, 109Cd, and 127Xe); light scattering labels (e.g., U.S. Pat. No. 6,214,560, and commercially available from Genicon Sciences Corporation, CA); chemiluminescent labels and enzyme substrates (e.g., dioxetanes and acridinium esters), enzymic or protein labels (e.g., green fluorescence protein (GFP) or color variant thereof, luciferase, peroxidase); other chromogenic labels or dyes (e.g., cyanine), and other cofactors or biomolecules such as digoxigenin, strepdavidin, biotin (e.g., members of a binding pair such as biotin and avidin for example), affinity capture moieties and the like. In some embodiments a primer may be labeled with an affinity capture moiety. Also included in detectable labels are those labels useful for mass modification for detection with mass spectrometry (e.g., matrix-assisted laser desorption ionization (MALDI) mass spectrometry and electrospray (ES) mass spectrometry).


A primer also may refer to a polynucleotide sequence that hybridizes to a subsequence of a target nucleic acid or another primer and facilitates the detection of a primer, a target nucleic acid or both, as with molecular beacons, for example. The term “molecular beacon” as used herein refers to detectable molecule, where the detectable property of the molecule is detectable only under certain specific conditions, thereby enabling it to function as a specific and informative signal. Non-limiting examples of detectable properties are, optical properties, electrical properties, magnetic properties, chemical properties and time or speed through an opening of known size.


In some embodiments a molecular beacon can be a single-stranded oligonucleotide capable of forming a stem-loop structure, where the loop sequence may be complementary to a target nucleic acid sequence of interest and is flanked by short complementary arms that can form a stem. The oligonucleotide may be labeled at one end with a fluorophore and at the other end with a quencher molecule. In the stem-loop conformation, energy from the excited fluorophore is transferred to the quencher, through long-range dipole-dipole coupling similar to that seen in fluorescence resonance energy transfer, or FRET, and released as heat instead of light. When the loop sequence is hybridized to a specific target sequence, the two ends of the molecule are separated and the energy from the excited fluorophore is emitted as light, generating a detectable signal. Molecular beacons offer the added advantage that removal of excess probe is unnecessary due to the self-quenching nature of the unhybridized probe. In some embodiments molecular beacon probes can be designed to either discriminate or tolerate mismatches between the loop and target sequences by modulating the relative strengths of the loop-target hybridization and stem formation. As referred to herein, the term “mismatched nucleotide” or a “mismatch” refers to a nucleotide that is not complementary to the target sequence at that position or positions. A probe may have at least one mismatch, but can also have 2, 3, 4, 5, 6 or 7 or more mismatched nucleotides.


Detection


Nucleotide sequence species, or amplified nucleic acid species, or detectable products prepared from the foregoing, can be detected by a suitable detection process. Non-limiting examples of methods of detection, quantification, sequencing and the like include mass detection of mass modified amplicons (e.g., matrix-assisted laser desorption ionization (MALDI) mass spectrometry and electrospray (ES) mass spectrometry), a primer extension method (e.g., iPLEX™; Sequenom, Inc.), direct DNA sequencing, Molecular Inversion Probe (MIP) technology from Affymetrix, restriction fragment length polymorphism (RFLP analysis), allele specific oligonucleotide (ASO) analysis, methylation-specific PCR (MSPCR), pyrosequencing analysis, acycloprime analysis, Reverse dot blot, GeneChip microarrays, Dynamic allele-specific hybridization (DASH), Peptide nucleic acid (PNA) and locked nucleic acids (LNA) probes, TaqMan, Molecular Beacons, Intercalating dye, FRET primers, AlphaScreen, SNPstream, genetic bit analysis (GBA), Multiplex minisequencing, SNaPshot, GOOD assay, Microarray miniseq, arrayed primer extension (APEX), Microarray primer extension, Tag arrays, Coded microspheres, Template-directed incorporation (TDI), fluorescence polarization, Colorimetric oligonucleotide ligation assay (OLA), Sequence-coded OLA, Microarray ligation, Ligase chain reaction, Padlock probes, Invader assay, hybridization using at least one probe, hybridization using at least one fluorescently labeled probe, cloning and sequencing, electrophoresis, the use of hybridization probes and quantitative real time polymerase chain reaction (QRT-PCR), digital PCR, nanopore sequencing, chips and combinations thereof. The detection and quantification of alleles or paralogs can be carried out using the “closed-tube” methods described in U.S. patent application Ser. No. 11/950,395, which was filed Dec. 4, 2007. In some embodiments the amount of each amplified nucleic acid species is determined by mass spectrometry, primer extension, sequencing (e.g., any suitable method, for example nanopore or pyrosequencing), Quantitative PCR (Q-PCR or QRT-PCR), digital PCR, combinations thereof, and the like.


A target nucleic acid can be detected by detecting a detectable label or “signal-generating moiety” in some embodiments. The term “signal-generating” as used herein refers to any atom or molecule that can provide a detectable or quantifiable effect, and that can be attached to a nucleic acid. In certain embodiments, a detectable label generates a unique light signal, a fluorescent signal, a luminescent signal, an electrical property, a chemical property, a magnetic property and the like.


Detectable labels include, but are not limited to, nucleotides (labeled or unlabelled), compomers, sugars, peptides, proteins, antibodies, chemical compounds, conducting polymers, binding moieties such as biotin, mass tags, colorimetric agents, light emitting agents, chemiluminescent agents, light scattering agents, fluorescent tags, radioactive tags, charge tags (electrical or magnetic charge), volatile tags and hydrophobic tags, biomolecules (e.g., members of a binding pair antibody/antigen, antibody/antibody, antibody/antibody fragment, antibody/antibody receptor, antibody/protein A or protein G, hapten/anti-hapten, biotin/avidin, biotin/streptavidin, folic acid/folate binding protein, vitamin B12/intrinsic factor, chemical reactive group/complementary chemical reactive group (e.g., sulfhydryl/maleimide, sulfhydryl/haloacetyl derivative, amine/isotriocyanate, amine/succinimidyl ester, and amine/sulfonyl halides) and the like, some of which are further described below. In some embodiments a probe may contain a signal-generating moiety that hybridizes to a target and alters the passage of the target nucleic acid through a nanopore, and can generate a signal when released from the target nucleic acid when it passes through the nanopore (e.g., alters the speed or time through a pore of known size).


In certain embodiments, sample tags are introduced to distinguish between samples (e.g., from different patients), thereby allowing for the simultaneous testing of multiple samples. For example, sample tags may introduced as part of the extend primers such that extended primers can be associated with a particular sample.


A solution containing amplicons produced by an amplification process, or a solution containing extension products produced by an extension process, can be subjected to further processing. For example, a solution can be contacted with an agent that removes phosphate moieties from free nucleotides that have not been incorporated into an amplicon or extension product. An example of such an agent is a phosphatase (e.g., alkaline phosphatase). Amplicons and extension products also may be associated with a solid phase, may be washed, may be contacted with an agent that removes a terminal phosphate (e.g., exposure to a phosphatase), may be contacted with an agent that removes a terminal nucleotide (e.g., exonuclease), may be contacted with an agent that cleaves (e.g., endonuclease, ribonuclease), and the like.


The term “solid support” or “solid phase” as used herein refers to an insoluble material with which nucleic acid can be associated. Examples of solid supports for use with processes described herein include, without limitation, arrays, beads (e.g., paramagnetic beads, magnetic beads, microbeads, nanobeads) and particles (e.g., microparticles, nanoparticles). Particles or beads having a nominal, average or mean diameter of about 1 nanometer to about 500 micrometers can be utilized, such as those having a nominal, mean or average diameter, for example, of about 10 nanometers to about 100 micrometers; about 100 nanometers to about 100 micrometers; about 1 micrometer to about 100 micrometers; about 10 micrometers to about 50 micrometers; about 1, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 200, 300, 400, 500, 600, 700, 800 or 900 nanometers; or about 1, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 200, 300, 400, 500 micrometers.


A solid support can comprise virtually any insoluble or solid material, and often a solid support composition is selected that is insoluble in water. For example, a solid support can comprise or consist essentially of silica gel, glass (e.g. controlled-pore glass (CPG)), nylon, Sephadex®, Sepharose®, cellulose, a metal surface (e.g. steel, gold, silver, aluminum, silicon and copper), a magnetic material, a plastic material (e.g., polyethylene, polypropylene, polyamide, polyester, polyvinylidenedifluoride (PVDF)) and the like. Beads or particles may be swellable (e.g., polymeric beads such as Wang resin) or non-swellable (e.g., CPG). Commercially available examples of beads include without limitation Wang resin, Merrifield resin and Dynabeads® and SoluLink.


A solid support may be provided in a collection of solid supports. A solid support collection comprises two or more different solid support species. The term “solid support species” as used herein refers to a solid support in association with one particular solid phase nucleic acid species or a particular combination of different solid phase nucleic acid species. In certain embodiments, a solid support collection comprises 2 to 10,000 solid support species, 10 to 1,000 solid support species or about 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 2000, 3000, 4000, 5000, 6000, 7000, 8000, 9000 or 10000 unique solid support species. The solid supports (e.g., beads) in the collection of solid supports may be homogeneous (e.g., all are Wang resin beads) or heterogeneous (e.g., some are Wang resin beads and some are magnetic beads). Each solid support species in a collection of solid supports sometimes is labeled with a specific identification tag. An identification tag for a particular solid support species sometimes is a nucleic acid (e.g., “solid phase nucleic acid”) having a unique sequence in certain embodiments. An identification tag can be any molecule that is detectable and distinguishable from identification tags on other solid support species.


Nucleotide sequence species, amplified nucleic acid species, or detectable products generated from the foregoing may be subject to sequence analysis. The term “sequence analysis” as used herein refers to determining a nucleotide sequence of an amplification product. The entire sequence or a partial sequence of an amplification product can be determined, and the determined nucleotide sequence is referred to herein as a “read.” For example, linear amplification products may be analyzed directly without further amplification in some embodiments (e.g., by using single-molecule sequencing methodology (described in greater detail hereafter)). In certain embodiments, linear amplification products may be subject to further amplification and then analyzed (e.g., using sequencing by ligation or pyrosequencing methodology (described in greater detail hereafter)). Reads may be subject to different types of sequence analysis. Any suitable sequencing method can be utilized to detect, and determine the amount of, nucleotide sequence species, amplified nucleic acid species, or detectable products generated from the foregoing. In one embodiment, a heterogeneous sample is subjected to targeted sequencing (or partial targeted sequencing) where one or more sets of nucleic acid species are sequenced, and the amount of each sequenced nucleic acid species in the set is determined, whereby the presence or absence of a chromosome abnormality is identified based on the amount of the sequenced nucleic acid species Examples of certain sequencing methods are described hereafter.


The terms “sequence analysis apparatus” and “sequence analysis component(s)” used herein refer to apparatus, and one or more components used in conjunction with such apparatus, that can be used by a person of ordinary skill to determine a nucleotide sequence from amplification products resulting from processes described herein (e.g., linear and/or exponential amplification products). Examples of sequencing platforms include, without limitation, the 454 platform (Roche) (Margulies, M. et al. 2005 Nature 437, 376-380), IIlumina Genomic Analyzer (or Solexa platform) or SOLID System (Applied Biosystems) or the Helicos True Single Molecule DNA sequencing technology (Harris T D et al. 2008 Science, 320, 106-109), the single molecule, real-time (SMRTTM) technology of Pacific Biosciences, and nanopore sequencing (Soni G V and Meller A. 2007 Clin Chem 53: 1996-2001). Such platforms allow sequencing of many nucleic acid molecules isolated from a specimen at high orders of multiplexing in a parallel manner (Dear Brief Funct Genomic Proteomic 2003; 1: 397-416). Each of these platforms allow sequencing of clonally expanded or non-amplified single molecules of nucleic acid fragments. Certain platforms involve, for example, (i) sequencing by ligation of dye-modified probes (including cyclic ligation and cleavage), (ii) pyrosequencing, and (iii) single-molecule sequencing. Nucleotide sequence species, amplification nucleic acid species and detectable products generated there from can be considered a “study nucleic acid” for purposes of analyzing a nucleotide sequence by such sequence analysis platforms.


Sequencing by ligation is a nucleic acid sequencing method that relies on the sensitivity of DNA ligase to base-pairing mismatch. DNA ligase joins together ends of DNA that are correctly base paired. Combining the ability of DNA ligase to join together only correctly base paired DNA ends, with mixed pools of fluorescently labeled oligonucleotides or primers, enables sequence determination by fluorescence detection. Longer sequence reads may be obtained by including primers containing cleavable linkages that can be cleaved after label identification. Cleavage at the linker removes the label and regenerates the 5′ phosphate on the end of the ligated primer, preparing the primer for another round of ligation. In some embodiments primers may be labeled with more than one fluorescent label (e.g., 1 fluorescent label, 2, 3, or 4 fluorescent labels).


An example of a system that can be used by a person of ordinary skill based on sequencing by ligation generally involves the following steps. Clonal bead populations can be prepared in emulsion microreactors containing study nucleic acid (“template”), amplification reaction components, beads and primers. After amplification, templates are denatured and bead enrichment is performed to separate beads with extended templates from undesired beads (e.g., beads with no extended templates). The template on the selected beads undergoes a 3′ modification to allow covalent bonding to the slide, and modified beads can be deposited onto a glass slide. Deposition chambers offer the ability to segment a slide into one, four or eight chambers during the bead loading process. For sequence analysis, primers hybridize to the adapter sequence. A set of four color dye-labeled probes competes for ligation to the sequencing primer. Specificity of probe ligation is achieved by interrogating every 4th and 5th base during the ligation series. Five to seven rounds of ligation, detection and cleavage record the color at every 5th position with the number of rounds determined by the type of library used. Following each round of ligation, a new complimentary primer offset by one base in the 5′ direction is laid down for another series of ligations. Primer reset and ligation rounds (5-7 ligation cycles per round) are repeated sequentially five times to generate 25-35 base pairs of sequence for a single tag. With mate-paired sequencing, this process is repeated for a second tag. Such a system can be used to exponentially amplify amplification products generated by a process described herein, e.g., by ligating a heterologous nucleic acid to the first amplification product generated by a process described herein and performing emulsion amplification using the same or a different solid support originally used to generate the first amplification product. Such a system also may be used to analyze amplification products directly generated by a process described herein by bypassing an exponential amplification process and directly sorting the solid supports described herein on the glass slide.


Pyrosequencing is a nucleic acid sequencing method based on sequencing by synthesis, which relies on detection of a pyrophosphate released on nucleotide incorporation. Generally, sequencing by synthesis involves synthesizing, one nucleotide at a time, a DNA strand complimentary to the strand whose sequence is being sought. Study nucleic acids may be immobilized to a solid support, hybridized with a sequencing primer, incubated with DNA polymerase, ATP sulfurylase, luciferase, apyrase, adenosine 5′ phosphsulfate and luciferin. Nucleotide solutions are sequentially added and removed. Correct incorporation of a nucleotide releases a pyrophosphate, which interacts with ATP sulfurylase and produces ATP in the presence of adenosine 5′ phosphsulfate, fueling the luciferin reaction, which produces a chemiluminescent signal allowing sequence determination.


An example of a system that can be used by a person of ordinary skill based on pyrosequencing generally involves the following steps: ligating an adaptor nucleic acid to a study nucleic acid and hybridizing the study nucleic acid to a bead; amplifying a nucleotide sequence in the study nucleic acid in an emulsion; sorting beads using a picoliter multiwell solid support; and sequencing amplified nucleotide sequences by pyrosequencing methodology (e.g., Nakano et al., “Single-molecule PCR using water-in-oil emulsion;” Journal of Biotechnology 102: 117-124 (2003)). Such a system can be used to exponentially amplify amplification products generated by a process described herein, e.g., by ligating a heterologous nucleic acid to the first amplification product generated by a process described herein.


Certain single-molecule sequencing embodiments are based on the principal of sequencing by synthesis, and utilize single-pair Fluorescence Resonance Energy Transfer (single pair FRET) as a mechanism by which photons are emitted as a result of successful nucleotide incorporation. The emitted photons often are detected using intensified or high sensitivity cooled charge-couple-devices in conjunction with total internal reflection microscopy (TIRM). Photons are only emitted when the introduced reaction solution contains the correct nucleotide for incorporation into the growing nucleic acid chain that is synthesized as a result of the sequencing process. In FRET based single-molecule sequencing, energy is transferred between two fluorescent dyes, sometimes polymethine cyanine dyes Cy3 and Cy5, through long-range dipole interactions. The donor is excited at its specific excitation wavelength and the excited state energy is transferred, non-radiatively to the acceptor dye, which in turn becomes excited. The acceptor dye eventually returns to the ground state by radiative emission of a photon. The two dyes used in the energy transfer process represent the “single pair”, in single pair FRET. Cy3 often is used as the donor fluorophore and often is incorporated as the first labeled nucleotide. Cy5 often is used as the acceptor fluorophore and is used as the nucleotide label for successive nucleotide additions after incorporation of a first Cy3 labeled nucleotide. The fluorophores generally are within 10 nanometers of each for energy transfer to occur successfully.


An example of a system that can be used based on single-molecule sequencing generally involves hybridizing a primer to a study nucleic acid to generate a complex; associating the complex with a solid phase; iteratively extending the primer by a nucleotide tagged with a fluorescent molecule; and capturing an image of fluorescence resonance energy transfer signals after each iteration (e.g., U.S. Pat. No. 7,169,314; Braslaysky et al., PNAS 100(7): 3960-3964 (2003)). Such a system can be used to directly sequence amplification products generated by processes described herein. In some embodiments the released linear amplification product can be hybridized to a primer that contains sequences complementary to immobilized capture sequences present on a solid support, a bead or glass slide for example. Hybridization of the primer—released linear amplification product complexes with the immobilized capture sequences, immobilizes released linear amplification products to solid supports for single pair FRET based sequencing by synthesis. The primer often is fluorescent, so that an initial reference image of the surface of the slide with immobilized nucleic acids can be generated. The initial reference image is useful for determining locations at which true nucleotide incorporation is occurring. Fluorescence signals detected in array locations not initially identified in the “primer only” reference image are discarded as non-specific fluorescence. Following immobilization of the primer—released linear amplification product complexes, the bound nucleic acids often are sequenced in parallel by the iterative steps of, a) polymerase extension in the presence of one fluorescently labeled nucleotide, b) detection of fluorescence using appropriate microscopy, TIRM for example, c) removal of fluorescent nucleotide, and d) return to step a with a different fluorescently labeled nucleotide.


In some embodiments, nucleotide sequencing may be by solid phase single nucleotide sequencing methods and processes. Solid phase single nucleotide sequencing methods involve contacting sample nucleic acid and solid support under conditions in which a single molecule of sample nucleic acid hybridizes to a single molecule of a solid support. Such conditions can include providing the solid support molecules and a single molecule of sample nucleic acid in a “microreactor.” Such conditions also can include providing a mixture in which the sample nucleic acid molecule can hybridize to solid phase nucleic acid on the solid support. Single nucleotide sequencing methods useful in the embodiments described herein are described in United States Provisional Patent Application Serial Number 61/021,871 filed January 17, 2008.


In certain embodiments, nanopore sequencing detection methods include (a) contacting a nucleic acid for sequencing (“base nucleic acid,” e.g., linked probe molecule) with sequence-specific detectors, under conditions in which the detectors specifically hybridize to substantially complementary subsequences of the base nucleic acid; (b) detecting signals from the detectors and (c) determining the sequence of the base nucleic acid according to the signals detected. In certain embodiments, the detectors hybridized to the base nucleic acid are disassociated from the base nucleic acid (e.g., sequentially dissociated) when the detectors interfere with a nanopore structure as the base nucleic acid passes through a pore, and the detectors disassociated from the base sequence are detected. In some embodiments, a detector disassociated from a base nucleic acid emits a detectable signal, and the detector hybridized to the base nucleic acid emits a different detectable signal or no detectable signal. In certain embodiments, nucleotides in a nucleic acid (e.g., linked probe molecule) are substituted with specific nucleotide sequences corresponding to specific nucleotides (“nucleotide representatives”), thereby giving rise to an expanded nucleic acid (e.g., U.S. Pat. No. 6,723,513), and the detectors hybridize to the nucleotide representatives in the expanded nucleic acid, which serves as a base nucleic acid. In such embodiments, nucleotide representatives may be arranged in a binary or higher order arrangement (e.g., Soni and Meller, Clinical Chemistry 53(11): 1996-2001 (2007)). In some embodiments, a nucleic acid is not expanded, does not give rise to an expanded nucleic acid, and directly serves a base nucleic acid (e.g., a linked probe molecule serves as a non-expanded base nucleic acid), and detectors are directly contacted with the base nucleic acid. For example, a first detector may hybridize to a first subsequence and a second detector may hybridize to a second subsequence, where the first detector and second detector each have detectable labels that can be distinguished from one another, and where the signals from the first detector and second detector can be distinguished from one another when the detectors are disassociated from the base nucleic acid. In certain embodiments, detectors include a region that hybridizes to the base nucleic acid (e.g., two regions), which can be about 3 to about 100 nucleotides in length (e.g., about 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 35, 40, 50, 55, 60, 65, 70, 75, 80, 85, 90, or 95 nucleotides in length). A detector also may include one or more regions of nucleotides that do not hybridize to the base nucleic acid. In some embodiments, a detector is a molecular beacon. A detector often comprises one or more detectable labels independently selected from those described herein. Each detectable label can be detected by any convenient detection process capable of detecting a signal generated by each label (e.g., magnetic, electric, chemical, optical and the like). For example, a CD camera can be used to detect signals from one or more distinguishable quantum dots linked to a detector.


In certain sequence analysis embodiments, reads may be used to construct a larger nucleotide sequence, which can be facilitated by identifying overlapping sequences in different reads and by using identification sequences in the reads. Such sequence analysis methods and software for constructing larger sequences from reads are known to the person of ordinary skill (e.g., Venter et al., Science 291: 1304-1351 (2001)). Specific reads, partial nucleotide sequence constructs, and full nucleotide sequence constructs may be compared between nucleotide sequences within a sample nucleic acid (i.e., internal comparison) or may be compared with a reference sequence (i.e., reference comparison) in certain sequence analysis embodiments. Internal comparisons sometimes are performed in situations where a sample nucleic acid is prepared from multiple samples or from a single sample source that contains sequence variations. Reference comparisons sometimes are performed when a reference nucleotide sequence is known and an objective is to determine whether a sample nucleic acid contains a nucleotide sequence that is substantially similar or the same, or different, than a reference nucleotide sequence. Sequence analysis is facilitated by sequence analysis apparatus and components known to the person of ordinary skill in the art.


Mass spectrometry is a particularly effective method for the detection of a nucleic acids (e.g., PCR amplicon, primer extension product, detector probe cleaved from a target nucleic acid). Presence of a target nucleic acid is verified by comparing the mass of the detected signal with the expected mass of the target nucleic acid. The relative signal strength, e.g., mass peak on a spectra, for a particular target nucleic acid indicates the relative population of the target nucleic acid amongst other nucleic acids, thus enabling calculation of a ratio of target to other nucleic acid or sequence copy number directly from the data. For a review of genotyping methods using Sequenom® standard iPLEX™ assay and MassARRAY® technology, see Jurinke, C., Oeth, P., van den Boom, D., “MALDI-TOF mass spectrometry: a versatile tool for high-performance DNA analysis.” Mol. Biotechnol. 26, 147-164 (2004);. For a review of detecting and quantifying target nucleic using cleavable detector probes that are cleaved during the amplification process and detected by mass spectrometry, see U.S. patent application Ser. No. 11/950,395, which was filed Dec. 4, 2007, and is hereby incorporated by reference. Such approaches may be adapted to detection of chromosome abnormalities by methods described herein.


In some embodiments, amplified nucleic acid species may be detected by (a) contacting the amplified nucleic acid species (e.g., amplicons) with extension primers (e.g., detection or detector primers), (b) preparing extended extension primers, and (c) determining the relative amount of the one or more mismatch nucleotides (e.g., SNP that exist between paralogous sequences) by analyzing the extended detection primers (e.g., extension primers). In certain embodiments one or more mismatch nucleotides may be analyzed by mass spectrometry. In some embodiments amplification, using methods described herein, may generate between about 1 to about 100 amplicon sets, about 2 to about 80 amplicon sets, about 4 to about 60 amplicon sets, about 6 to about 40 amplicon sets, and about 8 to about 20 amplicon sets (e.g., about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, or about 100 amplicon sets).


An example using mass spectrometry for detection of amplicon sets is presented herein. Amplicons may be contacted (in solution or on solid phase) with a set of oligonucleotides (the same primers used for amplification or different primers representative of subsequences in the primer or target nucleic acid) under hybridization conditions, where: (1) each oligonucleotide in the set comprises a hybridization sequence capable of specifically hybridizing to one amplicon under the hybridization conditions when the amplicon is present in the solution, (2) each oligonucleotide in the set comprises a distinguishable tag located 5′ of the hybridization sequence, (3) a feature of the distinguishable tag of one oligonucleotide detectably differs from the features of distinguishable tags of other oligonucleotides in the set; and (4) each distinguishable tag specifically corresponds to a specific amplicon and thereby specifically corresponds to a specific target nucleic acid. The hybridized amplicon and “detection” primer are subjected to nucleotide synthesis conditions that allow extension of the detection primer by one or more nucleotides (labeled with a detectable entity or moiety, or unlabeled), where one of the one of more nucleotides can be a terminating nucleotide. In some embodiments one or more of the nucleotides added to the primer may comprises a capture agent. In embodiments where hybridization occurred in solution, capture of the primer/amplicon to solid support may be desirable. The detectable moieties or entities can be released from the extended detection primer, and detection of the moiety determines the presence, absence or copy number of the nucleotide sequence of interest. In certain embodiments, the extension may be performed once yielding one extended oligonucleotide. In some embodiments, the extension may be performed multiple times (e.g., under amplification conditions) yielding multiple copies of the extended oligonucleotide. In some embodiments performing the extension multiple times can produce a sufficient number of copies such that interpretation of signals, representing copy number of a particular sequence, can be made with a confidence level of 95% or more (e.g., confidence level of 95% or more, 96% or more, 97% or more, 98% or more, 99% or more, or a confidence level of 99.5% or more).


Methods provided herein allow for high-throughput detection of nucleic acid species in a plurality of nucleic acids (e.g., nucleotide sequence species, amplified nucleic acid species and detectable products generated from the foregoing). Multiplexing refers to the simultaneous detection of more than one nucleic acid species. General methods for performing multiplexed reactions in conjunction with mass spectrometry, are known (see, e.g., U.S. Pat. Nos. 6,043,031, 5,547,835 and International PCT application No. WO 97/37041). Multiplexing provides an advantage that a plurality of nucleic acid species (e.g., some having different sequence variations) can be identified in as few as a single mass spectrum, as compared to having to perform a separate mass spectrometry analysis for each individual target nucleic acid species. Methods provided herein lend themselves to high-throughput, highly-automated processes for analyzing sequence variations with high speed and accuracy, in some embodiments. In some embodiments, methods herein may be multiplexed at high levels in a single reaction.


In certain embodiments, the number of nucleic acid species multiplexed include, without limitation, about 1 to about 500 (e.g., about 1-3, 3-5, 5-7, 7-9, 9-11, 11-13, 13-15, 15-17, 17-19, 19-21, 21-23, 23-25, 25-27, 27-29, 29-31, 31-33, 33-35, 35-37, 37-39, 39-41, 41-43, 43-45, 45-47, 47-49, 49-51, 51-53, 53-55, 55-57, 57-59, 59-61, 61-63, 63-65, 65-67, 67-69, 69-71, 71-73, 73-75, 75-77, 77-79, 79-81, 81-83, 83-85, 85-87, 87-89, 89-91, 91-93, 93-95, 95-97, 97-101, 101-103, 103-105, 105-107, 107-109, 109-111, 111-113, 113-115, 115-117, 117-119, 121-123, 123-125, 125-127, 127-129, 129-131, 131-133, 133-135, 135-137, 137-139, 139-141, 141-143, 143-145, 145-147, 147-149, 149-151, 151-153, 153-155, 155-157, 157-159, 159-161, 161-163, 163-165, 165-167, 167-169, 169-171, 171-173, 173-175, 175-177, 177-179, 179-181, 181-183, 183-185, 185-187, 187-189, 189-191, 191-193, 193-195, 195-197, 197-199, 199-201, 201-203, 203-205, 205-207, 207-209, 209-211, 211-213, 213-215, 215-217, 217-219, 219-221, 221-223, 223-225, 225-227, 227-229, 229-231, 231-233, 233-235, 235-237, 237-239, 239-241, 241-243, 243-245, 245-247, 247-249, 249-251, 251-253, 253-255, 255-257, 257-259, 259-261, 261-263, 263-265, 265-267, 267-269, 269-271, 271-273, 273-275, 275-277, 277-279, 279-281, 281-283, 283-285, 285-287, 287-289, 289-291, 291-293, 293-295, 295-297, 297-299, 299-301, 301-303, 303-305, 305-307, 307-309, 309-311, 311-313, 313-315, 315-317, 317-319, 319-321, 321-323, 323-325, 325-327, 327-329, 329-331, 331-333, 333-335, 335-337, 337-339, 339-341, 341-343, 343-345, 345-347, 347-349, 349-351, 351-353, 353-355, 355-357, 357-359, 359-361, 361-363, 363-365, 365-367, 367-369, 369-371, 371-373, 373-375, 375-377, 377-379, 379-381, 381-383, 383-385, 385-387, 387-389, 389-391, 391-393, 393-395, 395-397, 397-401, 401-403, 403-405, 405-407, 407-409, 409-411, 411-413, 413-415, 415-417, 417-419, 419-421, 421-423, 423-425, 425-427, 427-429, 429-431, 431-433, 433-435, 435-437, 437-439, 439-441, 441-443, 443-445, 445-447, 447-449, 449-451, 451-453, 453-455, 455-457, 457-459, 459-461, 461-463, 463-465, 465-467, 467-469, 469-471, 471-473, 473-475, 475-477, 477-479, 479-481, 481-483, 483-485, 485-487, 487-489, 489-491, 491-493, 493-495, 495-497, 497-501).


Design methods for achieving resolved mass spectra with multiplexed assays can include primer and oligonucleotide design methods and reaction design methods. For primer and oligonucleotide design in multiplexed assays, the same general guidelines for primer design applies for uniplexed reactions, such as avoiding false priming and primer dimers, only more primers are involved for multiplex reactions. For mass spectrometry applications, analyte peaks in the mass spectra for one assay are sufficiently resolved from a product of any assay with which that assay is multiplexed, including pausing peaks and any other by-product peaks. Also, analyte peaks optimally fall within a user-specified mass window, for example, within a range of 5,000-8,500 Da. In some embodiments multiplex analysis may be adapted to mass spectrometric detection of chromosome abnormalities, for example. In certain embodiments multiplex analysis may be adapted to various single nucleotide or nanopore based sequencing methods described herein. Commercially produced micro-reaction chambers or devices or arrays or chips may be used to facilitate multiplex analysis, and are commercially available.


Data Processing and Identifying Presence or Absence of a Chromosome Abnormality


The term “detection” of a chromosome abnormality as used herein refers to identification of an imbalance of chromosomes by processing data arising from detecting sets of amplified nucleic acid species, nucleotide sequence species, or a detectable product generated from the foregoing (collectively “detectable product”). Any suitable detection device and method can be used to distinguish one or more sets of detectable products, as addressed herein. An outcome pertaining to the presence or absence of a chromosome abnormality can be expressed in any suitable form, including, without limitation, probability (e.g., odds ratio, p-value), likelihood, percentage, value over a threshold, or risk factor, associated with the presence of a chromosome abnormality for a subject or sample. An outcome may be provided with one or more of sensitivity, specificity, standard deviation, coefficient of variation (CV) and/or confidence level, or combinations of the foregoing, in certain embodiments.


Detection of a chromosome abnormality based on one or more sets of detectable products may be identified based on one or more calculated variables, including, but not limited to, sensitivity, specificity, standard deviation, coefficient of variation (CV), a threshold, confidence level, score, probability and/or a combination thereof. In some embodiments, (i) the number of sets selected for a diagnostic method, and/or (ii) the particular nucleotide sequence species of each set selected for a diagnostic method, is determined in part or in full according to one or more of such calculated variables.


In certain embodiments, one or more of sensitivity, specificity and/or confidence level are expressed as a percentage. In some embodiments, the percentage, independently for each variable, is greater than about 90% (e.g., about 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99%, or greater than 99% (e.g., about 99.5%, or greater, about 99.9% or greater, about 99.95% or greater, about 99.99% or greater)). Coefficient of variation (CV) in some embodiments is expressed as a percentage, and sometimes the percentage is about 10% or less (e.g., about 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1%, or less than 1% (e.g., about 0.5% or less, about 0.1% or less, about 0.05% or less, about 0.01% or less)). A probability (e.g., that a particular outcome determined by an algorithm is not due to chance) in certain embodiments is expressed as a p-value, and sometimes the p-value is about 0.05 or less (e.g., about 0.05, 0.04, 0.03, 0.02 or 0.01, or less than 0.01 (e.g., about 0.001 or less, about 0.0001 or less, about 0.00001 or less, about 0.000001 or less)).


Scoring or a score refers to calculating the probability that a particular chromosome abnormality is actually present or absent in a subject/sample, in some embodimentse. The value of a score may be used to determine for example the variation, difference, or ratio of amplified nucleic detectable product that may correspond to the actual chromosome abnormality. For example, calculating a positive score from detectable products can lead to an identification of a chromosome abnormality, which is particularly relevant to analysis of single samples.


In certain embodiments, simulated (or simulation) data can aid data processing for example by training an algorithm or testing an algorithm. Simulated data may for instance involve hypothetical various samples of different concentrations of fetal and maternal nucleic acid in serum, plasma and the like. Simulated data may be based on what might be expected from a real population or may be skewed to test an algorithm and/or to assign a correct classification based on a simulated data set. Simulated data also is referred to herein as “virtual” data. Fetal/maternal contributions within a sample can be simulated as a table or array of numbers (for example, as a list of peaks corresponding to the mass signals of cleavage products of a reference biomolecule or amplified nucleic acid sequence), as a mass spectrum, as a pattern of bands on a gel, or as a representation of any technique that measures mass distribution. Simulations can be performed in most instances by a computer program. One possible step in using a simulated data set is to evaluate the confidence of the identified results, i.e. how well the selected positives/negatives match the sample and whether there are additional variations. A common approach is to calculate the probability value (p-value) which estimates the probability of a random sample having better score than the selected one. As p-value calculations can be prohibitive in certain circumstances, an empirical model may be assessed, in which it is assumed that at least one sample matches a reference sample (with or without resolved variations). Alternatively other distributions such as Poisson distribution can be used to describe the probability distribution.


In certain embodiments, an algorithm can assign a confidence value to the true positives, true negatives, false positives and false negatives calculated. The assignment of a likelihood of the occurrence of a chromosome abnormality can also be based on a certain probability model.


Simulated data often is generated in an in silico process. As used herein, the term “in silico” refers to research and experiments performed using a computer. In silico methods include, but are not limited to, molecular modeling studies, karyotyping, genetic calculations, biomolecular docking experiments, and virtual representations of molecular structures and/or processes, such as molecular interactions.


As used herein, a “data processing routine” refers to a process, that can be embodied in software, that determines the biological significance of acquired data (i.e., the ultimate results of an assay). For example, a data processing routine can determine the amount of each nucleotide sequence species based upon the data collected. A data processing routine also may control an instrument and/or a data collection routine based upon results determined. A data processing routine and a data collection routine often are integrated and provide feedback to operate data acquisition by the instrument, and hence provide assay-based judging methods provided herein.


As used herein, software refers to computer readable program instructions that, when executed by a computer, perform computer operations. Typically, software is provided on a program product containing program instructions recorded on a computer readable medium, including, but not limited to, magnetic media including floppy disks, hard disks, and magnetic tape; and optical media including CD-ROM discs, DVD discs, magneto-optical discs, and other such media on which the program instructions can be recorded.


Different methods of predicting abnormality or normality can produce different types of results. For any given prediction, there are four possible types of outcomes: true positive, true negative, false positive, or false negative. The term “true positive” as used herein refers to a subject correctly diagnosed as having a chromosome abnormality. The term “false positive” as used herein refers to a subject wrongly identified as having a chromosome abnormality. The term “true negative” as used herein refers to a subject correctly identified as not having a chromosome abnormality. The term “false negative” as used herein refers to a subject wrongly identified as not having a chromosome abnormality. Two measures of performance for any given method can be calculated based on the ratios of these occurrences: (i) a sensitivity value, the fraction of predicted positives that are correctly identified as being positives (e.g., the fraction of nucleotide sequence sets correctly identified by level comparison detection/determination as indicative of chromosome abnormality, relative to all nucleotide sequence sets identified as such, correctly or incorrectly), thereby reflecting the accuracy of the results in detecting the chromosome abnormality; and (ii) a specificity value, the fraction of predicted negatives correctly identified as being negative (the fraction of nucleotide sequence sets correctly identified by level comparison detection/determination as indicative of chromosomal normality, relative to all nucleotide sequence sets identified as such, correctly or incorrectly), thereby reflecting accuracy of the results in detecting the chromosome abnormality.


The term “sensitivity” as used herein refers to the number of true positives divided by the number of true positives plus the number of false negatives, where sensitivity (sens) may be within the range of 0≤sens≤1. Ideally, method embodiments herein have the number of false negatives equaling zero or close to equaling zero, so that no subject is wrongly identified as not having at least one chromosome abnormality when they indeed have at least one chromosome abnormality. Conversely, an assessment often is made of the ability of a prediction algorithm to classify negatives correctly, a complementary measurement to sensitivity. The term “specificity” as used herein refers to the number of true negatives divided by the number of true negatives plus the number of false positives, where sensitivity (spec) may be within the range of 0 spec 1. Ideally, methods embodiments herein have the number of false positives equaling zero or close to equaling zero, so that no subject wrongly identified as having at least one chromosome abnormality when they do not have the chromosome abnormality being assessed. Hence, a method that has sensitivity and specificity equaling one, or 100%, sometimes is selected.


One or more prediction algorithms may be used to determine significance or give meaning to the detection data collected under variable conditions that may be weighed independently of or dependently on each other. The term “variable” as used herein refers to a factor, quantity, or function of an algorithm that has a value or set of values. For example, a variable may be the design of a set of amplified nucleic acid species, the number of sets of amplified nucleic acid species, percent fetal genetic contribution tested, percent maternal genetic contribution tested, type of chromosome abnormality assayed, type of sex-linked abnormalities assayed, the age of the mother and the like. The term “independent” as used herein refers to not being influenced or not being controlled by another. The term “dependent” as used herein refers to being influenced or controlled by another. For example, a particular chromosome and a trisomy event occurring for that particular chromosome that results in a viable being are variables that are dependent upon each other.


One of skill in the art may use any type of method or prediction algorithm to give significance to the data of the present technology within an acceptable sensitivity and/or specificity. For example, prediction algorithms such as Chi-squared test, z-test, t-test, ANOVA (analysis of variance), regression analysis, neural nets, fuzzy logic, Hidden Markov Models, multiple model state estimation, and the like may be used. One or more methods or prediction algorithms may be determined to give significance to the data having different independent and/or dependent variables of the present technology. And one or more methods or prediction algorithms may be determined not to give significance to the data having different independent and/or dependent variables of the present technology. One may design or change parameters of the different variables of methods described herein based on results of one or more prediction algorithms (e.g., number of sets analyzed, types of nucleotide species in each set). For example, applying the Chi-squared test to detection data may suggest that specific ranges of maternal age are correlated to a higher likelihood of having an offspring with a specific chromosome abnormality, hence the variable of maternal age may be weighed differently verses being weighed the same as other variables.


In certain embodiments, several algorithms may be chosen to be tested. These algorithms are then can be trained with raw data. For each new raw data sample, the trained algorithms will assign a classification to that sample (i.e. trisomy or normal). Based on the classifications of the new raw data samples, the trained algorithms' performance may be assessed based on sensitivity and specificity. Finally, an algorithm with the highest sensitivity and/or specificity or combination thereof may be identified.


In some embodiments a ratio of nucleotide sequence species in a set is expected to be about 1.0:1.0, which can indicate the nucleotide sequence species in the set are in different chromosomes present in the same number in the subject. When nucleotide sequence species in a set are on chromosomes present in different numbers in the subject (for example, in trisomy 21) the set ratio which is detected is lower or higher than about 1.0:1.0. Where extracellular nucleic acid is utilized as template nucleic acid, the measured set ratio often is not 1.0:1.0 (euploid) or 1.0:1.5 (e.g., trisomy 21) , due to a variety of factors. Although, the expected measured ratio can vary, so long as such variation is substantially reproducible and detectable. For example, a particular set might provide a reproducible measured ratio (for example of peaks in a mass spectrograph) of 1.0:1.2 in a euploid measurement. The aneuploid measurement for such a set might then be, for example, 1.0:1.3. The, for example, 1.3 versus 1.2 measurement is the result of measuring the fetal nucleic acid against a background of maternal nucleic acid, which decreases the signal that would otherwise be provided by a “pure” fetal sample, such as from amniotic fluid or from a fetal cell.


As noted above, algorithms, software, processors and/or machines, for example, can be utilized to (i) process detection data pertaining to nucleotide sequence species and/or amplified nucleic acid species of sets, and/or (ii) identify the presence or absence of a chromosome abnormality.


Also provided are methods for identifying the presence or absence of a chromosome abnormality in a subject that comprise: (a) providing a system, where the system comprises distinct software modules, and where the distinct software modules comprise a signal detection module, a logic processing module, and a data display organization module; (b) detecting signal information derived from determining the amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, where the plurality of sets of amplified nucleic acid species are prepared by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; (c) receiving, by the logic processing module, the signal information; (d) calling the presence or absence of a chromosomal abnormality by the logic processing module, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and (e) organizing, by the data display organization model in response to being called by the logic processing module, a data display indicating the presence or absence of a chromosome abnormality in the subject.


Provided also are multiplex methods for identifying the presence or absence of an abnormality of a target chromosome in a subject that comprise: (a) providing a system, where the system comprises distinct software modules, and where the distinct software modules comprise a signal detection module, a logic processing module, and a data display organization module; (b) detecting signal information derived from determining the amount of each amplified nucleic acid species in each of three or more sets of amplified nucleic acid species, where the three or more sets are prepared by amplifying three or more nucleotide sequence sets from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) a nucleotide sequence in a set is present on a target chromosome and at least one other nucleotide sequence in the set is present on one or more reference chromosomes, (iii) the target chromosome is common for all of the sets; (iv) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (v) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (vi) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vii) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; c) receiving, by the logic processing module, the signal information; (d) detecting the presence or absence of a decrease or increase of the target chromosome from the amount of each amplified nucleic acid species in the sets; (e) calling the presence or absence of a chromosomal abnormality by the logic processing module, whereby the presence or absence of the chromosome abnormality is identified based on a decrease or increase of the target chromosome relative to the one or more reference chromosomes based on the amount of the amplified nucleic acid species from two or more sets; and (e) organizing, by the data display organization model in response to being called by the logic processing module, a data display indicating the presence or absence of a chromosome abnormality in the subject.


Also provided are methods for identifying the presence or absence of a chromosome abnormality in a subject that comprise: (a) providing a system, where the system comprises distinct software modules, and where the distinct software modules comprise a signal detection module, a logic processing module, and a data display organization module; (b) detecting signal information derived from determining the amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, where the plurality of sets of amplified nucleic acid species are prepared by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on three or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; (c) receiving, by the logic processing module, the signal information; (d) calling the presence or absence of a chromosomal abnormality by the logic processing module, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species; and (e) organizing, by the data display organization model in response to being called by the logic processing module, a data display indicating the presence or absence of a chromosome abnormality in the subject.


Provided also are methods for identifying the presence or absence of a chromosome abnormality in a subject, that comprise: (a) providing a system, where the system comprises distinct software modules, and where the distinct software modules comprise a signal detection module, a logic processing module, and a data display organization module; (b) detecting signal information derived from determining the amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, where the sets of amplified nucleic acid species are prepared by amplifying nucleotide sequences from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on chromosome 21, chromosome 18 and chromosome 13, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; (c) receiving, by the logic processing module, the signal information; (d) calling the presence or absence of a chromosomal abnormality by the logic processing module, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and (e) organizing, by the data display organization model in response to being called by the logic processing module, a data display indicating the presence or absence of a chromosome abnormality in the subject.


Also provided are methods for identifying the presence or absence of a chromosome abnormality in a subject, which comprise obtaining a plurality of sets of amplified nucleic acid species prepared by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; providing a system, where the system comprises distinct software modules, and where the distinct software modules comprise a signal detection module, a logic processing module, and a data display organization module; parsing a configuration file into definition data that specifies: the amount of each amplified nucleic acid species;receiving, by the logic processing module, the definition data; calling the presence or absence of a chromosomal abnormality by the logic processing module, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and organizing, by the data display organization model in response to being called by the logic processing module, a data display indicating the presence or absence of a chromosome abnormality in the subject.


Provided also are methods for identifying the presence or absence of a chromosome abnormality in a subject, comprising preparing a plurality of sets of amplified nucleic acid species by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; providing a system, where the system comprises distinct software modules, and where the distinct software modules comprise a signal detection module, a logic processing module, and a data display organization module; parsing a configuration file into definition data that specifies: the amount of each amplified nucleic acid species; receiving, by the logic processing module, the definition data; calling the presence or absence of a chromosomal abnormality by the logic processing module, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and organizing, by the data display organization model in response to being called by the logic processing module, a data display indicating the presence or absence of a chromosome abnormality in the subject.


Provided also are methods for identifying the presence or absence of a chromosome abnormality in a subject, which comprise providing signal information indicating the amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, where the plurality of sets of amplified nucleic acid species are prepared by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; providing a system, where the system comprises distinct software modules, and where the distinct software modules comprise a signal detection module, a logic processing module, and a data display organization module; receiving, by the logic processing module, the signal information; calling the presence or absence of a chromosomal abnormality by the logic processing module, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and organizing, by the data display organization model in response to being called by the logic processing module, a data display indicating the presence or absence of a chromosome abnormality in the subject.


Provided also are multiplex methods for identifying the presence or absence of an abnormality of a target chromosome in a subject that comprises providing signal information indicating the amount of each amplified nucleic acid species in each of three or more sets of amplified nucleic acid species, where the three or more sets are prepared by amplifying three or more nucleotide sequence sets from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) a nucleotide sequence in a set is present on a target chromosome and at least one other nucleotide sequence in the set is present on one or more reference chromosomes, (iii) the target chromosome is common for all of the sets; (iv) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (v) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (vi) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vii) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; providing a system, where the system comprises distinct software modules, and where the distinct software modules comprise a signal detection module, a logic processing module, and a data display organization module; receiving, by the logic processing module, the signal information; calling the presence or absence of a chromosomal abnormality by the logic processing module, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and organizing, by the data display organization model in response to being called by the logic processing module, a data display indicating the presence or absence of a chromosome abnormality in the subject.


Also provided are methods for identifying the presence or absence of a chromosome abnormality in a subject that comprises providing signal information indicating the amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, where the plurality of sets of amplified nucleic acid species are prepared by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on three or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; providing a system, where the system comprises distinct software modules, and where the distinct software modules comprise a signal detection module, a logic processing module, and a data display organization module; receiving, by the logic processing module, the signal information; calling the presence or absence of a chromosomal abnormality by the logic processing module, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species; and organizing, by the data display organization model in response to being called by the logic processing module, a data display indicating the presence or absence of a chromosome abnormality in the subject.


Also provided are methods for identifying the presence or absence of a chromosome abnormality in a subject that comprises providing signal information indicating the amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, where the plurality of sets of amplified nucleic acid species are prepared by amplifying nucleotide sequences from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on chromosome 21, chromosome 18 and chromosome 13, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; providing a system, where the system comprises distinct software modules, and where the distinct software modules comprise a signal detection module, a logic processing module, and a data display organization module; receiving, by the logic processing module, the signal information; calling the presence or absence of a chromosomal abnormality by the logic processing module, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and organizing, by the data display organization model in response to being called by the logic processing module, a data display indicating the presence or absence of a chromosome abnormality in the subject.


Also provided are methods for identifying the presence or absence of a chromosome abnormality in a subject, which comprise providing a system, where the system comprises distinct software modules, and where the distinct software modules comprise a signal detection module, a logic processing module, and a data display organization module; (please have someone review which modules are needed, or if we need more steps/description) receiving, by the logic processing module, signal information indicating the amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, where the plurality of sets of amplified nucleic acid species are prepared by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; calling the presence or absence of a chromosomal abnormality by the logic processing module, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and organizing, by the data display organization model in response to being called by the logic processing module, a data display indicating the presence or absence of a chromosome abnormality in the subject.


Provided also are multiplex methods for identifying the presence or absence of an abnormality of a target chromosome in a subject that comprises providing a system, where the system comprises distinct software modules, and where the distinct software modules comprise a signal detection module, a logic processing module, and a data display organization module; receiving, by the logic processing module, signal information indicating the amount of each amplified nucleic acid species in each of three or more sets of amplified nucleic acid species, where the three or more sets are prepared by amplifying three or more nucleotide sequence sets from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) a nucleotide sequence in a set is present on a target chromosome and at least one other nucleotide sequence in the set is present on one or more reference chromosomes, (iii) the target chromosome is common for all of the sets; (iv) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (v) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (vi) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vii) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; calling the presence or absence of a chromosomal abnormality by the logic processing module, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and organizing, by the data display organization model in response to being called by the logic processing module, a data display indicating the presence or absence of a chromosome abnormality in the subject.


Also provided are methods for identifying the presence or absence of a chromosome abnormality in a subject that comprises providing a system, where the system comprises distinct software modules, and where the distinct software modules comprise a signal detection module, a logic processing module, and a data display organization module; receiving, by the logic processing module, signal information indicating the amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, where the plurality of sets of amplified nucleic acid species are prepared by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on three or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; calling the presence or absence of a chromosomal abnormality by the logic processing module, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species; and organizing, by the data display organization model in response to being called by the logic processing module, a data display indicating the presence or absence of a chromosome abnormality in the subject.


Also provided are methods for identifying the presence or absence of a chromosome abnormality in a subject that comprises providing a system, where the system comprises distinct software modules, and where the distinct software modules comprise a signal detection module, a logic processing module, and a data display organization module; receiving, by the logic processing module, signal information indicating the amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, where the plurality of sets of amplified nucleic acid species are prepared by amplifying nucleotide sequences from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on chromosome 21, chromosome 18 and chromosome 13, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; calling the presence or absence of a chromosomal abnormality by the logic processing module, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and organizing, by the data display organization model in response to being called by the logic processing module, a data display indicating the presence or absence of a chromosome abnormality in the subject.


By “providing signal information” is meant any manner of providing the information, including, for example, computer communication means from a local, or remote site, human data entry, or any other method of transmitting signal information. The signal information may generated in one location and provided to another location.


By “obtaining” or “receiving” signal information is meant receiving the signal information by computer communication means from a local, or remote site, human data entry, or any other method of receiving signal information. The signal information may be generated in the same location at which it is received, or it may be generated in a different location and transmitted to the receiving location.


By “indicating” or “representing” the amount is meant that the signal information is related to, or correlates with, the amount of, for example, amplified nucleic acid species. The information may be, for example, the calculated data associated with the amount of amplified nucleic acid as obtained, for example, after converting raw data obtained by mass spectrometry of the amplified nucleic acid. The signal information may be, for example, the raw data obtained from analysis of the amplified nucleic acid by methods such as, for example, mass spectrometry.


Also provided are computer program products, such as, for example, a computer program products comprising a computer usable medium having a computer readable program code embodied therein, the computer readable program code adapted to be executed to implement a method for identifying the presence or absence of a chromosome abnormality in a subject, the method comprising: (a) providing a system, where the system comprises distinct software modules, and where the distinct software modules comprise a signal detection module, a logic processing module, and a data display organization module; (b) detecting signal information derived from determining the amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, where the plurality of sets of amplified nucleic acid species are prepared by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; (c) receiving, by the logic processing module, the signal information; (d) calling the presence or absence of a chromosomal abnormality by the logic processing module, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and (e) organizing, by the data display organization model in response to being called by the logic processing module, a data display indicating the presence or absence of a chromosome abnormality in the subject.


Provided also are computer program products comprising a computer usable medium having a computer readable program code embodied therein, the computer readable program code adapted to be executed to implement a method for identifying the presence or absence of a chromosome abnormality in a subject, the method comprising: multiplex methods for identifying the presence or absence of an abnormality of a target chromosome in a subject that comprise: (a) providing a system, where the system comprises distinct software modules, and where the distinct software modules comprise a signal detection module, a logic processing module, and a data display organization module; (b) detecting signal information derived from determining the amount of each amplified nucleic acid species in each of three or more sets of amplified nucleic acid species, where the three or more sets are prepared by amplifying three or more nucleotide sequence sets from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) a nucleotide sequence in a set is present on a target chromosome and at least one other nucleotide sequence in the set is present on one or more reference chromosomes, (iii) the target chromosome is common for all of the sets; (iv) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (v) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (vi) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vii) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; c) receiving, by the logic processing module, the signal information; (d) detecting the presence or absence of a decrease or increase of the target chromosome from the amount of each amplified nucleic acid species in the sets; (e) calling the presence or absence of a chromosomal abnormality by the logic processing module, whereby the presence or absence of the chromosome abnormality is identified based on a decrease or increase of the target chromosome relative to the one or more reference chromosomes based on the amount of the amplified nucleic acid species from two or more sets; and (e) organizing, by the data display organization model in response to being called by the logic processing module, a data display indicating the presence or absence of a chromosome abnormality in the subject.


Also provided are computer program products comprising a computer usable medium having a computer readable program code embodied therein, the computer readable program code adapted to be executed to implement methods for identifying the presence or absence of a chromosome abnormality in a subject that comprise: (a) providing a system, where the system comprises distinct software modules, and where the distinct software modules comprise a signal detection module, a logic processing module, and a data display organization module; (b) detecting signal information derived from determining the amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, where the plurality of sets of amplified nucleic acid species are prepared by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on three or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; (c) receiving, by the logic processing module, the signal information; (d) calling the presence or absence of a chromosomal abnormality by the logic processing module, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species; and (e) organizing, by the data display organization model in response to being called by the logic processing module, a data display indicating the presence or absence of a chromosome abnormality in the subject.


Provided also are computer program products comprising a computer usable medium having a computer readable program code embodied therein, the computer readable program code adapted to be executed to implement methods for identifying the presence or absence of a chromosome abnormality in a subject, that comprise: (a) providing a system, where the system comprises distinct software modules, and where the distinct software modules comprise a signal detection module, a logic processing module, and a data display organization module; (b) detecting signal information derived from determining the amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, where the sets of amplified nucleic acid species are prepared by amplifying nucleotide sequences from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on chromosome 21, chromosome 18 and chromosome 13, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; (c) receiving, by the logic processing module, the signal information; (d) calling the presence or absence of a chromosomal abnormality by the logic processing module, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and (e) organizing, by the data display organization model in response to being called by the logic processing module, a data display indicating the presence or absence of a chromosome abnormality in the subject.


Provided also is a computer program product, comprising a computer usable medium having a computer readable program code embodied therein, said computer readable program code adapted to be executed to implement a method for identifying the presence or absence of a chromosome abnormality in a subject, said method comprising: providing a system, where the system comprises distinct software modules, and where the distinct software modules comprise a signal detection module, a logic processing module, and a data display organization module; parsing a configuration file into definition data that specifies: the amount of each amplified nucleic acid species in each set receiving, by the logic processing module, the definition data; calling the presence or absence of a chromosomal abnormality by the logic processing module; and organizing, by the data display organization model in response to being called by the logic processing module, a data display indicating the presence or absence of a chromosome abnormality in the subject.


Provided also is a computer program product, comprising a computer usable medium having a computer readable program code embodied therein, said computer readable program code adapted to be executed to implement a method for identifying the presence or absence of a chromosome abnormality in a subject, the method comprising providing a system, where the system comprises distinct software modules, and where the distinct software modules comprise a signal detection module, a logic processing module, and a data display organization module; receiving signal information indicating the amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, where the plurality of sets of amplified nucleic acid species are prepared by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; calling the presence or absence of a chromosomal abnormality by the logic processing module; and organizing, by the data display organization model in response to being called by the logic processing module, a data display indicating the presence or absence of a chromosome abnormality in the subject.


Provided also are computer program products comprising a computer usable medium having a computer readable program code embodied therein, the computer readable program code adapted to be executed to implement a method for identifying the presence or absence of a chromosome abnormality in a subject, the method comprising: multiplex methods for identifying the presence or absence of an abnormality of a target chromosome in a subject that comprise: (a) providing a system, where the system comprises distinct software modules, and where the distinct software modules comprise a signal detection module, a logic processing module, and a data display organization module; (b) receiving signal information indicating the amount of each amplified nucleic acid species in each of three or more sets of amplified nucleic acid species, where the three or more sets are prepared by amplifying three or more nucleotide sequence sets from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) a nucleotide sequence in a set is present on a target chromosome and at least one other nucleotide sequence in the set is present on one or more reference chromosomes, (iii) the target chromosome is common for all of the sets; (iv) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (v) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (vi) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vii) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; calling the presence or absence of a chromosomal abnormality by the logic processing module; and organizing, by the data display organization model in response to being called by the logic processing module, a data display indicating the presence or absence of a chromosome abnormality in the subject.


Also provided are computer program products comprising a computer usable medium having a computer readable program code embodied therein, the computer readable program code adapted to be executed to implement methods for identifying the presence or absence of a chromosome abnormality in a subject that comprise: (a) providing a system, where the system comprises distinct software modules, and where the distinct software modules comprise a signal detection module, a logic processing module, and a data display organization module; (b) receiving signal information indicating the amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, where the plurality of sets of amplified nucleic acid species are prepared by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on three or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; calling the presence or absence of a chromosomal abnormality by the logic processing module; and organizing, by the data display organization model in response to being called by the logic processing module, a data display indicating the presence or absence of a chromosome abnormality in the subject.


Provided also are computer program products comprising a computer usable medium having a computer readable program code embodied therein, the computer readable program code adapted to be executed to implement methods for identifying the presence or absence of a chromosome abnormality in a subject, that comprise: (a) providing a system, where the system comprises distinct software modules, and where the distinct software modules comprise a signal detection module, a logic processing module, and a data display organization module; (b) receiving signal information indicating the amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, where the sets of amplified nucleic acid species are prepared by amplifying nucleotide sequences from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on chromosome 21, chromosome 18 and chromosome 13, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; and calling the presence or absence of a chromosomal abnormality by the logic processing module; and organizing, by the data display organization model in response to being called by the logic processing module, a data display indicating the presence or absence of a chromosome abnormality in the subject.


Provided also are methods for identifying the presence or absence of a chromosome abnormality in a subject that comprise: (a) detecting signal information, where the signal information represents the amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, where the plurality of sets of amplified nucleic acid species are prepared by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; (b) transforming the signal information representing the amount of each amplified nucleic acid species in each set into identification data, where the identification data represents the presence or absence of the chromosome abnormality, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and (c) displaying the identification data.


Signal information may be, for example, mass spectrometry data obtained from mass spectrometry of amplified nucleic acid. The mass spectrometry data may be raw data, such as, for example, a set of numbers, or, for example, a two dimensional display of the mass spectrum. The signal information may be converted or transformed to any form of data that may be provided to, or received by, a computer system. The signal information may also, for example, be converted, or transformed to identification data or information representing the chromosome number in cells. Where the chromosome number is greater or less than in euploid cells, the presence of a chromosome abnormality may be identified.


Also provided are methods for identifying the presence or absence of a chromosome abnormality in a subject that comprise: (a) detecting signal information, where the signal information represents the amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, where the plurality of sets of amplified nucleic acid species are prepared by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on three or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; (b) transforming the signal information representing the amount of each amplified nucleic acid species in each set into identification data, where the identification data represents the presence or absence of the chromosome abnormality, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and (c) displaying the identification data.


Provided also are multiplex methods for identifying the presence or absence of an abnormality of a target chromosome in a subject that comprise: (a) detecting signal information, where the signal information represents the amount of each amplified nucleic acid species in each of three or more sets of amplified nucleic acid species, where the three or more sets are prepared by amplifying three or more nucleotide sequence sets from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) a nucleotide sequence in a set is present on a target chromosome and at least one other nucleotide sequence in the set is present on one or more reference chromosomes, (iii) the target chromosome is common for all of the sets; (iv) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (v) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (vi) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vii) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; (b) detecting the presence or absence of a decrease or increase of the target chromosome from the amount of each amplified nucleic acid species in the sets; (c) based on the presence or absence of a decrease or increase of the target chromosome from the amount of each amplified nucleic acid species in the sets, transforming the signal information representing the amount of each amplified nucleic acid species in each set into identification data, where the identification data represents the presence or absence of the chromosome abnormality, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and (c) displaying the identification data.


Also provided are methods for identifying the presence or absence of a chromosome abnormality in a subject that comprise: (a) detecting signal information, where the signal information represents the amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, where the plurality of sets of amplified nucleic acid species are prepared by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on three or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; (b) transforming the signal information representing the amount of each amplified nucleic acid species in each set into identification data, where the identification data represents the presence or absence of the chromosome abnormality, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species; and (c) displaying the identification data.


Provided also are methods for identifying the presence or absence of a chromosome abnormality in a subject, that comprise: (a) detecting signal information, where the signal information represents the amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, where the sets of amplified nucleic acid species are prepared by amplifying nucleotide sequences from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on chromosome 21, chromosome 18 and chromosome 13, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; (b) transforming the signal information representing the amount of each amplified nucleic acid species in each set into identification data, where the identification data represents the presence or absence of the chromosome abnormality, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and (c) displaying the identification data.


Also provided are methods for identifying the presence or absence of a chromosome abnormality in a subject, comprising preparing a plurality of sets of amplified nucleic acid species by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; and obtaining a data set of values representing the amount of each amplified nucleic acid species in each set; transforming the data set of values representing the amount of each amplified nucleic acid species in each set into identification data, where the identification data represents the presence or absence of the chromosome abnormality, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and displaying the identified data.


Also provided are methods for identifying the presence or absence of a chromosome abnormality in a subject, which comprise providing signal information indicating the amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, where the plurality of sets of amplified nucleic acid species are prepared by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; transforming the signal information indicating the amount of each amplified nucleic acid species in each set into identification data, where the identification data represents the presence or absence of the chromosome abnormality, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and displaying the identification data.


Provided also are multiplex methods for identifying the presence or absence of an abnormality of a target chromosome in a subject that comprise: providing signal information indicating the amount of each amplified nucleic acid species in each of three or more sets of amplified nucleic acid species, where the three or more sets are prepared by amplifying three or more nucleotide sequence sets from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) a nucleotide sequence in a set is present on a target chromosome and at least one other nucleotide sequence in the set is present on one or more reference chromosomes, (iii) the target chromosome is common for all of the sets; (iv) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (v) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (vi) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vii) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; detecting the presence or absence of a decrease or increase of the target chromosome from the amount of each amplified nucleic acid species in the sets; based on the presence or absence of a decrease or increase of the target chromosome from the amount of each amplified nucleic acid species in the sets, transforming the signal information representing the amount of each amplified nucleic acid species in each set into identification data, where the identification data represents the presence or absence of the chromosome abnormality, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and displaying the identification data.


Also provided are methods for identifying the presence or absence of a chromosome abnormality in a subject that comprise: providing signal information indicating amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, where the plurality of sets of amplified nucleic acid species are prepared by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on three or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; transforming the signal information representing the amount of each amplified nucleic acid species in each set into identification data, where the identification data represents the presence or absence of the chromosome abnormality, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species; and displaying the identification data.


Provided also are methods for identifying the presence or absence of a chromosome abnormality in a subject, that comprise: providing signal information indicating detecting signal information, where the signal information represents the amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, where the sets of amplified nucleic acid species are prepared by amplifying nucleotide sequences from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on chromosome 21, chromosome 18 and chromosome 13, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; transforming the signal information representing the amount of each amplified nucleic acid species in each set into identification data, where the identification data represents the presence or absence of the chromosome abnormality, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and displaying the identification data.


Provided also are methods for identifying the presence or absence of a chromosome abnormality in a subject, which comprise receiving signal information indicating the amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, where the plurality of sets of amplified nucleic acid species are prepared by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; transforming the signal information indicating the amount of each amplified nucleic acid species in each set into identification data, where the identification data represents the presence or absence of the chromosome abnormality, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and displaying the identification data.


Provided also are multiplex methods for identifying the presence or absence of an abnormality of a target chromosome in a subject that comprise: receiving signal information indicating the amount of each amplified nucleic acid species in each of three or more sets of amplified nucleic acid species, where the three or more sets are prepared by amplifying three or more nucleotide sequence sets from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) a nucleotide sequence in a set is present on a target chromosome and at least one other nucleotide sequence in the set is present on one or more reference chromosomes, (iii) the target chromosome is common for all of the sets; (iv) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (v) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (vi) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vii) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; detecting the presence or absence of a decrease or increase of the target chromosome from the amount of each amplified nucleic acid species in the sets; based on the presence or absence of a decrease or increase of the target chromosome from the amount of each amplified nucleic acid species in the sets, transforming the signal information representing the amount of each amplified nucleic acid species in each set into identification data, where the identification data represents the presence or absence of the chromosome abnormality, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and displaying the identification data.


Also provided are methods for identifying the presence or absence of a chromosome abnormality in a subject that comprise: receiving signal information indicating amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, where the plurality of sets of amplified nucleic acid species are prepared by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on three or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; transforming the signal information representing the amount of each amplified nucleic acid species in each set into identification data, where the identification data represents the presence or absence of the chromosome abnormality, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species; and displaying the identification data.


Provided also are methods for identifying the presence or absence of a chromosome abnormality in a subject, that comprise: receiving signal information indicating detecting signal information, where the signal information represents the amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, where the sets of amplified nucleic acid species are prepared by amplifying nucleotide sequences from extracellular nucleic acid template of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on chromosome 21, chromosome 18 and chromosome 13, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; transforming the signal information representing the amount of each amplified nucleic acid species in each set into identification data, where the identification data represents the presence or absence of the chromosome abnormality, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and displaying the identification data.


For purposes of these, and similar embodiments, the term “signal information” indicates information readable by any electronic media, including, for example, computers that represent data derived using the present methods. For example, “signal information” can represent the amount of amplified nucleic acid species in a set of amplified nucleic acid species. Or, for example, it can represent the presence or absence of a decrease or an increase of one or more amplified nucleic acid species. Signal information, such as in these examples, that represents physical substances may be transformed into identification data, such as a visual display, that represents other physical substances, such as, for example, a chromosome abnormality. Identification data may be displayed in any appropriate manner, including, but not limited to, in a computer visual display, by encoding the identification data into computer readable media that may, for example, be transferred to another electronic device, or by creating a hard copy of the display, such as a print out of information. The information may also be displayed by auditory signal or any other means of information communication.


In some embodiments, the signal information may be detection data obtained using methods to detect the amplified nucleic acid species of the present technology, such as, for example, without limitation, data obtained from primer extension, sequencing, digital polymerase chain reaction (PCR), quantitative PCR (Q-PCR) and mass spectrometry. In some embodiments, the amplified nucleic acid species are detected by: (i) contacting the amplified nucleic acid species with extension primers, (ii) preparing extended extension primers, and (iii) determining the relative amount of the one or more mismatch nucleotides by analyzing the extended extension primers. The one or more mismatch nucleotides are analyzed by mass spectrometry in some embodiments. Where the signal information is detection data, the amount of the amplified nucleic acid species in a set of amplified nucleic acid species, or the presence or absence of a decrease or an increase of one or more amplified nucleic acid species may be determined by the logic processing module.


Once the signal information is detected, it may be forwarded to the logic processing module. The logic processing module may “call” or “identify” the presence or absence of a chromosome abnormality by analyzing the amount of amplified nucleic acid in two, or three, sets. Or, the chromosome abnormality may be called or identified by the logic processing module based on a decrease or increase of the target chromosome relative to the one or more reference chromosomes based on the amount of the amplified nucleic acid species from two or more sets.


Provided also are methods for transmitting prenatal genetic information to a human pregnant female subject, which comprises identifying the presence or absence of a chromosomal abnormality in the fetus of the pregnant female subject, where the presence or absence of the chromosomal abnormality has been determined by preparing a plurality of sets of amplified nucleic acid species by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template from placenta-expressed nucleic acid in the blood of the pregnant female subject, of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; and determining the amount of each amplified nucleic acid species in each set; whereby the presence or absence of the chromosome abnormality is determined based on the amount of the amplified nucleic acid species from two or more sets; and transmitting the presence or absence of the chromosomal abnormality to the pregnant female subject.


Provided also are methods for transmitting prenatal genetic information to a human pregnant female subject, which comprises identifying the presence or absence of a chromosomal abnormality in the fetus of the pregnant female subject, where the presence or absence of the chromosomal abnormality has been determined by a multiplex method by amplifying three or more nucleotide sequence sets from extracellular nucleic acid template from placenta-expressed nucleic acid in the blood of the pregnant female subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) a nucleotide sequence in a set is present on a target chromosome and at least one other nucleotide sequence in the set is present on one or more reference chromosomes, (iii) the target chromosome is common for all of the sets; (iv) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (v) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (vi) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vii) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; detecting the presence or absence of a decrease or increase of the target chromosome from the amount of each amplified nucleic acid species in the sets; based on the presence or absence of a decrease or increase of the target chromosome from the amount of each amplified nucleic acid species in the sets, transforming the signal information representing the amount of each amplified nucleic acid species in each set into identification data, where the identification data represents the presence or absence of the chromosome abnormality, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; transmitting the presence or absence of the chromosomal abnormality to the pregnant female subject.


Provided also are methods for transmitting prenatal genetic information to a human pregnant female subject, which comprises identifying the presence or absence of a chromosomal abnormality in the fetus of the pregnant female subject, where the presence or absence of the chromosomal abnormality has been determined by preparing a plurality of sets of amplified nucleic acid species by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template from placenta-expressed nucleic acid in the blood of the pregnant female subject, of a subject where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on three or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; transforming the signal information representing the amount of each amplified nucleic acid species in each set into identification data, where the identification data represents the presence or absence of the chromosome abnormality, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species; and transmitting the presence or absence of the chromosomal abnormality to the pregnant female subject.


Provided also are methods for transmitting prenatal genetic information to a human pregnant female subject, which comprises identifying the presence or absence of a chromosomal abnormality in the fetus of the pregnant female subject, where the presence or absence of the chromosomal abnormality has been determined by preparing a plurality of sets of amplified nucleic acid species by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template from placenta-expressed nucleic acid in the blood of the pregnant female subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on chromosome 21, chromosome 18 and chromosome 13, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; transforming the signal information representing the amount of each amplified nucleic acid species in each set into identification data, where the identification data represents the presence or absence of the chromosome abnormality, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and transmitting the presence or absence of the chromosomal abnormality to the pregnant female subject.


Also provided are methods for transmitting prenatal genetic information to a human pregnant female subject, comprising identifying the presence or absence of a chromosomal abnormality in the fetus of the pregnant female subject, where the presence or absence of the chromosomal abnormality has been determined by preparing a plurality of sets of amplified nucleic acid species by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template from placenta-expressed nucleic acid in the blood of the pregnant female subject, of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; and determining the amount of each amplified nucleic acid species in each set; whereby the presence or absence of the chromosome abnormality is determined based on the amount of the amplified nucleic acid species from two or more sets; and transmitting prenatal genetic information representing the chromosome number in cells in the fetus to the pregnant female subject.


Provided also are methods for transmitting prenatal genetic information to a human pregnant female subject, which comprises identifying the presence or absence of a chromosomal abnormality in the fetus of the pregnant female subject, where the presence or absence of the chromosomal abnormality has been determined by a multiplex method by amplifying three or more nucleotide sequence sets from extracellular nucleic acid template from placenta-expressed nucleic acid in the blood of the pregnant female subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) a nucleotide sequence in a set is present on a target chromosome and at least one other nucleotide sequence in the set is present on one or more reference chromosomes, (iii) the target chromosome is common for all of the sets; (iv) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (v) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (vi) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vii) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; detecting the presence or absence of a decrease or increase of the target chromosome from the amount of each amplified nucleic acid species in the sets; based on the presence or absence of a decrease or increase of the target chromosome from the amount of each amplified nucleic acid species in the sets, transforming the signal information representing the amount of each amplified nucleic acid species in each set into identification data, where the identification data represents the presence or absence of the chromosome abnormality, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; transmitting prenatal genetic information representing the chromosome number in cells in the fetus to the pregnant female subject.


Provided also are methods for transmitting prenatal genetic information to a human pregnant female subject, which comprises identifying the presence or absence of a chromosomal abnormality in the fetus of the pregnant female subject, where the presence or absence of the chromosomal abnormality has been determined by preparing a plurality of sets of amplified nucleic acid species by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template from placenta-expressed nucleic acid in the blood of the pregnant female subject, of a subject where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on three or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; transforming the signal information representing the amount of each amplified nucleic acid species in each set into identification data, where the identification data represents the presence or absence of the chromosome abnormality, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species; and transmitting prenatal genetic information representing the chromosome number in cells in the fetus to the pregnant female subject.


Provided also are methods for transmitting prenatal genetic information to a human pregnant female subject, which comprises identifying the presence or absence of a chromosomal abnormality in the fetus of the pregnant female subject, where the presence or absence of the chromosomal abnormality has been determined by preparing a plurality of sets of amplified nucleic acid species by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template from placenta-expressed nucleic acid in the blood of the pregnant female subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on chromosome 21, chromosome 18 and chromosome 13, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; transforming the signal information representing the amount of each amplified nucleic acid species in each set into identification data, where the identification data represents the presence or absence of the chromosome abnormality, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and transmitting prenatal genetic information representing the chromosome number in cells in the fetus to the pregnant female subject.


The term “identifying the presence or absence of a chromosomal abnormality” as used herein refers to any method for obtaining such information, including, without limitation, obtaining the information from a laboratory file. A laboratory file can be generated by a laboratory that carried out an assay to determine the presence or absence of the chromosomal abnormality. The laboratory may be in the same location or different location (e.g., in another country) as the personnel identifying the presence or absence of the chromosomal abnormality from the laboratory file. For example, the laboratory file can be generated in one location and transmitted to another location in which the information therein will be transmitted to the pregnant female subject. The laboratory file may be in tangible form or electronic form (e.g., computer readable form), in certain embodiments.


The term “transmitting the presence or absence of the chromosomal abnormality to the pregnant female subject” as used herein refers to communicating the information to the female subject, or family member, guardian or designee thereof, in a suitable medium, including, without limitation, in verbal, document, or file form.


Also provided are methods for providing to a human pregnant female subject a medical prescription based on prenatal genetic information, which comprise identifying the presence or absence of a chromosomal abnormality in the fetus of the pregnant female subject, where the presence or absence of the chromosomal abnormality has been determined by preparing a plurality of sets of amplified nucleic acid species by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template from placenta-expressed nucleic acid in the blood of the pregnant female subject, of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; and determining the amount of each amplified nucleic acid species in each set; whereby the presence or absence of the chromosome abnormality is determined based on the amount of the amplified nucleic acid species from two or more sets; and providing a medical prescription based on the presence or absence of the chromosomal abnormality to the pregnant female subject.


Also provided are methods for providing to a human pregnant female subject a medical prescription based on prenatal genetic information, which comprise identifying the presence or absence of a chromosomal abnormality in the fetus of the pregnant female subject, where the presence or absence of the chromosomal abnormality has been determined by a multiplex method by amplifying three or more nucleotide sequence sets from extracellular nucleic acid template from placenta-expressed nucleic acid in the blood of the pregnant female subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) a nucleotide sequence in a set is present on a target chromosome and at least one other nucleotide sequence in the set is present on one or more reference chromosomes, (iii) the target chromosome is common for all of the sets; (iv) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (v) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (vi) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vii) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; detecting the presence or absence of a decrease or increase of the target chromosome from the amount of each amplified nucleic acid species in the sets; based on the presence or absence of a decrease or increase of the target chromosome from the amount of each amplified nucleic acid species in the sets, transforming the signal information representing the amount of each amplified nucleic acid species in each set into identification data, where the identification data represents the presence or absence of the chromosome abnormality, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; providing a medical prescription based on the presence or absence of the chromosomal abnormality to the pregnant female subject.


Also provided are methods for providing to a human pregnant female subject a medical prescription based on prenatal genetic information, which comprise identifying the presence or absence of a chromosomal abnormality in the fetus of the pregnant female subject, where the presence or absence of the chromosomal abnormality has been determined by preparing a plurality of sets of amplified nucleic acid species by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template from placenta-expressed nucleic acid in the blood of the pregnant female subject, of a subject where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on three or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; transforming the signal information representing the amount of each amplified nucleic acid species in each set into identification data, where the identification data represents the presence or absence of the chromosome abnormality, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species; and providing a medical prescription based on the presence or absence of the chromosomal abnormality to the pregnant female subject.


Also provided are methods for providing to a human pregnant female subject a medical prescription based on prenatal genetic information, which comprise identifying the presence or absence of a chromosomal abnormality in the fetus of the pregnant female subject, where the presence or absence of the chromosomal abnormality has been determined by preparing a plurality of sets of amplified nucleic acid species by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template from placenta-expressed nucleic acid in the blood of the pregnant female subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on chromosome 21, chromosome 18 and chromosome 13, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; transforming the signal information representing the amount of each amplified nucleic acid species in each set into identification data, where the identification data represents the presence or absence of the chromosome abnormality, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and providing a medical prescription based on the presence or absence of the chromosomal abnormality to the pregnant female subject.


The term “providing a medical prescription based on prenatal genetic information” refers to communicating the prescription to the female subject, or family member, guardian or designee thereof, in a suitable medium, including, without limitation, in verbal, document or file form.


Also provided are methods for providing to a human pregnant female subject a medical prescription based on prenatal genetic information, which comprise reporting to a pregnant female subject the presence or absence of a chromosomal abnormality in the fetus of the pregnant female subject, where the presence or absence of the chromosomal abnormality has been determined by preparing a plurality of sets of amplified nucleic acid species by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template from placenta-expressed nucleic acid in the blood of the pregnant female subject, of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; and determining the amount of each amplified nucleic acid species in each set; whereby the presence or absence of the chromosome abnormality is determined based on the amount of the amplified nucleic acid species from two or more sets; and providing a medical prescription based on the presence or absence of the chromosome abnormality to the pregnant female subject.


Also included herein are methods for providing to a human pregnant female subject a medical prescription based on prenatal genetic information, which comprise reporting to a pregnant female subject the presence or absence of a chromosomal abnormality in the fetus of the pregnant female subject, where the presence or absence of the chromosomal abnormality has been determined by amplifying three or more nucleotide sequence sets from extracellular nucleic acid template from placenta-expressed nucleic acid in the blood of the pregnant female subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) a nucleotide sequence in a set is present on a target chromosome and at least one other nucleotide sequence in the set is present on one or more reference chromosomes, (iii) the target chromosome is common for all of the sets; (iv) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (v) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (vi) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vii) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; detecting the presence or absence of a decrease or increase of the target chromosome from the amount of each amplified nucleic acid species in the sets; based on the presence or absence of a decrease or increase of the target chromosome from the amount of each amplified nucleic acid species in the sets, transforming the signal information representing the amount of each amplified nucleic acid species in each set into identification data, where the identification data represents the presence or absence of the chromosome abnormality, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; providing a medical prescription based on the presence or absence of the chromosomal abnormality to the pregnant female subject.


Also provided are methods for providing to a human pregnant female subject a medical prescription based on prenatal genetic information, which comprise reporting to a pregnant female subject the presence or absence of a chromosomal abnormality in the fetus of the pregnant female subject, where the presence or absence of the chromosomal abnormality has been determined by preparing a plurality of sets of amplified nucleic acid species by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template from placenta-expressed nucleic acid in the blood of the pregnant female subject, of a subject where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on three or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; transforming the signal information representing the amount of each amplified nucleic acid species in each set into identification data, where the identification data represents the presence or absence of the chromosome abnormality, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species; and providing a medical prescription based on the presence or absence of the chromosomal abnormality to the pregnant female subject.


Also provided are methods for providing to a human pregnant female subject a medical prescription based on prenatal genetic information, which comprise reporting to a pregnant female subject the presence or absence of a chromosomal abnormality in the fetus of the pregnant female subject, where the presence or absence of the chromosomal abnormality has been determined by preparing a plurality of sets of amplified nucleic acid species by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template from placenta-expressed nucleic acid in the blood of the pregnant female subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on chromosome 21, chromosome 18 and chromosome 13, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; transforming the signal information representing the amount of each amplified nucleic acid species in each set into identification data, where the identification data represents the presence or absence of the chromosome abnormality, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and providing a medical prescription based on the presence or absence of the chromosomal abnormality to the pregnant female subject.


The medical prescription may be for any course of action determined by, for example, a medical professional upon reviewing the prenatal genetic information. For example, the prescription may be for the pregnant female subject to undergo an amniocentesis procedure. Or, in another example, the medical prescription may be for the pregnant female subject to undergo another genetic test. In yet another example, the medical prescription may be medical advice to not undergo further genetic testing.


Also provided are files, such as, for example, a file comprising the presence or absence of a chromosome abnormality in the fetus of a pregnant female subject, where the presence or absence of the chromosome abnormality has been determined by preparing a plurality of sets of amplified nucleic acid species by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template from placenta-expressed nucleic acid in the blood of the pregnant female subject, of a subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; and determining the amount of each amplified nucleic acid species in each set; whereby the presence or absence of the chromosome abnormality is determined based on the amount of the amplified nucleic acid species from two or more sets.


Also provided are files, such as, for example, a file comprising the presence or absence of a chromosome abnormality in the fetus of a pregnant female subject, where the presence or absence of the chromosome abnormality has been determined by amplifying three or more nucleotide sequence sets from extracellular nucleic acid template from placenta-expressed nucleic acid in the blood of the pregnant female subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) a nucleotide sequence in a set is present on a target chromosome and at least one other nucleotide sequence in the set is present on one or more reference chromosomes, (iii) the target chromosome is common for all of the sets; (iv) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (v) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (vi) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vii) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; detecting the presence or absence of a decrease or increase of the target chromosome from the amount of each amplified nucleic acid species in the sets; based on the presence or absence of a decrease or increase of the target chromosome from the amount of each amplified nucleic acid species in the sets, transforming the signal information representing the amount of each amplified nucleic acid species in each set into identification data, where the identification data represents the presence or absence of the chromosome abnormality, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets.


Also provided are files, such as, for example, a file comprising the presence or absence of a chromosome abnormality in the fetus of a pregnant female subject, where the presence or absence of the chromosome abnormality has been determined by preparing a plurality of sets of amplified nucleic acid species by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template from placenta-expressed nucleic acid in the blood of the pregnant female subject, of a subject where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on three or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; transforming the signal information representing the amount of each amplified nucleic acid species in each set into identification data, where the identification data represents the presence or absence of the chromosome abnormality, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species.


Also provided are files, such as, for example, a file comprising the presence or absence of a chromosome abnormality in the fetus of a pregnant female subject, where the presence or absence of the chromosome abnormality has been determined by preparing a plurality of sets of amplified nucleic acid species by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template from placenta-expressed nucleic acid in the blood of the pregnant female subject, where: (i) the extracellular nucleic acid template is heterogeneous, (ii) each nucleotide sequence in a set is present on chromosome 21, chromosome 18 and chromosome 13, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; transforming the signal information representing the amount of each amplified nucleic acid species in each set into identification data, where the identification data represents the presence or absence of the chromosome abnormality, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets.


The file may be, for example, but not limited to, a computer readable file, a paper file, or a medical record file.


Computer program products include, for example, any electronic storage medium that may be used to provide instructions to a computer, such as, for example, a removable storage device, CD-ROMS, a hard disk installed in hard disk drive, signals, magnetic tape, DVDs, optical disks, flash drives, RAM or floppy disk, and the like.


The systems discussed herein may further comprise general components of computer systems, such as, for example, network servers, laptop systems, desktop systems, handheld systems, personal digital assistants, computing kiosks, and the like. The computer system may comprise one or more input means such as a keyboard, touch screen, mouse, voice recognition or other means to allow the user to enter data into the system. The system may further comprise one or more output means such as a CRT or LCD display screen, speaker, FAX machine, impact printer, inkjet printer, black and white or color laser printer or other means of providing visual, auditory or hardcopy output of information. In certain embodiments, a system includes one or more machines.


The input and output means may be connected to a central processing unit which may comprise among other components, a microprocessor for executing program instructions and memory for storing program code and data. In some embodiments the methods may be implemented as a single user system located in a single geographical site. In other embodiments methods may be implemented as a multi-user system. In the case of a multi-user implementation, multiple central processing units may be connected by means of a network. The network may be local, encompassing a single department in one portion of a building, an entire building, span multiple buildings, span a region, span an entire country or be worldwide. The network may be private, being owned and controlled by the provider or it may be implemented as an internet based service where the user accesses a web page to enter and retrieve information.


The various software modules associated with the implementation of the present products and methods can be suitably loaded into the a computer system as desired, or the software code can be stored on a computer-readable medium such as a floppy disk, magnetic tape, or an optical disk, or the like. In an online implementation, a server and web site maintained by an organization can be configured to provide software downloads to remote users. As used herein, “module,” including grammatical variations thereof, means, a self-contained functional unit which is used with a larger system. For example, a software module is a part of a program that performs a particular task.


The present methods may be implemented using hardware, software or a combination thereof and may be implemented in a computer system or other processing system. An example computer system may include one or more processors. A processor can be connected to a communication bus. The computer system may include a main memory, oftenf random access memory (RAM), and can also include a secondary memory. The secondary memory can include, for example, a hard disk drive and/or a removable storage drive, representing a floppy disk drive, a magnetic tape drive, an optical disk drive, memory card etc. The removable storage drive reads from and/or writes to a removable storage unit in a well-known manner. A removable storage unit includes, but is not limited to, a floppy disk, magnetic tape, optical disk, etc. which is read by and written to by, for example, a removable storage drive. As will be appreciated, the removable storage unit includes a computer usable storage medium having stored therein computer software and/or data.


In alternative embodiments, secondary memory may include other similar means for allowing computer programs or other instructions to be loaded into a computer system. Such means can include, for example, a removable storage unit and an interface device. Examples of such can include a program cartridge and cartridge interface (such as that found in video game devices), a removable memory chip (such as an EPROM, or PROM) and associated socket, and other removable storage units and interfaces which allow software and data to be transferred from the removable storage unit to a computer system.


The computer system may also include a communications interface. A communications interface allows software and data to be transferred between the computer system and external devices. Examples of communications interface can include a modem, a network interface (such as an Ethernet card), a communications port, a PCMCIA slot and card, etc. Software and data transferred via communications interface are in the form of signals, which can be electronic, electromagnetic, optical or other signals capable of being received by communications interface. These signals are provided to communications interface via a channel. This channel carries signals and can be implemented using wire or cable, fiber optics, a phone line, a cellular phone link, an RF link and other communications channels. Thus, in one example, a communications interface may be used to receive signal information to be detected by the signal detection module.


In a related aspect, the signal information may be input by a variety of means, including but not limited to, manual input devices or direct data entry devices (DDEs). For example, manual devices may include, keyboards, concept keyboards, touch sensitive screens, light pens, mouse, tracker balls, joysticks, graphic tablets, scanners, digital cameras, video digitizers and voice recognition devices. DDEs may include, for example, bar code readers, magnetic strip codes, smart cards, magnetic ink character recognition, optical character recognition, optical mark recognition, and turnaround documents. In one embodiment, an output from a gene or chip reader my serve as an input signal.


Combination Diagnostic Assays


Results from nucleotide species assays described in sections above can be combined with results from one or more other assays, referred to herein as “secondary assays,” and results from the combination of the assays can be utilized to identify the presence or absence of aneuploidy. Results from a non-invasive nucleotide species assay described above may be combined with results from one or more other non-invasive assays and/or one or more invasive assays. In certain embodiments, results from a secondary assay are combined with results from a nucleotide species assay described above when a sample contains an amount of fetal nucleic acid below a certain threshold amount. A threshold amount of fetal nucleic acid sometimes is about 15% in certain embodiments.


In some embodiments, a nucleotide species assay described in sections above may be combined with a secondary nucleic acid-based allele counting assay. Allele-based methods for diagnosing, monitoring, or predicting chromosomal abnormalities rely on determining the ratio of the alleles found in maternal sample comprising free, fetal nucleic acid. The ratio of alleles refers to the ratio of the population of one allele and the population of the other allele in a biological sample. In some cases, it is possible that in trisomies a fetus may be tri-allelic for a particular locus, and these tri-allelic events may be detected to diagnose aneuploidy. In some embodiments, a secondary assay detects a paternal allele, and in certain embodiments, the mother is homozygous at the polymorphic site and the fetus is heterozygous at the polymorphic site detected in the secondary assay. In a related embodiment, the mother is first genotyped (for example, using peripheral blood mononuclear cells (PBMC) from a maternal whole blood sample) to determine the non-target allele that will be targeted by the cleavage agent in a secondary assay.


In certain embodiments, a nucleotide species assay described above may be combined with a secondary RNA-based diagnostic method. RNA-based methods for diagnosing, monitoring, or predicting chromosomal abnormalities often rely on the use of pregnancy-specificity of fetal-expressed transcripts to develop a method which allows the genetic determination of fetal chromosomal aneuploidy and thus the establishment of its diagnosis non-invasively. In one embodiment, the fetal-expressed transcripts are those expressed in the placenta. Specifically, a secondary assay may detect one or more single nucleotide polymorphisms (SNPs) from RNA transcripts with tissue-specific expression patterns that are encoded by genes on the aneuploid chromosome. Other polymorphisms also may be detected by a secondary assay, such as an insertion/deletion polymorphism and a simple tandem repeat polymorphism, for example. The status of the locus may be determined through the assessment of the ratio between informative SNPs on the RNA transcribed from the genetic loci of interest in a secondary assay. Genetic loci of interest may include, but are not limited to, COL6A1, SOD1, COL6A2, ATPSO, BTG3, ADAMTS1, BACE2, ITSN1, APP, ATPSJ, DSCRS, PLAC4, LOC90625, RPL17, SERPINB2 or COL4A2, in a secondary assay.


In some embodiments, a nucleotide species assay described in sections above may be combined with a secondary methylation-based assay. Methylation-based tests sometimes are directed to detecting a fetal-specific DNA methylation marker for detection in maternal plasma. It has been demonstrated that fetal and maternal DNA can be distinguished by differences in methylation status (see U.S. Pat. No. 6,927,028, issued Aug. 9, 2005). Methylation is an epigenetic phenomenon, which refers to processes that alter a phenotype without involving changes in the DNA sequence. Poon et al. further showed that epigenetic markers can be used to detect fetal-derived maternally-inherited DNA sequence from maternal plasma (Clin. Chem. 48:35-41, 2002). Epigenetic markers may be used for non-invasive prenatal diagnosis by determining the methylation status of at least a portion of a differentially methylated gene in a blood sample, where the portion of the differentially methylated gene from the fetus and the portion from the pregnant female are differentially methylated, thereby distinguishing the gene from the female and the gene from the fetus in the blood sample; determining the level of the fetal gene; and comparing the level of the fetal gene with a standard control. In some cases, an increase from the standard control indicates the presence or progression of a pregnancy-associated disorder. In other cases, a decrease from the standard control indicates the presence or progression of a pregnancy-associated disorder.


In certain embodiments, a nucleotide species assay described in sections above may be combined with another secondary molecular assay. Other molecular methods for the diagnosis of aneuploidies are also known (Hulten et al., 2003, Reproduction, 126(3):279-97; Armour et al., 2002, Human Mutation 20(5):325-37; Eiben and Glaubitz, J Histochem Cytochem. 2005 March; 53(3):281-3); and Nicolaides et al., J Matern Fetal Neonatal Med. 2002 July; 12(1):9-18)). Alternative molecular methods include PCR based methods such as QF-PCR (Verma et al., 1998, Lancet 352(9121):9-12; Pertl et al., 1994, Lancet 343(8907):1197-8; Mann et al., 2001, Lancet 358(9287):1057-61; Adinolfi et al., 1997, Prenatal Diagnosis 17(13):1299-311), multiple amplifiable probe hybridization (MAPH) (Armour et al., 2000, Nucleic Acids Res 28(2):605-9), multiplex probe ligation assay (MPLA) (Slater et al., 2003, J Med Genet 40(12)907-12; Schouten et al., 2002 30(12:e57), all of which are hereby incorporated by reference. Non PCR-based technologies such as comparative genome hybridization (CGH) offer another approach to aneuploidy detection (Veltman et al., 2002, Am J Hum Genet 70(5):1269-76; Snijders et al., 2001 Nat Genet 29(3):263-4).


In some embodiments, a nucleotide species assay described in sections above may be combined with a secondary non-nucleic acid-based chromosome test. Non-limiting examples of non-nucleic acid-based tests include, but are not limited to, invasive amniocentesis or chorionic villus sampling-based test, a maternal age-based test, a biomarker screening test, and an ultrasonography-based test. A biomarker screening test may be performed where nucleic acid (e.g., fetal or maternal) is detected. However, as used herein “biomarker tests” are considered a non-nucleic acid-based test.


Amniocentesis and chorionic villus sampling (CVS)-based tests offer relatively definitive prenatal diagnosis of fetal aneuploidies, but require invasive sampling by amniocentesis or Chorionic Villus Sampling (CVS). These sampling methods are associated with a 0.5% to 1% procedure-related risk of pregnancy loss (D'Alton, M. E., Semin Perinatol 18(3):140-62 (1994)).


While different approaches have been employed in connection with specific aneuploidies, in the case of Down's syndrome, screening initially was based entirely on maternal age, with an arbitrary cut-off of 35 years used to define a population of women at sufficiently high risk to warrant offering invasive fetal testing.


Maternal biomarkers offer another strategy for testing of fetal Down's syndrome and other chromosomal aneuploidies, based upon the proteomic profile of a maternal biological fluid.


“Maternal biomarkers” as used herein refer to biomarkers present in a pregnant female whose level of a transcribed mRNA or level of a translated protein is detected and can be correlated with presence or absence of a chromosomal abnormality.


Second-trimester serum screening techniques were introduced to improve detection rate and to reduce invasive testing rate. One type of screening for Down's syndrome requires offering patients a triple-marker serum test between 15 and 18 weeks gestation, which, together with maternal age (MA), is used for risk calculation. This test assays alpha-fetoprotein (AFP), human chorionic gonadotropin (beta-hCG), and unconjugated estriol (uE3). This “triple screen” for Down's syndrome has been modified as a “quad test”, in which the serum marker inhibin-A is tested in combination with the other three analytes. First-trimester concentrations of a variety of pregnancy-associated proteins and hormones have been identified as differing in chromosomally normal and abnormal pregnancies. Two first-trimester serum markers that can be tested for Down's syndrome and Edwards syndrome are PAPP-A and free .beta.hCG (Wapner, R., et al., N Engl J Med 349(15):1405-1413 (2003)). It has been reported that first-trimester serum levels of PAPP-A are significantly lower in Down's syndrome, and this decrease is independent of nuchal translucency (NT) thickness (Brizot, M. L., et al., Obstet Gynecol 84(6):918-22 (1994)). In addition, it has been shown that first-trimester serum levels of both total and free .beta.-hCG are higher in fetal Down's syndrome, and this increase is also independent of NT thickness (Brizot, M. L., Br J Obstet Gynaecol 102(2):127-32 (1995)).


Ultrasonography-based tests provide a non-molecular-based approach for diagnosing chromosomal abnormalities. Certain fetal structural abnormalities are associated with significant increases in the risk of Down's syndrome and other aneuploidies. Further work has been performed evaluating the role of sonographic markers of aneuploidy, which are not structural abnormalities per se. Such sonographic markers employed in Down's syndrome screening include choroid plexus cysts, echogenic bowel, short femur, short humerus, minimal hydronephrosis, and thickened nuchal fold. An 80% detection rate for Down's syndrome has been reported by a combination of screening MA and first-trimester ultrasound evaluation of the fetus (Pandya, P. P. et al., Br J Obstet Gyneacol 102(12):957-62 (1995); Snijders, R. J., et al., Lancet 352(9125):343-6 (1998)). This evaluation relies on the measurement of the translucent space between the back of the fetal neck and overlying skin, which has been reported as increased in fetuses with Down's syndrome and other aneuploidies. This nuchal translucency (NT) measurement is reportedly obtained by transabdominal or transvaginal ultrasonography between 10 and 14 weeks gestation (Snijders, R. J., et al., Ultrasound Obstet Gynecol 7(3):216-26 (1996)).


Kits


Kits often comprise one or more containers that contain one or more components described herein. A kit comprises one or more components in any number of separate containers, packets, tubes, vials, multiwell plates and the like, or components may be combined in various combinations in such containers. One or more of the following components, for example, may be included in a kit: (i) one or more amplification primers for amplifying a nucleotide sequence species of a set, (ii) one or more extension primers for discriminating between amplified nucleic acid species or nucleotide sequence species of each set, (iii) a solid support for multiplex detection of amplified nucleic acid species or nucleotide sequence species of each set (e.g., a solid support that includes matrix for matrix-assisted laser desorption ionization (MALDI) mass spectrometry; (iv) reagents for detecting amplified nucleic acid species or nucleotide sequence species of each set; (vi) a detector for detecting the amplified nucleic acid species or nucleotide sequence species of each set (e.g., mass spectrometer); (vii) reagents and/or equipment for quantifying fetal nucleic acid in extracellular nucleic acid from a pregnant female; (viii) reagents and/or equipment for enriching fetal nucleic acid from extracellular nucleic acid from a pregnant female; (ix) software and/or a machine for analyzing signals resulting from a process for detecting the amplified nucleic acid species or nucleotide sequence species of the sets; (x) information for identifying presence or absence of a chromosome abnormality (e.g., a table or file thats convert signal information or ratios into outcomes), (xi) equipment for drawing blood); (xii) equipment for generating cell-free blood; (xiii) reagents for isolating nucleic acid (e.g., DNA, RNA) from plasma, serum or urine; (xiv) reagents for stabilizing serum, plasma, urine or nucleic acid for shipment and/or processing.


A kit sometimes is utilized in conjunction with a process, and can include instructions for performing one or more processes and/or a description of one or more compositions. A kit may be utilized to carry out a process (e.g., using a solid support) described herein. Instructions and/or descriptions may be in tangible form (e.g., paper and the like) or electronic form (e.g., computer readable file on a tangle medium (e.g., compact disc) and the like) and may be included in a kit insert. A kit also may include a written description of an internet location that provides such instructions or descriptions (e.g., a URL for the World-Wide Web).


Thus, provided herein is a kit that comprises one or more amplification primers for amplifying a nucleotide sequence species of one or more sets. In some embodiments, one or more primers in the kit are selected from those described herein. The kit also comprises a conversion table, software, executable instructions, and/or an internet location that provides the foregoing, in certain embodiments, where a conversion table, software and/or executable instructions can be utilized to convert data resulting from detection of amplified nucleic acid species or nucleotide sequence species into ratios and/or outcomes (e.g., likelihood or risk of a chromosome abnormality), for example. A kit also may comprise one or more extension primers for discriminating between amplified nucleic acid species or nucleotide sequence species of each set, in certain embodiments. In some embodiments, a kit comprises reagents and/or components for performing an amplification reaction (e.g., polymerase, nucleotides, buffer solution, thermocycler, oil for generating an emulsion).


EXAMPLES

The following Examples are provided for illustration only and are not limiting. Those of skill in the art will readily recognize a variety of non-critical parameters that can be changed or modified to yield essentially similar results.


Example 1: Use of Paralogs and the Problem of Variance with Samples that Comprise Heterogenous Extracellular Nucleic Acid Template

Aneuploidies such as Down syndrome (DS) are chromosomal disorders genotypically associated with severe or complete duplication of a chromosome resulting in three (3) copies of the chromosome. In the case of trisomy 21, determining the number of genomic DNA copies of chromosome 21 is the primary step in the diagnosis of T21. The compositions and methods described herein provide a PCR-based chromosome counting technique that utilizes highly homologous genomic nucleotide sequences found in at least two different chromosomes.


Highly homologous sequences often are a type of genomic segmental duplication ranging between one to hundreds of kilobases that exhibit a high degree of sequence homology between multiple genomic regions. These sequences can be classified as either intrachromosomal, within the same chromosome, or interchromosomal, within different chromosomes. In certain portions of highly homologous interchromosomal regions, there can be instances were only two regions of high homology exist on two different chromosomes, such as chromosome 21 and chromosome 14 as depicted in FIG. 1.


Thus, provided are highly homologous species of nucleotide sequences that share a degree of sequence similarity that allows for co-amplification of the species. More specifically, the primer hybridization sequences in the nucleotide sequence template generally are substantially identical and a single pair of amplification primers reproducibly amplify the species of a set. Each species of the set comprises one or more sequence differences or mismatches (herein also referred to as “markers”) that are identifiable, and the relative amounts of each mismatch (or marker) can be quantified. Detection methods that are highly quantitative can accurately determine the ratio between the chromosomes. Thus, the ratio of the first and second nucleotide sequence is proportional to the dose of the first (target) and second (reference) sequences in the sample. In the case of more than two species in a set, the ratio of the two or more nucleotide sequences is proportional to the dose of the two or more target and reference sequences in the sample. Because of their high degree of primer hybridization sequence similarity, the nucleotide sequences provided often are useful templates for amplification reactions useful for determining relative doses of the chromosome and/or chromosome region on which these sequences are located.


Variance


Before initiating the marker feasibility experiments, a series of investigative experiments and simulations were performed to help gauge and evaluate the scope and design of this marker feasibility plan. The theoretical and actual experiments that were used to shape the marker feasibility plan included:

    • 1) Simulations of the relationship between fetal percent and marker quality/quantity on the sensitivity and selectivity of T21
    • 2) Experiments investigating how 96-well and 384-well format affects marker assay variance
    • 3) Experiments investigating how marker assay variance propagated through a standard TypePLEX® protocol
    • 4) Experiments investigating how experimental processes (e.g. day-to-day, plate-to-plate) affect variance in marker assays
    • 5) Experiments investigating how multiplex level affects marker assay variance
    • 6) Experiments investigating how whole genome amplification techniques affect marker assay variance


Obiective


A series of simulations was initiated to ascertain the interplay between the signal from CCF fetal DNA in the maternal background and the number and quality of interrogating markers as well as the impact of both on the sensitivity and selectivity of T21 classification.


Experimental Outline


Using a given range of maternal background DNA and fetal DNA contribution of 1500 copies of total DNA and 15% fetal contribution and a standard TypePLEX assay variation of 3% (CV=3%), simulations were run to determine the effect of increasing the number of markers on the classification of euploid and T21 aneuploid fetal samples. Holding these values constant allowed for a general assessment of the number and quality of markers needed to achieve various classification points using sensitivity and selectivity metrics.


Conclusions


Simulations resulted in a series of observations:

    • 1) A single or a few markers is insufficient to classify T21 aneuploid samples at an acceptable level (See FIG. 2)
    • 2) Increasing the number of markers improves the classification of T21 aneuploid samples (See FIG. 2)
    • 3) Quality markers, those that exhibit the lowest CV, have a larger impact than increasing the number of markers (See FIG. 3)
    • 4) An increase in fetal DNA percent from 10 to 20% has a large impact on the sensitivity and selectivity of the markers (see FIG. 3)


These simulations indicated a few axioms that will be carried throughout the feasibility study: First, the marker feasibility must generate a very large pool of markers so that enough quality markers are identified. Specifically this means that markers from all other chromosomes, with the exception of the sex determination chromosomes X and Y, will be include in the screening process. Additionally, quality metrics of the markers including CV will be central in the marker selection process during the FH feasibility study.


Propagation of Process Variance Using Sequenom® TypePLEX® Biochemistry


Objectives


Since the highly homologous DNA approach requires discriminating between small differences between T21 and normal samples, it is imperative to minimize the measurement variability to have a successful assay. The purpose of this first experiment was to empirically determine the contribution of each step in the TypePLEX process (PCR, SAP, primer extension, MALDI-TOF MS) to the overall measurement variability. TypePLEX biochemistry is further described in Example 3 below.


Experimental Outline


A 96 well PCR plate consisting of replicates of a single gDNA sample and a single multiplex was created. Wells were pooled and re-aliquotted at various stages of the post-PCR process in order to measure the variance of each step sequentially.


Results Overview


The boxplots in FIG. 4 show the allele frequency of two different sets of markers with variance isolated at different steps in the measurement process. In both cases, the variances of the post-PCR steps are all very similar and all markedly smaller than the PCR variance.


Conclusions


The PCR step contributes the most to the overall measurement variability. This preliminary study on process variance, coupled with the 96 vs 384-well study on variance, indicate that minimizing marker variance is best achieved at the PCR step. As a result, in this feasibility PCR will be performed on a larger aliquot of sample, minimizing sampling variance, and the 96-well 50 μL PCR reaction volume reducing reaction variance. Also, methods that reduce amplification variability (e.g., amplification is done in many replicates) or do not have an amplification step (e.g., sequencing and counting of highly homologous sequence sets) may be employed.


Variance In Experimental Procedures


Objectives


Measure the day-to-day process variability of the same data set and, in a separate experiment, determine the variability of measuring the same analyte over several days and several weeks.


Experimental Outline


Over the course of four consecutive days, the same 96 well PCR plate consisting of a single sample and single multiplex was created, one plate per day. The four plates underwent post-PCR processing using the same procedures and reagents, but each plate was processed on a different day.


For the second experiment, a single PCR plate was generated and processed following PCR. Once it was ready to be spotted for MALDI measurement, it was spotted for four days per week over four consecutive weeks, with the extension products stored at 4C in between each measurement.


Results Overview


The frequency of two assays was determined from the day-to-day variability experiment. The median frequency over four consecutive days was essentially the same for assay 21_13_2FH_13_E3, while assay 21_13_2FH_2_E3 shows significant differences over the same time frame. In another experiment, the reproducibility from spotting from the same plate repeatedly over four weeks was determined. Assay 18_13_2FH_28bB_E3 shows low frequency variance during the experiment while a different assay on the same plate, 21_13_2FH_2_E3, shows high variability throughout.


Conclusions


Both the day-to-day variability and spotting reproducibility experiments show that measurements from some assays are stable over time while measurements from others vary quite significantly, depending on the day the analytes are measured. With regards to the feasibility study, process variability is shown to be correlated with the inherent properties of specific markers; therefore, those markers displaying high variability will be removed during the marker screening process.


Example 2: Identification of Nucleotide Sequence Species Useful for Detecting Chromosomal Abnormalities

Methods


After identifying the sources of variability in the process, suitable markers were identified, screened (in silico) and multiplexed. First, a set of programs and scripts were developed to search for all the paralogous (highly homologous) sequences from the target chromosome (e.g., Chr 21) and reference chromosomes (e.g., all other, non-target autosomal chromosomes). Genome sequences from the Human March 2006 Assembly (hg18, NCBI build 36) were screened. To identify polymorphic base(s) in the sequences, dbSNP build 129 (followed by dbSNP build 130 when it became available) was used.


Next, chromosome 21 (Chr 21) was divided into smaller fragments as probes. Since the desired assays typically target sequence lengths of 80-120 base pairs (bp), Chr 21 was divided into 150 bp fragments with 50 bp overlaps between adjacent fragments. This setting worked well for manual assay screening where more than 100 additional base pairs from each end were added to each stretch of homologous regions found. To capture the possible paralogous sequences near the edge of each search region in the automatic assay screening, 150 bp fragments with 75 bp overlaps, 100 bp fragments with 50 bp overlaps, and finally 100 bp fragments with 75 bp overlaps were all used. Based on these different screening strategies and an optimal amplicon length of 100 for TypePLEX assays, the best strategy appeared to be breaking up Chr 21 into 100 bp fragments with 75 bp overlaps.


Repeat sequences in each chromosome were masked by lower case in the genome and unknown sequences were denoted by N's. Fragments containing only repeat sequences or N's will not generate useful paralogous sequences; therefore, they were identified and omitted.


Unique, paralogous regions of chromosome 21 were identified in other chromosomes by aligning fragments of Chr21 with all the chromosomes in the genome (including Chr21) using BLAT (the BLAST-Like Alignment Tool). All fragments having paralogs with a homology score more than 85% and alignment length greater than 75 were pooled. Target fragments matching a single reference chromosome were selected. Fragments with multiple (more than 1) matches were not included.


Next markers from the paralogous sequences were identified using Biostrings package in R. Some paralogous sequences derived from above analysis contained large insertions in the high homology regions on the reference chromosome. These kinds of sequences were thus filtered with the span limit of 500 bp on the reference chromosome. The paralogous segments were then merged into single sequence if they were overlapping or close to each other (<=100 bp) on both Chr 21 (target) and the 2nd (reference) chromosome. RepeatMask regions and SNPs from dbSNP 130 were identified in the chromosome sequences and masked as “N” before the alignment. The paralolgous sequences from chromosome 21 and the reference chromosome were then pairwise-aligned to locate the exact mismatch locations. Several mismatches might be found from single paralogous region. Each mismatch was prepared as a mock SNP (or mismatch nucleotide) on the sequence for proper input format of the Assay Design program, and all the other mismatch positions on the same paralogous region were masked as “N” to prevent or reduce the occurrence of PCR primers or extension primer being designed over it.


Unsuitable sequences were filtered out and the remaining sequences were grouped into SNP sets. The initial markers contained all the potential mismatch sites within the paralogous regions, regardless of the sequence context. Most of the sequences could not be used due to lack of suitable PCR primers or extend primer locations. They were filtered out using Sequenom's Assay Designer with standard iPLEX® parameters for uniplex. Those assays successful for uniplex designs were then run through additional programs (Sequenom's RealSNP PIeXTEND) to ensure PCR and extend primers had high specificity for the target and reference sequences. Sequences were then sorted first by the second chromosome and then by sequence variation position on Chr 21. Sequence IDs were generated by the following convention: 2FH[version letter]_21_[2nd chr number]_[sequence index], where [version letter] is a letter indicating the version for the screening effort, [2nd chr number] is the second chromosome number in two digits and [sequence index] is the sequence index restarted for each chromosome in 0 padded three or four digits format.


In a further considereation, markers that were in close proximity to each other were not plexed to the same well due to cross amplification. All sequences were first sorted by marker position on chromosome 21. Each sequence was assigned a SNP set ID, and markers within a distance of less than 1000 bp were assigned the same SNP set ID. The SNP set IDs could be checked by Assay Designer to ensure that assays with same SNP set ID would be placed into different wells. It is possible that markers more than 1000 bp apart on chromosome 21 map to another chromosome with distance less than 1000 bp. However, if they happen to be designed into the same well, running the assays through PIeXTEND will be able to successfully identify them.


Results


Table 3 summarizes the results of marker screening for chromosome 21. Initially probes of 150 bp fragments with 50 bp overlaps from chromosome 21 were used. This strategy yielded 3057 homologous regions, from which 7278 markers (nucleotide mismatch sequences or “mock SNPs) were found for chromosome 21 versus another autosomal chromosome. Uniplex assay design considerations for these sequences showed that 1903 sequences could be designed while 5375 failed (73.9%), mostly due to lack of suitable PCR primers or extension primer.


Next, screening was performed with 150 bp probes with 75 bp overlaps, 100 bp probes with 50 bp overlaps and finally 100 bp probes with 75 bp overlaps. The 100 bp probes with 75 bp overlaps provided nearly complete coverage of all the homologous regions of chromosome 21 against the entire genome. With these probes, 2738 sequences were found successful for uniplex design with SNPs from dbSNP 129 annotated into the sequences. Since dbSNP 130 contains more SN Ps than dbSNP 129, only 2648 sequences were found successful for uniplex design with this new database. The 2648 uniplex assays were run through realSNP PIeXTEND. Three assays were found to have false extensions (invalid target for the extend primer from amplicons produced by the primer pair), and 216 assays have 3 or more hits by the PCR primer pair. 2429 assays have intended 2 hits in the genome (one on chromosome 21 and one on another autosomal chromosome)


Shorter probes and longer overlaps resulted in more successful assay targets. See Table 3. However, longer probes and shorter overlaps did produce some additional successful sequences that were not present in the final screen with 100 bp probes and 75 bp overlaps. These sequences were added to the final sequence set. The final number of unique markers for chromosome 21 and the reference autosomal chromosome was 2785. Excluding false hits and 3+hits, there were 1877 markers available for T21 assay screen. These 1877 markers were carried forward for further Sequenom MassEXTEND assay design.


In Table 3, the different versions (A, B, C, etc.) refer to the different probe to overlap lengths. The number of sequences that met the criteria for each version as well as the number that fell out are provided.









TABLE 3







Nucleotide Sequence Species Identification Results














Marker screen
version
A
B
C
E
F
2FH21F






Chr21 fragment
150/50
150/75
100/50
100/75 Repeat
100/75 Repeat
Final Sequences



Length/overlap



dbSNP 129
dbSNP 130
(100/75 repeat plus









additionals from









earlier screen)



input region
3057
3697
6096
12606
12606




output mockSNPseq
7278
8082
9150
12650
12533



Designable assay
Failed by Assay
5375
6060
6922
 9912
 9885



screen
Designer









% failed
73.9%
75.0%
75.7%
78.4%
78.9%




Uniplex Designed
1903
2022
2228
 2738
 2648
2785



Additionals
 76
 48
 13





PleXTEND
Number of false hits
  1
  1
  1
  3
  3




Number of 0 hits
  0
  0
  0
  0
  0




Number of 1 hits
 44
 66
 69
  0
  0




Number of 2 hits
1788
1875
2047
 2519
 2429
1877 (excl









H.PCR > 300)



Number of 3+ hits
 70
 80
 111
 216
 216









Example 3: Assay Design for Nucleotide Sequence Species Useful for Detecting Chromosomal Abnormalities

Introduction


Below is a detailed account of the process used to design MassEXTEND® assays to test for (fetal) chromosome 21 trisomy, as performed on the Sequenom MassARRAY® platform.


The Background section will first discuss general assay design problems and their semi-automated solutions using software developed at Sequenom. It will then discuss the similarity and differences in application of these solutions with respect to quantifying marker signals for highly homologous (paralogous) regions. The Methods section will first discuss the general design process, as it was developed for the initial test panel using ‘mix-1’ assays, and how analysis of the experimental results prompted some further parameterization. It will then detail the specific methods of the design process used to generate TypePLEX assays. The Results section presents a summary of the T21 2FH TypePLEX assay designs.


Background


Typical MassEXTEND assays are designed and run to analyze single nucleotide polymorphisms (SNPs) in DNA samples. With respect to assay design, the first task is amplification of a short region flanking the SNP site using PCR. A specific probe primer (a.k.a. extend primer) then hybridizes to the amplified sequence adjacent to the SNP site and is extended by incorporation of a nucleotide species that reads (complements) the specific nucleotide at that site. The resulting extended probe primers (analytes) are subsequently identified by the intensity of their expected mass signals (peaks) in a mass spectrum of the crystallized MassEXTEND reaction products. A typical genotyping assay will look for one of two alternative nucleotides (alleles) in diploid DNA so that either a single peak is identified, for a homozygous sample, or two equal-intensity peaks are identified, for a heterozygous sample. More generally, the signal intensities may be used as a measure of the relative frequency of the alleles, e.g. when considering pooled samples, and the sequence variation may be more complex, e.g. a tri-allelic SNP, INDEL (insertion/deletion) or MNP (multiple nucleotide polymorphism), so long as the individual alleles may be uniquely distinguished by a single base extension (SBE) of the probe. For the remainder of this report the term ‘SNP’ will be used more generally to refer any specific sequence variation between homologous sequences.


For a single MassEXTEND assay design the main concern is with oligo primer design. Each primer sequence must hybridize to its target specifically and with sufficient strength, as estimated by its predicted temperature of hybridization (Tm). In particular, there should be little chance for false extension, i.e. that the primers could target an alternative extension site or extend against themselves through relatively stable primer-dimer or hairpin substructures. However, it is relatively inefficient and uneconomical to analyze multiple SNPs in separate wells of a MassARRAY plate, and so the more general problem for assay design is to create sets of SNP assays that can be run in parallel in the same reaction space. This process is referred to as multiplexed assay design.


The first challenge for multiplexed assay design is ensuring that all expected mass signals from individual assays in a well, including those for analytes, un-extended probes and anticipated by-products such as salt adducts, are sufficiently well resolved in a limited mass range of an individual mass spectrum. Since the probe primer must hybridize adjacent to the SNP site, the freedom to design assays for mass multiplexing is restricted to adjusting the primer lengths and, in most cases, design in either the forward or reverse sense of the given SNP sequence. Additional design options, such as adding variable 5′ mass tags, may be used to increase this freedom. An equally important consideration is the additional potential for false extension of the individual assay primers with respect to targeting any other primers or amplification products of assays they are multiplexed with. Such issues may be avoided or minimized by considering alternative combinations of SNP sequences to assay in the same well. Other factors used to evaluate (i.e. score) alternative multiplexed assay designs help to avoid competitive effects that could adversely bias the performance of some assays over others, e.g. favoring multiplexes where amplicon length and PCR primer Tm values have the least variation between assays. Hence, given larger numbers of SNPs, the typical goal for multiplexed assay design is to create as few wells containing as many assays as possible, while also ensuring that each well is a high-scoring alternative with respect to individual and multiplexed assay design features.


Automated multiplexed assay design for SNP sequences has been routinely performed using the MassARRAY Assay Design Software since 2001. To date, a great many assay designs produced by the software have been validated experimentally. Enhancements to the software, chemistry, and all aspects of experimental procedure and data analysis, today allow the Sequenom MassARRAY platform to measure allele ratios to high accuracy at relatively high assay multiplexing levels. Using a computer program to design assays removes all potential for human error and ensures many suspected and observed issues of multiplexed MassEXTEND assay design are avoided. However, it is still quite common for a fraction of assays to exhibit relatively poor performance in application. Individual assays may show highly skewed heterozygous allele signals, unexpected loss of heterozygosity or even fail to produce any extension products. In most cases the reason for poor assay performance is believed to be biological in nature, i.e. due to the general validity of the given SNP sequences rather than a limitation in their subsequent assay design. For example, a given sequence may be inaccurate when compared to the current genome assembly or the region of interest may contain other SNPs that were not demarked, thereby preventing the Assay Design Software from inadvertently designing primers over these locations. Either or both PCR primers may be designed for regions that are non-specific to the genome because, for example, they overlap with an alu sequence, are subject to copy number polymorphism or are paralogous to other regions in the genome.


The assay design procedure is assisted by additional bioinformatic validation; in particular the use of the eXTEND Tool suite at the Sequenom RealSNP website to prepare input SNP sequences and validate multiplexed assay design against the human genome (Oeth P et al., Methods Mol Biol. 2009; 578:307-43). The first stage of input SNP sequence validation uses the ProxSNP application to BLAST the sequences against the current golden path (consensus human genome assembly) sequence. Those sequences that have high homology to exactly one region of the genome are reformatted to include IUPAC character codes at sites where other (proximal) SNPs are registered or ‘N’s to indicate mismatches to the genomic sequence or unknown bases. It is recommended that the reformatted SNP sequences are then given to the PreXTEND application for further validation and PCR primer design against the genome. This application first uses the same procedure for selecting pairs of PCR primers as the Assay Design Software but generates, by default, 200 of the best scoring amplicon designs rather than just the top scoring design. These are then tested using the eXTEND tool that searches for primer triplets; two PCR primers and either the forward or reverse sequence adjacent to the assay SNP. If a primer triplet matches the genome exactly once with the expected sense orientations and relative positions, the input SNP sequence is reformatted so that the aligned PCR primer sequences are demarked for subsequent constricted assay design. In this case, typically, all or most of the alternative PCR primer choices also align against the same region of the genome, and so the highest scoring PCR primer pair is selected. The scoring criterion is dominated by the consideration of the number and types of alterative matches found for the individual PCR primers. Typically, SNP sequences that have issues for PreXTEND primer design are removed from the input SNP group. The remaining reformatted sequences are processed by the assay design software using an option that ensures PCR primer design is taken directly from the annotated sequences. In this manner the specificity of MassEXTEND assay designs is assured with respect to targeting a single region of the genome, although copy number polymorphism, which is not represented in the golden path by repeated sequence, might remain an issue for the targeted regions. The assay designs produced may be further validated against the human genome using the PIeXTEND application, which uses the same eXTEND tool that tests for specific primer triplets. For assays that were processed through PreXTEND validation the individual primer triplet alignments to the genome should be identical. However, PIeXTEND also validates all combinations of primer triplets possible in each multiplex of assays to ensure that unintended amplification products or probe primer targets are not a significant issue.


Assay design to detect nucleotide differences in paralog DNA sequences is functionally equivalent to assay design for SNPs in a unique region of DNA. That is, the (common) sequence is unique with respect to targeted primer design and the variation at the equivalent position in this sequence is represented by the Sequenom SNP format. Rather than amplifying a single region of (diploid) DNA containing the probe-targeted SNP, two paralogous regions on different chromosomes are equivalently amplified by the same PCR primers and the probe primer equivalently targets the specific site of variation (nucleotide mismatch sequences) in each of the amplified regions. For the paralogous regions assayed, the site of variation is a specific marker to particular chromosome amplified, with one target region always being on chromosome 21 for the current study. Hence, in contrast to traditional SNP assays, these assays are always expected to give heterozygous results and are termed ‘fixed heterozygous’, or ‘2FH’ assays, where the ‘2’ refers to the targeting of exactly two paralogous regions that are unique to (two) different chromosomes. The paralogous regions do not have to be completely homologous in the regions flanking the targeted variation so long as the primers designed are specific to these regions, and amplification occurs in a substantially reproducible manner with substantially equal efficiency using a single pair of primers for all members of the set. Other sites of variation between paralog sequences, and any known SNPs within either region, must be denoted as proximal SNPs so that primers are not designed over these locations. In fact the paralogous regions typically have several sites suitable for such markers, and the corresponding SNP sequences provided for each chromosome 21 paralogous region are identical except for the particular marker site formatted as the assay SNP.


Because the targeted regions are not unique to the genome, the current eXTEND tool set (ProxSNP and PreXTEND) cannot be used annotate 2FH ‘SNP’ sequences. Instead, these sequences are prepared as described above in Example 2. However, the PIeXTEND eXTEND tool is of greater importance for validating such that the multiplexed assays designed by the software specifically target exactly the two paralogous regions intended and that potential cross-amplification issues due to multiplexing the PCR primers are detected. The PIeXTEND application, in combination with the assay design software, was also used in selection of the set of paralog SNP sequences used for assay design, as described in the Methods section below.


As with detecting a heterozygous SNP instance in an autosomal pair of chromosomes, it is assumed that regions containing the marker variation are co-amplified and produce mass signals of identical intensities, admitting some statistical variation due to experimental procedure. In practice, the same issues that cause variations from the 1:1 signal intensity ratios observed for SNP assays of heterozygous samples apply to 2FH assays, with the additional possibility of chromosome-specific biasing. For T21 (chromosome 21 trisomy) 2FH assay design, the requirements for the sensitivity and specificity are greater than for a standard MassEXTEND allelotyping experiment. In particular, the measurement of allele ratios must be accurate enough to detect aneuploid (trisomic) heterozygous allele contribution from fetal DNA superimposed on the 2FH allele signals of the mother's DNA. Hence, the design criteria for effects that could possibly result in (sample-specific) allele skewing are set to be more stringent than for standard multiplexed assay design. The use of more stringent assay design restrictions is viable because the number of paralog SNP sequences provided for initial assay design (˜2,000) is considerably greater than the number required for initial experimental validation (˜250).


Additionally, it is anticipated that some (the majority) of run assays may still not meet the sensitivity and specificity requirements or be otherwise less suitable. Hence, from an initial test of a larger number of TypePLEX assays (e.g. 10×25plexes) the ‘best’ assays will be selected and re-designed by the software using a ‘replexing’ option to create the targeted number of assays. The ultimate goal is to create 50 to 60 validated assays in three wells to test for chromosome 21 trisomy. This number of assays is to increase the sensitivity of detecting fractional allele variations over a background of experimental, and perhaps biological, variations.


Methods


The current procedure for T21 2FH paralog sequence selection, assay design and assay validation was devised over a series of iterations that culminated in the testing of 250 assays against sample DNA and a 56-assay panel against euploid and aneuploid plasma samples. These tests employed a slightly different SBE (single base extension) terminator mix to the ultimate panel based on Sequenom TypePLEX assays. The viability of these assays were analyzed and subsequent assay rankings considered for correlations to addressable assay design criteria. As a result, some additional assay design restrictions were specified for the TypePLEX assay design. A summary of the general methods used to create the original “mix-1” assay panel and relevant conclusions from this study are presented here, followed by a more detailed account of the methods used for the TypePLEX assay design.


Summary of 2FH “mix-1” Test Panel Design and Evaluation The original 2FH assay designs were created using a modified version of the most recent version of the Assay Design software (v4.0.0.4). This modified version of the software (v4.0.50.4) permitted assay design for the “mix-1” SBE chemistry, which uses a mix of standard deoxy-nucleotide-triphosphates (dNTPs) and acyclo-nucleotide-triphosphates (acNTPs). Further, this version was modified to allow only A/G and C/T SNP assay design. This was to ensure that a pair of alleles did not require both dNTP and acNTP probe extensions, which would be a likely source of allelic skewing. The imposed restriction also disallowed a small number of the input 2FH sequences that were INDEL or MNP paralog variations.


Initial attempts at assay design for the selected 2FH markers resulted in multiplexed assays that did not give the expected specificity to the human genome when validated using the PIeXTEND web tool. Some of the assays targeted more or fewer regions than the two expected for 2FH sequences. As a result, the initial screening for suitable paralog sequences involved an additional filtering step that employed the modified version of the software to design uniplex assays that were further screened using PIeXTEND. All sequences that had assays that did not map exactly to the expected chromosome targets were discarded from the set of 2FH markers. Similarly discarded were markers for assays that gave NULL hits to the genome, i.e. assays that would amplify a region that did contain a suitable probe target sequence. To ensure PCR primer specificity to the genome, the selected markers were further reduced to those that only had both PCR primers that individually gave 300 or less matches to the genome. The default settings for a PIeXTEND test uses quite loose criteria for PCR primer alignment: A match is recorded for a given primer using the 16 most 3° bases, containing up to one base mismatch after the first 3 most 3° ′ bases. Running PIeXTEND using the 18 most 3° bases of the PCR primers (with no mismatches) confirmed that PCR primers designed for the remaining 2FH sequences were quite specific to the amplified regions, with few assays returning more than 2 hits for both PCR primers.


A total of 1,877 paralog SNP sequences were provided for assay design composed of the ultimate 2FH21F screen plus 56 sequences from earlier screens (see Example 2). Five sequences, all from the earlier screens, were subsequently removed as a result of scanning for assays that could preferentially target one paralog region of the genome due to sequence variations, depending on the assay design direction selected. Of the 1,872 paralog sequences used for assay design, only 1,015 were designable to mix-1 assays. Most 2FH sequences that failed assay design (817 of 857) did so because of the restriction the input sequence to either [A/G] or [C/T] SNPs.


The objective for this part of the initial assay design process was to create as many 25-plex assays as possible using standard designs settings with extra restrictions, as used and described in detail for the creation of TypePLEX assays in the next section. In particular, the option to extend probe sequences using non-templated bases was disabled to prevent the possibility of a non-templated base addition that happened to actually match a SNP or paralog variation at one target site, as was previously identified as a rare exception for early designs that resulted in unexpected PIeXTEND hits (<2). Despite the increased restrictions on assay design, a relatively high yield of 25-plex mix-1 assays were created for the designable sequences because of the small mass difference between the A/G and C/T analyte masses (15 Da and 16 Da respectively).


An important criterion for 2FH assay design is that no multiplex well design should have more than one assay that targets a particular chromosome 21 paralog region. For each pair of paralog regions there are typically multiple sites of sequence variation that are suitable for MassEXTEND assay design. If two assays were designed in the same well for the same region then there could be a competition between PCR primers trying to amplify within these small regions of the genome. To avoid this, each chromosome 21 paralogous region is denoted a unique SNP_SET value. The SNP group file provided includes a SNP_SET field and is such that each paralog variation for the same SNP_SET value is given a unique SNP_ID and targets just one paralog sequence variation. Each specific variation site is denoted by the assay SNP format, with all other variations demarked as proximal SNPs (‘N’). Exclusion of assays in multiplexes based on their SNP_SET value is then achieved using the 4.0 Assay Design software feature SNP Representation: Once per well.


An initial secondary concern was to ensure that some multiplex designs give as much paralog chromosome coverage as possible. To achieve this, a copy of the SNP group file is edited to use the paralog chromosome ID as the SNP_SET values. This input was used to produce well designs at up to 21-plex where each member assay targets a paralog region in a different chromosome (1-20, 22). The first 10 wells were retained in a copy of the result assay group design and then ‘superplexed’ up to the 25-plex level in a second assay design run against the original SNP group file, containing the chr21 indices as the SNP_SET values. Superplexed assay design is the software option to design new input SNP sequences to add to existing assay designs, as possible, or create additional new well designs. Since the definition of the SNP_SET grouping is only specified by the SNP group file, the net result is a set of well designs containing 25 (or less) assays, that must each target a different chromosome 21 paralog region (SNP_SET) and where the first 10 multiplexes have the maximum number of assays targeting regions in different paralog chromosomes.


The two-pass design strategy allows for a greater choice when picking a limited number of well designs to test. For the mix-1 designs thirty one 25-plex wells were created, of which 10 were selected including the first four wells that contained at least one assay that targeted each of the 21 paralog chromosomes (1-21, 22). Analysis of the experimental results for these ten 25-plexes for euploid samples led to a quality ranking of the individual assays. Three wells were chosen to run against the plasma tissue samples, including the first 25-plex and 19-plex designed by employing the re-multiplex replex design option of the Assay Design software the assays for the top 50 ranked model assays.


Simple RMS analysis using plots of model assay rankings against various assay design features showed some very general expected trends but no significant correlation based on R2 values. Considered design features included predicted probe hybridization Tm; probe length; percentage GC sequence content in both probe and amplicon sequences; the number and severity of individual assay design warnings; amplicon length and paralog amplicon length variation; the number of paralog variations in both the amplicons and SNP_SET region; and the probe mass. The lack of correlation of assay performance to assay design features indicated that no further restrictions on future 2FH assay design with respect to these features was necessary. In particular, it was not necessary to reduce the upper mass limit (8,500 Da) for assay analyte design, which would entail a reduction in the multiplexing levels achievable.


A lack of correlation to assay performance was also noted when considering the (excess) numbers of hits of the PCR primers to the genome, as reported for PIeXTEND analysis at various PCR primer and probe matching settings. Most of this data was collected for all thirty one 25-plex designs and provided to assist in selection of the initial model set assays. However, this information did not provide a clear metric to choose between different multiplexes and was therefore not considered in selection of the 10 model wells. The subsequent lack of correlation to the relative specificity of the PCR p,/sds3fdrimer sequences indicates that the initial filtering of 2FH sequences for assay design does not require further restrictions based on the number PCR primer alignments to the genome. The PIeXTEND analysis of the candidate well designs revealed that three 25-plex wells had potential for cross-amplification issues between pairs of assays. Cross-amplification may occur when the PCR primers from two different assays in the same well could amplify an unintended region that may or may not contain a target for a probe in either assay. The assays that had this issue were from SNP_SETs that were close in index value. Although the spacing between these paralog regions is relatively far on chromosome 21 (well in excess of 1,000 bases), the paralog regions on the second chromosomes turned out to be considerably less (only 100-500 bases) so that an overlap of intended amplicon designs was detected by PIeXTEND. None of the three wells containing these assays were selected for the model run. However, a similar issue that occurred in the replexed assays that targeted the same SNP_SET appeared to show evidence that cross-amplification is a concern.


The highest correlation of assay performance rank to design features was noted for the PCR confidence score (UP_CONF) and the minimum predicted Tm (for target hybridization) for either of the PCR primers of an assay, which is a key component of the UP_CONF calculation. This correlation was greater when the minimum predicted Tm for PCR primers were plotted against the probe extension yield and call rate for the assays. That some PCR primers were designed with Tm's as much as 20° C. below the optimum target value of 60° C. was not anticipated and was a result of limited choice for primer design in some input strands due to a relatively high density of proximal SNP demarcations. In consequence, the settings for the minimum PCR primer design Tm was set to 50° C. for TypePLEX assay design.


Another apparent correlation of assay performance rank was observed with respect to SNP_SET index. Assays of SNP_SET index of 1 to 44 appeared to have more consistently moderate or poor rankings. These regions were closest to the 5′ telomeric end of chromosome 21 and included all paralog regions to chromosome 22. Model set assays that targeted chromosome 22, and also possibly chromosomes 20, 17 and 16, appeared to have more consistently moderate or poor rankings, and may be an indication of chromosome-specific degradation. However, 25% of 2FH paralog sequences were members of SNP_SETs of index 1 to 44, and a test design without these sequences in the input set resulted in a corresponding loss of approximately 25% of the assay designs. For the TypePLEX assay designs it was decided to retain these 2FH marker sequences for design and note this observation when considering the ultimate set of assays selected for the TypePLEX T21-2FH panel.


2FH TypePLEX Assay Design


The TypePLEX assays were created using the most recent version of the Sequenom Assay Design software (4.0.0.4), employing standard TypePLEX (formally iPLEX) termination nucleotides without restriction on the particular SNPs. The same procedure of assay design and validation was followed as used for the mix-1 test run but with the modification of three design settings in the Assay Design software prompted from analysis of the mix-1 test results, as described below.


The same input set of 1,872 2FH sequences were initially used to create TypePLEX assay designs. However, PIeXTEND analysis showed that four assays had 3-hits to the genome. The corresponding 2FH sequences were removed from the SNP group to leave 1,868 input sequences. Despite the additional TypePLEX design restrictions, the lack of restriction on the allowed SNPs meant more of the input 2FH sequences are designable to assays (1,749 cf. 1,015). (In fact, all input sequences are designable to TypePLEX assays at standard design settings.) However, since individual TypePLEX assays may have allele mass differences as high as 79.9 Da, fewer high-multiplex designs may be created (25 vs. 31). With the addition of the 10Da minimum mass separation of un-extended probe signals, less than half as many TypePLEX 25-plex wells were created compared to the mix-1 designs (15 vs. 31). Hence for the initial set of candidate assay designs, all TypePLEX well designs containing 20 or more assays were considered for testing. These assay designs were validated against using the PIeXTEND web tool on Genome Build 36 (March, 2006) at the Sequenom RealSNP website, as detailed in the Results section below.


TypePLEX assay design was again performed in two steps to control which sequences of sets of 2FH were allowed to be multiplexed together in the same well. The first pass designed multiplexed assays using a Max. Multiplex Level setting of 21 and the SNP Set Restriction option set to Once per well to create wells in which each assay targeted a different paralog chromosome (1-20, 22). All assays in wells below a certain size were discarded to allow the corresponding 2FH sequences to be re-designed. The remaining assays were superplexed with the original 2FH sequences, with the chromosome 21 region as the SNP_SET value, using a using a Max. Multiplex Level setting of 25. Apart from the changes to the settings of Max. Multiplex Level and Assay Type (iPLEX then Superplex), all assay designer settings were the same for both design passes. The most important settings governing assay design features are detailed below with respect to the three primary components of assay design; amplicon (PCR primer) design, extend (probe) primer design and multiplexed assay design. Some settings relating to design options that are not relevant to standard TypePLEX assay design, or more algorithmic in nature, are not detailed here.


In the following sections, the numbers of assays or multiplexes affected by changing a particular design setting are provided. These are in respect to all other design settings being at their final values but these numbers should only be regarded as an approximate quantification of the individual design restraints, since the combination of multiple feature restraints is not represented as sum effect of applying individual restraints.


Amplicon Design Settings


The term ‘amplicon’ refers to the double-stranded DNA sequence that is the amplified region targeted by a PCR reaction. Amplicon design is a process of choosing the most suitable pair of PCR primers against the input sequences such that it contains the sequence variation (SNP) of interest and is within specified length requirements. For 2FH assay designs the standard settings for the minimum, optimum and maximum amplicon lengths were used; at values 80, 100 and 120 respectively. This length includes the non-targeted PCR primer 5′ 10-mer hME-10 tags used in standard MassEXTEND assay design, as specified in Assay Designer Amplicons Settings dialog window. The use of universal PCR primer tags, and a small variation in small amplicon lengths, is known to enhance and assist balance of amplification rates in multiplexed PCR reactions. An exemplary universal 10 mer tag used with the assay designs provided in Table 4 is the following: ACGTTGGATG (SEQ ID NO: 1). The Sequence Annotation option is set to its default setting of Scan and Restrict. This option affects how primers are preferentially chosen if the SNP sequence is annotated using character type casing. The particular option chosen is not effective for the 2FH sequences since they are provided as all uppercase characters. This option allows any 10-mer sequence repeats affecting PCR primer design to be avoided, although it is assumed that such repeats are unlikely due to the preparation the 2FH sequence set provided.


PCR primer design consists of evaluating targeted sequences on either side of the assay SNP then choosing the suitable pair of sequences that best meet amplicon length requirements. Primer sequence must be specific and may not target a region containing demarked sequence variations, e.g. other assay SNPs, proximal SNPs denoted by IUPAC codes or otherwise masked by ‘N’ characters. The masking of proximal variations for 2FH sequence design contributed to the majority (95%) of design failures in combination with restraints on PCR and probe primer design.


Restrictions on primer design and weightings on individual design features, affecting how the best pair of primers is ultimately selected, are configurable to the assay design software. These are typically left at their standard default values for assay design since they have proved to be effective. The length of targeted PCR primer is constricted to between 18 and 24 bases, with an optimum length target of 20 bases. The optimum fractional G.0 base content for the targeted sequence is set to 50% and the optimal predicted hybridization Tm for the sequence, using the 4+2 rule, is set to 60° C. Typical SNP sequences have sufficient scope for primer sequence selection that often all three of these optimum conditions are met, resulting in a specific and thermodynamically suitable primer design. However, this may not be the case where sequences have a high A.T base content or are restricted due to the presence of non-specific base codes. To address an observation of a possible correlation between assay performance and PCR primer predicted Tm's for the mix-1 2FH assay designs, the minimum Tm for primer design was set to 50° C., with the maximum retained at its standard value of 80° C. The application of this minimum Tm constraint resulted in the loss of 58 2FH assay designs. The score weighting settings that adjust how effectively primer design meets the optimum values for these restraints were not altered from their default values (1.0).


Other relevant settings for PCR primer design include considerations for the numbers of sequential G bases, false priming of the PCR primers to the same amplicon region and false extension of the primers against themselves due to strong dimer or hairpin substructure formation. Moderate potential for false extension of PCR primers, resulting in them becoming useless for amplification, is typically considered as only having a minor effect on PCR performance and these settings are left at their default values. However, as a result of observing a possible correlation between mix-1 assay performance and PCR design confidence score (UP_CONF), the option to include the hME-10 tags in the hairpin/homodimer analysis was enabled. This has the effect of debarring some primer designs that might have a strong potential for 3° extension against the full 5° sequence and resulted in the loss of 11 2FH TypePLEX assay designs.


Other assay design settings available for controlling single-assay amplicon design, such as score weightings for optimum amplicon length and heterodimer potential between the pair of PCR primers, were kept at their default values.


Extend Probe Design Settings


Restrictions on probe (extend) primer design are similar to those for PCR primers but length and composition is ultimately chosen based on mass and other multiplexed assay design concerns. Again, most available design settings were kept at their default values for moderate level multiplexing SBE (iPLEX) assay design, as have proved to be highly successful for multiplexed assay design in practice.


Probe primer length is controlled by the Oligo Length settings, which were set at minimum and maximum values of 17 and 30 bases respectively. The minimum value limits the size of the smallest extend primers designed and may be effectively set as low as 15 bases, since these sequences need only be specific to short strands of DNA (the amplicons resulting from PCR amplification). The higher value of 17 is used to ensure specificity, extension rates and because far more iPLEX chemistry has been performed at this setting. The maximum value governs the maximum extended length of the probes, i.e. for the allele analytes anticipated. Oligo length is the primary degree of freedom for MassEXTEND assay design, along with the freedom to design either forward or reverse sense assays to target the corresponding strand of the amplicon.


The constraints on the predicted targeted Tm for probe primer design are set to a minimum of 45° C. and a maximum of 100° C., as calculated by the Nearest Neighbor method, which is the default option. The values predicted by the Assay Design software using this method are known to be about 10° C. too low because the calculation does not consider effect of Mg ions on DNA duplex stabilization. The default minimum value was initially chosen as to give approximately the same probe designs as those created by the earliest versions of the software using the 4+2 (G.0 content) rule, where a 60° C. minimum temperature requirement had been recommended based on findings from an early hME assay design experiments. The findings did not indicate the necessity of an upper limit to probe primer Tm and the default value of 100° C. is chosen to be significantly larger than the predicted Tm for any probes typically designable by the software. These limits have since been validated over many assay runs and used for all iPLEX assay designs. Subsequent selection of probe sequences for assay design are not dependent of the predicted Tm value, although a component of internal probe design scoring does consider the fractional G.0 content relative to an optimum value of 50%. This is only a minor consideration for (alternative) probe design and the weighting factor for this component was left at its default value (1.0).


Standard assay design allows probe sequences to be extended at the 5° end with a small number bases that do not match the target DNA sequence, for the sake of mass multiplexing. This option was disabled for 2FH assay design by setting the Non-templated 5° Base Addition: Maximum Allowed value to 0. This restriction was primarily chosen so that the non-templated sequence was not designed over a proximal variation, thereby leading to differential primer hybridization to the two amplified paralog regions. Disallowing non-templated probe base extensions restricts probe design to just the specific sequence flanking the assay SNP. For the 2FH TypePLEX assays changing this setting from the default value reduced the number of 25-plexes designed by 67%.


The potential for false extension of the probe primer is given more internal weighting than for PCR primer design. Such extensions lead directly to false-positive genotyping results or significantly skewed allele frequencies. The potential for false extension is estimated by matching primer sequence to a sliding target such that the primer is able to extend (at the 3° end). Alternative extension targets include a primer molecule's own 5° tail (hairpin), another molecule of primer (homodimer) or either amplicon strand (false priming). The algorithm considers single-base mismatches, multiple-base mismatch loops and alternative choices of open and clamped loops. The largest ΔG value (most negative) for tested hybridization alignments is used to estimate the potential for extension. This estimate also includes a contribution based the number of bases in the 3° clamp of the hybridized structure, to account for a lack of general correlation of AG predictions with assumed instances of false extension. Settings available in the software related to Nearest Neighbor thermodynamics and extend hybridization potential were not changed from their default values.


The potential for false priming of a probe to its targeted amplicon is scored such that a relatively high ΔG prediction for partial 3° sequence hybridization exists at an alternative binding site relative to that for binding to the target site. This is typically a rare occurrence, requiring an exact complementary match of 8 to 10 bases primer at the 3° end. For the 2FH assay designs the score weighting for the probe False Primer Potential was set to 1.2. Using a feature score weighting value of 1.2 ensures that the particular feature is more heavily penalized during selection of alternative probe designs and debars assay design that would otherwise produce a high-moderate warning for the measured feature at standard settings (feature potential >0.416). For 2FH TypePLEX assays, no sequence failed design due to changing this value from the default value (1.0).


Extension of a probe primer through homodimer or hairpin hybridization is similarly analyzed. The potential for hairpin extension is typically considered moderately strong for a complementary alignment of four or more 3° bases, with a hairpin loop of 3 or more bases. The potential for dimer extension is typically considered moderately strong for a complementary alignment of five or more 3° bases, or longer alignments including one or more base-pair mismatches. For the 2FH assay designs the score weighting for the probe Hairpin/Dimer Extension Potential was also set to 1.2, to prevent extend probe designs that would a moderate warning at the default value (1.0). For 2FH TypePLEX assays, changing this value from the default value resulted in 51 sequences failing TypePLEX assay design.


Multiplexing Design Settings


Because of technical variance a single marker often is not sufficient for classification of disease state; therefore, multiple markers are required to reduce the variance and improve the accuracy. Thus, the invention provides, in part, multiplexed assays for the detection of chromosomal abnormalities from maternal samples comprising fetal nucleic acid—preferably procured through non-invasive means. A typical maternal plasma sample from a pregnant female has between 4-32% (+−2%) cell-free fetal nucleic acid. In order to reliably and accurately detect a fetal chromosomal abnormality, with sufficient specificity and/or sensitivity suitable for a high degree of clinical utility, in a background of maternal nucleic acid, sensitive quantitative methods are needed that can take advantage of the increased power provided by using multiple markers (e.g., multiple sets (from 2-1000's) of nucleotide species). By increasing both the number of sets and the number of species per set, the specificity and sensitivity of the method can be high enough for robust clinical utility as a screening test or diagnostic test—even in a sample that comprises a mixture of fetal and maternal nucleic acid. Further, the sex determination assay may be used to determine the amount of fetal nucleic acid present in the sample. Likewise, other assays to determine the amount or concentration of fetal nucleic acid present in a sample may be incorporated into the aneuploidy detection assay.


When designing multiplexed MassEXTEND assays, the primary concern of is that analyte signals from extended primers are well-resolved in the resulting mass spectrum. The molecular masses of probe primers and their extension products are easily calculated and constrained to the more conservative mass window recommended. The Lower Limit and Upper Limit values for the mass range were set to 4,500 Da and 8,500 Da respectively. This upper mass limit effectively limits maximum length for analyte sequences to 28 bases and prohibits the overlap of mass signals for singly charged (low mass) species and those for possible double and triple charged (high mass) species. The Min Peak Separation setting for analyte mass peaks was kept at its default value (30 Da). This value ensures that analyte sequences of any assay in a multiplex design do not overlap with any anticipated peaks from any other assay they are multiplexed with. It also ensures that analyte peaks are at least 8 Da separated from sodium and potassium ion adduct peaks, which are the most frequently observed salt adduct peaks in TypePLEX mass spectra. Specific additional by-product and fixed-mass contaminant signals may be specified to be avoided in multiplexed assay design but are not used for the 2FH assay designs. The Min Peak Separation setting for mass extend primers (probes) was set to 10 Da, the recommend setting for low multiplexing. This prevents un-extended probe signals in the mass spectrum from overlapping, thereby ensuring that the measurement of extension rate may be accurately estimated for all assays. (The default value of 0 was used for the mix-1 assay designs.) Adding this multiplexing restriction on the TypePLEX 2FH assay designs reduced the number of 25-plex wells created from 26 to 15 wells.


The False Priming Potential score weighting value for multiplexed primer design was set to 1.2 for the 2FH sequence designs. This reduces the likelihood that probe or PCR primers of one assay extend at an alternative site in any single-stranded amplicon sequence from another assay it is multiplexed with. This is a very low frequency occurrence at standard design settings and using a higher weighting here ensures that even moderate potentials for false priming between assays are disfavored. For 2FH TypePLEX assays, changing this value from the default value (1.0) had no significant effect on the assay designs.


The Primer-Dimer Potential score weighting value for multiplexed primer design was set to 1.2 for the 2FH sequence designs. This reduces the likelihood that a probe primer from one assay could extend off a probe primer from another assay it is multiplexed via heterodimer hybridization. As with probe homodimers and hairpins, apparent false extension has been observed at Sequenom for 3° base hybridizations with as few as 4 bases matched and is the primary reason why small sets of input sequences may fail to be multiplexed design to the same well. When the set of input sequences is large compared to the multiplexing level, as with the 2FH designs, it is usually possible to distribute probe sequences to allow for a greater number of high level multiplexes, but warnings for moderate primer-dimer extension potential are more common. Using a higher weighting here ensures that even moderate potentials for false probe extension are avoided. For 2FH TypePLEX assays, changing this value from the default value (1.0) removed 465 such warnings but reduced the number of 25-plex wells designed from 28 to 15.


Other design settings relating to multiplexing were kept at their default values. These design options are not used for standard TypePLEX assay design or not considered of particular significance for 2FH assay design. In particular, the option to use exchange replexing for de novo assay design was used and the Superplex with new SNPs option retained for superplexed assay design. The Minimum Multiplexing Level setting was set at its default value of 1, since there was no reason to restrict the wells to a minimum size at the design stage.


Results


The input set of 1,868 2FH sequences were initially designed to 1,749 assays processed in 347 wells using chromosome ID as the SNP_SET grouping. The four 21-plex, two 20-plex and five 19-plex assay design were retained for superplex assay design. These were superplexed with the original 1,868 2FH sequences at a maximum multiplexing level of 25, using chromosome region (index) as the SNP_SET grouping, to create 1,749 assays in 95 wells. From these designs, the fifteen 25-plex, thirteen 24-plex, nine 23-plex, seven 22-plex, four 21-plex and six 20-plex wells were retained as potential assay designs. The first 11 wells listed are original 21, 20 and 19 assay wells superplexed with additional 2FH sequences to well sizes of 25, 23, 23, 25, 24, 24, 22, 23, 22, 21 and 25 assays respectively.


The 54 wells, containing 1,252 assays in wells of size 20 to 25 assays, were validated by the PIeXTEND tool as all giving exactly 2 triplets of assay primer alignments to the human genome, for the expected chromosome 21 and paralog chromosome regions. PIeXTEND analysis also revealed that two wells (W27 and W53) contained pairs of assays that produced cross-amplification hits to the genome. Assays 2FH21F_01-046 and 2FH21F_01_071 were removed to avoid potential cross-amplification issues in the corresponding wells, leaving well W27 as a 23-plex and well W53 as a 19-plex. The remaining 54 wells, containing 1,250 assays, were provided for initial 2FH TypePLEX assay development. These assays are provided below in Table 4A.


In Table 4A, each “Marker ID” represents an assay of a set of nucleotide sequence species, where the set includes a first nucleotide sequence species and a second nucleotide sequence species. Table 4 provides assay details for each of the 1252 nucleotide sequence sets. As described herein, sequence sets comprise highly homologous sequences (e.g., paralogs) from a target chromosome (e.g., Ch21) and a reference chromosome (e.g., all other, non-target autosomal chromosomes). Each sequence set has a Marker ID, which provides the target and reference chromosome numbers. For the target chromosome, the chromosome number (CHR_1), the genomic nucleotide mismatch position (Marker_POS1), the genomic strand specificity (SENSE1—F (forward) or R (reverse)), the genomic nucleotide mismatch base (Marker 1), and the amplicon length (AMP_LEN1) are provided. Corresponding information is provided for the corresponding reference chromosome: the chromosome number (CHR_2), the genomic nucleotide mismatch position (Marker_POS2), the genomic strand specificity (SENSE2—F (forward) or R (reverse)), the genomic nucleotide mismatch base (Marker_2), and the amplicon length (AMP_LEN2). Marker positions are based on Human Genome 19 from The University of California Santa Cruz (Assembly GRCh37). The PCR1 and PCR2 primer sequences amplify both the target and reference nucleotide sequences of the set, and the marker nucleotide bases are interrogated at the marker positions by the Extend primer sequence. The PCR1 and PCR2 primer sequences may also comprise a 5′ universal primer sequence (e.g., the following 10-mer sequence was used in the Examples provided herein: ACGTTGGATG (SEQ ID NO: 1)). In certain embodiments, the nucleotide variant in the “Marker_1” and “Marker_2” column for an assay is the first nucleotide extended from the 3′ end of an extension primer shown.

























TABLE 4A









Mark-




Mark-


SEQ

SEQ

SEQ



CHR_
Marker_

er_
AMP_
CHR_
Marker_

er_
AMP_

ID

ID

ID


Marker_ID
1
POS1
SENSE1
1
LEN1
2
POS2
SENSE2
2
LEN2
PCR1
NO:
PCR2
NO:
Extension
NO:































2FH21F_01_003
21
17601200
F
G
90
1
9110229
R
T
 90
GGTTTGGATGATGTGTTGC
   2
CCTTGAGAAACTAAGTGACC
1254
TAAGTGACCTGCTTCT
2506

















CAGCTGT






FH21F_01_006
21
17811372
R
A
91
1
52326378
F
C
 91
TGATGATGGGCCAGGAAATG
   3
GCTGTCTAATAGAAGCTTAC
1255
TGTTACAGCCAATATT
2507

















TAAGGA






2FH21F_01_007
21
17811413
R
C
90
1
52326337
F
T
 90
ATTGGCTGTAACAAATGCTG
   4
CACTCAAGTTTCCCTCTTGC
1256
CTCTTGCTGTCTAATA
2508

















GAAGCTTAC






2FH21F_01_009
21
17811526
F
C
119
1
52326224
R
A
119
ATACCCTCCTGCATGCTTAG
   5
TCCAAGTCCTCTTAAAGGAG
1257
TTTTACCAGTGCTCCC
2509

















C






2FH21F_01_010
21
17811675
F
T
119
1
52326075
R
G
121
CAGCAAGGTTGAAATTGGGA
   6
GGGCCAGTACCATTTCATAG
1258
ATAGAATGCCCATTTG
2510

















TG






2FH21F_01_011
21
17811688
R
C
117
1
52326060
F
T
119
GGGCCAGTACCATTTCATAG
   7
GCAAGGTTGAAATTGGGAAT
1259
TCAGAAGAAAATAGGC
2511















G

CA






2FH21F_01_012
21
17811715
R
C
107
1
52326033
F
C
109
TCATAGAATGCCCATTTGTG
   8
TTCAGCAAGGTTGAAATTGG
1260
CAAGGTTGAAATTGGG
2512

















AATGT






2FH21F_01_013
21
17811745
F
G
98
1
52326003
R
G
 98
GCCTTATCCTGTATCCTAGC
   9
CATTCCCAATTTCAACCTTG
1261
TTCAACCTTGCTGAAA
2513















C

AA






2FH21F_01_014
21
17811765
R
A
100
1
52325983
F
C
100
TCCCAATTTCAACCTTGCTG
  10
TGCCAGCCTTATCCTGTATC
1262
TGTATCCTAGCTGTTC
2514

















TTAA






2FH21F_01_015
21
17811858
F
A
118
1
52325890
R
G
121
TGTAAGATTTTGTTCCCTC
  11
GCTAGCTATTCCAGTTTGAA
1263
TTGAAATCTACCAAAC
2515

















TGTAA






2FH21F_01_017
21
17811925
R
T
100
1
52325820
F
T
100
CACCTAGCTTGAGAAGGATG
  12
TGAGGGAACAAAATCTTAC
1264
GGAACAAAATCTTACA
2516

















AAAGG






2FH21F_01_018
21
17811943
F
T
100
1
52325802
R
T
100
CACCTAGCTTGAGAAGGATG
  13
TGAGGGAACAAAATCTTAC
1265
GGGATTAGGCACTCGC
2517

















T






2FH21F_01_020
21
17812111
R
G
103
1
52325634
F
G
103
AAGAAGTTCTTCTGGGTCTG
  14
CTTCATGCTGGAGTAATGGG
1266
GGGTAACATATCTTTG
2518

















GTATGGTT






2FH21F_01_021
21
17812175
F
C
91
1
52325570
R
A
 91
TTTTCATACACTTCTCTGG
  15
CCCATTACTCCAGCATGAAG
1267
GTGGCAAAATACCTCA
2519

















AGA






2FH21F_01_022
21
17812184
R
C
118
1
52325561
F
A
118
CAGTGGCAAAATACCTCAAG
  16
TTTTACCATTAGTGGTTTG
1268
ATTTTTCATACACTTC
2520

















TCTGG






2FH21F_01_023
21
17812224
R
G
118
1
52325521
F
T
118
CAGTGGCAAAATACCTCAAG
  17
TTTTACCATTAGTGGTTTG
1269
ACCATTAGTGGTTTGA
2521

















TTTTAAT






2FH21F_01_025
21
17812302
F
T
116
1
52325443
R
G
116
CTCCCTCCCCAGTAGAAATA
  18
ATCCAAGATACTCACTTTCC
1270
ACTCACTTTCCATTAA
2522

















TTCTGTGT






2FH21F_01_026
21
17812307
R
A
116
1
52325438
F
C
116
ATCCAAGATACTCACTTTCC
  19
CTCCCTCCCCAGTAGAAATA
1271
TTTGTTACTTTTCTTT
2523

















TCCCCC






2FH21F_01_027
21
21493445
R
A
115
1
47924051
R
G
116
CTTTCATTGCAAAATGTTTC
  20
CATTTCAAAATCTCTGGCCC
1272
GTTTATTAATGCAGAG
2524













C



CTCTC






2FH21F_01_029
21
22448020
F
A
84
1
33174864
F
G
 85
AGATTCTCTGGTCACAGG
  21
TATCTGGTAAGAAATTGTG
1273
TCTCAGAATTTCCCTG
2525

















G






2FH21F_01_030
21
27518134
F
T
97
1
95697485
F
C
 97
GAGGCAACTAGGACTTAAGG
  22
GTACTCAAATCAAATTGGC
1274
TACTCAAATCAAATTG
2526

















GCTTACTTGC






2FH21F_01_031
21
27518141
R
T
97
1
95697492
R
C
 97
GTACTCAAATCAAATTGGC
  23
GAGGCAACTAGGACTTAAGG
1275
GCCAACATCCATGAAA
2527

















AACAA






2FH21F_01_033
21
29350581
F
A
116
1
145141386
F
C
117
GGTGAAGGCTGTATTTGTAG
  24
CCAGCCAAGAATACAAACAC
1276
CCAGCCAAGAATACAA
2528

















ACACAAAATA






2FH21F_01_034
21
29350590
R
T
116
1
145141395
R
G
117
CCAGCCAAGAATACAAACAC
  25
GGTGAAGGCTGTATTTGTAG
1277
TGATGTTTTCTTATTC
2529

















TCCTTA






2FH21F_01_036
21
29350625
R
G
119
1
145141431
R
A
120
CCAGCCAAGAATACAAACAC
  26
GTAGGTGAAGGCTGTATTTG
1278
GGTGAAGGCTGTATTT
2530

















GTAGTAGTA






2FH21F_01_037
21
29355542
F
G
93
1
145141768
F
C
106
ATTAAGAAGTTTGCTGAGGC
  27
CATTGGCCTTAACTCCAGAG
1279
GCCTTAACTCCAGAGT
2531

















TTTCT






2FH21F_01_038
21
29355550
R
G
96
1
145141789
R
A
109
CATTGGCCTTAACTCCAGAG
  28
GCTATTAAGAAGTTTGCTGA
1280
TTTGAAGCTATTCCCC
2532















G

G






2FH21F_01_039
21
29356359
R
G
90
1
145141960
R
A
 90
AGAACTTTGAAAGTATTAAC
  29
GCTCTACAGACAATCTGATG
1281
CATAGAAAGGGCAGTA
2533

















GA






2FH21F_01_040
21
29357621
R
G
87
1
145142269
R
A
 87
GCTATTGCTGATACTGGTGC
  30
AATGAAGAGCCATGTCTGCC
1282
GTCTGCCACTTTGCCA
2534

















CCTGTTACTAC






2FH21F_01_041
21
29357656
F
G
120
1
145142304
F
A
120
GGAACAGTGTTGATAAAGAC
  31
CACCAGTATCAGCAATAGCT
1283
ACCAGTATCAGCAATA
2535













T

T

GCTTTGACTT






2FH21F_01_043
21
29361150
R
G
91
1
145142637
R
A
 91
AGCTTGGCCAGAAATACTTC
  32
GAAGTCTCATCTCTACTTCG
1284
CATCTCTACTTCGTAC
2536

















CTC






2FH21F_01_044
21
29361182
F
T
106
1
145142669
F
C
106
GCAGAAAAGCTCATGAGATT
  33
GTACGAAGTAGAGATGAGAC
1285
GAAGTAGAGATGAGAC
2537













C



TTCATCAA






2FH21F_01_045
21
29361209
R
A
106
1
145142696
R
G
106
GCAGAAAAGCTCATGAGATT
  34
GTACGAAGTAGAGATGAGAC
1286
AGGTTTTTTGCAGAAC
2538













C



AAC






2FH21F_01_046
21
29361246
R
G
109
1
145142733
R
A
109
ATCTCGAAGGTTTTTTGCAG
  35
AGGTCATAGAAGGTTATG
1287
GGTCATAGAAGGTTAT
2539

















GAAATAGC






2FH21F_01_049
21
31679773
R
T
120
1
9351912
F
G
134
CATTCATCAGAATGTGACCC
  36
CATTACCCCCTTATTATTTT
1288
AAGATTTTCCTCCCTC
2540















G

CT






2FH21F_01_050
21
31679795
F
G
120
1
9351890
R
T
134
CATTACCCCCTTATTATTTT
  37
CATTCATCAGAATGTGACCC
1289
AGGAGGGAGGAAAATC
2541













G



TTTAA






2FH21F_01_057
21
33849236
R
A
86
1
155945466
R
T
 86
AGTCGGAGTCATACTCCAAG
  38
GCTAAAGCTCCTTCTTCTAC
1290
CTCCTTCTTCTACCCA
2542

















CAGA






2FH21F_01_058
21
33849456
R
G
109
1
155945581
R
A
109
CTGTGGTAAGAAGACGAAGC
  39
GGATGGGAGATCTGCTAAAC
1291
TTGATCGCCTTAATCT
2543

















GA






2FH21F_01_059
21
33849485
R
A
113
1
155945610
R
G
113
GGATGGGAGATCTGCTAAAC
  40
CTGTGGTAAGAAGACGAAGC
1292
CGATCAAGAACACCCT
2544

















T






2FH21F_01_060
21
33851363
F
C
116
1
155945724
F
T
116
AGGTGCAGGCTTTAGGTTTG
  41
GATAAGGCTCAATTACTTG
1293
AGGCTCAATTACTTGA
2545

















AATAGC






2FH21F_01_062
21
33851411
R
A
96
1
155945772
R
G
 96
TAATGCAGCTGCCATGTGTG
  42
TATAGTAGGTGGAGGTGCAG
1294
GGTGGAGGTGCAGGCT
2546

















TTAGGTTTGG






2FH21F_01_063
21
33851469
F
G
105
1
155945830
F
A
105
CTCAGTTAGTTCTTCTATAG
  43
AAACCTAAAGCCTGCACCTC
1295
GAGAAAGTTGCTAAAA
2547













T



AGTCA






2FH21F_01_064
21
33853810
R
A
96
1
155946048
R
C
 96
ATTGCTGCAGCAAAACCA
  44
GAGATCCAGATGATACAGGG
1296
TGATACAGGGAATTCT
2548

















TTTGTTAA






2FH21F_01_065
21
33853850
F
C
85
1
155946088
F
T
 85
CATTCTCCATAAACACTATC
  45
GAATTCCCTGTATCATCTGG
1297
TATCATCTGGATCTCA
2549

















ACAT






2FH21F_01_067
21
33861377
R
T
92
1
155946234
R
C
 92
CTCTACAGCAATGAGTGAAC
  46
CCTGAGCTCTATTTAACATG
1298
TGCATTCTCACTGAGT
2550















C

CTTTTCTGAGC






2FH21F_01_068
21
33861410
F
T
112
1
155946267
F
C
112
CCTGAGCTCTATTTAACATG
  47
TACAGCAATGAGTGAACGGG
1299
AGACTCAGTGAGAATG
2551













C



CATTTGA






2FH21F_01_071
21
33869988
F
G
113
1
155946671
F
A
113
TCAGGGCCACTATCATGGAC
  48
AGGCAAACATCCTGTGTCTG
1300
GTGTCTGCTTTGATGG
2552

















A






2FH21F_01_072
21
33870000
R
A
104
1
155946683
R
G
104
TCCTGTGTCTGCTTTGATGG
  49
TCAGGGCCACTATCATGGAC
1301
CAGGTGGTTGCCACCT
2553

















TCT






2FH21F_01_073
21
33870731
F
T
103
1
155946943
F
C
103
TTATAAAACCTCAATCTATC
  50
CAATGGGCCTTGTACCAAAG
1302
CTCATGGCTAATGCCA
2554

















C






2FH21F_01_077
21
33870871
R
A
85
1
155947085
R
G
 85
GGTACAAAAATCAAAGCCTG
  51
GGCAATTTAAGACATTGTG
1303
AGACATTGTGTAAAAA
2555

















GCAATCTGTA






2FH21F_01_078
21
33870951
F
C
96
1
155947165
F
T
 96
TCGTTTGGATGTTAGCCAC
  52
AACCATACAGGGTTTTGGTA
1304
GGTTTTGGTATGTTTA
2556

















TATTGTTTA






2FH21F_01_080
21
33871006
R
C
82
1
155947220
R
T
 82
AGTGGCTAACATCCAAACGA
  53
TTAACATTCCACACTGAAG
1305
CATTCCACACTGAAGA
2557

















TTACTCT






2FH21F_01_081
21
33871091
R
G
93
1
155947305
R
C
 93
GTACTATGATGTAACTCCCC
  54
CACAGCCCTTCACTGATTAC
1306
TTACAGGCAAGTGTTA
2558

















CAGTAG






2FH21F_01_082
21
33871149
R
A
105
1
155947363
R
G
105
GTAATCAGTGAAGGGCTGTG
  55
GATCACCTCAATAACACTGG
1307
ATCTGTCCAGCAGAAC
2559

















CCA






2FH21F_01_083
21
33871170
F
A
108
1
155947384
F
C
108
CTTGATCACCTCAATAACAC
  56
GTAATCAGTGAAGGGCTGTG
1308
GTTCTGCTGGACAGAT
2560

















A






2FH21F_01_084
21
33871198
F
A
113
1
155947412
F
C
117
CAAAATTTTGAGGGGAGATG
  57
TGGGTTCTGCTGGACAGATA
1309
AGTGTTATTGAGGTGA
2561













G



TCAAG






2FH21F_01_086
21
33871220
R
C
119
1
155947438
R
A
123
TGGGTTCTGCTGGACAGATA
  58
CCTCTACAAAATTTTGAGGG
1310
CAAAATTTTGAGGGGA
2562

















GATGGT






2FH21F_01_088
21
33871351
F
C
116
1
155947568
F
G
120
GTAAAACTATATCACAACTC
  59
GGGTCATAAGAAGGGAGTAA
1311
AGGGAGTAAAAAATGA
2563

















AGTCTGA






2FH21F_01_090
21
33871453
F
G
105
1
155947674
F
A
105
GTGGCTGGTTGCCAATTTTA
  60
TGAATTTCAGCTACACCTAG
1312
CAGCTACACCTAGATA
2564

















GAC






2FH21F_01_093
21
33871568
R
A
118
1
155947788
R
G
117
ATTGGCAACCAGCCACTATT
  61
TACCACTGTAATACACATG
1313
CCACTGTAATACACAT
2565

















GAAATAT






2FH21F_01_094
21
33871608
F
C
91
1
155947828
F
T
 91
ATTTGGGCCTTAAGCTTTTG
  62
TTCATGTGTATTACAGTGG
1314
ATTACAGTGGTATTCA
2566

















TATGCTATGT






2FH21F_01_099
21
34436974
F
C
130
1
51085302
F
T
121
CTGTTGTAAGGGGAAAAGTC
  63
ACTGCTCACTGACAGCTTCT
1315
CTGACAGCTTCTCTGT
2567

















AA






2FH21F_01_101
21
39590986
F
C
120
1
13755946
R
C
120
GAGGCTCAGTAGAGGTTTAG
  64
CAGAACATAGGTTTGAAGC
1316
GGTTTGAAGCAGTCAC
2568

















A






2FH21F_01_102
21
39591032
R
G
115
1
13755900
F
T
115
CATAGGTTTGAAGCAGTCAC
  65
GAGGCTCAGTAGAGGTTTAG
1317
CTCAGTAGAGGTTTAG
2569

















TATGATG






2FH21F_01_104
21
39591411
R
C
98
1
13755518
F
A
 98
ACAGTGTCCTGATTAGTGCC
  66
TGCCAGACTGGTTTGTTAGC
1318
TTGTTTCTTAGTGCTC
2570

















TAGCCAT






2FH21F_02_003
21
13535069
F
A
111
2
132391742
R
A
115
AATTTTATAGAGAAGCCTG
  67
GTGTCTCATAGTCACTGGTC
1319
CATAGTCACTGGTCCA
2571

















TAGTAAGTAT






2FH21F_02_007
21
13543483
F
C
112
2
132383343
R
C
112
CACCTTACCCTGCCATCAAG
  68
CCATTCTTGCAACAGTTCCC
1320
AGTTCCCAGAAAAGAA
2572

















GAGGAATGTG






2FH21F_02_015
21
14091492
F
A
111
2
138411388
F
G
112
CATAGGTGAGAAAAGTTTGG
  69
GGGAAAAAAAGTGCACCT
1321
AAAAAGTGCACCTTTT
2573













G



CTTA






2FH21F_02_017
21
14091523
R
C
85
2
138411420
R
T
 86
CTCTTCCAGAGTGTTCTCTA
  70
CATAGGTGAGAAAAGTTTGG
1322
TGGGGAAAGAACTTGA
2574















G

A






2FH21F_02_018
21
14091561
F
T
112
2
138411458
F
G
113
CCCTACACTCCTTCTTCTTT
  71
TTCCCCAAACTTTTCTCACC
1323
CCAAACTTTTCTCACC
2575

















TATGTTT






2FH21F_02_019
21
14091590
R
G
112
2
138411488
R
A
113
TTCCCCAAACTTTTCTCACC
  72
CCCTACACTCCTTCTTCTTT
1324
CTTCTTCTTTATAGGA
2576

















ACACATTGC






2FH21F_02_020
21
14091662
F
A
120
2
138411560
F
G
120
CTCACTGTACATCCATCCTC
  73
AAAGAAGAAGGAGTGTAGGG
1325
TTTAGCTCTAGAGGAT
2577

















GAG






2FH21F_02_021
21
14091679
R
T
120
2
138411577
R
C
120
AAAGAAGAAGGAGTGTAGGG
  74
CTCACTGTACATCCATCCTC
1326
ACATCCATCCTCAAAC
2578

















TG






2FH21F_02_022
21
14091732
F
T
115
2
138411630
F
C
115
GCAGAGATATCATGCACA
  75
TAGTGAGGGGCTTTTTCCAC
1327
GCTTTTTCCACCTTGA
2579

















A






2FH21F_02_023
21
14091983
F
T
91
2
138411876
F
G
 97
GGCATGGGGCTTTCTTGCT
  76
ACCCCATGTAAACCTTGAGC
1328
TTGAGCACACTGCAAA
2580

















GTCAT






2FH21F_02_027
21
14092079
F
T
105
2
138411979
F
A
105
GCCTCTCAGGCACCATTCT
  77
TTATCACGTGACTTCAGTGG
1329
CAGCTCCCCTACATAC
2581

















C






2FH21F_02_034
21
14092568
R
T
84
2
138412473
R
G
 84
CCATTGCCAAAGTTGTGGTT
  78
GTGGAATTCTCCTTGGACTC
1330
GGAATTCTCCTTGGAC
2582

















TCTTTTGTCTC






2FH21F_02_035
21
14092619
R
T
92
2
138412524
R
C
 92
GAGTCCAAGGAGAATTCCAC
  79
ATACTCTTATCCAGTTCAGC
1331
CTCTTATCCAGTTCAG
2583

















CTTTGTTTGTC






2FH21F_02_036
21
14092764
R
A
98
2
138412667
R
C
 98
TGGTGACAAGGTGAAAAGGG
  80
GGAGGAGATATGGTGCAGAG
1332
GAGGAGATATGGTGCA
2584

















GAGCTCTCAG






2FH21F_02_037
21
14380512
F
C
93
2
38777773
F
A
 93
CATAAGCCACTTTTTCAGA
  81
CTCTTCAAATGCACCTAGTG
1333
TTCAAATGCACCTAGT
2585

















GTCACAAGAA






2FH21F_02_038
21
14390371
F
C
120
2
38790295
F
G
121
AAGCACCTTGGGAATTTTT
  82
GGAAAGGGAAAAAAACCTGC
1334
GGAAAGGGAAAAAAAC
2586

















CTGCAGCATA






2FH21F_02_040
21
14396267
F
T
85
2
38796979
F
C
 85
ACACAGATTCCTCCCATAGC
  83
TCCAGAAGGAGGCCCTGGT
1335
CCAGAAGGAGGCCCTG
2587

















GTGTACTA






2FH21F_02_041
21
14437193
F
C
110
2
208014410
F
T
110
TTGTGGAGTAGGCATATTTC
  84
TTTTAATCAGAATCATAGAG
1336
CAGAATCATAGAGTAA
2588

















AAATTGC






2FH21F_02_043
21
14437253
F
T
99
2
208014470
F
C
 99
GGGATTCCATTATCTGGTC
  85
GAAACTCTAGAAAAACCCAG
1337
AAATATGCCTACTCCA
2589

















CAA






2FH21F_02_045
21
16149874
R
A
86
2
225225486
F
A
 86
CCTGAGTTTTAAGTGCCACA
  86
ACAAGTCTGAGAGCCTAAAG
1338
GAGCCTAAAGGCAGGA
2590













T



TGTG






2FH21F_02_050
21
18127404
F
T
93
2
208185957
R
G
 93
AGACTTTTTGTACAGTAAG
  87
GAGTGTGTCACTTAAGGTC
1339
TGTCACTTAAGGTCTT
2591

















AGACTG






2FH21F_02_055
21
18128107
R
C
85
2
208185567
F
A
 87
GTTTTCTAATTTTCTGGATG
  88
ATAGCACTAACAGCTCAAGG
1340
CTAACAGCTCAAGGAA
2592

















TGTAT






2FH21F_02_057
21
18433865
R
A
113
2
95536170
F
A
113
CAGTGGAATCCTGGGAAATT
  89
GCCATTACCTGCAACCATGT
1341
CTGCAACCATGTTGTT
2593

















TTATT






2FH21F_02_058
21
18433901
F
C
102
2
95536134
R
A
102
AAACTAACAGCCTGGAATAC
  90
AACATGGTTGCAGGTAATGG
1342
ACATGGTTGCAGGTAA
2594

















TGGCAACAAG






2FH21F_02_061
21
18434055
R
G
103
2
95535979
F
T
104
CTAATTTTTAGAAAGAGTAC
  91
ATTTGTACAGTTTCCCATTC
1343
CCATTCCCATTCCCAC
2595















C

CTTT






2FH21F_02_062
21
18434167
F
C
113
2
95535867
R
A
114
AGTGGCAGAAGATGGAATAG
  92
TATGGTGCTAAAAAGGACTG
1344
TGCTAAAAAGGACTGT
2596

















TATCTAA






2FH21F_02_063
21
18434195
R
T
113
2
95535838
F
T
114
TATGGTGCTAAAAAGGACTG
  93
AGTGGCAGAAGATGGAATAG
1345
GGAATAGTACAATAAG
2597

















ATAAGGA






2FH21F_02_065
21
18434275
R
T
110
2
95535758
F
G
110
ACTATTCCATCTTCTGCCAC
  94
TTTATTAAATCAGTCTGGG
1346
AATCAGTCTGGGAAGG
2598

















CA






2FH21F_02_066
21
18434542
F
T
82
2
95536686
F
C
 82
ACATCATATAGAAAGGGCAG
  95
GTATAACATTATACAGAGAG
1347
TATACAGAGAGGACAG
2599















G

TGGTAAACT






2FH21F_02_067
21
18434573
F
T
99
2
95536717
F
A
 99
CAAACTGTAAACAGTGGTCC
  96
ACTGCTGCCCTTTCTATATG
1348
GCTGCCCTTTCTATAT
2600

















GATGTAAT






2FH21F_02_072
21
18435016
R
A
94
2
95537160
R
G
 94
TTTAGAGCTCTTGCATCTTG
  97
TCAAATGTGAGGAAAGTGCC
1349
ACATAAAATGTTACCA
2601

















AACAGATGGG






2FH21F_02_073
21
18435097
R
G
111
2
95537238
R
A
108
TGGCACTTTCCTCACATTTG
  98
GTGCCAGAACATTCTGAATC
1350
GAATCTTAGTGTGGAA
2602

















AAAAAAA






2FH21F_02_074
21
20848805
F
A
102
2
33521214
F
T
102
GAAAAAAGTGCATGTCTTTG
  99
GGAAAAGATTATGATGCAC
1351
GAAAAGATTATGATGC
2603

















ACTGGCCTG






2FH21F_02_075
21
20848810
R
A
97
2
33521219
R
T
 97
AGATTATGATGCACTGGCCT
 100
GAAAAAAGTGCATGTCTTTG
1352
GATGAATGCAGTGAAG
2604

















TC






2FH21F_02_076
21
20848832
F
C
96
2
33521241
F
A
 96
GAAAAAAGTGCATGTCTTTG
 101
GATTATGATGCACTGGCCTG
1353
ACTTCACTGCATTCAT
2605

















CAGC






2FH21F_02_077
21
20848839
R
G
101
2
33521248
R
A
101
GATTATGATGCACTGGCCTG
 102
ATTATGAAAAAAGTGCATGT
1354
AAAAAAGTGCATGTCT
2606

















TTGT






2FH21F_02_088
21
28215571
F
G
99
2
132405073
R
A
 99
ATTAATACAAGGGGGTGTTC
 103
CTTAAAATTAGGGATCAGA
1355
TAGGGATCAGAATCTC
2607

















AAC






2FH21F_02_089
21
28215882
R
T
95
2
132404762
F
T
 95
GTCTACCAAACTACAATTAG
 104
CTGAAGAAGTGTAAAAATGG
1356
GGCAACATGCATATAG
2608















C

AG






2FH21F_02_090
21
28215945
R
C
102
2
132404699
F
A
102
GCATGTTGCCATTTTTACAC
 105
TTGTCCTTAGGCACAAATGG
1357
TTAGGCACAAATGGAA
2609

















ATAGT






2FH21F_02_091
21
28215990
F
C
116
2
132404654
R
A
117
CCAAATTTTCAAGCAAAGC
 106
GTGCCTAAGGACAACTTTTT
1358
GACAACTTTTTCTTTT
2610















C

TCTTCT






2FH21F_02_103
21
28234536
R
A
92
2
132386044
F
C
 95
GGAGTTGACAATTACATCT
 107
AAACAATGGGTTCTAGAAA
1359
AAACAATGGGTTCTAG
2611

















AAAAAAAAA






2FH21F_02_107
21
28264424
R
A
112
2
132366224
F
C
112
GGAAAGTTAGAAGGCCACAC
 108
CCCAGATGAAGGGGTTTTAG
1360
TTTAGTATTGAATTTA
2612

















GTGCTTAG






2FH21F_02_108
21
28264470
F
G
111
2
132366178
R
A
116
GATTGTGGGTTTTTGGAAAG
 109
ACTAAAACCCCTTCATCTGG
1361
CCCCTTCATCTGGGAC
2613

















TCAA






2FH21F_02_111
21
28264552
R
A
85
2
132366091
F
C
 85
CTTTCCAAAAACCCACAATC
 110
CTGCTAACTCAGATACCTGC
1362
CTCAGATACCTGCATG
2614

















TCA






2FH21F_02_113
21
28264816
F
T
116
2
132365833
R
G
116
TGTCTCTGGCATTCCCTATC
 111
CTTCTATCAGCAAGTTAG
1363
TTTTGTTTCATTTTTG
2615

















TCACAT






2FH21F_02_116
21
28278126
R
C
119
2
132352487
F
A
118
AGGGCTGCAGGGACAGTAG
 112
GTCTCACATCCCATTTACAG
1364
ATTTACAGTTTATGTG
2616

















TCAGCTAC






2FH21F_02_127
21
31597156
F
C
86
2
231393191
R
A
 86
GTTTGCCAGTTCAAATTCAG
 113
CTAGCAAAGAATAATCATAT
1365
TAGCAAAGAATAATCA
2617













C

C

TATCAATTTC






2FH21F_02_129
21
31597201
F
G
85
2
231393146
R
A
 85
TAGTGATATGAAGATCACA
 114
CCATGCTGAATTTGAACTGG
1366
AACTGGCAAACTCTGA
2618

















T






2FH21F_02_132
21
31597387
F
G
94
2
231392961
R
T
 94
TAGTCATAGGTGTCCTATGG
 115
AATACTGATAATTTGCAGC
1367
AGGAACAGGACATTAA
2619

















AAAAA






2FH21F_02_134
21
31597421
F
C
81
2
231392927
R
C
 86
CTGAATAATTAAAACTTTGG
 116
CCCATAGGACACCTATGAC
1368
AGGACACCTATGACTA
2620













C



GGAA






2FH21F_02_139
21
31597560
R
G
116
2
231392784
F
T
116
GAAAGAAAAGGTGCTCTACA
 117
AATGAATCTGCCAGATCTGT
1369
AATGAATCTGCCAGAT
2621













G



CTGTGAATGA






2FH21F_02_143
21
32833444
F
A
93
2
286903
R
C
 93
GGCAATGAGTTCCATAAGTT
 118
TCTGATTTATACTGAGGAC
1370
AGGACAAATTAAAGAA
2622

















AGTAATTTAT






2FH21F_02_144
21
32833448
R
C
93
2
286899
F
A
 93
TCTGATTTATACTGAGGAC
 119
GGCAATGAGTTCCATAAGTT
1371
GAGTTCCATAAGTTTA
2623

















CTCTTC






2FH21F_02_145
21
32833749
F
T
90
2
286607
R
G
 90
TCTCCTCACTGTGCACAGG
 120
ACAAACGCTGCACCTTGCAC
1372
CACACCTGGGTCCCTG
2624

















C






2FH21F_02_146
21
32834036
R
G
120
2
286312
F
T
120
CACACCTGGTTGTCAGCAC
 121
GGAGCTGAGAATGACAGTTG
1373
CAGTTGTTAAGCCAGA
2625

















C






2FH21F_02_148
21
34197714
R
T
118
2
206809213
R
C
118
TTGTTGCTCCAAGTTTAAG
 122
AAGACCAAGATTCAGAAGC
1374
GCAGGGCTATGCGGGA
2626

















G






2FH21F_02_150
21
36183313
F
C
120
2
106922404
R
A
119
GATTATTTTGGTACTAACAA
 123
GAAATGAAGTGCAGGAAAGC
1375
AAATGAAGTGCAGGAA
2627

















AGCCCTGTG






2FH21F_02_151
21
36424390
R
A
101
2
32089093
F
G
101
GGCCGGGGCCAGGGCTTT
 124
CAATCACCACAAACTCCGGC
1376
CCCACGGCGGCCTCAC
2628

















C






2FH21F_02_155
21
43701408
R
G
115
2
112908101
R
T
114
TGCCAAACAGCAGACGCAG
 125
CAGCATCGCTGCCTTCTTG
1377
AGCTCGGGCGCCCCAC
2629

















C






2FH21F_02_156
21
43701502
R
T
115
2
112908195
R
A
115
TGACAGAGAAGGGCTGCAAG
 126
AAGAAGGCAGCGATGCTGG
1378
CATCTGCCCATCCCAT
2630

















CTGC






2FH21F_02_157
21
43701520
R
C
96
2
112908213
R
G
 96
GGAGAAACTGACAGAGAAGG
 127
TCCATCTGCCCATCCCATCT
1379
TCCACACACCGCCCTG
2631

















C






2FH21F_02_158
21
43701558
F
C
86
2
112908251
F
T
 89
CCCGATGGGAACTCTCATTT
 128
CAGCCCTTCTCTGTCAGTTT
1380
TCTCTGTCAGTTTCTC
2632

















CAT






2FH21F_02_159
21
43701561
R
T
81
2
112908257
R
C
 84
AGCCCTTCTCTGTCAGTTTC
 129
CCCGATGGGAACTCTCATTT
1381
GGAACTCTCATTTATC
2633

















ACCAAACCA






2FH21F_02_163
21
43701756
R
G
96
2
112908452
R
C
 96
ATGGCTAGGATGCCCCAGAC
 130
TCTGAACCCTTAGTTAGGAC
1382
GGGCCCTCCTTTCCAC
2634

















TTC






2FH21F_02_168
21
43702318
R
A
101
2
112909018
R
G
101
GGTGGTGGGCAGCATCTGG
 131
ACTTCACCGGATGATCTGGG
1383
CGCGCGAGTGTGGAAG
2635

















AAA






2FH21F_02_170
21
43702512
R
A
103
2
112909212
R
G
103
AAGGATAGAACAAGGTCCCG
 132
ATCCAGCCATCCACGCTCAG
1384
CCTCCTCCCTCGCTCT
2636

















C






2FH21F_02_172
21
43702610
F
T
109
2
112909310
F
C
109
GGGACATTATTAGCAAGGAG
 133
AAAAGTCCTCAGGACCTGCC
1385
AAAACGCCCTGTGAGC
2637

















TCTCC






2FH21F_02_173
21
43702645
F
T
115
2
112909345
F
C
115
CAGGGTCCTTTTCTTTTGGG
 134
CAAAACGCCCTGTGAGCTCT
1386
CTCTCCTTGCTAATAA
2638

















TGTCCCACA






2FH21F_02_174
21
43702740
R
A
117
2
112909440
R
G
117
TCAGGAAGAAACAGTCAGGC
 135
ATGAAAGTGGCCCCCTGCTC
1387
GCTCCACCTGCCGAGT
2639

















C






2FH21F_02_175
21
43702782
F
A
96
2
112909482
F
G
 96
TTCCAGCCTGAGGCTGTTTC
 136
TCCTCAGACTCTCCCCTTG
1388
GGGCAGGGAAACCTGC
2640

















CG






2FH21F_02_177
21
43702889
R
C
112
2
112909589
R
T
112
CCATTGAAGCATTCAGCAGG
 137
AAGGGAGGCTGCCCAGGAC
1389
CTGTGGGGCGGGGCTG
2641

















GTC






2FH21F_02_178
21
43702910
F
T
115
2
112909610
F
C
115
AGCAAGGGAGGCTGCCCAG
 138
CCATTGAAGCATTCAGCAGG
1390
GACCAGCCCCGCCCCA
2642

















CAGG






2FH21F_02_181
21
43702989
F
A
100
2
112909689
F
C
101
AGTGTCTGCAGTTTTCTGGG
 139
GGATGAGCAGCTCGCAATAG
1391
GCTCGCAATAGGCCCC
2643

















C






2FH21F_02_182
21
43703008
R
A
99
2
112909709
R
G
100
GATGAGCAGCTCGCAATAGG
 140
AGTGTCTGCAGTTTTCTGGG
1392
CTGGGGTGCCCCCGTC
2644

















CTC






2FH21F_02_184
21
43703202
R
G
108
2
112909903
R
C
108
CTCTCCGGCCAGGCCTCTC
 141
TGACCCAGATTCCTGAAGAG
1393
GGGCCTGGATGCTGGG
2645

















TG






2FH21F_02_185
21
43703225
R
A
108
2
112909926
R
G
108
CTCTCCGGCCAGGCCTCTC
 142
TGACCCAGATTCCTGAAGAG
1394
CCAGATTCCTGAAGAG
2646

















GGGATGACTA






2FH21F_02_189
21
43704043
F
G
104
2
112910745
F
A
104
TCTTAAGCCCTTGCCCCCTG
 143
GGAAGAGCGTGGAGCAAGA
1395
GAGCAAGAGGAGGAGG
2647

















CTCGGCCCAG






2FH21F_02_190
21
43704153
F
C
117
2
112910855
F
T
117
GATCCCTATCTCTGTCTGCG
 144
GTCTCAATCTTGTTGGCCAG
1396
GGCCAGTTTATGAAAG
2648

















TCAAGCCTA






2FH21F_02_191
21
43704243
R
C
105
2
112910945
R
T
105
AGAGATAGGGATCGCTCCAG
 145
TGGGTGTTCTGCAGGCTGG
1397
GGGTGGAGGTGCTCCA
2649

















GGACT






2FH21F_02_193
21
43704508
F
G
108
2
112911210
F
A
108
TCATGTGGGGCTGGTGTAG
 146
CCACCCCCACCCCGTCAC
1398
CCCACCCCGTCACGCG
2650

















CAT






2FH21F_02_194
21
43704539
F
C
107
2
112911241
F
T
107
AGGAGGAGGAGCCCACACTG
 147
AGACACTGACCCCCAGAGAC
1399
CTGGTGTAGGCGTGGG
2651

















GTGGAC






2FH21F_02_195
21
43704601
F
C
111
2
112911303
F
T
111
GGTCAAAGGTCCTGCACAC
 148
TACACCAGCCCCACATGAG
1400
GTCTCTGGGGGTCAGT
2652

















GTCTG






2FH21F_02_200
21
43704890
F
A
118
2
112911592
F
G
118
CAAGAGTTCAGATGAGTGGC
 149
TCCTCCAGGACTGGCCAAGT
1401
CCCCAGGCTCCTCCCC
2653

















C






2FH21F_02_204
21
44919978
F
T
108
2
86224659
F
C
109
GGAGTGCTTTCTTTGCAACT
 150
CAAACATTATTTTGATTGGC
1402
TTTTGATTGGCCTCAC
2654

















AAG






2FH21F_02_206
21
44920113
F
G
118
2
86224795
F
A
118
AAGGAAATCAGCAGTGATA
 151
GGTGTTAACATTTAGAACAG
1403
AACATTTAGAACAGTA
2655

















CTTGTAA






2FH21F_02_207
21
44920284
F
T
89
2
86224967
F
C
 89
TGGCTGAAGGAAGCCCGAAT
 152
GCTGGCATATGCTGTCAGGA
1404
TGCTGTCAGGATTTCC
2656

















A






2FH21F_02_208
21
44920330
F
T
107
2
86225013
F
C
107
TTTGTCAATCAGCTGAAGGG
 153
TATCTGTTTCGTTTCTAGGG
1405
GCTTCCTTCAGCCAGT
2657

















C






2FH21F_02_211
21
44920379
R
A
119
2
86225062
R
G
119
CCCTTCAGCTGATTGACAAA
 154
TCCTATTGCATTGAGCATGG
1406
GCATGGTGATCTGGAG
2658

















CTAG






2FH21F_02_212
21
44920544
R
A
90
2
86225231
R
C
 90
GAAGTACTGGTACAAGCTAT
 155
TGCTGTTCAAAAACTGGCCC
1407
TGGCCCGAAGGGTAGC
2659

















AATGATTGAT






2FH21F_02_213
21
44920587
F
A
88
2
86225274
F
G
 88
CAGTGAAGAGACCCTTAGAG
 156
CAATCATTGCTACCCTTCGG
1408
GCCAGTTTTTGAACAG
2660

















CATA






2FH21F_02_214
21
44920594
R
A
94
2
86225281
R
G
 94
CAATCATTGCTACCCTTCGG
 157
GGGTGTACAGTGAAGAGAC
1409
GTGAAGAGACCCTTAG
2661

















AG






2FH21F_02_215
21
44920624
F
T
108
2
86225311
F
C
108
CAGCTATCCCTCCAGAGTC
 158
TCGGGCCAGTTTTTGAACAG
1410
CTTCACTGTACACCCC
2662

















A






2FH21F_02_216
21
44920652
F
T
118
2
86225339
F
C
118
GCCATCAAAGCCAACTGTTC
 159
GTCTCTTCACTGTACACCCC
1411
AGGGACTCTGGAGGGA
2663

















TAGCTG






2FH21F_02_217
21
44920732
R
A
92
2
86225419
R
G
 92
CAGAACAGTTGGCTTTGATG
 160
CAGCATGAAGACCTCATCTG
1412
AAGACCTCATCTGCAG
2664

















AAA






2FH21F_02_218
21
44920793
R
A
81
2
86225480
R
G
 81
TAATGCCTCCACTGAAAGCC
 161
TGCAGTTGCTGAAGAGGAAG
1413
AAGAGGAAGCCAGAAA
2665

















AGCC






2FH21F_02_219
21
44921280
R
C
91
2
86232120
R
G
 91
AGCTCTCTGTTCAGCTGATC
 162
CTCTCTACTGATGATCTGAA
1414
TACTGATGATCTGAAC
2666

















TCCCT






2FH21F_02_220
21
44921506
F
T
87
2
86240216
F
C
103
CCTTTTTGACCACATTATCC
 163
AAGAGGTTGCTGGGGCCAAG
1415
GGCCAAGCCTCATATA
2667

















A






2FH21F_02_223
21
44921778
R
T
110
2
86247236
R
C
110
GTTGGAGTGTGCATTGACAG
 164
GAAGATGCTCTGAGGCAAA
1416
TCTGAGGCAAACTGCA
2668

















A






2FH21F_02_226
21
44922084
R
G
94
2
86254352
R
A
 94
TGTTTTTGGAGTTGTGAGGC
 165
GGTCCACTAAAAATCTCTAG
1417
AAATCTCTAGTGTATC
2669

















AGAAGTAA






2FH21F_02_227
21
44922157
F
G
84
2
86260096
F
C
 84
ACTCAGACAAACTCTTCGAG
 166
TTCTTTGGCAATGGAACAT
1418
TTTGGCAATGGAACAT
2670

















TATAAG






2FH21F_02_228
21
44922175
R
C
92
2
86260114
R
T
 92
GGCAATGGAACATTATAAG
 167
GAAAACCATACCTTACTCAG
1419
ATACCTTACTCAGACA
2671

















AACTCTTCGAG






2FH21F_02_230
21
46917919
R
A
87
2
92420
F
A
 87
GTATAAATAATGTTCAGTTA
 168
ACTGGTCTTTTACCTAGATG
1420
TACCTAGATGATTGCT
2672













TC



TCTAAAT






2FH21F_02_232
21
46918360
R
A
92
2
91979
F
C
 92
GTAAAATCTTGTAAGTTGCA
 169
TTATGCCACTTGAGTGGGAG
1421
ACATTGTTGGTCCAAT
2673













G



ACTAAT






2FH21F_02_234
21
46918645
F
A
115
2
91692
R
C
115
AGGTGCAACTCCAAAAAAGC
 170
AATCTTGAACCAGTGGTTCT
1422
ACCAGTGGTTCTGGCT
2674

















CC






2FH21F_02_235
21
46918651
R
T
112
2
91686
F
C
112
CTTGAACCAGTGGTTCTGGC
 171
AGGTGCAACTCCAAAAAAGC
1423
TGAGTTACAAAGATTA
2675

















TGACAAG






2FH21F_02_236
21
46918748
R
G
107
2
91589
F
T
107
GGAGTTGCACCTGTTCCTTG
 172
GGAATGACAAATTGCCAAAT
1424
TGACAAATTGCCAAAT
2676















C

CATGTCTTA






2FH21F_02_239
21
46918867
F
C
85
2
91470
R
T
 85
TTGTGGAGGATTATTTCTGC
 173
TCCTTCTTATAACAGTGGGC
1425
ATAACAGTGGGCTTTC
2677

















ACAAT






2FH21F_02_241
21
46919142
F
T
112
2
91213
R
G
113
AGAATCTCCTCACACCTTGC
 174
GCAGGGACTCCCCAAGTGT
1426
ACTCCCCAAGTGTCCG
2678

















CACCCC






2FH21F_02_243
21
46919207
F
G
93
2
91147
R
T
 95
AGGACTCTGCAACCCAGG
 175
TGCTGGGCTGCCCTCCCTGT
1427
GGTGTGAGGAGATTCT
2679

















T






2FH21F_02_248
21
46920267
R
T
92
2
90118
F
C
 95
CTATAGAAATTACTGGACT
 176
GGAAGGAATCATTCTGAG
1428
AAGGAATCATTCTGAG
2680

















TGAAAA






2FH21F_02_249
21
46920298
R
C
86
2
90087
F
T
 86
CACTCAGAATGATTCCTTCC
 177
TTAAAGGGCTAGACAATGGG
1429
AGGGAGGAGACTCAGA
2681

















A






2FH21F_02_250
21
46920352
F
A
98
2
90033
R
C
 98
ACATGTCCAAATATGTCTG
 178
TCCCTACCCCATTGTCTAGC
1430
TCTAGCCCTTTAAATA
2682

















CATTTGACAAT






2FH21F_02_254
21
46920612
R
T
95
2
89503
F
G
 95
TATTTTTATTTCCAATGTAG
 179
CAATTAGAAATCTAGTGCAA
1431
AATTAGAAATCTAGTG
2683













T



CAAAAGAAT






2FH21F_03_005
21
15894129
F
C
121
3
50774887
F
T
119
TCATCCCCATTTCTCAACTC
 180
TATATAATACTTAGTTTTGG
1432
ATAATACTTAGTTTTG
2684















T

GTCATCAA






2FH21F_03_007
21
15894317
R
G
95
3
50774127
F
T
 97
ATCAAAGCCATTAGCCTA
 181
CTTCTTTTGGATCTTCACCT
1433
CTTCACCTGATAATTT
2685















G

TTCACCATTTT






2FH21F_03_008
21
15894382
F
G
108
3
50774062
R
T
108
TCAAAAGTGCTGGCCAGGTC
 182
GATTAAAGTGCAGAAAAGTG
1434
GTGCAGAAAAGTGAAT
2686

















CCA






2FH21F_03_011
21
15894444
F
T
102
3
50774000
R
G
102
CTTTGGTGTCTTTATCCCTG
 183
GGTAATTTTTCCCTTGGG
1435
CTGGCCAGCACTTTTG
2687

















A






2FH21F_03_012
21
15894451
R
T
99
3
50773993
F
G
 99
GACCTGGCCAGCACTTTTGA
 184
CCCAAGCTTAAAATGTGGGC
1436
ACCTTTGGTGTCTTTA
2688

















TCCCTG






2FH21F_03_013
21
15894476
R
C
99
3
50773968
F
T
 99
GACCTGGCCAGCACTTTTGA
 185
CCCAAGCTTAAAATGTGGGC
1437
CAAGCTTAAAATGTGG
2689

















GCCTAGAT






2FH21F_03_014
21
15894647
R
G
113
3
50773797
F
T
113
GTTAAGGTGTTCTAAGGCTA
 186
GTGTCCAGTAGAAGGAAAAC
1438
AAAACTTAGCTGAAAG
2690













C



GAACATGAAA






2FH21F_03_015
21
15894746
F
T
120
3
50773698
R
G
120
TTCCTCTAAATTCCTTAGC
 187
GAGAAAAGATATTCATGAGA
1439
GAGACTATTAAGGAAA
2691















C

TATAAAATGA






2FH21F_03_017
21
18755793
R
T
120
3
107588227
R
C
120
TCAATATCTTACAGTACAG
 188
GAGGTTCAATTTTATTTCAT
1440
CATAAAATGTGTAGTA
2692

















TTTCTTAGA






2FH21F_03_018
21
18755822
F
T
120
3
107588256
F
C
120
GAGGTTCAATTTTATTTCAT
 189
TCAATATCTTACAGTACAG
1441
AAGAAATACTACACAT
2693

















TTTATGTTA






2FH21F_03_021
21
18756063
R
A
95
3
107588491
R
G
 95
TAGTTGCCCTGAGTTCAA
 190
TAGAAAGAAACTCCTCCTCC
1442
CTCCTCCCATAAAGGA
2694

















AGA






2FH21F_03_022
21
18756109
F
C
91
3
107588537
F
T
 91
GCTGATCAAGGCAGTTTTTC
 191
TTCCTTTATGGGAGGAGGAG
1443
AGTTTCTTTCTATGTC
2695

















TTTGGTTAT






2FH21F_03_025
21
19539204
F
A
109
3
14464204
F
T
109
CATGGTGTCCTCCATGCAG
 192
ACTACCTGTTCCAGTCCTTC
1444
CTTCCAGAAGGAGCTG
2696

















CCC






2FH21F_03_026
21
19539233
F
G
103
3
14464233
F
A
103
GAGCTGATGGTGATCCAGAC
 193
GGCACACTGCAACCACAGC
1445
AACCACAGCTGGAACA
2697

















C






2FH21F_03_027
21
19539238
R
C
98
3
14464238
R
A
 98
ACTGCAACCACAGCTGGAAC
 194
GAGCTGATGGTGATCCAGAC
1446
ATGGTGTCCTCCATGC
2698

















AG






2FH21F_03_028
21
19539267
R
G
106
3
14464267
R
A
106
TGCAACCACAGCTGGAACAC
 195
TTGGTGGAGCTGATGGTGAT
1447
GGTGATCCAGACACTC
2699

















T






2FH21F_03_030
21
19775552
F
G
89
3
14950732
R
G
 89
ATTCCTGGTCTTGGCAGATG
 196
AGAACAGCCTCAGGCCACGA
1448
ACAGCCTCAGGCCACG
2700

















ACTTCTGTGCT






2FH21F_03_031
21
19775569
R
A
83
3
14950715
F
C
 83
TCAGGCCACGACTTCTGTGC
 197
TGAATTCCTGGTCTTGGC
1449
CTGGTCTTGGCAGATG
2701

















G






2FH21F_03_039
21
25654993
R
C
100
3
116610381
R
G
100
AGCCCATGAAGGCTTCCAAA
 198
CAAGTTGTCTCTGACCTAGC
1450
TCTGACCTAGCTCCCT
2702

















T






2FH21F_03_040
21
25655024
F
G
95
3
116610412
F
T
 95
CTTGTTGCCTGGTTTTCATT
 199
GAGCTAGGTCAGAGACAACT
1451
TCAGAGACAACTTGAA
2703

















CA






2FH21F_03_043
21
27438037
F
C
81
3
49370600
F
A
 81
TGTGAGCCTGGGCTCCCTG
 200
TGTAGTCCCGGACCGTGGTG
1452
GCCACATTCTCGATAA
2704

















GTAGT






2FH21F_03_053
21
32740757
F
A
86
3
131271948
R
G
 86
GTAGGCAAGCTCATGCATTC
 201
ACCAAGGTGTGGGAAGTT
1453
TGTGGGAAGTTCAGTG
2705

















GC






2FH21F_03_058
21
33872005
R
T
113
3
137256165
R
A
113
CTATGTGGAATACAAAATGC
 202
CCTACTGATTTATAATTCC
1454
AATTCCTTTATTTTCA
2706













C



CATATACTAAA






2FH21F_03_061
21
33872582
F
G
101
3
137257230
R
G
101
TAAAGATGATTTCCCAAGT
 203
AAGGAGCTTACTAACTGTGG
1455
ACTGTGGTTTGCACCC
2707

















TAA






2FH21F_03_062
21
33873563
R
A
94
3
137257154
F
C
 94
TATCAAGTACTTTGTCCAT
 204
CTCTGCAGTACTGTATCCAC
1456
CCAACTGCTGTATTTA
2708

















ACA






2FH21F_03_063
21
33873613
F
T
101
3
137257104
R
G
101
GCCTCATTCTCTGCATTCAC
 205
TCGTGTGGATACAGTACTGC
1457
CAGTACTGCAGAGAAA
2709

















GA






2FH21F_03_064
21
33873616
R
A
101
3
137257101
F
C
101
TCGTGTGGATACAGTACTGC
 206
GCCTCATTCTCTGCATTCAC
1458
ACCATGCTGCTCAAAT
2710

















CTTCACAGAG






2FH21F_03_065
21
33873672
R
G
100
3
137257045
F
T
100
CATGGTCAGTGAATGCAGAG
 207
CTCTTTCTGGATACAGAGAC
1459
AGTTTGGAGATTACAG
2711

















GT






2FH21F_03_071
21
39487857
R
T
97
3
6496443
R
C
 97
TGCTTTTAAAGACATCAGG
 208
AGAAGTGGTATTTTGGTTT
1460
AGTGGTATTTTGGTTT
2712

















TTAATC






2FH21F_03_073
21
39487887
R
G
98
3
6496473
R
A
 98
CTTCTGATGAAACCAAATC
 209
CTTTCAGTCCAAAATAGTTA
1461
CCAAAATAGTTAGACC
2713















G

CTTG






2FH21F_03_079
21
39488200
R
A
94
3
6496780
R
G
 94
AAATTAATGGATTTGACATC
 210
CTGAAAAGACTAATGGGATG
1462
TGGGATGCCTTTTACT
2714















C

T






2FH21F_03_080
21
39488320
F
G
101
3
6496902
F
A
102
AACTGAGATAGGTGGGAAAC
 211
GAGAAGAAAAGCATCATAG
1463
AGAAAAGCATCATAGT
2715

















TCTGAAATG






2FH21F_03_081
21
39488330
R
T
100
3
6496912
R
C
101
GAAAAGCATCATAGTTCTG
 212
TATCAACTGAGATAGGTGGG
1464
CCTCTCATTTGTGGCT
2716

















TAG






2FH21F_03_083
21
39488395
F
C
119
3
6496978
F
T
119
CTATTCCATTTGACATAGTA
 213
AGGTTTCCCACCTATCTCAG
1465
TGTCCAAAAACATCCT
2717













G



TC






2FH21F_03_084
21
39488417
F
A
119
3
6497000
F
G
119
CTATTCCATTTGACATAGTA
 214
AGGTTTCCCACCTATCTCAG
1466
CATGCATCAGAGTAGA
2718













G



AAGA






2FH21F_03_085
21
39488427
R
T
118
3
6497010
R
C
118
CCCACCTATCTCAGTTGATA
 215
GTTATCTATTCCATTTGACA
1467
GTTATCTATTCCATTT
2719

















GACATAGTAG






2FH21F_03_087
21
39488728
F
C
108
3
6497201
F
T
108
GGACTTGATTCAAATGGTT
 216
CACAATTAGGGCTAATAAA
1468
GTGGGGTACTGTAACA
2720

















TAT






2FH21F_03_088
21
39488868
F
C
119
3
6497341
F
G
119
GTCCAAATATAAGAAACTGT
 217
GGTTAGAAAATAAGTGTACT
1469
AAGTGTACTATTTGTG
2721













C

A

TGATAAA






2FH21F_03_089
21
39488934
F
A
120
3
6497407
F
G
119
AGTTTACTGCTTCCATGTGC
 218
ACATGACAGTTTCTTATATT
1470
ACATGACAGTTTCTTA
2722

















TATTTGGACT






2FH21F_03_091
21
39488983
R
G
118
3
6497455
R
A
117
GACAGTTTCTTATATTTGGA
 219
TTAGTTTACTGCTTCCATG
1471
TGCTTCCATGTGCAAT
2723













C



C






2FH21F_03_093
21
39489193
R
A
109
3
6497664
R
G
109
TCTTTTAGCCCTGTACACTC
 220
CTTCCATAATCTTACTCTGT
1472
TTACTCTGTGAAATAG
2724















G

AGGAAT






2FH21F_03_094
21
39489227
F
A
105
3
6497698
F
G
106
CTTCTGTCCAAGATCTCCTG
 221
CCTCTATTTCACAGAGTAAG
1473
TCACAGAGTAAGATTA
2725

















TGGAAG






2FH21F_03_095
21
39489346
R
C
106
3
6497817
R
T
105
TATATAGCATTTTGTTAGTG
 222
GATTTGAGTGCATGTTTTA
1474
TGAGTGCATGTTTTAA
2726

















ACCTCTA






2FH21F_03_097
21
40695570
F
C
116
3
141989208
F
A
121
AGGTCAGCAGCCTCCAGAG
 223
ACAGCCATGTTCCCACCAGG
1475
CACCAGGGTCAAGAGA
2727

















A






2FH21F_03_098
21
40695618
R
T
120
3
141989261
R
G
125
TCCCACCAGGGTCAAGAGAA
 224
CAGGTCTCCAGGTCAGCAG
1476
CTCCAGGTCAGCAGCC
2728

















TCCAGAGGGG






2FH21F_03_100
21
40695660
R
G
106
3
141989303
R
A
106
TGCTGACCTGGAGACCTGC
 225
ATATAGCTAGCAAGGCTGGG
1477
AAGGAGAGCTGGCAAG
2729

















A






2FH21F_03_101
21
40695692
F
A
106
3
141989335
F
G
106
ATATAGCTAGCAAGGCTGG
 226
TGCTGACCTGGAGACCTGC
1478
CTCCTTCCTCTTTCTC
2730

















CAGA






2FH21F_04_006
21
17963704
R
A
80
4
94858511
R
G
 80
TCTAGAATTCTATCAGAAG
 227
TCTCAGAGGTATGACTGAGC
1479
ACTGAGCAGTTGCTCA
2731

















AG






2FH21F_04_008
21
22395232
F
G
119
4
110832709
R
G
115
GATTCTGTTGTAGCATTAT
 228
TATGATTTGAAATCATTCAG
1480
ATTTGAAATCATTCAG
2732

















GACTTT






2FH21F_04_010
21
23867805
F
G
106
4
83204416
F
A
107
TATAACACATCCCCACATGC
 229
TTAGTCTTTCTTGCTGGGA
1481
TTAGTCTTTCTTGCTG
2733

















GGAATCAAA






2FH21F_04_011
21
23867842
R
G
107
4
83204454
R
T
108
AGTCTTTCTTGCTGGGAATC
 230
TATAACACATCCCCACATGC
1482
TCCCCACATGCATCCT
2734

















T






2FH21F_04_014
21
31962966
R
G
85
4
164801285
R
A
 85
TGATCACTTGGAAGATTTG
 231
ACAGGTCATTGAAACAGACA
1483
GGTCATTGAAACAGAC
2735

















ATTTTAA






2FH21F_04_015
21
31962996
F
T
93
4
164801315
F
C
 92
AAGAAATTCTGACAAGTTTA
 232
AATGTCTGTTTCAATGACC
1484
CTGTTTCAATGACCTG
2736

















TATT






2FH21F_04_017
21
33092540
F
T
98
4
185473899
R
G
 98
AAGAAGCCATCCAGAGAGAC
 233
GGACACAAGTGCAGGTTCAG
1485
TGCAGGTTCAGGGCAA
2737

















GGTGTG






2FH21F_04_018
21
33092610
F
G
115
4
185473829
R
A
115
GTAAGAATTGGGGTTAGGTC
 234
TCTCTCTGGATGGCTTCTTG
1486
GGTGACTGACAGAGGG
2738

















A






2FH21F_04_019
21
33092642
R
T
119
4
185473797
F
G
119
TCTCTCTGGATGGCTTCTTG
 235
TGGAGTAAGAATTGGGGT
1487
AAGAATTGGGGTTAGG
2739

















TC






2FH21F_04_021
21
33092683
R
T
111
4
185473756
F
G
111
CTAACCCCAATTCTTACTCC
 236
GTACTTGAGAGAAACTAGGG
1488
GACACAGTCTCCAGCA
2740

















GAAT






2FH21F_04_022
21
33092713
F
C
100
4
185473726
R
A
100
AAGCCCAGTGAAATCACAGC
 237
TCTGCTGGAGACTGTGTCTT
1489
GGAGACTGTGTCTTAA
2741

















AACTT






2FH21F_04_023
21
44291397
F
G
92
4
101090391
F
A
 92
GAAGGAGTAGGTGGTGGGAT
 238
CTGAAGCTCAAGCAAGCAAG
1490
CAAGCAAGGCAGAGAA
2742

















A






2FH21F_04_024
21
44291416
R
C
93
4
101090410
R
T
 93
CTGAAGCTCAAGCAAGCAAG
 239
CGAAGGAGTAGGTGGTGG
1491
GAGTAGGTGGTGGGAT
2743

















CTC






2FH21F_05_003
21
15812473
F
C
114
5
157490943
R
C
114
GAAGTGGCCTATCAGGTCT
 240
AACCATGGTTTGGGTTTAC
1492
CACTGTTCTATTACAG
2744

















TGTTCTTC






2FH21F_05_005
21
15812543
F
T
101
5
157490873
R
G
101
GGTGGTAATTGAGATGACTG
 241
TTGTAAACCCAAACCATG
1493
CCCAAACCATGGTTCT
2745

















T






2FH21F_05_006
21
18426542
F
A
93
5
160998928
R
A
 91
GTTTTCCCATATCTAGATGT
 242
GTGAATTCTTCCCACTTCTC
1494
CACTTCTCACTTATCA
2746













C



TCTG






2FH21F_05_007
21
18426561
R
T
99
5
160998911
F
C
 97
GTGAATTCTTCCCACTTCTC
 243
TCTTATGTTTTCCCATATC
1495
CTTATGTTTTCCCATA
2747

















TCTAGATGTC






2FH21F_05_008
21
18426592
F
A
87
5
160998880
R
A
 87
TTCCAAGGATTGGAGGACAC
 244
GACATCTAGATATGGGAAAA
1496
AGATATGGGAAAACAT
2748















C

AAGAAAA






2FH21F_05_013
21
18426958
F
A
89
5
160998513
R
C
 88
GTGCAACAAATGCCTTTAA
 245
TTAACATGTTTTCTCTCAC
1497
TTAACATGTTTTCTCT
2749

















CACTGTACT






2FH21F_05_015
21
18427206
R
A
115
5
160998262
F
G
115
AAACAAGCACTGTAGAGTA
 246
CTTTCTTACAACCTATGACT
1498
AACTATTGGCAATTCT
2750















C

GTAATTC






2FH21F_05_016
21
18427235
F
A
97
5
160998233
R
C
 97
ATTTAATAGAACAAACCCC
 247
CTATTGGCAATTCTGTAATT
1499
TACTCTACAGTGCTTG
2751















C

TTTA






2FH21F_05_018
21
20033996
F
T
99
5
64072748
R
G
 99
ACTTTTGAATGCCGCAAT
 248
CTTCACTACTTGTACTGCTG
1500
CCCTTTTAGGGTCTAC
2752

















TC






2FH21F_05_019
21
20034055
R
A
104
5
64072689
F
G
104
GAGTAGACCCTAAAAGGGAC
 249
TATTCAGTTCTTCATTCTC
1501
ATTCAGTTCTTCATTC
2753

















TCTTCATC






2FH21F_05_025
21
27040842
F
T
105
5
35308773
F
A
105
TATTTGTAATGTGAATTTGC
 250
GGACACTAAACAAAGACAGG
1502
AAACAAAGACAGGTTC
2754

















AAAAATAC






2FH21F_05_026
21
27040864
F
G
105
5
35308795
F
A
105
TATTTGTAATGTGAATTTGC
 251
GGACACTAAACAAAGACAGG
1503
GGATGTTTCTGGAACA
2755

















AT






2FH21F_05_027
21
31316723
F
T
111
5
23151508
F
G
111
TTTAGCATTCCCAGACTCAG
 252
ATTGGCCAACATCTCAACAG
1504
ACATCTCAACAGAGTT
2756

















ACA






2FH21F_05_028
21
31316765
R
T
114
5
23151550
R
A
114
TGGCCAACATCTCAACAGAG
 253
TTTCATTTAGCATTCCCAG
1505
GCATTCCCAGACTCAG
2757

















A






2FH21F_05_032
21
31918345
R
A
118
5
171221502
R
G
118
GAATTAGACTATCCCAGTGC
 254
TTCCCAGCCATACTCTGGAC
1506
TCTGGACTTTATTTTG
2758

















CTAACCATAA






2FH21F_05_033
21
31918387
F
T
95
5
171221544
F
A
 94
GGACTTTGGCACCCAAGGA
 255
AATAAAGTCCAGAGTATGGC
1507
GAGTATGGCTGGGAAT
2759

















T






2FH21F_05_034
21
31918647
R
A
108
5
171221804
R
G
108
CTTCCCCCTGGGCTTTCCT
 256
TGATGGTGGTTGTGAAAGTG
1508
ATGGTGGTTGTGAAAG
2760

















TGATTTAG






2FH21F_05_035
21
31918687
F
T
83
5
171221844
F
C
 83
GTAAACAATAAACCTCCATT
 257
CTTTCACAACCACCATCAAG
1509
CACCATCAAGCTTACA
2761













C



ACATC






2FH21F_05_040
21
31918896
F
C
119
5
171222065
F
T
118
CCAATAAACAGCCTCCTATA
 258
CTCAATGCAAAGGACAAATC
1510
CCTTCCCTTTAGTAGT
2762

















AGAG






2FH21F_05_041
21
31918920
R
A
91
5
171222089
R
C
 91
CCTTCCCTTTAGTAGTAGAG
 259
AGGACCAATAAACAGCCTCC
1511
ACCAATAAACAGCCTC
2763

















CTATAAA






2FH21F_05_044
21
31919409
F
C
82
5
171222232
F
T
 82
CACAGCCCAAATGTGTAAAT
 260
GATGCCAACGTCCTTTCC
1512
ATGCCAACGTCCTTTC
2764













G



CATGCAC






2FH21F_05_045
21
31919418
R
G
82
5
171222241
R
A
 82
GATGCCAACGTCCTTTCCAT
 261
CACAGCCCAAATGTGTAAAT
1513
AAATGTGTAAATGGCA
2765















G

CTGT






2FH21F_05_047
21
31919498
R
G
118
5
171222321
R
C
118
CCATTTACACATTTGGGCTG
 262
CCACCCCAGTCATCTCTG
1514
CCAGTCATCTCTGGTG
2766

















TCA






2FH21F_05_051
21
31919696
R
A
112
5
171222519
R
C
112
GATGCATGAATTCCAGAGCC
 263
CAAAAATCATTATTCTGTGC
1515
TGGCCCTGGGAAGGGG
2767

















AAATAA






2FH21F_05_054
21
31919824
F
T
90
5
171222647
F
A
 91
TATATTATACAATAGAGAGG
 264
ACTCAGGAGTACTTATGAGA
1516
TGAGAAAAAGAATAAG
2768

















AACAAAAA






2FH21F_05_058
21
31920049
R
C
104
5
171222880
R
T
104
AGGTAATCCACATCAACC
 265
CTTGAGACACTAATACAGAG
1517
ACTAATACAGAGTGTG
2769

















TTCGC






2FH21F_05_061
21
31920141
R
T
81
5
171222972
R
C
 81
ACTGTTATGTACATTATATC
 266
GTGTGCTTGCCTCCTAATTT
1518
CCTCCTAATTTAAAAT
2770

















ACTGTATTC






2FH21F_05_064
21
31920848
F
T
101
5
171223266
F
C
101
TTTTGGGTGCCAAACACCTA
 267
TGACTTGGACGGTCAAAAGG
1519
TTGGACGGTCAAAAGG
2771

















AGAATG






2FH21F_05_066
21
31920882
R
A
102
5
171223300
R
G
102
GGACGGTCAAAAGGAGAATG
 268
GTGAAATTTTGGGTGCCAAA
1520
GGGTGCCAAACACCTA
2772















C

C






2FH21F_05_067
21
31920932
R
G
99
5
171223350
R
A
 99
TGGCACCCAAAATTTCACTG
 269
GGCCTCTAATTTATATTGC
1521
TATTGCTTTGCACTTT
2773

















GGTTTGATA






2FH21F_05_069
21
31920989
R
A
112
5
171223408
R
G
113
ATCAAACCAAAGTGCAAAGC
 270
GAAAAGGAACATAGAATCTG
1522
GAATCTGTTTTACAGA
2774

















AGTAAAT






2FH21F_05_072
21
31921065
F
A
116
5
171223484
F
G
115
TTTGAGAAGGAGACCTTAGC
 271
ACATTTGAAACATTAGATTT
1523
CATTTGAAACATTAGA
2775















T

TTTTTTCACT






2FH21F_05_073
21
31921138
R
T
100
5
171223556
R
C
100
GAAGCTAAGGTCTCCTTCTC
 272
GCAAAGCAGCCTAACTCTTC
1524
TTTCTCACCTCTGATT
2776

















CC






2FH21F_05_074
21
31921163
F
G
103
5
171223581
F
T
103
GATGCAAAGCAGCCTAACTC
 273
GAAGCTAAGGTCTCCTTCTC
1525
GAATCAGAGGTGAGAA
2777

















ATGTCGG






2FH21F_05_076
21
31921354
R
T
101
5
171228281
R
C
101
GTGCAGACTGTTATCTAGAG
 274
TAAATGTGCCTCCCAGTGCC
1526
TGCCTCCCAGTGCCCA
2778

















GAATGAGACCC






2FH21F_05_080
21
31921952
F
C
113
5
171236063
F
T
113
ACACGGGTGAAGTTCTTAAC
 275
TCCTTGGAACAGGTCACCAT
1527
AGGTCACCATCAGTCC
2779

















A






2FH21F_05_083
21
31922417
F
T
84
5
171259565
F
C
 84
GAATGCTTTGGAAGAAGCTG
 276
GAAAGTCCTTTCCATAGGGG
1528
TCCATAGGGGATCAGT
2780

















G






2FH21F_05_088
21
31922614
F
G
94
5
171270233
F
A
 94
GTGGAACATCTTATTTCACG
 277
TGCAACATGGGCTTCAGGTA
1529
GGCTTCAGGTAAGAGT
2781

















T






2FH21F_05_091
21
34117690
R
G
83
5
10718223
F
G
 83
AGAATTTATTGCCATGTAC
 278
CCTTGCTGAAAGGTTAAATC
1530
TCTCCTTGCTCAGAAC
2782

















TCT






2FH21F_05_092
21
34117728
R
G
106
5
10718185
F
T
105
CAAGGAGATTTAACCTTTC
 279
TTGTCGCCCACTGTTCCTGT
1531
TTCTTGGTAACCAAAA
2783

















TCACATC






2FH21F_05_094
21
34117762
R
T
111
5
10718152
F
G
110
CTGAGCAAGGAGATTTAACC
 280
TTGTCGCCCACTGTTCCTG
1532
TCGCCCACTGTTCCTG
2784

















TCCACC






2FH21F_05_096
21
34130664
F
C
92
5
10717750
R
A
 92
TGATGATCTGGCCCTTGTTG
 281
AGGTGATTGGGATGTACGAC
1533
ACGACTACACCGCGCA
2785

















GAATGA






2FH21F_05_097
21
34130701
F
G
98
5
10717713
R
T
 98
TGACTTCTCCTTTCCACCAG
 282
ATGAGCTGGCCTTCAACAAG
1534
AAGGGCCAGATCATCA
2786

















AC






2FH21F_05_098
21
34130721
F
T
99
5
10717693
R
G
 99
ATGAGCTGGCCTTCAACAAG
 283
CCCACTTGTCCATTGACTTC
1535
TCTCCTTTCCACCAGT
2787

















C






2FH21F_05_099
21
34131201
R
A
91
5
10717567
F
C
 91
TCATATGTTGTCCATCCCCC
 284
TGGGCAGTGATATGGGATAG
1536
GGGTCTCTTTGAGGAC
2788

















TT






2FH21F_05_101
21
34131361
F
C
104
5
10717407
R
A
104
TTTGCTCCTATCTCTGCAAG
 285
AGAAGAACTCACTGCAGAGC
1537
TACCTTAGTTGCATGT
2789

















GAT






2FH21F_05_102
21
34131411
F
C
110
5
10717357
R
A
110
GGGAAAGTCAATTTGAGTAA
 286
TTACTTGCAGAGATAGGAGC
1538
AGAGATAGGAGCAAAA
2790













C



ATTACAAAAA






2FH21F_05_109
21
39372630
F
G
82
5
21021038
R
T
 82
CTCTTCTTAATGGGAAGCAG
 287
TCCCAAACTTGGGCAAAG
1539
CTTGGGCAAAGTTGAC
2791

















A






2FH21F_05_110
21
39372638
R
C
80
5
21021030
F
C
 80
CCAAACTTGGGCAAAGTTGA
 288
TCCTCTTCTTAATGGGAAGC
1540
ATGGGAAGCAGCTCCT
2792

















TA






2FH21F_06_001
21
17888275
F
A
81
6
139639257
F
G
 79
CATGTTAGCACCTCACTA
 289
TACCTTTTTCTCAACATGA
1541
CTCAACATGACACCAA
2793

















CACA






2FH21F_06_004
21
26521837
R
G
98
6
114291260
F
A
 98
GGAATTGGATCAAATGATT
 290
TTGGCAGTATGTATAATGGC
1542
TAATGGCATTTGCTGT
2794

















GGTT






2FH21F_06_005
21
26521929
F
G
110
6
114291168
R
T
110
GGAAAAAAATGTTAATATGG
 291
CAATACTGAACTGTACAAGA
1543
AAGAGTTATTTATTTT
2795













C

G

TCCTTAATCTC






2FH21F_06_006
21
26521974
R
C
91
6
114291124
F
A
 90
CATCCAAAGTTTTGTACATC
 292
TTTAGTAATACAAAAAAGCC
1544
AAAAAAGCCATATTAA
2796













A



CATTTTTTTCC






2FH21F_06_007
21
26522028
R
G
89
6
114291070
F
T
 89
CATGATGTACAAAACTTTGG
 293
GGTGGATTTTCCTCCAAGTG
1545
GGTGGATTTTCCTCCA
2797

















AGTGATTAAA






2FH21F_06_011
21
26527970
R
C
116
6
114290746
F
A
117
GTTAAGATAGGAAAGACCC
 294
TTTTAGTTAGGGTTTCTTG
1546
TAGTTAGGGTTTCTTG
2798

















ATCTTGG






2FH21F_06_012
21
26528056
F
G
101
6
114290660
R
T
101
GGAATAATGGATCAAAAATA
 295
CCCTTCTAAGTGTTATTTG
1547
CAAGGGTGTTTGGTAA
2799













G



GGTC






2FH21F_06_013
21
26528063
R
T
82
6
114290653
F
T
 82
TTAGTAGCAAGGGTGTTTGG
 296
TTAATTGGAATAATGGATCA
1548
ATTGGAATAATGGATC
2800

















AAAAATAG






2FH21F_06_015
21
26528520
R
G
117
6
114290188
F
A
117
GACATCATCCATTCAACACC
 297
GCTTAGTGCTTGGCTAATTT
1549
TTGGCTAATTTCCAAA
2801















C

TTATTGC






2FH21F_06_018
21
26528680
R
G
95
6
114290028
F
T
 95
TCTATAGACTCTCACTCAG
 298
GAGAAAATTTCATAAAGCC
1550
GAGAAAATTTCATAAA
2802

















GCCATTCTC






2FH21F_06_023
21
26528889
R
A
111
6
114289819
F
C
111
TGGTAACAGATTTGACATGG
 299
TCTGAAGTTTTCAAGCTCTG
1551
TCAAGCTCTGAAATTC
2803

















ATAATC






2FH21F_06_025
21
26528957
R
A
118
6
114289751
F
C
118
TCAGAGCTTGAAAACTTCAG
 300
TGAGACTTCTAGGTCTTAGG
1552
GGTTAATTTTTAGGAA
2804

















GATCTTG






2FH21F_06_026
21
26529017
F
G
118
6
114289691
R
T
119
TTCTGTGAGCACACTAAAA
 301
TAAGACCTAGAAGTCTCAG
1553
AGTCTCAGTATTATTA
2805

















GAACATAAA






2FH21F_06_028
21
26529096
R
T
97
6
114289611
F
G
 97
GTGTGCTCACAGAAAATTAG
 302
GAGATGGAATGTAACTTTGC
1554
CTTACAAAAATTGCTA
2806

















TTAAACTCCT






2FH21F_06_029
21
26529157
F
G
118
6
114289550
R
T
118
TCAGATGCAATGGTTTTGTG
 303
GCAAAGTTACATTCCATCTC
1555
TTCCATCTCTAAGTCA
2807

















AATTGGTC






2FH21F_06_031
21
26529316
R
G
104
6
114289392
F
T
105
CCACAGTATAAACAGTAAC
 304
CTGCAGTCATCTTGGACCTT
1556
AAACTCAACCAAGCTG
2808

















TGATAAG






2FH21F_06_034
21
26529525
F
C
94
6
114289182
R
T
 94
TGTACCAGTCAGTGATTAAG
 305
ATTAAGGTCATAAACCAGC
1557
GTCATAAACCAGCAAT
2809

















AAACAATA






2FH21F_06_035
21
26529569
F
C
105
6
114289138
R
C
105
GTTCTACTTAATCACTGAC
 306
GATCATAGTCTTAGGAGTTC
1558
GAACTTTTCACTTATC
2810

















TCATGTTAG






2FH21F_06_037
21
26529646
R
A
119
6
114289061
F
G
120
GAACTCCTAAGACTATGAT
 307
ACAACACTACAAGTCTTGA
1559
GAAAAAACACCAATAC
2811

















CCA






2FH21F_06_038
21
26529744
R
T
94
6
114288954
F
T
102
GAAATGGTGTAAAGGCTGTC
 308
GTGTTGTAAACCTGCCTCAC
1560
AAATACATGGTAATAA
2812

















CTTTTCTT






2FH21F_06_045
21
29875665
F
T
86
6
102479244
R
G
 86
ACTCAGACGTGGTGGAAAAC
 309
TGAGAGCTCCAACTCCAAAC
1561
TCCAAACCAGAAACTA
2813

















TTTAG






2FH21F_06_046
21
29875668
R
A
86
6
102479241
F
C
 86
TGAGAGCTCCAACTCCAAAC
 310
ACTCAGACGTGGTGGAAAAC
1562
GTGGTGGAAAACAATT
2814

















TTAC






2FH21F_06_047
21
30050650
R
G
112
6
6413565
R
A
112
AACGTGGCATTGTCCCCAAG
 311
GTCAGCTAATGCCACATGGT
1563
TAATGCCACATGGTAA
2815

















TTGCTGC






2FH21F_06_051
21
31747020
F
A
86
6
154912719
F
G
 85
CCAGGTCTTGATAGTCTTTG
 312
AGATGAGTGAGCAGGAAGAG
1564
AGAGGAGCTTGAGGAT
2816

















G






2FH21F_06_052
21
31747021
F
C
101
6
107468032
F
T
101
ACTGCTTTTTCCAGGTCTT
 313
TGATGAGATGAGTGAGCAGG
1565
AGAGGAGCTTGAGGAT
2817

















GA






2FH21F_06_053
21
31747168
F
G
116
6
154912866
F
A
116
TGTATCTCCCACTTTGACC
 314
AGAAACAAAGTGGAAGATGC
1566
AGGCTGAATGGGGAAA
2818

















A






2FH21F_06_060
21
32835972
R
A
117
6
156609546
R
T
116
GGTAGAGTTGCAAATAATT
 315
CCACCCACATTTTTCTCAGC
1567
ATACCTCCATCTGCAC
2819

















C






2FH21F_06_061
21
32835996
F
T
111
6
156609570
F
C
110
CCACCCACATTTTTCTCAGC
 316
GTTGCAAATAATTTGGTGAG
1568
GCAGATGGAGGTATCT
2820

















CTTA






2FH21F_06_062
21
32836018
R
A
94
6
156609591
R
T
 93
GTGCAGATGGAGGTATCTCT
 317
TTCTCCCACCCACATTTTTC
1569
CTCCCACCCACATTTT
2821

















TCTCAGCAATT






2FH21F_06_064
21
32836229
F
A
108
6
156609801
F
G
111
GGGAAAGGACATCCCTTC
 318
TGTAGTGATGGGAGGGATTC
1570
GATTCAAATCCTCCTC
2822

















TTCAGCAAAAG






2FH21F_06_065
21
32836400
F
G
92
6
156609975
F
C
 92
CCTGTTTTGAGTAAACAGT
 319
GTCTCATGGGCTGCAAAC
1571
GGGCTGCAAACCACCA
2823

















A






2FH21F_06_068
21
32836499
R
A
116
6
156610074
R
G
116
ACTGTTTACTCAAAACAGG
 320
GATACCTACTGAATTATTG
1572
GATACCTACTGAATTA
2824

















TTGAGGATA






2FH21F_06_073
21
32836931
R
G
95
6
156610505
R
A
 95
AATCACTGGGAAACAAAGAC
 321
GAAAATGCCAACTTTCTGGG
1573
TTACCATTTGTGGTTT
2825

















ATTTGCTCT






2FH21F_06_075
21
32837154
F
T
106
6
156610726
F
C
106
TTCATTTGTCCCTGGTACAC
 322
GACTGGAAACTGTTGAAAG
1574
ACTGGAAACTGTTGAA
2826

















AGTTAAAAA






2FH21F_06_076
21
32837191
R
G
113
6
156610763
R
A
113
GACTGGAAACTGTTGAAAG
 323
GGATACTTTCATTTGTCCCT
1575
TTGTCCCTGGTACACA
2827















G

T






2FH21F_06_077
21
32837231
F
C
86
6
156610803
F
T
 86
AGAAAGGCTTGACAATAAT
 324
ATGTGTACCAGGGACAAATG
1576
ATGAAAGTATCCTTCC
2828

















AAAATA






2FH21F_06_079
21
32837258
F
G
107
6
156610830
F
A
107
TGGATTTGCTGTTGATCACC
 325
CCCAAATTATTGTCAAGC
1577
AATTATTGTCAAGCCT
2829

















TTCT






2FH21F_06_082
21
32838067
F
A
90
6
156611192
F
G
 89
TCAGACACTGCATATTCTGG
 326
AATCTCCAGTAAACTCTAGG
1578
GTAAACTCTAGGATAT
2830

















CCAAAGGTGT






2FH21F_06_083
21
32838110
F
G
87
6
156611234
F
T
 87
GTTTTGCTGACATTAGTTG
 327
CAGAATATGCAGTGTCTGAG
1579
GAATATGCAGTGTCTG
2831

















AGAAACTT






2FH21F_06_084
21
32838463
F
G
84
6
156611587
F
A
 84
GCTAGAGAAAAAGCCAGG
 328
TCAGGGTACAAGCAGCTGTC
1580
CAGCTGTCTGACTCCA
2832

















AACCCTTTAT






2FH21F_06_088
21
32838640
F
C
82
6
156611764
F
T
 82
GAAAATATGTGCTTTTATCT
 329
TTATCTATAGAAACACTCC
1581
AGAAACACTCCCAAAG
2833













G



C






2FH21F_06_092
21
32838763
R
G
88
6
156611887
R
C
 88
CCTTGATAGTATTTGCCACT
 330
CATCATTCCCTATTTGACTG
1582
TGACTGATTTTTAACC
2834













C



TATCAT






2FH21F_06_093
21
32838962
F
C
97
6
156612095
F
G
 97
TCCTGAAGTTCAGAAACAG
 331
TTTCTTAACCAGAGAGCTTC
1583
TAACCAGAGAGCTTCC
2835

















TGGCCCACA






2FH21F_06_095
21
32839594
F
G
94
6
156612730
F
A
 97
AGACCCTTATTCCAAGGGTA
 332
TTCCCAGGGCCCAAAGCAAG
1584
TTCCCAGGGCCCAAAG
2836

















CAAGAAAATG






2FH21F_06_099
21
32839825
F
C
89
6
156612965
F
T
 89
GACTTGAGCAACACAAATG
 333
CTAAGTAAATCAGGCTTTGG
1585
AGGCTTTGGACAGGCT
2837

















C






2FH21F_06_102
21
32839931
F
T
108
6
156613068
F
C
108
CCTTTTCTGACAGAAAGGTA
 334
GATGGAATTTCTCTTTGCAC
1586
AATTTCTCTTTGCACC
2838















C

TGAACAA






2FH21F_06_107
21
32840060
R
T
108
6
156613197
R
C
108
CTTAGATTCACACTCAAGCC
 335
TCTGTGCTAGGAGAAGGAG
1587
AGGAGAAGGAGAATTT
2839

















GGG






2FH21F_06_110
21
32840630
R
T
116
6
156613770
R
C
116
GACTCATCAACTTCTCAT
 336
GGAAAACTCAAACATGGACT
1588
AACATGGACTGGAGTG
2840















G

G






2FH21F_06_111
21
32840668
F
G
105
6
156613808
F
A
108
GTCTGTTGATTTCAAAACAC
 337
CACTCCAGTCCATGTTTGAG
1589
GAGTTTTCCAAATCCA
2841

















CAT






2FH21F_06_112
21
32840695
R
T
118
6
156613838
R
G
121
CACTCCAGTCCATGTTTGAG
 338
GGATTAAGTATATGTCTGTT
1590
TCTGTTGATTTCAAAA
2842















G

CACA






2FH21F_06_113
21
32840740
F
G
120
6
156613883
F
A
119
GAGAATTAAAATGAACTGAG
 339
GTGTTTTGAAATCAACAGAC
1591
CATATACTTAATCCTT
2843













G



TTGCCTCA






2FH21F_06_114
21
32840770
R
A
97
6
156613912
R
C
 96
TACTTAATCCTTTTGCCTC
 340
GAGAATTAAAATGAACTGAG
1592
GAGAATTAAAATGAAC
2844

















TGAGGATTTC






2FH21F_06_117
21
32840889
F
G
111
6
156614032
F
A
107
CTGCATATATCTTCTGCCTC
 341
CTGGTTTTGAATTACATTGG
1593
ATTACATTGGCTAACT
2845















C

TCAGAAAA






2FH21F_06_118
21
32840915
R
A
112
6
156614054
R
T
108
CTGGTTTTGAATTACATTGG
 342
ACTGCATATATCTTCTGCC
1594
CTTCTGCCTCAATTAC
2846













C



TTTC






2FH21F_06_119
21
32841051
R
C
95
6
156614190
R
A
 95
AAGCCTATTTATCATACAG
 343
AGAATGACAACTGACATTT
1595
GAGGCTTATAAAATGA
2847

















TTAAAGG






2FH21F_06_127
21
32844567
F
T
91
6
156617501
F
C
 91
GGGCTGCGAGTTCAAATTC
 344
CTGCCCTTTTCAATTCTG
1596
CCCTTTTCAATTCTGT
2848

















CTGAG






2FH21F_06_128
21
32844629
R
C
120
6
156617563
R
T
120
GAATTTGAACTCGCAGCCCC
 345
CTGTGAAACCATGGGAAGTT
1597
AAGTATACAATCAGGC
2849

















AGAAAAAGG






2FH21F_06_129
21
32844655
R
G
120
6
156617589
R
A
120
GAATTTGAACTCGCAGCCCC
 346
CTGTGAAACCATGGGAAGTT
1598
TGACTTTACAGGCACT
2850

















T






2FH21F_06_130
21
32844700
F
T
119
6
156617634
F
C
119
AGAGGATTCAGCCTGCTCA
 347
ATAACTTCCCATGGTTTCAC
1599
CCCATGGTTTCACAGC
2851

















AAAG






2FH21F_06_132
21
32844750
R
G
96
6
156617684
R
A
 96
GCACAGGCTTTTAAACCCA
 348
GAGACATTGTCCTTTTGAAG
1600
TTTGAAGATGTGGAAA
2852

















GTAAT






2FH21F_06_133
21
32844772
F
G
117
6
156617706
F
T
117
GCAATTTTGACACCTTAAAG
 349
TTGTCCTTTTGAAGATGTGG
1601
AGCAGGCTGAATCCTC
2853













C



T






2FH21F_06_134
21
32844793
R
A
120
6
156617727
R
G
120
AGTGAGCAGGCTGAATCCTC
 350
GCAGCAGGGTATAACAAAGC
1602
TGACACCTTAAAGCAG
2854

















AA






2FH21F_06_135
21
32844826
F
T
103
6
156617760
F
C
103
TGGGTTTAAAAGCCTGTGC
 351
TATCTGTGTAGCAGCAGGG
1603
GCAGGGTATAACAAAG
2855

















CTAAA






2FH21F_06_137
21
32844977
R
T
113
6
156617917
R
C
114
TATATATGTTAGCACAGAC
 352
CTGTTTGACTATTCTGATCT
1604
TGATCTCTTAAGATGC
2856















C

ATCTGAAAAA






2FH21F_06_138
21
32845021
F
A
114
6
156617961
F
C
113
ACTAGCTGTAACCTTTGTGC
 353
CTTAAGAGATCAGAATAGTC
1605
ATCAGAATAGTCAAAC
2857

















AGTAG






2FH21F_06_140
21
32845086
F
C
102
6
156618025
F
T
102
ACGAGGTCAAATCTGCTCC
 354
CCATCTTCAAGTTTTAAGCA
1606
GCACAAAGGTTACAGC
2858















C

TAGT






2FH21F_06_141
21
32845096
R
T
85
6
156618035
R
C
 85
GCACAAAGGTTACAGCTAGT
 355
ACGAGGTCAAATCTGCTCC
1607
TCCAACAGTGGAAATA
2859

















AAAT






2FH21F_06_142
21
32845163
F
T
104
6
156618102
F
C
104
CTTCATTCAGAATCTTTTTC
 356
CAGATTTGACCTCGTCTCTC
1608
GCAGAAAACTTCAACA
2860

















AAGG






2FH21F_06_144
21
32845265
F
T
105
6
156618204
F
C
105
CACTGGGGAAAAGTGCACCT
 357
ATGCAGTGCTTAGGAAGTGG
1609
GTGCTTAGGAAGTGGA
2861

















TAAAAGTCAA






2FH21F_06_147
21
32845497
F
G
103
6
156618436
F
A
103
TCTTTTGGAATGGGAGGGAG
 358
TGCCACTGCACCAGGAGAAA
1610
AGGAGAAAAGGAGTCA
2862

















CTAG






2FH21F_06_148
21
32845501
R
C
103
6
156618440
R
T
103
TGCCACTGCACCAGGAGAAA
 359
TCTTTTGGAATGGGAGGGAG
1611
TTTTCTCTTCCCCATC
2863

















C






2FH21F_06_149
21
32845574
F
C
118
6
156618513
F
T
118
GATGACATTCTTCCTGTCT
 360
TCCCTCCCATTCCAAAAGAG
1612
GAAGAAAAAACCTGGA
2864

















CAGCCAGATA






2FH21F_06_150
21
32845973
F
G
112
6
156618922
F
T
112
GCCTGAGTCTCTCTAATT
 361
TGCTTCAGCTAGGTGCTTAC
1613
AGGTGCTTACAGGTGA
2865

















A






2FH21F_06_153
21
32846019
R
A
102
6
156618968
R
G
102
CATGTAGCAAATTTGGTTTC
 362
GGAGAAGAGCATAGCTAGAC
1614
GCCTGAGTCTCTCTAA
2866

















TT






2FH21F_06_155
21
32846052
R
C
108
6
156619001
R
T
108
CATGTAGCAAATTTGGTTTC
 363
GAGGCTGGAGAAGAGCATAG
1615
AGAAGAGCATAGCTAG
2867

















AC






2FH21F_06_156
21
32846079
F
T
109
6
156619028
F
C
116
CCATTCAAACAAAAGCCCG
 364
GTCTAGCTATGCTCTTCTCC
1616
CTAGCTATGCTCTTCT
2868

















CCAGCCTC






2FH21F_06_159
21
32846617
F
T
87
6
156619266
F
C
 87
AGAACCGAGGGATGCAAAAC
 365
TCTTTGAAACAGCATGACTC
1617
AAACAGCATGACTCAG
2869

















ATAG






2FH21F_06_163
21
32849012
R
T
99
6
156621662
R
C
 99
GGAACCAAGACTACACTGAG
 366
TGGTGTTTATGGATGAGTGG
1618
GAGGTTGAAGGAGAGG
2870

















C






2FH21F_06_165
21
32849060
R
A
93
6
156621710
R
G
 93
GGGCTGTTTCAATGAGGGAC
 367
GGTACCACTCATCCATAAAC
1619
CTCATCCATAAACACC
2871

















AACACT






2FH21F_06_166
21
32849104
F
C
120
6
156621754
F
T
119
GATGTCTGTGTCTAAAATTG
 368
TGTGTATCATAAAGTCCCTC
1620
CCTCATTGAAACAGCC
2872













G



C






2FH21F_06_168
21
32849148
R
A
113
6
156621797
R
G
112
GTCCCTCATTGAAACAGCCC
 369
GGGAGGATGTCTGTGTCTAA
1621
GGAGGATGTCTGTGTC
2873

















TAAAATTGGT






2FH21F_06_172
21
32849578
F
A
112
6
156622258
F
C
113
ATTGTGCAATTAAATGACC
 370
CTCTCTTCTGGAAATCATCG
1622
GGAAATCATCGATGAA
2874

















AAAGCATGTT






2FH21F_06_176
21
32849896
F
A
111
6
156622572
F
T
110
AGACCTTGTTGTCTAGGGTG
 371
AACAGCCAAAAGCCTATC
1623
CCAAAAGCCTATCATC
2875

















ACA






2FH21F_06_179
21
32850613
R
G
103
6
156622980
R
A
107
CCTCATCATTTTCAGCCTGG
 372
TATGGGAGAGGGTAAAAAG
1624
GGGAGAGGGTAAAAAG
2876

















AGGTTAA






2FH21F_06_182
21
32850954
R
A
118
6
156625339
F
C
118
GCTCAGGTATTTTATAAGGC
 373
AGTTAGTTACCAACTCCTAG
1625
CCAACTCCTAGAAGCC
2877

















A






2FH21F_06_183
21
32850996
F
A
113
6
156625297
R
C
113
GCTCAGGTATTTTATAAGGC
 374
GTTACCAACTCCTAGAAGCC
1626
GATGTGTAAAATAACT
2878

















GAGAAAA






2FH21F_06_194
21
32863500
R
A
102
6
161178437
F
A
102
CAGAACCGCCTAGAAGGCAA
 375
TTCCGCAGCCCACAGCTAAG
1627
CAGCTAAGTCACTCTG
2879

















A






2FH21F_06_196
21
32863965
R
C
112
6
167684833
R
G
127
TCACTGAAAACCGCGGAAG
 376
GGCAGCGAAGGGGCCTCAC
1628
GCAGCGAAGGGGCCTC
2880

















ACGGGGAC






2FH21F_06_198
21
32864171
R
C
115
6
167685060
R
T
114
GCGAAATGACCTGTTTACC
 377
TGTAAACACAACGCAGGAAC
1629
CGCAGGAACATCATGA
2881

















AAA






2FH21F_06_204
21
32867314
R
G
102
6
167521102
F
T
102
AGCTGTCCAGATAATTTGGG
 378
GAAGCCACAGGCTCACAG
1630
GGATAAGAACCAGGAA
2882

















AACAT






2FH21F_06_218
21
32883453
F
G
100
6
167724992
F
A
100
ACCCTCAGTACCACTATCTC
 379
GAAAGTTCTTGTATTAAAAG
1631
GAAAGTTCTTGTATTA
2883

















AAAGAAGTGG






2FH21F_06_219
21
32883480
R
T
93
6
167725019
R
C
 93
CTTGTATTAAAAGAAGTGG
 380
ACCCTCAGTACCACTATCTC
1632
TCAGTACCACTATCTC
2684

















AATCTT






2FH21F_06_224
21
32885410
F
G
107
6
167728703
F
A
107
GGAGTCAAGGGAGCATTTTA
 381
CAAGGATTCCAGTACTGGAG
1633
CAGTACTGGAGAATGT
2885

















CT






2FH21F_06_228
21
32885661
R
T
88
6
167728958
R
C
 90
GATGTCACCTCTCTGCCTTC
 382
ACGTAAGTCCCCACAGTTTG
1634
GGGAGGCTTAGGGAGA
2886

















A






2FH21F_06_229
21
32885700
F
C
118
6
167728997
F
G
142
GGGAGGTCAGGACAATTTTT
 383
CTCCCAAACTGTGGGGACTT
1635
AAACTGTGGGGACTTA
2887

















CGTGT






2FH21F_06_233
21
32886101
R
A
99
6
167729422
R
G
 99
ATGGGTGGACAAAACGAC
 384
GAAAATTGCATCTGGCTACA
1636
CAGCTCCTTGGTGTAG
2888















C

A






2FH21F_06_238
21
32886328
F
C
115
6
167729649
F
G
115
TGTGTGCAAGGCTCTAGAAG
 385
TGTTCTTGGTTGACTTTAC
1637
CAAACAGAGAAAATTA
2889

















AAATCAAACA






2FH21F_06_239
21
32886535
F
T
116
6
167729855
F
G
116
TTTTGCCACTTTCCAGGTG
 386
CTGTTCCTGAGCTGATTGGG
1638
TCCTGAGCTGATTGGG
2890

















GTTCTGG






2FH21F_06_241
21
32886578
F
G
116
6
167729898
F
A
116
TTTTGCCACTTTCCAGGTG
 387
CTGTTCCTGAGCTGATTGGG
1639
AAGCTCAGGAGGACAA
2891

















A






2FH21F_06_242
21
32888205
R
A
108
6
167732826
R
C
108
GAAGACAAGTAGCTGACCTG
 388
AGGACATGGGGCTGGTTTTG
1640
GGAGAAGGGCCTAGGT
2892

















G






2FH21F_06_243
21
32888229
R
G
108
6
167732850
R
C
108
GAAGACAAGTAGCTGACCTG
 389
AGGACATGGGGCTGGTTTT
1641
AGGACATGGGGCTGGT
2893

















TTTGGTAAA






2FH21F_06_250
21
32889347
R
T
120
6
167733959
R
C
119
TGTATGACAAGCCATGTGGG
 390
TCCTGTGTTTCTAGGAAGGC
1642
TTCTAGGAAGGCAACA
2894

















ACT






2FH21F_06_251
21
32889391
F
C
119
6
167734003
F
T
119
CCTGTCAGTTCAATGTGTAA
 391
GAAACACAGGAATAACCTGC
1643
GGAATAACCTGCAGCA
2895

















CCA






2FH21F_06_252
21
32889422
R
A
114
6
167734034
R
C
114
ACAGGAATAACCTGCAGCAC
 392
CCTGTCAGTTCAATGTGTAA
1644
AAAAGCACAAAAGTAG
2896

















ATTCCT






2FH21F_06_253
21
32889464
F
A
113
6
167734076
F
G
113
ATTCATCGAATGTGGGCGTC
 393
GTGCTTTTACACATTGAACT
1645
TGCTTTTACACATTGA
2897















G

ACTGACAGGT






2FH21F_06_254
21
32889504
F
A
85
6
167734116
F
G
 85
GCAGGATTCATCGAATGTGG
 394
AGGCATCGACTGTCACAGG
1646
CAGGGGCCAGTGGAGA
2898

















GGT






2FH21F_06_258
21
32889591
R
A
124
6
167734195
R
G
116
CCCACATTCGATGAATCCTG
 395
AGCTGCCTTTATTCGTGCTC
1647
TTTATTCGTGCTCAAG
2899

















TTAT






2FH21F_06_259
21
32889621
F
T
103
6
167734225
F
C
103
ACAGGAGCAGTGTTTAGAGC
 396
ACTTGAGCACGAATAAAGGC
1648
CGAATAAAGGCAGCTC
2900

















A






2FH21F_06_263
21
34679715
F
A
119
6
86502282
R
C
119
CTTTCAGCCTCCAGTTTTTG
 397
GGCAGCAAAAACATTAATTC
1649
AGCAAAAACATTAATT
2901

















CTCTGCCTG






2FH21F_06_264
21
34679765
R
A
115
6
86502232
F
C
115
AACATTAATTCTCTGCCTG
 398
TCTTCCTTTCAGCCTCCAG
1650
CTTCCTTTCAGCCTCC
2902

















AGTTTTTG






2FH21F_06_268
21
36424803
R
C
107
6
135260845
R
A
107
CCACTTGTTTATAAGCATGG
 399
CAAAAAGACCTGCTAGAGCC
1651
GCTAGAGCCATTATTG
2903













G



C






2FH21F_06_275
21
36680355
R
C
103
6
106220938
R
T
103
AGACTCAGGAGGATGAAAG
 400
CATGCTGGAAGTCCAGGCT
1652
AAGTCCAGGCTGTACA
2904

















C






2FH21F_06_277
21
36707214
F
T
111
6
106222106
F
C
111
GGGTCTTGGGTTCTGCTGG
 401
CAGCAAAGAAAACCAAGAGT
1653
ACCAAGAGTCAGACAC
2905















C

A






2FH21F_06_278
21
36707282
F
G
84
6
106222174
F
A
 84
TGGGGCCTGTCTGGCCTGAG
 402
TGCCAGCAGAACCCAAGAC
1654
AGAACCCAAGACCCCA
2906

















GCA






2FH21F_06_279
21
36707299
R
C
93
6
106222191
R
A
 93
TGCCAGCAGAACCCAAGAC
 403
TGTTGGGGCTGGGGCCTGT
1655
TGGGGCCTGTCTGGCC
2907

















TGAG






2FH21F_06_284
21
36710882
F
C
93
6
106222912
F
A
 94
CTTTCTCATCTTCCTAATTC
 404
CTGGCATCCTCGTGAAAGTG
1656
ATGGAGGGACTCCTTT
2908

















T






2FH21F_06_288
21
44005258
R
C
96
6
14831246
R
T
 96
ATGTTTCCTGTTCTCAGTGC
 405
TGAAAGGCAGGAACGTGGT
1657
AGGCAGGAACGTGGTT
2909

















TTAGAC






2FH21F_07_002
21
10017549
R
T
81
7
151532773
R
C
 81
GAAAGGCTTTGGAGATGACC
 406
GGTTTAGGGACTGAATAAC
1658
GGACTGAATAACTTAG
2910

















TTACATAA






2FH21F_07_003
21
10017701
F
G
107
7
151532925
F
A
107
TGATGAAAGGATTTGAGTGC
 407
AGTCTATTGGATTTAAACC
1659
ACCATTTCCTTATAAA
2911

















ACCTGATT






2FH21F_07_004
21
10017727
R
T
117
7
151532951
R
C
117
CCATTTCCTTATAAAACCTG
 408
CTCAATAAGAGTCTTATTGC
1660
GATGAAAGGATTTGAG
2912















C

TGC






2FH21F_07_009
21
10018035
F
G
114
7
151533262
F
A
114
TATCCTGTGTACTGTGGAAA
 409
TTGCCGCACCATAAATCCAC
1661
CACCAATACCTATCCA
2913

















AAAAAGAAATT






2FH21F_07_016
21
10018739
F
A
112
7
151533969
F
G
112
TGTATAAATGCCCTCATAC
 410
CACAAACTACCTAGATGACA
1662
TGACTGATATGATTTC
2914















C

AGGGGGAC






2FH21F_07_017
21
10019087
F
C
99
7
151534313
F
A
105
TGCAGATTTCTTCCAGGAAC
 411
CCCTCAATTAGAGGGTTGAC
1663
GAGGCAGAGGAAAAGA
2915

















AAA






2FH21F_07_018
21
10019153
F
T
119
7
151534385
F
C
119
GGTCATATCTATAATAAGG
 412
AAAAGTACACTTATAAGCC
1664
ACACTTATAAGCCTCA
2916

















TGAT






2FH21F_07_021
21
10019238
R
C
88
7
151534470
R
T
 88
GGTCCTTATTATAGATATGA
 413
CATTCGTATTCCATGAGACC
1665
TTCCATGAGACCTTAA
2917













C



AAGATAACCT






2FH21F_07_022
21
10019293
R
A
92
7
151534525
R
C
 92
GGTCTCATGGAATACGAATG
 414
GTAAGAGTGATCTAAATCCC
1666
TGATCTAAATCCCTTT
2918

















TGATATG






2FH21F_07_025
21
10019407
R
G
89
7
151534640
R
C
 90
CAATTTAAAACCTCATTGG
 415
CACACGTGTTGAGTAGGCTT
1667
TGTTGAGTAGGCTTTC
2919

















CTTAG






2FH21F_07_026
21
10019536
R
G
113
7
151534770
R
A
113
GCCTACAACTTCTGTATTGT
 416
TCAGGAGTGGAGAGAAAAGC
1668
GAGAAAAGCGGTCTTG
2920













G



C






2FH21F_07_027
21
10019592
R
A
103
7
151534826
R
G
103
AAGACCGCTTTTCTCTCCAC
 417
GGCTCCTAGAATTTATAGTC
1669
AGTCCAGTTAAAAACC
2921

















ATGA






2FH21F_07_028
21
10019645
R
G
101
7
151534879
R
A
101
GGACTATAAATTCTAGGAGC
 418
TGTTTATGCAGGAGTGCCAG
1670
AAGTATACAGTGTGAA
2922

















GGGGAA






2FH21F_07_029
21
10019826
F
A
118
7
151535060
F
C
118
GTCCAAGTATGAACAAAAGC
 419
GTGAATACTTCACAATGAAT
1671
TCCCAAATGTTAACCA
2923













C

C

TTTTATTAAA






2FH21F_07_030
21
10019853
F
T
118
7
151535087
F
C
118
GTGAATACTTCACAATGAAT
 420
GTCCAAGTATGAACAAAAGC
1672
AAATGGTTAACATTTG
2924













C

C

GGA






2FH21F_07_033
21
10020153
F
T
90
7
151535387
F
C
 90
TCAGAATCTAGTCCTGAGCG
 421
ACACCATCTGTTCCTTCCAC
1673
CCACTCCCTTAGTTTC
2925

















ATCAT






2FH21F_07_035
21
10020360
F
C
102
7
151535594
F
A
102
AACACTGCACTAAGCAGCAC
 422
ATCCCTGTTGGTAGGGAAAG
1674
GGAAAGTATGAAAGGA
2926

















GATAGAAG






2FH21F_07_036
21
10020375
R
C
102
7
151535609
R
G
102
ATCCCTGTTGGTAGGGAAAG
 423
AACACTGCACTAAGCAGCAC
1675
ACTAAGCAGCACAATT
2927

















TCTA






2FH21F_07_037
21
10020466
R
C
115
7
151535700
R
T
115
AAGGGGAACACAGAACTCAG
 424
AGAGACCTGGACCTGAAGAC
1676
AGTGAATTTGTTAAGT
2928

















GCAAATGG






2FH21F_07_042
21
10021598
R
A
101
7
151536832
R
G
101
CATGAACAGGGTATTTGTC
 425
GCCATTATCAGATTGTTATG
1677
TTGTTATGGAATTGGC
2929

















CT






2FH21F_07_050
21
10054407
F
C
112
7
151569685
F
G
113
CCAATGGAAATATTGAGAG
 426
CCACCTAGGACGTTTTATTG
1678
ATTTAGTGGTAGGCAG
2930

















TGGGG






2FH21F_07_052
21
10054485
F
T
104
7
151569764
F
C
104
GAACTGTCTACTGCCAACAT
 427
GGTTTTTCTCTGAGATTTGG
1679
TGGCTAACATACATCT
2931















C

TAAATTC






2FH21F_07_053
21
10054494
R
A
104
7
151569773
R
G
104
GGTTTTTCTCTGAGATTTGG
 428
GAACTGTCTACTGCCAACAT
1680
ACTGCCAACATAATAT
2932













C



TAAACTAT






2FH21F_07_057
21
10054889
F
T
81
7
151570171
F
C
 81
CTGCCCCTGTAATGTATGG
 429
ACAGTGTAAAAAGTGCTGCA
1681
CTGCAACTGGATTGTA
2933

















GG






2FH21F_07_058
21
10054933
F
A
106
7
151570215
F
C
107
TGCTGAACAGGGTGCTTAAC
 430
CTACAATCCAGTTGCAGCAC
1682
CACTTTTTACACTGTA
2934

















ATTAAAGAT






2FH21F_07_059
21
10054956
R
C
110
7
151570239
R
T
111
CTACAATCCAGTTGCAGCAC
 431
TAAGTGCTGAACAGGGTG
1683
TGAACAGGGTGCTTAA
2935

















C






2FH21F_07_061
21
10055024
R
G
116
7
151570307
R
A
116
TCTGCTGAGCATCTATTATC
 432
TACTGGTGGAGGCATTAGTG
1684
TTGTTTATTGATGAAT
2936

















TCATACACA






2FH21F_07_063
21
10055125
F
A
119
7
151570408
F
G
118
CAGTTTGTAGATTAAGGAGG
 433
CCACCAGTAATAACCTAGAA
1685
ATCTTGAATTTCTTCA
2937

















CTTAAAAAAA






2FH21F_07_064
21
10055296
F
T
108
7
151570578
F
C
108
CAGAAAGAAACTTAATGCT
 434
AAACACTACCTGGCAGGGAC
1686
GGCAGGGACTGAATTT
2938

















GAACC






2FH21F_07_067
21
10055438
R
A
119
7
151570720
R
C
119
CTCAGGTAAACTGTCCAAGC
 435
GTTGCTTCTAAATAGCCTAT
1687
TAAATAGCCTATCCTC
2939















C

CAC






2FH21F_07_071
21
10055681
R
C
107
7
151570963
R
G
107
CCAAGGTTGCTTATAAACAG
 436
CTTTTACCAGTTATCTTCC
1688
TCTTCATTGCTTTCAC
2940

















TTTTC






2FH21F_07_072
21
10055703
F
C
107
7
151570985
F
G
107
CTTTTACCAGTTATCTTCC
 437
CCAAGGTTGCTTATAAACAG
1689
GAAAAGTGAAAGCAAT
2941

















GAAGA






2FH21F_07_074
21
10055918
R
T
95
7
151571200
R
C
 95
GTAGAACAAGAAATTAGACC
 438
TTATTGAAGGCTAAAGCTG
1690
TATTGAAGGCTAAAGC
2942

















TGATAATA






2FH21F_07_081
21
10056637
R
G
112
7
151571928
R
C
112
GAAAGCAATTAGAACATGA
 439
ACCCTGTATGTATCATCACG
1691
AATGTAATCACACTAC
2943

















TATGATCTA






2FH21F_07_082
21
10056705
R
C
102
7
151571996
R
A
102
GACGTGATGATACATACAGG
 440
GTATTCCCATTCTAATTAGG
1692
AATAATCTTAGGTCTT
2944

















CTTGTAT






2FH21F_07_084
21
10057393
F
A
92
7
151572685
F
C
 95
GCAGGATTTCACAAAGATGA
 441
CAATATCCAATTTGCTGTCT
1693
CCAATTTGCTGTCTGT
2945













G

G

TACTTCT






2FH21F_07_088
21
10057855
R
A
116
7
151573150
R
G
117
ATTTAAAACTGAATATACTT
 442
TTCTGTTGTTCATGGAACAC
1694
ACACATTTTAATGCAG
2946













G



ATAATTG






2FH21F_07_090
21
10058493
R
A
104
7
151573797
R
G
104
ATTTGCCCACCATGAAACAG
 443
CAATTCTTTGGTCTTTACCA
1695
CTAACCAAAGAAATGT
2947















G

AGATTTAC






2FH21F_07_094
21
10059025
R
A
105
7
151574328
R
C
105
ACTAAAAAGCTGGAGGGAGG
 444
GCCCCTCTTGTTACTACTTC
1696
GCCCCTCTTGTTACTA
2948

















CTTCATCATTT






2FH21F_07_095
21
10059172
F
A
101
7
151574474
F
G
101
CCAGGTTCAATACATTAGGA
 445
TAAGCCTGGAAATACACCCC
1697
CCCCTCCCCAATATTT
2949













C



C






2FH21F_07_105
21
10059545
F
G
106
7
151574848
F
T
107
AGACAAGGTACACGAAAGGG
 446
GGCCTAGTTTTACTGCACAC
1698
GCCTAGTTTTACTGCA
2950

















CACGTCTTT






2FH21F_07_106
21
10059627
F
T
92
7
151574931
F
G
 92
TGTGAAAATTAGTCTCCTC
 447
TCCCTTTCGTGTACCTTGTC
1699
GTCTTTAGAGAATAAA
2951

















ATATATCTGG






2FH21F_07_109
21
10059776
R
C
116
7
151575081
R
A
116
GCCAAACTTTAATCCATTT
 448
TCACAATAGTAATTTGGAG
1700
TGATTGAAATTGCTTC
2952

















AAGT






2FH21F_07_112
21
10059962
F
G
82
7
151575268
F
A
 86
CTACCCTTTAAGAATGAGTT
 449
CATTTTGCCATGCAGTTTTA
1701
GCAGTTTTACTTAAAT
2953













C

C

CTCACTTA






2FH21F_07_115
21
10061071
F
A
115
7
151576385
F
C
115
CTGCAGTTGTTAGAGGAACC
 450
GTTTCTAGTGGAAGAGTGAC
1702
TTTCTAGTGGAAGAGT
2954

















GACAGATTC






2FH21F_07_116
21
10061077
R
A
109
7
151576391
R
T
109
AGTGGAAGAGTGACAGATTC
 451
CTGCAGTTGTTAGAGGAACC
1703
GAATCAAGGCCTCCAA
2955

















AATT






2FH21F_07_117
21
10061102
F
T
109
7
151576416
F
C
109
CTGCAGTTGTTAGAGGAACC
 452
AGTGGAAGAGTGACAGATTC
1704
GAGGCCTTGATTCTTC
2956

















T






2FH21F_07_119
21
10061143
R
C
110
7
151576457
R
T
110
TTTGGAGGCCTTGATTCTTC
 453
TCGTTACACACCAGATCAC
1705
ACCAGATCACTGTGCA
2957

















GCAAGA






2FH21F_07_122
21
10061299
R
G
116
7
151576613
R
A
116
TATGCTTCACTTCAGAAGAC
 454
TATCATCCCAACATACAGT
1706
TCCCAACATACAGTGA
2958

















ATAC






2FH21F_07_128
21
10061656
R
C
100
7
151576973
R
G
100
TGTTATGTGAGGTACCTAAG
 455
CATCTGGGTATCTACTATTA
1707
TGCCTACACATTCTAG
2959















G

ATCA






2FH21F_07_130
21
10061746
R
G
92
7
151577063
R
A
 92
AGACTCAAAAGCACAGACAG
 456
GGTTGGCAGGTATGGTTAAG
1708
GCAAAATAAATATTGG
2960

















TGGTTAG






2FH21F_07_131
21
10061791
F
G
120
7
151577108
F
C
120
GATTTCCTGAGATTAGTCTT
 457
TTTGCTTAACCATACCTGCC
1709
CCATACCTGCCAACCT
2961

















A






2FH21F_07_135
21
10062478
F
T
112
7
151577796
F
C
112
ATCCCAAAGACATTTTTGC
 458
CCATTGTCAATTCTTTTCCA
1710
ATCTCTTAACTAAAAG
2962















G

ATTTAGTTAC






2FH21F_07_136
21
10062502
R
T
118
7
151577820
R
C
118
CCATTGTCAATTCTTTTCCA
 459
GTCTTTATCCCAAAGACA
1711
TTTATCCCAAAGACAT
2963













G



TTTTGC






2FH21F_07_138
21
10066094
R
A
93
7
151587748
R
C
 93
ACCTATCTGACAATGACTGG
 460
TGCTCCCTGGTGAGCTGGA
1712
CCTGGTGAGCTGGAGT
2964

















GGGG






2FH21F_07_142
21
10066675
R
A
99
7
151588323
R
G
 99
CTCTCAAAAGAGAATAGCAG
 461
TCTCAGCTTGTTCTGTCTCC
1713
CCCCTTTGGTGTGCTT
2965

















CTTT






2FH21F_07_143
21
10066747
F
C
116
7
151588395
F
T
115
AATATCTAGTAACTACTGG
 462
CACCCAGAATTCTCTACCAG
1714
CCCAGAATTCTCTACC
2966

















AGTTCTCAAGA






2FH21F_07_147
21
10067472
F
C
104
7
151589126
F
A
108
GTTGAATGGTTATCTTTTCA
 463
GTTACCTCTATTAAGCTTTT
1715
CCTCTATTAAGCTTTT
2967













C

C

CAAAAGATA






2FH21F_07_150
21
10067666
R
C
108
7
151589324
R
G
108
CATTACATAGAATAAAGAAC
 464
TGTGGCTGTTATTTAGCAAG
1716
GTGGCTGTTATTTAGC
2968

















AAGTAGGTCA






2FH21F_07_151
21
10067696
F
T
97
7
151589354
F
G
 97
GACCACTATTAATTGTTCCT
 465
GACCTACTTGCTAAATAACA
1717
CTTGCTAAATAACAGC
2969















G

CACAAG






2FH21F_07_152
21
10067754
F
T
96
7
151589412
F
A
 99
GATAGGAACAATTAATAGTG
 466
GTTAGATGAAGTCCTTTTAC
1718
GACTTGTTGATTCAAC
2970













G

C

AAGTT






2FH21F_07_153
21
10067846
F
A
102
7
151589507
F
C
 99
AATTTAACTAAGGTAGGTTT
 467
TAAACACAAATGCTACACC
1719
ATGCTACACCTTTAAA
2971

















AAGTCA






2FH21F_07_156
21
10068270
F
G
103
7
151589937
F
A
103
GGCCAGAGTTCATCACAATC
 468
AAAGAGCTGCTGGGTAACTG
1720
GGCTACCTGGGAAGTG
2972

















GG






2FH21F_07_157
21
10068378
F
G
112
7
151590045
F
A
112
CTGCAAGCAGTATTACCAGG
 469
GAGAGAAAGCCCCTCCCCT
1721
CCACCACTCAGGCAGA
2973

















TGCCTA






2FH21F_07_160
21
10068563
F
C
101
7
151590229
F
A
101
AAGGCACAGCATTGTCATTG
 470
ACATCACCCTCCTTTCCCAG
1722
AGGCCCTCCACCTCCT
2974

















C






2FH21F_07_161
21
10068616
F
T
120
7
151590282
F
C
120
TGACCCTCAGGTGCTGCAT
 471
AATGACAATGCTGTGCCTTC
1723
TGCTGTGCCTTCCACT
2975

















CC






2FH21F_07_164
21
10068653
R
G
120
7
151590319
R
A
120
AATGCTGTGCCTTCCACTCC
 472
ATGGAGATGACCCTCAGGTG
1724
TGGGCCTGGAGCGGGT
2976

















T






2FH21F_07_166
21
10068814
F
C
109
7
151590480
F
A
109
CCTACCTCACTTGGCTTCTG
 473
ATTCCAAGGGCTATCTCCAC
1725
CCCAACCCGGCTCTGA
2977

















ACGCCTC






2FH21F_07_168
21
10069480
F
G
94
7
151591156
F
A
 94
AAACATAAGTTTAAAGATAA
 474
GCATCTTGCTATCTTCTCCC
1726
GCTATCTTCTCCCGAT
2978













G



TGTCTAAAAA






2FH21F_07_176
21
10070235
F
G
116
7
151591914
F
A
115
AGCTCTTCTTGCTTTCCCTG
 475
CTCTGTTGAGATTTTTGAC
1727
GATTTTAAATTCAAGA
2979

















GGAGGGGAA






2FH21F_07_178
21
10070329
R
G
113
7
151592007
R
A
113
GTGACTTTTTATGGAGAGG
 476
GAATGAAATCTGGGGGATAA
1728
ACAGGAAGATGGGTCA
2980

















GTT






2FH21F_07_179
21
10070373
F
A
116
7
151592051
F
G
114
GAGTACTTGTCCTCCAAGAT
 477
GCCTCCAATTATTATTCAG
1729
CCTCTCCATAAAAAGT
2981

















CAC






2FH21F_07_180
21
10070397
R
C
92
7
151592073
R
G
 90
CCTCTCCATAAAAAGTCAC
 478
CTTGAAGGAAGAGTACTTG
1730
ACTTGTCCTCCAAGAT
2982

















CTTT






2FH21F_07_181
21
10070432
R
C
82
7
151592108
R
T
 82
AAAGATCTTGGAGGACAAGT
 479
CTCAGTTTCTTGGGAAGGAT
1731
TTCTTGGGAAGGATTA
2983

















AAAGA






2FH21F_07_183
21
10070468
R
C
107
7
151592144
R
T
107
AGGACAAGTACTCTTCCTTC
 480
ATTCAGTAAACATTTATTCG
1732
ATTCAGTAAACATTTA
2984

















TTCGATACCTT






2FH21F_07_186
21
10070670
R
G
119
7
151592346
R
A
119
TTGGGCATAATTCTTGCTGG
 481
ACCCCCATGATTCTAATGAG
1733
GATAATTTGGGGATGT
2985

















TACCAG






2FH21F_07_187
21
10070767
F
A
119
7
151592443
F
C
119
ATCCTGGTCAGCATAATTCC
 482
GGAGAAATGACCAAGAGATG
1734
GAGAAATGACCAAGAG
2986

















ATGAAATAC






2FH21F_07_188
21
10070815
R
A
120
7
151592491
R
G
120
TTGAGTAGATCCTGGTCAGC
 483
TGACCAAGAGATGAAATAC
1735
AAATTTGTAAATGCCA
2987

















CATATTTC






2FH21F_07_194
21
10071259
F
T
99
7
151592988
F
C
 99
ATTCAAAGCTGTGTATTGGG
 484
GAACAACCTCTATTATATTA
1736
ACAACCTCTATTATAT
2988















C

TACACAAAC






2FH21F_07_195
21
10071393
R
G
96
7
151593122
R
A
 96
TTCTGGCACACTTTGCACTC
 485
TGTGGTCAGCACTATCATGG
1737
TCATGGAATGTGCCTG
2989

















GATA






2FH21F_07_198
21
10071650
F
T
115
7
151593379
F
C
115
GCATCATGAACCTTTCAGAC
 486
GATTAAATACCCTACAGTG
1738
ATACCCTACAGTGTTT
2990

















TTATTG






2FH21F_07_200
21
10071825
F
C
108
7
151593554
F
T
108
GTTACACTGCAAAGCATTTC
 487
GCTGGATACCTAATTAATGC
1739
TACCTAATTAATGCTC
2991

















AATATATGCT






2FH21F_07_202
21
10071854
F
C
108
7
151593583
F
A
108
GTTACACTGCAAAGCATTTC
 488
GCTGGATACCTAATTAATGC
1740
GAACCAAACAAGGAAA
2992

















ATAC






2FH21F_07_203
21
10071857
R
C
114
7
151593586
R
T
114
GCTGGATACCTAATTAATGC
 489
GTTTATGTTACACTGCAAAG
1741
CACTGCAAAGCATTTC
2993















C

TTA






2FH21F_07_207
21
10072259
F
A
102
7
151593988
F
C
102
TATGCATAAGTTTAACTGTA
 490
TACTAACAGTTCTTTTACC
1742
AAATATAAGGATAAAC
2994

















TGCCCTG






2FH21F_07_210
21
10072886
F
T
93
7
151594614
F
C
 93
GTTTCAAGATGCTTGACTGG
 491
GAAGGTTTGGTCAATCCTAT
1743
CAATCCTATCAATTTC
2995

















TCTCTGACTCA






2FH21F_07_211
21
10074617
F
C
93
7
151596351
F
T
 91
GTTTCTTGTAAGCATATGGG
 492
GAATACCTATTACCACACCC
1744
TACCTATTACCACACC
2996

















CAAATACC






2FH21F_07_212
21
10074885
R
G
119
7
151596617
R
A
119
CATCCCAGTTATGTCCTTTC
 493
TGGCTCTTTAAGTGATAGGC
1745
ATCTAACAATGGAAGC
2997

















ATCATAAATT






2FH21F_07_214
21
10075482
F
A
96
7
151597193
F
T
 96
TCAGTAAGGAATTGGTGGA
 494
CTCTGCAACAAGACAACTG
1746
CTGTCATTGTCACAAA
2998

















AATCAC






2FH21F_07_215
21
10075500
R
C
116
7
151597211
R
T
116
GTCATTGTCACAAAAATCAC
 495
CCAATGATCCATAGTAATC
1747
TCAGTAAGGAATTGGT
2999

















GGA






2FH21F_07_216
21
10075520
F
C
116
7
151597231
F
T
116
CCAATGATCCATAGTAATC
 496
GTCATTGTCACAAAAATCAC
1748
TCCACCAATTCCTTAC
3000

















TGA






2FH21F_07_219
21
10075639
R
T
101
7
151597352
R
A
 99
CTGGTGCAAAAACACTTAA
 497
GTTGGAACCAACCTCATTTC
1749
TTTCTTTGTGTAGTGC
3001

















TTTTAAAAAT






2FH21F_07_220
21
10075694
R
A
81
7
151597407
R
G
 81
AATGAGGTTGGTTCCAACCC
 498
GGTTGTTTCAGTATTCCCAC
1750
TTCCCACACATCTTCT
3002

















C






2FH21F_07_223
21
10076079
R
A
113
7
151597787
R
G
113
GAAAGTGATGAGTATTTGAG
 499
AACCTTGCTCCCTTTACTTC
1751
CCCTTTACTTCATTTA
3003

















GCTTCAT






2FH21F_07_226
21
10076263
F
T
110
7
151597971
F
C
110
GCTGTTCACCAATGCTTTTA
 500
TAGAACAGAGCTTATCACAG
1752
GCTTATCACAGATCCT
3004

















TAAAC






2FH21F_07_228
21
10076329
R
G
117
7
151598037
R
C
117
CCAGACAACACATAAGAAT
 501
CAATGCTGATTTGGTCCTTC
1753
TGACAGCTATTTTGAC
3005

















TTTT






2FH21F_07_229
21
10076363
F
G
104
7
151598071
F
T
104
GAAAGCAATGCTGATTTGGT
 502
TAAAAGCATTGGTGAACAGC
1754
AATAGCTGTCATACAG
3006













C



TGTGAATT






2FH21F_07_230
21
10076479
F
A
118
7
151598187
F
G
118
TCTAGCCTCTTTGGATGAC
 503
TTTCATCACTGGCAGGACAC
1755
TTTGTCTATAAAAGAG
3007

















AATCTCTGG






2FH21F_07_233
21
10078516
F
A
119
7
151600224
F
T
114
ACCTTCAGTTACATGTTAG
 504
CATTATATACATGATCAACA
1756
ATTATATACATGATCA
3008

















ACAACAGCA






2FH21F_07_234
21
10078568
R
A
119
7
151600271
R
G
114
ATACATGATCAACAACAGC
 505
AGTGTATACCTTCAGTTAC
1757
TATACCTTCAGTTACA
3009

















TGTTAG






2FH21F_07_235
21
10078595
R
A
84
7
151600298
R
G
 84
CTAACATGTAACTGAAGGT
 506
ATGGCAGTGCTACTTTCTAC
1758
TTCTACTGAAAACTGT
3010

















GTTCTAA






2FH21F_07_238
21
10078870
F
A
100
7
151600575
F
G
 99
TCAATCTGGAAGAGAAGAAC
 507
TGCACTTGCTGAAGTAACTC
1759
AGTAACTCAGTACATA
3011

















AATAGTAGCC






2FH21F_07_239
21
10078889
R
A
111
7
151600593
R
G
110
TGCACTTGCTGAAGTAACTC
 508
TTGTACACTCTTCAATCTGG
1760
TTCAATCTGGAAGAGA
3012

















AGAACTT






2FH21F_07_240
21
10079022
R
G
89
7
151600722
R
T
 85
GTTTGCCTTACCTATAATTT
 509
TGTGTCCACATATGTAATC
1761
CCACATATGTAATCAT
3013













G



ATCACC






2FH21F_07_241
21
10079119
R
C
106
7
151600819
R
T
107
AAAGGGTAATGATCATGTA
 510
CTTCTCCAGGTCTGTGAAAC
1762
CCAGGCTTAAACTAAT
3014

















CTCAAATAC






2FH21F_07_242
21
10079159
F
T
82
7
151600859
F
C
 82
GAGATTAGTTTAAGCCTGGG
 511
TTTCCTATCTTCTCCAGGTC
1763
TTCCTATCTTCTCCAG
3015

















GTCTGTGAAAC






2FH21F_07_243
21
10079191
F
A
117
7
151600891
F
G
116
CTTTTTTATGTCACCTCTTA
 512
CACAGACCTGGAGAAGATAG
1764
CCTGGAGAAGATAGGA
3016













G



AAAAA






2FH21F_07_245
21
10079219
F
G
117
7
151600918
F
A
116
CTTTTTTATGTCACCTCTTA
 513
CACAGACCTGGAGAAGATAG
1765
AACATTGCTAAGGAAC
3017













G



AG






2FH21F_07_247
21
10079325
F
T
105
7
151601024
F
G
105
GAGATCTCTCCTTTTTCTTA
 514
GGAAATTCAATAGACTAGGA
1766
TTCAATAGACTAGGAG
3018













C

G

AAAAAA






2FH21F_07_253
21
10079512
F
T
99
7
151601209
F
A
 95
GAATTATAAAATACTATTTG
 515
CCTTTTCATGATTCATCTAT
1767
CCTTTTCATGATTCAT
3019













G

C

CTATCTTAGTC






2FH21F_07_254
21
10079748
R
C
131
7
151601433
R
T
119
ACTGGATGGCTTTTTAGTGT
 516
CCACTGTAGAAAGATGTAA
1768
CTGTAGAAAGATGTAA
3020

















ATAGGGACT






2FH21F_07_256
21
10079996
R
C
120
7
151601681
R
T
120
ACACTCAGGGAATTTACAAC
 517
GACCAAGCTCCTGAAAGATG
1769
CTTTTAAACTTCAACC
3021

















AATGT






2FH21F_07_262
21
10080693
R
A
110
7
151602391
R
C
118
TACAAAATAAACTCATCAAT
 518
GTTGATTGCTACATTGAAG
1770
TTGCTACATTGAAGTA
3022













T



TGTAGTTTT






2FH21F_07_264
21
10080826
R
A
99
7
151602525
R
G
 99
TTCCCATTTCAACCTGCCTC
 519
TAGACTGCCCCTCTTGTTTG
1771
TTGTTTGGGGCTTATT
3023

















TCTGTG






2FH21F_07_268
21
10081077
F
G
101
7
151602776
F
A
101
GATCATGTAATGGCATAAGC
 520
CTCTGTGGGAAATGACTATC
1772
TATCTAACATAAATTT
3024

















TTGTTTACACC






2FH21F_07_269
21
10081089
R
C
103
7
151602788
R
T
103
CTCTGTGGGAAATGACTATC
 521
CTGATCATGTAATGGCATAA
1773
TGATCATGTAATGGCA
3025















G

TAAGCAAGTA






2FH21F_07_270
21
10081127
F
T
99
7
151602826
F
C
 99
CAAAGATAGTATGGTGCCTC
 522
CTTATGCCATTACATGATCA
1774
GCCATTACATGATCAG
3026















G

TTTATCTTTT






2FH21F_07_271
21
10081152
R
G
117
7
151602851
R
A
117
GCCATTACATGATCAGTTT
 523
CAGCATTTTTGGTGCTTTGG
1775
GATAGTATGGTGCCTC
3027

















AA






2FH21F_07_277
21
10081324
R
G
99
7
151603023
R
C
 99
GTGGCTCATAAACAGCTTAG
 524
CCACAGTAATGTTAGCAGGG
1776
ATGTTAGCAGGGTCCA
3028

















ACTGTCT






2FH21F_07_279
21
10081461
R
T
95
7
151603160
R
A
 95
TGTTTTCAATGTTTTATGTG
 525
GCAGTAGACTGATGACAGTG
1777
GTGAGGAAGAGTTTGA
3029

















TAGTATGTGA






2FH21F_07_282
21
10081890
F
T
116
7
151603589
F
C
118
CCTGTTTTGTAAAAGCTGGT
 526
GCTATTTTGGCACTCAAGGG
1778
TTTTGGCACTCAAGGG
3030

















TATTAATG






2FH21F_07_283
21
10081972
R
G
103
7
151603668
R
A
 98
ACCAGCTTTTACAAAACAGG
 527
CTGGGTTCTGTTAATGCACT
1779
TATTTAGATACCTTGG
3031

















GAGTTA






2FH21F_07_289
21
10082542
F
C
92
7
151604238
F
T
 92
TAGGAAGATACATTCCAGAC
 528
AGCTAATGAAGAGCACTCGG
1780
CACTCGGCATTAAAAG
3032

















AAAA






2FH21F_07_293
21
10083271
F
G
109
7
151604963
F
C
112
TTGAAAATTCCTCAGACTC
 529
CCCATATTAATCCAAGAAC
1781
CCATATTAATCCAAGA
3033

















ACACAATAA






2FH21F_07_298
21
10083542
R
G
84
7
151605235
R
A
 84
TGGTTTTAGGCTACGTGCTC
 530
AAACAAATTTGGAGCATGGG
1782
ATTTGGAGCATGGGGA
3034

















GCCTTA






2FH21F_07_302
21
10085885
R
G
112
7
151607541
R
A
112
TGCTGTTAATGAGATCCGAG
 531
GAATAATTTCATAGATTAGG
1783
TTTTATTTCAGTCAGC
3035

















TTTATTTCA






2FH21F_07_303
21
10085999
F
T
110
7
151607655
F
C
110
GACCTGAAGTAATGAACAGT
 532
GTGTGTTTAATAGTATGCC
1784
GTATGCCAACTAGAAT
3036

















GATTA






2FH21F_07_304
21
10086054
R
T
115
7
151607715
R
A
120
GGCATACTATTAAACACAC
 533
ATCCCACTCTTAGCAGTCTC
1785
TGTAATGTCGTTTGAT
3037

















GTTATTT






2FH21F_07_305
21
10087226
F
G
106
7
151608858
F
C
106
CATAGTGTTAAGACATTGTG
 534
GCTTTGGTCTCTGCCAAATC
1786
GGTCTCTGCCAAATCA
3038

















CTATTA






2FH21F_07_306
21
10087247
R
C
106
7
151608879
R
A
106
GCTTTGGTCTCTGCCAAATC
 535
CATAGTGTTAAGACATTGTG
1787
CATTGTGTAATGTAAG
3039

















TATAATGT






2FH21F_07_307
21
10087343
F
A
96
7
151608975
F
T
 94
ACTCAGAAAGCTTGCCTCTC
 536
ACTCTGGCTTGGAAATGAGG
1788
GAGGAGGCAGAATCTC
3040

















AGA






2FH21F_07_308
21
10087356
R
T
100
7
151608986
R
A
 98
ATGAGGAGGCAGAATCTCAG
 537
TAGAGGGCACTTTTGTGGAC
1789
ACTCAGAAAGCTTGCC
3041

















TCTCCTATTTT






2FH21F_07_309
21
10087427
F
T
111
7
151609057
F
C
111
AAGTGCCCTCTACCTATTGG
 538
AGGGACCTATTTCTTCAGGG
1790
TATGTATGTTGTTACA
3042

















AATAGAGA






2FH21F_07_312
21
10089160
F
C
89
7
151610833
F
A
 87
TATATATAAAATTCACTTTG
 539
GGGTATTCCTAGAAATGTG
1791
TGTACCTATTATTCAC
3043













C



TTGCT






2FH21F_07_321
21
10089979
F
T
104
7
151611658
F
G
104
CGAGTTTCTCCAAACAGATG
 540
TCAACCAGAATCTGGTTCAC
1792
ATCTGGTTCACCTTAT
3044

















TGACTCA






2FH21F_07_323
21
10090076
F
A
92
7
151611755
F
G
 91
GCAGGTACTGGAAATCTGCT
 541
TGGTGAACAAACTGTTTGTG
1793
TGTTTTCCACTTTTCT
3045

















TAAAAAA






2FH21F_07_325
21
10090219
R
T
118
7
151611897
R
A
118
AATCACAGAAGGGCTATCAG
 542
GCTGTGATTATATAAATACT
1794
TATATAAATACTCTTT
3046















C

TGATGCATAA






2FH21F_07_329
21
10101118
R
G
88
7
151706710
R
A
 88
CTTTGGTACCAATTCTAGAT
 543
AGGAACATGAGACCAGGAAG
1795
GAGACCAGGAAGTTAA
3047

















ATACC






2FH21F_07_331
21
10101393
R
T
113
7
151706985
R
C
113
ACAAGCTCTATCTTCCTTAC
 544
GGGAAGTTTTTTGAAGATGG
1796
TTGAAGATGGGAGAAA
3048















G

GA






2FH21F_07_332
21
10101424
F
T
98
7
151707016
F
G
 97
TTCTACAGACCAGGCTGTTG
 545
TCTCCCATCTTCAAAAAAC
1797
CCATCTTCAAAAAACT
3049

















TCCCCC






2FH21F_07_333
21
10101607
R
A
107
7
151707208
R
T
117
CTGAAACTTTTTTCAATGCC
 546
AAAGTGGTTCAACTGAAAG
1798
GTGGTTCAACTGAAAG
3050













C



AATGAAAAG






2FH21F_07_334
21
10103626
F
T
101
7
151708905
F
C
101
TTCAGCCATGTTCAAAAGGG
 547
GCTTGGGATTCAAGTCATAA
1799
AGGTCTGTCTTACCTT
3051

















TC






2FH21F_07_335
21
10103674
R
A
107
7
151708953
R
C
107
CTTTTGGAGTCTCTCTGCTA
 548
GAAAGGTAAGACAGACCTAG
1800
ATATTTATGACTTGAA
3052

















TCCCAAGCTA






2FH21F_07_337
21
10103849
F
A
92
7
151709127
F
G
 92
AACAGAACAAAACTTGATG
 549
AGATGTTCAATGGACATCCC
1801
CCCATTTCTTTTGTAA
3053

















AAGCAACTTGA






2FH21F_07_340
21
10104391
R
T
120
7
151709669
R
A
120
CTGTTCTACAATAGAGGCTT
 550
GTGAAATCTCAGGATTCAT
1802
AATCTCAGGATTCATG
3054

















GTATC






2FH21F_07_343
21
10104535
F
C
104
7
151709817
F
T
104
AAAGAACTGGCAGAATGTGG
 551
GCTAAAAGCTTTGAGTGATG
1803
TAAAAGCTTTGAGTGA
3055

















TGTTTGATTA






2FH21F_07_347
21
10104730
F
C
100
7
151710012
F
T
100
CCATATGGACTTTTGAGCAG
 552
CAATGTCCATGTCTCCTTCC
1804
TATCCCTACCCATTAA
3056

















TACTGTA






2FH21F_07_349
21
10104785
R
A
118
7
151710067
R
T
118
CTCAAAAGTCCATATGGTTG
 553
AAGTGGATTGTAGCTAGTTG
1805
GAATGTCAAGCTTTAG
3057













C



GAATT






2FH21F_07_351
21
10104973
R
T
115
7
151710255
R
C
115
TCAAAAGCCATTCAGGCTTC
 554
CATGGCTAGATCTGGTTTCC
1806
ACTGTTATTCTGAGTT
3058

















GAATGC






2FH21F_07_352
21
10104999
R
G
115
7
151710281
R
A
115
CATGGCTAGATCTGGTTTCC
 555
TCAAAAGCCATTCAGGCTTC
1807
TCAACTCAGAATAACA
3059

















GTAAG






2FH21F_07_354
21
10105057
R
C
117
7
151710339
R
T
117
AACCAGATCTAGCCATGTTC
 556
GAAGTAGAAAGGCAAATAGG
1808
GACAGTGGCATGAGCC
3060















G

AAC






2FH21F_07_355
21
10105089
R
A
82
7
151710371
R
G
 82
GTTCAGAGAAGTAGAAAGGC
 557
GTTGGCTCATGCCACTGTC
1809
GCCACTGTCCTTATTT
3061

















ATAAC






2FH21F_07_356
21
10105122
F
A
105
7
151710404
F
C
105
TGAGGTGACTCTGTGTTTGG
 558
CCCCTATTTGCCTTTCTAC
1810
CCTTTCTACTTCTCTG
3062

















AACTC






2FH21F_07_357
21
10105140
R
A
105
7
151710422
R
C
105
GCCTTTCTACTTCTCTGAAC
 559
CTGAGGTGACTCTGTGTTTG
1811
GTGTTTGGGTTTTTGA
3063

















AAAGAT






2FH21F_07_358
21
10105198
R
T
95
7
151710480
R
C
 95
AACCCAAACACAGAGTCACC
 560
CCTGAAAGCCACAGGCATTG
1812
TGAAAGCCACAGGCAT
3064

















TGGGTGGGGT






2FH21F_07_359
21
10105280
F
C
119
7
151710562
F
T
119
GAATCTATCATAATCTCAGC
 561
CCTGTGGCTTTCAGGTCATT
1813
CAATTTTACTGGTTCT
3065

















CTTTTAGA






2FH21F_07_360
21
10105284
R
C
92
7
151710566
R
G
 92
CTCAATTTTACTGGTTCTC
 562
CCCATTCAGCTTACTAATGA
1814
ATCTATCATAATCTCA
3066

















GCTGT






2FH21F_07_365
21
10106079
F
G
113
7
151711353
F
A
113
TACAGGAATGTAGGAAGATG
 563
GCAGTCTTACAAAACCTAAG
1815
TACAAAACCTAAGCAA
3067















C

CCTT






2FH21F_07_366
21
10106087
R
C
108
7
151711361
R
T
108
GCAGTCTTACAAAACCTAAG
 564
TACAGGAATGTAGGAAGATG
1816
AAATGCTTTTCCCACA
3068













C



GATA






2FH21F_07_367
21
10106124
R
G
116
7
151711398
R
A
116
CTGTGGGAAAAGCATTTTTA
 565
AGTGAGAGTCACCAACATAG
1817
ACAGGAATGTAGGAAG
3069













G



ATG






2FH21F_07_368
21
10106166
R
C
107
7
151711440
R
A
107
CTTCCTACATTCCTGTAATC
 566
CCTAGGATTTCTGGTTCAGC
1818
AAAGTGAGAGTCACCA
3070

















ACATAG






2FH21F_07_369
21
10106228
F
C
113
7
151711502
F
T
113
TATAATCCCTCCTTTCCCAG
 567
CATAGGCTGAACCAGAAATC
1819
TCCTAGGAAAAACTGA
3071

















TGA






2FH21F_07_370
21
10106248
F
T
113
7
151711522
F
C
113
TATAATCCCTCCTTTCCCAG
 568
CATAGGCTGAACCAGAAATC
1820
AGAAAGCTAAGGGGAA
3072

















GGA






2FH21F_07_371
21
10106297
F
C
96
7
151711571
F
T
 96
CTAAGTGTATGCTCTGTGCC
 569
CCTGGGAAAGGAGGGATTAT
1821
GGAGGGATTATATTAC
3073

















ACATGTTA






2FH21F_07_373
21
10106738
F
C
85
7
151712012
F
A
 88
TCGGATCTCCTTCTAGAGTC
 570
TCTAGCCTTGTTAGTTGCCC
1822
AGTTGCCCAAATTCTG
3074

















AAAAAAA






2FH21F_07_374
21
10106828
R
C
119
7
151712103
R
T
117
AAGGAGATCCGAGAGGCAGA
 571
TTGGCATTACTCCTGATTCC
1823
ATTACTCCTGATTCCT
3075

















CCTTC






2FH21F_07_375
21
10106864
F
T
94
7
151712139
F
A
 90
GACTCATGATGCCCCTTTTC
 572
GGAGGAATCAGGAGTAATGC
1824
GTAATGCCAAGAATGA
3076

















GAA






2FH21F_07_376
21
10107874
R
T
95
7
151712663
R
C
 95
GCACTGATCCACCACTAGC
 573
CTGTATAGGACAGTATCTGG
1825
AATACCCAAAGACAAG
3077

















ATCTCTAAAG






2FH21F_07_377
21
10109898
R
C
80
7
151715027
R
T
 80
AAGTAACACTATTCTGTGG
 574
AGTATTCCTTAAAATATCAC
1826
CTTAAAATATCACTTT
3078

















AATATGCCA






2FH21F_07_380
21
10110237
R
G
114
7
151715373
R
C
116
GATTTCAGTTATATATGTAG
 575
TTAATGTAGGTGCAGTTCAG
1827
AATGTAGGTGCAGTTC
3079

















AGTAATGATT






2FH21F_07_381
21
10110269
F
T
92
7
151715405
F
C
 92
TTGGCATACTAGTATATGT
 576
CATTACTGAACTGCACCTAC
1828
GAACTGCACCTACATT
3080

















AATCA






2FH21F_07_385
21
10110756
F
G
99
7
151715879
F
A
 99
TCAGTTTTACTCCCCAGAGG
 577
GTCTTATCTACAAACCAAA
1829
TATCTACAAACCAAAA
3081

















ACATCT






2FH21F_07_391
21
10111466
R
C
98
7
151716644
R
T
 98
TCTAATCAGGAGATTTTGG
 578
CCAGGTATTCTTCAGGTTAG
1830
TTCAGGTTAGAACTCA
3082

















GTTTCACAA






2FH21F_07_393
21
10112627
F
T
100
7
151717812
F
C
100
GGATTTAAATATGGACCAGC
 579
CTTTTTTTAAACTAGCAGGG
1831
GTGAATAGTGGGATTA
3083

















CAGA






2FH21F_07_394
21
10113252
F
G
96
7
151718440
F
A
100
CACTGTTGTATACTTCGTAG
 580
GGAAGTAGAAACTGAAGAAC
1832
GTAGAAACTGAAGAAC
3084













C



ACTTTGTTAA






2FH21F_07_395
21
10114677
F
T
110
7
151719888
F
C
110
GTATGTATATGATAAAGCTA
 581
AAGCTCCTCAAAAGAGCTGG
1833
TCCTCAAAAGAGCTGG
3085













G



AGTATAAA






2FH21F_07_397
21
10115023
F
G
92
7
151720234
F
C
 92
CACTAAGGCCTTTCCAAT
 582
TTATCTGTTCTCCCCTACCC
1834
CTCCCCTACCCCCCAC
3086

















AAC






2FH21F_07_398
21
10115084
R
T
85
7
151720295
R
C
 85
ATTGGAAAGGCCTTAGTG
 583
GTAGTAGTATGTGAGTTTGG
1835
GTAGTATGTGAGTTTG
3087

















GATCATTTCT






2FH21F_07_399
21
10115123
F
C
81
7
151720334
F
T
 81
TCATTTTAGTTTGGAGAAC
 584
GATCCAAACTCACATACTAC
1836
AAACTCACATACTACT
3088

















ACTTCTTTATT






2FH21F_07_402
21
10115294
R
T
83
7
151720505
R
G
 83
TAGTTATTAGTAAACAACTC
 585
TGAGAACAGTTCCATAGCCC
1837
CCATAGCCCTTCATTT
3089

















TTA






2FH21F_07_403
21
10115433
F
A
103
7
151720644
F
G
103
GGGAGGGCATTCACACAAAA
 586
GCGCAGTGTTTAATGAACTT
1838
CACATCAGAACCACCA
3090















G

G






2FH21F_07_405
21
10116089
F
C
101
7
151721214
F
G
101
GCAGAGTCCAATGCATAATT
 587
AAGATAACTACCTGGCATTC
1839
AACTACCTGGCATTCA
3091

















GGTTAAAAT






2FH21F_07_406
21
10116140
R
A
119
7
151721265
R
G
119
CCTGGCATTCAGGTTAAAAT
 588
TGAAATTTACAAGTAGGGGC
1840
AAGTAGGGGCTGGTGA
3092

















T






2FH21F_07_407
21
10116273
R
T
120
7
151721398
R
C
120
CTGATCTCAGAGTTTAAAAC
 589
GTAAATAATTTTTGCATGCT
1841
AATAATTTTTGCATGC
3093













C



TAAGAAA






2FH21F_07_416
21
10119088
R
T
95
7
151729790
R
C
 95
GCATAACTGTTCTCAACCTT
 590
CCTTTCCTTTTCCCTTTATG
1842
CTTTATGTGCTTACAT
3094













G



CTGTCATTTCT






2FH21F_07_419
21
26029711
R
C
82
7
62991014
R
G
 82
GAGAGACGCTGCACGTGGA
 591
CGCCCGCACTCCAGAGCC
1843
TCCAGAGCCGGCTGAG
3095

















AAC






2FH21F_07_420
21
26052351
R
G
120
7
62991802
R
A
120
GTTCCAGATGACTCCAGAGA
 592
ACCACACTCAACATTTCGGG
1844
AGAAGATTTTTTTCAG
3096

















CGGGTTCCTC






2FH21F_07_421
21
26058471
R
A
103
7
62991971
R
G
103
GTGCAATCTGCTACACCTAC
 593
GAAATCCTCGGCGCTCTTTG
1845
TCTTTGTACTTTGGCT
3097

















GC






2FH21F_07_422
21
26058506
R
G
87
7
62992003
R
C
 84
CTGTTCTGTTCCCAGGTGAG
 594
GCAGCCAAAGTACAAAGAGC
1846
AGTACAAAGAGCGCCG
3098

















AGGATTTCAG






2FH21F_07_423
21
26063275
F
T
100
7
62992312
F
C
100
TGTGTACAAGTTTGTCTGTG
 595
CACATTCTGTGACCAAACGG
1847
CAACTCCGCTGCACTG
3099

















TATCCA






2FH21F_07_426
21
26063642
R
A
119
7
62992679
R
G
122
GTTAAAGGATCTCCACAAT
 596
GCTACACATTAATACTGACC
1848
ATTCCACAATGAACCT
3100

















GCCTTCACAC






2FH21F_07_427
21
26063674
F
A
107
7
62992711
F
G
110
CTGGCTATTTTTGGTAGGGC
 597
TGAAGGCAGGTTCATTGTGG
1849
AAGGCAGGTTCATTGT
3101

















GGAATAGTTT






2FH21F_07_429
21
26063792
F
T
90
7
62992832
F
A
 90
TCTCTAGGAAACAGTCTGGC
 598
CATCTAAAGCAGCAGAGAGG
1850
AGGGGAAACAGTTATA
3102

















TTTTCAAA






2FH21F_07_430
21
26063870
F
C
106
7
62992910
F
A
106
CCTCTCTGCTGCTTTAGATG
 599
GATTAGATGAAACAGGCACA
1851
TAGATGAAACAGGCAC
3103















C

ACATGCTTTA






2FH21F_07_431
21
26064006
R
A
86
7
62993046
R
G
 86
AAACCTGGATCTCCTCCTTC
 600
TGCAAGCAAAGGACAGTAAG
1852
TGCAAGCAAAGGACAG
3104

















TAAGAAGTTG






2FH21F_07_434
21
26064248
R
T
113
7
62993288
R
G
113
AACTGAAAAGGTATACCTC
 601
AATAAACTGGCACTACAGGG
1853
AAAAAGGAAGCCATAA
3105

















CAAACCAAA






2FH21F_07_437
21
26064421
F
A
113
7
62993461
F
T
113
GTCTTAAAGAGAAGACTGCC
 602
AGTACTTTACCTTTCAAGGC
1854
TGCAAATAGTTTTAAA
3106

















AGGAAAAT






2FH21F_07_438
21
26064428
R
T
113
7
62993468
R
C
113
AGTACTTTACCTTTCAAGGC
 603
GTCTTAAAGAGAAGACTGCC
1855
AGAGAAGACTGCCTAT
3107

















AACA






2FH21F_07_439
21
26064471
F
G
118
7
62993511
F
A
122
TGATCAACTGAATATGTATA
 604
TAGGCAGTCTTCTCTTTAAG
1856
AAGACAATACTTTTCC
3108

















ACTT






2FH21F_07_443
21
26064690
F
C
115
7
62993736
F
T
120
AATAGCTATCTGCCAGTCTC
 605
CAAAAATGGCTAGAAATGTC
1857
TGTCTTTTTCTTTCTT
3109

















TTCTCT






2FH21F_07_444
21
26064883
R
G
104
7
62993934
R
A
104
TAACAATGCCATCTTGCCTG
 606
AAAGCTTCTTAAGAGCTCAG
1858
TGACTTAACTAGGAGA
3110

















AAAAG






2FH21F_07_445
21
26064992
R
G
107
7
62994042
R
A
106
GAATGAATCCTAAGAGGCAG
 607
AAGATTACCAGAGAAAGAG
1859
GATTACCAGAGAAAGA
3111

















GATCAAAGAT






2FH21F_07_447
21
26065229
F
A
120
7
62994284
F
G
120
CCTTTCTTGCTGTCTATTTG
 608
AATTTGGGCACTGTGGTT
1860
ATGAAATAATAAACAG
3112

















AAGCTCTA






2FH21F_07_452
21
26065616
R
A
98
7
62994670
R
C
 98
CTCATAATTTGAACAGAGAC
 609
TGTCATGCATAAATGATGG
1861
AAAAAGCATCTGATCA
3113

















TGTA






2FH21F_07_454
21
26065675
R
G
80
7
62994734
R
A
 85
CCATCATTTATGCATGACA
 610
GAGTTTCTTGAATCAACTGG
1862
AATCAACTGGAGAAAT
3114

















TAGTCA






2FH21F_07_457
21
26066063
R
G
86
7
62995130
R
T
 86
GAAGATCAACCACACATAGC
 611
ATATTTGTGTTGGCATCAG
1863
TGTTGGCATCAGAAAA
3115

















ACAAAT






2FH21F_07_459
21
26066149
R
T
79
7
62995221
R
C
 84
TATTTTTGTATCAGTCTATG
 612
AATCAGGGGAGAAAACAA
1864
AATCAGGGGAGAAAAC
3116

















AACTAAACA






2FH21F_07_460
21
26066207
R
T
109
7
62995279
R
C
109
GTTTAGTTGTTTTCTCCCCT
 613
CAGCAGACCTCACAAAAATA
1865
CTCACAAAAATATTTG
3117













G



GTGGTACA






2FH21F_07_462
21
28675597
R
C
87
7
57161078
R
T
 87
CCCACTATTCAGACATTAG
 614
GTCTTTTTAAATGAGGCCTG
1866
TAAATGAGGCCTGTCA
3118

















TTATGTCATC






2FH21F_07_463
21
28675666
F
C
119
7
57161147
F
T
119
CTCAGTGAATGCGTGAGATT
 615
CAGGCCTCATTTAAAAAGAC
1867
AAACCATGTGTATTTC
3119

















TACAA






2FH21F_07_464
21
28900500
F
G
120
7
42279914
R
T
120
GGCAAACATAATTTGGATGG
 616
AGGTAGTTCTCTAAGTTAC
1868
GGTAGTTCTCTAAGTT
3120













G



ACCAAAATC






2FH21F_07_465
21
28900549
R
C
119
7
42279865
F
C
119
GTTCTCTAAGTTACCAAAAT
 617
CATGGGCAAACATAATTTGG
1869
AAACATAATTTGGATG
3121













C



GGTCT






2FH21F_07_466
21
28900702
F
G
104
7
42280104
F
A
104
GCTTCTACCAAGTTTATTTG
 618
CTCCCATTATTACTCTTCAG
1870
GTAGAAAATAACTTTG
3122

















GGGTAACAA






2FH21F_07_474
21
34400356
F
G
99
7
130139932
F
T
 99
GAATTGCTAACATTTCCAT
 619
GCAAAGTACATTCCTTTCTG
1871
GTACATTCCTTTCTGT
3123

















GGTATTTT






2FH21F_07_475
21
35894307
R
C
114
7
148135521
F
T
114
GTTTGAAATTCTGAATTTGC
 620
CTTTGCAGCTGGTGAGAAGG
1872
CTGGTGAGAAGGCAAT
3124

















AAAAAGTTGA






2FH21F_07_476
21
40333032
F
C
118
7
121388053
R
A
118
TTCATCTGCATAATTTAATC
 621
GAAAAACTAAAGTCTAACAG
1873
TAAAGTCTAACAGGGG
3125

















AAA






2FH21F_07_479
21
45508375
R
G
82
7
125645926
R
A
 82
TGTTTTATACAGCTCTCAG
 622
TGTTCTAGAAACAGTGCCTT
1874
AAACAGTGCCTTTTTC
3126

















AT






2FH21F_07_480
21
45508426
R
T
93
7
125645977
R
C
 93
AAAGGCACTGTTTCTAGAAC
 623
GTTACTCAAAGCTGTGCAGG
1875
AAGCTGTGCAGGGTAA
3127

















ATG






2FH21F_07_482
21
45508473
R
C
91
7
125646024
R
T
 91
TTACCCTGCACAGCTTTGAG
 624
CTCAAGCTTTTAAAATTGAC
1876
CTCAAGCTTTTAAAAT
3128















C

TGACCCTGAAC






2FH21F_07_483
21
45508504
F
A
107
7
125646055
F
G
113
GAGGGACAGACAGCTCTTC
 625
CAGGGTCAATTTTAAAAGC
1877
TCAATTTTAAAAGCTT
3129

















GAGAAG






2FH21F_08_001
21
14371001
R
A
118
8
47060648
R
T
128
ACTTCACAGAAACCGTTCCC
 626
TCTTTCTCCTTCTGAGATGC
1878
CCTTCTGAGATGCATC
3130

















TTCAAAC






2FH21F_08_003
21
17783776
R
G
107
8
52794904
R
A
107
CACATCTTCCTGGATTGGAG
 627
AAATATTCTGCTTGAATCC
1879
TACTCTGGAAGAATTT
3131

















TTGAA






2FH21F_08_004
21
17783855
R
A
106
8
52794983
R
G
106
ATACTCACAGTCTTAGATG
 628
GGATTCAAGCAGAATATTT
1880
TTTTTTCAAAGATCAG
3132

















TAAGCGGTGC






2FH21F_08_008
21
23758768
R
G
89
8
131135676
F
T
 89
TTAGCTCCATGACAGACCAG
 629
CCAAAGTAGGTTTTTGTAGC
1881
GGTTTTTGTAGCTGTA
3133

















AACTGTG






2FH21F_08_009
21
23758804
F
C
99
8
131135640
R
A
 99
GCTGAAGGAATAACACTTAC
 630
TACAGCTACAAAAACCTAC
1882
TACAAAAACCTACTTT
3134

















GGTATT






2FH21F_08_010
21
23758828
R
T
103
8
131135616
F
T
103
GCTACAAAAACCTACTTTGG
 631
CAGTGAATATTTTGCTGAAG
1883
TTGCTGAAGGAATAAC
3135















G

ACTTACA






2FH21F_08_013
21
23759109
F
A
116
8
131135335
R
C
116
CTGCTTTAATGGCAATCAAG
 632
TGCATTTAGAAGCTTACCTG
1884
CATTTAGAAGCTTACC
3136

















TGAAATCT






2FH21F_08_014
21
39452121
R
A
100
8
121215010
R
G
100
TCTTCATAACTACTACAATA
 633
TAGTAAATTTCCATCTGTG
1885
CATCTGTGTAAACTTT
3137

















ATTGAG






2FH21F_08_016
21
40846776
F
T
96
8
25626542
F
A
 95
GGGTTGGATTTGCATCCTAA
 634
GTAAAACATTATACAGCTC
1886
GGAAACAGCTTTCTAA
3138

















TTTTTT






2FH21F_08_017
21
46479557
F
T
119
8
130332631
R
G
119
ATGGTGGACATTTGAGCAG
 635
TGCATCAAGCATCTGAGAA
1887
CAAGCATCTGAGAATA
3139

















ACAT






2FH21F_09_004
21
20468633
F
T
94
9
114431868
R
G
 99
GTGTGTATAATGTTTGCCTC
 636
CCATAAGTTTTAGGCTGTAC
1888
TTAGGCTGTACCAACA
3140















C

AA






2FH21F_09_005
21
20468658
R
T
99
9
114431838
F
G
104
CCATAAGTTTTAGGCTGTAC
 637
CCATTGTGTGTATAATGTT
1889
CCATTGTGTGTATAAT
3141













C



GTTTGCCTCT






2FH21F_09_007
21
20468716
R
C
103
9
114431780
F
A
103
GAGGCAAACATTATACACAC
 638
CATATTTGTCTGTGTACTTG
1890
CTGTGTACTTGTGCTC
3142

















T






2FH21F_09_010
21
20468878
F
T
80
9
114431624
R
G
 80
CTGTGTCAAATATGTGACTG
 639
ACAAATATTGACAGGCAGCA
1891
GACAGGCAGCAGATTA
3143

















T






2FH21F_09_013
21
20469264
R
T
102
9
114431015
R
C
103
CCATGGTCAGTAATAGTTTG
 640
TTCCCACCAGGTTTCAGGC
1892
GGGTTAGAGTTACATT
3144

















TTCAG






2FH21F_09_016
21
20469522
F
C
108
9
114431266
F
T
108
AATTGTGGTTATTGTATTTC
 641
GGAAGTTAATTGGGAATAA
1893
ATTGGGAATAAAAAGA
3145

















TTTATCAATT






2FH21F_09_018
21
32523837
R
A
111
9
15976292
F
C
118
TGCAGACAGACATGGTCC
 642
GATGTGAATAAACACAAGC
1894
TGTGAATAAACACAAG
3146

















CTGATAA






2FH21F_10_003
21
26638582
F
T
88
10
69347648
F
G
 88
CTTTCAAGAAGTTCATACT
 643
ATGTTCAAAAATGGTCTGA
1895
AATGGTCTGAAAAATA
3147

















AATGCTTA






2FH21F_10_005
21
26638665
R
G
118
10
69347731
R
A
118
TCAGACCATTTTTGAACAT
 644
GAACAGCTATATTTCAAACC
1896
ACAGCTATATTTCAAA
3148















C

CCCTTTTTA






2FH21F_10_006
21
26638706
F
T
111
10
69347772
F
C
111
GGGAAATGGCCATTCAATAC
 645
GGGTTTGAAATATAGCTGTT
1897
AGCTGTTCTTTATGCA
3149















C

TAAAA






2FH21F_10_007
21
26638769
R
A
92
10
69347835
R
T
 92
GTATTGAATGGCCATTTCCC
 646
ACTGCATTCTTTAGTGTAGC
1898
AAATAAATTCAGATTG
3150

















AGACATCTT






2FH21F_10_011
21
26639000
F
C
100
10
69348063
F
T
100
TTAAAACAGTGTACAAGTAA
 647
GTAGACTGTTTAATGACTGG
1899
AATGACTGGATATCTT
3151

















CCT






2FH21F_10_016
21
36780234
R
A
106
10
95708632
R
G
106
AGGCCAGGGAGCCCACAG
 648
CTGAGTTCCTTCAGAGTGTC
1900
CCAACAATGAAGCCAT
3152

















T






2FH21F_10_018
21
36780339
F
G
116
10
95708737
F
C
116
AGACATTGATGCCAGCTCAG
 649
ACACTCTGAAGGAACTCAGG
1901
TAATCATCCTCCTCCT
3153

















TGGCTGGCT






2FH21F_10_019
21
36780343
R
A
116
10
95708741
R
T
116
ACACTCTGAAGGAACTCAGG
 650
AGACATTGATGCCAGCTCAG
1902
GATGCCAGCTCAGCCA
3154

















TGGACAC






2FH21F_10_020
21
46486292
F
A
100
10
28159033
R
C
100
GGCACAGGATGGTGGAACTT
 651
GTATCATGGAGTTGGAGAAG
1903
ACTTCAAGGATCTCTA
3155

















TGGGGA






2FH21F_11_001
21
23395848
F
G
113
11
124150014
R
A
113
GGGCTGAGCATCCCATCCT
 652
TGAAAGAACATGGTGTTG
1904
AAAAGAAAGAGCAGTT
3156

















ACACA






2FH21F_11_002
21
23395850
R
A
113
11
124150012
F
A
113
TGAAAGAACATGGTGTTG
 653
GGGCTGAGCATCCCATCCT
1905
ACACCTGTTCCAACTG
3157

















TTC






2FH21F_11_003
21
23395873
F
C
95
11
124149989
R
C
 95
GGGCTGAGCATCCCATCCT
 654
GAAAAGAAAGAGCAGTTACA
1906
GAACAGTTGGAACAGG
3158















C

TGTTTG






2FH21F_11_005
21
23395905
F
A
116
11
124149957
R
G
116
GACTCCAGCTCCTGGTACAA
 655
ACAGTTGGAACAGGTGTTTG
1907
GATGCTCAGCCCTGCC
3159

















AG






2FH21F_11_006
21
23396494
F
T
120
11
124143062
R
G
119
GGCCAGTTTATTAGAAAGA
 656
ATCGGTACAGTTGAAATGGG
1908
AATGGGAACTTTTTCA
3160

















GAG






2FH21F_11_007
21
23396572
F
G
108
11
124142985
R
T
108
GAAGTCGCTTGCCAAGGG
 657
GGAATTGGTTATAACACCCG
1909
ATAACACCCGTTGGAA
3161

















AG






2FH21F_11_008
21
23396581
R
A
108
11
124142976
F
C
108
GGAATTGGTTATAACACCCG
 658
GAAGTCGCTTGCCAAGGG
1910
TGATCTCAGCATAATG
3162

















GTAA






2FH21F_11_010
21
23396894
F
T
119
11
124142661
R
G
119
GAAAGGGTTTCCAGGTCAA
 659
GGCTATGAAGAATGTATTG
1911
GAAGAATGTATTGAGA
3163

















GGC






2FH21F_11_012
21
23397275
R
G
116
11
124142280
F
A
116
GGCTCTTTAGTTGAGTGC
 660
AGGAGCTAAGAGCCCAAATC
1912
GCCCAAATCCTTATGA
3164

















AGGATGAC






2FH21F_11_013
21
23397327
F
T
105
11
124142228
R
G
105
GTTTCCATGAAGAGTCTGA
 661
GCTCTTAGCTCCTTCTTCTC
1913
CCTTCTTCTCTACTCA
3165

















CTT






2FH21F_11_014
21
23397405
R
T
110
11
124142150
F
G
110
TCAGACTCTTCATGGAAAC
 662
TGAGGTCTGTTTTTTCTGGC
1914
AAGTCTACTATGATTC
3166

















CTTAGAAGTC






2FH21F_11_015
21
23397432
F
T
120
11
124142123
R
C
120
CATTTTCAGGTGAGGTCTGT
 663
TCAGACTCTTCATGGAAAC
1915
GACTTCTAAGGAATCA
3167

















TAGTAGACTT






2FH21F_11_019
21
25986415
F
A
115
11
109811803
F
G
117
TTCAACCACAACATCTAGCA
 664
GAAGATAAAATAACAGTCCA
1916
TAACAGTCCACTTTAT
3168















C

AAACC






2FH21F_11_020
21
25986457
R
T
108
11
109811847
R
C
110
ACAGTCCACTTTATAAACC
 665
ATTATTTTCAACCACAACAT
1917
AACCACAACATCTAGC
3169

















A






2FH21F_11_022
21
29170479
F
A
98
11
92982462
F
T
 99
ACTGAAGTCATTCATTAGG
 666
GGAATGTTCCACCTTTCTAC
1918
TGTTCCACCTTTCTAC
3170

















CTTTTTTT






2FH21F_11_023
21
29170506
R
G
100
11
92982490
R
T
101
GAATGTTCCACCTTTCTACC
 667
GAAACTGAAGTCATTCATT
1919
CTGAAGTCATTCATTA
3171

















GGTAA






2FH21F_11_024
21
29170534
R
A
121
11
92982518
R
G
121
ACCTAATGAATGACTTCAG
 668
CAGTCCTCAAGTTCACCAAG
1920
GAAACTGAATGCATTT
3172

















AGCATAT






2FH21F_11_026
21
29170588
R
G
121
11
92982572
R
A
119
ATGCATTCAGTTTCCAGTAG
 669
TGCACTTTCCAGACAAGCAG
1921
TCAGTCCTCAAGTTCA
3173

















CCAAGT






2FH21F_11_027
21
29170613
R
G
96
11
92982595
R
A
 94
ACTTGGTGAACTTGAGGAC
 670
AAAGGTCTGCAAGGAACCAC
1922
TCCAGACAAGCAGGCC
3174

















AAGAAACT






2FH21F_11_028
21
37392976
R
C
107
11
66718478
F
A
107
GTATATATAACTCCTGATC
 671
CTGTGTCAATGGCACATCTG
1923
ATGGCACATCTGAATT
3175

















ACT






2FH21F_11_029
21
37393011
F
C
81
11
66718443
R
A
 81
ACTCAGATAAAAGTCTTTC
 672
GTAATTCAGATGTGCCATTG
1924
TGCCATTGACACAGGA
3176

















GGACC






2FH21F_11_030
21
39479721
F
C
83
11
77021841
F
G
 83
AATAGGATTTAATTTGTTGT
 673
ATTCATTTAATCTGGCAATT
1925
CATTTAATCTGGCAAT
3177













T



TTTAATTT






2FH21F_11_033
21
40282355
R
A
115
11
8662624
R
G
115
AGATTTTCCATAGAGTGCTG
 674
TCTTATTTCCTGGAACCA
1926
TTTCCTGGAACCAGGA
3178

















TAAA






2FH21F_12_003
21
14364374
F
T
88
12
36842346
R
G
 88
GCGCTGCCACTAGAGCTG
 675
TGAGGTGTGTCTGGCTGTC
1927
TGTCCATCAGCCTCTC
3179

















TCTCC






2FH21F_12_011
21
14365323
R
T
81
12
36841410
F
G
 81
CTCCTCGTGGGGGTCCACC
 676
AAGGCGGAAGAGGTGGGATG
1928
GGATGCTGCTGCCTGG
3180

















CGGT






2FH21F_12_012
21
14368770
R
C
101
12
36831590
F
A
101
CTGCTTATGCACATCAACGG
 677
AAAGGTGAGCCAATGGGGTA
1929
GGTGAGCCAATGGGGT
3181

















ACAAAAT






2FH21F_12_013
21
14368851
R
C
120
12
36831509
F
T
120
AATCTTCAGGCACAACGAGG
 678
ACCCCATTGGCTCACCTTTC
1930
CACACTCCTTCCCCGC
3182

















C






2FH21F_12_015
21
14368945
F
G
83
12
36831415
R
T
 83
CCAGGCAACGGCCCTGAT
 679
TCTGCCTTACGACCAAAAGC
1931
CTGTGGCAAATTTTGA
3183

















GT






2FH21F_12_016
21
14369156
R
A
112
12
36831204
F
G
112
CGCACTTGGCAGAGTGGAG
 680
AGGCGGATGAGTGAGGCAG
1932
GCAGGCCCCTCCCACT
3184

















C






2FH21F_12_032
21
14396950
R
G
109
12
36794298
F
G
109
GAATCAGAGAATGTGATCAC
 681
TGAGCTATTGTCCCTCCAG
1933
CTATTGTCCCTCCAGC
3185













T



CTTTGGCCCT






2FH21F_12_036
21
14400021
R
T
117
12
36791807
F
G
115
GAAAAAAGACTAGATGCAGG
 682
GTTTAATTTACTGGTGCCC
1934
TTTACTGGTGCCCACA
3186













G



AGAAAAAAA






2FH21F_12_039
21
18364641
R
T
93
12
19154702
R
G
 93
CCAGCAGTCCTTAGGATTAC
 683
ATCTCATCTCCAATTTTAC
1935
TCTCCAATTTTACTTT
3187

















TTTTTTTCCCT






2FH21F_12_048
21
31116128
F
C
81
12
107311641
F
A
 81
TGACCTGCTGCCTCTGCTTG
 684
CAGCTTTGATTCTTAAACCC
1936
TTAAACCCCTTTACCC
3188















C

CAA






2FH21F_12_049
21
35466901
R
T
109
12
98716977
F
G
109
AAGAGGGAAGATGACTTTTC
 685
CTTCCTGTGAACCTGCTTTC
1937
GCTATCTTACTTTTCT
3189

















TTATTCCAC






2FH21F_12_050
21
35466974
F
C
109
12
98716904
R
A
109
GCAGGTTCACAGGAAGTTTC
 686
CTTCAAGGCAATCTTTCTCC
1938
TCCACTATTTAAAAAC
3190

















AAAACAAA






2FH21F_12_051
21
35467003
F
A
109
12
98716875
R
C
109
CTTCAAGGCAATCTTTCTCC
 687
GCAGGTTCACAGGAAGTTTC
1939
TTGTTTTGTTTTTAAA
3191

















TAGTGGAAAG






2FH21F_12_052
21
35467007
R
A
109
12
98716871
F
C
109
GCAGGTTCACAGGAAGTTTC
 688
CTTCAAGGCAATCTTTCTCC
1940
AGGCAATCTTTCTCCA
3192

















TAAACATA






2FH21F_12_053
21
35467047
F
A
107
12
98716831
R
G
107
GAAAGATTGCCTTGAAGATG
 689
CTCCACTTGTGCTCTTTATT
1941
TTCTTGAATTTTGATC
3193















C

ATCTCT






2FH21F_12_054
21
35467071
R
A
101
12
98716807
F
G
101
TTGCCTTGAAGATGCAAGAG
 690
CTCCACTTGTGCTCTTTATT
1942
TGCTCTTTATTCTATC
3194















C

ACTTTCTGCT






2FH21F_12_057
21
35467870
F
A
81
12
98716023
R
A
 89
TCAGAGCTTAGCTGCACTGG
 691
GCAGGCTTCAGGATAATTAT
1943
GGATAATTATGGTTGG
3195















G

AGTGC






2FH21F_12_058
21
35467877
R
G
102
12
98716008
F
T
110
CAGGCTTCAGGATAATTATG
 692
ATGGAAAAGGGATGCAAAG
1944
AGCTTAGCTGCACTGG
3196













G



TT






2FH21F_12_060
21
36344402
R
C
103
12
8757741
R
T
103
GCACAAGCTGATCAAGAT
 693
GAGGATAGTCTTCCCTGATG
1945
ACCACAACTTGGCAGC
3197

















CAC






2FH21F_12_064
21
36344480
R
A
98
12
8757819
R
G
 98
CATCAGGGAAGACTATCCTC
 694
CTAAAGTCCAGTTCCTCCTC
1946
CTCCTCACAACATTTG
3198

















GCCTT






2FH21F_12_066
21
36344707
R
T
116
12
8793866
R
C
116
CGCCATCTAGAGAAGATGGG
 695
GCCAGCCAACTCTTGAAATG
1947
AGTTCAGGATGGCTTG
3199

















A






2FH21F_12_068
21
36344961
F
T
112
12
8797830
F
C
112
CTGATCTAAGCCATCTTAT
 696
GACAATGACACGTACATCCC
1948
TCTTTAACATACTTCT
3200

















GGAACA






2FH21F_12_071
21
36345046
R
A
100
12
8797915
R
G
100
GGGATGTACGTGTCATTGTC
 697
GCTTTGCATTCTCCCATCTG
1949
TGCATTCTCCCATCTG
3201

















TTGAACAA






2FH21F_12_072
21
36345177
F
G
102
12
8817273
F
A
102
TTTGCATTGGCCTCACAGAC
 698
ATATCCTGGGGATGGATGTG
1950
ATATCCTGGGGATGGA
3202

















TGTGTGTGGC






2FH21F_12_073
21
36345212
R
T
108
12
8817308
R
C
108
ATATCCTGGGGATGGATGTG
 699
CCTACATTTGCATTGGCCTC
1951
ATTTGCATTGGCCTCA
3203

















CAGAC






2FH21F_12_074
21
36345252
R
C
87
12
8817348
R
G
 87
CTGTGAGGCCAATGCAAATG
 700
ACCAGCTACATCTAGATTAC
1952
ACATCTAGATTACAAG
3204

















CCTTAT






2FH21F_12_075
21
36345286
F
G
120
12
8817382
F
T
120
CAGAGGGTAGAAGGGAGGC
 701
GAGGCCAATGCAAATGTAGG
1953
CTAGATGTAGCTGGTA
3205

















TCA






2FH21F_12_076
21
36345299
R
T
105
12
8817395
R
C
105
TGTAATCTAGATGTAGCTGG
 702
GAGAGCAGGGACATACGC
1954
CCAGAGGGTAGAAGGG
3206

















AGGC






2FH21F_12_077
21
36345331
R
A
98
12
8817427
R
G
 98
GCCTCCCTTCTACCCTCTG
 703
ACTAGTCTCACTGGCAGTGG
1955
GAGAGCAGGGACATAC
3207

















GC






2FH21F_12_078
21
36345350
F
T
98
12
8817446
F
C
 98
ACTAGTCTCACTGGCAGTGG
 704
GCCTCCCTTCTACCCTCTG
1956
CGTATGTCCCTGCTCT
3208

















C






2FH21F_12_079
21
36345382
F
T
108
12
8817478
F
C
108
AACAGAGCTGGAACTTGCAC
 705
GTCCACTGCCAGTGAGACTA
1957
CAGTGAGACTAGTGAG
3209

















C






2FH21F_12_080
21
36345422
R
A
107
12
8817518
R
G
107
CTGTCAACAGAGCTGGAAC
 706
TGCCAGTGAGACTAGTGAGC
1958
ACTGCTGTTGACAACA
3210

















T






2FH21F_12_081
21
36345599
F
T
115
12
8817695
F
C
111
TGAACAGCATTGCAAGTTGG
 707
GGACTGACTCCACTGGTAAT
1959
CAAAACCCTTGTAAAA
3211

















CTTTCTTTCTT






2FH21F_12_082
21
36345703
F
C
119
12
8817795
F
T
123
TTCTATACCCCACCTATTCT
 708
ATTAGTTGGAGAGAGTGGGA
1960
GGAGAGAGTGGGAGAT
3212

















AGA






2FH21F_12_083
21
36345712
R
C
115
12
8817804
R
T
119
TTGGAGAGAGTGGGAGATAG
 709
TTTCTATACCCCACCTATTC
1961
TGAAAGTAACATCTTA
3213

















CTAGC






2FH21F_12_084
21
36345749
F
G
115
12
8817841
F
A
119
TTTCTATACCCCACCTATTC
 710
TTGGAGAGAGTGGGAGATAG
1962
TACTTTCATTTACAAA
3214

















TCCTACA






2FH21F_12_086
21
36345790
R
C
106
12
8817888
R
T
108
GGGTATAGAAAAATGTCAGG
 711
AAGTATTTGTTCCTCATGG
1963
TAGCAATTTAAAAGGG
3215

















TAACT






2FH21F_12_088
21
36345832
R
A
111
12
8817930
R
G
113
GTTACCCTTTTAAATTGCT
 712
CAAAAACAAAAGCAAGGGAC
1964
AAAAAAGTATTTGTTC
3216

















CTCATGG






2FH21F_12_094
21
36589553
R
A
84
12
119386208
R
G
 84
AGGGCATATTCCATGTCTTC
 713
ATGTGCAGAAGGATGGAGTG
1965
GAAGGATGGAGTGGGG
3217

















ATGGT






2FH21F_12_095
21
36589583
F
C
97
12
119386238
F
A
 97
TGGCAGGACCTGAAGGATCA
 714
ATCCCCACTCCATCCTTCTG
1966
CCATCCTTCTGCACAT
3218

















C






2FH21F_12_098
21
36589734
R
C
114
12
119391656
R
T
114
GGGTCCTCGAAGCGCACG
 715
AGGACCTGTTCTACAAGTA
1967
GCGAGATCGAGCTCAA
3219

















GA






2FH21F_12_103
21
40338511
F
T
81
12
43603073
R
C
 81
TTTAATTGCAGTTGCAAAC
 716
CTGTGCTAGAGAATGACTTG
1968
ATGACTTGAGAGAGGT
3220

















ACTT






2FH21F_12_104
21
40770445
R
A
99
12
56310838
F
C
 99
AGGGACTCTAGGAATTTCAG
 717
CCAATGGTTAGTCAGCAAAG
1969
CCCCAAAACTCCCCAG
3221

















TTA






2FH21F_12_105
21
40770469
F
G
99
12
56310814
R
A
 99
CCAATGGTTAGTCAGCAAAG
 718
AGGGACTCTAGGAATTTCAG
1970
CTGGGGAGTTTTGGGG
3222

















GAAA






2FH21F_12_106
21
40770473
R
T
103
12
56310810
F
G
103
AGGGACTCTAGGAATTTCAG
 719
CTAACCAATGGTTAGTCAGC
1971
ATGGTTAGTCAGCAAA
3223

















GAATA






2FH21F_12_107
21
40770509
F
G
120
12
56310774
R
A
120
CACTGTATAACATAGCCTAC
 720
CTGACTAACCATTGGTTAGG
1972
AACCATTGGTTAGGTG
3224

















GTGG






2FH21F_12_112
21
43408873
F
T
103
12
6472542
F
C
104
CTTATTTGGTGTGCTGTTG
 721
AGTCCCACAGGCGCCTAC
1973
ACAGGCGCCTACCTGC
3225

















CC






2FH21F_12_113
21
43408884
R
C
103
12
6472553
R
T
104
AGTCCCACAGGCGCCTACCT
 722
CTTATTTGGTGTGCTGTTG
1974
AGACTAGAGAAATGGC
3226

















AGGGA






2FH21F_12_114
21
43408906
F
G
103
12
6472575
F
C
104
CTTATTTGGTGTGCTGTTG
 723
AGTCCCACAGGCGCCTACCT
1975
CTGCCATTTCTCTAGT
3227

















CT






2FH21F_13_005
21
9991870
F
A
85
13
18965568
F
T
 89
GAGGCACCTGCGAAAGAAAG
 724
ATGCACACTTATGCTGACGG
1976
ATGCTGACGGGTGACT
3228

















TTA






2FH21F_13_019
21
14093183
F
G
105
13
18171241
F
T
105
GGTCTAAATGTCAGTGTAGC
 725
CTCTAACATAAACCCTGCTG
1977
AAACCCTGCTGCTTCC
3229

















A






2FH21F_13_020
21
14093198
R
T
104
13
18171256
R
C
104
ACATAAACCCTGCTGCTTCC
 726
CAGTTACCTTCTAGTAGGTC
1978
AGTGTAGCATAACAAG
3230

















GGG






2FH21F_13_022
21
14093293
R
A
116
13
18171351
R
G
116
GACCTACTAGAAGGTAACTG
 727
GTAATTGATGTTGGGTATGC
1979
ATGTTGGGTATGCAAT
3231

















GTACCTTTT






2FH21F_13_023
21
14093337
R
C
112
13
18171395
R
T
112
GGAAACATACGATGCTTTGC
 728
CCCAATAAGAGTCCCTGAAG
1980
AACATCAATTACATTT
3232

















ATCTTCC






2FH21F_13_026
21
14096743
F
T
96
13
18174798
F
C
 96
AGAGGAAGAGCAAAAGCCTG
 729
ATCCTATGTATCTTATTCC
1981
TCTTATTCCAATGAAT
3233

















AACTCT






2FH21F_13_028
21
14099425
R
T
119
13
18177481
R
G
119
CCATTCAATGGAATAGACAA
 730
GCTTTTCTATATTCCCCAGC
1982
TATTCCCCAGCATTTT
3234













G



GTA






2FH21F_13_031*
21
14102405
F
C
109
13
18180495
F
T
109
AGGGTTAATGACCAGGGCTC
 731
TAGTCCCTCCTAGCTCAACC
1983
TAGCTCAACCTCTAAT
3235

















TTGTTCTC






2FH21F_13_032*
21
14102433
F
A
116
13
18180523
F
G
116
GACAACTTCTGAGAATCAGG
 732
TGGAGCACTGCAGAGAAGTC
1984
GGAGCACTGCAGAGAA
3236

















GTCAAAACAC






2FH21F_13_033
21
14102490
F
G
104
13
18180580
F
A
104
ATTCTGAATGACGAGCCCTG
 733
CTGCAAAGGCACAGAGACT
1985
AGGCACAGAGACTGCA
3237

















GAATC






2FH21F_13_035
21
14103122
R
C
80
13
18181212
R
G
 80
TGTTTCCCTTCCTTATCCTT
 734
CCAGTATTTTGAAACAGAGG
1986
AGTATTTTGAAACAGA
3238

















GGTTAATT






2FH21F_13_036
21
14103149
F
A
116
13
18181239
F
G
116
GAGTTCTAGTTTGGCAAACT
 735
CTTATCCTTTGGGTCTTCTC
1987
CCTCTGTTTCAAAATA
3239













T



CTGG






2FH21F_13_039
21
14106660
R
T
120
13
18184718
R
C
120
AGCCTCAGGCCTTTCTATAC
 736
GCCATATCCAAACCACATTG
1988
ATCCAAACCACATTGT
3240

















AGATTCTCAAA






2FH21F_13_040
21
14109261
F
T
89
13
18187316
F
G
 89
GTCTTTGTGTTATCTCTGGC
 737
GATCTTCCAGGCTGAAAGTG
1989
GGAGGAGAACACATGT
3241

















TGT






2FH21F_13_041
21
14109738
R
C
106
13
18187793
R
A
106
TTGTGTGTAGGATTATGAGC
 738
ATGCTGATGAACCGCACTTC
1990
TCTCAGGTCTCAGCAC
3242

















TCA






2FH21F_13_042
21
14109824
R
G
99
13
18187879
R
A
 99
GGATCATTGGCCAACCATAC
 739
ATTTGTGAGGTGGAAGGTGG
1991
GGGCCTTAATGGATAA
3243

















CC






2FH21F_13_043
21
14109914
R
A
101
13
18187969
R
G
101
CTGAATGTGGATTTGGCCAG
 740
TGATCAGAGGGATGAGCTTG
1992
TTGGGATGCATGACAG
3244

















GATG






2FH21F_13_046
21
14111144
R
A
103
13
18189204
R
T
104
TTACCAAGAGATTGGTGGAG
 741
GTCACATCAAAATTTGGAG
1993
AAAATTTGGAGAAGAA
3245

















GTAAAAA






2FH21F_13_047
21
14111203
R
G
88
13
18189263
R
A
 88
ACTCCACCAATCTCTTGGTA
 742
AGCACTCTAAAAGGATGCAC
1994
AAGGATGCACACAGCT
3246

















TA






2FH21F_13_048
21
14111249
R
G
95
13
18189309
R
A
 95
AGCTGTGTGCATCCTTTTAG
 743
TGCATGACCAAGATCAGCAG
1995
CAGCAGCAACTTCAAT
3247

















G






2FH21F_13_049
21
14111290
F
T
92
13
18189350
F
C
 92
GAAGTTGCTGCTGATCTTGG
 744
GAACCCCAACAGCATCCAAG
1996
CATCCAAGTCTGCTGA
3248

















TAAGCAC






2FH21F_13_051
21
14111371
F
A
99
13
18189431
F
G
 99
CTTCTAGGACTTGTCTATTG
 745
GCAATTTTTCCAAGACAGGC
1997
TTCCAAGACAGGCTTT
3249

















CTGTTGCCCA






2FH21F_13_052
21
14111381
R
G
90
13
18189441
R
T
 90
CCAAGACAGGCTTTCTGTTG
 746
CTTCTAGGACTTGTCTATTG
1998
TTGTCTATTGAGAAAC
3250

















AGCAGCTAC






2FH21F_13_054
21
14116424
F
C
85
13
18194428
F
T
 85
ACCATATAGCAGTTGGTAA
 747
TAACTGTAAATTCTGAATAC
1999
GTAAATTCTGAATACT
3251

















TAGTATGG






2FH21F_13_057
21
14118994
F
C
120
13
18196941
F
T
120
GAGATATACTTATGACATGG
 748
CTTGATTGCCCATGTAAATC
2000
TTGATTGCCCATGTAA
3252













C

T

ATCTTGATTG






2FH21F_13_059
21
14119045
F
T
120
13
18196992
F
C
120
GATATGACAAACTGTGTGAC
 749
GCCCATGTAAATCTTGATTG
2001
GCCATGTCATAAGTAT
3253

















ATCTC






2FH21F_13_060
21
14119121
R
T
114
13
18197068
R
C
114
GTCACACAGTTTGTCATATC
 750
GTGGAAAAACTGGAGTAAAC
2002
CTGGAGTAAACCCTGG
3254

















A






2FH21F_13_062
21
14120815
F
C
100
13
18198762
F
T
100
AATACACAAAAGATATGTAG
 751
ACCGGGGACTGTCTTTTTTC
2003
AGTTTGCAAGATTTTG
3255

















TTTTC






2FH21F_13_065
21
14120978
R
C
92
13
18198925
R
G
 92
TCTTGGCGGACGTCCAGAAC
 752
TCCAGCTGCGGAGCTCTAC
2004
AGCTCTACCTCCTTCT
3256

















G






2FH21F_13_066
21
14121175
F
G
87
13
18199129
F
T
 87
GGGTTCATGCTGTAGCTGA
 753
AGAACTGGTACCAGCTAGAA
2005
CTCTCCAACCTCCTCA
3257

















AG






2FH21F_13_068
21
14121570
R
G
100
13
18199524
R
A
100
CAGATGGGTACAAGCAAGTG
 754
AGCTTCGTGTCGTAGATGTG
2006
CGTGTCGTAGATGTGC
3258

















CACCGGGTCC






2FH21F_13_071
21
14141636
F
T
114
13
18214549
F
C
114
CCAAGGCCACGTTCAAGACT
 755
GCTGCATTCTACCTCCCAAA
2007
TCTACCTCCCAAATTA
3259

















AGATAC






2FH21F_13_077*
21
14643157
F
G
120
13
71046877
F
A
120
GCTGTCATGGTTTCTTGTAA
 756
CTTCAGCAATCAAACAAAGC
2008
AATGAAAAGAATCAAT
3260

















TAAAATGGAT






2FH21F_13_079*
21
17407356
F
C
113
13
50189919
R
A
113
CTGAAAGACTTCCATTTCTG
 757
AGCAGAATTGATGCAACTAC
2009
AAAACAGAAAGGGAGA
3261

















CA






2FH21F_13_082*
21
19162925
F
G
87
13
49661632
R
T
 87
GCTTGAATGATAGTTTAAAG
 758
GAGACAACCCAAGTTAGATG
2010
ACAACCCAAGTTAGAT
3262

















GGAGCTA






2FH21F_13_083*
21
19162941
R
T
121
13
49661616
F
C
121
CAACCCAAGTTAGATGGAGC
 759
CTAGCTACTTTAAAAGGAAC
2011
TGCTTGAATGATAGTT
3263

















TAAAGAATT






2FH21F_13_084*
21
19162971
F
C
121
13
49661586
R
T
121
CTAGCTACTTTAAAAGGAAC
 760
CAACCCAAGTTAGATGGAGC
2012
CTTTAAACTATCATTC
3264

















AAGCAAAAC






2FH21F_13_088*
21
19163145
R
A
116
13
49661415
F
G
116
CTTTTCATAGAACAGAGGA
 761
TCCTCTGCTTCATCTAACTC
2013
CTGCTTCATCTAACTC
3265

















GTAGGG






2FH21F_13_099
21
35999919
R
A
90
13
88808282
F
G
 90
GATGAGAGAACCAAAAGC
 762
ATGTTCATTCCTTCAACTG
2014
AATTTTCCTTCTGACT
3266

















GTATT






2FH21F_13_101*
21
36000063
R
C
107
13
88808136
F
T
107
TTTAGGGGATTCTCCTTC
 763
CTGATGATGGGAAAGAACA
2015
AAGAACAAAAAGACAA
3267

















CATCC






2FH21F_13_105
21
36000702
F
C
94
13
88807508
R
A
 91
GCTATGAGATTTCAAACCC
 764
TTGATCCCTTTGCCAAGTTC
2016
TTGCCAAGTTCTTTCA
3268

















ATTAATGTTA






2FH21F_13_107
21
36001079
R
G
100
13
88807132
F
T
100
TGACCCATTCCCAAAATGAA
 765
AATGGTGGGACACAGAAGAG
2017
GGGACATGCTTCTGGT
3269

















TAGTGGA






2FH21F_13_108
21
36001146
F
G
121
13
88807065
R
A
121
ACTGGGAGAAATTGGTAGTG
 766
CTTCTGTGTCCCACCATTAG
2018
ATTAGAAAATCAAAAG
3270

















CTGACT






2FH21F_13_110
21
36001377
F
T
116
13
88806834
R
G
116
CAGTACTTGACCATTGAAGC
 767
GAGTCACATTCCAATTCAGC
2019
CCACCTTGCATTATTC
3271

















TAA






2FH21F_13_111
21
36001406
R
T
119
13
88806805
F
G
119
GAGTCACATTCCAATTCAGC
 768
GTTCAGTACTTGACCATTG
2020
TCAGTACTTGACCATT
3272

















GAAGCTTTTG






2FH21F_13_112
21
36001435
F
A
98
13
88806776
R
G
 98
AGAACTTGTTATAGCAGG
 769
CAAAAGCTTCAATGGTCAAG
2021
AAGCTTCAATGGTCAA
3273

















GTACTGAAC






2FH21F_14_006
21
13879750
R
C
104
14
19381806
F
C
104
GAAAAAGACCATGTACTACC
 770
ATATAAAAGGAACTTGTGC
2022
AAGGAACTTGTGCCAT
3274

















TTT






2FH21F_14_008
21
13879926
F
T
101
14
19381630
R
G
 99
AATTATATATGACTTAAAGA
 771
CTCCTTTTCATCACCAGAA
2023
TTTTCATCACCAGAAA
3275













C



GAATG






2FH21F_14_010
21
13880089
F
A
87
14
19381469
R
A
 91
GCTAGGTGCATAACTGGTAG
 772
GCAAACCACAACTGCTTCTG
2024
AACTGCTTCTGAAGAC
3276

















CCT






2FH21F_14_011
21
13880128
R
G
102
14
19381426
F
T
102
TGGTGATTTCAGTAGGCTTG
 773
TCTAGCTTTTAACCTACCAG
2025
TAACCTACCAGTTATG
3277

















CACCTAGC






2FH21F_14_012
21
13880152
F
A
92
14
19381402
R
C
 92
CTATGGTGATTTCAGTAGGC
 774
CTACCAGTTATGCACCTAGC
2026
AAAAACACCATTTCCT
3278

















CCGAG






2FH21F_14_013
21
13880155
R
C
108
14
19381399
F
C
108
CTACCAGTTATGCACCTAGC
 775
GCTTACTAAAGAACTATGGT
2027
GTGATTTCAGTAGGCT
3279















G

TGT






2FH21F_14_015
21
14921613
R
T
113
14
41185950
F
G
113
GTCTTCCAAAATTTTTCACC
 776
GGCAAGGATGGAGAGTATTC
2028
TTTGTTTTCCAGGAGT
3280

















CT






2FH21F_14_016
21
14921832
F
G
99
14
41185732
R
T
 99
GTGCATGACAATGCTCACTG
 777
AAATTGTCTGGAGGCCCAT
2029
GAGGCCCATGGCCAAT
3281

















ATCAACAG






2FH21F_14_017
21
14921834
R
T
99
14
41185730
F
G
 99
AAATTGTCTGGAGGCCCAT
 778
GTGCATGACAATGCTCACTG
2030
GGATCTCTTTCCTCAC
3282

















AAA






2FH21F_14_018
21
14921856
F
C
102
14
41185708
R
A
102
GCATTCATGCTGTGCATGAC
 779
CCCATGGCCAATATCAACAG
2031
TGAGGAAAGAGATCCC
3283

















C






2FH21F_14_026
21
14922069
R
G
119
14
41185495
F
T
119
AGACAAGGGAGAAGTCTCAG
 780
GCTAAAGGAAGCATTTTGGG
2032
GGAAGCATTTTGGGAG
3284

















TTAACTAC






2FH21F_14_027
21
14922093
F
T
119
14
41185471
R
C
119
AGACAAGGGAGAAGTCTCAG
 781
GCTAAAGGAAGCATTTTGGG
2033
AGGATAAGTGATTCTA
3285

















GGAAATG






2FH21F_14_028
21
14922116
R
T
114
14
41185448
F
G
114
GCATTTTGGGAGTTAACTAC
 782
TCCCCAGACAAGGGAGAAGT
2034
GACAAGGGAGAAGTCT
3286

















CAGG






2FH21F_14_033
21
17946653
R
C
99
14
103092721
R
A
 99
TATTTCAAGAATAACTAAGG
 783
ATTGGAACAGTATGTCTTC
2035
GGAACAGTATGTCTTC
3287

















AATAAT






2FH21F_14_035
21
17947627
R
T
111
14
103093055
R
A
109
CTTCTCAAACTAAATTATAT
 784
AATAAATGTAATGAATATGT
2036
AATGTAATGAATATGT
3288













C

C

CTACAAAG






2FH21F_14_037
21
25973901
F
T
111
14
49818843
F
G
111
ATTGGTGGTTAGAATGAAGG
 785
TTGGTGTCCTACTTTCCTAG
2037
CTCTTAGCTTCCACCT
3289

















TCCT






2FH21F_14_039
21
28867125
F
C
99
14
51943094
R
A
 99
GGTGCAACATAAAGTCAAA
 786
GACTCATGGCCCAAGTTTTG
2038
CAAGTTTTGGACAGAA
3290

















ATATG






2FH21F_14_040
21
28867172
F
T
119
14
51943047
R
G
119
CCACATTCATATTGAGTGGA
 787
CAAGTTTTGGACAGAAATAT
2039
ATTTTGACTTTATGTT
3291















G

GCACC






2FH21F_15_002
21
9885955
F
T
106
15
18428903
R
C
106
CCAGAGGTATTTTCAGAGGG
 788
CTGGACTTTTAGAGGCATGG
2040
TTAGAGGCATGGATAG
3292

















GAATA






2FH21F_15_004
21
9886039
R
G
117
15
18428819
F
T
117
GCCTCTAAAAGTCCAGCAAG
 789
GGCCTCATACATGACATCTC
2041
ACATGACATCTCTCAT
3293

















GG






2FH21F_15_005
21
9886081
F
T
113
15
18428777
R
G
113
TGCATTTGCTGCAAAAAGGG
 790
TGTATGAGGCCCTGTAGATG
2042
GAGGCCCTGTAGATGG
3294

















ATTAC






2FH21F_15_009
21
9886376
R
A
108
15
18428482
F
C
108
TCTGCTTGCTTGCCAGTGTC
 791
TTAGTGGGAGGAGGTTTGTG
2043
TCCAGAGTGCACCCCA
3295

















A






2FH21F_15_010
21
9886443
F
A
99
15
18428415
R
A
 99
TATCCCTGCAGGCGCATATC
 792
AGATGCACACAAACCTCCTC
2044
CCCACTAATTATCCAC
3296

















TACTAA






2FH21F_15_011
21
9886468
R
G
105
15
18428390
F
T
105
AGATGCACACAAACCTCCTC
 793
TTTATCCCTGCAGGCGCATA
2045
CCCTGCAGGCGCATAT
3297

















CCATTT






2FH21F_15_015
21
9886738
R
G
118
15
18428120
F
T
118
ATGGAAACATCCTTCTGCGG
 794
GATTTGTATGAACAAATGCC
2046
TTTACTCATAATTTAT
3298















C

TTCCTCTCC






2FH21F_15_016
21
9886765
F
T
118
15
18428093
R
T
118
GATTTGTATGAACAAATGCC
 795
ATGGAAACATCCTTCTGCGG
2047
GGAGAGGAAATAAATT
3299













C



ATGAGTAAAA






2FH21F_15_017
21
9886774
R
T
118
15
18428084
F
G
118
ATGGAAACATCCTTCTGCGG
 796
GATTTGTATGAACAAATGCC
2048
ACAAATGCCCATACTT
3300















C

TATTC






2FH21F_15_018
21
9886872
F
T
118
15
18427986
R
G
119
AAGGGGCTGGGAAATATC
 797
AGCCACCATTAGCTGAGAAC
2049
TGAGAACAAACATTTC
3301

















ACC






2FH21F_15_019
21
9886898
F
C
118
15
18427960
R
C
119
AAGGGGCTGGGAAATATC
 798
AGCCACCATTAGCTGAGAAC
2050
CATGGGGAGGTCAAGC
3302

















AG






2FH21F_15_021
21
9886939
F
T
113
15
18427918
R
G
114
ACACAGAGGCCCAGGGATGA
 799
TGCATGGGGAGGTCAAGCAG
2051
ATATTTCCCAGCCCCT
3303

















T






2FH21F_15_024
21
9887096
F
C
108
15
18427761
R
A
108
ATACGGGATGGTCAACTTGG
 800
CTCATCTGCAACATAGCACA
2052
CATCTGCAACATAGCA
3304

















CATGACAG






2FH21F_15_025
21
9887136
R
A
111
15
18427721
F
C
111
ACTGTCAGCTATACGGGATG
 801
TGCAACATAGCACATGACAG
2053
AATTGGCAAAGGAGAC
3305

















C






2FH21F_15_026
21
9887170
F
A
99
15
18427687
R
G
 99
CAGATGATGTTCCGACACAG
 802
AGTTGACCATCCCGTATAGC
2054
CCCGTATAGCTGACAG
3306

















TGAC






2FH21F_15_027
21
9887176
R
G
99
15
18427681
F
T
 99
AGTTGACCATCCCGTATAGC
 803
CAGATGATGTTCCGACACAG
2055
TTGTGGAGGGGACGTT
3307

















GACC






2FH21F_15_030
21
9887369
R
C
120
15
18427488
F
T
120
GGACAGAGAGAGCTGAATAC
 804
TAGAGTGGTCTGCGCAGATA
2056
GCAGATAAGAAATTAG
3308

















AAAGTGA






2FH21F_15_031
21
9887415
F
C
86
15
18427442
R
A
 87
CTTGATATTCAGAATGCTGG
 805
ATTTCTTATCTGCGCAGACC
2057
CTGCGCAGACCACTCT
3309

















ACAGATTTTT






2FH21F_15_032
21
9887447
F
G
98
15
18427409
R
T
 98
ATGATGAGAAGCTGGTGCTG
 806
CTGTTGTGACCAGCATTCTG
2058
TGTGACCAGCATTCTG
3310

















AATATCAAGT






2FH21F_15_033
21
9887470
R
G
102
15
18427386
F
T
102
CTGTTGTGACCAGCATTCTG
 807
TGAAATGATGAGAAGCTGG
2059
GATGAGAAGCTGGTGC
3311

















TGAA






2FH21F_15_034
21
9887497
R
C
80
15
18427359
F
C
 80
TTCAGCACCAGCTTCTCAT
 808
ACACATTGTGTAAGTTAGAG
2060
AGTTAGAGTGGTCAGT
3312

















GAGGA






2FH21F_15_038
21
9887692
R
T
115
15
18427165
F
T
114
TGTGCTTACTTTAATCAGGC
 809
CAGCTGTTGGCTTACTTACC
2061
TTGGCTTACTTACCTT
3313

















AAATATTAC






2FH21F_15_040
21
9887823
F
G
108
15
18427034
R
G
108
GGTATCTGTGCTGAGTCTTC
 810
ATTAATACTGCTACGCAAG
2062
ACTGCTACGCAAGTTA
3314

















TAGT






2FH21F_15_041
21
9887904
R
A
92
15
18426953
F
G
 92
ATCACTATCAGCTCAGGCAC
 811
GAAGACTCAGCACAGATACC
2063
GATACCTTCCACCAGA
3315

















CTAACCTAG






2FH21F_15_042
21
9888098
F
T
103
15
18426760
R
C
102
AACTTGGACAGTGGCGTTAG
 812
TCCTATCTTCACATGGGATG
2064
ACATGGGATGTTTTTA
3316

















GGTTTTGT






2FH21F_15_043
21
9888188
R
G
88
15
18426671
F
T
 88
TTCCCAGTATGAGAGACTGC
 813
CTCCTATCCCTAACAACAGC
2065
ACATTCCTTTGTGTCA
3317

















GA






2FH21F_15_044
21
9888229
F
T
108
15
18426630
R
G
108
GAATGTAGCTGTTGTTAGGG
 814
CTGGGCAACTGTGAAAAGAC
2066
TCCCTGCTCATGTTCT
3318

















TACGATCAC






2FH21F_15_045
21
9888343
F
C
103
15
18426516
R
A
103
CAGTGGCATAAAACATCTGG
 815
AGAGACCCAGGAGAACAATG
2067
CAGTCTCTCCAGTCCC
3319

















ATA






2FH21F_15_046
21
9888409
R
C
110
15
18426450
F
T
110
CCAGATGTTTTATGCCACTG
 816
GAAGGATACTGGAAAATAG
2068
GAAAATAGTATTCTGG
3320

















TCAAAAC






2FH21F_15_047
21
9888447
F
C
117
15
18426412
R
A
117
TTTTCTAGGCCCAGGTCTTG
 817
GAGGACAATACTATTTTCCA
2069
CTATTTTCCAGTATCC
3321















G

TTCAAA






2FH21F_15_048
21
9888478
F
G
83
15
18426381
R
T
 83
TTGTTTTCTAGGCCCAGGTC
 818
CAAATCAGAGAGCACCACAG
2070
GAGCACCACAGTGCCC
3322

















C






2FH21F_15_050
21
9888657
F
C
99
15
18426202
R
C
 99
GCTGGTCTAACAGCATAAGG
 819
ATAAACTGGTCTGCAGTGGG
2071
GCAGTGGGTACAGAAT
3323

















TA






2FH21F_15_054
21
9889047
F
T
100
15
18425811
R
G
100
GAGGCTCAAGGTTTGCTTTC
 820
TAGATGGTGGAAGGGAAGAC
2072
TAACATCTAGGGAAAT
3324

















TTCAGGG






2FH21F_15_057
21
9889172
R
A
91
15
18425686
F
C
 91
CCTGGTCATGGAATAGTCTC
 821
GCATCATCCCACTTACACAC
2073
TCCCACTTACACACAA
3325

















TGTTCTA






2FH21F_15_061
21
9890285
F
T
119
15
18424581
R
G
120
AGAGTCACAGGTAATGACCC
 822
GCTAGTGTGACCAGGAATAT
2074
ATTTGAGTGTGTGTGT
3326

















GCTCTTTG






2FH21F_15_068
21
9891452
F
T
95
15
18423412
R
G
 95
TGAAACATGAGACTCAGGGC
 823
TGTCCCAGAAATGTCATTAC
2075
GTATGTGAGCGCCAAT
3327

















AG






2FH21F_15_069
21
9892865
F
G
108
15
18422004
R
G
108
AAGGTTTCAGGATCTGGGAG
 824
TCAAAGTCTACCATCAGAGC
2076
CAGAGCTTTGGTCCTC
3328

















TTG






2FH21F_15_070
21
9892920
F
G
91
15
18421949
R
G
 91
AGGTGAGAGACTGCAGGTG
 825
ACTTGGTCTCCTGTGATTCC
2077
TCCCAGATCCTGAAAC
3329

















CTT






2FH21F_15_074
21
9893038
F
T
93
15
18421831
R
G
 93
CCACATCCCCTTTCAATTTC
 826
TCCTATGGCCCATGCAAATG
2078
ATGATTTCCCCAACAC
3330

















AG






2FH21F_15_075
21
9893077
F
G
105
15
18421792
R
T
113
GGACTCCTTTTGTACCACTG
 827
CCTGTATGAAATTGAAAGGG
2079
AAATTGAAAGGGGATG
3331

















TGGG






2FH21F_15_076
21
9893140
R
A
90
15
18421721
F
G
 90
TCACAGTGGTACAAAAGGAG
 828
CTTGAGTGACAACATCACCC
2080
CACCCTAGTTCACAAC
3332

















ACCTTAGCA






2FH21F_15_077
21
9893181
R
G
111
15
18421680
F
T
111
GGTACAAAAGGAGTCCTCAG
 829
GATTTCTCTTCATGGAGCCC
2081
TCTTCATGGAGCCCCC
3333

















ATTGTAG






2FH21F_15_079
21
9893313
R
G
102
15
18421548
F
G
102
ACCAGGAGCGGTGACTCAAC
 830
TTCTCCTCTTTGCTGAGCAC
2082
CTGAGCACAGAATTCT
3334

















CACCCTCT






2FH21F_15_082
21
9893385
F
A
99
15
18421476
R
G
 99
CCATTGTGAACTTTCCTGGC
 831
CAGCAAAGAGGAGAACTCAC
2083
CTCAATTTTCCCTCAA
3335

















GAA






2FH21F_15_083
21
9893447
R
G
88
15
18421414
F
T
 88
ACTGGGGAAAAACCTTGTGC
 832
TGGAGAATCTCCAGCTCCAG
2084
GGTGGGACCCCAAAAG
3336

















A






2FH21F_15_084
21
9893475
F
G
118
15
18421386
R
A
118
TGGAGAATCTCCAGCTCCAG
 833
GGTAGGAACTGGGGAAAAAC
2085
GTAGGAACTGGGGAAA
3337

















AACCTTGTGC






2FH21F_15_085
21
9893847
F
A
110
15
18421032
R
C
110
AGCCAAGGAACAAATTCCCC
 834
TGCAAAGCTGTCAGCAAAGG
2086
TCTTCTTGAGAGAAAG
3338

















AATAATG






2FH21F_15_086
21
9893944
R
T
109
15
18420935
F
C
109
TTTGCTGACAGCTTTGCAGG
 835
TAAGAGGGAACATCCTGGTG
2087
GAGTCACACAGAGAGC
3339

















TCACTTGTCC






2FH21F_15_091
21
9894548
R
G
89
15
18420331
F
T
 89
TTCATGTTTCCTCCAGGGAC
 836
GGTATTTTAGAGATGTAGAG
2088
GATGTAGAGCTAGACA
3340















C

CAGCA






2FH21F_15_092
21
9894701
F
T
103
15
18420178
R
G
103
TAAGGTTCCTGTCCCGAATG
 837
AGTGGTCACTAGGATCACAG
2089
CTGAGGGTAACCTGGT
3341

















GAATCTTCT






2FH21F_15_093
21
9894729
F
C
102
15
18420150
R
A
102
GATTCCTGAGACTGTTCTCC
 838
GGGTAACCTGGTGAATCTTC
2090
AGTCACATTCGGGACA
3342

















GGAACCTTAG






2FH21F_15_097
21
9903575
F
T
119
15
18413155
R
G
119
CTTCCCTTTAGCATTATAAC
 839
TGTCTGCTGTGGAAAGAAG
2091
CTGCTGTGGAAAGAAG
3343

















ACATAG






2FH21F_15_101
21
9903915
R
T
110
15
18412815
F
C
110
TAGTGAGGGCTCATCACTAC
 840
AAGAGATGGTCTCCACTTGC
2092
GGTCTCCACTTGCTGT
3344

















AAGCTCACACT






2FH21F_15_103
21
9905185
F
A
118
15
18411551
R
C
118
CACAGCTTGGTGCAAATGAG
 841
TACAAGTGATTCAACACAG
2093
ATTCAACACAGAGCCT
3345

















G






2FH21F_15_106
21
9906091
F
T
91
15
18410645
R
G
 91
CTGTGAGAAGATTCACGGAC
 842
CTGCCTGTATTTGACCACAC
2094
TGTATTTGACCACACT
3346

















TTATCTT






2FH21F_15_107
21
9906394
F
C
88
15
18410342
R
A
 88
GGGAGATTTTGCGACTTTTC
 843
AACACTGGAAAGCTCACACC
2095
CTCACACCCAGACTCA
3347

















G






2FH21F_15_119
21
13976012
F
A
110
15
19264667
R
C
110
TCTCCCCTCCCGGGGCTAA
 844
TAGGGCGCTGGAGAGCGGG
2096
GCTGGAGAGCGGGGAT
3348

















CCTCTGGT






2FH21F_15_126
21
14329606
F
C
98
15
44318884
F
T
 98
TACCAAATATTCAAGTGAG
 845
GTGGCATTTTATCTTGCAAA
2097
ATCTTGCAAACATTTG
3349















C

CCACA






2FH21F_15_128
21
14329861
R
G
113
15
44319637
R
A
118
GGGCCAGAAGTTCTCGAGC
 846
AGGAGCCTTCAGATTCTGTG
2098
TGTGGATTCTCTGTAC
3350

















C






2FH21F_15_130
21
14330105
R
C
113
15
44319887
R
T
113
CACATGCTGTCAGCTAATT
 847
TTCTCCTGGAATAAGACCCC
2099
AAAGGCTGAGGAATCT
3351

















GT






2FH21F_15_134
21
14330189
R
T
107
15
44319971
R
C
107
GGGTCTTATTCCAGGAGAA
 848
GCTCTGCACTGAAGCTACTG
2100
GTTATTGTGGCATAAA
3352

















TTAAATAAG






2FH21F_15_135
21
14330252
F
A
110
15
44320034
F
G
110
TTTACTTGCAGGCAGTTTTC
 849
ACAGTAGCTTCAGTGCAGAG
2101
CTGCAGCTTCAAGCTT
3353

















TAC






2FH21F_15_137
21
14330414
F
T
94
15
44320198
F
C
 95
TCTCCAGTATCTCAGTTCCC
 850
AAGTATCATTCCCCCTCACC
2102
CCCCTCACCTTGCTAT
3354

















T






2FH21F_15_139
21
14330464
F
C
83
15
44320249
F
G
 84
TTCTTCTGTCACACTGTAA
 851
AGTGGGAACTGAGATACTGG
2103
CTGAGATACTGGAGAA
3355

















AGT






2FH21F_15_142
21
14330613
F
G
102
15
44320399
F
T
102
TGTGACCACCTGCCAGTC
 852
TGGCATGCTGAGAAACTCAC
2104
GTTTGTGGTCTTTTTG
3356

















TGAATAA






2FH21F_15_144
21
14330885
R
T
106
15
44320643
R
A
101
CAAGTACTGTGTGCAGGATG
 853
TTCTTCCCAGCATAGGGTTG
2105
GCATAGGGTTGGAAAA
3357

















ATTGCTTA






2FH21F_15_146
21
14331549
R
C
84
15
44321301
R
T
 84
AATTATTGAATCTGGTTGG
 854
GTCTGAAGTATTGCAAAGC
2106
AGCAGTATGAAAAGAC
3358

















ATTAT






2FH21F_15_147
21
14331587
R
A
80
15
44321339
R
G
 80
CATTAATGTTCAGATTCCAT
 855
GTCTTTTCATACTGCTTTGC
2107
TACTGCTTTGCAATAC
3359

















TTCAGAC






2FH21F_15_148
21
14331644
R
C
105
15
44321396
R
A
105
ACTTGTATGGAATCTGAAC
 856
AGCTTGTAATTCAAGAGTG
2108
GTAATTCAAGAGTGTA
3360

















CTATCTTA






2FH21F_15_149
21
14332091
F
G
100
15
44321855
F
A
 96
CACTCAATATGACCTCCTTC
 857
CACCTTAATTTGCAAAAGTG
2109
AAAAGTGGAGCTTGGG
3361















G

T






2FH21F_15_150
21
14332119
R
G
96
15
44321879
R
C
 92
TTGCAAAAGTGGAGCTTGGG
 858
TTTTACACTCAATATGACC
2110
CTCAATATGACCTCCT
3362

















TCT






2FH21F_15_151
21
14332566
R
G
119
15
44322320
R
T
124
AGAGCTCCTGGTGGGACAG
 859
CACTTTGCTGTTGAAATTC
2111
CAAGCAGTGGCTCTTC
3363

















T






2FH21F_15_152
21
14332589
F
A
114
15
44322343
F
G
119
CACTTTGCTGTTGAAATTC
 860
TCCTGGTGGGACAGGGACT
2112
AGAGCCACTGCTTGGA
3364

















GAG






2FH21F_15_153
21
14332612
R
G
109
15
44322371
R
C
114
AAGAGCCACTGCTTGGAGAG
 861
TTAAATGTGTGGATATGTC
2113
TTTGCTGTTGAAATTC
3365

















ATTTA






2FH21F_15_156
21
14333098
R
G
102
15
44322880
R
A
102
GAATTGGTGGAGGACCCTT
 862
TGATGTAGGGCATCTCTAGG
2114
CCCCTAATCCAGACTC
3366

















ATGGGTCTC






2FH21F_15_157
21
14333124
F
A
106
15
44322906
F
G
106
TTGTGATGATGGTAACAAGG
 863
AATCCAGACTCATGGGTCTC
2115
AAGGGTCCTCCACCAA
3367

















TTC






2FH21F_15_160
21
14333462
R
G
101
15
44323242
R
A
101
CAGTATGCAATTATGACAC
 864
CTTGTTAAAGAAGCACTGTC
2116
GCACTGTCCAACATTA
3368

















AATATAC






2FH21F_15_165
21
14333667
R
A
95
15
44323445
R
C
 95
GCTTGACTGGTCTGTCTTAC
 865
ATTTCAAAGCTAGTAACAG
2117
AAAGCTAGTAACAGAG
3369

















AGATT






2FH21F_15_170
21
14334200
R
C
109
15
44323975
R
G
106
CAAGTAATTTCAAACTTGAC
 866
TGCTGCTTGCAGTGCCTA
2118
GCTGCTTGCAGTGCCT
3370

















ACCAAGT






2FH21F_15_175
21
14334530
F
G
105
15
44324302
F
A
105
CTCTAGAGGAGTCATAAGCC
 867
CCAGCAATGACATGATTACC
2119
CCCCAAAATGTTCTGA
3371

















AACCCTGC






2FH21F_15_178
21
14334783
R
T
89
15
44324556
R
A
 89
TGGAAGTCATTCTTGAAGTG
 868
CATTAACATAAAGAGAGGC
2120
TTAACATAAAGAGAGG
3372

















CTGAAACC






2FH21F_15_180
21
14335783
R
G
111
15
44325553
R
A
111
TCATAGCACTGCCCTACTAC
 869
GAATTCTTATATGAGAGGAC
2121
AGAGGACCTCATGGAC
3373

















A






2FH21F_15_182
21
14335875
R
G
108
15
44325644
R
A
107
GTAGTAGGGCAGTGCTATGA
 870
GGACAATTAATCTATTCCCC
2122
TCCCCATCTCATTTAA
3374

















ATAAC






2FH21F_15_191
21
22732455
R
T
110
15
50126130
F
C
111
TCAAACACTTTCACAATGT
 871
TCCTTACTGATCCCCAGAG
2123
CCTTACTGATCCCCAG
3375

















AGTGTCAAA






2FH21F_15_193
21
22909478
F
T
110
15
57893049
R
G
110
GAGCTTGATCCTGATTCTTC
 872
TCAAGTAGTGTCTCCCTT
2124
CAAGTAGTGTCTCCCT
3376

















TTCATTC






2FH21F_15_195
21
22909551
F
T
82
15
57892976
R
G
 82
CCCTACGACCTGTCAGAAA
 873
CCTGAAGAATCAGGATCAAG
2125
ATCAGGATCAAGCTCT
3377

















CAAAAT






2FH21F_15_196
21
22909563
R
C
121
15
57892964
F
C
121
CAGGATCAAGCTCTCAAAAT
 874
GATAGGATGAGCAACCAAAA
2126
CCCTACGACCTGTCAG
3378

















AAA






2FH21F_15_198
21
22909608
R
G
94
15
57892919
F
A
 94
TTTCTGACAGGTCGTAGGG
 875
GGACATCATGATAGGATGAG
2127
TAGGATGAGCAACCAA
3379

















AA






2FH21F_15_200
21
22909683
F
G
114
15
57892844
R
A
114
GCTCATCCTATCATGATGTC
 876
AGCTATCTGGTAGATAGTGG
2128
CTATCTGGTAGATAGT
3380

















GGAATTTTGC






2FH21F_15_209
21
31354944
F
G
94
15
23136353
F
T
 94
ACAGACAGAGCACCTGTGG
 877
CTTTCTGTGTCTGGGCCATT
2129
GTCTGGGCCATTTTTG
3381

















GCTA






2FH21F_15_210
21
31354964
R
G
90
15
23136373
R
A
 90
CTGTGTCTGGGCCATTTTTG
 878
ACAGACAGAGCACCTGTGG
2130
ACAGACAGAGCACCTG
3382

















TGGGAGGAC






2FH21F_15_211
21
31354995
R
C
119
15
23136404
R
T
119
GTCTGGGCCATTTTTGGCTA
 879
CAGAAAAGACTCTTCTTGCA
2131
AAGACTCTTCTTGCAG
3383















G

TTTACA






2FH21F_15_212
21
31355097
F
C
83
15
23150634
F
T
 83
ATTGCTTATATGTGGAAGCC
 880
GAGTCCCTGGTATAGCCAC
2132
GTATAGCCACCGTCAT
3384

















ATTC






2FH21F_15_214
21
31355171
R
G
114
15
23150809
R
A
215
TCTTCTAGTGCTTGGAAATC
 881
CCAATGAATCTCCCTTAAAG
2133
TGAATCTCCCTTAAAG
3385

















TACTTA






2FH21F_15_217
21
31355249
F
G
81
15
23150887
F
T
 85
CTCCGAAAAGCCTTGAACTG
 882
GTCAATCTTTATTCTGACTA
2134
AATCTTTATTCTGACT
3386















C

ACATTCTCAAT






2FH21F_15_218
21
31355355
F
G
118
15
23152205
F
A
119
AAGGGAATGTGGAAGATGAC
 883
TATCACCATTTTCCTTTAG
2135
CATCATTGGGTTACCA
3387

















AAA






2FH21F_15_219
21
31355370
R
C
101
15
23152221
R
T
102
CATCATTGGGTTACCAAAA
 884
GGAGTATGAAGGGAATGTGG
2136
AAGATGACATGATGAT
3388

















CACTTTCCAG






2FH21F_15_220
21
31355525
R
C
100
15
23153010
R
T
100
TTCCTGTAGACAACCATGGG
 885
CTGATTGGCATAGTACTGGG
2137
GGCTATTTACAATAAC
3389

















TGTATACTGG






2FH21F_15_221
21
31356019
R
A
98
15
23156668
R
C
104
GGATTGAAGATTTCCTCCAC
 886
GGAATTTGAAGGAGAACAAG
2138
AGGGAGGTGTTTCCAA
3390















G

A






2FH21F_15_222
21
31356039
F
C
103
15
23156688
F
T
109
TATGTGGAATTTGAAGGAG
 887
GGATTGAAGATTTCCTCCAC
2139
TTTGGAAACACCTCCC
3391

















TCA






2FH21F_15_223
21
31356065
F
G
97
15
23156720
F
C
106
CTATGGAAAATCCTGCAGAC
 888
TTTGGAAACACCTCCCTCA
2140
TCTCCTTCAAATTCCA
3392

















CATA






2FH21F_15_228
21
31356399
F
G
86
15
23167097
F
T
 86
GGTGTTAAAACCCTGGATTG
 889
CAGTGGTTCATTAATAAACT
2141
AACTCTTCAAAAGGGA
3393















C

TAAG






2FH21F_15_231
21
31356477
F
C
119
15
23167175
F
T
119
ATGAAGAGCCCATCCCTGAG
 890
TTTCCAGGGGGTCCACTC
2142
GACCTTTCTTGTTTCT
3394

















TCT






2FH21F_15_234
21
31356543
F
C
120
15
23167241
F
T
119
GCTTCGAAGTGCTTGAAAAT
 891
CTCAGGGATGGGCTCTTCAT
2143
GATGGGCTCTTCATCA
3395













G



TCTTC






2FH21F_15_236
21
31356594
R
T
101
15
23167291
R
C
100
TGATTTGTGTCCACTTCCC
 892
ATGCCATTGTTGCTGCTTCG
2144
CTTCGAAGTGCTTGAA
3396

















AATG






2FH21F_15_237
21
31356757
R
C
86
15
23167454
R
T
 86
CTGGTCTGCATTGTATTTAG
 893
AGCAAGCTACCCCTTGCAG
2145
CTTGCAGCCCAAGGAA
3397

















A






2FH21F_15_238
21
31356790
F
T
104
15
23167487
F
C
104
TCTCCACAGTCCTGAATATC
 894
TTTCCTTGGGCTGCAAGGG
2146
CTGCAAGGGGTAGCTT
3398

















GCTCAT






2FH21F_15_239
21
31356911
R
A
93
15
23167608
R
G
 93
GAACAAATTCAGATAATTAG
 895
GTCACCTAACGTGGAATGTG
2147
CGTGGAATGTGACTTG
3399













G



A






2FH21F_15_241
21
31357019
R
G
112
15
23167716
R
A
112
GAGAGCAATCTGGTGTAGAC
 896
TTAGGCCCTGATGATGTGTC
2148
CTGATGATGTGTCTGT
3400

















GGATA






2FH21F_15_242
21
31357085
F
G
100
15
23167782
F
A
100
CATGTTCTGCTGCTGCTATG
 897
ATTGTTGTCTCCCTGTGAGC
2149
TGTCTCCCTGTGAGCT
3401

















ATCACCT






2FH21F_15_243
21
31357087
R
G
100
15
23167784
R
C
100
ATTGTTGTCTCCCTGTGAGC
 898
CATGTTCTGCTGCTGCTATG
2150
GAAGACTCAGAAGCAT
3402

















CTTCCTCAAG






2FH21F_15_244
21
31357145
F
G
117
15
23167842
F
T
117
ACACACCAAGGAAGAACTG
 899
TTCCATAGCAGCAGCAGAAC
2151
AGCAGAACATGCAGCT
3403

















TTT






2FH21F_15_247
21
31357316
R
T
115
15
23168014
R
G
116
GCACTAGAAAAAACTCTTCC
 900
AACAGAAGAGAAGGTATAT
2152
CAGAAGAGAAGGTATA
3404

















TGAAATT






2FH21F_15_248
21
36589643
F
T
90
15
81157835
F
G
214
GCAGAGGATGCTATTTATGG
 901
TGTGATCCTTCAGGTCCTGC
2153
GGTCCTGCCAGCTGCC
3405

















TGA






2FH21F_16_004
21
15052250
F
T
117
16
56061461
F
G
117
GTATTCAAAAGCCACCCCTG
 902
AAAGGGCCAGGAGCTGAGAC
2154
CAAGGAGCATGCCAAG
3406

















T






2FH21F_16_005
21
15052256
R
T
117
16
56061467
R
C
117
AAAGGGCCAGGAGCTGAGAC
 903
GTATTCAAAAGCCACCCCT
2155
AAGCCACCCCTGCAGT
3407

















A






2FH21F_16_006
21
16577428
F
T
115
16
75226732
R
T
115
CACAATACTTTATCACTCT
 904
GTTTTCTTGCTTTTTGTCAG
2156
GCTTTTTGTCAGTTTC
3408

















AAATA






2FH21F_16_010
21
29192727
R
A
82
16
20653095
R
T
 79
AGCAGACTTGCTCCAAGACA
 905
CGAGTCCTTTTGTCTTGCAC
2157
TTTTGTCTTGCACTAT
3409

















CAAAATA






2FH21F_16_011
21
29192949
F
T
117
16
20653317
F
C
117
TTCCTGCACAAGTGGCTATG
 906
CTAGTCTGGTTTACCAAACA
2158
TTTACCAAACAGAACC
3410

















AC






2FH21F_16_012
21
29192996
R
A
115
16
20653364
R
G
115
GGTTTACCAAACAGAACCAC
 907
TAGGCTTCCTGCACAAGTGG
2159
AGGCTTCCTGCACAAG
3411

















TGGCTATGTT






2FH21F_16_014
21
29193036
R
C
119
16
20653404
R
T
119
CACTTGTGCAGGAAGCCTAA
 908
GAATATTAAGGAGCTGTAA
2160
CCTTAAGTTTTAAAAA
3412

















GTTAGGAA






2FH21F_16_015
21
29193084
R
C
120
16
20653452
R
T
120
CTTTTTAAAACTTAAGGATA
 909
GACACCAACAAAGTCTGCAA
2161
AAGAATATTAAGGAGC
3413













AG



TGTAAA






2FH21F_16_016
21
29196058
F
T
117
16
20655783
F
C
120
AAATAACCAGCAGGTACCAG
 910
AAGTTCAGGTTTGGCTCCTC
2162
GGCTCCTCCCTCATTT
3414

















A






2FH21F_16_018
21
29197551
F
C
100
16
20657780
F
T
100
CTTGAAGAAAGAAGTTGGTG
 911
TTGCTCCACTTTCCACTGAC
2163
TTCCACTGACTGGAAT
3415

















C






2FH21F_16_019
21
29197558
R
G
100
16
20657787
R
C
100
TTGCTCCACTTTCCACTGAC
 912
CTTGAAGAAAGAAGTTGGTG
2164
AGGCATCTACAGAGAT
3416

















GAG






2FH21F_16_021
21
29197604
F
A
114
16
20657833
F
G
114
ATCAGCAGCCCTCTGGAAGT
 913
CTCATCTCTGTAGATGCCT
2165
CACCAACTTCTTTCTT
3417

















CAAG






2FH21F_16_022
21
29197624
R
T
114
16
20657853
R
C
114
CTCATCTCTGTAGATGCCT
 914
ATCAGCAGCCCTCTGGAAGT
2166
CTCTGGAAGTGAGGGA
3418

















GA






2FH21F_16_023
21
29197908
R
G
102
16
20660574
R
A
102
TTCCCGCCGCCAGGCTGAG
 915
GGAGAAACGTTTCTCTTTCC
2167
ACGTTTCTCTTTCCTC
3419

















TCAG






2FH21F_16_024
21
32671407
F
G
98
16
30338481
F
A
 97
CATGCCAGAGCAAACTGTAG
 916
CAACCCACTTCAGTGCCAG
2168
ACCCACTTCAGTGCCA
3420

















GCAGCCTAC






2FH21F_16_025
21
32671471
F
T
88
16
30338544
F
C
 88
GGGTTTGGATTTATGATGGG
 917
TACAGTTTGCTCTGGCATGG
2169
TGGCATGGGGTACTAT
3421

















GAGAGG






2FH21F_17_004
21
24615434
F
T
83
17
44843420
R
C
 83
CTGAACTGGGCACCAAGAGA
 918
TTCCAGAGATCAGGGAGTTG
2170
GGAGTTGTAGGTATTA
3422

















ATACATT






2FH21F_17_006
21
38532100
F
C
94
17
45987947
R
A
 94
TGCCTTTCCTGAGTACCCTC
 919
TGAGCAGGCTTGATTCTCAC
2171
GCTTGATTCTCACACA
3423

















CATA






2FH21F_17_008
21
38532123
R
T
96
17
45987924
F
C
 96
TGAGCAGGCTTGATTCTCAC
 920
GCTGCCTTTCCTGAGTACC
2172
CTGCCTTTCCTGAGTA
3424

















CCCTCCGA






2FH21F_17_009
21
38532149
F
C
91
17
45987898
R
A
 91
TGCTGATTCTGGCTGATGGG
 921
TCGGAGGGTACTCAGGAAA
2173
TCGGAGGGTACTCAGG
3425

















AAAGGCAGC






2FH21F_17_010
21
38532403
F
A
95
17
45986684
R
C
 95
TTCCGTGTCAGCCCACAACC
 922
ACACACACTTGTCCATCCAG
2174
ACTTGTCCATCCAGTC
3426

















CTTGTG






2FH21F_17_011
21
38532428
R
G
99
17
45986659
F
T
 99
ACACACACTTGTCCATCCAG
 923
GCCAATTCCGTGTCAGCCC
2175
TCCGTGTCAGCCCACA
3427

















ACC






2FH21F_17_012
21
39486280
R
A
95
17
41026587
F
C
 95
TTATTTCCTTGATATCCAC
 924
GTCATTGTAGAACTTTTCAC
2176
TGTTGAAGTTATACCT
3428















C

CTGAA






2FH21F_17_014
21
39486350
R
A
93
17
41026517
F
C
 93
CAGGTGAAAAGTTCTACAAT
 925
GTTGTATGGAAATTATAGTT
2177
TGTATGGAAATTATAG
3429













G

C

TTCAATTATT






2FH21F_17_015
21
39486380
F
T
107
17
41026487
R
G
107
GTTGATATATTTATTTATCA
 926
AATAATTGAACTATAATTTC
2178
AATTGAACTATAATTT
3430













GG

C

CCATACAACA






2FH21F_17_020
21
39486682
R
A
99
17
41026180
F
C
104
CACAATCAAGTTCAACTTGT
 927
TTTACTAACCTCCCTGTTTG
2179
TAACCTCCCTGTTTGA
3431













A



TATTAAAAA






2FH21F_17_021
21
39486851
R
C
82
17
41026004
F
T
 82
ACCATCTGAGGGTGTTACTG
 928
GTGCAAAGGGCTTAGTGATG
2180
CTTAGTGATGCATCTT
3432

















ATTCTTTA






2FH21F_17_022
21
39486902
R
T
100
17
41025953
F
G
100
AGCACTTCAAAACAGAAGGG
 929
ACAGTAACACCCTCAGATGG
2181
GGTATTTTTATTGGTT
3433

















TGTTTTATAT






2FH21F_17_023
21
39486997
F
G
102
17
41025858
R
A
101
AGAAAGGTTCCTTTCAAAT
 930
AGTTCTTTGCCTCCATTTTC
2182
AACCCAATTTCCTCTT
3434

















TAG






2FH21F_18_002
21
13567219
R
A
102
18
15086411
F
G
101
AGATATTGCCAGCCACCTAC
 931
TAAGAGAGCTACAGGTGGTG
2183
AGGTGGTGGTGTCAGT
3435

















AATGG






2FH21F_18_005
21
13583906
R
G
86
18
15072096
F
T
 86
GAGGGCCACATTTCACTATG
 932
CCCTTTAAGGGGAAATGATT
2184
GGGAAATGATTAGAAA
3436

















TAGAAACTTC






2FH21F_18_006
21
13585163
F
T
119
18
15070881
R
G
119
TTAGGGTAATGGTGAGAGAG
 933
TTAGAAAAGAGACTAAATTC
2185
ATTTTACATAGTCCTT
3437

















AAAATTTGT






2FH21F_18_007
21
13585166
R
C
119
18
15070878
F
A
119
TTAGAAAAGAGACTAAATTC
 934
TTAGGGTAATGGTGAGAGAG
2186
TAAGAGTGAAGCGAAA
3438

















ATC






2FH21F_18_019
21
13607464
R
A
108
18
15048533
F
G
109
TAGACGTTTTAGGAATTTG
 935
TTCGGATGAAGATAGTGGGC
2187
AGAATGGAGGGATCTA
3439

















TTAGCAAAAA






2FH21F_18_020
21
13608759
F
T
100
18
15047227
R
C
100
GACCAAAGTGTATACATAG
 936
TCCCTCTCTCCCTGAAAAAG
2188
TGAAAAAGAGACACAT
3440

















TTGCCTTTG






2FH21F_18_021
21
13609221
F
C
97
18
15046765
R
A
 97
GTGTAGTAAGCGGGAATGAG
 937
GGGATGATTCTTAAAAGGG
2189
AAAAGGGATTCTGGAA
3441

















GTGG






2FH21F_18_023
21
13613775
R
T
111
18
15039544
F
C
111
CAATAAGGTGGTATTCTCTC
 938
AGTGGGGCACATGTATTTTG
2190
TGGGGCACATGTATTT
3442













C



TGTAGATT






2FH21F_18_031
21
13676899
F
C
80
18
14843884
R
A
 80
GTGTGAGGCTTCACTAAAGG
 939
AGCCTCTATTGATGCCTCAG
2191
GCCTCAGAGAGTGAGA
3443

















A






2FH21F_18_035
21
13677129
F
C
98
18
14843654
R
A
 98
GCTGCTTGTTAGTGAATTTA
 940
GTGTCTAGTAAGACAGTACC
2192
ACCAATTTGGCAGAAA
3444













C



GATT






2FH21F_18_042
21
13678531
F
T
119
18
14842335
R
C
119
GAAAGTTAACAAAAGCAAGG
 941
CCATAATTGAATACCTCCTC
2193
ATAATTGAATACCTCC
3445

















TCATTTTTCTC






2FH21F_18_044
21
13678653
F
T
86
18
14842213
R
C
 86
AGGAGTCTCTGGAGCAGAAA
 942
CTAATTGCTGTCGAAGCCAC
2194
ACCTATTTTTGCTTTC
3446

















TAGTT






2FH21F_18_045
21
13678937
F
T
120
18
14841929
R
C
120
CAAGAACTTGCTTTCCACAG
 943
GTTGATGGAGCACCTCATTG
2195
ATCAACATTCATTATT
3447

















CCTTGCAAA






2FH21F_18_046
21
13679258
F
C
81
18
14841608
R
A
 81
TTTATTTTCCTTCACCTGG
 944
TGCCATGCTAAAACTGGAAG
2196
AGTAGCCACACTGAAA
3448

















C






2FH21F_18_047
21
13679689
R
G
108
18
14841172
F
T
108
ACTAAGGCTCTTAGTATGGG
 945
TAAAAGATTAATCAATTTGA
2197
AAGATTAATCAATTTG
3449















C

ACTACATAC






2FH21F_18_048
21
13679727
R
G
85
18
14841134
F
G
 85
TATATGTAGCACTAAGGCTC
 946
TGGGTTTACACTCTGATGTC
2198
TTACACTCTGATGTCT
3450

















AACCTATACAA






2FH21F_18_050*
21
13680033
F
T
108
18
14840828
R
G
108
CTTTACCACTTTTGTTTTG
 947
GGACTTCTCCACCAAATCTC
2199
CAGTTAATTCTACTGG
3451

















GTAAATA






2FH21F_18_051*
21
13680058
R
C
113
18
14840803
F
A
113
GGACTTCTCCACCAAATCTC
 948
ATCTTCTTTACCACTTTTG
2200
ATCTTCTTTACCACTT
3452

















TTGTTTTGA






2FH21F_18_054
21
13680768
F
C
103
18
14840088
R
A
104
TGTCATTTGGAAGAGGTTAC
 949
ATAAAATTCCTATATTCCTG
2201
TCCTATATTCCTGAAT
3453

















TTTTTTTT






2FH21F_18_055
21
13680796
R
T
107
18
14840059
F
G
108
CCTATATTCCTGAATTTTTT
 950
TTTTCTCACTATTTTTCAAG
2202
TGTCATTTGGAAGAGG
3454

















TTAC






2FH21F_18_059
21
13686501
R
T
101
18
14834328
F
T
101
ATATTTCAAGTATCACTATG
 951
GCTTTAATGGTCCATAGGTC
2203
ACAGTTTTCACTTTTA
3455

















TTAAAGTAGA






2FH21F_18_060
21
13686840
F
A
100
18
14833989
R
C
100
ATTATTCCTATGCATGCTT
 952
AGCAGTTGAAAACAAATTC
2204
AACAAATTCTACATAT
3456

















TATCTATGACC






2FH21F_18_061
21
13686860
R
G
111
18
14833969
F
A
111
CTACATATTATCTATGACC
 953
GTAAACAATTGTCTAAACTG
2205
TATTATTCCTATGCAT
3457















G

GCTTAAA






2FH21F_18_063
21
13687524
F
A
98
18
14833315
R
C
 98
GTAGCTTTAATTTCAGGTG
 954
TAAGTCATACAGACATTCCC
2206
CAAAACATTACAGTAT
3458

















GAGGAC






2FH21F_18_065*
21
13687741
R
A
120
18
14833098
F
A
120
TTGGGGAGATGAGACTATTA
 955
GGAAGAATAAACAAACATTG
2207
AAACATTGAGAGCAGG
3459

















T






2FH21F_18_066
21
13688025
R
A
116
18
14832818
F
A
112
CAGCCACAAATGAATCCAG
 956
CATACCGAAAGAAAACCCCC
2208
GCTGAGAAAAAGGACT
3460

















TAG






2FH21F_18_067
21
13688314
R
A
115
18
14832529
F
C
115
AGGAGCAAATTATGACCCAG
 957
GATATAAATTATTCCAGTGT
2209
AGTGTATTTCACTGAA
3461

















TATATGG






2FH21F_18_068*
21
13688562
F
T
111
18
14832281
R
G
111
TTTGCATGAGTGAATCAAG
 958
TGTTTCCCATATCCTTGCAG
2210
TATGCCTACATTGCTG
3462

















TATC






2FH21F_18_070
21
13688877
F
C
118
18
14831965
R
T
118
GAGATATTTGAATCTAAGAG
 959
TATGGTAAGTGTCTAATAG
2211
ACCAAAACAATTTGCT
3463













C



TCATTAAA






2FH21F_18_071*
21
13689014
F
T
90
18
14831828
R
G
 90
AGATTGTGGGTACTCCAGAG
 960
AGTCACCATGGTTTACTCC
2212
TCCAATTCTAGTAATC
3464

















CTCC






2FH21F_18_072
21
13689107
F
T
117
18
14831735
R
G
116
ATAGCCAGCCAACTTTGGAG
 961
CCCACAATCTAATCTTCTGG
2213
TGAATTCACTCAAATT
3465

















TCCTTT






2FH21F_18_074
21
13689632
R
C
111
18
14831211
F
T
111
AAGTGTGAAAACTTCTCGTC
 962
CTGCAGTATGTGAATATAAG
2214
CAGTATGTGAATATAA
3466















C

GCATATTT






2FH21F_18_076
21
13690808
F
T
88
18
14830029
R
T
 89
CCTTTTAAAATATGCACGAG
 963
GACTAGGTTACTGAGCAAGG
2215
GTTACTGAGCAAGGAA
3467

















AATAA






2FH21F_18_078
21
13691635
R
G
114
18
14829201
F
T
114
TGCTGCAGTAGTAGGAAGAG
 964
CTCTTAAGGAACATTCTCTG
2216
TTTTAAAGAAAAAGTT
3468

















ACAGTAATTT






2FH21F_18_083*
21
13694498
F
G
89
18
14826337
R
A
 89
AGTGCTTGAGCATTTCATGG
 965
ACTCTGTCATCTGGTTTCCC
2217
TGGTTTCCCCATCCTA
3469

















GTAAATAACA






2FH21F_18_086*
21
13695423
F
T
120
18
14825419
R
G
120
GAATCATACGTAAGGGAAGA
 966
TATAAAATATCCCCATTGC
2218
TATCCCCATTGCAAGA
3470

















GATA






2FH21F_18_090*
21
13697020
R
C
83
18
14823821
F
C
 83
CTCCTTTTCTTTCACGACTG
 967
CGGAAGAAACAAACAAGAGC
2219
CGGAAGAAACAAACAA
3471

















GAGCCATGAT






2FH21F_18_094
21
13706648
R
A
100
18
14814186
F
C
100
GTGGCTATGAAAGACAGCCT
 968
AGCTCCGCTTTGATTTCAGG
2220
ATTTCAGGCTTCATAG
3472

















TTTG






2FH21F_18_101
21
13713284
R
G
103
18
14807188
F
T
103
GACTCTCTTCATGATGACTC
 969
GAAGTAAGACATACACTTA
2221
AAGTAAGACATACACT
3473

















TAAACAAA






2FH21F_18_103
21
13714932
F
G
99
18
14805576
R
A
 99
AGCAACATAACGCTTTCTCC
 970
CTTTCATGGGAGAAATGTGG
2222
ATGTGGAAGAAGAGTA
3474

















ATTGGATAA






2FH21F_18_117
21
13723496
R
A
115
18
14718593
F
C
115
GCTATGATGCATTTGCCAAT
 971
AGCACTGCAGGTCCAAAATG
2223
AGATTTTTAGATGCCT
3475

















TCTTC






2FH21F_18_120
21
13724769
R
A
85
18
14717315
F
G
 85
AATGTCTCTTTCCTCTGCTG
 972
ATGCATTCATCAAGCAACT
2224
GCATTCATCAAGCAAC
3476

















TGGAGAT






2FH21F_18_122
21
13725010
R
A
85
18
14717074
F
C
 85
TAGCATAACAAGTTGGTGAG
 973
AGTGAACTATGATAGGAAGC
2225
AAGCTAATTGGCACAT
3477

















TT






2FH21F_18_123
21
13732060
F
C
93
18
14710050
R
A
 93
CCTCTTTCTTCATAGGTAGG
 974
GAGCTGGATCCATCCATCAC
2226
TCACCAGGGAATCTTT
3478

















ACTA






2FH21F_18_126
21
13734197
F
G
104
18
14707921
R
T
105
TTGTGACATGATAAAGCTGG
 975
GTCTGAAAAACTGTCATTC
2227
CTGTCATTCAGCGACT
3479

















A






2FH21F_18_127
21
13734217
R
C
102
18
14707900
F
A
103
AAAACTGTCATTCAGCGACT
 976
TTGTGACATGATAAAGCTGG
2228
AGCTGGATATTGAAAA
3480

















CCAAAA






2FH21F_18_132
21
13735676
R
C
106
18
14706441
F
T
106
CAGTACTAAGTATGAACATG
 977
TATACCTTAGAATAGTCAG
2229
ATACCTTAGAATAGTC
3481













A



AGAAGTCAG






2FH21F_18_133
21
13736390
F
G
116
18
14705733
R
T
118
GGTCTAGAGAACTCTGAAAG
 978
TGCAATTCACTTGGACACGG
2230
TCACTTGGACACGGCC
3482

















TAAC






2FH21F_18_136
21
13739171
F
C
97
18
14702950
R
A
 97
AGGAAGTTGCACTCTGTTGG
 979
ATATACACACCCTTCCCTGC
2231
GCACATTTGACTTTCT
3483

















GTACAACA






2FH21F_18_137
21
13739241
R
G
110
18
14702880
F
T
110
TGCAACTAAGAGACATCAGC
 980
AGTCAAATGTGCAGGGAAGG
2232
AGTAGCCAGAGGGCAG
3484

















CCAGG






2FH21F_18_138
21
13739280
R
C
111
18
14702841
F
A
111
AGCTGATGTCTCTTAGTTGC
 981
TGGTAGAGACTCACGCAAAG
2233
AGCTTCACCAGAAACC
3485

















CAGAGG






2FH21F_18_139*
21
13739359
F
C
115
18
14702762
R
A
115
AGTCTCTACCACAAGAACAC
 982
ATTAGGGTGCAGACAAGGAG
2234
CTTCTCTAGCCTATTG
3486

















TCTCC






2FH21F_18_141
21
13739493
F
T
104
18
14702628
R
G
104
TAGTGAAGCTGTCGGTAGTG
 983
ATTCAGCCTGGTGAATGAAG
2235
GCCCTCCAATAACAAG
3487

















A






2FH21F_18_142
21
13739495
R
T
104
18
14702626
F
G
104
ATTCAGCCTGGTGAATGAAG
 984
TAGTGAAGCTGTCGGTAGTG
2236
GTCAGAGACATTGTCA
3488

















ACCAGACAC






2FH21F_18_143*
21
13739563
R
C
100
18
14702558
F
C
100
CAATGTCTCTGACACTACCG
 985
TGACTTTGGAGGTGGGATAC
2237
TGACTTTGGAGGTGGG
3489

















ATACTGTGTG






2FH21F_18_144*
21
13740079
R
G
100
18
14702029
F
T
100
CAGATGCCATTAGATGGTGC
 986
TGCTCCTCCTAAACCTTCTC
2238
CCTAAACCTTCTCATC
3490

















TTGCTGTG






2FH21F_18_145
21
13740111
F
A
108
18
14701997
R
C
108
CTGTTACCACCTTGCCTGC
 987
GCAAGATGAGAAGGTTTAGG
2239
AGAAGGTTTAGGAGGA
3491

















GCA






2FH21F_18_149
21
13740288
R
T
108
18
14701820
F
G
108
TGGTGGCACTAGTACACAAG
 988
TTCATAGAACCATGCCACCC
2240
AGAACCATGCCACCCA
3492

















GATATTCTC






2FH21F_18_151
21
13740658
R
A
81
18
14701478
F
C
 81
GTTTATTGCACCATCTACA
 989
GAAGCAATTTCAAGCTAACA
2241
AGCAATTTCAAGCTAA
3493















G

CAGAAAGAC






2FH21F_18_153*
21
13740789
F
T
106
18
14701347
R
G
106
TCCATGTTGCCAGTAAACAC
 990
CAAGCTTTTCTCTTGTAGTC
2242
TAGTCTATCTTACAGG
3494

















TACTTCCA






2FH21F_18_154
21
13741100
F
T
81
18
14701036
R
G
 81
TAAATGAGCAGAGACTCAAG
 991
TTAGATTGTTATCCCCACT
2243
TTGTTATCCCCACTTC
3495

















TTTAA






2FH21F_18_156
21
13741318
F
C
85
18
14700818
R
A
 86
AGGACGTGTAAGAGAAAGGG
 992
GTTCTTCGTAAATCAAACCC
2244
CGTAAATCAAACCCTT
3496

















TGTCATTT






2FH21F_18_158
21
13741417
R
A
118
18
14700718
F
C
118
CGTCCTTTGACACATTTTAG
 993
GAAACACTTCAGTTTCTTG
2245
CTTCAGTTTCTTGAAA
3497

















TGTTT






2FH21F_18_159
21
13741498
F
C
112
18
14700637
R
T
112
CCAAACATTTATAATCTGAC
 994
GTGTTTCTTTTTTCACCTGC
2246
TCTTATTATATCTGGA
3498

















TTTTAACATTT






2FH21F_18_160
21
13741575
F
T
115
18
14700560
R
G
118
AACCAGTACAGATTAGTTGC
 995
AATGATTCTGACTGGTTTCC
2247
TGATTCTGACTGGTTT
3499

















CCTACATATA






2FH21F_18_161
21
13741601
R
T
112
18
14700531
F
G
115
GATTCTGACTGGTTTCCTAC
 996
AACCAGTACAGATTAGTTGC
2248
TGCAAATAATTAGAAA
3500

















GTAAAGG






2FH21F_18_162*
21
13741741
R
A
114
18
14700391
F
G
114
TCTCATGAAAAAGCAGCAG
 997
GCATGCTTCTAGTGGTTTAC
2249
TTACTATTAGACAATA
3501

















ATGGGTTGGC






2FH21F_18_171
21
13746965
R
C
114
18
14695171
F
C
114
GGGAAGATCTTAAAGGGAGC
 998
TTCCTGATGATAATCTTCCC
2250
TATAGCCAATAAATTA
3502

















CTCTTATTTTA






2FH21F_18_172
21
13753460
R
C
114
18
14688687
F
A
114
TTCTGCAAATTACCATTTC
 999
TCTATGCCTAAAATAAGTG
2251
CTATGGGTCAGTTGGA
3503

















G






2FH21F_18_173
21
13753479
F
A
114
18
14688668
R
C
114
TCTATGCCTAAAATAAGTG
1000
TTCTGCAAATTACCATTTC
2252
TCCAACTGACCCATAG
3504

















A






2FH21F_18_174
21
13754373
R
G
90
18
14687774
F
T
 90
GATCTCTGCAAAGAATACC
1001
GGGAACTGTTAAGAAACTC
2253
AAGGAAGTGAATGGAT
3505

















CTTAC






2FH21F_18_175
21
13754850
R
A
98
18
14687294
F
G
 98
CAGGAGTATGCATTTTCCTC
1002
GTCACACAGAGTTCTGTGAG
2254
AGCACCACCTAAATAC
3506

















TTTTCA






2FH21F_18_176
21
13756658
F
G
104
18
14685428
R
A
104
ACACCACATTTCTACCACTG
1003
AACGGCCAGGGTGGACACT
2255
GGCCAGGGTGGACACT
3507

















GTTACT






2FH21F_18_178
21
13769627
F
C
100
18
14672247
R
A
100
TCTGTGACACAGAGCATGAG
1004
GCATCAGGACAAACTGATGG
2256
TAAGCAGCCTAGGTTT
3508

















TCCTC






2FH21F_18_186
21
13771387
F
C
98
18
14670492
R
A
 98
CAGAGCTGATTTGTTCCAGT
1005
ACCCAGTCTTCCTGAGTATG
2257
CTTGTGGGCGATGTCT
3509

















A






2FH21F_18_188
21
13771486
F
C
111
18
14670393
R
A
111
AACTCCAGGGCTACTTGAAC
1006
GTGCTATAAAGCTTTAACAA
2258
TAAAGCTTTAACAAGT
3510















G

TGGCGA






2FH21F_18_190
21
13771524
R
A
107
18
14670355
F
G
107
AAAGCTTTAACAAGTTGGCG
1007
AAGAACTCCAGGGCTACTTG
2259
ACTCCAGGGCTACTTG
3511

















AACAATT






2FH21F_18_191
21
13771649
R
T
90
18
14670230
F
G
 90
TGATACAGAAATGTCAACCC
1008
GATGCTTCTAAGGACCATGT
2260
GGACCATGTAATTTCT
3512

















TTAATTC






2FH21F_18_194
21
13775207
R
A
120
18
14666674
F
G
120
TGTGACAAATTCTATGGC
1009
TGCACAGTTGAAAAGTAACC
2261
AAAGCATTTAAAAAAA
3513

















GATTAGGAG






2FH21F_18_195
21
13775250
F
T
119
18
14666631
R
C
119
TGCACAGTTGAAAAGTAACC
1010
CAAATTCTATGGCATCTTTC
2262
AATGCTTTTGTTTGGT
3514

















ATTTGATAA






2FH21F_18_197
21
13775571
F
C
101
18
14666302
R
C
101
GCCATTTGAAGAATGGTATG
1011
GCCTAACATATTGTATGCAC
2263
ACTAAGCAAGTACTAG
3515

















TAAAATTATT






2FH21F_18_198
21
13775577
R
A
101
18
14666296
F
C
101
GCCTAACATATTGTATGCAC
1012
GCCATTTGAAGAATGGTATG
2264
TTGAAGAATGGTATGA
3516

















AGATGATAA






2FH21F_18_199
21
13775783
F
C
101
18
14666090
R
A
101
AGTCTGTCTATTGTAGGATG
1013
GTACCTTATTTTCCTCACAC
2265
ACACAAAAATGTAAAC
3517

















ATTAAGGA






2FH21F_18_200
21
13775825
R
C
85
18
14666048
F
A
 85
GATTCATCCTACAATAGAC
1014
GAGAGTGAGTGAGACTTCAG
2266
CAGCCCAATCAATGAA
3518

















TGACCC






2FH21F_18_201
21
13775885
F
C
96
18
14665988
R
A
 96
GTGTAGTAGATTTTCTAGGC
1015
TGATTGGGCTGAAGTCTCAC
2267
ACTCTCTATTATTTCT
3519

















AATTTTTTCA






2FH21F_18_202
21
13777903
F
G
96
18
14663972
R
T
 96
TCTTATCACCTATGTTCTGG
1016
ATTCAGCAGGCAATGGAGAG
2268
CAGAAAAGCTTAAGCA
3520

















AAAATGAGCA






2FH21F_18_203
21
13777939
F
T
119
18
14663936
R
C
119
CCAGAACATAGGTGATAAGA
1017
GCCTTTATCTTCACAGCCC
2269
TGTTATAAACCTGATG
3521













C



TTTCATA






2FH21F_18_204
21
13778733
F
C
96
18
14663146
R
A
 96
AAAATCTTTATATAGCTTGG
1018
GGTTAGTCTAAGATAAAACT
2270
GAGTAAAAGGAAGGAA
3522















C

AGGA






2FH21F_18_212
21
13783264
F
T
96
18
14658612
R
C
 96
GTTCCTCATGTCAGCTCTTG
1019
ACAACCAAGTCCTACTGAAC
2271
TGAACTACTGAATGTT
3523

















AGAAC






2FH21F_18_213
21
13783324
R
G
100
18
14658552
F
T
100
ACTCAAGAGCTGACATGAGG
1020
GTCTACCCTGTCCATTGAAG
2272
TCCATTGAAGATGAGG
3524

















ACTCCTA






2FH21F_18_216
21
13784000
F
A
101
18
14657872
R
C
101
CTGTGTTGATGTGGTAGCCC
1021
GTCACCCAGTATATTTCTCC
2273
TTCTCCCAAATAAAAG
3525

















AGGA






2FH21F_18_217
21
13784009
R
C
101
18
14657863
F
A
101
GTCACCCAGTATATTTCTCC
1022
CTGTGTTGATGTGGTAGCCC
2274
GTGGTAGCCCATCACT
3526

















GGGTTGTAAA






2FH21F_18_219
21
13785807
F
C
120
18
14656075
R
A
120
TATGTGTTATATTTTTTTCT
1023
CAATGCAAACACTTTTAAGA
2275
TGATCCTTTTAACTCA
3527













G

C

ATCCAAA






2FH21F_18_223
21
13787653
F
T
83
18
14654244
R
C
 83
CTTTAGAAGGATTTTCTTAT
1024
GTCAATAACAACAATGTCC
2276
CAACAATGTCCATGAA
3528

















AAACTTGATT






2FH21F_18_224
21
13787882
F
C
95
18
14654015
R
T
 95
CCAAATTAATCTTCCATTCT
1025
TATAGATTATTGAATCTGAC
2277
ATTGAATCTGACAATA
3529













G



AATCATATT






2FH21F_18_226
21
13788781
R
A
100
18
14653118
F
G
100
AGTACATCATTGGCACCTTG
1026
ACTGCATTTGAAGTAGATGG
2278
ATTTGAAGTAGATGGT
3530

















AATGTAATAC






2FH21F_18_233
21
13809100
F
T
101
18
14633715
R
G
101
GATGGAAGGAGTGGTAGTG
1027
TGTGCCTTTGCCGAAACCAG
2279
TGACCAGCATGACAAG
3531

















GTGA






2FH21F_18_234
21
13817921
F
T
110
18
14624683
R
G
110
TTTTTCTATTTTAACTAACT
1028
TACCTCTGATGAGCATCAGC
2280
TGAGCATCAGCTAATA
3532













G



TTTAATC






2FH21F_18_241
21
13825443
F
C
87
18
14617161
R
C
 87
TGACACATGACTTTTGTGCC
1029
TCATTTAATTAATCATCAGG
2281
AATTAATCATCAGGTT
3533

















CTTTATCCTTA






2FH21F_18_243
21
13825600
F
T
108
18
14617004
R
T
108
CAGTATTGGCTTATTATGTC
1030
CAGAGTAGGTGTCCTTACAG
2282
GACACGTTCCAGTATA
3534

















AAATA






2FH21F_18_244
21
13825929
R
G
116
18
14616676
F
T
116
CCAGGTACTGTTGTTTTTGA
1031
CAGGTGTTTTTGGTAACCAG
2283
AAGGCACAGAAAGAAG
3535













C



TAATATC






2FH21F_18_245
21
13825963
F
G
116
18
14616642
R
G
116
CAGGTGTTTTTGGTAACCAG
1032
CCAGGTACTGTTGTTTTTGA
2284
CTGTGCCTTCAAAATT
3536















C

TCA






2FH21F_18_252
21
13837138
F
A
103
18
14605452
R
G
102
GAAAACAAATGTGCATTAGC
1033
TACTACGTTTTTATACTTAC
2285
TACTACGTTTTTATAC
3537

















TTACTTTTTTT






2FH21F_18_254
21
13846782
R
T
92
18
14595816
F
C
 92
GCTTCTCTAAGCTACTTTA
1034
TAGTCGACCCTGGGCAATT
2286
CCTGGGCAATTCCTTA
3538

















AATACCAGATA






2FH21F_18_255
21
13847349
R
T
112
18
14595239
F
G
112
CGTCTCCTGAGTAAACTCAC
1035
GTAAGATGAATACACAAAGG
2287
AGGCTAAATCTTCTAA
3539















C

AATCAAG






2FH21F_18_260
21
13852890
R
T
120
18
14589935
F
G
120
GACAGAGAGGGTTAAGTTCT
1036
GGTTACATATCACTGCAAG
2288
ACTAAATCAATCTCAT
3540

















CATACATTC






2FH21F_18_261
21
13853735
F
C
113
18
14589078
R
C
113
CCATAGCAAGATGAATTCAC
1037
ATGATACTCCCCAAAGTCTC
2289
CTCCCCAAAGTCTCAG
3541

















ATAG






2FH21F_18_262
21
13853770
R
G
107
18
14589043
F
T
107
CCATAGCAAGATGAATTCAC
1038
CTCCCCAAAGTCTCAGATAG
2290
AATTGCAAAAGCCAAT
3542

















TAAAAAAC






2FH21F_18_268
21
13856320
F
C
93
18
14586489
R
A
 93
ACCCTCATATGTCTGGTAGC
1039
AGAGAATTTGGGGCCTGGCT
2291
GGCCTGGCTGACAGTA
3543

















AAC






2FH21F_18_269
21
13856700
F
C
83
18
14586110
R
A
 83
AGTTCCACATGAACCTAGCG
1040
TGAGATAAGTGGCTACGTTG
2292
TAAGTGGCTACGTTGT
3544

















TGTCATATTG






2FH21F_18_270
21
13856890
R
A
104
18
14585922
F
C
104
GTTGTGACTATTGTTATAG
1041
TGGTTCTCAACACTGACCAC
2293
CCACTAGTATTAACAT
3545

















ACAGTTTA






2FH21F_18_271
21
13862329
R
G
105
18
14579738
F
T
105
GAGTGTAGAGCTGTTACTGG
1042
GGACATATGGCCTTGCTTAG
2294
AGAAAGGTGACTAAGA
3546

















ATTGTAGTTC






2FH21F_18_272
21
13862406
F
T
110
18
14579661
R
G
110
CTCAGATTATAGGAGACAGA
1043
GCTCTACACTCTAGAAGAAG
2295
AAAATTGATGAATACT
3547













G



TAGTTCCC






2FH21F_18_273
21
13862436
R
A
116
18
14579631
F
G
116
TGATGAATACTTAGTTCCC
1044
ATCTACAAAGGATAATCAG
2296
TGCACTGGAGAAATTA
3548

















AAA






2FH21F_18_274
21
13862459
F
T
97
18
14579608
R
C
 97
AGGAAATTATCTACAAAGG
1045
ACTCTGTCTCCTATAATCTG
2297
TTAATTTCTCCAGTGC
3549

















AGTG






2FH21F_18_275
21
13862500
F
A
106
18
14579567
R
C
106
CCTGAAGTATGTTAGTAGAC
1046
TCCTTTGTAGATAATTTCC
2298
TGTAGATAATTTCCTT
3550

















TGTAAGTA






2FH21F_18_276
21
13862519
R
A
106
18
14579548
F
C
106
TCCTTTGTAGATAATTTCC
1047
CCTGAAGTATGTTAGTAGAC
2299
AGTAGACAAAGAAGAA
3551

















AAGTGAAG






2FH21F_18_277
21
13869305
F
G
102
18
14567749
R
G
102
TTTGTCCTTCATCTCTTACC
1048
TAAGTCATTTACTTCTCAG
2300
TAGAAGACAGCATTTC
3552

















CATTA






2FH21F_18_284
21
13877545
F
A
83
18
14559499
R
C
 83
ATATTGACTATAACTTAAAT
1049
TGGTGGACGAATGTCAAAAA
2301
CGAATGTCAAAAATTT
3553













AT



TAAAATATCA






2FH21F_18_292
21
13895590
F
C
98
18
14541965
R
A
 98
GTGATTGTAAAAATTATAGC
1050
CAGATTGACCACCTCCAAAG
2302
AGAAAGAGGGGAGGTA
3554

















AATAATAAGA






2FH21F_18_293
21
13896370
F
C
99
18
14541176
R
A
 99
TGCTTTCGAATTTTTTCAC
1051
CCCATTCTTCTTAATGTCAG
2303
AATGTCAGAAGCCCTT
3555

















A






2FH21F_18_296
21
13897380
F
G
105
18
14540150
R
T
105
CCCAAAGATTTAACTTGAT
1052
ATATATCTGGGCCTGGCTAC
2304
TTCTCTTGGTTCAAAT
3556

















TTCC






2FH21F_18_300
21
13898463
F
C
111
18
14539060
R
T
111
CTCTCCATGATGTACTGTAG
1053
GCATACAGAGAGGAGCTAGT
2305
GAGGAGCTAGTCAGAA
3557

















CA






2FH21F_18_301
21
13898498
R
A
105
18
14539025
F
C
105
AGAGAGGAGCTAGTCAGAAC
1054
CTCTCCATGATGTACTGTAG
2306
ATGATGTACTGTAGTA
3558

















ACACAC






2FH21F_18_303
21
13898901
R
C
118
18
14538622
F
T
118
GATCTAGGTTGAAACTAGTT
1055
ATTTGCCCAATGCAAGCCAG
2307
CAGAAGTGCAAGTTCA
3559













G



G






2FH21F_18_304
21
13898938
R
A
111
18
14538585
F
C
111
GTTGAAACTAGTTGGGCTTC
1056
ATTTGCCCAATGCAAGCCAG
2308
GCAAGCCAGTAAATAA
3560

















TAAAAC






2FH21F_18_305
21
13899002
F
C
110
18
14538521
R
A
110
GCCTCTTTCACTACCATGAG
1057
ATCTAACGAGGATCTGCACC
2309
TCTGCACCACCTTTCT
3561

















T






2FH21F_18_307
21
13899589
R
G
95
18
14537930
F
T
 99
GTTAATCAGAGCCAGCCAAG
1058
TCAATTCCTCTCTAAGAGCC
2310
AGCCACGGTAACTCTT
3562

















TC






2FH21F_18_314
21
13958107
F
T
92
18
14479443
R
G
 92
GAAGGAAGGTGGGTTCTGTG
1059
CGCCGCACATCCCCTCTCG
2311
CGCCGCACATCCCCTC
3563

















TCGCCCCTC






2FH21F_18_319
21
14043808
R
A
90
18
14396652
F
C
 90
CTGAATTCTTTGGGAAGGGC
1060
TGAGAGTCATCAAAAAGGTC
2312
GTCCAAGTTTAGTGAA
3564

















GATG






2FH21F_18_326
21
14121932
F
G
114
18
14347928
R
T
116
AAAGGAACGAAAGCAACGGG
1061
AACCTGTTCAGTGCTGCC
2313
CAGTGCTGCCAGTCAA
3565

















C






2FH21F_18_327
21
14121941
R
G
109
18
14347918
F
A
111
TGTTCAGTGCTGCCAGTCAA
1062
AAAGGAACGAAAGCAACGGG
2314
TGATCCCACGCTGCTA
3566

















CTCA






2FH21F_18_328
21
14121971
R
G
109
18
14347887
F
A
111
TGTTCAGTGCTGCCAGTCAA
1063
AAAGGAACGAAAGCAACGGG
2315
CGAAAGCAACGGGGAA
3567

















AAAAAA






2FH21F_18_329
21
14122272
R
A
110
18
14347585
F
C
111
CCCGCAAAAGTTTCAAGAAG
1064
ACTGATTTCCCAGCACCCAC
2316
CTGATTTCCCAGCACC
3568

















CACTGTCCC






2FH21F_18_330
21
14124875
R
T
81
18
14344986
F
C
 81
TTCCCTGATTACACTGTGCC
1065
CATTTATAGTCTATACGTGC
2317
ATAGTCTATACGTGCA
3569

















GTGCAGGGTT






2FH21F_18_332
21
14128493
F
T
81
18
14341370
R
G
 81
ATGTAGGCATTGTAATGAGG
1066
GACTTGAATTTAACTGCTCC
2318
TTGAATTTAACTGCTC
3570

















CAGTAAGG






2FH21F_18_333
21
14221264
R
T
116
18
14222905
F
C
116
AGTATAATATTTTGGCATTC
1067
CTGGGGCAAGGTTGGGAT
2319
AAGAGAAACAACATAA
3571

















TCTGA






2FH21F_18_340
21
14274503
R
A
114
18
14168976
F
C
114
AGCGCACAGCGTTTCCGCA
1068
TGGGGCTGCAGCTGCGAGA
2320
GGGCCTTGCCATTCTC
3572

















A






2FH21F_18_344
21
14282925
R
G
92
18
14159539
F
T
 92
CGAGTAAGTAAATGTGAGTG
1069
CCCTTTTCTACTCACATTCC
2321
GCTAATTAGTGCTATT
3573













G



GGCTG






2FH21F_18_346
21
14283763
F
T
116
18
14158701
R
G
116
AACTTGCCTTCAAGATCTG
1070
GATAACATAAGATTAGGAAC
2322
AACATAAGATTAGGAA
3574

















CAAGAATA






2FH21F_18_349
21
14296262
R
G
119
18
14146201
F
T
120
TCAGAACCTTTTTGAAAAC
1071
CCAATAGGCATTGCTAAACT
2323
CTTTGCATATTTCTTT
3575

















TTACGAAACGC






2FH21F_18_350
21
14296320
F
C
119
18
14146143
R
A
119
TTCGGTCAAGGCTTACTATG
1072
GTTCTGAATTTAGATGTACG
2324
ACGGAATAGGAAAATT
3576















G

TCTCCA






2FH21F_18_351
21
14296558
F
A
115
18
14145905
R
A
115
AGTGTGCTATACTGGACTAC
1073
ACTCTTAGCCCTTTCACAGC
2325
CTCTTAGCCCTTTCAC
3577

















AGCATTTGAT






2FH21F_18_352
21
14296560
R
A
115
18
14145903
F
C
115
ACTCTTAGCCCTTTCACAGC
1074
AGTGTGCTATACTGGACTAC
2326
GGTAAGGTGGCAAGTC
3578

















AA






2FH21F_18_354
21
14298284
F
A
119
18
14144184
R
C
114
TTAGCCTTTTCCCTGCTTTG
1075
CGTCAAGTGAGTATACTGTG
2327
AAAACGTGGAAAATAC
3579

















AAAAAAAA






2FH21F_18_355
21
14298299
R
G
110
18
14144174
F
T
105
GTTAAAACGTGGAAAATAC
1076
AAAATATATATTGAAAGAAA
2328
TTAGCCTTTTCCCTGC
3580















AC

TTTGATTTT






2FH21F_18_357
21
14298722
F
C
100
18
14143752
R
T
100
AAAGAATAAAACGTAAACTC
1077
TGGGAGGAATGTGAGTTGGG
2329
TTGTAGAATTGGAGTT
3581

















AAGATAGGAT






2FH21F_18_364
21
14301415
F
T
119
18
14141056
R
G
119
TGCACGCAGCATCACCAGT
1078
CCACACACAGTAAGAGCCAC
2330
CACAGTAAGAGCCACT
3582

















CGGACA






2FH21F_18_365
21
14301450
F
A
117
18
14141021
R
C
117
ACACACAGTAAGAGCCACTC
1079
TGCACGCAGCATCACCAGT
2331
GTGCCCGGCTGAGGTG
3583

















CGT






2FH21F_18_369
21
14301678
R
G
106
18
14140795
F
T
106
CCCACCAGGCACCTGCTCT
1080
AAGATCAGGAATGGACAGGG
2332
CCCGCAAGAGGGCAAA
3584

















G






2FH21F_18_370
21
14301937
F
A
83
18
14140537
R
C
 83
AGCCTCTGCTTCCCCACA
1081
ATATGAGGAGGGACTCACTG
2333
CTGGAGCTGGGAGGGG
3585

















TTTGA






2FH21F_18_375
21
14302390
F
C
118
18
14140084
R
A
118
TGAGGTGGCCTATGTTCCC
1082
ATGGGTCTGGCAAGGTTGG
2334
TGTGGCTTTTAGGGCG
3586

















A






2FH21F_18_380
21
14302721
F
C
109
18
14139753
R
A
109
GAGTCACCAACTGCCCCCA
1083
AGTTCTGTTGGGCAGACTTC
2335
GTTGGGCAGACTTCTG
3587

















TGGAGACC






2FH21F_18_386
21
14302985
R
G
104
18
14139489
F
T
104
TCATAGCACAAGTCTCAGGG
1084
ACATGTGGTGTGCCTGTGTC
2336
TGCCTGTGTCCACCTA
3588

















A






2FH21F_18_388
21
14303062
F
A
108
18
14139412
R
C
108
AGGAGACCCCTCACCCTATG
1085
ATGGCCCCTCCTCCCTATAC
2337
TCCTCCCTATACCGGT
3589

















ACAA






2FH21F_18_398
21
14303787
R
A
117
18
14138688
F
C
117
AGCGCCTGAGTGCCCTGAG
1086
TCCTAGCAGCCATGGCAATC
2338
TGGCAATCCACAGGGA
3590

















GC






2FH21F_18_399
21
14303884
F
T
91
18
14138591
R
G
 90
TCCTGCGTCCCAGCACCAT
1087
GGAACACTGTGGACTTGTTG
2339
TGGACTTGTTGAGGAG
3591

















GCT






2FH21F_18_402
21
14304050
F
G
100
18
14138426
R
T
100
CTGCACACTTGCAGGGTATG
1088
AGGCCAAGAGAGGCACAAG
2340
GCACACCTGCCTGCTC
3592

















CTCTTGGAC






2FH21F_18_403
21
14304106
R
C
107
18
14138370
F
T
107
CAAGTGTGCAGTCTGTCCTC
1089
AGAGGTCCTCAGAGACCAG
2341
AGGACAGGGTCTGTGT
3593

















T






2FH21F_18_405
21
14304976
R
C
117
18
14137500
F
T
117
TGAGGACTGCTCTATGACCG
1090
CTGCTGGATCTGGTAGTCA
2342
GATCTGGTAGTCAGAG
3594

















AAG






2FH21F_18_408
21
14305188
F
A
106
18
14137291
R
C
106
GGAGATAACAGGTGTTTCC
1091
TGCTCATCTGAGGCCTCAGT
2343
GGGGCCTCAGCACCCT
3595

















CA






2FH21F_18_409
21
14305214
R
A
101
18
14137265
F
G
101
TGCTCATCTGAGGCCTCAGT
1092
TAACAGGTGTTTCCAGTTGC
2344
GGGTGCTGAGGCCCCC
3596

















AGTGAG






2FH21F_18_412
21
14305608
R
C
96
18
14136938
F
C
 96
TCGCGGAGATCAACTTCAAC
1093
TGCCTGCATGACCCCGCAC
2345
TCGCTCACACTGTCCT
3597

















C






2FH21F_18_414
21
14305697
R
G
101
18
14136849
F
G
101
TTCCCAGGCAGCTCAGGCCG
1094
TCCACAGAGGGGCCTCTCC
2346
CCAGCCCCACCGCACA
3598

















GGCCCAC






2FH21F_18_415
21
14305767
F
C
108
18
14136779
R
A
108
AGGCCCCTCTGTGGAGCTA
1095
GCTTAGTTCAGGATGTGGGC
2347
GCCATGGGCTGGAGGG
3599

















CATGATGGG






2FH21F_18_417
21
14305947
R
A
115
18
14136603
F
C
115
GCCTTCACCTGGGCAGCAC
1096
TGAGGCCTGCTGCAGCGAC
2348
CATCCAGCACTTTGAT
3600

















GA






2FH21F_18_419
21
14306173
R
T
110
18
14136378
F
G
110
GTCCTGCAAGCACTGGCG
1097
AGAATGCCCTGAGTGAGGAG
2349
TCCAGGCCTCAGCTCC
3601

















G






2FH21F_18_427
21
14306777
R
A
115
18
14135780
F
C
115
TGGGTGGTGTCCACCTAGT
1098
GCTGGGGTGGGCATCAGG
2350
CTGGGGTGGGCATCAG
3602

















GCCTGTG






2FH21F_18_428
21
14306802
F
A
88
18
14135755
R
C
 88
CCTAGATGTCAGCCGTGAG
1099
CACAGGCCTGATGCCCAC
2351
CTGATGCCCACCCCAG
3603

















C






2FH21F_18_429
21
14306814
R
C
88
18
14135743
F
C
 88
CACAGGCCTGATGCCCACC
1100
CCTAGATGTCAGCCGTGAG
2352
CGTGAGGGTGGAGGCC
3604

















AG






2FH21F_18_430
21
14306846
R
G
99
18
14135711
F
T
 99
AAGGAGAGGGGTCTTATCAG
1101
CCTCCACCCTCACGGCTGA
2353
CACGGCTGACATCTAG
3605

















G






2FH21F_18_432
21
14306875
R
C
98
18
14135682
F
A
 98
ACCCTCACGGCTGACATCTA
1102
GGGTAAGGAGAGGGGTCTT
2354
GAGAGGGGTCTTATCA
3606

















GCC






2FH21F_18_434
21
14307078
R
G
117
18
14135479
F
T
117
ACGTCCCAGATAGGAGGAAG
1103
AGGACCGCATCCAACAGAGA
2355
GCAGCTCACCAAGCAC
3607

















CAC






2FH21F_18_435
21
14307099
R
A
118
18
14135458
F
C
118
TCATCCTTGAGGCCAGGGAG
1104
ATGCCACTGCCCTGTCCTAT
2356
CCAGGACCGCATCCAA
3608

















CAGAGA






2FH21F_18_441
21
14307877
R
G
118
18
14134766
F
T
118
TTTCTGCTGGTAACAAATG
1105
GAGGACAGGGTCAGTCCCG
2357
CACTTCCTGACACGGC
3609

















CCC






2FH21F_18_446
21
14308106
F
T
92
18
14134537
R
C
 92
TCCTGCAGAGGCCTAGCCTT
1106
TCCCACTGACCCCAAGGAG
2358
GCTGGCCTCAGGCCTT
3610

















A






2FH21F_18_457
21
14311562
R
A
104
18
14131075
F
C
104
TGACACTGGGCATAGTGTGG
1107
CAGAGCAAGCCCCTTAGATG
2359
CCCCTCCTGTACCTTG
3611

















G






2FH21F_18_459
21
14311633
R
G
101
18
14131004
F
T
101
TTGGGATCATGGCACAGG
1108
TCCAGGCTGCGTTCAGATTC
2360
TCAAGCACCTCATTCT
3612

















C






2FH21F_18_460
21
14311656
R
G
118
18
14130981
F
T
118
TGATGACCTCAAACCTCCG
1109
TTGGGATCATGGCACAGG
2361
GAGAATGAGGTGCTTG
3613

















ATGATG






2FH21F_18_461
21
14312314
F
C
103
18
14130330
R
A
103
TTCTTTGTTCGTGGGTAGTG
1110
GCAGTTTAAACCACCATTTC
2362
CCACCATTTCTGTGAA
3614

















GCTTTCT






2FH21F_18_462
21
14312342
F
C
92
18
14130302
R
A
 92
TGCCTGTTACCAGGTACTAC
1111
GTGCAGCACAGAACAACGC
2363
CTTTGTTCGTGGGTAG
3615

















TGT






2FH21F_18_463
21
14312574
R
G
80
18
14130070
F
A
 80
CTGATTATCTTTTTCTAAGC
1112
AGTCCTAACTGAAAGACAGA
2364
GAAAGACAGACAAGAA
3616

















CATCTTA






2FH21F_18_466
21
14312692
F
T
117
18
14129952
R
G
117
AATCTGGGTTTCCTTGAGGG
1113
TTAGCAACTGACTGTCATA
2365
AACTGACTGTCATAAG
3617

















AGAT






2FH21F_18_467
21
14312732
R
C
114
18
14129912
F
A
114
GCAACTGACTGTCATAAGAG
1114
AATCTGGGTTTCCTTGAGGG
2366
GGGTTTCCTTGAGGGC
3618

















TAAGATTACT






2FH21F_18_468
21
14313209
F
C
118
18
14129421
R
A
118
GGAAGAATCTGAGAAGTAGC
1115
ATAAGGTGAGGCTTGCGCTG
2367
GGATGCAGTTCTGGAA
3619

















ACAAGA






2FH21F_18_469
21
14313390
R
G
100
18
14129240
F
T
100
AGCTCTTAGTTCCTCCAGAC
1116
CTTCCCTGATGATGAATGGC
2368
TGAATGGCTCATCCCA
3620

















G






2FH21F_18_470
21
14313610
F
T
100
18
14129020
R
C
100
GCAGCCCAGATCTTGGTTAC
1117
CCTCAGAAATAGCATGCAGG
2369
TGAAGTGGTGGTGGTT
3621

















G






2FH21F_18_472
21
14313830
R
T
109
18
14128800
F
G
109
TCCTAGACTCTTTCCTGTGG
1118
ACCTGAATGTGCATGGGAAG
2370
GAATGTGCATGGGAAG
3622

















GTTCTGGAAT






2FH21F_18_474
21
14313944
R
A
120
18
14128688
F
C
120
TGAGATTGAGTTCGCTCCTG
1119
CAAGGCTTGGGTAAGAAGGG
2371
TGGCATTCAGAGAGCA
3623

















T






2FH21F_18_475
21
14314051
R
A
115
18
14128579
F
A
117
AAGGACACCTGACAAGATAG
1120
AAGAAGACCCCTTCTTACCC
2372
GGATAAAAAAGCAAGA
3624

















CTCT






2FH21F_18_476
21
14314089
F
T
101
18
14128541
R
T
 99
AGAATCAGAGTCCAGCTCAG
1121
CTGCTCTATCTTGTCAGGTG
2373
TCTTGTCAGGTGTCCT
3625

















TGAAATT






2FH21F_18_480
21
14314502
F
G
102
18
14128129
R
T
102
GACCCACAAATATGAGTCAG
1122
TAGTGGAAAAGGGAGTTCGG
2374
TAGACCCAGAGTCCCA
3626

















TA






2FH21F_18_481
21
14314586
F
C
104
18
14128045
R
A
104
GGAAATGGATTACAGCCCTC
1123
CGTCAAAAGTGAGTGGGAAG
2375
GAGTGGGAAGAATACA
3627

















GT






2FH21F_18_482
21
14314695
F
C
119
18
14127936
R
C
119
GGGCTGTAATCCATTTCCTG
1124
TATGAAGGTTGCAAAGAGGG
2376
GAAGGTTGCAAAGAGG
3628

















GGTGGAAT






2FH21F_18_483
21
14314743
F
C
106
18
14127888
R
A
106
TCTCTTTCCATTCCAGTGA
1125
CACCCCTCTTTGCAACCTTC
2377
AACAGCCCAAGGTCTT
3629

















AC






2FH21F_18_485
21
14314908
F
C
103
18
14127723
R
A
103
GTGTAAGAGAGAGGACCTTT
1126
TTGGATGGAGGCACAGTGAG
2378
ACAGTGAGAATTTTGG
3630

















TCTG






2FH21F_18_490
21
14315928
R
A
99
18
14126706
F
C
 99
TCCCTTGAATGTTGGAAGGA
1127
ATTGAGTTAGCACTGGCTCC
2379
GCACTGGCTCCAATCT
3631

















GATCAATT






2FH21F_18_491
21
14316557
F
T
119
18
14126077
R
G
119
AGAGCCAGTTTTGCATTCAC
1128
GGAACTAAGGCAAAGATGAG
2380
CACCTGTCACCAAGAC
3632

















AC






2FH21F_18_494
21
14316694
R
C
95
18
14125936
F
T
 99
TCAGAATGGGTCTGAGTTTC
1129
CAGGCAAGAGGTCTTTCCAG
2381
TCTTTCCAGATTCCCC
3633

















A






2FH21F_18_497
21
14317060
F
C
99
18
14125570
R
A
 99
CATGGGCTAAGCCATGTAAG
1130
GTTGCCTCATCTTTCCCTTC
2382
TCCCTTCTGAGAAGTC
3634

















TA






2FH21F_18_501
21
14318981
F
C
98
18
14123650
R
A
 98
CACATTCAGGAGCAGCTATG
1131
CAGGGTGAGGAATACATTGG
2383
GGAATACATTGGCTGT
3635

















ATGTGATTTT






2FH21F_18_502
21
14319138
R
G
90
18
14123493
F
T
 90
CTAAATCAAATTACTGTGCC
1132
TCAGCAGCTCTGTCTTTATG
2384
CTTGCCTTCAAAGCAA
3636

















AAG






2FH21F_18_503
21
14321397
F
T
112
18
14122673
R
T
113
TTGGCTCCAGTCACTTTCAG
1133
CCTTCATAACGTTATACACC
2385
ATACACCACAATGCTA
3637

















AAAAA






2FH21F_18_504
21
14321408
R
T
112
18
14122661
F
G
113
CCTTCATAACGTTATACACC
1134
AGGGCTTTCTGTCTGTGCTG
2386
TCTGTGCTGCGCCTGG
3638

















CTCT






2FH21F_18_505
21
14321469
R
A
96
18
14122600
F
A
 96
TGAAAGTGACTGGAGCCAAG
1135
TGCGTGTCAGAAGATGCTAC
2387
ACGGAATGAGCCGAGA
3639

















GTG






2FH21F_18_506
21
14321489
R
A
96
18
14122580
F
C
 96
TGCGTGTCAGAAGATGCTAC
1136
TGAAAGTGACTGGAGCCAAG
2388
CTCTCGGCTCATTCCG
3640

















T






2FH21F_18_508
21
14321836
F
A
117
18
14122233
R
C
117
ACTCGCAGACTAGGTCCCGT
1137
CGAGAAATGGTGAGTGTGGG
2389
CCGAGACTGGGGAGGG
3641

















G






2FH21F_18_509
21
14321892
R
C
111
18
14122180
F
C
108
ACGGGACCTAGTCTGCGAGT
1138
TGCAGGGACAGGACAGGAC
2390
GAGGGGACTGAGGGCT
3642

















GAGCTGCAGA






2FH21F_18_510
21
14322704
R
A
98
18
14121367
F
C
 98
CTTGCTGACATTCCCCAAAG
1139
CTGAAATGTGCAATAAAGG
2391
ATGTGCAATAAAGGAC
3643

















AAAAA






2FH21F_18_511
21
14322742
F
T
92
18
14121329
R
C
 92
CAAATTGCCATCCACTGCTC
1140
GTCCTTTATTGCACATTTCA
2392
TATTGCACATTTCAGA
3644















G

AACAGTATTT






2FH21F_18_512
21
14322792
F
G
105
18
14121279
R
T
105
GAGCAGTGGATGGCAATTTG
1141
AGTGCCAGGGGATTATTTTC
2393
ATGTGAAATATTTGTA
3645

















AGTAGAAAA






2FH21F_18_513
21
14322852
R
G
105
18
14121219
F
T
105
AGCAGAAAATAATCCCCTGG
1142
TAAGGGCGTTTGTGCTAAGG
2394
AGAAACAGCAGAAAGA
3646

















TTTTTTACAG






2FH21F_18_515
21
14322938
F
C
109
18
14121133
R
C
109
AGCACAAACGCCCTTATTAG
1143
CCGAATGTGGCTAAGGAAAC
2395
AAACATTGCCCCATAA
3647

















AGTTTCCCAA






2FH21F_18_516
21
14323047
R
C
100
18
14121024
F
T
100
GATGGCCCAAGATACAAACC
1144
CTGGAAGATTACCAAAGGGC
2396
TATTCACCAGAACTCC
3648

















CAAAA






2FH21F_18_517
21
14323069
F
T
100
18
14121002
R
C
100
TGTGTCCTCTGGAAGATTAC
1145
GCCCAAGATACAAACCAGAG
2397
TTTGGGAGTTCTGGTG
3649

















AATA






2FH21F_18_518
21
14323100
F
A
92
18
14120971
R
G
 92
CATTCAGCTGCTCCTTTGAG
1146
CAGCCCTTTGGTAATCTTCC
2398
ATCTTCCAGAGGACAC
3650

















A






2FH21F_18_519
21
14323115
R
A
105
18
14120956
F
C
105
GGTAATCTTCCAGAGGACAC
1147
GATATTTCTCTCACCCCCAG
2399
CTGCTCCTTTGAGAAG
3651

















CTG






2FH21F_18_520
21
14323420
R
G
103
18
14120654
F
G
103
AGTGCAAGAACCTGCAAAGC
1148
TCACTGAAGTGCTCAATGCC
2400
CTGCACTGTGCCCCAC
3652

















T






2FH21F_18_521
21
14324503
F
C
100
18
14119577
R
A
100
CAGAAGAAAGACATCACTGG
1149
TGTGTGCAGAACAAAGCCTC
2401
TTCCCTCAGACACCTG
3653

















GAGTCTCCTT






2FH21F_18_522
21
14324706
F
A
99
18
14119374
R
G
 99
GTAAAACTTTGTCGTGGGAG
1150
CCTACATGCTTCTAACCCAC
2402
ACCCACTCCTGAACAT
3654

















A






2FH21F_18_523
21
14324731
F
T
118
18
14119349
R
G
118
CTTCTAACCCACTCCTGAAC
1151
AAGCTGTTGTGAGCACAATT
2403
GTAAAACTTTGTCGTG
3655

















GGAGGA






2FH21F_18_524
21
14324792
F
A
94
18
14119288
R
A
 94
TAAGCCAGGAGTCTTCTAGG
1152
TGTGCTCACAACAGCTTTCC
2404
CAGCTTTCCTCCTAGA
3656

















G






2FH21F_18_525
21
14324801
R
G
94
18
14119279
F
T
 94
TGTGCTCACAACAGCTTTCC
1153
TAAGCCAGGAGTCTTCTAGG
2405
GCACCTGTGTATGTTC
3657

















T






2FH21F_18_526
21
14324841
F
A
89
18
14119239
R
G
 89
CAGGTTCCCGATAGAGATTC
1154
CATACACAGGTGCCTAGAAG
2406
AGACTCCTGGCTTATC
3658

















T






2FH21F_18_527
21
14324931
R
G
118
18
14119149
F
T
118
TGCTACAGATACAGGCTCAG
1155
ACCCAGGTTTCTTGGACTAC
2407
ACCTGATCATAATCTC
3659

















TTCTGATTGT






2FH21F_18_529
21
14327004
F
T
105
18
14117104
R
G
105
CAGAGCCATAATCACAACTG
1156
AGCTAAGTCTGAGGTAAGGG
2408
ACTCTACTCCACTAAC
3660

















AGTTTACA






2FH21F_18_530
21
14327071
R
C
86
18
14117035
F
C
 88
TGTTCTTCCCCTTACCTCAG
1157
CAGATCCCGAATCTAGCTGT
2409
AGATCCCGAATCTAGC
3661

















TGTAATATCCC






2FH21F_18_534
21
14327453
F
A
102
18
14116653
R
G
102
GACCATGACTGCTTCATCTC
1158
GATCTGGAGACTCAAACTGG
2410
GGAGACTCAAACTGGT
3662

















CAATAAGCTA






2FH21F_18_535
21
14327664
R
A
104
18
14116442
F
A
104
TTGATGCCACCAACTGAAGG
1159
AATATTTATTCTTAGCAAGG
2411
AATAATAACTTCTCTT
3663

















CTGTCC






2FH21F_18_536
21
14327693
R
G
112
18
14116413
F
T
112
ACCCTTACGTTTTCCTAGAG
1160
GGACAGAAGAGAAGTTATT
2412
ACAGAAGAGAAGTTAT
3664

















TATTTGTATT






2FH21F_18_537
21
14327880
R
A
90
18
14116226
F
C
 90
TTGGGACAGATCTCCATGC
1161
CAGATTTCTCTTGGTCAGGC
2413
GCTTAGAAAAGATAAA
3665

















ACTGAAA






2FH21F_18_538
21
14327930
R
A
103
18
14116176
F
C
103
TTTCAGTGTGGGATCAGACC
1162
CATGGAGATCTGTCCCAACC
2414
GCGCAGATCCACCCTC
3666

















T






2FH21F_18_539
21
14328545
F
T
105
18
14115563
R
G
105
GCTCATTTTAGACAGATGGA
1163
TTCTTCACAAGTCTCAAAG
2415
GAATTGCAGTTAACAG
3667













G



TTCCTTTC






2FH21F_18_543
21
17841257
R
G
111
18
14469188
F
G
111
CCAGAAGTTTGAGTATCAC
1164
GGACTAAGCGTAAATTTGC
2416
TTTCCCCTTTGGCTTT
3668

















TTCAATCATCT






2FH21F_18_545
21
25676417
F
A
102
18
13654900
F
G
 99
CTATTTCAGTTCTAACCCT
1165
GCAGATAAGTCAAAACAAGG
2417
TCAAAACAAGGACAAT
3669

















CTAA






2FH21F_18_548
21
28291001
F
T
111
18
15073195
R
G
111
GAGACATATCAAGGAATAA
1166
GTTTCAAAACCAACATGGTA
2418
AAAACCAACATGGTAA
3670

















AATCTAAATA






2FH21F_18_549
21
28291458
F
C
96
18
15072738
R
A
 96
CCTCTGACAAAAAGAGGAGC
1167
GAGGTCCTTGCCTTATCAC
2419
GTCCTTGCCTTATCAC
3671

















CACCATT






2FH21F_18_555*
21
28308411
R
G
104
18
15055759
F
G
104
CAAGGAATTTAGAAAATGC
1168
AAGTTTCCTGTAGAAAGAG
2420
TTCCTGTAGAAAGAGT
3672

















TAAAGTGAAT






2FH21F_18_565*
21
28318201
F
T
96
18
15046074
R
G
 96
TCACATTTACCAACTACTG
1169
TTCTACATTCCTGGCCTGAG
2421
AACAGAAGTACCTTTT
3673

















GCTTAT






2FH21F_18_566*
21
28318293
F
G
119
18
15045982
R
T
119
AATGTCAGGTTGTTGACTGC
1170
TTAGATATGGCTGAGAAGTG
2422
ATATGGCTGAGAAGTG
3674

















GGGTGA






2FH21F_18_567*
21
28318296
R
C
117
18
15045979
F
T
117
AGATATGGCTGAGAAGTGGG
1171
AATGTCAGGTTGTTGACTGC
2423
TAAGTTAAAGTGGGTC
3675

















AGGT






2FH21F_18_570*
21
28318429
F
A
100
18
15045847
R
A
 99
GACAGGAGCTCTATATTTA
1172
CATACAAGTAAAGAACCCA
2424
CTAACCTGCTACCTAC
3676

















CTT






2FH21F_18_571
21
28318455
R
A
94
18
15045821
F
C
 94
CTAACCTGCTACCTACCTT
1173
TGAAGTTATAAATCAGTAAG
2425
GTTATAAATCAGTAAG
3677

















AAACAGGA






2FH21F_18_574*
21
28318711
F
G
114
18
15045565
R
A
114
TCTCTCTGTAAGATGTGAAG
1174
ATGGAGAGATGGCAAGTGAG
2426
GCTGAGGAACACAGCT
3678

















CCCTTATG






2FH21F_18_576*
21
28318759
R
T
95
18
15045517
F
G
 95
TCTTCACATCTTACAGAGAG
1175
GCTGACAGCATCAGCTTTAG
2427
AACAGATTAGATTCCA
3679

















TGTAACTA






2FH21F_18_577*
21
28318824
R
G
89
18
15045452
F
T
 89
CTACTAAAGCTGATGCTGTC
1176
CTCAAAATGTGTCTACAAGC
2428
GTGTCTACAAGCATAA
3680

















TGAA






2FH21F_18_579*
21
28318862
R
A
111
18
15045414
F
A
111
CTGTCAGCTGCCATGCTTAG
1177
ACCTTCTTAGAAGTTTCTC
2429
CTTCTTAGAAGTTTCT
3681

















CTTCTAGAT






2FH21F_18_583*
21
28319085
R
T
115
18
15045191
F
G
115
CTTGGTAATAATATATAGTG
1178
GAGCACTATGTATTGTTTTC
2430
ACTTGCTTGCATCATA
3682

















CAT






2FH21F_18_585
21
28328803
F
T
120
18
15040341
F
C
117
TGAATGTCTTCAGGGTGAGG
1179
CTGAAGGAGAAGAAGGGAAC
2431
ACTTCCTCCCCTGAGT
3683

















C






2FH21F_18_590
21
28349711
F
G
80
18
15014464
R
G
 80
AAACAAAGCCTTTGAGACC
1180
ACAACATACTCGTATCTCC
2432
CGTATCTCCTGAAATC
3684

















CTG






2FH21F_18_594
21
46813934
F
G
94
18
953658
F
A
 94
AAAACATTTTAATGCACTTC
1181
GTATTGAAAGGTCAGTGGTG
2433
CAGTGGTGGTAAGACA
3685

















A






2FH21F_19_004
21
31210897
R
G
117
19
53404855
R
A
117
AATTTTCATCTATTCTCAAG
1182
CTTTTATATCCTTCTCATGT
2434
AATTCATATGCTTTGC
3686

















TACTC






2FH21F_19_005
21
31210922
F
T
120
19
53404880
F
C
120
CCAGAAGGCCTTCAAAATAA
1183
GAGTAGCAAAGCATATGA
2435
GTAGCAAAGCATATGA
3687













G



ATTTTA






2FH21F_19_006
21
31210930
R
A
120
19
53404888
R
G
120
GAGTAGCAAAGCATATGAA
1184
CCAGAAGGCCTTCAAAATAA
2436
AACTTTTATATCCTTC
3688















G

TCATGT






2FH21F_19_007
21
31210962
F
C
120
19
53404920
F
T
120
CCAGAAGGCCTTCAAAATAA
1185
GAGTAGCAAAGCATATGAA
2437
GAAGGATATAAAAGTT
3689













G



TGTTTTCTG






2FH21F_19_010
21
32791147
R
C
99
19
7785166
F
A
 99
GCAACTAAAAGAAACAGACC
1186
CCATGTCTTTATTAGCAACC
2438
GCCATAGATGAGATCT
3690

















CCAACCT






2FH21F_19_012
21
33743482
R
C
80
19
57303531
R
A
 80
TCATCAAACAAGATGGTAT
1187
CAGAGTATGAAGCAGTTG
2439
AGAGTATGAAGCAGTT
3691

















GTGGAGC






2FH21F_19_014
21
33743785
R
T
119
19
57303833
R
C
119
ACTGCAAACTCAGTAAAAGG
1188
GCTCTAGCTCTCAAGCTTTG
2440
TCAAGCTTTGGGTGAA
3692

















T






2FH21F_19_015
21
33743831
R
G
115
19
57303881
R
A
117
CCAAAGCTTGAGAGCTAGAG
1189
TCCCAAAGGGAATTATCACC
2441
GCATTTCATCTACTCA
3693

















GTTAC






2FH21F_19_016
21
33743853
F
A
115
19
57303903
F
G
117
TCCCAAAGGGAATTATCACC
1190
CCAAAGCTTGAGAGCTAGAG
2442
GTAACTGAGTAGATGA
3694

















AATGC






2FH21F_19_018
21
33743924
F
A
120
19
57303974
F
T
120
TTCAATAGCAAGCAAGTTT
1191
ATTCCCTTTGGGAAGAAGTG
2443
ATCTTTAATTATTCCA
3695

















CTTTTTGTTA






2FH21F_19_022
21
33744128
F
C
117
19
57304180
F
T
119
AGAATTCCTCTAATATGAC
1192
GCTGCCTTACACAGTCTTTT
2444
GTTTATTTGATCATGT
3696

















ATTATCCCTT






2FH21F_19_026
21
33744255
R
C
83
19
57304303
R
A
 82
CTTCTTCAATACATAAGAAC
1193
TTTGGCCTAAAAATGAGGT
2445
TTGGCCTAAAAATGAG
3697

















GTTTTTTTG






2FH21F_19_027
21
33744286
F
G
83
19
57304334
F
A
 84
GAGCACTGAGCCATAAAAGG
1194
AAAAACCTCATTTTTAGGC
2446
AACCTCATTTTTAGGC
3698

















CAAAATAA






2FH21F_19_028
21
33744302
R
A
87
19
57304351
R
T
 88
CCTCATTTTTAGGCCAAAAT
1195
GAATGAGCACTGAGCCATA
2447
AATGAGCACTGAGCCA
3699













A



TAAAAGGT






2FH21F_19_030
21
33744768
F
A
118
19
57304825
F
G
114
TTTTTCATTGCATAGACTG
1196
GATCAAGTTCTAAATCTCAG
2448
AAGTTCTAAATCTCAG
3700















G

GAATAAAA






2FH21F_19_031
21
33761256
F
T
106
19
57305651
F
C
102
GiTTTTTACAGGCTGGTGG
1197
CACATGTGTGAAAGGCATGG
2449
ATGGTTCAACTGTTCT
3701

















GGC






2FH21F_20_003
21
10014053
F
T
109
20
51652429
F
C
109
AGAAGGATAGGATTTGTGAG
1198
GTTCTACGCTAGAAATCAAC
2450
TAGAAATCAACTTTCC
3702

















TTCTATGC






2FH21F_20_004
21
10014083
R
G
109
20
51652459
R
A
109
GTTCTACGCTAGAAATCAAC
1199
AGAAGGATAGGATTTGTGA
2451
GGATAGGATTTGTGAG
3703

















ATTTA






2FH21F_20_006
21
10014138
F
C
98
20
51652514
F
T
 98
AAAGAAACATGGGTGGTGAG
1200
TCTCACAAATCCTATCCTTC
2452
CTGAAATGTATGTACC
3704

















CTTTCC






2FH21F_20_007
21
10014203
R
C
105
20
51652579
R
T
105
TCACCACCCATGTTTCTTTG
1201
TGGACTAGAAAGAAGGCAGG
2453
AAGAAGGCAGGTACAG
3705

















GAG






2FH21F_20_008
21
10014238
F
G
100
20
51652614
F
T
100
TCACACAAAGCAGTAGCAGG
1202
TCCTGTACCTGCCTTCTTTC
2454
CCTGCCTTCTTTCTAG
3706

















TCCAGAATAC






2FH21F_20_009
21
10014255
R
C
100
20
51652631
R
T
100
TCCTGTACCTGCCTTCTTTC
1203
TCACACAAAGCAGTAGCAGG
2455
CAGTAGCAGGATGGTT
3707

















ATT






2FH21F_20_010
21
10014324
R
G
119
20
51652700
R
A
119
GGGACCATGGTGTGGTTTTG
1204
TCCTGCTACTGCTTTGTGTG
2456
AATTTTACTTTTCCAA
3708

















AATAAGTCA






2FH21F_20_011
21
10014342
R
A
118
20
51652718
R
G
118
CTGCTTTGTGTGAAATTCTC
1205
ATTGGCTGGGACCATGGTGT
2457
ACCATGGTGTGGTTTT
3709













C



G






2FH21F_20_012
21
10015428
F
C
109
20
51653799
F
T
109
AGGGTGGTTACAGGTTGATG
1206
TGCTCTATTCTGACTGCCTG
2458
CTCTATTCTGACTGCC
3710

















TGCACCCCTC






2FH21F_20_013
21
10015493
F
G
89
20
51653864
F
A
 89
GAGAGTAACTGAAGGAGGTG
1207
AACATCAACCTGTAACCACC
2459
CCTGTAACCACCCTAA
3711

















TC






2FH21F_20_014
21
10015509
R
A
88
20
51653880
R
G
 88
ACATCAACCTGTAACCACCC
1208
GAGAGTAACTGAAGGAGGTG
2460
AGTAACTGAAGGAGGT
3712

















GGCATTT






2FH21F_20_015
21
10015560
F
A
106
20
51653931
F
G
106
AGAAATAACATACCCAGGGC
1209
CACCTCCTTCAGTTACTCTC
2461
CTTTGTTCAATGCCTC
3713

















CTTTAT






2FH21F_20_016
21
10015572
R
A
106
20
51653943
R
G
106
CACCTCCTTCAGTTACTCTC
1210
AGAAATAACATACCCAGGGC
2462
CCCAGGGCTAGGCATA
3714

















A






2FH21F_20_017
21
10015607
F
C
98
20
51653978
F
T
 98
AGGAAACTGGTCTTCCCTTG
1211
TATGCCTAGCCCTGGGTATG
2463
CCTGGGTATGTTATTT
3715

















CTCTTAC






2FH21F_20_018
21
10015618
R
T
98
20
51653989
R
G
 98
TATGCCTAGCCCTGGGTATG
1212
AGGAAACTGGTCTTCCCTTG
2464
TCTTCCCTTGGAAGAG
3716

















CCTCCCC






2FH21F_20_020
21
10016927
F
T
116
20
51655279
F
C
116
TTCAGCAAAGGAGAGAGACC
1213
ATGGCCGGGCTCGGTTAGT
2465
GCTCGGTTAGTAAGTG
3717

















G






2FH21F_22_012
21
10131022
F
G
120
22
41759969
F
A
120
GTGTTAAACGGGGTTTGAGC
1214
GTAGCGTGGCCTTTCTGAAC
2466
GCAGTTTACCTCCTTC
3718

















TAC






2FH21F_22_016
21
10131733
F
G
100
22
41760983
F
C
101
TCAGCAGGAACAAGTCTAGG
1215
GAATGTTGGCCAAGTGGCAG
2467
AGGGTGGGCCTGGGCC
3719

















TGAGGGAA






2FH21F_22_017
21
10131740
R
A
100
22
41760991
R
G
101
GAATGTTGGCCAAGTGGCAG
1216
CTCTGTCAGCAGGAACAAG
2468
TCAGCAGGAACAAGTC
3720

















TAGGGG






2FH21F_22_018
21
10131768
F
T
100
22
41761019
F
C
100
CTCCAGTGACAGATGCAAAC
1217
CCCTAGACTTGTTCCTGCTG
2469
AGACTTGTTCCTGCTG
3721

















ACAGAG






2FH21F_22_019
21
10131932
F
A
115
22
41761183
F
G
115
TGAGGACCCTTTGTGAGCAG
1218
GGGCAAATCAGTGAAGATCA
2470
GTGAAGATCAAAATCC
3722

















CTC






2FH21F_22_021
21
10132070
F
A
104
22
41761321
F
G
104
TCTCCTGCAGGGCCCTGCCT
1219
GACACACAAACAGCCTGAG
2471
GCCTGAGGGTGCCCAG
3723

















TC






2FH21F_22_025
21
10132318
R
C
106
22
41761569
R
T
106
ATGGTGTGTGGCAGTGTGAG
1220
TCCACACAGTGGTTCTTCAG
2472
AAGCCTCCTATGCTTG
3724

















CC






2FH21F_22_026
21
10132343
F
T
108
22
41761594
F
G
108
CCTCCACACAGTGGTTCTTC
1221
ATGGTGTGTGGCAGTGTGAG
2473
GGCAAGCATAGGAGGC
3725

















TTTATGGA






2FH21F_22_028
21
10132521
F
A
90
22
41761775
F
G
 90
ATCCTTCACCTCCTTTGCAC
1222
AGTGAGAAGGTTGTCACCAG
2474
TCACCAGGCCCTCACT
3726

















AATACCC






2FH21F_22_029
21
10132527
R
A
90
22
41761781
R
C
 90
AGTGAGAAGGTTGTCACCAG
1223
ATCCTTCACCTCCTTTGCAC
2475
CTCCTTTGCACACGGG
3727

















CT






2FH21F_22_030
21
10132914
F
A
103
22
41762133
F
C
102
GGTCCCAGGCCAGAGGGTT
1224
GAGGATGGGTTTATATTG
2476
GGATGGGTTTATATTG
3728

















GGAAAA






2FH21F_22_035
21
10133104
R
G
111
22
41762322
R
T
111
TGTTCCTGGCCCGACAGCCT
1225
GGGCAGATGTTTCCTCTGA
2477
AGGGTGCGGTGTTGGC
3729

















AGC






2FH21F_22_036
21
10133131
F
T
80
22
41762349
F
C
 80
GGGCAGATGTTTCCTCTGA
1226
CTGCCAACACCGCACCCTT
2478
AACACCGCACCCTTCC
3730

















CACC






2FH21F_22_037
21
10133227
F
G
101
22
41762445
F
A
101
GTGGTTAGTTTGCTGGTGAC
1227
GAGACAGTCACTATATGACA
2479
ATGACATAAATCCACT
3731

















TAGC






2FH21F_22_040
21
10133361
F
A
106
22
41762579
F
G
106
GCTCTTCCACCGGTTTTTAC
1228
AACCAGGGACTCCACCCTTC
2480
GACTCCACCCTTCTCC
3732

















CAGAG






2FH21F_22_042
21
10133484
F
T
93
22
41762702
F
G
 93
CTCTGGCGAGCCCTCTTAC
1229
TGTAGGAGCCGAGGTGGAG
2481
GGTGGAGCCGCCAGCT
3733

















GT






2FH21F_22_043
21
10133506
R
G
97
22
41762724
R
A
 97
TGTAGGAGCCGAGGTGGAG
1230
CTGGCTCTGGCGAGCCCT
2482
TCTGGCGAGCCCTCTT
3734

















ACC






2FH21F_22_044
21
10134693
R
A
119
22
41763868
R
G
119
TTGGTGCCATTTGGGAGAAC
1231
CTGAAGTTTCACTCGCTGTC
2483
TTAAAGCTTGCCACCT
3735

















GTTTTTGTTG






2FH21F_22_047
21
10136147
F
T
110
22
41765342
F
C
110
ACAAAACAAATCTTATAGAC
1232
CAGTCAAGTAAAAAGAAACG
2484
GAAACGCAACTAAAAG
3736















C

AGC






2FH21F_22_048
21
10136171
F
A
97
22
41765366
F
C
 97
ACAAAACAAATCTTATAGAC
1233
AGAAACGCAACTAAAAGAGC
2485
TCAGTTAAATACATTC
3737

















CTCTCT






2FH21F_22_051
21
10136258
R
C
119
22
41765459
R
T
119
TTTAATGTTTAAACCTTGTG
1234
TAACCTAAGCAGAATTTTC
2486
TTTGACAGAAAGTAAC
3738

















AGCTTCA






2FH21F_22_055
21
10136453
R
G
113
22
41765655
R
C
113
TAACCTTCCAAAGAAGTGCC
1235
CTGCTGAAGCCCTATTTTG
2487
AGCCCTATTTTGAAAT
3739

















TTCCCTTTT






2FH21F_22_056
21
10136486
F
C
109
22
41765688
F
G
109
TCACCACCTGGAAGTGAGTC
1236
GGGAAATTTCAAAATAGGGC
2488
GAAATTTCAAAATAGG
3740

















GCTTCAGCAG






2FH21F_22_057
21
10136520
F
T
102
22
41765722
F
C
102
TCAAAATAGGGCTTCAGCAG
1237
CTCACCACCTGGAAGTGAGT
2489
CCTGGAAGTGAGTCCC
3741

















ACC






2FH21F_22_059
21
10136569
R
T
115
22
41765772
R
G
116
ACTTCCAGGTGGTGAGGAC
1238
CTGACCGGGAGCTGAGAAG
2490
GGCCCAGAGCAGGCCG
3742

















AT






2FH21F_22_061
21
10136684
F
T
84
22
41765887
F
C
 84
TGGCCCTGCCTGTTGCCTT
1239
TACCTGGAGACAGAAACAGC
2491
GAGACAGAAACAGCCA
3743

















GGATCA






2FH21F_22_062
21
10136700
R
G
99
22
41765903
R
A
 99
TACCTGGAGACAGAAACAGC
1240
CACACAGCAGCCTGGTGG
2492
GCCTGGTGGCCCTGCC
3744

















TGTTGCCTT






2FH21F_22_067
21
10168905
F
C
115
22
15875490
F
T
116
CATGGACCTTCCAGCTTATG
1241
TTCTCTCCTTCTATAATGGC
2493
TTCTCTCCTTCTATAA
3745

















TGGCTTATTTT






2FH21F_22_068
21
10169081
R
G
111
22
15875667
R
A
111
GCCAACAATTATGAAGGCAG
1242
GGAATATCTCCTTGGCCTTC
2494
GAATATCTCCTTGGCC
3746

















TTCCTATCTAA






2FH21F_22_073
21
10169966
F
T
109
22
15876544
F
C
110
TTGGGCGCTTTTTCCCAAGG
1243
AGGACCCACCCTGGCTCTCA
2495
TCAGCGGGAGAGCAGG
3747

















GA






2FH21F_22_074
21
10170094
F
C
112
22
15876672
F
T
112
ATCAGGCAGCTGGTGGTCCT
1244
TATTGGAGAGTCCGCATGAG
2496
CCCTGCTGCACTCACT
3748

















C






2FH21F_22_075
21
10170099
R
A
83
22
15876677
R
G
 83
TGGTCCCTGCTGCACTCACT
1245
TGCTCCATGCTCACCATCAG
2497
TCAGGCAGCTGGTGGT
3749

















CCTT






2FH21F_22_076
21
10173355
R
G
102
22
15879914
R
A
102
TCAGGTATGGTTTTGCTGGG
1246
TTTACCACAGCTATTCCCCC
2498
GCTATTCCCCCTAATC
3750

















CTA






2FH21F_22_077
21
10173724
R
A
101
22
15880283
R
G
101
GTTTGAACCCACTCTTCCTG
1247
GGTCCAGAAATAGCTACAGG
2499
CAGAAATAGCTACAGG
3751

















AGAAGA






2FH21F_22_078
21
10173774
R
A
106
22
15880332
R
C
105
CTGTAGCTATTTCTGGACCC
1248
TTCCTTGCCTGGATGATTTC
2500
TTTCTCTTTCTCCTCC
3752

















C






2FH21F_22_079
21
10173857
F
C
100
22
15880415
F
G
 98
AAGTAGCAAAATCAGCTTC
1249
AGAAAGCAGAGGTTTAGGAG
2501
TTTAGGAGAAGAAAAA
3753

















GAAGAGA






2FH21F_22_080
21
10175430
R
A
118
22
15881989
R
G
119
GAGATTTGCTTGCCAATAGG
1250
GTCTCTCACCCCTTCATTTT
2502
TTATTTTCTTCTTGAG
3754

















TACACTCTTA






2FH21F_22_081
21
10175471
F
A
118
22
15882030
F
C
119
CTGTCTCTCACCCCTTCATT
1251
GATTTGCTTGCCAATAGGAG
2503
AGAAGAAAATAACATT
3755

















TTCCTGTATA






2FH21F_22_082
21
10175474
R
G
118
22
15882034
R
T
119
GATTTGCTTGCCAATAGGAG
1252
CTGTCTCTCACCCCTTCATT
2504
TCACCCCTTCATTTTA
3756

















ATTTTA






2FH21F_22_085
21
10176077
F
C
99
22
15882635
F
T
 99
CCAATGAATGTCCTCATCAG
1253
GCAGCGTGATTCCTATGAAG
2505
GAAGAAGGCATCTCTG
3757

















GATAATGA









Table 4B shows the common nucleotide sequence for each assay and a mismatch in brackets between the first nucleotide sequence species and the second nucleotide sequence species.










Lengthy table referenced here




US20220098644A1-20220331-T00001


Please refer to the end of the specification for access instructions.






Example 3: Detecting Fetal Aneuplodies—Model Systems and Plasma Samples

The multiplexed assays designed according to the methods of Example 3 and provided in Table 4 were tested in a series of model systems to identify the best performing assays. Assays were analyzed based on the following characteristics:

    • 1. Low overall process variability.
    • 2. Low differences between ethnic groups.
    • 3. Large differences between normal and T21 samples.
    • 4. Strong relationship between allele frequency and fraction of T21 DNA in the sample.
    • 5. High ‘discernibility’ between normal samples and samples containing T21 DNA.


After the assays were screened across the different model systems, the best performing assays from the model systems were further validated in plasma samples.


Model System Selection


Processes and compositions described herein are useful for testing circulating cell-free DNA from the maternal plasma for the presence or absence of fetal aneuplodies. Plasma samples from pregnant women, however, are limited and variable in nature. Thus, they are not the ideal sample for performing controlled studies designed to specifically challenge performance aspects of the marker performance. Therefore, synthetic model systems were created that meet the following criteria:

    • 1) Come from a renewable resource to allow for follow-up and subsequent longitudinal studies
    • 2) Provide an indication of how the marker will perform when assayed against plasma samples
    • 3) Be able to assess the basic functionality of each marker with metrics such as extension rate and allele skew
    • 4) Provide a genetically and ethnically diverse sample set to indicate the population coverage of each marker
    • 5) Allow for repeated measurement of the same biological sample to assess marker stability
    • 6) Be dynamic and tunable to allow for analysis at defined ranges, such as fetal contribution, to develop a more robust characterization of each marker's capabilities and limitations


Model System Design


From the list of model system performance criteria provided above, a series model system sets were derived. The model system can be broken down into three major components: basic functionality, technical replicate variance and biological replicate variance. These model system sets allowed for the analyses at extremes of fetal contribution and provided an ethnically and genetically diverse sampling.


DNA Set 1: Basic Marker Functionality


This set was composed of 121 normal euploid samples (normal karyotype cell lines) representing African, Asian, Caucasian, and Mexican ethnic groups, as well as 55 T21 aneuploid samples (T21 cell lines). These samples were distributed over two 96-well plates. These samples were used to assess the following:

    • 1) If the marker is functional on a basic level, including extension rate and allele skew from the 50% theoretical;
    • 2) If the marker is able to distinguish 100% normal euploid samples from 100% T21 aneuploid samples; and
    • 3) If the marker has a strong ethnic bias when compared to other ethnic populations.


Assays that performed well in this model set showed minimal ethnic bias, have a significant difference between N and T21, and low CV's. See FIG. 5. Assays that performed poorly showed an ethnic bias, do not have a significant difference between N and T21, and high CV's. See FIG. 6.


DNA Set 2: Variances in Replicates


This set was composed of a single euploid DNA sample (from a single diploid cell line) to simulate the maternal background, and a single spiked-in T21 aneuploid DNA sample (from a single T21 cell line) to simulate circulating fetal DNA. The simulated fetal T21 spike-in DNA was replicated 22 times at 0, 5, 7.5, 10, 12.5, 15, 20 and 30% of the simulated maternal background for a total of 176 samples. These samples were distributed over two 96-well plates. These samples were used to assess the following:

    • 1) What is the CV (technical variance) of each marker in the 22 PCR technical replicates; and
    • 2) What affect does increasing the simulated fetal DNA T21 spike-in, from 0 to 30.0%, have on the T21 allele frequency of each marker in the technical replicate samples?


Assays that performed well in this model set showed a linear response, a good match of expected “allele” frequency vs. observed “allele” frequency (where “allele” refers to the detectable sequence mismatch), and a large difference between N00 and N30. See FIG. 7—good assay. Assays that performed poorly showed no linear response, no difference between N00 and N30, and large technical variance. See FIG. 7—poor assay.


DNA Set 3: Variances in Biological Replicates


This set was composed of 44 different euploid DNA samples (from diploid cell lines) to simulate circulating maternal background paired with 44 different aneuploid T21 DNA samples (from T21 cell lines) to simulate circulating fetal DNA. The simulated fetal T21 spike-in DNA was replicated 44 times at 0, 5, 10, and 20% of the simulated maternal background for a total of 176 samples. These samples were distributed over two 96-well plates. These samples were used to assess the ‘discernibility’ between normal samples and T21 DNA, or more specifically:

    • 1) What is the CV of each marker in the 44 biological replicates; and
    • 2) What affect does increasing the simulated fetal DNA T21 spike-in, from 0 to 20.0%, have on the T21 allele frequency of each marker in the biological replicate samples?


Assays that performed well in DNA Set 3 showed a significant difference between N00 and N20 samples, small variances in each group, and the ability of an algorithm to discern between N00 and N20.


Model DNA Samples


Concentrations


Concentrations in the model system were adjusted to simulate, in a simplified manner, plasma derived samples. For a clinical test, 10 mL of whole blood would likely be obtained from the mother, which yields ˜4 mL of plasma. Under optimized conditions, DNA extraction from plasma obtains ˜25 ng of DNA in 100 μL. Given this clinical constraint for tests that assay nucleic acid from plasma samples, the model DNA concentrations were normalized to ˜0.25 ng/pL. The DNA concentrations of the spiked-in DNA used to simulate the fetal contributions were selected to range from 0% -30% with a mean value of 15%. These values were selected based the estimated ranges and mean values for fetal DNA contribution in maternal plasma.


Sample Source


The model DNA was provided by Coriell DNA repository from a total DNA extraction of cultured cell lines with known ethnicity and T21 aneuploidy status. Coriell was chosen as a source of DNA for the model system because of their extensive history of providing essential research reagents to the scientific community. These collections, supported by funds from the National Institutes of Health (NIH) and several foundations, are utilized by scientists around the world and are extensive, well characterized and can be replenished at any time.


Euploid Model DNA


The euploid samples were chosen from well characterized DNA panels in the Coriell repository that represent four (4) ethnic groups:

    • African (AF)—INTERNATIONAL HAPMAP PROJECT—YORUBA IN IBADAN, NIGERIA. The HAPMAPPT04 plate, from the Yoruba in Ibadan, Nigeria includes a set of 28 trios, 2 duos, and 2 singletons with 90 samples. The concentration of each DNA sample is normalized and then this concentration is verified.
    • Asian (AS)—INTERNATIONAL HAPMAP PROJECT—JAPANESE IN TOKYO, JAPAN AND HAN CHINESE IN BEIJING, CHINA. The HAPMAPPT02 plate of 90 individual samples includes 45 Japanese in Tokyo and 45 Han Chinese in Beijing. The concentration of each DNA sample is normalized and then this concentration is verified.
    • Caucasian (CA)—INTERNATIONAL HAPMAP PROJECT—CEPH (UTAH RESIDENTS WITH ANCESTRY FROM NORTHERN AND WESTERN EUROPE). The HAPMAPPT01 plate, from the CEPH Collection, includes a set of 30 trios (90 samples). The concentration of each DNA sample is normalized and then this concentration is verified.
    • Mexican (MX)—INTERNATIONAL HAPMAP PROJECT—MEXICAN ANCESTRY IN LA, USA. These cell lines and DNA samples were prepared from blood samples collected from trios (mother, father, and child) from Communities of Mexican Origin in Los Angeles; Calif. DNA samples from thirty trios have been included in the panel designated as HAPMAPV13. The concentration of each DNA sample is normalized and then this concentration is verified.


T21 Aneploid Model DNA


Fifty-five T21 DNA samples in the Coriell repository were used to generate a biologically diverse sampling of T21 to help increase the genetic robustness of the marker screening. The T21 samples were selected by identifying those Coriell samples with “Trisomy 21” as a description. The concentration of each DNA sample was normalized and verified.


Plasma Derived Samples


To extract DNA from maternal plasma samples, the QlAamp Circulating Nucleic Acid Kit (4mL Procedure) was used. An outline of the extraction procedure is provided below.


Sample Collection and Preparation


The method is preferably performed ex vivo on a blood sample that is obtained from a pregnant female. “Fresh” blood plasma or serum, or frozen (stored) and subsequently thawed plasma or serum may be used.


Frozen (stored) plasma or serum optimally is frozen shortly after it's collected (e.g., less than 6-12 hours after collection) and maintained at storage conditions of −20 to −70 degrees centigrade until thawed and used. “Fresh” plasma or serum should be refrigerated or maintained on ice until used. Blood may be drawn by standard methods into a collection tube, preferably siliconized glass, either without anticoagulant for preparation of serum, or with EDTA, sodium citrate, heparin, or similar anticoagulants for preparation of plasma. The preferred method of preparing plasma or serum for storage, although not an absolute requirement, is that plasma or serum is first fractionated from whole blood prior to being frozen. “Fresh” plasma or serum may be fractionated from whole blood by centrifugation, using gentle centrifugation at 300-800×g for five to ten minutes, or fractionated by other standard methods. A second centrifugation step often is employed for the fractionation of plasma or serum from whole blood for five to ten minutes at about 20,000 to 3,000×g, and sometimes at about 25,000×g, to improve the signal to noise ratio in subsequent DNA detection methods.


Fetal DNA is usually detected in equal to or less than 10 ml maternal blood, plasma or serum, more preferably in equal or less than 20, 15, 14, 13, 12, 11, 10, 9, 8.5, 8.0, 7.5, 7.0, 6.5, 6.0, 5.5, 5.0, 4.5, 4.0, 3.5, 3.0, 2.5, 2.0, 1.6, 0. 8, 0.4, 0.2 or 0.1 ml, and any intermediates values, of maternal blood, plasma or serum. Such fetal DNA is preferably detectable in a maternal blood sample during early pregnancy, more preferably in the first trimester of pregnancy and most preferably prior to week 22, 21, 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9 or 8 of gestation.


DNA Extraction Preparation Suggestions

    • Equilibrate samples to room temperature
    • If samples are less than 4 mL, bring up volume with PBS
    • Set up QIAvac 24 Plus
    • Heat waterbath to 60° C.
    • Heat heating block to 56° C.
    • Equilibrate Buffer AVE to RT
    • Ensure Buffer ACB, ACW1, ACW2 have been prepared properly
    • Add reconstituted carrier RNA to Buffer ACL according to chart


Note: After thawing, spin plasma samples at 1600 RPM for 10 minutes. This helps remove precipitates that may occur due to freeze/thawing cycles.


Procedure

    • 1. Pipet 400 uL of Qiagen Proteinase K into a 50 mL centrifuge tube
    • 2. Add 4 mL plasma to tube
    • 3. Add 3.2 mL ACL with carrier. Close cap, and mix by pulse-vortexing for 30 seconds
    • 4. Incubate at 60° C. for 30 minutes
    • 5. Briefly Centrifuge the tube to remove drops from the inside of the lid
    • 6. Add 7.2 mL of ACB to the lysate in the tube. Close cap and pulse-vortex for 15-30 seconds.
    • 7. Incubate lysate/Buffer ACB mixture in the tube on ice for 5 minutes
    • 8. Add lysate/Buffer ACB to tube extenders on columns. Switch on pump and when lysates have been drawn through column, turn each one off with their individual valves
    • 9. After all lysates have gone through, control vacuum using valve for manifold
    • 10. Using a Eppendorf Repeater and 5 mL Combitip, add 600 uL of ACW1 to each column
    • 11. Add 750 uL of ACW2 to each column
    • 12. Add 750 uL of absolute ethanol (200 proof) to each column
    • 13. Remove tube extenders, close the lids on the columns, place columns in clean 2 mL collection tubes, and centrifuge at 20,000 rcf for 3 minutes
    • 14. Place columns in new 2 mL collection tubes. Open the lids and incubate on 56° C. heat block for 10 minutes to dry the membrane completely
    • 15. Place columns in clean collection tubes and add 110 uL of Buffer AVE to center of column
    • 16. Close lids and incubate at RT for 3 minutes
    • 17. Centrifuge columns in collection tubes at 20,000 rcf for 1 minute to elute DNA


Assay Biochemistry and Protocol


The nucleotide sequence species of a set share primer hybridization sequences that, in one embodiment, are substantially identical, thus they will amplify in a reproducible manner with substantially equal efficiency using a single pair of primers for all members of the set. Sequence differences or mismatches between the two or more species sequences are identified, and the relative amounts of each mismatch, each of which represents a chromosome, are quantified. Detection methods that are highly quantitative can accurately assay the ratio between the chromosomes. For example, provided below are exemplary methods and compositions for the detection and quantification of nucleotide sequence species using Sequenom's MassARRAY® System.


Polymerase Chain Reaction (PCR)


PCR Configuration


Samples to be analyzed, whether from the model system or from plasma, were subjected to PCR amplification. Given the dilute nature of the ccfDNA, the PCR will be performed in 96-well plate format with a total reaction volume of 50 μL composed of reagents and samples as outlined below in Table 5. In general, standard PCR conditions as outlined by the manufacture were used for the various experiments.









TABLE 5







Example PDA PCR Reaction










Reagent
Supplier
Final Concentration
Volume (μL)





Water
N/A
N/A
 8.625


10 × PCR Buffer
*02100/
1.0×
 5.000


(contains 20 mM MgCl2)
Sequenom




PCR Nucleotide Mix (10 mM) ea
Roche
 200 μM
 1.000


Primer Mix (0.5 μM) ea
IDT
 0.1 μM
 10.000


10 U/μL UNG
Roche
6.25 U/rxn
 0.625


5 U/μL Fast Start
01462/Sequenom
  5 U/rxn
 1.000


MgCl2-dye (20 mM)
*02100/Sequenom
**3.5 mM
 3.75


Total =
30.000





*Item Number 02100 is a kit and includes 10 × PCR Buffer and 25 mM MgCl2.


**The final concentration of MgCl2 is 3.5 mM in each 50 μL reaction (2.0 mM from 10 × PCR Buffer and 1.5 mM from the 20 mM MgCl2-dye solution).






A 50 μL reaction volume was chosen for two reasons. The first is that the low concentration of circulating cell free DNA in plasma is between 1000 and 2000 genomic copies per μL, or 0.15-0.30 ng/pL requires more volume of sample to meet a minimum practical target value outlined by the reagent manufacture of ˜5 ng per reaction. Secondly, because the PDA method relies on small copy number differences between two paralogous DNA regions in different chromosomal loci, a larger volume PCR reduces the effect from small changes of volume and concentration that may occur in the ordinary course of PCR preparation and may increase variability in the PCR amplification.


Post-PCR


Distribution to 384 Well Plate and Dephosphorylate


After transferring aliquots of the PCR amplicons to 384 well format, the remaining PCR primers and dNTPs were dephosphorylated using Shrimp Alkaline Phosphatase (SAP). The dephosphorylation reaction is performed at 37° C. and the enzyme is heat inactivated at 85° C.









TABLE 6







SAP Mixture











Item Number/
Volume for
Final


SAP Mix Reagent
Vendor
N = 1 (μL)
Concentration





Nanopure Water
N/A
 1.536
N/A


SAP Buffer (10×)
10055/
 0.17
0.85×



Sequenom




Shrimp Alkaline Phosphatase
10002.1*/
 0.294
0.5 U/rxn


(SAP) (1.7 U/μL)
Sequenom











Total Volume
2  






*equivalent to SQNM product #10144






The 96 well PCR plates are centrifuged in a benchtop centrifuge to consolidate the PCR product. Using a Hamilton™ liquid handler, 4×5 μL aliquots are distributed to quadrants in a 384 well plate. Remaining PCR product (˜30 μL) is stored at −20° C. for future use.

    • 1. Using the Beckman 96 head MultiMek, 2 μL of SAP mixture dispensed to each 5 μL aliquot.
    • 2. The plates were sealed with adhesive sealing film and centrifuge.
    • 3. SAP dephosphorylation was performed in ABI 9700 thermal cyclers with the following program:









TABLE 7







SAP Reaction Thermal Profile












Temperature
Time
Cycles
Comments







37° C.
40 minutes
1
Dephosphorylation step



85° C.
 5 minutes
1
Inactivate SAP



 4° C.
forever
1
Store reaction










Primer Extension Reaction


Single base primer extension was used to detect the allele genotype at a SNP location, or in this case, at the nucleotide mismatch location of interest. An extension primer with a specific sequence is designed such that the 3′ end of the primer was located one base upstream of the fixed heterozygote location. During the extension portion of the cycle, a single base was incorporated into the primer sequence (single base extension), which was determined by the sequence of the target allele. The mass of the extended primer product will vary depending on the nucleotide added. The identity and amount of each allele was determined by mass spectrometry of the extended products using the Sequenom MassARRAY platform.


The extension mixture components are as described in the following table:









TABLE 8







Extension Mix Reagent Formulation









Extension
Item Number/
Volume for


Reagent
Vendor
N = 1 (μL)





Water (HPLC grade)
VWR_JT4218-2
0.4


TypePLEX detergent
01431*/
0.2


free buffer (10×)
Sequenom



TypePLEX
01533**/
0.2


Termination Mix
Sequenom



Extend Primer Mix
IDT
1  


Thermosequenase
10052***/
0.2


(32 U/μL)
Sequenom









Total Volume
2  





*equivalent to SQNM product #01449


**equivalent to SQNM products #01430 or #01450


***equivalent to SQNM products #10138 or #10140








    • 1. 2 μL of extension reaction mixture was added using the 96 head Beckman Coulter Multimek, bringing the total reaction volume to 9 μL.

    • 2. The plate was sealed with adhesive sealing film and centrifuge with benchtop centrifuge.

    • 3. The base extension reaction was performed in an ABI 9700 thermal cycler with the following cycling profile:












TABLE 9







Single Base Extension Thermal Cycling Profile











Temperature

Number of


Purpose
(° C.)
Time
Cycles





Initial Denaturation
94
30 seconds
1












Cycled Template
94
 5 seconds





Denaturation


















Cycled primer
52
 5 seconds






Annealing




{close oversize brace}
40


Cycled primer
80
 5 seconds
{close oversize brace}
5




Extension
















Final Extension
72
 3 minutes
1


Hold
 4
overnight
1











    • 4. After the extension reaction is complete, store the plate at 4° C. or continue to the desalting step.





Desalt Reaction with CLEAN Resin


The extension products were desalted of divalent cations (especially sodium cations) by incubating the samples with a cation-exchange resin prior to MALDI-TOF analysis.


Procedure

    • 1. The plates were centrifuged in a benchtop centrifuge.
    • 2. The 96 head Beckman Multimek was used to add 20 μL of autoclaved water to each well of the sample plate.
    • 3. The Sequenom Resin Dispenser (Model #XXX) was used to add resin slurry to each sample well.
    • 4. The plate was covered with an aluminum foil adhesive seal and rotated for at least ten minutes at room temperature.
    • 5. The plate was centrifuged at 4000 rpm for five minutes before dispensing the sample to a SpectroCHIP.


Dispense Sample onto a SpectroCHIP and Analyze on MassARRAY System


Approximately 15-20 nL of each sample was dispensed onto a pad of a SpectroCHIP using a MassARRAY Nanodispenser. Following rapid crystallization of the sample, the analytes were ready to be scanned by MALDI-TOF.


Procedure

    • 1. 3-point calibrant and samples were dispensed to a 384-spot SpectroCHIP using the RS-1000 Nanodispenser. Refer to the RS-1000 user's guide for more detailed instructions.
    • 2. Note: different dispensing speeds may be necessary depending on the ambient temperature and humidity in the dispensing chamber. Typical dispensing speeds are 80 mm/sec for analytes and 100mm/sec for the calibrant solution.
    • 3. After dispensing, the plate was resealed and stored at 4° C. or −20° C. for longer term storage. The plate can be re-centrifuged and re-spotted if necessary.
    • 4. The SpectroCHIP was placed in its storage case and stored in a dessicated chamber, if not analyzed immediately after spotting.
    • 5. The SpectroCHIP was loaded into the PHOENIX MassARRAY analyzer and the user's guide was followed to analyze the chip and acquire/store the mass spectrum data.


Three Experiments Across Four Tiers (and 3 Model Sets+A Plasma Set)


The assays provided in Table 4 were tested during three different experiments:


Experiment 1—Selected Markers with Mix 1 Biochemistry (2 acyclo's+2 ddNTP's)


Experiment 2—Selected Markers with TypePLEX Biochemistry (all acylco's)


Experiment 3—Remaining assays not included in Experiment 1 or 2


During each experiment, samples were tested across four different tiers (or a combination thereof). Within each tier, the different DNA Sets (1, 2 or 3, or combinations thereof) were used to test the assay's performance.


Tier I. Run multiplex (MP) set on model system and filter out poor performing assays


Tier II. Re-Plex selected assays into new multiplex and run on model system


Tier III. Genomic Screening and select best performing 3 multiplex


Tier IV. Run the best assays on plasma samples for assessment of true performance. (Plasma sample extraction methods are described in below in the “Plasma Derived Samples” section)


Experiment 1


The results from the different tiers for Experiment 1 are described below, and the binary performance of each assay is outlined in Table 13, where “yes” indicates the assay passed the tier, and “-” indicates the assay was not tested or did not test.


Results from Tier I

    • 250 assays in 10 multiplexes were tested on 6 different DNA plates
    • 50% assays did not meet quality criteria
    • Good quality assays show some biological signal for the discrimination of euploid and Normal/T21 mixed samples
    • More T21 DNA allows better discrimination


Conclusion: The DNA model system is concise and can be used for marker identification.


Results from Tier II

    • From TIER ONE 5 Multiplexes are carried forward.
    • A total of 4 re-plexed Multiplexes (comprising top 40 assays) are tested.


Conclusions: Re-plexed assays show good performance and low dropout rate. Redesign of extend primers better than ‘simple’ re-plexing.


Results from Tier III

    • More than 400 genomic DNAs from 4 ethnic groups were tested on TIER II Multiplexes
    • less than 10% of the assays show genomic variability
    • For the remaining assays variability is observed in less than 1% of the samples


(Processing Variability Needs to be Excluded)


Conclusion: The filter criteria used during assays design are sufficient to identify highly stable genomic regions.


Results from Tier IV

    • 57 assays were measured
    • 75 Normal samples
    • 23 T21 samples


The results from Experiment I, Tier IV are provided in Table 10 and shown in FIG. 9. FIG. 9 results are based on a Simple Principle Component Analysis, and shows the two main components can separate euploid samples from aneuploid samples.









TABLE 10







Experiment 1, Tier IV Plasma Results












Method
Sensitivity
Specificity
AUC







Decision Tree
55%
85%
0.73



SVM-linear kernel
77%
91%
0.84



Logistic Regression
77%
84%
0.89



Naïve Bayes
86%
91%
0.95



Multilayer Perceptron
91%
93%
0.97










Experiment 2—TypePLEX Extension Biochemistry


Experiment 2 was run using TypePLEX extension biochemistry and a new set of assays (see Table 4).

    • The entire feasibility was repeated using the TypePLEX biochemistry.
    • Selection of genomic target regions did not have to be repeated.
    • Assays were replexed after TIER 1.
    • Tier four included 150 euploid samples and 25 T21 samples.


Results of Experiment 2: TypePLEX Study

    • 250 Markers were tested.
    • 120 passed QC criteria to be replexed into 9 multiplexes.
    • 3 Multiplexes comprising 54 markers were tested on Plasma samples.
    • >90% classification accuracy in the DNA model system.
    • 150 euploid samples tested
    • 24 T21 samples tested
    • Fetal Quantifer Assay (FQA) used to determine the amount of fetal DNA present in the samples after DNA extraction.









TABLE 11







Experiment 2, Tier IV Results (from all samples)









Method
Sensitivity
Specificity





Naïve Bayes
34%
97%


AdaBoost
48%
98%


Logistic Regression
50%
87%


Multilayer Perceptron
61%
94%
















TABLE 12







Experiment 2, Tier IV Results (from all


samples with >12.5% or >15% fetal DNA)









Method
Sensitivity
Specificity





Naïve Bayes
 43% (52%)*
97% (96%)


AdaBoost
55% (72%)
100% (100%)


Logistic Regression
93% (99%)
98% (99%)


Multilayer Perceptron
75% (81%)
97% (99%)





*values in paranthesis represent samples with >15% fetal DNA






Of all the samples tested in Experiment 2, 111 samples had more than 12.5% fetal DNA and 84 samples had more than 15% fetal DNA. The FQA assay refers to the Fetal Quantifier Assay described in U.S. patent application No. 12/561,241 filed Sep. 16, 2009, which is hereby incorporated by reference. The assay is able to determine the amount (or concentration) of fetal DNA present in a sample.


Experiment 3—Remaining Assays


The remaining assays were analyzed across DNA Sets 1, 2 and 3 using Type PLEX biochemistry, and the results are provided in Table 13 below.


In one embodiment, a multiplexed assay is selected from the group consisting of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55 or more of the following nucleotide sequence sets 2FH21F_01-030, 2FH21F_01-041, 2FH21F_02-075, 2FH21F_02_076, 2FH21F_02-089, 2FH21F_02-091, 2FH21F_02-107, 2FH21F_02-111, 2FH21F_02-116, 2FH21F_02-148, 2FH21F_02-254, 2FH21F_03_005, 2FH21F_03_022, 2FH21F_05_003, 2FH21F_05_006, 2FH21F_05_027, 2FH21F_05_033, 2FH21F_05_061, 2FH21F_06_114, 2FH21F_06_165, 2FH21F_06_218, 2FH21F_06_219, 2FH21F_06_224, 2FH21F_06_238, 2FH21F_07_071, 2FH21F_07_166, 2FH21F_07_202, 2FH21F_07_464, 2FH21F_07_465, 2FH21F_09_007, 2FH21F_09_010, 2FH21F_10_005, 2FH21F_11-022, 2FH21F_11-028, 2FH21F_12-049, 2FH21F_12-052, 2FH21F_12-074, 2FH21F_12-075, 2FH21F_13_036, 2FH21F_13_041, 2FH21F_15_044, 2FH21F_18_020, 2FH21F_18_059, 2FH21F_18_076, 2FH21F_18_094, 2FH21F_18_154, 2FH21F_18_171, 2FH21F_18_176, 2FH21F_18_178, 2FH21F_18_188, 2FH21F_18_190, 2FH21F_18_191, 2FH21F_18_262, 2FH21F_18_270, 2FH21F_18_332 and 2FH21F_18_346, which correspond to those sequence sets carried to Tier IV of Experiment 3 (although not run on plasma samples). See Table 13 below.


Based on analysis of the designs and the results (both from the models and the plasma samples from all three experiments), one can conclude that investigating several regions in parallel, reduces the measurement variance and enabled accurate quantification of ccff DNA. Also, due to the low copy numbers that have to be detected it is desirable to have redundant measurements, which will increase the confidence in the results.












TABLE 13








Experiment 1
Experiment 2
Experiment 3


















M1_
M1_
_



Full_
Full_
_
_



All_
Replex_
M1
TypePLEX_
TypePLEX_

Screen_
Screen_
Full
Full



211
90
Plasma_
All_246
Replex_117
TypePLEX_
All_1004
Replexl_236
Screen_
Screen_



tier 1
tier 2
47
tier 1
tier 2
Plasma_50
tier 1
tier 2
Replex2_92
Plasma56



DNA set
DNA set
tier 4
DNA set
DNA set
tier 4
DNA
DNA set
tier 3
tier 4*


Marker_ID
1, 2, 3
1, 2, 3
plasma
1, 2, 3
1, 2, 3
plasma
set 1
1, 3
1, 3
1, 3





2FH21F_01_003






Yes





2FH21F_01_006






Yes





2FH21F_01_007



Yes








2FH21F_01_009






Yes





2FH21F_01_010






Yes





2FH21F_01_011






Yes





2FH21F_01_012






Yes





2FH21F_01_013



Yes








2FH21F_01_014






Yes





2FH21F_01_015



Yes
Yes
Yes






2FH21F_01_017






Yes





2FH21F_01_018






Yes





2FH21F_01_020






Yes





2FH21F_01_021
Yes
Yes
Yes



Yes





2FH21F_01_022






Yes





2FH21F_01_023






Yes





2FH21F_01_025






Yes





2FH21F_01_026



Yes








2FH21F_01_027
Yes





Yes





2FH21F_01_029






Yes





2FH21F_01_030






Yes
Yes
Yes
Yes


2FH21F_01_031






Yes
Yes




2FH21F_01_033






Yes
Yes
Yes



2FH21F_01_034






Yes





2FH21F_01_036



Yes
Yes
Yes






2FH21F_01_037



Yes
Yes
Yes






2FH21F_01_038






Yes





2FH21F_01_039






Yes





2FH21F_01_040
Yes
Yes

Yes
Yes







2FH21F_01_041






Yes
Yes
Yes
Yes


2FH21F_01_043






Yes





2FH21F_01_044
Yes





Yes





2FH21F_01_045






Yes





2FH21F_01_046












2FH21F_01_049



Yes
Yes







2FH21F_01_050






Yes





2FH21F_01_057



Yes








2FH21F_01_058
Yes
Yes




Yes





2FH21F_01_059






Yes





2FH21F_01_060






Yes





2FH21F_01_062






Yes





2FH21F_01_063






Yes





2FH21F_01_064






Yes





2FH21F_01_065






Yes





2FH21F_01_067






Yes





2FH21F_01_068






Yes





2FH21F_01_071












2FH21F_01_072
Yes
Yes




Yes





2FH21F_01_073






Yes





2FH21F_01_077






Yes





2FH21F_01_078






Yes





2FH21F_01_080






Yes





2FH21F_01_081






Yes





2FH21F_01_082
Yes
Yes
Yes



Yes





2FH21F_01_083



Yes
Yes
Yes






2FH21F_01_084






Yes





2FH21F_01_086






Yes





2FH21F_01_088






Yes





2FH21F_01_090



Yes








2FH21F_01_093
Yes





Yes





2FH21F_01_094
Yes


Yes








2FH21F_01_099






Yes





2FH21F_01_101






Yes





2FH21F_01_102
Yes





Yes





2FH21F_01_104






Yes





2FH21F_02_003






Yes
Yes




2FH21F_02_007



Yes
Yes


Yes




2FH21F_02_015



Yes
Yes







2FH21F_02_017



Yes
Yes







2FH21F_02_018






Yes





2FH21F_02_019






Yes





2FH21F_02_020
Yes
Yes




Yes





2FH21F_02_021






Yes





2FH21F_02_022



Yes








2FH21F_02_023






Yes





2FH21F_02_027






Yes





2FH21F_02_034






Yes





2FH21F_02_035



Yes
Yes
Yes






2FH21F_02_036






Yes





2FH21F_02_037



Yes








2FH21F_02_038






Yes





2FH21F_02_040






Yes





2FH21F_02_041






Yes





2FH21F_02_043






Yes





2FH21F_02_045






Yes





2FH21F_02_050






Yes
Yes




2FH21F_02_055



Yes
Yes
Yes

Yes




2FH21F_02_057






Yes





2FH21F_02_058






Yes





2FH21F_02_061



Yes
Yes







2FH21F_02_062






Yes





2FH21F_02_063



Yes
Yes







2FH21F_02_065






Yes





2FH21F_02_066






Yes





2FH21F_02_067






Yes





2FH21F_02_072






Yes





2FH21F_02_073






Yes





2FH21F_02_074






Yes
Yes
Yes



2FH21F_02_075






Yes
Yes
Yes
Yes


2FH21F_02_076






Yes

Yes
Yes


2FH21F_02_077
Yes


Yes
Yes
Yes






2FH21F_02_088



Yes








2FH21F_02_089



Yes
Yes


Yes
Yes
Yes


2FH21F_02_090






Yes





2FH21F_02_091






Yes
Yes
Yes
Yes


2FH21F_02_103






Yes





2FH21F_02_107
Yes
Yes
Yes



Yes
Yes
Yes
Yes


2FH21F_02_108






Yes





2FH21F_02_111
Yes
Yes
Yes



Yes
Yes
Yes
Yes


2FH21F_02_113



Yes
Yes







2FH21F_02_116



Yes
Yes


Yes
Yes
Yes


2FH21F_02_127






Yes





2FH21F_02_129






Yes





2FH21F_02_132






Yes





2FH21F_02_134






Yes





2FH21F_02_139
Yes
Yes




Yes





2FH21F_02_143






Yes





2FH21F_02_144
Yes





Yes





2FH21F_02_145






Yes





2FH21F_02_146






Yes





2FH21F_02_148






Yes
Yes
Yes
Yes


2FH21F_02_150



Yes








2FH21F_02_151






Yes





2FH21F_02_155






Yes





2FH21F_02_156






Yes





2FH21F_02_157






Yes





2FH21F_02_158






Yes





2FH21F_02_159






Yes





2FH21F_02_163






Yes





2FH21F_02_168






Yes





2FH21F_02_170



Yes
Yes







2FH21F_02_172






Yes





2FH21F_02_173






Yes





2FH21F_02_174
Yes
Yes




Yes





2FH21F_02_175
Yes





Yes





2FH21F_02_177






Yes





2FH21F_02_178






Yes





2FH21F_02_181






Yes





2FH21F_02_182
Yes
Yes




Yes





2FH21F_02_184






Yes





2FH21F_02_185






Yes





2FH21F_02_189






Yes





2FH21F_02_190






Yes





2FH21F_02_191






Yes





2FH21F_02_193






Yes





2FH21F_02_194



Yes
Yes
Yes






2FH21F_02_195






Yes





2FH21F_02_200






Yes





2FH21F_02_204
Yes
Yes
Yes



Yes





2FH21F_02_206






Yes





2FH21F_02_207






Yes





2FH21F_02_208
Yes


Yes
Yes
Yes






2FH21F_02_211






Yes





2FH21F_02_212






Yes





2FH21F_02_213
Yes
Yes
Yes



Yes





2FH21F_02_214
Yes
Yes
Yes



Yes





2FH21F_02_215
Yes
Yes
Yes



Yes





2FH21F_02_216






Yes





2FH21F_02_217






Yes





2FH21F_02_218






Yes





2FH21F_02_219






Yes





2FH21F_02_220



Yes








2FH21F_02_223






Yes





2FH21F_02_226






Yes





2FH21F_02_227






Yes





2FH21F_02_228






Yes
Yes




2FH21F_02_230






Yes





2FH21F_02_232






Yes





2FH21F_02_234






Yes





2FH21F_02_235






Yes





2FH21F_02_236






Yes





2FH21F_02_239






Yes





2FH21F_02_241
Yes





Yes
Yes




2FH21F_02_243






Yes
Yes




2FH21F_02_248






Yes





2FH21F_02_249






Yes





2FH21F_02_250
Yes





Yes





2FH21F_02_254



Yes
Yes


Yes
Yes
Yes


2FH21F_03_005






Yes
Yes
Yes
Yes


2FH21F_03_007






Yes
Yes




2FH21F_03_008



Yes
Yes


Yes




2FH21F_03_011






Yes





2FH21F_03_012






Yes





2FH21F_03_013






Yes





2FH21F_03_014






Yes
Yes




2FH21F_03_015






Yes
Yes




2FH21F_03_017






Yes





2FH21F_03_018



Yes








2FH21F_03_021
Yes
Yes
Yes



Yes
Yes




2FH21F_03_022






Yes
Yes
Yes
Yes


2FH21F_03_025



Yes
Yes







2FH21F_03_026
Yes
Yes




Yes
Yes




2FH21F_03_027



Yes








2FH21F_03_028



Yes
Yes
Yes

Yes




2FH21F_03_030






Yes





2FH21F_03_031






Yes





2FH21F_03_039



Yes








2FH21F_03_040






Yes





2FH21F_03_043






Yes





2FH21F_03_053



Yes








2FH21F_03_058






Yes





2FH21F_03_061



Yes








2FH21F_03_062






Yes





2FH21F_03_063






Yes





2FH21F_03_064
Yes





Yes





2FH21F_03_065






Yes





2FH21F_03_071






Yes





2FH21F_03_073






Yes





2FH21F_03_079






Yes





2FH21F_03_080
Yes





Yes





2FH21F_03_081






Yes





2FH21F_03_083
Yes





Yes





2FH21F_03_084






Yes





2FH21F_03_085






Yes





2FH21F_03_087






Yes





2FH21F_03_088






Yes





2FH21F_03_089






Yes





2FH21F_03_091



Yes
Yes
Yes






2FH21F_03_093






Yes





2FH21F_03_094






Yes





2FH21F_03_095






Yes





2FH21F_03_097






Yes





2FH21F_03_098






Yes





2FH21F_03_100






Yes





2FH21F_03_101
Yes
Yes
Yes
Yes
Yes


Yes




2FH21F_04_006



Yes








2FH21F_04_008






Yes





2FH21F_04_010
Yes
Yes

Yes
Yes







2FH21F_04_011






Yes





2FH21F_04_014



Yes
Yes







2FH21F_04_015
Yes





Yes





2FH21F_04_017






Yes





2FH21F_04_018






Yes
Yes




2FH21F_04_019






Yes





2FH21F_04_021
Yes
Yes
Yes
Yes
Yes


Yes




2FH21F_04_022
Yes
Yes
Yes
Yes
Yes
Yes






2FH21F_04_023
Yes





Yes





2FH21F_04_024






Yes





2FH21F_05_003






Yes
Yes
Yes
Yes


2FH21F_05_005






Yes
Yes




2FH21F_05_006






Yes

Yes
Yes


2FH21F_05_007






Yes





2FH21F_05_008






Yes
Yes
Yes



2FH21F_05_013






Yes

Yes



2FH21F_05_015






Yes





2FH21F_05_016
Yes





Yes
Yes




2FH21F_05_018
Yes





Yes
Yes




2FH21F_05_019



Yes
Yes
Yes






2FH21F_05_025






Yes
Yes




2FH21F_05_026






Yes





2FH21F_05_027






Yes

Yes
Yes


2FH21F_05_028






Yes
Yes
Yes



2FH21F_05_032






Yes
Yes




2FH21F_05_033



Yes



Yes
Yes
Yes


2FH21F_05_034






Yes





2FH21F_05_035



Yes
Yes







2FH21F_05_040






Yes





2FH21F_05_041



Yes
Yes
Yes

Yes




2FH21F_05_044






Yes





2FH21F_05_045






Yes
Yes




2FH21F_05_047



Yes








2FH21F_05_051






Yes





2FH21F_05_054






Yes





2FH21F_05_058






Yes
Yes




2FH21F_05_061






Yes
Yes
Yes
Yes


2FH21F_05_064
Yes
Yes
Yes



Yes
Yes




2FH21F_05_066
Yes
Yes
Yes



Yes





2FH21F_05_067






Yes





2FH21F_05_069






Yes





2FH21F_05_072






Yes
Yes




2FH21F_05_073






Yes





2FH21F_05_074






Yes





2FH21F_05_076






Yes





2FH21F_05_080






Yes





2FH21F_05_083






Yes
Yes




2FH21F_05_088






Yes





2FH21F_05_091



Yes
Yes
Yes

Yes
Yes



2FH21F_05_092






Yes





2FH21F_05_094



Yes
Yes







2FH21F_05_096
Yes
Yes
Yes
Yes








2FH21F_05_097



Yes








2FH21F_05_098






Yes





2FH21F_05_099






Yes





2FH21F_05_101






Yes





2FH21F_05_102
Yes





Yes





2FH21F_05_109
Yes





Yes





2FH21F_05_110






Yes





2FH21F_06_001






Yes
Yes




2FH21F_06_004






Yes





2FH21F_06_005
Yes





Yes





2FH21F_06_006






Yes





2FH21F_06_007
Yes


Yes
Yes
Yes






2FH21F_06_011






Yes





2FH21F_06_012



Yes








2FH21F_06_013






Yes





2FH21F_06_015






Yes





2FH21F_06_018






Yes





2FH21F_06_023






Yes





2FH21F_06_025






Yes





2FH21F_06_026



Yes








2FH21F_06_028
Yes
Yes




Yes





2FH21F_06_029



Yes








2FH21F_06_031






Yes





2FH21F_06_034






Yes





2FH21F_06_035



Yes








2FH21F_06_037






Yes





2FH21F_06_038






Yes





2FH21F_06_045






Yes
Yes




2FH21F_06_046
Yes
Yes
Yes



Yes





2FH21F_06_047
Yes
Yes
Yes
Yes
Yes
Yes

Yes




2FH21F_06_051






Yes
Yes
Yes



2FH21F_06_052
Yes
Yes
Yes



Yes





2FH21F_06_053






Yes
Yes
Yes



2FH21F_06_060






Yes





2FH21F_06_061






Yes





2FH21F_06_062



Yes
Yes


Yes




2FH21F_06_064



Yes
Yes







2FH21F_06_065






Yes





2FH21F_06_068






Yes





2FH21F_06_073



Yes



Yes




2FH21F_06_075






Yes





2FH21F_06_076






Yes





2FH21F_06_077






Yes
Yes




2FH21F_06_079
Yes
Yes
Yes
Yes
Yes







2FH21F_06_082






Yes





2FH21F_06_083






Yes





2FH21F_06_084






Yes
Yes




2FH21F_06_088
Yes





Yes





2FH21F_06_092






Yes
Yes
Yes



2FH21F_06_093






Yes
Yes




2FH21F_06_095






Yes





2FH21F_06_099






Yes
Yes




2FH21F_06_102






Yes





2FH21F_06_107






Yes





2FH21F_06_110






Yes
Yes




2FH21F_06_111






Yes





2FH21F_06_112






Yes





2FH21F_06_113






Yes
Yes




2FH21F_06_114






Yes
Yes
Yes
Yes


2FH21F_06_117






Yes





2FH21F_06_118



Yes
Yes
Yes

Yes




2FH21F_06_119






Yes





2FH21F_06_127
Yes





Yes





2FH21F_06_128






Yes
Yes




2FH21F_06_129






Yes
Yes




2FH21F_06_130
Yes
Yes
Yes
Yes
Yes


Yes




2FH21F_06_132
Yes





Yes





2FH21F_06_133






Yes





2FH21F_06_134






Yes





2FH21F_06_135
Yes
Yes




Yes





2FH21F_06_137






Yes





2FH21F_06_138






Yes





2FH21F_06_140






Yes





2FH21F_06_141






Yes
Yes
Yes



2FH21F_06_142






Yes





2FH21F_06_144






Yes
Yes




2FH21F_06_147






Yes





2FH21F_06_148



Yes
Yes
Yes

Yes




2FH21F_06_149






Yes
Yes




2FH21F_06_150






Yes





2FH21F_06_153






Yes





2FH21F_06_155






Yes





2FH21F_06_156



Yes



Yes




2FH21F_06_159






Yes
Yes




2FH21F_06_163






Yes





2FH21F_06_165
Yes
Yes
Yes
Yes
Yes


Yes
Yes
Yes


2FH21F_06_166






Yes





2FH21F_06_168






Yes





2FH21F_06_172






Yes
Yes




2FH21F_06_176






Yes





2FH21F_06_179






Yes





2FH21F_06_182
Yes
Yes
Yes



Yes
Yes




2FH21F_06_183






Yes





2FH21F_06_194






Yes
Yes




2FH21F_06_196






Yes





2FH21F_06_198



Yes








2FH21F_06_204






Yes





2FH21F_06_218



Yes



Yes
Yes
Yes


2FH21F_06_219
Yes


Yes
Yes


Yes
Yes
Yes


2FH21F_06_224






Yes
Yes
Yes
Yes


2FH21F_06_228






Yes
Yes




2FH21F_06_229






Yes





2FH21F_06_233






Yes





2FH21F_06_238






Yes

Yes
Yes


2FH21F_06_239






Yes
Yes




2FH21F_06_241






Yes
Yes




2FH21F_06_242






Yes





2FH21F_06_243






Yes





2FH21F_06_250
Yes
Yes
Yes
Yes
Yes







2FH21F_06_251
Yes





Yes





2FH21F_06_252






Yes





2FH21F_06_253
Yes





Yes





2FH21F_06_254






Yes





2FH21F_06_258
Yes
Yes
Yes



Yes





2FH21F_06_259






Yes
Yes




2FH21F_06_263



Yes
Yes
Yes






2FH21F_06_264
Yes





Yes





2FH21F_06_268






Yes





2FH21F_06_275






Yes





2FH21F_06_277






Yes





2FH21F_06_278



Yes
Yes
Yes


Yes



2FH21F_06_279






Yes
Yes




2FH21F_06_284






Yes





2FH21F_06_288






Yes





2FH21F_07_002






Yes





2FH21F_07_003
Yes
Yes




Yes





2FH21F_07_004






Yes





2FH21F_07_009






Yes





2FH21F_07_016






Yes





2FH21F_07_017






Yes





2FH21F_07_018
Yes
Yes




Yes





2FH21F_07_021






Yes





2FH21F_07_022






Yes





2FH21F_07_025



Yes








2FH21F_07_026






Yes





2FH21F_07_027






Yes





2FH21F_07_028






Yes





2FH21F_07_029






Yes





2FH21F_07_030






Yes





2FH21F_07_033






Yes





2FH21F_07_035






Yes





2FH21F_07_036






Yes





2FH21F_07_037






Yes





2FH21F_07_042






Yes





2FH21F_07_050






Yes





2FH21F_07_052






Yes





2FH21F_07_053






Yes





2FH21F_07_057
Yes





Yes





2FH21F_07_058






Yes





2FH21F_07_059
Yes


Yes
Yes







2FH21F_07_061
Yes





Yes





2FH21F_07_063






Yes





2FH21F_07_064



Yes
Yes







2FH21F_07_067






Yes





2FH21F_07_071






Yes
Yes
Yes
Yes


2FH21F_07_072






Yes





2FH21F_07_074
Yes
Yes




Yes





2FH21F_07_081






Yes





2FH21F_07_082






Yes





2FH21F_07_084






Yes





2FH21F_07_088






Yes





2FH21F_07_090
Yes





Yes





2FH21F_07_094






Yes





2FH21F_07_095



Yes








2FH21F_07_105






Yes





2FH21F_07_106






Yes





2FH21F_07_109






Yes





2FH21F_07_112






Yes





2FH21F_07_115






Yes





2FH21F_07_116






Yes





2FH21F_07_117






Yes





2FH21F_07_119






Yes





2FH21F_07_122






Yes





2FH21F_07_128






Yes





2FH21F_07_130






Yes





2FH21F_07_131



Yes








2FH21F_07_135
Yes





Yes





2FH21F_07_136






Yes





2FH21F_07_138






Yes





2FH21F_07_142






Yes





2FH21F_07_143






Yes





2FH21F_07_147






Yes





2FH21F_07_150



Yes








2FH21F_07_151






Yes





2FH21F_07_152






Yes





2FH21F_07_153






Yes





2FH21F_07_156



Yes








2FH21F_07_157






Yes





2FH21F_07_160



Yes








2FH21F_07_161






Yes





2FH21F_07_164






Yes





2FH21F_07_166



Yes
Yes
Yes

Yes
Yes
Yes


2FH21F_07_168






Yes





2FH21F_07_176






Yes





2FH21F_07_178
Yes
Yes




Yes





2FH21F_07_179






Yes





2FH21F_07_180






Yes





2FH21F_07_181



Yes








2FH21F_07_183






Yes
Yes




2FH21F_07_186






Yes
Yes




2FH21F_07_187






Yes





2FH21F_07_188






Yes





2FH21F_07_194
Yes





Yes





2FH21F_07_195






Yes





2FH21F_07_198






Yes





2FH21F_07_200






Yes





2FH21F_07_202






Yes
Yes
Yes
Yes


2FH21F_07_203
Yes





Yes





2FH21F_07_207






Yes





2FH21F_07_210
Yes
Yes




Yes





2FH21F_07_211






Yes





2FH21F_07_212






Yes





2FH21F_07_214






Yes





2FH21F_07_215






Yes





2FH21F_07_216






Yes





2FH21F_07_219






Yes





2FH21F_07_220
Yes





Yes





2FH21F_07_223






Yes





2FH21F_07_226






Yes





2FH21F_07_228






Yes
Yes




2FH21F_07_229






Yes
Yes
Yes



2FH21F_07_230






Yes





2FH21F_07_233






Yes





2FH21F_07_234






Yes





2FH21F_07_235
Yes
Yes
Yes



Yes





2FH21F_07_238






Yes
Yes




2FH21F_07_239






Yes





2FH21F_07_240



Yes








2FH21F_07_241
Yes





Yes





2FH21F_07_242



Yes



Yes
Yes



2FH21F_07_243






Yes





2FH21F_07_245






Yes





2FH21F_07_247



Yes








2FH21F_07_253






Yes





2FH21F_07_254






Yes
Yes




2FH21F_07_256






Yes





2FH21F_07_262






Yes





2FH21F_07_264






Yes





2FH21F_07_268






Yes
Yes
Yes



2FH21F_07_269






Yes





2FH21F_07_270






Yes





2FH21F_07_271
Yes





Yes





2FH21F_07_277






Yes





2FH21F_07_279






Yes





2FH21F_07_282



Yes
Yes







2FH21F_07_283






Yes





2FH21F_07_289






Yes





2FH21F_07_293






Yes





2FH21F_07_298






Yes





2FH21F_07_302






Yes





2FH21F_07_303






Yes





2FH21F_07_304






Yes





2FH21F_07_305






Yes





2FH21F_07_306






Yes





2FH21F_07_307






Yes





2FH21F_07_308






Yes





2FH21F_07_309



Yes








2FH21F_07_312






Yes





2FH21F_07_321






Yes





2FH21F_07_323






Yes





2FH21F_07_325






Yes





2FH21F_07_329






Yes





2FH21F_07_331






Yes
Yes




2FH21F_07_332






Yes





2FH21F_07_333






Yes





2FH21F_07_334






Yes





2FH21F_07_335






Yes





2FH21F_07_337






Yes





2FH21F_07_340






Yes





2FH21F_07_343






Yes





2FH21F_07_347
Yes





Yes





2FH21F_07_349






Yes





2FH21F_07_351






Yes





2FH21F_07_352






Yes





2FH21F_07_354






Yes





2FH21F_07_355
Yes
Yes




Yes





2FH21F_07_356






Yes





2FH21F_07_357






Yes





2FH21F_07_358






Yes





2FH21F_07_359






Yes





2FH21F_07_360






Yes





2FH21F_07_365






Yes





2FH21F_07_366






Yes





2FH21F_07_367



Yes








2FH21F_07_368



Yes








2FH21F_07_369






Yes





2FH21F_07_370
Yes





Yes





2FH21F_07_371






Yes





2FH21F_07_373






Yes





2FH21F_07_374



Yes








2FH21F_07_375






Yes





2FH21F_07_376






Yes





2FH21F_07_377






Yes





2FH21F_07_380






Yes





2FH21F_07_381






Yes





2FH21F_07_385
Yes
Yes
Yes



Yes





2FH21F_07_391






Yes





2FH21F_07_393
Yes
Yes




Yes





2FH21F_07_394



Yes








2FH21F_07_395



Yes








2FH21F_07_397



Yes








2FH21F_07_398
Yes
Yes

Yes








2FH21F_07_399



Yes








2FH21F_07_402



Yes








2FH21F_07_403






Yes





2FH21F_07_405






Yes





2FH21F_07_406



Yes








2FH21F_07_407






Yes
Yes




2FH21F_07_416
Yes





Yes





2FH21F_07_419






Yes





2FH21F_07_420
Yes





Yes
Yes




2FH21F_07_421






Yes
Yes




2FH21F_07_422






Yes





2FH21F_07_423






Yes





2FH21F_07_426



Yes
Yes







2FH21F_07_427






Yes





2FH21F_07_429



Yes








2FH21F_07_430



Yes
Yes







2FH21F_07_431
Yes
Yes




Yes





2FH21F_07_434






Yes





2FH21F_07_437






Yes





2FH21F_07_438
Yes
Yes




Yes





2FH21F_07_439






Yes





2FH21F_07_443






Yes





2FH21F_07_444






Yes





2FH21F_07_445






Yes





2FH21F_07_447






Yes





2FH21F_07_452






Yes





2FH21F_07_454






Yes





2FH21F_07_457






Yes





2FH21F_07_459






Yes





2FH21F_07_460






Yes





2FH21F_07_462






Yes
Yes




2FH21F_07_463



Yes








2FH21F_07_464






Yes
Yes
Yes
Yes


2FH21F_07_465






Yes
Yes
Yes
Yes


2FH21F_07_466






Yes





2FH21F_07_474






Yes





2FH21F_07_475






Yes





2FH21F_07_476






Yes





2FH21F_07_479






Yes





2FH21F_07_480






Yes





2FH21F_07_482



Yes








2FH21F_07_483



Yes








2FH21F_08_001



Yes








2FH21F_08_003
Yes


Yes








2FH21F_08_004
Yes





Yes
Yes




2FH21F_08_008
Yes
Yes
Yes
Yes
Yes
Yes






2FH21F_08_009
Yes
Yes
Yes



Yes





2FH21F_08_010



Yes
Yes
Yes

Yes




2FH21F_08_013






Yes





2FH21F_08_014






Yes





2FH21F_08_016



Yes








2FH21F_08_017
Yes





Yes





2FH21F_09_004



Yes
Yes



Yes



2FH21F_09_005



Yes
Yes
Yes

Yes




2FH21F_09_007






Yes
Yes
Yes
Yes


2FH21F_09_010
Yes
Yes
Yes
Yes
Yes


Yes
Yes
Yes


2FH21F_09_013
Yes
Yes
Yes



Yes





2FH21F_09_016
Yes


Yes








2FH21F_09_018






Yes





2FH21F_10_003



Yes
Yes
Yes






2FH21F_10_005
Yes


Yes
Yes


Yes
Yes
Yes


2FH21F_10_006
Yes
Yes

Yes
Yes







2FH21F_10_007






Yes





2FH21F_10_011
Yes





Yes





2FH21F_10_016






Yes





2FH21F_10_018






Yes
Yes




2FH21F_10_019



Yes



Yes




2FH21F_10_020
Yes





Yes





2FH21F_11_001






Yes





2FH21F_11_002






Yes





2FH21F_11_003






Yes





2FH21F_11_005






Yes





2FH21F_11_006
Yes





Yes





2FH21F_11_007
Yes


Yes








2FH21F_11_008
Yes





Yes





2FH21F_11_010






Yes





2FH21F_11_012



Yes
Yes







2FH21F_11_013
Yes
Yes

Yes








2FH21F_11_014
Yes


Yes








2FH21F_11_015






Yes





2FH21F_11_019






Yes





2FH21F_11_020
Yes
Yes
Yes
Yes
Yes
Yes






2FH21F_11_022






Yes
Yes
Yes
Yes


2FH21F_11_023






Yes





2FH21F_11_024






Yes
Yes




2FH21F_11_026



Yes
Yes







2FH21F_11_027



Yes
Yes


Yes
Yes



2FH21F_11_028



Yes
Yes


Yes
Yes
Yes


2FH21F_11_029






Yes





2FH21F_11_030






Yes





2FH21F_11_033



Yes
Yes







2FH21F_12_003



Yes








2FH21F_12_011
Yes


Yes



Yes




2FH21F_12_012
Yes
Yes

Yes








2FH21F_12_013



Yes








2FH21F_12_015






Yes





2FH21F_12_016






Yes





2FH21F_12_032



Yes
Yes
Yes

Yes




2FH21F_12_036



Yes
Yes


Yes




2FH21F_12_039






Yes





2FH21F_12_048






Yes





2FH21F_12_049






Yes
Yes
Yes
Yes


2FH21F_12_050






Yes





2FH21F_12_051






Yes
Yes
Yes



2FH21F_12_052
Yes





Yes
Yes
Yes
Yes


2FH21F_12_053






Yes
Yes
Yes



2FH21F_12_054






Yes
Yes




2FH21F_12_057






Yes





2FH21F_12_058






Yes





2FH21F_12_060



Yes
Yes


Yes




2FH21F_12_064






Yes





2FH21F_12_066






Yes





2FH21F_12_068






Yes





2FH21F_12_071






Yes





2FH21F_12_072






Yes
Yes




2FH21F_12_073



Yes
Yes
Yes






2FH21F_12_074



Yes
Yes


Yes
Yes
Yes


2FH21F_12_075



Yes
Yes
Yes

Yes
Yes
Yes


2FH21F_12_076






Yes
Yes




2FH21F_12_077






Yes





2FH21F_12_078
Yes


Yes
Yes


Yes




2FH21F_12_079






Yes





2FH21F_12_080






Yes





2FH21F_12_081






Yes





2FH21F_12_082
Yes
Yes




Yes





2FH21F_12_083
Yes





Yes





2FH21F_12_084






Yes





2FH21F_12_086






Yes

Yes



2FH21F_12_088






Yes





2FH21F_12_094






Yes
Yes




2FH21F_12_095



Yes








2FH21F_12_098






Yes





2FH21F_12_103



Yes
Yes







2FH21F_12_104






Yes





2FH21F_12_105






Yes





2FH21F_12_106
Yes
Yes
Yes



Yes





2FH21F_12_107






Yes





2FH21F_12_112






Yes





2FH21F_12_113
Yes





Yes





2FH21F_12_114






Yes





2FH21F_13_005






Yes
Yes




2FH21F_13_019






Yes





2FH21F_13_020






Yes





2FH21F_13_022
Yes





Yes





2FH21F_13_023






Yes





2FH21F_13_026



Yes








2FH21F_13_028






Yes





2FH21F_13_031
Yes





Yes





2FH21F_13_032
Yes





Yes





2FH21F_13_033
Yes
Yes




Yes





2FH21F_13_035






Yes





2FH21F_13_036






Yes
Yes
Yes
Yes


2FH21F_13_039






Yes





2FH21F_13_040






Yes





2FH21F_13_041






Yes
Yes
Yes
Yes


2FH21F_13_042






Yes





2FH21F_13_043






Yes





2FH21F_13_046






Yes





2FH21F_13_047






Yes





2FH21F_13_048
Yes


Yes
Yes







2FH21F_13_049






Yes





2FH21F_13_051
Yes


Yes
Yes







2FH21F_13_052






Yes





2FH21F_13_054






Yes
Yes




2FH21F_13_057
Yes
Yes




Yes





2FH21F_13_059






Yes





2FH21F_13_060






Yes





2FH21F_13_062






Yes





2FH21F_13_065






Yes





2FH21F_13_066






Yes





2FH21F_13_068



Yes








2FH21F_13_071



Yes








2FH21F_13_077






Yes





2FH21F_13_079






Yes
Yes




2FH21F_13_082






Yes





2FH21F_13_083






Yes





2FH21F_13_084






Yes





2FH21F_13_088






Yes





2FH21F_13_099






Yes





2FH21F_13_101



Yes
Yes
Yes






2FH21F_13_105






Yes





2FH21F_13_107






Yes





2FH21F_13_108






Yes





2FH21F_13_110
Yes


Yes
Yes







2FH21F_13_111






Yes





2FH21F_13_112






Yes





2FH21F_14_006






Yes





2FH21F_14_008
Yes





Yes





2FH21F_14_010






Yes





2FH21F_14_011



Yes








2FH21F_14_012
Yes
Yes

Yes
Yes
Yes






2FH21F_14_013






Yes





2FH21F_14_015






Yes





2FH21F_14_016
Yes





Yes





2FH21F_14_017






Yes





2FH21F_14_018
Yes
Yes
Yes



Yes





2FH21F_14_026
Yes


Yes
Yes
Yes






2FH21F_14_027






Yes





2FH21F_14_028






Yes





2FH21F_14_033



Yes
Yes







2FH21F_14_035






Yes





2FH21F_14_037



Yes
Yes







2FH21F_14_039
Yes





Yes





2FH21F_14_040






Yes





2FH21F_15_002






Yes





2FH21F_15_004






Yes





2FH21F_15_005






Yes





2FH21F_15_009
Yes





Yes





2FH21F_15_010






Yes





2FH21F_15_011






Yes





2FH21F_15_015
Yes


Yes








2FH21F_15_016






Yes





2FH21F_15_017



Yes








2FH21F_15_018






Yes





2FH21F_15_019






Yes





2FH21F_15_021






Yes





2FH21F_15_024






Yes





2FH21F_15_025
Yes





Yes





2FH21F_15_026






Yes





2FH21F_15_027






Yes





2FH21F_15_030






Yes





2FH21F_15_031






Yes





2FH21F_15_032
Yes
Yes




Yes





2FH21F_15_033






Yes





2FH21F_15_034






Yes





2FH21F_15_038






Yes





2FH21F_15_040






Yes





2FH21F_15_041






Yes





2FH21F_15_042






Yes





2FH21F_15_043






Yes





2FH21F_15_044
Yes





Yes
Yes
Yes
Yes


2FH21F_15_045



Yes
Yes







2FH21F_15_046






Yes





2FH21F_15_047
Yes


Yes








2FH21F_15_048






Yes





2FH21F_15_050






Yes





2FH21F_15_054






Yes





2FH21F_15_057
Yes
Yes




Yes





2FH21F_15_061



Yes








2FH21F_15_068



Yes








2FH21F_15_069






Yes





2FH21F_15_070






Yes





2FH21F_15_074






Yes





2FH21F_15_075






Yes





2FH21F_15_076






Yes





2FH21F_15_077






Yes





2FH21F_15_079



Yes








2FH21F_15_082






Yes





2FH21F_15_083
Yes
Yes

Yes








2FH21F_15_084






Yes
Yes




2FH21F_15_085
Yes
Yes




Yes





2FH21F_15_086






Yes





2FH21F_15_091






Yes





2FH21F_15_092






Yes





2FH21F_15_093






Yes





2FH21F_15_097
Yes
Yes

Yes








2FH21F_15_101






Yes





2FH21F_15_103



Yes








2FH21F_15_106
Yes





Yes





2FH21F_15_107






Yes





2FH21F_15_119






Yes





2FH21F_15_126






Yes





2FH21F_15_128






Yes





2FH21F_15_130






Yes





2FH21F_15_134






Yes





2FH21F_15_135
Yes
Yes
Yes



Yes





2FH21F_15_137






Yes





2FH21F_15_139






Yes





2FH21F_15_142






Yes





2FH21F_15_144






Yes





2FH21F_15_146






Yes
Yes
Yes



2FH21F_15_147



Yes
Yes







2FH21F_15_148






Yes





2FH21F_15_149






Yes
Yes




2FH21F_15_150






Yes





2FH21F_15_151






Yes





2FH21F_15_152






Yes





2FH21F_15_153






Yes





2FH21F_15_156






Yes





2FH21F_15_157
Yes
Yes
Yes



Yes





2FH21F_15_160






Yes





2FH21F_15_165



Yes








2FH21F_15_170



Yes
Yes


Yes




2FH21F_15_175






Yes





2FH21F_15_178






Yes





2FH21F_15_180






Yes





2FH21F_15_182
Yes





Yes





2FH21F_15_191






Yes





2FH21F_15_193






Yes





2FH21F_15_195






Yes





2FH21F_15_196






Yes





2FH21F_15_198






Yes





2FH21F_15_200






Yes





2FH21F_15_209






Yes





2FH21F_15_210






Yes
Yes




2FH21F_15_211






Yes
Yes




2FH21F_15_212



Yes








2FH21F_15_214






Yes





2FH21F_15_217






Yes





2FH21F_15_218






Yes
Yes




2FH21F_15_219
Yes
Yes
Yes
Yes
Yes
Yes






2FH21F_15_220






Yes
Yes
Yes



2FH21F_15_221






Yes
Yes




2FH21F_15_222






Yes





2FH21F_15_223






Yes
Yes




2FH21F_15_228






Yes





2FH21F_15_231






Yes





2FH21F_15_234



Yes
Yes
Yes

Yes




2FH21F_15_236
Yes
Yes
Yes



Yes





2FH21F_15_237
Yes





Yes





2FH21F_15_238
Yes





Yes





2FH21F_15_239






Yes





2FH21F_15_241



Yes
Yes







2FH21F_15_242
Yes
Yes
Yes



Yes





2FH21F_15_243



Yes








2FH21F_15_244






Yes





2FH21F_15_247






Yes
Yes




2FH21F_15_248






Yes





2FH21F_16_004






Yes





2FH21F_16_005
Yes


Yes
Yes







2FH21F_16_006






Yes





2FH21F_16_010






Yes





2FH21F_16_011






Yes
Yes




2FH21F_16_012






Yes
Yes




2FH21F_16_014
Yes





Yes
Yes




2FH21F_16_015



Yes
Yes
Yes






2FH21F_16_016
Yes





Yes
Yes




2FH21F_16_018



Yes
Yes







2FH21F_16_019






Yes





2FH21F_16_021






Yes





2FH21F_16_022
Yes
Yes




Yes





2FH21F_16_023
Yes


Yes
Yes
Yes

Yes




2FH21F_16_024
Yes
Yes

Yes








2FH21F_16_025






Yes





2FH21F_17_004






Yes





2FH21F_17_006
Yes





Yes





2FH21F_17_008



Yes








2FH21F_17_009






Yes





2FH21F_17_010



Yes








2FH21F_17_011
Yes
Yes




Yes





2FH21F_17_012



Yes








2FH21F_17_014
Yes





Yes





2FH21F_17_015






Yes





2FH21F_17_020






Yes





2FH21F_17_021






Yes





2FH21F_17_022
Yes


Yes








2FH21F_17_023



Yes
Yes







2FH21F_18_002



Yes








2FH21F_18_005






Yes
Yes




2FH21F_18_006






Yes





2FH21F_18_007






Yes





2FH21F_18_019



Yes



Yes
Yes



2FH21F_18_020






Yes
Yes
Yes
Yes


2FH21F_18_021



Yes



Yes




2FH21F_18_023






Yes
Yes




2FH21F_18_031






Yes





2FH21F_18_035






Yes





2FH21F_18_042



Yes








2FH21F_18_044






Yes





2FH21F_18_045






Yes
Yes




2FH21F_18_046
Yes
Yes
Yes
Yes
Yes
Yes






2FH21F_18_047






Yes
Yes




2FH21F_18_048






Yes





2FH21F_18_050






Yes





2FH21F_18_051






Yes
Yes




2FH21F_18_054



Yes








2FH21F_18_055






Yes





2FH21F_18_059






Yes
Yes
Yes
Yes


2FH21F_18_060
Yes
Yes




Yes
Yes
Yes



2FH21F_18_061






Yes
Yes
Yes



2FH21F_18_063






Yes





2FH21F_18_065






Yes





2FH21F_18_066






Yes
Yes




2FH21F_18_067
Yes





Yes





2FH21F_18_068






Yes





2FH21F_18_070






Yes





2FH21F_18_071
Yes
Yes
Yes



Yes





2FH21F_18_072






Yes
Yes




2FH21F_18_074






Yes
Yes




2FH21F_18_076






Yes
Yes
Yes
Yes


2FH21F_18_078
Yes





Yes





2FH21F_18_083






Yes
Yes
Yes



2FH21F_18_086






Yes





2FH21F_18_090






Yes





2FH21F_18_094






Yes
Yes
Yes
Yes


2FH21F_18_101






Yes
Yes




2FH21F_18_103






Yes
Yes




2FH21F_18_117






Yes





2FH21F_18_120



Yes








2FH21F_18_122






Yes





2FH21F_18_123
Yes





Yes





2FH21F_18_126
Yes





Yes





2FH21F_18_127






Yes
Yes




2FH21F_18_132






Yes





2FH21F_18_133






Yes





2FH21F_18_136



Yes








2FH21F_18_137






Yes





2FH21F_18_138






Yes





2FH21F_18_139
Yes


Yes








2FH21F_18_141



Yes








2FH21F_18_142






Yes





2FH21F_18_143






Yes





2FH21F_18_144
Yes





Yes





2FH21F_18_145
Yes
Yes
Yes
Yes
Yes
Yes






2FH21F_18_149
Yes
Yes
Yes
Yes
Yes
Yes

Yes




2FH21F_18_151






Yes
Yes
Yes



2FH21F_18_153






Yes





2FH21F_18_154






Yes
Yes
Yes
Yes


2FH21F_18_156






Yes





2FH21F_18_158






Yes





2FH21F_18_159






Yes
Yes




2FH21F_18_160






Yes





2FH21F_18_161
Yes
Yes




Yes





2FH21F_18_162






Yes





2FH21F_18_171






Yes
Yes
Yes
Yes


2FH21F_18_172






Yes





2FH21F_18_173






Yes





2FH21F_18_174






Yes





2FH21F_18_175






Yes





2FH21F_18_176






Yes
Yes
Yes
Yes


2FH21F_18_178
Yes





Yes
Yes
Yes
Yes


2FH21F_18_186






Yes





2FH21F_18_188






Yes
Yes
Yes
Yes


2FH21F_18_190



Yes



Yes
Yes
Yes


2FH21F_18_191
Yes





Yes
Yes
Yes
Yes


2FH21F_18_194






Yes
Yes




2FH21F_18_195



Yes








2FH21F_18_197






Yes





2FH21F_18_198
Yes


Yes
Yes


Yes




2FH21F_18_199






Yes





2FH21F_18_200






Yes





2FH21F_18_201






Yes





2FH21F_18_202






Yes
Yes




2FH21F_18_203



Yes








2FH21F_18_204
Yes





Yes





2FH21F_18_212






Yes





2FH21F_18_213






Yes
Yes




2FH21F_18_216






Yes
Yes
Yes



2FH21F_18_217






Yes





2FH21F_18_219
Yes





Yes





2FH21F_18_223






Yes





2FH21F_18_224






Yes
Yes




2FH21F_18_226






Yes





2FH21F_18_233
Yes
Yes
Yes



Yes
Yes




2FH21F_18_234






Yes





2FH21F_18_241






Yes
Yes




2FH21F_18_243



Yes
Yes


Yes




2FH21F_18_244
Yes





Yes





2FH21F_18_245






Yes





2FH21F_18_252



Yes








2FH21F_18_254






Yes





2FH21F_18_255






Yes





2FH21F_18_260






Yes





2FH21F_18_261






Yes
Yes
Yes



2FH21F_18_262






Yes
Yes
Yes
Yes


2FH21F_18_268






Yes
Yes




2FH21F_18_269



Yes








2FH21F_18_270






Yes
Yes
Yes
Yes


2FH21F_18_271






Yes





2FH21F_18_272






Yes





2FH21F_18_273






Yes
Yes




2FH21F_18_274






Yes





2FH21F_18_275
Yes





Yes





2FH21F_18_276



Yes



Yes
Yes



2FH21F_18_277






Yes
Yes
Yes



2FH21F_18_284






Yes





2FH21F_18_292






Yes





2FH21F_18_293






Yes





2FH21F_18_296



Yes
Yes
Yes






2FH21F_18_300






Yes





2FH21F_18_301






Yes





2FH21F_18_303



Yes








2FH21F_18_304






Yes





2FH21F_18_305






Yes





2FH21F_18_307



Yes








2FH21F_18_314
Yes
Yes

Yes








2FH21F_18_319






Yes
Yes




2FH21F_18_326






Yes
Yes




2FH21F_18_327






Yes





2FH21F_18_328






Yes





2FH21F_18_329
Yes





Yes





2FH21F_18_330






Yes





2FH21F_18_332



Yes
Yes
Yes

Yes
Yes
Yes


2FH21F_18_333






Yes





2FH21F_18_340






Yes





2FH21F_18_344






Yes
Yes




2FH21F_18_346






Yes
Yes
Yes
Yes


2FH21F_18_349
Yes





Yes





2FH21F_18_350
Yes





Yes





2FH21F_18_351



Yes
Yes







2FH21F_18_352






Yes





2FH21F_18_354






Yes





2FH21F_18_355






Yes





2FH21F_18_357






Yes





2FH21F_18_364






Yes
Yes




2FH21F_18_365






Yes





2FH21F_18_369



Yes
Yes







2FH21F_18_370






Yes





2FH21F_18_375






Yes





2FH21F_18_380



Yes
Yes
Yes






2FH21F_18_386
Yes
Yes




Yes





2FH21F_18_388






Yes
Yes




2FH21F_18_398






Yes





2FH21F_18_399






Yes





2FH21F_18_402






Yes





2FH21F_18_403






Yes





2FH21F_18_405






Yes





2FH21F_18_408






Yes





2FH21F_18_409






Yes





2FH21F_18_412






Yes





2FH21F_18_414






Yes





2FH21F_18_415






Yes





2FH21F_18_417



Yes








2FH21F_18_419






Yes





2FH21F_18_427






Yes





2FH21F_18_428






Yes





2FH21F_18_429






Yes





2FH21F_18_430






Yes





2FH21F_18_432






Yes





2FH21F_18_434






Yes





2FH21F_18_435






Yes





2FH21F_18_441






Yes





2FH21F_18_446






Yes





2FH21F_18_457






Yes





2FH21F_18_459






Yes





2FH21F_18_460






Yes
Yes




2FH21F_18_461






Yes





2FH21F_18_462



Yes
Yes







2FH21F_18_463



Yes








2FH21F_18_466






Yes





2FH21F_18_467






Yes
Yes




2FH21F_18_468
Yes
Yes
Yes



Yes
Yes




2FH21F_18_469






Yes





2FH21F_18_470






Yes





2FH21F_18_472



Yes
Yes
Yes






2FH21F_18_474






Yes





2FH21F_18_475



Yes
Yes







2FH21F_18_476






Yes





2FH21F_18_480



Yes
Yes
Yes






2FH21F_18_481
Yes





Yes





2FH21F_18_482



Yes



Yes




2FH21F_18_483
Yes
Yes
Yes



Yes





2FH21F_18_485






Yes





2FH21F_18_490






Yes





2FH21F_18_491



Yes








2FH21F_18_494






Yes





2FH21F_18_497






Yes





2FH21F_18_501






Yes





2FH21F_18_502






Yes





2FH21F_18_503






Yes





2FH21F_18_504






Yes
Yes




2FH21F_18_505






Yes





2FH21F_18_506






Yes





2FH21F_18_508






Yes





2FH21F_18_509






Yes
Yes




2FH21F_18_510






Yes
Yes




2FH21F_18_511



Yes
Yes
Yes

Yes




2FH21F_18_512






Yes





2FH21F_18_513
Yes
Yes




Yes





2FH21F_18_515






Yes





2FH21F_18_516






Yes





2FH21F_18_517






Yes





2FH21F_18_518






Yes





2FH21F_18_519






Yes





2FH21F_18_520






Yes





2FH21F_18_521






Yes
Yes




2FH21F_18_522



Yes
Yes


Yes




2FH21F_18_523
Yes


Yes
Yes
Yes

Yes




2FH21F_18_524






Yes





2FH21F_18_525






Yes





2FH21F_18_526






Yes





2FH21F_18_527






Yes





2FH21F_18_529
Yes


Yes
Yes
Yes

Yes




2FH21F_18_530






Yes
Yes




2FH21F_18_534






Yes





2FH21F_18_535






Yes





2FH21F_18_536






Yes

Yes



2FH21F_18_537



Yes
Yes
Yes






2FH21F_18_538



Yes



Yes




2FH21F_18_539
Yes





Yes





2FH21F_18_543



Yes








2FH21F_18_545






Yes





2FH21F_18_548






Yes
Yes
Yes



2FH21F_18_549






Yes
Yes




2FH21F_18_555






Yes





2FH21F_18_565






Yes
Yes




2FH21F_18_566






Yes
Yes
Yes



2FH21F_18_567






Yes
Yes




2FH21F_18_570






Yes





2FH21F_18_571






Yes





2FH21F_18_574






Yes





2FH21F_18_576
Yes





Yes





2FH21F_18_577






Yes
Yes
Yes



2FH21F_18_579






Yes





2FH21F_18_583






Yes





2FH21F_18_585






Yes





2FH21F_18_590






Yes





2FH21F_18_594






Yes
Yes
Yes



2FH21F_19_004






Yes





2FH21F_19_005
Yes


Yes








2FH21F_19_006






Yes





2FH21F_19_007
Yes


Yes








2FH21F_19_010
Yes
Yes

Yes
Yes







2FH21F_19_012



Yes








2FH21F_19_014



Yes
Yes
Yes






2FH21F_19_015






Yes





2FH21F_19_016






Yes
Yes




2FH21F_19_018






Yes
Yes




2FH21F_19_022
Yes





Yes
Yes




2FH21F_19_026






Yes





2FH21F_19_027






Yes
Yes
Yes



2FH21F_19_028






Yes
Yes




2FH21F_19_030






Yes





2FH21F_19_031
Yes
Yes
Yes



Yes
Yes




2FH21F_20_003






Yes





2FH21F_20_004
Yes


Yes








2FH21F_20_006






Yes





2FH21F_20_007
Yes





Yes





2FH21F_20_008






Yes





2FH21F_20_009
Yes


Yes








2FH21F_20_010



Yes








2FH21F_20_011






Yes





2FH21F_20_012



Yes








2FH21F_20_013
Yes
Yes
Yes
Yes








2FH21F_20_014






Yes





2FH21F_20_015






Yes





2FH21F_20_016
Yes





Yes





2FH21F_20_017



Yes








2FH21F_20_018






Yes





2FH21F_20_020






Yes





2FH21F_22_012






Yes





2FH21F_22_016






Yes





2FH21F_22_017






Yes





2FH21F_22_018






Yes





2FH21F_22_019






Yes





2FH21F_22_021
Yes
Yes




Yes





2FH21F_22_025
Yes





Yes





2FH21F_22_026






Yes





2FH21F_22_028
Yes





Yes





2FH21F_22_029






Yes





2FH21F_22_030






Yes





2FH21F_22_035






Yes





2FH21F_22_036



Yes








2FH21F_22_037



Yes








2FH21F_22_040






Yes





2FH21F_22_042






Yes





2FH21F_22_043






Yes





2FH21F_22_044






Yes





2FH21F_22_047






Yes





2FH21F_22_048






Yes





2FH21F_22_051






Yes





2FH21F_22_055






Yes





2FH21F_22_056






Yes





2FH21F_22_057



Yes








2FH21F_22_059






Yes





2FH21F_22_061






Yes





2FH21F_22_062






Yes





2FH21F_22_067






Yes
Yes




2FH21F_22_068
Yes


Yes








2FH21F_22_073
Yes





Yes





2FH21F_22_074
Yes


Yes



Yes




2FH21F_22_075






Yes





2FH21F_22_076
Yes





Yes





2FH21F_22_077






Yes





2FH21F_22_078






Yes





2FH21F_22_079



Yes
Yes







2FH21F_22_080
Yes


Yes








2FH21F_22_081



Yes








2FH21F_22_082






Yes





2FH21F_22_085



Yes











*Experiment 3, Tier IV sequence sets have not been tested on plasma samples.






Multiplex Scheme


Provided in Table 14 below is a multiplex scheme with a subset of nucleotide sequence sets that perform well. The multiplex scheme was designed by first including top-performing sequence sets from DNA Sets 1 and 3 from Experiment 3 and replexing these sets. This approach ensures that these top-performing sets are included in a design and are more highly represented in a single multiplex scheme. Next, a “superplex” was performed. Superplexing takes an existing assay (in this case, the top-performing replex from DNA Sets 1 and 3) and adds additional top-performing sequence sets to fill in to a desired plex level (in this case 56 sequence sets). This approach optimizes markers in a consolidated mulitplex scheme. When designing the multiplex schemes, those markers that are in close proximity (<1000 bases) and may co-amplify are not included in the same, single multiplex reaction. In Table 14, the WELL corresponds to those sequence sets included in the same single reaction, i.e., all of the sequence sets from well W1 are assayed in the same single reaction.









TABLE 14







Multiplex Scheme

















SEQ

SEQ

SEQ





ID

ID

ID


WELL
MARKER_ID
PCR Primer 1
NO:
PCR Primer 2
NO:
Extension Primer
NO:





W1
2FH21F_01_030
GTACTCAAATCAAATTGGC
5010
GAGGCAACTAGGACTTAAGG
5066
TCAAATTGGCTTACTTGC
5122





W1
2FH21F_02_075
GAAAAAAGTGCATGTCTTTG
5011
AGATTATGATGCACTGGCCT
5067
TGATGAATGCAGTGAAGTC
5123





W1
2FH21F_02_107
CCCAGATGAAGGGGTTTTAG
5012
GGAAAGTTAGAAGGCCACAC
5068
GTTTTAGTATTGAATTTAG
5124








TGCTTAG






W1
2FH21F_02_148
AAGACCAAGATTCAGAAGC
5013
TTGTTGCTCCAAGTTTAAG
5069
GCAGGGCTATGCGGGAG
5125





W1
2FH21F_05_006
GTGAATTCTTCCCACTTCTC
5014
GTTTTCCCATATCTAGATGTC
5070
CACTTCTCACTTATCATCT
5126








G






W1
2FH21F_06_114
GAGAATTAAAATGAACTGAG
5015
TACTTAATCCTTTTGCCTC
5071
GAATTAAAATGAACTGAGG
5127








ATTTC






W1
2FH21F_06_165
GGTACCACTCATCCATAAAC
5016
GGGCTGTTTCAATGAGGGAC
5072
TCCATAAACACCAACACT
5128





W1
2FH21F_06_219
ACCCTCAGTACCACTATCTC
5017
CTTGTATTAAAAGAAGTGG
5073
CCTCAGTACCACTATCTCA
5129








ATCTT






W1
2FH21F_06_224
CAAGGATTCCAGTACTGGAG
5018
GGAGTCAAGGGAGCATTTTA
5074
CCAGTACTGGAGAATGTCT
5130





W1
2FH21F_09_007
CATATTTGTCTGTGTACTTG
5019
GAGGCAAACATTATACACAC
5075
TTGTCTGTGTACTTGTGCT
5131








CT






W1
2FH21F_11_022
GGAATGTTCCACCTTTCTAC
5020
ACTGAAGTCATTCATTAGG
5076
AATGTTCCACCTTTCTACC
5132








TTTTTTT






W1
2FH21F_12_052
CTTCAAGGCAATCTTTCTCC
5021
GCAGGTTCACAGGAAGTTTC
5077
GCAATCTTTCTCCATAAAC
5133








ATA






W1
2FH21F_12_074
ACCAGCTACATCTAGATTAC
5022
CTGTGAGGCCAATGCAAATG
5078
GCTACATCTAGATTACAAG
5134








CCTTAT






W1
2FH21F_18_094
AGCTCCGCTTTGATTTCAGG
5023
GTGGCTATGAAAGACAGCCT
5079
TTGATTTCAGGCTTCATAG
5135








TTTG






W1
2FH21F_18_171
TTCCTGATGATAATCTTCCC
5024
GGGAAGATCTTAAAGGGAGC
5080
TATAGCCAATAAATTACTC
5136








TTATTTTA






W1
2FH21F_18_176
AACGGCCAGGGTGGACACT
5025
ACACCACATTTCTACCACTG
5081
GCCAGGGTGGACACTGTTA
5137








CT






W1
2FH21F_18_191
GATGCTTCTAAGGACCATGT
5026
TGATACAGAAATGTCAACCC
5082
GGACCATGTAATTTCTTTA
5138








ATTC






W1
2FH21F_18_262
CCATAGCAAGATGAATTCAC
5027
CTCCCCAAAGTCTCAGATAG
5083
CAAGATGAATTCACTTAAC
5139








GAAGTT






W2
2FH21F_01_041
CACCAGTATCAGCAATAGCTT
5028
GGAACAGTGTTGATAAAGACT
5084
TCAGCAATAGCTTTGACTT
5140





W2
2FH21F_02_091
GTGCCTAAGGACAACTTTTTC
5029
CCAAATTTTCAAGCAAAGC
5085
GGACAACTTTTTCTTTTTC
5141








TTCT






W2
2FH21F_05_003
GAACCATGGTTTGGGTTTAC
5030
GAAGTGGCCTATCAGGTCT
5086
CTGTTCTATTACAGTGTTC
5142








TTC






W2
2FH21F_05_033
AATAAAGTCCAGAGTATGGC
5031
GGACTTTGGCACCCAAGGA
5087
AGAGTATGGCTGGGAATT
5143





W2
2FH21F_07_166
ATTCCAAGGGCTATCTCCAC
5032
TTCCTACCTCACTTGGCTTC
5088
CCGGCTCTGAACGCCTC
5144





W2
2FH21F_07_202
GCTGGATACCTAATTAATGC
5033
GTTACACTGCAAAGCATTTC
5089
GAACCAAACAAGGAAAATA
5145








C






W2
2FH21F_07_464
AGGTAGTTCTCTAAGTTAC
5034
GGCAAACATAATTTGGATGGG
5090
AGGTAGTTCTCTAAGTTAC
5146








CAAAATC






W2
2FH21F_09_010
ACAAATATTGACAGGCAGCA
5035
CTGTGTCAAATATGTGACTG
5091
GACAGGCAGCAGATTAT
5147





W2
2FH21F_10_005
GAACAGCTATATTTCAAACCC
5036
TTTCAGACCATTTTTGAAC
5092
AACAGCTATATTTCAAACC
5148








CTTTTTA






W2
2FH21F_12_049
CTTCCTGTGAACCTGCTTTC
5037
AAGAGGGAAGATGACTTTTC
5093
GCTATCTTACTTTTCTTTA
5149








TTCCAC






W2
2FH21F_12_075
GAGGCCAATGCAAATGTAGG
5038
CAGAGGGTAGAAGGGAGGC
5094
GTAATCTAGATGTAGCTGG
5150








TATCA






W2
2FH21F_13_036
CTTATCCTTTGGGTCTTCTC
5039
GAGTTCTAGTTTGGCAAACTT
5095
TTAACCTCTGTTTCAAAAT
5151








ACTGG






W2
2FH21F_13_041
TTGTGTGTAGGATTATGAGC
5040
ATGCTGATGAACCGCACTTC
5096
TGTGTGTAGGATTATGAGC
5152








ATCCATT






W2
2FH21F_15_044
GAATGTAGCTGTTGTTAGGG
5041
CTGGGCAACTGTGAAAAGAC
5097
TGTAGCTGTTGTTAGGGAT
5153








AGGAGA






W2
2FH21F_18_020
TCCCTCTCTCCCTGAAAAAG
5042
GACCAAAGTGTATACATAG
5098
AAAAGAGACACATTTGCCT
5154








TTG






W2
2FH21F_18_076
GACTAGGTTACTGAGCAAGG
5043
CCTTTTAAAATATGCACGAG
5099
GTTACTGAGCAAGGAAAAT
5155








AA






W2
2FH21F_18_154
TTAGATTGTTATCCCCACT
5044
TAAATGAGCAGAGACTCAAG
5100
TGTTATCCCCACTTCTTTA
5156








A






W2
2FH21F_18_190
AAGAACTCCAGGGCTACTTG
5045
AAAGCTTTAACAAGTTGGCG
5101
AGGGCTACTTGAACAATT
5157





W2
2FH21F_18_270
TGGTTCTCAACACTGACCAC
5046
GTTGTGACTATTGTTATAG
5102
CCACTAGTATTAACATACA
5158








GTTTA






W2
2FH21F_18_332
ATGTAGGCATTGTAATGAGG
5047
GACTTGAATTTAACTGCTCC
5103
AATGAGGTTTTTGGTCTTT
5159








G






W2
2FH21F_18_346
GATAACATAAGATTAGGAAC
5048
AACTTGCCTTCAAGATCTG
5104
ACATAAGATTAGGAACAAG
5160








AATA






W3
2FH21F_02_076
GATTATGATGCACTGGCCTG
5049
GAAAAAAGTGCATGTCTTTG
5105
GACTTCACTGCATTCATCA
5161








GC






W3
2FH21F_02_089
CTGAAGAAGTGTAAAAATGGC
5050
GTCTACCAAACTACAATTAG
5106
GGCAACATGCATATAGAG
5162





W3
2FH21F_02_111
CTGCTAACTCAGATACCTGC
5051
CTTTCCAAAAACCCACAATC
5107
CAGATACCTGCATGTCA
5163





W3
2FH21F_02_116
GTCTCACATCCCATTTACAG
5052
AGGGCTGCAGGGACAGTAG
5108
CCCATTTACAGTTTATGTG
5164








TCAGCTAC






W3
2FH21F_02_254
TCAATTAGAAATCTAGTGC
5053
TATTTTTATTTCCAATGTAG
5109
CAATTAGAAATCTAGTGCA
5165








AAAGAAT






W3
2FH21F_03_005
TATATAATACTTAGTTTTGG
5054
TCATCCCCATTTCTCAACTC
5110
ATACTTAGTTTTGGTCATC
5166








AA






W3
2FH21F_03_022
TTCCTTTATGGGAGGAGGAG
5055
GCTGATCAAGGCAGTTTTTC
5111
TTTCTTTCTATGTCTTTGG
5167








TTAT






W3
2FH21F_05_027
ATTGGCCAACATCTCAACAG
5056
TTTAGCATTCCCAGACTCAG
5112
ACATCTCAACAGAGTTACA
5168





W3
2FH21F_05_061
GTGTGCTTGCCTCCTAATTT
5057
ACTGTTATGTACATTATATC
5113
CCTCCTAATTTAAAATACT
5169








GTATTC






W3
2FH21F_06_218
GAAAGTTCTTGTATTAAAAG
5058
ACCCTCAGTACCACTATCTC
5114
AAGTTCTTGTATTAAAAGA
5170








AGTGG






W3
2FH21F_06_238
TGTTCTTGGTTGACTTTAC
5059
TGTGTGCAAGGCTCTAGAAG
5115
AACAGAGAAAATTAAAATC
5171








AAACA






W3
2FH21F_07_071
CTTTTACCAGTTATCTTCC
5060
CCAAGGTTGCTTATAAACAG
5116
CTTCATTGCTTTCACTTTT
5172








C






W3
2FH21F_07_465
CATGGGCAAACATAATTTGG
5061
GTTCTCTAAGTTACCAAAATC
5117
CAAACATAATTTGGATGGG
5173








TCT






W3
2FH21F_11_028
CTGTGTCAATGGCACATCTG
5062
GTATATATAACTCCTGATC
5118
TGTGTCAATGGCACATCTG
5174








AATTACT






W3
2FH21F_18_059
ATATTTCAAGTATCACTATG
5063
CAGCATAGCTTTAATGGTCC
5119
ATTTCAAGTATCACTATGT
5175








ACAATC






W3
2FH21F_18_178
GCATCAGGACAAACTGATGG
5064
TCTGTGACACAGAGCATGAG
5120
CAGCCTAGGTTTTCCTC
5176





W3
2FH21F_18_188
GTGCTATAAAGCTTTAACAAG
5065
AACTCCAGGGCTACTTGAAC
5121
ATAAAGCTTTAACAAGTTG
5177








GCGA









Example 4: Detecting Fetal Chromosomal Abnormalities in Maternal Plasma

Embodiments of a method for detecting the presence or absence of a fetal chromosomal abnormality in a maternal blood sample are described hereafter. The method comprises a) preparing a set of amplified nucleic acid species by amplifying nucleotide sequences from extracellular nucleic acid template of a subject, wherein: (i) the extracellular nucleic acid template is heterogenous, (ii) each nucleotide sequences in the set is present on different chromosomes, (iii) each nucleotide sequence in the set differs by one or more mismatch nucleotides; (iv) each nucleotide sequence in the set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in the set comprises a nucleotide sequence having the one or more mismatch nucleotides; and b) determining the amount of each amplified nucleic acid species in the set; whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species. Step (a) of the method often involves (1) extraction of nucleic acid from maternal blood, preferably from blood plasma or serum; (2) application of a nucleic acid amplification process to the extracted nucleic acids, where the nucleotide sequences of a set are amplified by a single set of primers; and (3) quantification of the nucleotide sequence amplification products based on the ratio of the specific products. A single assay has duplicate confirmation that utilizes internal controls to identify the presence of trisomy. (See FIG. 1).


The amplification and detection steps (2) and (3) may be performed so as to allow quantitative detection of the fetal-derived DNA in a background of maternal nucleic acid. Assays described herein can be optimized for biological and experimental variability by performing the assays across a number of samples under identical conditions. Likewise, the ratio of nucleotide sequence species can be compared to a standard control representing a ratio of nucleotide sequences from comparable biological samples obtained from pregnant women each carrying a chromosomally normal (euploid) fetus. Also, the ratio of nucleotide sequence species can be determined without amplification, wherein the amount of each species is determined, for example, by a sequencing and/or hybridization reaction.


Example 5: Analytical Models

Given the very high cost and scarcity of plasma samples, not every set of markers can be tested on these samples. Therefore, one can either make the assumption that the assays which show best classification accuracy on the model systems will also work best on plasma, or attempt to infer a conditional distribution of probability of the classification accuracy on plasma based on the observed discriminating power on the model systems.


One of the variables affecting the performance of each paralog region is the actual assay design. Since all the markers are evaluated in the context of a multiplex environment, one needs to investigate the effect of various multiplexing scenarios on the performance of the assays undergoing screening. One way in which this analysis can be accomplished is to compare changes in the following (or combinations thereof):

    • 1) reaction performance (as characterized, e.g., by average extension rate and call rate);
    • 2) significance of differences between population of allele frequencies corresponding to Normal and T21 samples;
    • 3) significance of differences between apparent ethnic bias for both Normal as well as T21 samples;
    • 4) changes in the dependency of the average separation between Normal and T21 allele frequencies as a function of the fraction of T21 contribution; and
    • 5) changes in the information content for each individual assay. This content can be represented by a plurality of metrics, such as Information Gain, Gain Ratio, Gini index, ReliefF index. Graphical methods such as heatmaps can be very useful in the process of comparing multiple metrics.


Finally, for the selection of groups of markers that will be evaluated on plasma samples, one can consider standard metrics from the theory of statistical inference—e.g., true positive rate, false positive rate, true negative rate, false negative rate, positive predictive value, negative predictive value. These metrics can be obtained by applying a plurality of classifiers—e.g., Linear/Quadratic/Mixture Discriminant analysis, NaiveBayes, Neural Networks, Support Vector Machines, Boosting with Decision Trees, which are further described below. The classification accuracy of individual multiplexes or groups of multiplexes can be calculated, in conjecture with various methods of preventing over-fitting—e.g., repeated 10-fold cross-validation or leave-one-out cross validation. For robust estimates of such accuracy, a paired t-test can be applied in order to validate the significance of any observed differences. Comparisons with random selection of multiple assays (as coming from different multiplexes) can also be performed, as well as with “all stars” groups of assays (assays which, though coming from different multiplexes, show highest information content).


Some of the different models and methods that can be employed to analyze the data resulting from the methods and compositions are provided herein. Exemplary models include, but are not limited to, Decision Tree, Support Vector Machine (SVM)—Linear Kernel, Logistic Regression, Adaptive Boosting (AdaBoost), Naïve Bayes, Multilayer Perceptron, and Hidden Markov Model (HMM).


Support Vector Machine (SVM)—Linear Kernel—SVM (linear kernel) analyzes data by mapping the data into a high dimensional feature space, where each coordinate corresponds to one feature of the data items, transforming the data into a set of points in a Euclidean space.


Logistic Regression is used for prediction of the probability of occurrence of an event by fitting data to a logistic curve. It is a generalized linear model used for binomial regression.


AdaBoost is a meta-algorithm, and can be used in conjunction with many other learning algorithms to improve their performance. AdaBoost is adaptive in the sense that subsequent classifiers built are tweaked in favor of those instances misclassified by previous classifiers.


Naïve Bayes is a simple probabilistic classifier based on applying Bayes' theorem (from Bayesian statistics) with strong (naive) independence assumptions. A more descriptive term for the underlying probability model would be “independent feature model”.


Hidden Markov Model (HMM) is defined by a collection of states and transitions from each state to one or more other states, along with a probability for each transition. Specifically, HMM is a double stochastic process with one underlying process (i.e. the sequence of states) that is not observable but may be estimated through a set of data that produce a sequence of observations. HMMs are helpful in treating problems where information is uncertain and/or incomplete. HMMs generally are established in two stages: (1) a training stage, where the stochastic process is estimated through extensive observation, and (2) an application stage where the model may be used in real time to obtain classifications of maximum probability.


Example 6: Examples of Embodiments

Provided hereafter are certain non-limiting examples of some embodiments of the technology.


A1. A method for identifying the presence or absence of a chromosome abnormality in a subject, which comprises:

    • a. preparing a plurality of sets of amplified nucleic acid species by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, wherein: (i) the extracellular nucleic acid template is heterogenous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; and
    • b. determining the amount of each amplified nucleic acid species in each set;
    • whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets.


A2. The method of embodiment Al, wherein the chromosome abnormality is aneuploidy of a target chromosome.


A3. The method of embodiment A2, wherein the target chromosome is chromosome 21.


A4. The method of embodiment A2, wherein the target chromosome is chromosome 18.


A4. The method of embodiment A2, wherein the target chromosome is chromosome 13.


A6. The method of embodiment A2, wherein the target chromosome is chromosome X.


A7. The method of embodiment A2, wherein the target chromosome is chromosome Y.


A8. The method of embodiment A2, wherein each nucleotide sequence in a set is not present in a chromosome other than each target chromosome.


A9. The method of any one of embodiments A1-A8, wherein the extracellular nucleic acid is from blood.


A10. The method of embodiment A9, wherein the extracellular nucleic acid is from blood plasma.


A11. The method of embodiment A9, wherein the extracellular nucleic acid is from blood serum.


A12. The method of any one of embodiments A9-A11, wherein the blood is from a pregnant female subject.


A13. The method of embodiment A12, wherein the extracellular nucleic acid template is from a female subject in the first trimester of pregnancy.


A14. The method of embodiment A12, wherein the extracellular nucleic acid template is from a female subject in the second trimester of pregnancy.


A15. The method of embodiment A12, wherein the extracellular nucleic acid template is from a female subject in the third trimester of pregnancy.


A16. The method of embodiment A12, wherein the extracellular nucleic acid template comprises a mixture of maternal nucleic acid and fetal nucleic acid.


A17(a). The method of embodiment A16, wherein the fetal nucleic acid is about 5% to about 40% of the extracellular nucleic acid; or the number of fetal nucleic acid copies is about 10 copies to about 2000 copies of the total extracellular nucleic acid.


A17(b). The method of embodiment A16, wherein the fetal nucleic acid is greater than about 15% of the extracellular nucleic acid.


A18. The method of embodiment A16 or A17, which comprises determining the fetal nucleic acid concentration in the extracellular nucleic acid.


A19. The method of any one of embodiments A16-A18, which comprises enriching the extracellular nucleic acid for fetal nucleic acid.


A20. The method of any one of embodiments A1-A11, wherein the extracellular nucleic acid comprises a mixture of nucleic acid from cancer cells and nucleic acid from non-cancer cells.


A21. The method of any one of embodiments A1-A20, wherein each nucleotide sequence in a set is substantially identical to each other nucleotide sequence in the set.


A22. The method of embodiment A21, wherein each nucleotide sequence in a set is a paralog sequence.


A22. The method of embodiment A20 or A21, wherein each nucleotide sequence in each set shares about 50%, 60%, 70%, 80% or 90% identity with another nucleotide sequence in the set.


A23. The method of any one of embodiments A1-A22, wherein one or more of the nucleotide sequences are non-exonic.


A24. The method of embodiment A23, wherein one or more of the nucleotide sequences are intronic.


A25. The method of any one of embodiments A1-24, wherein the one or more nucleotide sequence species are selected from the group of nucleotide species shown in Table 4B.


A26. The method of any one of embodiments A1-A25, wherein one or more of the sets comprises two nucleotide sequences.


A27. The method of any one of embodiments A1-A26, wherein one or more of the sets comprises three nucleotide sequences.


A28. The method of any one of embodiments A1-A27, wherein in a set, nucleotide sequence species are on chromosome 21 and chromosome 18.


A29. The method of any one of embodiments A1-A27, wherein in a set, nucleotide sequence species are on chromosome 21 and chromosome 13.


A30. The method of any one of embodiments A1-A27, wherein in a set, nucleotide sequence species are on chromosome 21, chromosome 18 and chromosome 13.


A31. The method of any one of embodiments A1-A27, wherein each nucleotide sequence in all sets is present on chromosome 21, chromosome 18 and chromosome 13.


A32. The method of any one of embodiments A1-A32, wherein the amplification species of the sets are generated in one reaction vessel.


A33. The method of any one of embodiments A1-A33, wherein the amplified nucleic acid species in a set are prepared by a process that comprises contacting the extracellular nucleic acid with one reverse primer and one forward primer.


A34. The method of any one of embodiments A1-A34, wherein the amounts of the amplified nucleic acid species in each set vary by about 50% or less.


A35. The method of any one of embodiments A1-A35, wherein the amounts of the amplified nucleic acid species in each set vary by up to a value that permits detection of the chromosome abnormality with a confidence level of about 95% or more.


A36. The method of any one of embodiments A1-A35, wherein the amounts of the amplified nucleic acid species in each set vary by up to a value that permits detection of the chromosome abnormality with a sensitivity of about 90% or more, and a specificity of about 95% or more.


A37. The method of any one of embodiments A1-A36, wherein the length of each of the amplified nucleic acid species independently is about 30 to about 500 base pairs.


A38. The method of any one of embodiments A1-A37, wherein the amount of each amplified nucleic acid species is determined by primer extension, sequencing, digital PCR, QPCR, mass spectrometry.


A39. The method of any one of embodiments A1-A38, wherein the amplified nucleic acid species are detected by:

    • contacting the amplified nucleic acid species with extension primers,
    • preparing extended extension primers, and
    • determining the relative amount of the one or more mismatch nucleotides by analyzing the extended extension primers.


A40. The method of embodiment A39, wherein the one or more mismatch nucleotides are analyzed by mass spectrometry.


A41. The method of any one of embodiments A1-A40, wherein there are about 4 to about 100 sets.


A42. The method of any one of embodiments A1-A41, wherein the presence or absence of the chromosome abnormality is based on the amounts of the amplified nucleic acid species in 80% or more of the sets.


A43. The method of any one of embodiments A1-A42, wherein the amounts of one or more amplified nucleic acid species are weighted differently than other amplified nucleic acid species for identifying the presence or absence of the chromosome abnormality.


A44. The method of any one of embodiments A1-A43, wherein the number of sets provides a sensitivity of 85% or greater for determining the absence of the chromosome abnormality.


A45. The method of any one of embodiments A1-A43, wherein the number of sets provides a specificity of 85% or greater for determining the presence of the chromosome abnormality.


A46. The method of any one of embodiments A1-A43, wherein the number of sets is determined based on (i) a 85% or greater sensitivity for determining the absence of the chromosome abnormality, and (ii) a 85% or greater specificity for determining the presence of the chromosome abnormality.


A47. The method of any one of embodiments A1-A46, which further comprises determining a ratio between the relative amount of (i) an amplified nucleic acid species and (ii) another amplified nucleic acid species, in each set; and determining the presence or absence of the chromosome abnormality is identified by the ratio.


A48. The method of any one of embodiments A1-A47, wherein the presence or absence of the chromosome abnormality is based on nine or fewer replicates.


A49. The method of embodiment A48, wherein the presence or absence of the chromosome abnormality is based on four replicates.


A50. The method of any one of embodiments A1-A47, wherein the nucleotide sequence species in the sets are not found on chromosome 18 or chromosome 13.


A51. The method of any one of embodiments A1-A47, wherein the nucleotide sequence species in the sets are any described herein, with the proviso that they are not selected from any designated by an asterisk in Table 4A.


A52. The method of any one of embodiments A1-A47, wherein there are about 10 to about 70 sets, and about 10 or more of the sets are selected from Table 14.


A53. The method of embodiment A52, wherein there are about 56 sets, wherein the sets are set forth in Table 14.


B1. A multiplex method for identifying the presence or absence of an abnormality of a target chromosome in a subject, which comprises:

    • a. preparing three or more sets of amplified nucleic acid species by amplifying three or more nucleotide sequence sets from extracellular nucleic acid template of a subject, wherein: (i) the extracellular nucleic acid template is heterogenous, (ii) a nucleotide sequence in a set is present on a target chromosome and at least one other nucleotide sequence in the set is present on one or more reference chromosomes, (iii) the target chromosome is common for all of the sets; (iv) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (v) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (vi) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vii) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; and
    • b. determining the amount of each amplified nucleic acid species in each set;
    • c. detecting the presence or absence of a decrease or increase of the target chromosome from the amount of each amplified nucleic acid species in the sets;
    • whereby the presence or absence of the chromosome abnormality is identified based on a decrease or increase of the target chromosome relative to the one or more reference chromosomes.


C1. A method for identifying the presence or absence of a chromosome abnormality in a subject, which comprises:

    • a. preparing a set of amplified nucleic acid species by amplifying nucleotide sequences from extracellular nucleic acid template of a subject, wherein: (i) the extracellular nucleic acid template is heterogenous, (ii) each nucleotide sequence in the set is present on three or more different chromosomes, (iii) each nucleotide sequence in the set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in the set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in the set comprises a nucleotide sequence having the one or more mismatch nucleotides; and
    • b. determining the amount of each amplified nucleic acid species in the set;
    • whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species.


D1. A method for identifying the presence or absence of a chromosome abnormality associated with cancer in a subject, which comprises:

    • a. preparing a set of amplified nucleic acid species by amplifying nucleotide sequences from nucleic acid template, wherein: (i) the nucleic acid template is from a cell-free sample from a subject and is heterogenous, (ii) each nucleotide sequence in the set is present on chromosome 21, chromosome 18 and chromosome 13, (iii) each nucleotide sequence in the set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in the set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in the set comprises a nucleotide sequence having the one or more mismatch nucleotides; and
    • b. determining the amount of each amplified nucleic acid species in the set; whereby the presence or absence of the chromosome abnormality associated with cancer is identified based on the amount of the amplified nucleic acid species in the set.


E1. A computer program product, comprising a computer usable medium having a computer readable program code embodied therein, said computer readable program code adapted to be executed to implement a method for identifying the presence or absence of a chromosome abnormality in a subject, said method comprising:

    • providing a system, wherein the system comprises distinct software modules, and wherein the distinct software modules comprise a signal detection module, a logic processing module, and a data display organization module;
    • detecting signal information derived from determining the amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, wherein the plurality of sets of amplified nucleic acid species are prepared by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, wherein: (i) the extracellular nucleic acid template is heterogenous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides;
    • receiving, by the logic processing module, the signal information;
    • calling the presence or absence of a chromosomal abnormality by the logic processing module;
    • organizing, by the data display organization model in response to being called by the logic processing module, a data display indicating the presence or absence of a chromosome abnormality in the subject.


E2. A computer program product, comprising a computer usable medium having a computer readable program code embodied therein, said computer readable program code adapted to be executed to implement a method for identifying the presence or absence of a chromosome abnormality in a subject, said method comprising:

    • providing a system, wherein the system comprises distinct software modules, and wherein the distinct software modules comprise a signal detection module, a logic processing module, and a data display organization module;
    • parsing a configuration file into definition data that specifies: the amount of each amplified nucleic acid species in each set of claim Al;
    • receiving, by the logic processing module, the definition data;
    • calling the presence or absence of a chromosomal abnormality by the logic processing module;
    • organizing, by the data display organization model in response to being called by the logic processing module, a data display indicating the presence or absence of a chromosome abnormality in the subject.


E3. A computer program product, comprising a computer usable medium having a computer readable program code embodied therein, said computer readable program code adapted to be executed to implement a method for identifying the presence or absence of a chromosome abnormality in a subject, said method comprising:

    • providing a system, wherein the system comprises distinct software modules, and wherein the distinct software modules comprise a signal detection module, a logic processing module, and a data display organization module;
    • receiving signal information indicating the amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, wherein the plurality of sets of amplified nucleic acid species are prepared by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, wherein: (i) the extracellular nucleic acid template is heterogenous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides;
    • calling the presence or absence of a chromosomal abnormality by the logic processing module;
    • organizing, by the data display organization model in response to being called by the logic processing module, a data display indicating the presence or absence of a chromosome abnormality in the subject.


F1. A method for identifying the presence or absence of a chromosome abnormality in a subject, which comprises:

    • a. providing a system, wherein the system comprises distinct software modules, and wherein the distinct software modules comprise a signal detection module, a logic processing module, and a data display organization module;
    • b. detecting signal information derived from determining the amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, wherein the plurality of sets of amplified nucleic acid species are prepared by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, wherein: (i) the extracellular nucleic acid template is heterogenous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides;
    • c. receiving, by the logic processing module, the signal information;
    • d. calling the presence or absence of a chromosomal abnormality by the logic processing module, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and
    • e. organizing, by the data display organization model in response to being called by the logic processing module, a data display indicating the presence or absence of a chromosome abnormality in the subject.


F2. A method for identifying the presence or absence of a chromosome abnormality in a subject, which comprises:

    • a. obtaining a plurality of sets of amplified nucleic acid species prepared by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, wherein: (i) the extracellular nucleic acid template is heterogenous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides;
    • b. providing a system, wherein the system comprises distinct software modules, and wherein the distinct software modules comprise a signal detection module, a logic processing module, and a data display organization module;
    • c. parsing a configuration file into definition data that specifies: the amount of each amplified nucleic acid species;
    • d. receiving, by the logic processing module, the definition data;
    • e. calling the presence or absence of a chromosomal abnormality by the logic processing module, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and
    • f. organizing, by the data display organization model in response to being called by the logic processing module, a data display indicating the presence or absence of a chromosome abnormality in the subject.


F3. A method for identifying the presence or absence of a chromosome abnormality in a subject, which comprises:

    • a. preparing a plurality of sets of amplified nucleic acid species by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, wherein: (i) the extracellular nucleic acid template is heterogenous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides;
    • b. providing a system, wherein the system comprises distinct software modules, and wherein the distinct software modules comprise a signal detection module, a logic processing module, and a data display organization module; (please have someone review which modules are needed, or if we need more steps/description)
    • c. parsing a configuration file into definition data that specifies: the amount of each amplified nucleic acid species;
    • d. receiving, by the logic processing module, the definition data;
    • e. calling the presence or absence of a chromosomal abnormality by the logic processing module, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and
    • f. organizing, by the data display organization model in response to being called by the logic processing module, a data display indicating the presence or absence of a chromosome abnormality in the subject.


F4. A method for identifying the presence or absence of a chromosome abnormality in a subject, which comprises:

    • a. providing signal information indicating the amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, wherein the plurality of sets of amplified nucleic acid species are prepared by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, wherein: (i) the extracellular nucleic acid template is heterogenous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides;
    • b. providing a system, wherein the system comprises distinct software modules, and wherein the distinct software modules comprise a signal detection module, a logic processing module, and a data display organization module;
    • c. receiving, by the logic processing module, the signal information;
    • d. calling the presence or absence of a chromosomal abnormality by the logic processing module, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and
    • e. organizing, by the data display organization model in response to being called by the logic processing module, a data display indicating the presence or absence of a chromosome abnormality in the subject.


F5. A method for identifying the presence or absence of a chromosome abnormality in a subject, which comprises:

    • a. providing a system, wherein the system comprises distinct software modules, and wherein the distinct software modules comprise a signal detection module, a logic processing module, and a data display organization module;
    • b. receiving, by the logic processing module, signal information indicating the amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, wherein the plurality of sets of amplified nucleic acid species are prepared by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, wherein: (i) the extracellular nucleic acid template is heterogenous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides;
    • c. calling the presence or absence of a chromosomal abnormality by the logic processing module, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and
    • d. organizing, by the data display organization model in response to being called by the logic processing module, a data display indicating the presence or absence of a chromosome abnormality in the subject.


G1. A method for identifying the presence or absence of a chromosome abnormality in a subject, which comprises:

    • a. detecting signal information, wherein the signal information represents the amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, wherein the plurality of sets of amplified nucleic acid species are prepared by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, wherein: (i) the extracellular nucleic acid template is heterogenous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides;
    • b. transforming the signal information representing the amount of each amplified nucleic acid species in each set into identification data, wherein the identification data represents the presence or absence of the chromosome abnormality,
      • whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and
    • c. displaying the identification data.


G2. A method for identifying the presence or absence of a chromosome abnormality in a subject, which comprises:

    • a. preparing a plurality of sets of amplified nucleic acid species by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, wherein: (i) the extracellular nucleic acid template is heterogenous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; and
    • b. obtaining a data set of values representing the amount of each amplified nucleic acid species in each set;
    • c. transforming the data set of values representing the amount of each amplified nucleic acid species in each set into identification data, wherein the identification data represents the presence or absence of the chromosome abnormality, whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and
    • d. displaying the identified data.


G3. A method for identifying the presence or absence of a chromosome abnormality in a subject, which comprises:

    • a. providing signal information indicating the amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, wherein the plurality of sets of amplified nucleic acid species are prepared by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, wherein: (i) the extracellular nucleic acid template is heterogenous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides;
    • b. transforming the signal information indicating the amount of each amplified nucleic acid species in each set into identification data, wherein the identification data represents the presence or absence of the chromosome abnormality,
      • whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and
    • c. displaying the identification data.


G4. A method for identifying the presence or absence of a chromosome abnormality in a subject, which comprises:

    • a. receiving signal information indicating the amount of each amplified nucleic acid species in each of a plurality of sets of amplified nucleic acid species, wherein the plurality of sets of amplified nucleic acid species are prepared by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template of a subject, wherein: (i) the extracellular nucleic acid template is heterogenous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides;
    • b. transforming the signal information indicating the amount of each amplified nucleic acid species in each set into identification data, wherein the identification data represents the presence or absence of the chromosome abnormality,
      • whereby the presence or absence of the chromosome abnormality is identified based on the amount of the amplified nucleic acid species from two or more sets; and
    • c. displaying the identification data.


H1. A method for transmitting prenatal genetic information to a human pregnant female subject, which comprises:

    • a. identifying the presence or absence of a chromosomal abnormality in the fetus of the pregnant female subject, wherein the presence or absence of the chromosomal abnormality has been determined by preparing a plurality of sets of amplified nucleic acid species by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template from placenta-expressed nucleic acid in the blood of the pregnant female subject, of a subject, wherein: (i) the extracellular nucleic acid template is heterogenous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; and determining the amount of each amplified nucleic acid species in each set; whereby the presence or absence of the chromosome abnormality is determined based on the amount of the amplified nucleic acid species from two or more sets; and
    • b. transmitting the presence or absence of the chromosomal abnormality to the pregnant female subject.


H2. A method for transmitting prenatal genetic information to a human pregnant female subject, which comprises:

    • a. identifying the presence or absence of a chromosomal abnormality in the fetus of the pregnant female subject, wherein the presence or absence of the chromosomal abnormality has been determined by preparing a plurality of sets of amplified nucleic acid species by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template from placenta-expressed nucleic acid in the blood of the pregnant female subject, of a subject, wherein: (i) the extracellular nucleic acid template is heterogenous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; and determining the amount of each amplified nucleic acid species in each set;
      • whereby the presence or absence of the chromosome abnormality is determined based on the amount of the amplified nucleic acid species from two or more sets; and
    • b. transmitting prenatal genetic information representing the chromosome number in cells in the fetus to the pregnant female subject.


I1. A method for providing to a human pregnant female subject a medical prescription based on prenatal genetic information, which comprises:

    • a. identifying the presence or absence of a chromosomal abnormality in the fetus of the pregnant female subject, wherein the presence or absence of the chromosomal abnormality has been determined by preparing a plurality of sets of amplified nucleic acid species by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template from placenta-expressed nucleic acid in the blood of the pregnant female subject, of a subject, wherein: (i) the extracellular nucleic acid template is heterogenous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; and determining the amount of each amplified nucleic acid species in each set; whereby the presence or absence of the chromosome abnormality is determined based on the amount of the amplified nucleic acid species from two or more sets; and
    • b. providing a medical prescription based on the presence or absence of the chromosomal abnormality to the pregnant female subject.


I2. A method for providing to a human pregnant female subject a medical prescription based on prenatal genetic information, which comprises:

    • a. reporting to a pregnant female subject the presence or absence of a chromosomal abnormality in the fetus of the pregnant female subject, wherein the presence or absence of the chromosomal abnormality has been determined by preparing a plurality of sets of amplified nucleic acid species by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template from placenta-expressed nucleic acid in the blood of the pregnant female subject, of a subject, wherein: (i) the extracellular nucleic acid template is heterogenous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; and determining the amount of each amplified nucleic acid species in each set; whereby the presence or absence of the chromosome abnormality is determined based on the amount of the amplified nucleic acid species from two or more sets; and
    • b. providing a medical prescription based on the presence or absence of the chromosome abnormality to the pregnant female subject.


I3. The method of embodiment I1 or I2, wherein the medical prescription is for the pregnant female subject to undergo an amniocentesis procedure.


I4. The method of embodiment I1 or I2, wherein the medical prescription is for the pregnant female subject to undergo another genetic test.


I5. The method of embodiment I1 or I2, wherein the medical prescription is medical advice to not undergo further genetic testing.


J1. A file comprising the presence or absence of a chromosome abnormality in the fetus of a pregnant female subject, wherein the presence or absence of the chromosome abnormality has been determined by preparing a plurality of sets of amplified nucleic acid species by amplifying a plurality of nucleotide sequence sets from extracellular nucleic acid template from placenta-expressed nucleic acid in the blood of the pregnant female subject, of a subject, wherein: (i) the extracellular nucleic acid template is heterogenous, (ii) each nucleotide sequence in a set is present on two or more different chromosomes, (iii) each nucleotide sequence in a set differs by one or more mismatch nucleotides from each other nucleotide sequence in the set; (iv) each nucleotide sequence in a set is amplified at a substantially reproducible level relative to each other nucleotide sequence in the set, (v) the primer hybridization sequences in the extracellular nucleic acid template are substantially identical; and (vi) each amplified nucleic acid species in a set comprises a nucleotide sequence having the one or more mismatch nucleotides; and determining the amount of each amplified nucleic acid species in each set; whereby the presence or absence of the chromosome abnormality is determined based on the amount of the amplified nucleic acid species from two or more sets.


J2. The file of embodiment J1, which is a computer readable file.


J3. The file of embodiment J1, which is a paper file.


J4. The file of embodiment J1, which is a medical record file.


The entirety of each patent, patent application, publication and document referenced herein hereby is incorporated by reference. Citation of the above patents, patent applications, publications and documents is not an admission that any of the foregoing is pertinent prior art, nor does it constitute any admission as to the contents or date of these publications or documents.


Modifications may be made to the foregoing without departing from the basic aspects of the technology. Although the technology has been described in substantial detail with reference to one or more specific embodiments, those of ordinary skill in the art will recognize that changes may be made to the embodiments specifically disclosed in this application, yet these modifications and improvements are within the scope and spirit of the technology.


The technology illustratively described herein suitably may be practiced in the absence of any element(s) not specifically disclosed herein. Thus, for example, in each instance herein any of the terms “comprising,” “consisting essentially of,” and “consisting of” may be replaced with either of the other two terms. The terms and expressions which have been employed are used as terms of description and not of limitation, and use of such terms and expressions do not exclude any equivalents of the features shown and described or portions thereof, and various modifications are possible within the scope of the technology claimed. The term “a” or “an” can refer to one of or a plurality of the elements it modifies (e.g., “a reagent” can mean one or more reagents) unless it is contextually clear either one of the elements or more than one of the elements is described. The term “about” as used herein refers to a value within 10% of the underlying parameter (i.e., plus or minus 10%), and use of the term “about” at the beginning of a string of values modifies each of the values (i.e., “about 1, 2 and 3” is about 1, about 2 and about 3). For example, a weight of “about 100 grams” can include weights between 90 grams and 110 grams. Further, when a listing of values is described herein (e.g., 50%, 60%, 70%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93% or 94%), the listing includes all intermediate values thereof (e.g., 62%, 77%). Thus, it should be understood that although the present technology has been specifically disclosed by representative embodiments and optional features, modification and variation of the concepts herein disclosed may be resorted to by those skilled in the art, and such modifications and variations are considered within the scope of this technology.


Non-limiting embodiments of the technology are set forth in the claim that follows.










LENGTHY TABLES




The patent application contains a lengthy table section. A copy of the table is available in electronic form from the USPTO web site (). An electronic copy of the table will also be available from the USPTO upon request and payment of the fee set forth in 37 CFR 1.19(b)(3).





Claims
  • 1. (canceled)
  • 2. A multiplex method for identifying the presence or absence of an aneuploidy of a target chromosome in a sample from a pregnant female subject, which comprises: (a) providing a plurality of amplification primer pairs, wherein each amplification primer pair specifically hybridizes with a nucleotide sequence species set, wherein: (i) the nucleotide sequence species of a set are present on two or more different chromosomes, comprising a target chromosome and one or more reference chromosomes not associated with the aneuploidy; (ii) the nucleotide sequence species in a set differ by one or more mismatch nucleotides; and (iii) the nucleotide sequence species of a set are reproducibly amplified by a single pair of amplification primers relative to each other;(b) contacting in one or more reaction vessels under amplification conditions, extracellular nucleic acid of the sample comprising fetally derived and maternally derived nucleic acid with amplification primer pairs, wherein each reaction vessel comprises at least two amplification primer pairs and each amplification primer pair in a reaction vessel amplifies the nucleotide sequence species of a set, thereby producing a plurality of sets of amplified nucleic acid species;(c) determining the amount of each amplified nucleic acid species in each set by detecting the one or more mismatch nucleotides in each amplified nucleic acid species;(d) determining a ratio between the relative amount of (i) an amplified target nucleic acid species and (ii) an amplified reference nucleic acid species, for each set; and(e) identifying the presence or absence of an aneuploidy of a target chromosome based on the ratios from the plurality of sets of amplified nucleic acid species.
  • 3. The method of claim 2, wherein the extracellular nucleic acid is from blood, blood plasma, or blood serum of the pregnant female subject.
  • 4. The method of claim 2, wherein the extracellular nucleic acid is from a female subject in the first trimester of pregnancy, second trimester of pregnancy, or third trimester of pregnancy.
  • 5. The method of claim 2, wherein the fetal nucleic acid is about 5% to about 40% of the extracellular nucleic acid and/or or the number of fetal nucleic acid copies is about 10 copies to about 2000 copies of the total extracellular nucleic acid.
  • 6. The method of claim 2, which comprises enriching the extracellular nucleic acid for fetal nucleic acid.
  • 7. The method of claim 2, which comprises determining the fetal nucleic acid concentration in the extracellular nucleic acid.
  • 8. The method of claim 2, wherein the amounts of the amplified nucleic acid species in each set vary by up to a value that permits detection of the aneuploidy of a target chromosome with a confidence level of about 95% or more.
  • 9. The method of claim 2, wherein the amounts of the amplified nucleic acid species in each set vary by up to a value that permits detection of the aneuploidy of a target chromosome with a sensitivity of about 90% or more, and a specificity of about 95% or more.
  • 10. The method of claim 2, wherein the number of sets of amplified nucleic acid species is based on (i) the number of sets that provides a 85% or greater sensitivity for determining the absence of the aneuploidy of a target chromosome or (ii) the number of sets that provides a 85% or greater specificity for determining the presence of the aneuploidy of a target chromosome; or (i) the number of sets that provides a 85% or greater sensitivity for determining the absence of the aneuploidy of a target chromosome and (ii) the number of sets that provides a 85% or greater specificity for determining the presence of the aneuploidy of a target chromosome.
  • 11. The method of claim 2, wherein the nucleotide sequence species sets have nucleotide sequences corresponding to nucleotide sequences shown in Table 4B, or portions thereof.
  • 12. The method of claim 2, wherein detecting the one or more mismatch nucleotides in each amplified nucleic acid species in a set is by primer extension, sequencing, Q-PCR or mass spectrometry.
  • 13. The method of claim 2, wherein the amounts of one or more amplified nucleic acid species are weighed differently than other amplified nucleic acid species for identifying the presence or absence of the aneuploidy of the target chromosome.
  • 14. The method of claim 2, wherein the plurality of amplification primer pairs are chosen from the primer pairs in Table 14.
  • 15. A kit comprising a plurality of amplification primer pairs for amplifying a nucleotide sequence species of a set chosen from nucleotide sequence species sets shown in Table 4B or portions thereof.
  • 16. The kit of claim 15, wherein the plurality of amplification primer pairs are chosen from the primer pairs in Table 14.
  • 17. The kit of claim 16, comprising 56 amplification primers.
  • 18. The kit of claim 15, wherein the amplification primer pairs are in one reaction vessel.
  • 19. The kit of claim 15, wherein the kit comprises one or more extension primers for discriminating between amplified nucleotide sequence species of a set.
RELATED PATENT APPLICATION(S)

This application is a continuation application of U.S. patent application Ser. No. 15/892,241, filed on Feb. 8, 2018, entitled PROCESSES AND KITS FOR IDENTIFYING ANEUPLOIDY, naming Mathias Ehrich, Guy Del Mistro, Cosmin Deciu, Yong Qing Chen, Ron Michael McCullough and Roger Chan Tim as inventors, and designated by attorney docket no. PLA-6027-CT, which is a continuation application of U.S. patent application Ser. No. 13/518,368, filed on Feb. 6, 2013, entitled PROCESSES AND KITS FOR IDENTIFYING ANEUPLOIDY, naming Mathias Ehrich, Guy Del Mistro, Cosmin Deciu, Yong Qing Chen, Ron Michael McCullough and Roger Chan Tim as applicants and inventors, and designated by attorney docket no. PLA-6027-US, which is a national stage of international patent application no. PCT/US2010/061319 filed on Dec. 20, 2010, entitled PROCESSES AND KITS FOR IDENTIFYING ANEUPLOIDY, naming Mathias Ehrich, Guy Del Mistro, Cosmin Deciu, Yong Qing Chen, Ron Michael McCullough and Roger Chan Tim as applicants and inventors, and designated by Attorney Docket No. SEQ-6027-PC, which claims the benefit of U.S. provisional patent application No. 61/289,370 filed on Dec. 22, 2009, entitled PROCESSES AND KITS FOR IDENTIFYING ANEUPLOIDY, naming Mathias Ehrich, Guy Del Mistro, Cosmin Deciu, Yong Qing Chen, Ron Michael McCullough and Roger Chan Tim as inventors and designated by Attorney Docket No. SEQ-6027-PV. The entire content of the foregoing patent applications are incorporated herein by reference, including, without limitation, all text, tables and drawings.

Provisional Applications (1)
Number Date Country
61289370 Dec 2009 US
Continuations (2)
Number Date Country
Parent 15892241 Feb 2018 US
Child 17502842 US
Parent 13518368 Feb 2013 US
Child 15892241 US