COMPOSITIONS AND METHODS FOR ONCOLOGY PRECISION ASSAYS

SEQUENCE LISTING

The material in the electronic Sequence Listing is submitted as an XML file entitled 9748-107215-02_Sequence_Listing.xml created on Oct. 6, 2022, which has a file size of 11,734,191 bytes, and is herein incorporated by reference in its entirety.

FIELD OF THE INVENTION

The present invention relates to compositions and methods of preparing a library of target nucleic acid sequences and uses therefor.

BACKGROUND OF THE INVENTION

Advances in cancer therapies have started to provide promising results across oncology. Targeted therapies, immune checkpoint inhibitors, cancer vaccines and T-cell therapies have shown sustainable results in responsive populations over conventional chemotherapies. However, effective identification of responsive candidates and/or monitoring response has proven challenging. The need of a better understanding of the tumor microenvironment, tumor evolution and drug response biomarkers is immediate. Higher-throughput, systematic and standardized assay solutions that can efficiently and effectively detect multiple relevant biomarkers in a variety of sample types are desirable.

BRIEF SUMMARY OF THE INVENTION

In one aspect of the invention compositions are provided for a single stream multiplex determination of actionable oncology biomarkers in a sample. In some embodiments the composition consists of a plurality of primer reagents directed to a plurality of target sequences to rapidly and effectively detect low level targets in the sample. Provided compositions target oncology gene sequences wherein the plurality of gene sequences are selected from targets among DNA hotspot mutation genes, copy number variation (CNV) genes, inter-genetic fusion genes, and intra-genetic fusion genes. Provided compositions maximize detection of key biomarkers, e.g., EGFR, ALK, BRAF, ROS1, HER2, MET, NTRK, and RET from a variety of samples (e.g., FFPE tissue, plasma) in a single-day in an integrated and automated workflow.

In some embodiments the plurality of actionable target genes in a sample determines a change in oncology activity in the sample indicative of a potential diagnosis, prognosis, candidate therapeutic regimen, and/or adverse event. In particular embodiments, provided compositions include a plurality of primer reagents selected from Table A. In some embodiments a multiplex assay comprising compositions of the invention is provided. In some embodiments a test kit comprising compositions of the invention is provided.

In another aspect of the invention, methods are provided for determining actionable oncology biomarkers in a biological sample. Such methods comprise performing multiplex amplification of a plurality of target sequences from a biological sample containing target sequences. Amplification comprises contacting at least a portion of the sample comprising multiple target sequences of interest using a plurality of target-specific primers in the presence of a polymerase under amplification conditions to produce a plurality of amplified target sequences. The methods further comprise detecting the presence of each of the plurality of target oncology sequences, wherein detection of one or more actionable oncology biomarkers as compared with a control sample determines a change in oncology activity in the sample indicative of a potential diagnosis, prognosis, candidate therapeutic regimen, and/or adverse event. The methods described herein utilize compositions of the invention provided herein. In some embodiments target genes are selected from the group consisting of DNA hotspot mutation genes, copy number variation (CNV) genes, inter-genetic fusion genes, and intra-genetic fusion genes. In certain embodiments target genes are selected from the genes of Table 1. In particular embodiments the target genes consist of the genes of Table 1.

Still further, uses of provided compositions and kits comprising provided compositions for analysis of sequences of the nucleic acid libraries are additional aspects of the invention. In some embodiments, analysis of the sequences of the resulting libraries enables detection of low frequency alleles, improved detection of gene fusions and novel fusions, and/or detection of genetic mutations in a sample of interest and/or multiple samples of interest is provided. In certain embodiments, manual, partially automated and fully automated implementations of uses of provided compositions and methods are contemplated. In a particular embodiment, use of provide compositions is implemented in a fully integrated library preparation, templating and sequencing system for genetic analysis of samples. In certain embodiments, uses of provided compositions and method of the invention provide benefit for research and clinical applications including first line testing of tissue and/or plasma specimens as well as ongoing monitoring of specimens for recurrence and/or resistance detection of biomarkers.

All publications, patents, and patent applications mentioned in this specification are herein incorporated by reference to the same extent as if each individual publication, patent, or patent application was specifically and individually indicated to be incorporated by reference.

Efficient methods for production of targeted libraries encompassing actionable oncology biomarkers from complex samples is desirable for a variety of nucleic acid analyses. The present invention provides, inter alia, methods of preparing libraries of target nucleic acid sequences, allowing for rapid production of highly multiplexed targeted libraries, including unique tag sequences; and resulting library compositions are useful for a variety of applications, including sequencing applications. Provided compositions are designed for the detection of mutations, copy number variations (CNVs), and gene fusions in tissue and plasma derived samples. Provided compositions comprise targeted primer panels and reagents for use in high throughput sample to results next generation workflows for genetic analysis. In particular embodiments, use is implemented on a completely integrated sample to analysis system. Novel features of the invention are set forth with particularity in the appended claims; and a complete understanding of the features and advantages of the present invention will be obtained by reference to the following detailed description that sets forth illustrative embodiments, in which the principles of the invention are utilized.

DESCRIPTION OF THE INVENTION

Section headings used herein are for organizational purposes only and are not to be construed as limiting the described subject matter in any way. All literature and similar materials cited in this application, including but not limited to, patents, patent applications, articles, books, treatises, and internet web pages are expressly incorporated by reference in their entirety for any purpose. When definitions of terms in incorporated references appear to differ from the definitions provided in the present teachings, the definition provided in the present teachings shall control. It will be appreciated that there is an implied “about” prior to the temperatures, concentrations, times, etc discussed in the present teachings, such that slight and insubstantial deviations are within the scope of the present teachings herein. In this application, the use of the singular includes the plural unless specifically stated otherwise. It is noted that, as used in this specification, singular forms “a,” “an,” and “the,” and any singular use of a word, include plural referents unless expressly and unequivocally limited to one referent. Also, the use of “comprise”, “comprises”, “comprising”, “contain”, “contains”, “containing”, “include”, “includes”, and “including” are not intended to be limiting. It is to be understood that both the general description is exemplary and explanatory only and not restrictive of the invention.

Unless otherwise defined, scientific and technical terms used in connection with the invention described herein shall have the meanings that are commonly understood by those of ordinary skill in the art. Further, unless otherwise required by context, singular terms shall include pluralities and plural terms shall include the singular. Generally, nomenclatures utilized in connection with, and techniques of, cell and tissue culture, molecular biology, and protein and oligo- or polynucleotide chemistry and hybridization used herein are those well-known and commonly used in the art. The practice of the present subject matter may employ, unless otherwise indicated, conventional techniques and descriptions of organic chemistry, molecular biology (including recombinant techniques), cell biology, and biochemistry, which are within the skill of the art. Such conventional techniques include, but are not limited to, preparation of synthetic polynucleotides, polymerization techniques, chemical and physical analysis of polymer particles, preparation of nucleic acid libraries, nucleic acid sequencing and analysis, and the like. Specific illustrations of suitable techniques can be used by reference to the examples provided herein. Other equivalent conventional procedures can also be used. Such conventional techniques and descriptions can be found in standard laboratory manuals such as Genome Analysis: A Laboratory Manual Series (Vols. I-IV), PCR Primer: A Laboratory Manual, and Molecular Cloning: A Laboratory Manual (all from Cold Spring Harbor Laboratory Press), Hermanson, Bioconjugate Techniques, Second Edition (Academic Press, 2008); Merkus, Particle Size Measurements (Springer, 2009); Rubinstein and Colby, Polymer Physics (Oxford University Press, 2003); and the like. As utilized in accordance with embodiments provided herein, the following terms, unless otherwise indicated, shall be understood to have the following meanings:

As used herein, “amplify”, “amplifying” or “amplification reaction” and their derivatives, refer generally to an action or process whereby at least a portion of a nucleic acid molecule (referred to as a template nucleic acid molecule) is replicated or copied into at least one additional nucleic acid molecule. The additional nucleic acid molecule optionally includes sequence that is substantially identical or substantially complementary to at least some portion of the template nucleic acid molecule. A template target nucleic acid molecule may be single-stranded or double-stranded. The additional resulting replicated nucleic acid molecule may independently be single-stranded or double-stranded. In some embodiments, amplification includes a template-dependent in vitro enzyme-catalyzed reaction for the production of at least one copy of at least some portion of a target nucleic acid molecule or the production of at least one copy of a target nucleic acid sequence that is complementary to at least some portion of a target nucleic acid molecule. Amplification optionally includes linear or exponential replication of a nucleic acid molecule. In some embodiments, such amplification is performed using isothermal conditions; in other embodiments, such amplification can include thermocycling. In some embodiments, the amplification is a multiplex amplification that includes simultaneous amplification of a plurality of target sequences in a single amplification reaction. At least some target sequences can be situated on the same nucleic acid molecule or on different target nucleic acid molecules included in a single amplification reaction. In some embodiments, “amplification” includes amplification of at least some portion of DNA- and/or RNA-based nucleic acids, whether alone, or in combination. An amplification reaction can include single or double-stranded nucleic acid substrates and can further include any amplification processes known to one of ordinary skill in the art. In some embodiments, an amplification reaction includes polymerase chain reaction (PCR). In some embodiments, an amplification reaction includes isothermal amplification.

As used herein, “amplification conditions” and derivatives (e.g., conditions for amplification, etc.) generally refers to conditions suitable for amplifying one or more nucleic acid sequences. Amplification can be linear or exponential. In some embodiments, amplification conditions include isothermal conditions or alternatively include thermocyling conditions, or a combination of isothermal and themocycling conditions. In some embodiments, conditions suitable for amplifying one or more target nucleic acid sequences includes polymerase chain reaction (PCR) conditions. Typically, amplification conditions refer to a reaction mixture that is sufficient to amplify nucleic acids such as one or more target sequences, or to amplify an amplified target sequence ligated to one or more adaptors, e.g., an adaptor-ligated amplified target sequence. Generally, amplification conditions include a catalyst for amplification or for nucleic acid synthesis, for example a polymerase; a primer that possesses some degree of complementarity to the nucleic acid to be amplified; and nucleotides, such as deoxyribonucleoside triphosphates (dNTPs) to promote extension of a primer once hybridized to a nucleic acid. Amplification conditions can require hybridization or annealing of a primer to a nucleic acid, extension of the primer and a denaturing step in which the extended primer is separated from the nucleic acid sequence undergoing amplification. Typically, though not necessarily, amplification conditions can include thermocycling. In some embodiments, amplification conditions include a plurality of cycles wherein steps of annealing, extending and separating are repeated. Typically, amplification conditions include cations such as Mg⁺⁺ or Mn⁺⁺ (e.g., MgCl₂, etc.) and can also optionally include various modifiers of ionic strength.

As used herein, “target sequence” “target nucleic acid sequence” or “target sequence of interest” and derivatives, refers generally to any single or double-stranded nucleic acid sequence that can be amplified or synthesized according to the disclosure, including any nucleic acid sequence suspected or expected to be present in a sample. In some embodiments, the target sequence is present in double-stranded form and includes at least a portion of the particular nucleotide sequence to be amplified or synthesized, or its complement, prior to the addition of target-specific primers or appended adaptors. Target sequences can include the nucleic acids to which primers useful in the amplification or synthesis reaction can hybridize prior to extension by a polymerase. In some embodiments, the term refers to a nucleic acid sequence whose sequence identity, ordering or location of nucleotides is determined by one or more of the methods of the disclosure.

The term “portion” and its variants, as used herein, when used in reference to a given nucleic acid molecule, for example a primer or a template nucleic acid molecule, comprises any number of contiguous nucleotides within the length of the nucleic acid molecule, including the partial or entire length of the nucleic acid molecule.

As used herein, “contacting” and its derivatives, when used in reference to two or more components, refers generally to any process whereby the approach, proximity, mixture or commingling of the referenced components is promoted or achieved without necessarily requiring physical contact of such components, and includes mixing of solutions containing any one or more of the referenced components with each other. The referenced components may be contacted in any particular order or combination and the particular order of recitation of components is not limiting. For example, “contacting A with B and C” encompasses embodiments where A is first contacted with B then C, as well as embodiments where C is contacted with A then B, as well as embodiments where a mixture of A and C is contacted with B, and the like. Furthermore, such contacting does not necessarily require that the end result of the contacting process be a mixture including all of the referenced components, as long as at some point during the contacting process all of the referenced components are simultaneously present or simultaneously included in the same mixture or solution. For example, “contacting A with B and C” can include embodiments wherein C is first contacted with A to form a first mixture, which first mixture is then contacted with B to form a second mixture, following which C is removed from the second mixture; optionally A can then also be removed, leaving only B. Where one or more of the referenced components to be contacted includes a plurality (e.g., “contacting a target sequence with a plurality of target-specific primers and a polymerase”), then each member of the plurality can be viewed as an individual component of the contacting process, such that the contacting can include contacting of any one or more members of the plurality with any other member of the plurality and/or with any other referenced component (e.g., some but not all of the plurality of target specific primers can be contacted with a target sequence, then a polymerase, and then with other members of the plurality of target-specific primers) in any order or combination.

As used herein, the term “primer” and its derivatives refer generally to any polynucleotide that can hybridize to a target sequence of interest. In some embodiments, the primer can also serve to prime nucleic acid synthesis. Typically, a primer functions as a substrate onto which nucleotides can be polymerized by a polymerase; in some embodiments, however, a primer can become incorporated into a synthesized nucleic acid strand and provide a site to which another primer can hybridize to prime synthesis of a new strand that is complementary to the synthesized nucleic acid molecule. A primer may be comprised of any combination of nucleotides or analogs thereof, which may be optionally linked to form a linear polymer of any suitable length. In some embodiments, a primer is a single-stranded oligonucleotide or polynucleotide. (For purposes of this disclosure, the terms ‘polynucleotide” and “oligonucleotide” are used interchangeably herein and do not necessarily indicate any difference in length between the two). In some embodiments, a primer is double-stranded. If double stranded, a primer is first treated to separate its strands before being used to prepare extension products. Preferably, the primer is an oligodeoxyribonucleotide. A primer must be sufficiently long to prime the synthesis of extension products. Lengths of the primers will depend on many factors, including temperature, source of primer and the use of the method. In some embodiments, a primer acts as a point of initiation for amplification or synthesis when exposed to amplification or synthesis conditions; such amplification or synthesis can occur in a template-dependent fashion and optionally results in formation of a primer extension product that is complementary to at least a portion of the target sequence. Exemplary amplification or synthesis conditions can include contacting the primer with a polynucleotide template (e.g., a template including a target sequence), nucleotides and an inducing agent such as a polymerase at a suitable temperature and pH to induce polymerization of nucleotides onto an end of the target-specific primer. If double-stranded, the primer can optionally be treated to separate its strands before being used to prepare primer extension products. In some embodiments, the primer is an oligodeoxyribonucleotide or an oligoribonucleotide. In some embodiments, the primer can include one or more nucleotide analogs. The exact length and/or composition, including sequence, of the target-specific primer can influence many properties, including melting temperature (Tm), GC content, formation of secondary structures, repeat nucleotide motifs, length of predicted primer extension products, extent of coverage across a nucleic acid molecule of interest, number of primers present in a single amplification or synthesis reaction, presence of nucleotide analogs or modified nucleotides within the primers, and the like. In some embodiments, a primer can be paired with a compatible primer within an amplification or synthesis reaction to form a primer pair consisting or a forward primer and a reverse primer. In some embodiments, the forward primer of the primer pair includes a sequence that is substantially complementary to at least a portion of a strand of a nucleic acid molecule, and the reverse primer of the primer of the primer pair includes a sequence that is substantially identical to at least of portion of the strand. In some embodiments, the forward primer and the reverse primer are capable of hybridizing to opposite strands of a nucleic acid duplex. Optionally, the forward primer primes synthesis of a first nucleic acid strand, and the reverse primer primes synthesis of a second nucleic acid strand, wherein the first and second strands are substantially complementary to each other, or can hybridize to form a double-stranded nucleic acid molecule. In some embodiments, one end of an amplification or synthesis product is defined by the forward primer and the other end of the amplification or synthesis product is defined by the reverse primer. In some embodiments, where the amplification or synthesis of lengthy primer extension products is required, such as amplifying an exon, coding region, or gene, several primer pairs can be created than span the desired length to enable sufficient amplification of the region. In some embodiments, a primer can include one or more cleavable groups. In some embodiments, primer lengths are in the range of about 10 to about 60 nucleotides, about 12 to about 50 nucleotides and about 15 to about 40 nucleotides in length. Typically, a primer is capable of hybridizing to a corresponding target sequence and undergoing primer extension when exposed to amplification conditions in the presence of dNTPS and a polymerase. In some instances, the particular nucleotide sequence or a portion of the primer is known at the outset of the amplification reaction or can be determined by one or more of the methods disclosed herein. In some embodiments, the primer includes one or more cleavable groups at one or more locations within the primer.

As used herein, “target-specific primer” and its derivatives, refers generally to a single stranded or double-stranded polynucleotide, typically an oligonucleotide, that includes at least one sequence that is at least 50% complementary, typically at least 75% complementary or at least 85% complementary, more typically at least 90% complementary, more typically at least 95% complementary, more typically at least 98% or at least 99% complementary, or identical, to at least a portion of a nucleic acid molecule that includes a target sequence. In such instances, the target-specific primer and target sequence are described as “corresponding” to each other. In some embodiments, the target-specific primer is capable of hybridizing to at least a portion of its corresponding target sequence (or to a complement of the target sequence); such hybridization can optionally be performed under standard hybridization conditions or under stringent hybridization conditions. In some embodiments, the target-specific primer is not capable of hybridizing to the target sequence, or to its complement, but is capable of hybridizing to a portion of a nucleic acid strand including the target sequence, or to its complement. In some embodiments, the target-specific primer includes at least one sequence that is at least 75% complementary, typically at least 85% complementary, more typically at least 90% complementary, more typically at least 95% complementary, more typically at least 98% complementary, or more typically at least 99% complementary, to at least a portion of the target sequence itself; in other embodiments, the target-specific primer includes at least one sequence that is at least 75% complementary, typically at least 85% complementary, more typically at least 90% complementary, more typically at least 95% complementary, more typically at least 98% complementary, or more typically at least 99% complementary, to at least a portion of the nucleic acid molecule other than the target sequence. In some embodiments, the target-specific primer is substantially non-complementary to other target sequences present in the sample; optionally, the target-specific primer is substantially non-complementary to other nucleic acid molecules present in the sample. In some embodiments, nucleic acid molecules present in the sample that do not include or correspond to a target sequence (or to a complement of the target sequence) are referred to as “non-specific” sequences or “non-specific nucleic acids”. In some embodiments, the target-specific primer is designed to include a nucleotide sequence that is substantially complementary to at least a portion of its corresponding target sequence. In some embodiments, a target-specific primer is at least 95% complementary, or at least 99% complementary, or identical, across its entire length to at least a portion of a nucleic acid molecule that includes its corresponding target sequence. In some embodiments, a target-specific primer can be at least 90%, at least 95% complementary, at least 98% complementary or at least 99% complementary, or identical, across its entire length to at least a portion of its corresponding target sequence. In some embodiments, a forward target-specific primer and a reverse target-specific primer define a target-specific primer pair that can be used to amplify the target sequence via template-dependent primer extension. Typically, each primer of a target-specific primer pair includes at least one sequence that is substantially complementary to at least a portion of a nucleic acid molecule including a corresponding target sequence but that is less than 50% complementary to at least one other target sequence in the sample. In some embodiments, amplification can be performed using multiple target-specific primer pairs in a single amplification reaction, wherein each primer pair includes a forward target-specific primer and a reverse target-specific primer, each including at least one sequence that substantially complementary or substantially identical to a corresponding target sequence in the sample, and each primer pair having a different corresponding target sequence. In some embodiments, the target-specific primer can be substantially non-complementary at its 3′ end or its 5′ end to any other target-specific primer present in an amplification reaction. In some embodiments, the target-specific primer can include minimal cross hybridization to other target-specific primers in the amplification reaction. In some embodiments, target-specific primers include minimal cross-hybridization to non-specific sequences in the amplification reaction mixture. In some embodiments, the target-specific primers include minimal self-complementarity. In some embodiments, the target-specific primers can include one or more cleavable groups located at the 3′ end. In some embodiments, the target-specific primers can include one or more cleavable groups located near or about a central nucleotide of the target-specific primer. In some embodiments, one of more targets-specific primers includes only non-cleavable nucleotides at the 5′ end of the target-specific primer. In some embodiments, a target specific primer includes minimal nucleotide sequence overlap at the 3′end or the 5′ end of the primer as compared to one or more different target-specific primers, optionally in the same amplification reaction. In some embodiments 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more, target-specific primers in a single reaction mixture include one or more of the above embodiments. In some embodiments, substantially all of the plurality of target-specific primers in a single reaction mixture includes one or more of the above embodiments.

As used herein, the term “adaptor” denotes a nucleic acid molecule that can be used for manipulation of a polynucleotide of interest. In some embodiments, adaptors are used for amplification of one or more target nucleic acids. In some embodiments, the adaptors are used in reactions for sequencing. In some embodiments, an adaptor has one or more ends that lack a 5′ phosphate residue. In some embodiments, an adaptor comprises, consists of, or consist essentially of at least one priming site. Such priming site containing adaptors can be referred to as “primer” adaptors. In some embodiments, the adaptor priming site can be useful in PCR processes. In some embodiments an adaptor includes a nucleic acid sequence that is substantially complementary to the 3′ end or the 5′ end of at least one target sequences within the sample, referred to herein as a gene specific target sequence, a target specific sequence, or target specific primer. In some embodiments, the adaptor includes nucleic acid sequence that is substantially non-complementary to the 3′ end or the 5′ end of any target sequence present in the sample. In some embodiments, the adaptor includes single stranded or double-stranded linear oligonucleotide that is not substantially complementary to an target nucleic acid sequence. In some embodiments, the adaptor includes nucleic acid sequence that is substantially non-complementary to at least one, and preferably some or all of the nucleic acid molecules of the sample. In some embodiments, suitable adaptor lengths are in the range of about 10-75 nucleotides, about 12-50 nucleotides and about 15-40 nucleotides in length. Generally, an adaptor can include any combination of nucleotides and/or nucleic acids. In some aspects, adaptors include one or more cleavable groups at one or more locations. In some embodiments, the adaptor includes sequence that is substantially identical, or substantially complementary, to at least a portion of a primer, for example a universal primer. In some embodiments, adaptors include a tag sequence to assist with cataloguing, identification or sequencing. In some embodiments, an adaptor acts as a substrate for amplification of a target sequence, particularly in the presence of a polymerase and dNTPs under suitable temperature and pH.

As used herein, “polymerase” and its derivatives, generally refers to any enzyme that can catalyze the polymerization of nucleotides (including analogs thereof) into a nucleic acid strand. Typically but not necessarily, such nucleotide polymerization can occur in a template-dependent fashion. Such polymerases can include without limitation naturally occurring polymerases and any subunits and truncations thereof, mutant polymerases, variant polymerases, recombinant, fusion or otherwise engineered polymerases, chemically modified polymerases, synthetic molecules or assemblies, and any analogs, derivatives or fragments thereof that retain the ability to catalyze such polymerization. Optionally, the polymerase can be a mutant polymerase comprising one or more mutations involving the replacement of one or more amino acids with other amino acids, the insertion or deletion of one or more amino acids from the polymerase, or the linkage of parts of two or more polymerases. Typically, the polymerase comprises one or more active sites at which nucleotide binding and/or catalysis of nucleotide polymerization can occur. Some exemplary polymerases include without limitation DNA polymerases and RNA polymerases. The term “polymerase” and its variants, as used herein, also refers to fusion proteins comprising at least two portions linked to each other, where the first portion comprises a peptide that can catalyze the polymerization of nucleotides into a nucleic acid strand and is linked to a second portion that comprises a second polypeptide. In some embodiments, the second polypeptide can include a reporter enzyme or a processivity-enhancing domain. Optionally, the polymerase can possess 5′ exonuclease activity or terminal transferase activity. In some embodiments, the polymerase can be optionally reactivated, for example through the use of heat, chemicals or re-addition of new amounts of polymerase into a reaction mixture. In some embodiments, the polymerase can include a hot-start polymerase and/or an aptamer based polymerase that optionally can be reactivated.

The terms “identity” and “identical” and their variants, as used herein, when used in reference to two or more nucleic acid sequences, refer to similarity in sequence of the two or more sequences (e.g., nucleotide or polypeptide sequences). In the context of two or more homologous sequences, the percent identity or homology of the sequences or subsequences thereof indicates the percentage of all monomeric units (e.g., nucleotides or amino acids) that are the same (i.e., about 70% identity, preferably 75%, 80%, 85%, 90%, 95%, 98% or 99% identity). The percent identity can be over a specified region, when compared and aligned for maximum correspondence over a comparison window, or designated region as measured using a BLAST or BLAST 2.0 sequence comparison algorithms with default parameters described below, or by manual alignment and visual inspection. Sequences are said to be “substantially identical” when there is at least 85% identity at the amino acid level or at the nucleotide level. Preferably, the identity exists over a region that is at least about 25, 50, or 100 residues in length, or across the entire length of at least one compared sequence. A typical algorithm for determining percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 algorithms, which are described in Altschul et al, Nuc. Acids Res. 25:3389-3402 (1977). Other methods include the algorithms of Smith & Waterman, Adv. Appl. Math. 2:482 (1981), and Needleman & Wunsch, J. Mol. Biol. 48:443 (1970), etc. Another indication that two nucleic acid sequences are substantially identical is that the two molecules or their complements hybridize to each other under stringent hybridization conditions.

The terms “complementary” and “complement” and their variants, as used herein, refer to any two or more nucleic acid sequences (e.g., portions or entireties of template nucleic acid molecules, target sequences and/or primers) that can undergo cumulative base pairing at two or more individual corresponding positions in antiparallel orientation, as in a hybridized duplex. Such base pairing can proceed according to any set of established rules, for example according to Watson-Crick base pairing rules or according to some other base pairing paradigm. Optionally there can be “complete” or “total” complementarity between a first and second nucleic acid sequence where each nucleotide in the first nucleic acid sequence can undergo a stabilizing base pairing interaction with a nucleotide in the corresponding antiparallel position on the second nucleic acid sequence. “Partial” complementarity describes nucleic acid sequences in which at least 20%, but less than 100%, of the residues of one nucleic acid sequence are complementary to residues in the other nucleic acid sequence. In some embodiments, at least 50%, but less than 100%, of the residues of one nucleic acid sequence are complementary to residues in the other nucleic acid sequence. In some embodiments, at least 70%, 80%, 90%, 95% or 98%, but less than 100%, of the residues of one nucleic acid sequence are complementary to residues in the other nucleic acid sequence. Sequences are said to be “substantially complementary” when at least 85% of the residues of one nucleic acid sequence are complementary to residues in the other nucleic acid sequence. In some embodiments, two complementary or substantially complementary sequences are capable of hybridizing to each other under standard or stringent hybridization conditions. “Non-complementary” describes nucleic acid sequences in which less than 20% of the residues of one nucleic acid sequence are complementary to residues in the other nucleic acid sequence. Sequences are said to be “substantially non-complementary” when less than 15% of the residues of one nucleic acid sequence are complementary to residues in the other nucleic acid sequence. In some embodiments, two non-complementary or substantially non-complementary sequences cannot hybridize to each other under standard or stringent hybridization conditions. A “mismatch” is present at any position in the two opposed nucleotides are not complementary. Complementary nucleotides include nucleotides that are efficiently incorporated by DNA polymerases opposite each other during DNA replication under physiological conditions. In a typical embodiment, complementary nucleotides can form base pairs with each other, such as the A-T/U and G-C base pairs formed through specific Watson-Crick type hydrogen bonding, or base pairs formed through some other type of base pairing paradigm, between the nucleobases of nucleotides and/or polynucleotides in positions antiparallel to each other. The complementarity of other artificial base pairs can be based on other types of hydrogen bonding and/or hydrophobicity of bases and/or shape complementarity between bases.

As used herein, “amplified target sequences” and its derivatives, refers generally to a nucleic acid sequence produced by the amplification of/amplifying the target sequences using target-specific primers and the methods provided herein. The amplified target sequences may be either of the same sense (the positive strand produced in the second round and subsequent even-numbered rounds of amplification) or antisense (i.e., the negative strand produced during the first and subsequent odd-numbered rounds of amplification) with respect to the target sequences. For the purposes of this disclosure, amplified target sequences are typically less than 50% complementary to any portion of another amplified target sequence in the reaction.

As used herein, terms “ligating”, “ligation” and derivatives refer generally to the act or process for covalently linking two or more molecules together, for example, covalently linking two or more nucleic acid molecules to each other. In some embodiments, ligation includes joining nicks between adjacent nucleotides of nucleic acids. In some embodiments, ligation includes forming a covalent bond between an end of a first and an end of a second nucleic acid molecule. In some embodiments, for example embodiments wherein the nucleic acid molecules to be ligated include conventional nucleotide residues, the ligation can include forming a covalent bond between a 5′ phosphate group of one nucleic acid and a 3′ hydroxyl group of a second nucleic acid thereby forming a ligated nucleic acid molecule. In some embodiments, any means for joining nicks or bonding a 5′phosphate to a 3′ hydroxyl between adjacent nucleotides can be employed. In an exemplary embodiment, an enzyme such as a ligase can be used.

As used herein, “ligase” and its derivatives, refers generally to any agent capable of catalyzing the ligation of two substrate molecules. In some embodiments, the ligase includes an enzyme capable of catalyzing the joining of nicks between adjacent nucleotides of a nucleic acid. In some embodiments, a ligase includes an enzyme capable of catalyzing the formation of a covalent bond between a 5′ phosphate of one nucleic acid molecule to a 3′ hydroxyl of another nucleic acid molecule thereby forming a ligated nucleic acid molecule. Suitable ligases may include, but not limited to, T4 DNA ligase; T7 DNA ligase; Taq DNA ligase, and E. coli DNA ligase.

As defined herein, a “cleavable group” generally refers to any moiety that once incorporated into a nucleic acid can be cleaved under appropriate conditions. For example, a cleavable group can be incorporated into a target-specific primer, an amplified sequence, an adaptor or a nucleic acid molecule of the sample. In an exemplary embodiment, a target-specific primer can include a cleavable group that becomes incorporated into the amplified product and is subsequently cleaved after amplification, thereby removing a portion, or all, of the target-specific primer from the amplified product. The cleavable group can be cleaved or otherwise removed from a target-specific primer, an amplified sequence, an adaptor or a nucleic acid molecule of the sample by any acceptable means. For example, a cleavable group can be removed from a target-specific primer, an amplified sequence, an adaptor or a nucleic acid molecule of the sample by enzymatic, thermal, photo-oxidative or chemical treatment. In one aspect, a cleavable group can include a nucleobase that is not naturally occurring. For example, an oligodeoxyribonucleotide can include one or more RNA nucleobases, such as uracil that can be removed by a uracil glycosylase. In some embodiments, a cleavable group can include one or more modified nucleobases (such as 7-methylguanine, 8-oxo-guanine, xanthine, hypoxanthine, 5,6-dihydrouracil or 5-methylcytosine) or one or more modified nucleosides (i.e., 7-methylguanosine, 8-oxo-deoxyguanosine, xanthosine, inosine, dihydrouridine or 5-methylcytidine). The modified nucleobases or nucleotides can be removed from the nucleic acid by enzymatic, chemical or thermal means. In one embodiment, a cleavable group can include a moiety that can be removed from a primer after amplification (or synthesis) upon exposure to ultraviolet light (i.e., bromodeoxyuridine). In another embodiment, a cleavable group can include methylated cytosine. Typically, methylated cytosine can be cleaved from a primer for example, after induction of amplification (or synthesis), upon sodium bisulfite treatment. In some embodiments, a cleavable moiety can include a restriction site. For example, a primer or target sequence can include a nucleic acid sequence that is specific to one or more restriction enzymes, and following amplification (or synthesis), the primer or target sequence can be treated with the one or more restriction enzymes such that the cleavable group is removed. Typically, one or more cleavable groups can be included at one or more locations with a target-specific primer, an amplified sequence, an adaptor or a nucleic acid molecule of the sample.

As used herein, “digestion”, “digestion step” and its derivatives, generally refers to any process by which a cleavable group is cleaved or otherwise removed from a target-specific primer, an amplified sequence, an adaptor or a nucleic acid molecule of the sample. In some embodiments, the digestion step involves a chemical, thermal, photo-oxidative or digestive process.

As used herein, the term “hybridization” is consistent with its use in the art, and generally refers to the process whereby two nucleic acid molecules undergo base pairing interactions. Two nucleic acid molecule molecules are said to be hybridized when any portion of one nucleic acid molecule is base paired with any portion of the other nucleic acid molecule; it is not necessarily required that the two nucleic acid molecules be hybridized across their entire respective lengths and in some embodiments, at least one of the nucleic acid molecules can include portions that are not hybridized to the other nucleic acid molecule. The phrase “hybridizing under stringent conditions” and its variants refers generally to conditions under which hybridization of a target-specific primer to a target sequence occurs in the presence of high hybridization temperature and low ionic strength. As used herein, the phrase “standard hybridization conditions” and its variants refers generally to conditions under which hybridization of a primer to an oligonucleotide (i.e., a target sequence), occurs in the presence of low hybridization temperature and high ionic strength. In one exemplary embodiment, standard hybridization conditions include an aqueous environment containing about 100 mm magnesium sulfate, about 500 mM Tris-sulfate at pH 8.9, and about 200 mM ammonium sulfate at about 50-55° C., or equivalents thereof.

As used herein, the term “end” and its variants, when used in reference to a nucleic acid molecule, for example a target sequence or amplified target sequence, can include the terminal 30 nucleotides, the terminal 20 and even more typically the terminal 15 nucleotides of the nucleic acid molecule. A linear nucleic acid molecule comprised of linked series of contiguous nucleotides typically includes at least two ends. In some embodiments, one end of the nucleic acid molecule can include a 3′ hydroxyl group or its equivalent, and can be referred to as the “3′ end” and its derivatives. Optionally, the 3′ end includes a 3′ hydroxyl group that is not linked to a 5′ phosphate group of a mononucleotide pentose ring. Typically, the 3′ end includes one or more 5′ linked nucleotides located adjacent to the nucleotide including the unlinked 3′ hydroxyl group, typically the 30 nucleotides located adjacent to the 3′ hydroxyl, typically the terminal 20 and even more typically the terminal 15 nucleotides. Generally, the one or more linked nucleotides can be represented as a percentage of the nucleotides present in the oligonucleotide or can be provided as a number of linked nucleotides adjacent to the unlinked 3′ hydroxyl. For example, the 3′ end can include less than 50% of the nucleotide length of the oligonucleotide. In some embodiments, the 3′ end does not include any unlinked 3′ hydroxyl group but can include any moiety capable of serving as a site for attachment of nucleotides via primer extension and/or nucleotide polymerization. In some embodiments, the term “3′ end” for example when referring to a target-specific primer, can include the terminal 10 nucleotides, the terminal 5 nucleotides, the terminal 4, 3, 2 or fewer nucleotides at the 3′end. In some embodiments, the term “3′ end” when referring to a target-specific primer can include nucleotides located at nucleotide positions 10 or fewer from the 3′ terminus. As used herein, “5′ end”, and its derivatives, generally refers to an end of a nucleic acid molecule, for example a target sequence or amplified target sequence, which includes a free 5′ phosphate group or its equivalent. In some embodiments, the 5′ end includes a 5′ phosphate group that is not linked to a 3′ hydroxyl of a neighboring mononucleotide pentose ring. Typically, the 5′ end includes to one or more linked nucleotides located adjacent to the 5′ phosphate, typically the 30 nucleotides located adjacent to the nucleotide including the 5′ phosphate group, typically the terminal 20 and even more typically the terminal 15 nucleotides. Generally, the one or more linked nucleotides can be represented as a percentage of the nucleotides present in the oligonucleotide or can be provided as a number of linked nucleotides adjacent to the 5′ phosphate. For example, the 5′ end can be less than 50% of the nucleotide length of an oligonucleotide. In another exemplary embodiment, the 5′ end can include about 15 nucleotides adjacent to the nucleotide including the terminal 5′ phosphate. In some embodiments, the 5′ end does not include any unlinked 5′ phosphate group but can include any moiety capable of serving as a site of attachment to a 3′ hydroxyl group, or to the 3′end of another nucleic acid molecule. In some embodiments, the term “5′ end” for example when referring to a target-specific primer, can include the terminal 10 nucleotides, the terminal 5 nucleotides, the terminal 4, 3, 2 or fewer nucleotides at the 5′end. In some embodiments, the term “5′ end” when referring to a target-specific primer can include nucleotides located at positions 10 or fewer from the 5′ terminus. In some embodiments, the 5′ end of a target-specific primer can include only non-cleavable nucleotides, for example nucleotides that do not contain one or more cleavable groups as disclosed herein, or a cleavable nucleotide as would be readily determined by one of ordinary skill in the art. A “first end” and a “second end” of a polynucleotide refer to the 5′ end or the 3′end of the polynucleotide. Either the first end or second end of a polynucleotide can be the 5′ end or the 3′ end of the polynucleotide; the terms “first” and “second” are not meant to denote that the end is specifically the 5′ end or the 3′ end.

As used herein “tag,” “barcode,” “unique tag” or “tag sequence” and its derivatives, refers generally to a unique short (6-14 nucleotide) nucleic acid sequence within an adaptor or primer that can act as a ‘key’ to distinguish or separate a plurality of amplified target sequences in a sample. For the purposes of this disclosure, a barcode or unique tag sequence is incorporated into the nucleotide sequence of an adaptor or primer. As used herein, “barcode sequence” denotes a nucleic acid fixed sequence that is sufficient to allow for the identification of a sample or source of nucleic acid sequences of interest. A barcode sequence can be, but need not be, a small section of the original nucleic acid sequence on which the identification is to be based. In some embodiments a barcode is 5-20 nucleic acids long. In some embodiments, the barcode is comprised of analog nucleotides, such as L-DNA, LNA, PNA, etc. As used herein, “unique tag sequence” denotes a nucleic acid sequence having at least one random sequence and at least one fixed sequence. A unique tag sequence, alone or in conjunction with a second unique tag sequence, is sufficient to allow for the identification of a single target nucleic acid molecule in a sample. A unique tag sequence can, but need not, comprise a small section of the original target nucleic acid sequence. In some embodiments a unique tag sequence is 2-50 nucleotides or base-pairs, or 2-25 nucleotides or base-pairs, or 2-10 nucleotides or base-pairs in length. A unique tag sequence can comprise at least one random sequence interspersed with a fixed sequence.

As used herein, “comparable maximal minimum melting temperatures” and its derivatives, refers generally to the melting temperature (Tm) of each nucleic acid fragment for a single adaptor or target-specific primer after digestion of a cleavable groups. The hybridization temperature of each nucleic acid fragment generated by an adaptor or target-specific primer is compared to determine the maximal minimum temperature required preventing hybridization of a nucleic acid sequence from the target-specific primer or adaptor or fragment or portion thereof to a respective target sequence. Once the maximal hybridization temperature is known, it is possible to manipulate the adaptor or target-specific primer, for example by moving the location of one or more cleavable group(s) along the length of the primer, to achieve a comparable maximal minimum melting temperature with respect to each nucleic acid fragment to thereby optimize digestion and repair steps of library preparation.

As used herein, “addition only” and its derivatives, refers generally to a series of steps in which reagents and components are added to a first or single reaction mixture. Typically, the series of steps excludes the removal of the reaction mixture from a first vessel to a second vessel in order to complete the series of steps. Generally, an addition only process excludes the manipulation of the reaction mixture outside the vessel containing the reaction mixture. Typically, an addition-only process is amenable to automation and high-throughput.

As used herein, “polymerizing conditions” and its derivatives, refers generally to conditions suitable for nucleotide polymerization. In typical embodiments, such nucleotide polymerization is catalyzed by a polymerase. In some embodiments, polymerizing conditions include conditions for primer extension, optionally in a template-dependent manner, resulting in the generation of a synthesized nucleic acid sequence. In some embodiments, the polymerizing conditions include polymerase chain reaction (PCR). Typically, the polymerizing conditions include use of a reaction mixture that is sufficient to synthesize nucleic acids and includes a polymerase and nucleotides. The polymerizing conditions can include conditions for annealing of a target-specific primer to a target sequence and extension of the primer in a template dependent manner in the presence of a polymerase. In some embodiments, polymerizing conditions can be practiced using thermocycling. Additionally, polymerizing conditions can include a plurality of cycles where the steps of annealing, extending, and separating the two nucleic strands are repeated. Typically, the polymerizing conditions include a cation such as MgCl₂. Generally, polymerization of one or more nucleotides to form a nucleic acid strand includes that the nucleotides be linked to each other via phosphodiester bonds, however, alternative linkages may be possible in the context of particular nucleotide analogs.

As used herein, the term “nucleic acid” refers to natural nucleic acids, artificial nucleic acids, analogs thereof, or combinations thereof, including polynucleotides and oligonucleotides. As used herein, the terms “polynucleotide” and “oligonucleotide” are used interchangeably and mean single-stranded and double-stranded polymers of nucleotides including, but not limited to, 2′-deoxyribonucleotides (nucleic acid) and ribonucleotides (RNA) linked by internucleotide phosphodiester bond linkages, e.g. 3′-5′ and 2′-5′, inverted linkages, e.g. 3′-3′ and 5′-5′, branched structures, or analog nucleic acids. Polynucleotides have associated counter ions, such as H⁺, NH₄⁺, trialkylammonium, Mg²⁺, Na⁺ and the like. An oligonucleotide can be composed entirely of deoxyribonucleotides, entirely of ribonucleotides, or chimeric mixtures thereof. Oligonucleotides can be comprised of nucleobase and sugar analogs. Polynucleotides typically range in size from a few monomeric units, e.g. 5-40, when they are more commonly frequently referred to in the art as oligonucleotides, to several thousands of monomeric nucleotide units, when they are more commonly referred to in the art as polynucleotides; for purposes of this disclosure, however, both oligonucleotides and polynucleotides may be of any suitable length. Unless denoted otherwise, whenever a oligonucleotide sequence is represented, it will be understood that the nucleotides are in 5′ to 3′ order from left to right and that “A” denotes deoxyadenosine, “C” denotes deoxycytidine, “G” denotes deoxyguanosine, “T” denotes thymidine, and “U’ denotes deoxyuridine. As discussed herein and known in the art, oligonucleotides and polynucleotides are said to have “5′ ends” and “3′ ends” because mononucleotides are typically reacted to form oligonucleotides via attachment of the 5′ phosphate or equivalent group of one nucleotide to the 3′ hydroxyl or equivalent group of its neighboring nucleotide, optionally via a phosphodiester or other suitable linkage.

As used herein, the term “polymerase chain reaction” (“PCR”) refers to the method of K. B. Mullis U.S. Pat. Nos. 4,683,195 and 4,683,202, hereby incorporated by reference, which describe a method for increasing the concentration of a segment of a polynucleotide of interest in a mixture of genomic DNA without cloning or purification. This process for amplifying the polynucleotide of interest consists of introducing a large excess of two oligonucleotide primers to the DNA mixture containing the desired polynucleotide of interest, followed by a precise sequence of thermal cycling in the presence of a DNA polymerase. The two primers are complementary to their respective strands of the double stranded polynucleotide of interest. To effect amplification, the mixture is denatured and the primers then annealed to their complementary sequences within the polynucleotide of interest molecule. Following annealing, the primers are extended with a polymerase to form a new pair of complementary strands. The steps of denaturation, primer annealing and polymerase extension can be repeated many times (i.e., denaturation, annealing and extension constitute one “cycle”; there can be numerous “cycles”) to obtain a high concentration of an amplified segment of the desired polynucleotide of interest. The length of the amplified segment of the desired polynucleotide of interest (amplicon) is determined by the relative positions of the primers with respect to each other, and therefore, this length is a controllable parameter. By virtue of repeating the process, the method is referred to as the “polymerase chain reaction” (hereinafter “PCR”). Because the desired amplified segments of the polynucleotide of interest become the predominant nucleic acid sequences (in terms of concentration) in the mixture, they are said to be “PCR amplified”. As defined herein, target nucleic acid molecules within a sample including a plurality of target nucleic acid molecules are amplified via PCR. In a modification to the method discussed above, the target nucleic acid molecules can be PCR amplified using a plurality of different primer pairs, in some cases, one or more primer pairs per target nucleic acid molecule of interest, thereby forming a multiplex PCR reaction. Using multiplex PCR, it is possible to simultaneously amplify multiple nucleic acid molecules of interest from a sample to form amplified target sequences. It is also possible to detect the amplified target sequences by several different methodologies (e.g., quantitation with a bioanalyzer or qPCR, hybridization with a labeled probe; incorporation of biotinylated primers followed by avidin-enzyme conjugate detection; incorporation of ³²P-labeled deoxynucleotide triphosphates, such as dCTP or dATP, into the amplified target sequence). Any oligonucleotide sequence can be amplified with the appropriate set of primers, thereby allowing for the amplification of target nucleic acid molecules from genomic DNA, cDNA, formalin-fixed paraffin-embedded DNA, fine-needle biopsies and various other sources. In particular, the amplified target sequences created by the multiplex PCR process as disclosed herein, are themselves efficient substrates for subsequent PCR amplification or various downstream assays or manipulations.

As defined herein “multiplex amplification” refers to selective and non-random amplification of two or more target sequences within a sample using at least one target-specific primer. In some embodiments, multiplex amplification is performed such that some or all of the target sequences are amplified within a single reaction vessel. The “plexy” or “plex” of a given multiplex amplification refers generally to the number of different target-specific sequences that are amplified during that single multiplex amplification. In some embodiments, the plexy can be about 12-plex, 24-plex, 48-plex, 96-plex, 192-plex, 384-plex, 768-plex, 1536-plex, 3072-plex, 6144-plex or higher.

Compositions

We have developed a single stream multiplex next generation sequencing workflow for determination of actionable oncology tumor biomarkers in a sample, in order to determine oncology status in a sample. The oncology precision assay compositions and methods of the invention offer a specific and robust solution for biomarker screening for understanding mechanisms involved with tumor immune response. Thus, provided are compositions for multiplex library preparation and use in conjunction with next generation sequencing technologies and workflow solutions (e.g., Ion Torrent™ NGS workflow), manual or automated, to evaluate low level biomarker targets in a variety of sample types to assess oncology status.

Thus, provided are compositions for a single stream multiplex determination of actionable oncology biomarkers in a sample. In some embodiments, the composition consists of a plurality of sets of primer pair reagents directed to a plurality of target sequences to detect low level targets in the sample, wherein the target genes are selected from oncology response genes consisting of the following function: DNA hotspot mutation genes, copy number variation (CNV) genes, inter-genetic fusion genes, and intra-genetic fusion genes. In some embodiments, the target genes are selected from oncology genes consisting of one or more function of Table 1. In some embodiments, the target genes are selected from one or more actionable target genes in a sample that determines a change in oncology activity in the sample indicative of a potential diagnosis, prognosis, candidate therapeutic regimen, and/or adverse event likelihood. In total, the various functions of genes comprising the provided multiplex panel of the invention provide a comprehensive picture recommending actionable approaches to cancer therapy.

In certain embodiments, target oncology sequences are directed to sequences having mutations associated with cancer. In some embodiments, the target sequences or amplified target sequences are directed to sequences having mutations associated with one or more solid tumor cancers selected from the group consisting of head and neck cancers (e.g., HNSCC, nasopharyngeal, salivary gland), brain cancer (e.g., glioblastoma, glioma, gliosarcoma, glioblastoma multiforme, neuroblastoma), breast cancer (e.g., TNBC, trastuzumab resistant HER2+ breast cancer, ER+/HER− breast cancer), gynecological (e.g., uterine, ovarian cancer, cervical cancer, endometrial cancer, fallopian cancer), colorectal cancer, gallbladder cancer, esophageal cancer, gastrointestinal cancer, gastric cancer, bladder cancer, prostate cancer, testicular cancer, urothelial cancer, liver cancer (e.g., hepatocellular, HCC), lung cancer (e.g., non-small cell lung, small cell lung), kidney (renal cell) cancer, pancreatic cancer (e.g., adenocarcinoma, ductal), thyroid cancer, bile duct cancer, pituitary tumor, wilms tumor, kaposi sarcoma, hairy cell carcinoma, osteosarcoma, thymus cancer, skin cancer, melanoma, heart cancer, oral and larynx cancer, neuroblastoma, mesothelioma, and other solid tumors (thymic, bone, soft tissue, oral SCC, myelofibrosis, synovial sarcoma). In one embodiment, the mutations can include substitutions, insertions, inversions, point mutations, deletions, mismatches and translocations. In some embodiments, the target sequences or amplified target sequences are directed to sequences having mutations associated with one or more blood/hematologic cancers selected from the group consisting of multiple myeloma, diffuse large B cell lymphoma (DLBCL), lymphoma, Hodgkins lymphoma, non-Hodgkins lymphoma, follicular lymphoma, leukemia, acute myeloid leukemia (AML), chronic lymphocytic leukemia (CLL), myelodisplastic syndrome. In one embodiment, the mutant biomarker associated with cancer is located in at least one of the genes provided in Table 1.

In some embodiments, one or more mutant oncology sequences are located in at least one of the genes selected from, Table 1. In some embodiments the one or more mutant sequences indicate cancer activity.

In some embodiments the one or more mutant sequences indicate a patient's likelihood to response to a therapeutic agent. In some embodiments, the one or more mutant oncology biomarker sequences indication a patient's likelihood to not be responsive to a therapeutic agent. In certain embodiments, relevant therapeutic agents can be oncology therapies including but not limited to kinase inhibitors, cell signaling inhibitors, checkpoint blockades, T cell therapies, and therapeutic vaccines.

In some embodiments, target sequences or mutant target sequences are directed to mutations associated with cancer. In some embodiments, the target sequences or mutant target sequences are directed to mutations associated with one or more solid tumor cancers selected from the group consisting of head and neck cancers (e.g., HNSCC, nasopharyngeal, salivary gland), brain cancer (e.g., glioblastoma, glioma, gliosarcoma, glioblastoma multiforme, neuroblastoma), breast cancer (e.g., TNBC, trastuzumab resistant HER2+ breast cancer, ER+/HER− breast cancer), gynecological (e.g., uterine, ovarian cancer, cervical cancer, endometrial cancer, fallopian cancer), colorectal cancer, gallbladder cancer, esophageal cancer, gastrointestinal cancer, gastric cancer, bladder cancer, prostate cancer, testicular cancer, urothelial cancer, liver cancer (e.g., hepatocellular, HCC), lung cancer (e.g., non-small cell lung, small cell lung), kidney (renal cell) cancer, pancreatic cancer (e.g., adenocarcinoma, ductal), thyroid cancer, bile duct cancer, pituitary tumor, wilms tumor, kaposi sarcoma, hairy cell carcinoma, osteosarcoma, thymus cancer, skin cancer, melanoma, heart cancer, oral and larynx cancer, neuroblastoma, mesothelioma, and other solid tumors (thymic, bone, soft tissue, oral SCC, myelofibrosis, synovial sarcoma). In one embodiment, the mutations can include substitutions, insertions, inversions, point mutations, deletions, mismatches and translocations. In one embodiment, the mutations can include variation in copy number. In one embodiment, the mutations can include germline or somatic mutations. In some embodiments, the target sequences or amplified target sequences are directed to sequences having mutations associated with one or more blood/hematologic cancers selected from the group consisting of multiple myeloma, diffuse large B cell lymphoma (DLBCL), lymphoma, Hodgkins lymphoma, non-Hodgkins lymphoma, follicular lymphoma, leukemia, acute myeloid leukemia (AML), chronic lymphocytic leukemia (CLL), myelodisplastic syndrome.

In one embodiment, the mutations associated with cancer are located in at least one of the genes provided in Table 1. In some embodiments, mutant target sequences are directed to any one of more of the genes provided in Table 1. In some embodiments, mutant target sequences comprise any one or more amplicon sequences of the genes provided in Table 1. In some embodiments, mutant target sequences consist of any one or more amplicon sequences of the genes provided in Table 1. In some embodiments, mutant target sequences include amplicon sequences of each of the genes provided in Table 1.

In some embodiments, compositions comprise any one or more of oncology target-specific primer pairs provided in Table A. In some embodiments, compositions comprise all of the oncology target-specific primer pairs provided in Table A. In some embodiments, any one or more of the oncology target-specific primer pairs provided in Table A can be used to amplify a target sequence present in a sample as disclosed by the methods described herein.

In some embodiments, the oncology target-specific primers from Table A include 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 40, 60, 80, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600 or more, target-specific primer pairs. In some embodiments, the amplified target sequences can include any one or more of the amplified target sequences produced using target-specific primers provided in Table A. In some embodiments, at least one of the target-specific primers associated with cancer is at least 90% identical to at least one nucleic acid sequence produced using target specific primers selected from SEQ ID NOs: 1-1563. In some embodiments, at least one of the target-specific primers associated with oncology is complementary across its entire length to at least one target sequence in a sample. In some embodiments, at least one of the target-specific primers associated with immune response includes a non-cleavable nucleotide at the 3′ end. In some embodiments, the non-cleavable nucleotide at the 3′ end includes the terminal 3′ nucleotide. In one embodiment, the amplified target sequences are directed to one or more individual exons having mutations associated with cancer. In one embodiment, the amplified target sequences are directed to individual exons having a mutation associated with cancer.

Methods

Provided methods of the invention comprise efficient procedures which enable rapid preparation of highly multiplexed libraries suitable for downstream analysis. The methods optionally allow for incorporation of one or more unique tag sequences. Certain methods comprise streamlined, addition-only procedures conveying highly rapid library generation.

Provided herein are methods for determining oncology activity in a sample. In some embodiments, the method comprises multiplex amplification of a plurality of oncology sequences from a biological sample, wherein amplifying comprises contacting at least a portion of the sample with a plurality of sets of primer pair reagents directed to the plurality of target sequences, and a polymerase under amplification conditions, to thereby produce amplified target expression sequences. The method further comprises detecting the presence of a mutation of the one or more target sequences in the sample, wherein a mutation of one or more oncology markers as compared with a control determines a change in oncology activity in the sample. In some embodiments the oncology sequences of the methods are selected from oncology response genes consisting of the following function: DNA hotspot mutation genes, copy number variation (CNV) genes, inter-genetic fusion genes, and intra-genetic fusion genes. In some embodiments, the target genes are selected from oncology genes consisting of one or more function of Table 1. In some embodiments, the target genes are selected from one or more actionable target genes in a sample that determines a change in oncology activity in the sample indicative of a potential diagnosis, prognosis, candidate therapeutic regimen, and/or adverse event likelihood. In total, the various functions of genes comprising the provided multiplex panel of the invention provide a comprehensive picture recommending actionable approaches to cancer therapy.

In certain embodiments, target oncology sequences of the methods are directed to sequences having mutations associated with cancer. In some embodiments, the target sequences or amplified target sequences are directed to sequences having mutations associated with one or more solid tumor cancers selected from the group consisting of head and neck cancers (e.g., HNSCC, nasopharyngeal, salivary gland), brain cancer (e.g., glioblastoma, glioma, gliosarcoma, glioblastoma multiforme, neuroblastoma), breast cancer (e.g., TNBC, trastuzumab resistant HER2+ breast cancer, ER+/HER− breast cancer), gynecological (e.g., uterine, ovarian cancer, cervical cancer, endometrial cancer, fallopian cancer), colorectal cancer, gallbladder cancer, esophageal cancer, gastrointestinal cancer, gastric cancer, bladder cancer, prostate cancer, testicular cancer, urothelial cancer, liver cancer (e.g., hepatocellular, HCC), lung cancer (e.g., non-small cell lung, small cell lung), kidney (renal cell) cancer, pancreatic cancer (e.g., adenocarcinoma, ductal), thyroid cancer, bile duct cancer, pituitary tumor, wilms tumor, kaposi sarcoma, hairy cell carcinoma, osteosarcoma, thymus cancer, skin cancer, melanoma, heart cancer, oral and larynx cancer, neuroblastoma, mesothelioma, and other solid tumors (thymic, bone, soft tissue, oral SCC, myelofibrosis, synovial sarcoma). In one embodiment, the mutations can include substitutions, insertions, inversions, point mutations, deletions, mismatches and translocations. In some embodiments, the target sequences or amplified target sequences are directed to sequences having mutations associated with one or more blood/hematologic cancers selected from the group consisting of multiple myeloma, diffuse large B cell lymphoma (DLBCL), lymphoma, Hodgkins lymphoma, non-Hodgkins lymphoma, follicular lymphoma, leukemia, acute myeloid leukemia (AML), chronic lymphocytic leukemia (CLL), myelodisplastic syndrome. In one embodiment, the mutant biomarker associated with cancer is located in at least one of the genes provided in Table 1.

In some embodiments, one or more mutant oncology sequences of the methods are located in at least one of the genes selected from, Table 1. In some embodiments the one or more mutant sequences indicate cancer activity.

In some embodiments the one or more mutant sequences of the methods indicate a patient's likelihood to response to a therapeutic agent. In some embodiments, the one or more mutant oncology biomarker sequences indication a patient's likelihood to not be responsive to a therapeutic agent. In certain embodiments, relevant therapeutic agents can be oncology therapies including but not limited to kinase inhibitors, cell signaling inhibitors, checkpoint blockades, T cell therapies, and therapeutic vaccines.

In some embodiments, target sequences or mutant target sequences of the methods are directed to mutations associated with cancer. In some embodiments, the target sequences or mutant target sequences of the methods are directed to mutations associated with one or more solid tumor cancers selected from the group consisting of head and neck cancers (e.g., HNSCC, nasopharyngeal, salivary gland), brain cancer (e.g., glioblastoma, glioma, gliosarcoma, glioblastoma multiforme, neuroblastoma), breast cancer (e.g., TNBC, trastuzumab resistant HER2+ breast cancer, ER+/HER− breast cancer), gynecological (e.g., uterine, ovarian cancer, cervical cancer, endometrial cancer, fallopian cancer), colorectal cancer, gallbladder cancer, esophageal cancer, gastrointestinal cancer, gastric cancer, bladder cancer, prostate cancer, testicular cancer, urothelial cancer, liver cancer (e.g., hepatocellular, HCC), lung cancer (e.g., non-small cell lung, small cell lung), kidney (renal cell) cancer, pancreatic cancer (e.g., adenocarcinoma, ductal), thyroid cancer, bile duct cancer, pituitary tumor, wilms tumor, kaposi sarcoma, hairy cell carcinoma, osteosarcoma, thymus cancer, skin cancer, melanoma, heart cancer, oral and larynx cancer, neuroblastoma, mesothelioma, and other solid tumors (thymic, bone, soft tissue, oral SCC, myelofibrosis, synovial sarcoma). In one embodiment, the mutations can include substitutions, insertions, inversions, point mutations, deletions, mismatches and translocations. In one embodiment, the mutations can include variation in copy number. In one embodiment, the mutations can include germline or somatic mutations. In some embodiments, the target sequences or amplified target sequences are directed to sequences having mutations associated with one or more blood/hematologic cancers selected from the group consisting of multiple myeloma, diffuse large B cell lymphoma (DLBCL), lymphoma, Hodgkins lymphoma, non-Hodgkins lymphoma, follicular lymphoma, leukemia, acute myeloid leukemia (AML), chronic lymphocytic leukemia (CLL), myelodisplastic syndrome.

In some embodiments, methods comprise use of any one or more of oncology target-specific primer pairs provided in Table A. In some embodiments, methods comprise use of all of the oncology target-specific primer pairs provided in Table A. In some embodiments, use of any one or more of the oncology target-specific primer pairs provided in Table A can be used to amplify a target sequence present in a sample as disclosed by the methods described herein.

In some embodiments, methods comprise use of the oncology target-specific primers from Table A include 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 40, 60, 80, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600 or more, target-specific primer pairs. In some embodiments, methods comprising detection of amplified target sequences can include any one or more of the amplified target sequences produced using target-specific primers provided in Table A. In some embodiments, methods comprise use of at least one of the target-specific primers associated with cancer is at least 90% identical to at least one nucleic acid sequence produced using target specific primers selected from SEQ ID NOs: 1-1563. In some embodiments, at least one of the target-specific primers associated with oncology is complementary across its entire length to at least one target sequence in a sample. In some embodiments, at least one of the target-specific primers associated with immune response includes a non-cleavable nucleotide at the 3′ end. In some embodiments, the non-cleavable nucleotide at the 3′ end includes the terminal 3′ nucleotide. In one embodiment, the amplified target sequences are directed to one or more individual exons having mutations associated with cancer. In one embodiment, the amplified target sequences are of the methods are directed to individual exons having a mutation associated with cancer.

In some embodiments, methods comprise detection and optionally, the identification of clinically actionable markers. As defined herein, the term “clinically actionable marker” includes clinically actionable mutations and/or clinically actionable expression patterns that are known or can be associated by one of ordinary skill in the art with, but not limited to, prognosis for the treatment of cancer. In one embodiment, prognosis for the treatment of cancer includes the identification of mutations and/or expression patterns associated with responsiveness or non-responsiveness of a cancer to a drug, drug combination, or treatment regime. In one embodiment, methods comprise amplification of a plurality of target sequences from a population of nucleic acid molecules linked to, or correlated with, the onset, progression or remission of cancer. In some embodiments, provided methods comprise selective amplification of more than one target sequences in a sample and the detection and/or identification of mutations associated with cancer. In some embodiments, the amplified target sequences include two or more nucleotide sequences of the genes provided in Table 1. In some embodiments, the amplified target sequences can include any one or more the amplified target sequences generated using the target-specific primers provided in Table A. In one embodiment, the amplified target sequences include 10, 50, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600 or more amplicons of the genes from Table 1.

In one aspect of the invention, methods for preparing a library of target nucleic acid sequences are provided. In some embodiments, methods comprise contacting a nucleic acid sample with a plurality of adaptors capable of amplification of one or more target nucleic acid sequences in the sample under conditions wherein the target nucleic acid(s) undergo a first amplification; digesting resulting first amplification products to reduce or eliminate resulting primer dimers and prepare partially digested target amplicons, thereby producing gapped, double stranded amplicons. The methods further comprise repairing the partially digested target amplicons; then amplifying the repaired target amplicons in a second amplification using universal primers, thereby producing a library of target nucleic acid sequences. Each of the plurality of adaptors used in the methods herein comprise a universal handle sequence and a target nucleic acid sequence and a cleavable moiety and optionally one or more tag sequences. At least two and up to one hundred thousand target specific adaptor pairs are included in the provided methods, wherein the target nucleic acid sequence of each adaptor includes at least one cleavable moiety and the universal handle sequence does not include the cleavable moiety. In some embodiments where an optional tag sequence is included in at least one adaptor, the cleavable moieties are included in the adaptor sequence flanking either end of the tag sequence.

In one aspect of the invention, methods for preparing a tagged library of target nucleic acid sequences are provided. In some embodiments, methods comprise contacting a nucleic acid sample with a plurality of adaptors capable of amplification of one or more target nucleic acid sequences in the sample under conditions wherein the target nucleic acid(s) undergo a first amplification; digesting resulting first amplification products to reduce or eliminate resulting primer dimers and prepare partially digested target amplicons, thereby producing gapped, double stranded amplicons. The methods further comprise repairing the partially digested target amplicons; then amplifying the repaired target amplicons in a second amplification using universal primers, thereby producing a library of target nucleic acid sequences. Each of the plurality of adaptors used in the methods herein comprise a universal handle sequence and a target nucleic acid sequence and a cleavable moiety and one or more tag sequences. At least two and up to one hundred thousand target specific adaptor pairs are included in the provided methods, wherein the target nucleic acid sequence of each adaptor includes at least one cleavable moiety, the universal handle sequence does not include the cleavable moiety, and the cleavable moieties are included flanking either end of the tag sequence.

In certain embodiments, the comparable maximal minimum melting temperature of each universal sequence is higher than the comparable maximal minimum melting temperature of each target nucleic acid sequence and each tag sequence present in an adaptor.

In some embodiments, each of the adaptors comprise unique tag sequences as further described herein and each further comprise cleavable groups flanking either end of the tag sequence in each adaptor. In some embodiments wherein unique taq sequences are employed, each generated target specific amplicon sequence includes at least 1 different sequence and up to 10⁷different sequences. In certain embodiments each target specific pair of the plurality of adaptors includes up to 16,777,216 different adaptor combinations comprising different tag sequences.

In some embodiments, methods comprise contacting the plurality of gapped polynucleotide products with digestion and repair reagents simultaneously. In some embodiments, methods comprise contacting the plurality of gapped polynucleotide products sequentially with the digestion then repair reagents.

A digestion reagent useful in the methods provided herein comprises any reagent capable of cleaving the cleavable site present in adaptors, and in some embodiments includes, but is not limited to, one or a combination of uracil DNA glycosylase (UDG). apurinic endonuclease (e.g., APE1), RecJf, formamidopyrimidine [fapy]-DNA glycosylase (fpg), Nth endonuclease III, endonuclease VIII, polynucleotide kinase (PNK), Taq DNA polymerase, DNA polymerase I and/or human DNA polymerase beta.

A repair reagent useful in the methods provided herein comprises any reagent capable of repair of the gapped amplicons, and in some embodiments includes, but is not limited to, any one or a combination of Phusion DNA polymerase, Phusion U DNA polymerase, SuperFi DNA polymerase, Taq DNA polymerase, Human DNA polymerase beta, T4 DNA polymerase and/or T7 DNA polymerase, SuperFiU DNA polymerase, E. coli DNA ligase, T3 DNA ligase, T4 DNA ligase, T7 DNA ligase, Taq DNA ligase, and/or 9° N DNA ligase.

Thus, in certain embodiments, a digestion and repair reagent comprises any one or a combination of one or a combination of uracil DNA glycosylase (UDG). apurinic endonuclease (e.g., APE1), RecJf, formamidopyrimidine [fapy]-DNA glycosylase (fpg), Nth endonuclease II, endonuclease VIII, polynucleotide kinase (PNK), Taq DNA polymerase, DNA polymerase I and/or human DNA polymerase beta; and any one or a combination of Phusion DNA polymerase, Phusion U DNA polymerase, SuperFi DNA polymerase, Taq DNA polymerase, Human DNA polymerase beta, T4 DNA polymerase and/or T7 DNA polymerase, SuperFiU DNA polymerase, E. coli DNA ligase, T3 DNA ligase, T4 DNA ligase, T7 DNA ligase, Taq DNA ligase, and/or 9° N DNA ligase. In certain embodiments, a digestion and repair reagent comprises any one or a combination of uracil DNA glycosylase (UDG), apurinic endonuclease (e.g., APE1), Taq DNA polymerase, Phusion U DNA polymerase, SuperFiU DNA polymerase, T7 DNA ligase. In certain embodiments, a digestion and repair reagent comprises any one or a combination of uracil DNA glycosylase (UDG), formamidopyrimidine [fapy]-DNA glycosylase (fpg), Phusion U DNA polymerase, Taq DNA polymerase, SuperFiU DNA polymerase, T4 PNK and T7 DNA ligase.

In some embodiments, methods comprise the digestion and repair steps carried out in a single step. In other embodiments, methods comprise the digestion and repair of steps carried out in a temporally separate manner at different temperatures.

In some embodiments methods of the invention are carried out wherein one or more of the method steps is conducted in manual mode. In particular embodiments, methods of the invention are carried out wherein each of the method steps is conducted manually. In some embodiments methods of the invention are carried out wherein one or more of the method steps is conducted in an automated mode. In particular embodiments, methods of the invention are carried wherein each of the method steps is automated. In some embodiments methods of the invention are carried out wherein one or more of the method steps is conducted in a combination of manual and automated modes.

In some embodiments, methods of the invention comprise at least one purification step. For example, in certain embodiments a purification step is carried out only after the second amplification of repaired amplicons. In some embodiments two purification steps are utilized, wherein a first purification step is carried out after the digestion and repair and a second purification step is carried out after the second amplification of repaired amplicons.

In some embodiments a purification step comprises conducting a solid phase adherence reaction, solid phase immobilization reaction or gel electrophoresis. In certain embodiments a purification step comprises separation conducted using Solid Phase Reversible Immobilization (SPRI) beads. In particular embodiments a purification step comprises separation conducted using SPRI beads wherein the SPRI beads comprise paramagnetic beads.

In some embodiments, methods comprise contacting a nucleic acid sample with a plurality of adaptors capable of amplification of one or more target nucleic acid sequences in the sample under conditions wherein the target nucleic acid(s) undergo a first amplification; digesting resulting first amplification products to reduce or eliminate resulting primer dimers and prepare partially digested target amplicons, thereby producing gapped, double stranded amplicons. The methods further comprise repairing the partially digested target amplicons, then purifying repaired amplicons; then amplifying the repaired target amplicons in a second amplification using universal primers, thereby producing a library of target nucleic acid sequences; and then purifying resulting library. Each of the plurality of adaptors used in the methods herein comprise a universal handle sequence and a target nucleic acid sequence and a cleavable moiety and optionally one or more tag sequences. At least two and up to one hundred thousand target specific adaptor pairs are included in the provided methods, wherein the target nucleic acid sequence of each adaptor includes at least one cleavable moiety and the universal handle sequence does not include the cleavable moiety. In some embodiments where an optional tag sequence is included in at least one adaptor, the cleavable moieties are included in the adaptor sequence flanking either end of the tag sequence. In some embodiments a digestion and repair reagent comprises any one or a combination of one or a combination of uracil DNA glycosylase (UDG). apurinic endonuclease (e.g., APE1), RecJf, formamidopyrimidine [fapy]-DNA glycosylase (fpg), Nth endonuclease III, endonuclease VIII, polynucleotide kinase (PNK), Taq DNA polymerase, DNA polymerase I and/or human DNA polymerase beta; and any one or a combination of Phusion DNA polymerase, Phusion U DNA polymerase, SuperFi DNA polymerase, Taq DNA polymerase, Human DNA polymerase beta, T4 DNA polymerase and/or T7 DNA polymerase, SuperFiU DNA polymerase, E. coli DNA ligase, T3 DNA ligase, T4 DNA ligase, T7 DNA ligase, Taq DNA ligase, and/or 9° N DNA ligase. In certain embodiments, a digestion and repair reagent comprises any one or a combination of uracil DNA glycosylase (UDG), apurinic endonuclease (e.g., APE1), Taq DNA polymerase, Phusion U DNA polymerase, SuperFiU DNA polymerase, T7 DNA ligase. In certain embodiments, a digestion and repair reagent comprises any one or a combination of uracil DNA glycosylase (UDG), formamidopyrimidine [fapy]-DNA glycosylase (fpg), Phusion U DNA polymerase, Taq DNA polymerase, SuperFiU DNA polymerase, T4 PNK and T7 DNA ligase.

In some embodiments, methods comprise contacting a nucleic acid sample with a plurality of adaptors capable of amplification of one or more target nucleic acid sequences in the sample under conditions wherein the target nucleic acid(s) undergo a first amplification; digesting resulting first amplification products to reduce or eliminate resulting primer dimers and prepare partially digested target amplicons, thereby producing gapped, double stranded amplicons. The methods further comprise repairing the partially digested target amplicons, and purifying repaired amplicons; then amplifying the repaired target amplicons in a second amplification using universal primers, thereby producing a library of target nucleic acid sequences; and then purifying resulting library. Each of the plurality of adaptors used in the methods herein comprise a universal handle sequence and a target nucleic acid sequence and a cleavable moiety and one or more tag sequences. At least two and up to one hundred thousand target specific adaptor pairs are included in the provided methods, wherein the target nucleic acid sequence of each adaptor includes at least one cleavable moiety, the universal handle sequence does not include the cleavable moiety, and cleavable moieties are included in the flanking either end of the tag sequence. In some embodiments a digestion and repair reagent comprises any one or a combination of one or a combination of uracil DNA glycosylase (UDG). apurinic endonuclease (e.g., APE1), RecJf, formamidopyrimidine [fapy]-DNA glycosylase (fpg), Nth endonuclease III, endonuclease VIII, polynucleotide kinase (PNK), Taq DNA polymerase, DNA polymerase I and/or human DNA polymerase beta; and any one or a combination of Phusion DNA polymerase, Phusion U DNA polymerase, SuperFi DNA polymerase, Taq DNA polymerase, Human DNA polymerase beta, T4 DNA polymerase and/or T7 DNA polymerase, SuperFiU DNA polymerase, E. coli DNA ligase, T3 DNA ligase, T4 DNA ligase, T7 DNA ligase, Taq DNA ligase, and/or 9° N DNA ligase. In certain embodiments, a digestion and repair reagent comprises any one or a combination of uracil DNA glycosylase (UDG), apurinic endonuclease (e.g., APE1), Taq DNA polymerase, Phusion U DNA polymerase, SuperFiU DNA polymerase, T7 DNA ligase. In certain embodiments, a digestion and repair reagent comprises any one or a combination of uracil DNA glycosylase (UDG), formamidopyrimidine [fapy]-DNA glycosylase (fpg), Phusion U DNA polymerase, Taq DNA polymerase, SuperFiU DNA polymerase, T4 PNK and T7 DNA ligase.

In certain embodiments methods of the invention are carried out in a single, addition only workflow reaction, allowing for rapid production of highly multiplexed targeted libraries. For example, in one embodiment, methods for preparing a library of target nucleic acid sequences comprise contacting a nucleic acid sample with a plurality of adaptors capable of amplification of one or more target nucleic acid sequences in the sample under conditions wherein the target nucleic acid(s) undergo a first amplification; digesting resulting first amplification products to reduce or eliminate resulting primer dimers and prepare partially digested target amplicons, thereby producing gapped, double stranded amplicons. The methods further comprise repairing the partially digested target amplicons; then amplifying the repaired target amplicons in a second amplification using universal primers, thereby producing a library of target nucleic acid sequences, and purifying the resulting library. In certain embodiments the purification comprises a single or repeated separating step that is carried out following production of the library following the second amplification; and wherein the other method steps are conducted in a single reaction vessel without requisite transferring of a portion (aliquot) of any of the products generated in steps to another reaction vessel. Each of the plurality of adaptors used in the methods herein comprise a universal handle sequence and a target nucleic acid sequence and a cleavable moiety and optionally one or more tag sequences. At least two and up to one hundred thousand target specific adaptor pairs are included in the provided methods, wherein the target nucleic acid sequence of each adaptor includes at least one cleavable moiety and the universal handle sequence does not include the cleavable moiety. In some embodiments where an optional tag sequence is included in at least one adaptor, the cleavable moieties are included in the adaptor sequence flanking either end of the tag sequence.

In one embodiment, methods for preparing a library of target nucleic acid sequences comprise contacting a nucleic acid sample with a plurality of adaptors capable of amplification of one or more target nucleic acid sequences in the sample under conditions wherein the target nucleic acid(s) undergo a first amplification; digesting resulting first amplification products to reduce or eliminate resulting primer dimers and prepare partially digested target amplicons, thereby producing gapped, double stranded amplicons. The methods further comprise repairing the partially digested target amplicon; then amplifying the repaired target amplicons in a second amplification using universal primers, thereby producing a library of target nucleic acid sequences, and purifying the resulting library.

In some embodiments a digestion reagent comprises any one or any combination of: uracil DNA glycosylase (UDG). AP endonuclease (APE1), RecJf, formamidopyrimidine [fapy]-DNA glycosylase (fpg), Nth endonuclease III, endonuclease VIII, polynucleotide kinase, Taq DNA polymerase, DNA polymerase I and/or human DNA polymerase beta. In certain embodiments a digestion reagent comprises any one or any combination of: uracil DNA glycosylase (UDG). AP endonuclease (APE1), RecJf, formamidopyrimidine [fapy]-DNA glycosylase (fpg), Nth endonuclease III, endonuclease VIII, polynucleotide kinase, Taq DNA polymerase, DNA polymerase I and/or human DNA polymerase beta wherein the digestion reagent lacks formamidopyrimidine [fapy]-DNA glycosylase (fpg).

In some embodiments a digestion reagent comprises a single-stranded DNA exonuclease that degrades in a 5′-3′ direction. In some embodiments a cleavage reagent comprises a single-stranded DNA exonuclease that degrades abasic sites. In some embodiments herein the digestions reagent comprises an RecJf exonuclease. In particular embodiments a digestion reagent comprises APE1 and RecJf, wherein the cleavage reagent comprises an apurinic/apyrimidinic endonuclease. In certain embodiments the digestion reagent comprises an AP endonuclease (APE1).

In some embodiments a repair reagent comprises at least one DNA polymerase; wherein the gap-filling reagent comprises: any one or any combination of: Phusion DNA polymerase, Phusion U DNA polymerase, SuperFi DNA polymerase, Taq DNA polymerase, Human DNA polymerase beta, T4 DNA polymerase and/or T7 DNA polymerase and/or SuperFi U DNA polymerase. In some embodiments a repair reagent further comprises a plurality of nucleotides.

In some embodiment a repair reagent comprises an ATP-dependent or an ATP-independent ligase; wherein the repair reagent comprises any one or any combination of: E. coli DNA ligase, T3 DNA ligase, T4 DNA ligase, T7 DNA ligase, Taq DNA ligase, 9° N DNA ligase

In certain embodiments a digestion and repair reagent comprises any one or a combination of one or a combination of uracil DNA glycosylase (UDG). apurinic endonuclease (e.g., APE1), RecJf, formamidopyrimidine [fapy]-DNA glycosylase (fpg), Nth endonuclease II, endonuclease VIII, polynucleotide kinase (PNK), Taq DNA polymerase, DNA polymerase I and/or human DNA polymerase beta; and any one or a combination of Phusion DNA polymerase, Phusion U DNA polymerase, SuperFi DNA polymerase, Taq DNA polymerase, Human DNA polymerase beta, T4 DNA polymerase and/or T7 DNA polymerase, SuperFiU DNA polymerase, E. coli DNA ligase, T3 DNA ligase, T4 DNA ligase, T7 DNA ligase, Taq DNA ligase, and/or 9° N DNA ligase. In particular embodiments, a digestion and repair reagent comprises any one or a combination of uracil DNA glycosylase (UDG), apurinic endonuclease (e.g., APE1), Taq DNA polymerase, Phusion U DNA polymerase, SuperFiU DNA polymerase, T7 DNA ligase. In certain embodiments a purification comprises a single or repeated separating step that is carried out following production of the library following the second amplification; and wherein method steps are conducted in a single reaction vessel without requisite transferring of a portion of any of the products generated in steps to another reaction vessel until a first purification. Each of the plurality of adaptors used in the methods herein comprise a universal handle sequence and a target nucleic acid sequence and a cleavable moiety and optionally one or more tag sequences. At least two and up to one hundred thousand target specific adaptor pairs are included in the provided methods, wherein the target nucleic acid sequence of each adaptor includes at least one cleavable moiety and the universal handle sequence does not include the cleavable moiety. In some embodiments where an optional tag sequence is included in at least one adaptor, the cleavable moieties are included in the adaptor sequence flanking either end of the tag sequence.

In another embodiment, methods for preparing a tagged library of target nucleic acid sequences are provided comprising contacting a nucleic acid sample with a plurality of adaptors capable of amplification of one or more target nucleic acid sequences in the sample under conditions wherein the target nucleic acid(s) undergo a first amplification; digesting resulting first amplification products to reduce or eliminate resulting primer dimers and prepare partially digested target amplicons, thereby producing gapped, double stranded amplicons. The methods further comprise repairing the partially digested target amplicons; then amplifying the repaired target amplicons in a second amplification using universal primers, thereby producing a library of target nucleic acid sequences, and purifying the resulting library. In certain embodiments a digestion and repair reagent comprises any one or a combination of one or a combination of uracil DNA glycosylase (UDG). apurinic endonuclease (e.g., APE1), RecJf, formamidopyrimidine [fapy]-DNA glycosylase (fpg), Nth endonuclease III, endonuclease VIII, polynucleotide kinase (PNK), Taq DNA polymerase, DNA polymerase I and/or human DNA polymerase beta; and any one or a combination of Phusion DNA polymerase, Phusion U DNA polymerase, SuperFi DNA polymerase, Taq DNA polymerase, Human DNA polymerase beta, T4 DNA polymerase and/or T7 DNA polymerase, SuperFiU DNA polymerase, E. coli DNA ligase, T3 DNA ligase, T4 DNA ligase, T7 DNA ligase, Taq DNA ligase, and/or 9° N DNA ligase. In particular embodiments, a digestion and repair reagent comprises any one or a combination of uracil DNA glycosylase (UDG), apurinic endonuclease (e.g., APE1), Taq DNA polymerase, Phusion U DNA polymerase, SuperFiU DNA polymerase, T7 DNA ligase. In certain embodiments the purification comprises a single or repeated separating step that is carried out following production of the library following the second amplification; and wherein steps the other method steps are conducted in a single reaction vessel without requisite transferring of a portion (aliquot) of any of the products generated in steps to another reaction vessel. Each of the plurality of adaptors used in the methods herein comprise a universal handle sequence and a target nucleic acid sequence and a cleavable moiety and one or more tag sequences. At least two and up to one hundred thousand target specific adaptor pairs are included in the provided methods, wherein the target nucleic acid sequence of each adaptor includes at least one cleavable moiety, the universal handle sequence does not include the cleavable moiety, and the cleavable moieties are included flanking either end of the tag sequence.

In some embodiments, adaptor-dimer byproducts resulting from the first amplification of step of the methods are largely removed from the resulting library. In certain embodiments the enriched population of amplified target nucleic acids contains a reduced amount of adaptor-dimer byproduct. In particular embodiments adaptor dimer byproducts are eliminated.

In some embodiments, the library is prepared in less than 4 hours. In some embodiments, the library is prepared, enriched and sequenced in less than 3 hours. In some embodiments, the library is prepared, enriched and sequenced in 2 to 3 hours. In some embodiments, the library is prepared in approximately 2.5 hours. In some embodiments, the library is prepared in approximately 2.75 hours. In some embodiments, the library is prepared in approximately 3 hours.

Compositions

Additional aspects of the invention comprise composition comprising a plurality of nucleic acid adaptors, as well as library compositions prepared according to the methods of the invention. Provided compositions are useful in conjunction with the methods described herein as well as for additional analysis and applications known in the art.

Thus, provided are composition comprising a plurality of nucleic acid adaptors, wherein each of the plurality of adaptors comprises a 5′ universal handle sequence, optionally one or more tag sequences, and a 3′ target nucleic acid sequence wherein each adaptor comprises a cleavable moiety, wherein the target nucleic acid sequence of the adaptor includes at least one cleavable moiety, and when tag sequences are present cleavable moieties are included flanking either end of the tag sequence and wherein the universal handle sequence does not include the cleavable moiety. At least two and up to one hundred thousand target specific adaptor pairs are included in provided compositions. Provided composition allow for rapid production of highly multiplexed targeted libraries.

In some embodiments, provided compositions comprise plurality of nucleic acid adaptors, wherein each of the plurality of adaptors comprise a 5′ universal handle sequence, one or more tag sequences, and a 3′ target nucleic acid sequence wherein each adaptor comprises a cleavable moiety; wherein the target nucleic acid sequence of the adaptor includes at least one cleavable moiety, cleavable moieties are included flanking either end of the tag sequence and the universal handle sequence does not include the cleavable moiety. At least two and up to one hundred thousand target specific adaptor pairs are included in provided compositions. Provided composition allow for rapid production of highly multiplexed, tagged, targeted libraries.

Primer/adaptor compositions may be single stranded or double stranded. In some embodiments adaptor compositions comprise are single stranded adaptors. In some embodiments adaptor compositions comprise double stranded adaptors. In some embodiments adaptor compositions comprise a mixture of single stranded and double stranded adaptors.

In some embodiments, compositions include a plurality of adaptors capable of amplification of one or more target nucleic acid sequences comprising a multiplex of adaptor pairs capable of amplification of at least two different target nucleic acid sequences wherein the target-specific primer sequence is substantially non-complementary to other target specific primer sequences in the composition. In some embodiments, the composition comprises at least 25, 50, 75, 100, 150, 200, 250, 300, 350, 400, 450, 500, 750, 1000, 1250, 1500, 1750, 2000, 2250, 2500, 2750, 3000, 3250, 3500, 3750, 4000, 4500, 5000, 5500, 6000, 7000, 8000, 9000, 10000, 11000, or 12000, or more target-specific adaptor pairs. In some embodiments, target-specific adpator pairs comprise about 15 nucleotides to about 40 nucleotides in length, wherein at least one nucleotide is replaced with a cleavable group. In some embodiments the cleavable group is a uridine nucleotide. In some embodiments, the target-specific adaptor pairs are designed to amplify an exon, gene, exome or region of the genome associated with a clinical or pathological condition, e.g., amplification of one or more sites comprising one or more mutations (e.g., driver mutation) associated with a cancer, e.g., lung, colon, breast cancer, etc., or amplification of mutations associated with an inherited disease, e.g., cystic fibrosis, muscular dystrophies, etc. In some embodiments, the target-specific adaptor pairs when hybridized to a target sequence and amplified as provided herein generates a library of adaptor-ligated amplified target sequences that are about 100 to about 600 base pairs in length. In some embodiments, no one adaptor-ligated amplified target sequence is overexpressed in the library by more than 30% as compared to the remainder of other adaptor-ligated amplified target sequences in the library. In some embodiments, an adaptor-ligated amplified target sequence library is substantially homogenous with respect to GC content, amplified target sequence length or melting temperature (Tm) of the respective target sequences.

In some embodiments, the target-specific primer sequences of adaptor pairs in the compositions of the invention are target-specific sequences that can amplify specific regions of a nucleic acid molecule. In some embodiments, the target-specific adaptors can amplify genomic DNA or cDNA. In some embodiments, target-specific adaptors can amplify mammalian nucleic acid, such as, but not limited to human DNA or RNA, murine DNA or RNA, bovine DNA or RNA, canine DNA or RNA, equine DNA or RNA, or any other mammal of interest. In other embodiments, target specific adaptors include sequences directed to amplify plant nucleic acids of interest. In other embodiments, target specific adaptors include sequences directed to amplify infectious agents, e.g., bacterial and/or viral nucleic acids. In some embodiments, the amount of nucleic acid required for selective amplification is from about 1 ng to 1 microgram. In some embodiments, the amount of nucleic acid required for selective amplification of one or more target sequences is about 1 ng, about 5 ng or about 10 ng. In some embodiments, the amount of nucleic acid required for selective amplification of target sequence is about 10 ng to about 200 ng.

As described herein, each of the plurality of adaptors comprises a 5′ universal handle sequence. In some embodiments a universal handle sequence comprises any one or any combination of an amplification primer binding sequence, a sequencing primer binding sequence and/or a capture primer binding sequence. In some embodiments the comparable maximal minimum melting temperatures of each adaptor universal handle sequence is higher than the comparable maximal minimum melting temperatures of each target nucleic acid sequence and each tag sequence present in the same adaptor. Preferably, the universal handle sequences of provided adaptors do not exhibit significant complementarity and/or hybridization to any portion of a unique tag sequence and/or target nucleic acid sequence of interest. In some embodiments a first universal handle sequence comprises any one or any combination of an amplification primer binding sequence, a sequencing primer binding sequence and/or a capture primer binding sequence. In some embodiments a second universal handle sequence comprises any one or any combination of an amplification primer binding sequence, a sequencing primer binding sequence and/or a capture primer binding sequence. In certain embodiments first and second universal handle sequences correspond to forward and reverse universal handle sequences and in certain embodiments the same first and second universal handle sequences are included for each of the plurality of target specific adaptor pairs. Such forward and reverse universal handle sequences are targeted in conjunction with universal primers to carry out a second amplification of repaired amplicons in production of libraries according to methods of the invention. In certain embodiments a first 5′ universal handle sequence comprises two universal handle sequences (e.g., a combination of an amplification primer binding sequence, a sequencing primer binding sequence and/or a capture primer binding sequence); and a second 5′ universal sequence comprises two universal handle sequences (e.g., a combination of an amplification primer binding sequence, a sequencing primer binding sequence and/or a capture primer binding sequence), wherein the 5′ first and second universal handle sequences do not exhibit significant hybridization to any portion of a target nucleic acid sequence of interest.

The structure and properties of universal amplification primers or universal primers are well known to those skilled in the art and can be implemented for utilization in conjunction with provided methods and compositions to adapt to specific analysis platforms. Universal handle sequences of the adaptors provided herein are adapted accordingly to accommodate a preferred universal primer sequences. For example, e.g., as described herein universal P1 and A primers with optional barcode sequences have been described in the art and utilized for sequencing on Ion Torrent sequencing platforms (Ion Xpress™ Adapters, Thermo Fisher Scientific). Similarly, additional and other universal adaptor/primer sequences described and known in the art (e.g., Illumina universal adaptor/primer sequences can be found, e.g., at https://support.illumina.com/content/dam/illumina-support/documents/documentation/chemistry_documentation/experiment-design/illumina-adapter-sequences_1000000002694-01.pdf; PacBio universal adaptor/primer sequences, can be found, e.g., at https://s3.amazonaws.com/files.pacb.com/pdf/Guide_Pacific_Biosciences_Template_Preparation_and_Sequencing.pdf; etc.) can be used in conjunction with the methods and compositions provided herein. Suitable universal primers of appropriate nucleotide sequence for use with adaptors of the invention are readily prepared using standard automated nucleic acid synthesis equipment and reagents in routine use in the art. One single type of universal primer or separate types (or even a mixture) of two different universal primers, for example a pair of universal amplification primers suitable for amplification of repaired amplicons in a second amplification are included for use in the methods of the invention. Universal primers optionally include a different tag (barcode) sequence, where the tag (barcode) sequence does not hybridize to the adaptor. Barcode sequences incorporated into amplicons in a second universal amplification can be utilized e.g., for effective identification of sample source.

In some embodiments adaptors further comprise a unique tag sequence located between the 5′ first universal handle sequence and the 3′ target-specific sequence, and wherein the unique tag sequence does not exhibit significant complementarity and/or hybridization to any portion of a unique tag sequence and/or target nucleic acid sequence of interest. In some embodiments the plurality of primer adaptor pairs has 104-10⁹different tag sequence combinations. Thus in certain embodiments each generated target specific adaptor pair comprises 104-10⁹different tag sequences. In some embodiments the plurality of primer adaptors comprise each target specific adaptor comprising at least 1 different unique tag sequence and up to 10⁵different unique tag sequences. In some embodiments the plurality of primer adaptors comprise each target specific adaptor comprising at least 1 different unique tag sequence and up to 10⁵different unique tag sequences. In certain embodiments each generated target specific amplicon generated comprises at least two and up to 10⁹different adaptor combinations comprising different tag sequences, each having two different unique tag sequences. In some embodiments the plurality of primer adaptors comprise each target specific adaptor comprising 4096 different tag sequences. In certain embodiments each generated target specific amplicon generated comprises up to 16,777,216 different adaptor combinations comprising different tag sequences, each having two different unique tag sequences.

In some embodiments individual primer adaptors in the plurality of adaptors include a unique tag sequence (e.g., contained in a tag adaptor) comprising different random tag sequences alternating with fixed tag sequences. In some embodiments, the at least one unique tag sequence comprises a at least one random sequence and at least one fixed sequence, or comprises a random sequence flanked on both sides by a fixed sequence, or comprises a fixed sequence flanked on both sides by a random sequence. In some embodiments a unique tag sequence includes a fixed sequence that is 2-2000 nucleotides or base-pairs in length. In some embodiments a unique tag sequence includes a random sequence that is 2-2000 nucleotides or base-pairs in length.

In some embodiments, unique tag sequences include a sequence having at least one random sequence interspersed with fixed sequences. In some embodiments, individual tag sequences in a plurality of unique tags have the structure (N)_n(X)_x(M)_m(Y)_y, wherein “N” represents a random tag sequence that is generated from A, G, C, T, U or I, and wherein “n” is 2-10 which represents the nucleotide length of the “N” random tag sequence; wherein “X” represents a fixed tag sequence, and wherein “x” is 2-10 which represents the nucleotide length of the “X” random tag sequence; wherein “M” represents a random tag sequence that is generated from A, G, C, T, U or I, wherein the random tag sequence “M” differs or is the same as the random tag sequence “N”, and wherein “m” is 2-10 which represents the nucleotide length of the “M” random tag sequence; and wherein “Y” represents a fixed tag sequence, wherein the fixed tag sequence of “Y” is the same or differs from the fixed tag sequence of “X”, and wherein “y” is 2-10 which represents the nucleotide length of the “Y” random tag sequence. In some embodiments, the fixed tag sequence “X” is the same in a plurality of tags. In some embodiments, the fixed tag sequence “X” is different in a plurality of tags. In some embodiments, the fixed tag sequence “Y” is the same in a plurality of tags. In some embodiments, the fixed tag sequence “Y” is different in a plurality of tags. In some embodiments, the fixed tag sequences “(X),” and “(Y)” within the plurality of adaptors are sequence alignment anchors.

In some embodiments, the random sequence within a unique tag sequence is represented by “N”, and the fixed sequence is represented by “X”. Thus, a unique tag sequence is represented by N₁N₂N₃X₁X₂X₃or by N₁N₂N₃X₁X₂X₃N₄N₅N₆X₄X₅X₆. Optionally, a unique tag sequence can have a random sequence in which some or all of the nucleotide positions are randomly selected from a group consisting of A, G, C, T, U and I. For example, a nucleotide for each position within a random sequence is independently selected from any one of A, G, C, T, U or I, or is selected from a subset of these six different types of nucleotides. Optionally, a nucleotide for each position within a random sequence is independently selected from any one of A, G, C or T. In some embodiments, the first fixed tag sequence “X₁X₂X₃” is the same or different sequence in a plurality of tags. In some embodiments, the second fixed tag sequence “X₄X₅X₆” is the same or different sequence in a plurality of tags. In some embodiments, the first fixed tag sequence “X₁X₂X₃” and the second fixed tag sequence “X₄X₅X₆” within the plurality of adaptors are sequence alignment anchors.

In some embodiments, a unique tag sequence comprises the sequence 5′-NNNACTNNNTGA-3′, where “N” represents a position within the random sequence that is generated randomly from A, G, C or T, the number of possible distinct random tags is calculated to be 4⁶(or 4{circumflex over ( )}6) is about 4096, and the number of possible different combinations of two unique tags is 4¹²(or 4{circumflex over ( )}12) is about 16.78 million. In some embodiments, the underlined portions of 5′-NNNACTNNNTGA-3′ are a sequence alignment anchor.

In some embodiments, the fixed sequences within the unique tag sequence is a sequence alignment anchor that can be used to generate error-corrected sequencing data. In some embodiments fixed sequences within the unique tag sequence is a sequence alignment anchor that can be used to generate a family of error-corrected sequencing reads.

Adaptors provided herein comprise at least one cleavable moiety. In some embodiments a cleavable moiety is within the 3′ target-specific sequence. In some embodiments a cleavable moiety is at or near the junction between the 5′ first universal handle sequence and the 3′ target-specific sequence. In some embodiments a cleavable moiety is at or near the junction between the 5′ first universal handle sequence and the unique tag sequence, and at or near the junction between the unique tag sequence and the 3′ target-specific sequence. The cleavable moiety can be present in a modified nucleotide, nucleoside or nucleobase. In some embodiments, the cleavable moiety can include a nucleobase not naturally occurring in the target sequence of interest.

In some embodiments the at least one cleavable moiety in the plurality of adaptors is a uracil base, uridine or a deoxyuridine nucleotide. In some embodiments a cleavable moiety is within the 3′ target-specific sequence and the junctions between the 5′ universal handle sequence and the unique tag sequence and/or the 3′target specific sequence wherein the at least one cleavable moiety in the plurality of adaptors is cleavable with uracil DNA glycosylase (UDG). In some embodiments, a cleavable moiety is cleaved, resulting in a susceptible abasic site, wherein at least one enzyme capable of reacting on the abasic site generates a gap comprising an extendible 3′ end. In certain embodiments the resulting gap comprises a 5′-deoxyribose phosphate group. In certain embodiments the resulting gap comprises an extendible 3′ end and a 5′ ligatable phosphate group.

In another embodiment, inosine can be incorporated into a DNA-based nucleic acid as a cleavable group. In one exemplary embodiment, EndoV can be used to cleave near the inosine residue. In another exemplary embodiment, the enzyme hAAG can be used to cleave inosine residues from a nucleic acid creating abasic sites.

Where a cleavable moiety is present, the location of the at least one cleavable moiety in the adaptors does not significantly change the melting temperature (Tm) of any given double-stranded adaptor in the plurality of double-stranded adaptors. The melting temperatures (Tm) of any two given double-stranded adaptors from the plurality of double-stranded adaptors are substantially the same, wherein the melting temperatures (Tm) of any two given double-stranded adaptors does not differ by more than 10° C. of each other. However, within each of the plurality of adaptors, the melting temperatures of sequence regions differs, such that the comparable maximal minimum melting temperature of, for example, the universal handle sequence, is higher than the comparable maximal minimum melting temperatures of either the unique tag sequence and/or the target specific sequence of any adaptor. This localized differential in comparable maximal minimum melting temperatures can be adjusted to optimize digestion and repair of amplicons and ultimately improved effectiveness of the methods provided herein.

Further provided are compositions comprising a nucleic acid library generated by methods of the invention. Thus, provided are composition comprising a plurality of amplified target nucleic acid amplicons, wherein each of the plurality of amplicons comprises a 5′ universal handle sequence, optionally a first unique tag sequences, an intermediate target nucleic acid sequence, optionally a second unique tag sequences and a 3′ universal handle sequence. At least two and up to one hundred thousand target specific amplicons are included in provided compositions. Provided compositions include highly multiplexed targeted libraries. In some embodiments, provided compositions comprise a plurality of nucleic acid amplicons, wherein each of the plurality of amplicons comprise a 5′ universal handle sequence, a first unique tag sequences, an intermediate target nucleic acid sequence, a second unique tag sequences and a 3′ universal handle sequence. At least two and up to one hundred thousand target specific tagged amplicons are included in provided compositions. Provided compositions include highly multiplexed tagged targeted libraries.

In some embodiments, library compositions include a plurality of target specific amplicons comprising a multiplex of at least two different target nucleic acid sequences. In some embodiments, the composition comprises at least 25, 50, 75, 100, 150, 200, 250, 300, 350, 400, 450, 500, 750, 1000, 1250, 1500, 1750, 2000, 2250, 2500, 2750, 3000, 3250, 3500, 3750, 4000, 4500, 5000, 5500, 6000, 7000, 8000, 9000, 10000, 11000, or 12000, or more target-specific amplicons. In some embodiments, the target-specific amplicons comprise one or more exon, gene, exome or region of the genome associated with a clinical or pathological condition, e.g., amplicons comprising one or more sites comprising one or more mutations (e.g., driver mutation) associated with a cancer, e.g., lung, colon, breast cancer, etc., or amplicons comprising mutations associated with an inherited disease, e.g., cystic fibrosis, muscular dystrophies, etc. In some embodiments, the target-specific amplicons comprise a library of adaptor-ligated amplicon target sequences that are about 100 to about 750 base pairs in length.

As described herein, each of the plurality of amplicons comprises a 5′ universal handle sequence. In some embodiments a universal handle sequence comprises any one or any combination of an amplification primer binding sequence, a sequencing primer binding sequence and/or a capture primer binding sequence. Preferably, the universal handle sequences of provided adaptors do not exhibit significant complementarity and/or hybridization to any portion of a unique tag sequence and/or target nucleic acid sequence of interest. In some embodiments a first universal handle sequence comprises any one or any combination of an amplification primer binding sequence, a sequencing primer binding sequence and/or a capture primer binding sequence. In some embodiments a second universal handle sequence comprises any one or any combination of an amplification primer binding sequence, a sequencing primer binding sequence and/or a capture primer binding sequence. In certain embodiments first and second universal handle sequences correspond to forward and reverse universal handle sequences and in certain embodiments the same first and second universal handle sequences are included for each of the plurality of target specific amplicons. Such forward and reverse universal handle sequences are targeted in conjunction with universal primers to carry out a second amplification of a preliminary library composition in production of resulting amplified according to methods of the invention. In certain embodiments a first 5′ universal handle sequence comprises two universal handle sequences (e.g., a combination of an amplification primer binding sequence, a sequencing primer binding sequence and/or a capture primer binding sequence); and a second 5′ universal sequence comprises two universal handle sequences (e.g., a combination of an amplification primer binding sequence, a sequencing primer binding sequence and/or a capture primer binding sequence), wherein the 5′ first and second universal handle sequences do not exhibit significant hybridization to any portion of a target nucleic acid sequence of interest.

The structure and properties of universal amplification primers or universal primers are well known to those skilled in the art and can be implemented for utilization in conjunction with provided methods and compositions to adapt to specific analysis platforms. Universal handle sequences of the adaptors and amplicons provided herein are adapted accordingly to accommodate a preferred universal primer sequences. For example, e.g., as described herein universal P1 and A primers with optional barcode sequences have been described in the art and utilized for sequencing on Ion Torrent sequencing platforms (Ion Xpress™ Adapters, Thermo Fisher Scientific). Similarly, additional and other universal adaptor/primer sequences described and known in the art (e.g., Illumina universal adaptor/primer sequences can be found, e.g., at https://support.illumina.com/content/dam/illumina-support/documents/documentation/chemistry_documentation/experiment-design/illumina-adapter-sequences_1000000002694-01.pdf; PacBio universal adaptor/primer sequences, can be found, e.g., at https://s3.amazonaws.com/files.pacb.com/pdf/Guide_Pacific_Biosciences_Template_Preparation_and_Sequencing.pdf; etc.) can be used in conjunction with the methods and compositions provided herein. Suitable universal primers of appropriate nucleotide sequence for use with libraries of the invention are readily prepared using standard automated nucleic acid synthesis equipment and reagents in routine use in the art. One single type or separate types (or even a mixture) of two different universal primers, for example a pair of universal amplification primers suitable for amplification of a preliminary library may be used in production of the libraries of the invention. Universal primers optionally include a tag (barcode) sequence, where the tag (barcode) sequence does not hybridize to adaptor sequence or to target nucleic acid sequences. Barcode sequences incorporated into amplicons in a second universal amplification can be utilized e.g., for effective identification of sample source to thereby generate a barcoded library. Thus provided compositions include highly multiplexed barcoded targeted libraries. Provided compositions also include highly multiplexed barcoded tagged targeted libraries.

In some embodiments amplicon libraries comprise a unique tag sequence located between the 5′ first universal handle sequence and the 3′ target-specific sequence, and wherein the unique tag sequence does not exhibit significant complementarity and/or hybridization to any portion of a unique tag sequence and/or target nucleic acid sequence. In some embodiments the plurality of amplicons has 104-10⁹different tag sequence combinations. Thus in certain embodiments each of the plurality of amplicons in a library comprises 104-10⁹different tag sequences. In some embodiments each of the plurality of amplicons in a library comprises at least 1 different unique tag sequence and up to 10⁵different unique tag sequences. In certain embodiments each target specific amplicon in a library comprises at least two and up to 10⁹different combinations comprising different tag sequences, each having two different unique tag sequences. In some embodiments each of the plurality of amplicons in a library comprise a tag sequence comprising 4096 different tag sequences. In certain embodiments each target specific amplicon of a library comprises up to 16,777,216 different combinations comprising different tag sequences, each having two different unique tag sequences.

In some embodiments individual amplicons in the plurality of amplicons of a library include a unique tag sequence (e.g., contained in a tag adaptor sequence) comprising different random tag sequences alternating with fixed tag sequences. In some embodiments, the at least one unique tag sequence comprises a at least one random sequence and at least one fixed sequence, or comprises a random sequence flanked on both sides by a fixed sequence, or comprises a fixed sequence flanked on both sides by a random sequence. In some embodiments a unique tag sequence includes a fixed sequence that is 2-2000 nucleotides or base-pairs in length. In some embodiments a unique tag sequence includes a random sequence that is 2-2000 nucleotides or base-pairs in length.

Kits, Systems

Further provided herein are kits for use in preparing libraries of target nucleic acids using methods of the first or second aspects of the invention. Embodiments of a kit comprise a supply of at least a pair of target specific adaptors as defined herein which are capable of producing a first amplification product; as well as optionally a supply of at least one universal pair of amplification primers capable of annealing to the universal handle(s) of the adaptor and priming synthesis of an amplification product, which amplification product would include a target sequence of interest ligated to a universal sequence. Adaptors and/or primers may be supplied in kits ready for use, or more preferably as concentrates requiring dilution before use, or even in a lyophilized or dried form requiring reconstitution prior to use. In certain embodiments kits further include a supply of a suitable diluent for dilution or reconstitution of the components. Optionally, kits further comprise supplies of reagents, buffers, enzymes, dNTPs, etc., for use in carrying out amplification, digestion, repair, and/or purification in the generation of library as provided herein. Non-limiting examples of such reagents are as described in the Materials and Methods sections of the accompanying Exemplification. Further components which optionally are supplied in the kit include components suitable for purification of libraries prepared using the provided methods._In some embodiments, provided is a kit for generating a target-specific library comprising a plurality of target-specific adaptors having a 5′ universal handle sequence, a 3′ target specific sequence and a cleavable group, a DNA polymerase, an adaptor, dATP, dCTP, dGTP, dTTP, and a digestion reagent. In some embodiments, the kit further comprises one or more antibodies, a repair reagent, universal primers optionally comprising nucleic acid barcodes, purification solutions or columns.

Particular features of adaptors for inclusion in kits are as described elsewhere herein in relation to other aspects of the invention. The structure and properties of universal amplification primers are well known to those skilled in the art and can be implemented for utilization in conjunction with provided methods and compositions to adapt to specific analysis platforms (e.g., as described herein universal P1 and A primers have been described in the art and utilized for sequencing on Ion Torrent sequencing platforms). Similarly, additional and other universal adaptor/primer sequences described and known in the art (e.g., Illumina universal adaptor/primer sequences, PacBio universal adaptor/primer sequences, etc.) can be used in conjunction with the methods and compositions provided herein. Suitable primers of appropriate nucleotide sequence for use with adaptors included in the kit is readily prepared using standard automated nucleic acid synthesis equipment and reagents in routine use in the art. A kit may include a supply of one single type of universal primer or separate types (or even a mixture) of two different universal primers, for example a pair of amplification primers suitable for amplification of templates modified with adaptors in a first amplification. A kit may comprise at least a pair of adaptors for first amplification of a sample of interest according to the methods of the invention, plus at least two different amplification primers that optionally carry a different tag (barcode) sequence, where the tag (barcode) sequence does not hybridize to the adaptor. A kit can be used to amplify at least two different samples where each sample is amplified according to methods of the invention separately and a second amplification comprises using a single universal primer having a barcode, and then pooling prepared sample libraries after library preparations. In some embodiments a kit includes different universal primer-pairs for use in second amplification step described herein. In this context the ‘universal’ primer-pairs may be of substantially identical nucleotide sequence but differ with respect to some other feature or modification.

Further provided are systems, e.g., systems used to practice methods provided herein, and/or comprising compositions provided herein. In some embodiments, systems facilitate methods carried out in automated mode. In certain embodiments, systems facilitate high throughput mode. In certain embodiments, systems include, e.g., a fluid handling element, a fluid containing element, a heat source and/or heat sink for achieving and maintaining a desired reaction temperature, and/or a robotic element capable of moving components of the system from place to place as needed (e.g., a multiwell plate handling element).

Samples

As defined herein, “sample” and its derivatives, is used in its broadest sense and includes any specimen, culture and/or the like that is suspected of including a target nucleic acid. In some embodiments, a sample comprises DNA, RNA, TNA, chimeric nucleic acid, hybrid nucleic acid, multiplex-forms of nucleic acids or any combination of two or more of the foregoing. In some embodiments a sample useful in conjunction with methods of the invention includes any biological, clinical, surgical, agricultural, atmospheric or aquatic-based specimen containing one or more target nucleic acid of interest. In some embodiments, a sample includes nucleic acid molecules obtained from an animal such as a human or mammalian source. In another embodiment, a sample includes nucleic acid molecules obtained from a non-mammalian source such as a plant, bacteria, virus or fungus. In some embodiments, the source of the nucleic acid molecules may be an archived or extinct sample or species. In some embodiments a sample includes isolated nucleic acid sample prepared, for example, from a source such as genomic DNA, RNA TNA or a prepared sample such as, e.g., fresh-frozen or formalin-fixed paraffin-embedded (FFPE) nucleic acid specimen. It is also envisioned that a sample is from a single individual, a collection of nucleic acid samples from genetically related members, multiple nucleic acid samples from genetically unrelated members, multiple nucleic acid samples (matched) from a single individual such as a tumor sample and normal tissue sample, or genetic material from a single source that contains two distinct forms of genetic material such as maternal and fetal DNA obtained from a maternal subject, or the presence of contaminating bacteria DNA in a sample that contains plant or animal DNA. In some embodiments, a source of nucleic acid material includes nucleic acids obtained from a newborn (e.g., a blood sample for newborn screening). In some embodiments, provided methods comprise amplification of multiple target-specific sequences from a single nucleic acid sample. In some embodiments, provided methods comprise target-specific amplification of two or more target sequences from two or more nucleic acid samples or species. In certain embodiments, provided methods comprise amplification of highly multiplexed target nucleic acid sequences from a single sample. In particular embodiments, provided methods comprise amplification of highly multiplexed target nucleic acid sequences from more than one sample, each from the same source organism.

In some embodiments a sample comprises a mixture of target nucleic acids and non-target nucleic acids. In certain embodiments a sample comprises a plurality of initial polynucleotides which comprises a mixture of one or more target nucleic acids and may include one or more non-target nucleic acids. In some embodiments a sample comprising a plurality of polynucleotides comprises a portion or aliquot of an originating sample; in some embodiments, a sample comprises a plurality of polynucleotides which is the entire originating sample. In some embodiments a sample comprises a plurality of initial polynucleotides is isolated from the same source or from the same subject at different time points.

In some embodiments, a nucleic acid sample includes cell-free nucleic acids from a biological fluid, nucleic acids from a tissue, nucleic acids from a biopsied tissue, nucleic acids from a needle biopsy, nucleic acids from a single cell or nucleic acids from two or more cells. In certain embodiments, a single reaction mixture contains 1-100 ng of the plurality of initial polynucleotides. In some embodiments a plurality of initial polynucleotides comprises a formalin fixed paraffin-embedded (FFPE) sample; genomic DNA; RNA; TNA; cell free DNA or RNA or TNA; circulating tumor DNA or RNA or TNA; fresh frozen sample, or a mixture of two or more of the foregoing; and in some embodiments a the plurality of initial polynucleotides comprises a nucleic acid reference standard. In some embodiments, a sample includes nucleic acid molecules obtained from biopsies, tumors, scrapings, swabs, blood, mucus, urine, plasma, semen, hair, laser capture micro-dissections, surgical resections, and other clinical or laboratory obtained sample. In some embodiments, a sample is an epidemiological, agricultural, forensic or pathogenic sample. In certain embodiments, a sample includes a reference. In some embodiments a sample is a normal tissue or well documented tumor sample. In certain embodiments a reference is a standard nucleic acid sequence (e.g., Hg19).

Target Nucleic Acid Sequence Analysis

Provided methods and compositions of the invention are particularly suitable for amplifying, optionally tagging, and preparing target sequences for subsequent analysis. Thus, in some embodiments, methods provided herein include analyzing resulting library preparations. For example, methods comprise analysis of a polynucleotide sequence of a target nucleic acid, and, where applicable, analysis of any tag sequence(s) added to a target nucleic acid. In some embodiments wherein multiple target nucleic acid regions are amplified, provided methods include determining polynucleotide sequences of multiple target nucleic acids. Provided methods further optionally include using a second tag sequence(s), e.g., barcode sequence, to identify the source of the target sequence (or to provide other information about the sample source). In certain embodiments, use of prepared library composition is provided for analysis of the sequences of the nucleic acid library.

In particular embodiments, use of prepared tagged library compositions is provided for further analyzing the sequences of the target nucleic acid library. In some embodiments determination of sequences comprises determining the abundance of at least one of the target sequences in the sample. In some embodiments determination of a low frequency allele in a sample is comprised in determination of sequences of a nucleic acid library. In certain embodiments, determination of the presence of a mutant target nucleic acid in the plurality of polynucleotides is comprised in determination of sequences of a nucleic acid library. In some embodiments, determination of the presence of a mutant target nucleic acid comprises detecting the abundance level of at least one mutant target nucleic acid in the plurality of polynucleotides. For example, such determination comprises detecting at least one mutant target nucleic acid is present at 0.05% to 1% of the original plurality of polynucleotides in the sample, detecting at least one mutant target nucleic acid is present at about 1% to about 5% of the polynucleotides in the sample, and/or detecting at least 85%-100% of target nucleic acids in sample. In some embodiments, determination of the presence of a mutant target nucleic acid comprises detecting and identification of copy number variation and/or genetic fusion sequences in a sample.

In some embodiments, nucleic acid sequencing of the amplified target sequences produced by the teachings of this disclosure include de novo sequencing or targeted re-sequencing. In some embodiments, nucleic acid sequencing further includes comparing the nucleic acid sequencing results of the amplified target sequences against a reference nucleic acid sequence. In some embodiments, nucleic acid sequencing of the target library sequences further includes determining the presence or absence of a mutation within a nucleic acid sequence. In some embodiments, nucleic acid sequencing includes the identification of genetic markers associated with disease (e.g., cancer and/or inherited disease).

In some embodiments, prepared library of target sequences of the disclosed methods is used in various downstream analysis or assays with, or without, further purification or manipulation. In some embodiments analysis comprises sequencing by traditional sequencing reactions, high throughput next generation sequencing, targeted multiplex array sequence detection, or any combination of two or more of the foregoing. In certain embodiments analysis is carried out by high throughput next generation sequencing. In particular embodiments sequencing is carried out in a bidirectional manner, thereby generating sequence reads in both forward and reverse strands for any given amplicon.

In some embodiments, library prepared according to the methods provided herein is then further manipulated for additional analysis. For example, prepared library sequences is used in downstream enrichment techniques known in the art, such a bridge amplification or emPCR to generate a template library that is then used in next generation sequencing. In some embodiments, the target nucleic acid library is used in an enrichment application and a sequencing application. For example, sequence determination of a provided target nucleic acid library is accomplished using any suitable DNA sequencing platform. In some embodiments, the library sequences of the disclosed methods or subsequently prepared template libraries is used for single nucleotide polymorphism (SNP) analysis, genotyping or epigenetic analysis, copy number variation analysis, gene expression analysis, analysis of gene mutations including but not limited to detection, prognosis and/or diagnosis, detection and analysis of rare or low frequency allele mutations, nucleic acid sequencing including but not limited to de novo sequencing, targeted resequencing and synthetic assembly analysis. In one embodiment, prepared library sequences are used to detect mutations at less than 5% allele frequency. In some embodiments, the methods disclosed herein is used to detect mutations in a population of nucleic acids at less than 4%, 3%, 2% or at about 1% allele frequency. In another embodiment, libraries prepared as described herein are sequenced to detect and/or identify germline or somatic mutations from a population of nucleic acid molecules. In certain embodiments, sequencing adaptors are ligated to the ends of the prepared libraries generate a plurality of libraries suitable for nucleic acid sequencing.

In some embodiments, methods for preparing a target-specific amplicon library are provided for use in a variety of downstream processes or assays such as nucleic acid sequencing or clonal amplification. In some embodiments, the library is amplified using bridge amplification or emPCR to generate a plurality of clonal templates suitable for nucleic acid sequencing. For example, optionally following target-specific amplification a secondary and/or tertiary amplification process including, but not limited to, a library amplification step and/or a clonal amplification step is performed. “Clonal amplification” refers to the generation of many copies of an individual molecule. Various methods known in the art is used for clonal amplification. For example, emulsion PCR is one method, and involves isolating individual DNA molecules along with primer-coated beads in aqueous bubbles within an oil phase. A polymerase chain reaction (PCR) then coats each bead with clonal copies of the isolated library molecule and these beads are subsequently immobilized for later sequencing. Emulsion PCR is used in the methods published by Marguilis et al. and Shendure and Porreca et al. (also known as “polony sequencing”, commercialized by Agencourt and recently acquired by Applied Biosystems). Margulies, et al. (2005) Nature 437: 376-380; Shendure et al., Science 309 (5741): 1728-1732. Another method for clonal amplification is “bridge PCR,” where fragments are amplified upon primers attached to a solid surface. These methods, as well as other methods of clonal amplification, both produce many physically isolated locations that each contain many copies derived from a single molecule polynucleotide fragment. Thus, in some embodiments, the one or more target specific amplicons are amplified using for example, bridge amplification or emPCR to generate a plurality of clonal templates suitable for nucleic acid sequencing.

In some embodiments, at least one of the library sequences to be clonally amplified are attached to a support or particle. A support can be comprised of any suitable material and have any suitable shape, including, for example, planar, spheroid or particulate. In some embodiments, the support is a scaffolded polymer particle as described in U.S. Published App. No. 20100304982, hereby incorporated by reference in its entirety. In certain embodiments methods comprise depositing at least a portion of an enriched population of library sequences onto a support (e.g., a sequencing support), wherein the support comprises an array of sequencing reaction sites. In some embodiments, an enriched population of library sequences are attached to the sequencing reaction sites on the support wherein the support comprises an array of 10²-10¹⁰sequencing reaction sites.

Sequence determination means determination of information relating to the sequence of a nucleic acid and may include identification or determination of partial as well as full sequence information of the nucleic acid. Sequence information may be determined with varying degrees of statistical reliability or confidence. In some embodiments sequence analysis includes high throughput, low depth detection such as by qPCR, rtPCR, and/or array hybridization detection methodologies known in the art. In some embodiments, sequencing analysis includes the determination of the in depth sequence assessment, such as by Sanger sequencing or other high throughput next generation sequencing methods. Next-generation sequencing means sequence determination using methods that determine many (typically thousands to billions) nucleic acid sequences in an intrinsically massively parallel manner, e.g. where many sequences are read out, e.g., in parallel, or alternatively using an ultra-high throughput serial process that itself may be parallelized. Thus, in certain embodiments, methods of the invention include sequencing analysis comprising massively parallel sequencing. Such methods include but are not limited to pyrosequencing (for example, as commercialized by 454 Life Sciences, Inc., Branford, Conn.); sequencing by ligation (for example, as commercialized in the SOLiD™. technology, Life Technologies, Inc., Carlsbad, Calif.); sequencing by synthesis using modified nucleotides (such as commercialized in TruSeq™ and HiSeg™ and MiSeq™ and/or NovaSeq™ technology by Illumina, Inc., San Diego, Calif.; HeliScope™ by Helicos Biosciences Corporation, Cambridge, Mass.; and PacBio Sequel® or RS systems by Pacific Biosciences of California, Inc., Menlo Park, Calif.), sequencing by ion detection technologies (e.g., Ion Torrent™ technology, Life Technologies, Carlsbad, Calif.); sequencing of DNA nanoballs (Complete Genomics, Inc., Mountain View, Calif.); nanopore-based sequencing technologies (for example, as developed by Oxford Nanopore Technologies, LTD, Oxford, UK), and like highly parallelized sequencing methods.

For example, in certain embodiments, libraries produced by the teachings of the present disclosure are sufficient in yield to be used in a variety of downstream applications including the Ion Xpress™ Template Kit using an Ion Torrent™ PGM system (e.g., PCR-mediated addition of the nucleic acid fragment library onto Ion Sphere™ Particles)(Life Technologies, Part No. 4467389) or Ion Torrent Proton™ system). For example, instructions to prepare a template library from the amplicon library can be found in the Ion Xpress Template Kit User Guide (Life Technologies, Part No. 4465884), hereby incorporated by reference in its entirety. Instructions for loading the subsequent template library onto the Ion Torrent™ Chip for nucleic acid sequencing are described in the Ion Sequencing User Guide (Part No. 4467391), hereby incorporated by reference in its entirety.

The initiation point for the sequencing reaction may be provided by annealing a sequencing primer to a product of a solid-phase amplification reaction. In this regard, one or both of the adaptors added during formation of template library may include a nucleotide sequence which permits annealing of a sequencing primer to amplified products derived by whole genome or solid-phase amplification of the template library. Depending on implementation of an embodiment of the invention, a tag sequence and/or target nucleic acid sequence may be determined in a single read from a single sequencing primer, or in multiple reads from two different sequencing primers. In the case of two reads from two sequencing primers, a ‘tag read’ and a ‘target sequence read’ are performed in either order, with a suitable denaturing step to remove an annealed primer after the first sequencing read is completed.

In some embodiments, a sequencer is coupled to server that applies parameters or software to determine the sequence of the amplified target nucleic acid molecules. In certain embodiments, the sequencer is coupled to a server that applies parameters or software to determine the presence of a low frequency mutation allele present in a sample.

EXEMPLIFICATION
Example 1 Materials and Methods

Reverse Transcription (RT) Reaction method (21 uL reaction)_may be carried out in samples where RNA and DNA are analyzed, e.g., FFPE RNA and cfTNA:

- 1. Thaw the 5×URT buffer at room temperature for at least 5 minutes. (NOTE: Check for white precipitate in the tube. Vortex to mix as needed)

URT Buffer
5× concentration

TrisHCl ph 8.4
125 mM

Ammonium sulfate
50 mM

MgCl2
20 mM

dNTP pH 7.6
5 mM

- 2. In a MicroAmp EnduraPlate 96-well plate, set up the RT reaction by adding the following components.

(5-15ng RNA or DNA//5-40ng cfTNA)

Component
Volume

20 ng input cfTNA/10 ng FFPE RNA
15 μL

5 × URT buffer
4 μL

10 × RT (SSIV) Enzyme Mix
2 μL

Total volume
21 L

- 3. Mix entire contents by vortexing or pipetting. Spin down briefly.
- 4. Add 20 μl Parol 40 C oil to the top of each reaction mix.
- 5. Load the plate into thermocycler (e.g., SimpliAmp Thermocycler), and run the following program:

Stage
Temperature
Time

Stage 1
25° C.
10 min

Stage 2
50° C.
10 min

Stage 3
85° C.
5 min

Hold
4° C.
∞

Low-Cycle Tagging PCR (38 uL Reaction Volume+20 uL Oil):

Assemble tagging PCR reaction in 96-well PCR plate wells:

FFPE DNA Samples Only

1. Assemble the reaction by adding the following components to a MicroAmp EnduraPlate 96-well plate:

a. Prepare UDG mix: 1 ul+5 ul 5×URT buffer

b. Add the 6 ul diluted UDG to 15 μl FFPE DNA samples.

c. Mix by vortexing. Briefly spin down to collect reaction at the bottom of the wells.

d. Add 20 μL Parol 40 C Oil to the top of each sample.

e. Perform the reaction as following:

Stage
Temperature
Time

Stage 1
37° C.
2 min

Stage 2
50° C.
10 min

Hold
4° C.
>=1 min

2. Prepare Amplification Master Mix:

Component
Volume

Hawkeye panel FWD pool (125 nM)
3.75 μL

Hawkeye panel REV pool (125 nM)
3.75 μL

4 × SuperFiU MM v2.0
9.5 μL

Total volume
17 μL

3. Add 17 μL PCR Master Mix to 21 μL UDG treated FFPE DNA samples.

Set a pipette at 20 μL volume. Mix the reaction below oil by pipetting up and down 20 times to ensure thorough mix of the reaction without disturbing the oil phase. Spin down the plate briefly.

FFPE RNA and cfTNA Samples Only

1. Add components directly to the RT reactions from RT steps above:

Component
Volume

RT reaction
21 μL

Hawkeye panel FWD pool (10×, 125 nM)
3.8 μL

Hawkeye panel REV pool (10×, 125 nM)
3.8 μL

4 × SuperFiU MM v2.0
9.5 μL

Total volume
38 L

2. Set a pipette at 20 μL volume. Mix the reaction below oil by pipetting up and down 20 times to ensure thorough mix of the reaction without disturbing the oil phase. Spin down the plate briefly.

3. Perform 3-cycles tagging PCR using the following cycling condition on SimpliAmp:

For FFPE DNA and RNA Libraries:

Stage
Temperature
Time

Hold
99° C.
1
min

Cycle:
99° C.
30
sec

3
64° C.
2
min

60° C.
12
min

66° C.
2
min

72° C.
2
min

Hold
72° C.
2
min

Hold
4° C.
∞

For cfTNA Libraries

Stage
Temperature
Time

Cycle:
99° C.
30
sec

3
64° C.
2
min

60° C.
12
min

66° C.
2
min

72° C.
2
min

Hold
72° C.
2
min

Hold
4° C.
∞

Digestion-Filling-Ligation (45.6 μL Reaction Volume+20 μL Oil):

1. Add 7.6 μL of SUPA into each of the above PCR reaction well. Add SUPA directly to the sample below the oil layer.
2. Set a pipette at 25 μL. Mix the reaction below oil layer by pipetting up and down for 20 times. Spin down the plate briefly.
3. Load the plate into thermocycler and run the following program:

Stage
Temperature
Time

Stage 1
30° C.
15
min

Stage 2
50° C.
15
min

Stage 3
55° C.
15
min

Stage 4
25° C.
10
min

Stage 5
98° C.
2
min

Hold
4° C.
∞

Library Amplification (˜51 μL Reaction Volume+20 μL Oil)

1. Carefully transfer 30 μL the above post digestion-filling-ligation reaction to AmpliSeq HD Dual Barcodes. Mix well by pipetting up and down 20 times. Transfer all the reactions back to the original well under the oil layer.

2. Set a pipette at 30 μL. Mix entire reaction below oil by pipetting up and down 20 times. Spin down the plate briefly.

3. Load the plate into thermocycler and run the following program:

Stage
Temperature
Time

Hold
99° C.
15
sec

Cycle: 5
99° C.
15
sec

62° C.
20
sec

72° C.
20
sec

Cycle: 15 (FFPE DNA
99° C.
15
sec

and cfFNA)

Cycle: 18 (FFPE RNA)
70° C.
40
sec

Hold
72° C.
5
min

Hold
4° C.
∞

2—Round AmpurcXP library purification

Resulting repaired sample is purified using 36.8 ul Ampure® beads (Beckman Coulter, Inc.) according to the manufacturer instructions for two rounds. Briefly:

- Transfer 46 μL of library reaction below oil layer to new, clean wells on the PCR plate.
- Add 36.8 μl of Agencourt™ AMPure™ XP Reagent to each sample and mix by pipetting then incubate at room temperature for 5 minutes.
- Place the plate on magnet until the solutions in wells become clear.
- Carefully remove the supernatant; then remove residual supernatant.
- Add 150 uL of 80% ethanol in 10 mM pH 8 Tris-HCl. Do not disturb the bead pellet.
- Toggle plate on magnet 3 times with 5 seconds interval; Remove the supernatant; Repeat wash steps one more time. Use a pipette to remove residual buffer in the wells.
- Dry wells at room temperature for 5 min.
- Add 30 uL of low TE buffer to the wells and pipette to resuspend beads.
- Incubate the solution at room temperature for 5 min, Place plate on magnet to clear solution.
- Transfer 30 uL of the eluent into clean well on a plate.
- Add into the above well 30 μL (1× Volume) of AmpureXP beads; Pipette in well to mix.
- Repeat steps as above, using 40 uL of low TE buffer to elute after second purification.
- Transfer 40 uL of the library into a new clean well.

Library Normalization with Individual Equalizer

First, warm all reagents in the Ion Library Equalizer™ Kit to room temperature. Vortex and centrifuge all reagents. Wash the Equalizer™ Beads (f previously performed skip to Add Equalizer™ Beads and Wash).

- 1. For each 4 reaction, add 12 μL of beads into a clean 1.5-mL tube and 24 μL/reaction Equalizer™ Wash Buffer.
- 2. Place tube in a magnetic rack for 3 minutes or until the solution is completely clear.
- 3. Carefully remove and discard the supernatant without disturbing the pellet.
- 4. Remove from magnet, add 24 μL per reaction Equalizer™ Wash Buffer, and resuspend.

Amplify the Library

- 5. Remove plate with purified libraries from the magnet, then add 10 μL of 5× DV-Amp Mix and
- 2 μL of Equalizer™ Primers (pink cap in Equalizer kit). Total volume=52 μL
- 6. Mix.
- 7. Add 20 μL Parol 40 C Oil gently on top of samples.
- 8. run the following program on thermocycler:
  - 98 C for 2 min
  - 9-cycles amplification for FFPE DNA/RNA OR 6-cycles amplification for cfTNA:
  - 98 C for 15 sec
  - 64 C for 1 min
  - Then
  - Hold at 4 C for infinite
- 9. (Optional) after thermal cycling, centrifuge plate to collect any droplets.

Add Equalizer™ Capture to the amplified library

- 10. Add 10 μL of Equalizer Capture to each library amplification reaction beneath the oil layer.
- 11. mix up and down 10×.
- 12. Incubate at room temperature for 5 minutes.

Add Equalizer™ Beads and wash

- 13. Transfer 60 μL amplified library samples beneath the oil layer into well with washed beads.
- 14. mix thoroughly.
- 15. Incubate at room temperature for 5 minutes.
- 16. Place plate in magnet, then incubate for 2 minutes or until the solution is clear.
- 17. remove the supernatant.
- 18. Add 150 μL of Equalizer™ Wash Buffer to each reaction.
- 19. With the plate still in the magnet, remove, and discard supernatant.
- 20. Repeat the bead wash Elute the Equalized Library.

Elute the Equalized library

- 21. Remove plate from magnet, add 100 μL of Equalizer™ Elution Buffer to each pellet.
- 22. Pipette mix with 50 ul volume 5×.
- 23. Elute library by incubating on thermo cycler at 32° C. for 5 minutes.
- 24. Remove immediately, place plate in magnet, as soon as solution is clear, move to new wells.
- 25. Perform qPCR and adjust pool @100 pM for templating and sequencing.

Example 2 Compositions and Methods

The first step of provided methods comprises a few rounds of amplification, for example, three to six cycles of amplification, and in certain instances, three cycles of amplification using forward and reverse adaptors to each gene specific target sequence. Each adaptor contains a 5′universal sequence, and a 3′ gene specific target sequence. In some embodiments adaptors optionally comprise a unique tag sequence located between the 5′ universal and the 3′ gene specific target sequences.

In specific embodiments wherein unique tag sequences are utilized, each gene specific target adaptor pair includes a multitude of different unique tag sequences in each adaptor. For example, each gene specific target adaptor comprises up to 4096 TAGs. Thus, each target specific adaptor pair comprises at least four and up to 16,777,216 possible combinations.

Each of the provided adaptors comprises a cleavable uracil in place of thymine at specific locations in the forward and reverse adaptor sequences. Positions of uracils (Us) are consistent for all forward and reverse adaptors having unique tag sequences, wherein uracils (Us) are present flanking the 5′ and 3′ ends of the unique tag sequence when present; and Us are present in each of the gene specific target sequence regions, though locations for each gene specific target sequence will inevitably vary. Uracils flanking each unique tag sequence (UT) and in gene-specific sequence regions are designed in conjunction with sequences and calculated Tm of such sequences, to promote fragment dissociation at a temperature lower than melting temperature of the universal handle sequences, which are designed to remain hybridized at a selected temperature. Variations in Us in the flanking sequences of the UT region are possible, however designs keep the melting temperature below that of the universal handle sequences on each of the forward and reverse adaptors. Exemplary adaptor sequence structures comprise:

Forward Adaptor:

SEQ ID NO: 1564

------A Handle------------*UT*------ -Gene Specific--

TCTGTACGGTGACAAGGCG-U-NNNACTNNNTGA-U-XXXXXXXXXXXXXXXX

Reverse Adaptor

SEQ ID NO: 1565

TGACAAGGCGTAGTCACGG-U-NNNACTNNNTGA-U-XXXXXXXXXXXXXXXX

-----B Handle------- ------UT------- -------Gene Specific-------

Wherein each N is a base selected from A, C, G, or T and the constant sections of the UT region are used as anchor sequences to ensure correct identification of variable (N) portion. The constant and variable regions of the UT can be significantly modified (e.g., alternative constant sequence, >3Ns per section) as long as the Tm of the UT region remains below that of the universal handle regions. Importantly, cleavable uracils are absent from each forward (e.g., TCTGTACGGTGACAAGGCG (SEQ ID NO:1566 and reverse (e.g., TGACAAGGCGTAGTCACGG (SEQ ID NO:1567) universal handle sequence. In the present example, universal sequences have been designed to accommodate follow on amplification and addition of sequencing sequences on the ION Torrent platform, however, one skilled in the art would understand that such universal sequences could be adaptable to use other universal sequences which may be more amenable to alternative sequencing platforms (e.g., ILLUMINA sequencing systems, QIAGEN sequencing systems, PACBIO sequencing systems, BGI sequencing systems, or others).

Methods of use of provided compositions comprise library preparation via AmpliSeq HD technology with slight variations thereof and using reagents and kits available from Thermo Fisher Scientific. SuperFiU DNA comprises a modification in the uracil-binding pocket (e.g., AA 36) and a family B polymerase catalytic domain (e.g., AA 762). SuperFiU is described in U.S. Provisional patent application No. 62/524,730 filed Jun. 26, 2017, which is hereby incorporated by reference. Polymerase enzymes may be limited in their ability to utilize uracil and/or any alternative cleavable residues (e.g., inosine, etc.) included into adaptor sequences. In certain embodiments, it may also be advantageous to use a mixture of polymerases to reduce enzyme specific PCR errors.

The second step of methods involves partial digestion of resulting amplicons, as well as any unused uracil-containing adaptors. For example, where uracil is incorporated as a cleavable site, digestion and repair includes enzymatic cleavage of the uridine monophosphate from resulting primers, primer dimers and amplicons, and melting DNA fragments, then repairing gapped amplicons by polymerase fill-in and ligation. This step reduces and potentially eliminates primer-dimer products that occur in multiplex PCR. In some instances, digestion and repair are carried out in a single step. In certain instances, it may be desirable to separate digestion and repair-steps temporally. For example, thermolabile polymerase inhibitors may be utilized in conjunction with methods, such that digestion occurs at lower temperatures (25-40° C.), then repair is activated by increasing temperature enough to disrupt a polymerase-inhibitor interaction (e.g., polymerase-Ab), though not high enough to melt the universal handle sequences.

Uracil-DNA Glycosylase (UDG) enzyme can be used to remove uracils, leaving abasic sites which can be acted upon by several enzymes or enzyme combinations including (but not limited to): APE 1-Apurinic/apyrimidinic endonuclease; FPG-Formamidopyrimidine [fapy]-DNA glycosylase; Nth-Endonuclease II; Endo VIII-Endonuclease VIII; PNK-Polynucleotide Kinase; Taq-Thermus aquaticus DNA polymerase; DNA pol I-DNA polymerase I; Pol beta-Human DNA polymerase beta. In a particular implementation, the method uses Human apurinic/apyrimidinic endonuclease, APE1. APE1 activity leaves a 3′-OH and a 5′deoxyribose-phosphate (5′-dRP). Removal of the 5′-dRP can be accomplished by a number of enzymes including recJ, Polymerase beta, Taq, DNA pol I, or any DNA polymerase with 5′-3′ exonuclease activity. Removal of the 5′-dRP by any of these enzymes creates a ligatable 5′-phosphate end. In another implementations, UDG activity removes the Uracil and leaves and abasic site which is removed by FPG, leaving a 3′ and 5′-phosphate. The 3′-phosphate is then removed by T4 PNK, leaving a polymerase extendable 3′-OH. The 5′-deoxyribose phosphate can then be removed by Polymerase beta, fpg, Nth, Endo VIII, Taq, DNA pol I, or any other DNA polymerase with 5′-3′ exonuclease activity. In a particular implementation Taq DNA polymerase is utilized.

Repair fill-in process can be accomplished by almost any polymerase, possibly the amplification polymerase used for amplification in step 1 or by any polymerase added in step 2 including (but not limited to): Phusion DNA polymerase; Phusion U DNA polymerase; SuperFi DNA polymerase; SuperFi U DNA polymerase; TAQ; Pol beta; T4 DNA polymerase; and T7 DNA polymerase. Ligation repair of amplicons can be performed by many ligases including (but not limited to): T4 DNA ligase; T7 DNA ligase; Taq DNA ligase. In a particular implementation of the methods, Taq DNA polymerase is utilized and ligation repaired in accomplished by T7 DNA ligase.

A last step of library preparation involves amplification of the repaired amplicons by standard PCR protocols using universal primers that contain sequences complementary to the universal handle sequences on the 5′ and 3′ ends of prepared amplicons. For example, an A-universal primer, and a P1 universal primer, each part of the Ion Express Adaptor Kit (Thermo Fisher Scientific, Inc.) may optionally contain a sample specific barcode. The last library amplification step may be performed by many polymerases including, but not limited to: Phusion DNA polymerase; Phusion U DNA polymerase; SuperFi DNA polymerase; SuperFi U DNA polymerase; Taq DNA polymerase; Veraseq Ultra DNA polymerase.

Example 3 Assay Content and Methods

With primers directed to target sequences specific to targets in Table 1, adaptors each comprise 4096 unique tag sequences for each gene specific target sequence, resulting in an estimate of 16,777,216 different unique tag combinations for each gene specific target sequence pair.

Preparation of library was carried out according to the method described above. Prepared libraries are prepared for templating and sequenced, and analyzed. Sequencing can be carried out by a variety of known methods, including, but not limited to sequencing by synthesis, sequencing by ligation, and/or sequencing by hybridization. Sequencing has been carried out in the examples herein using the Ion Torrent platform (Thermo Fisher Scientific, Inc.), however, libraries can be prepared and adapted for analysis, e.g., sequencing, using any other platforms, e.g., Illumina, Qiagen, PacBio, etc. Results may be analyzed using a number of metrics to assess performance, for example:

- # of families (with ng input DNA captured) The median # of families is a measure of the number of families that maps to an individual target. In this case, each unique molecular tag is a family.
- Uniformity is a measure of the percentage of target bases covered by at least 0.2× the average read depth. This metric is used to ensure that the technology does not selectively under-amplify certain targets.
- Positives/Negatives: When a control sample with known mutations is utilized is analyzed (e.g., Acrometrix Oncology Hotspot Control DNA, Thermo Fisher Scientific, Inc.), the number of True Positives can be tracked.
  - True Positives: The number of True Positives informs on the number of mutations that were present and correctly identified.
  - False positives (FP): (Hot spot and Whole Target) The number of False Positives informs on the number of mutations that are determined to be present, but known not to be in the sample.
  - False negatives (FN) (if acrometrix spike-in is used) The number of False Negatives informs on the number of mutations that were present but not identified.
- On/Off Target is the percentage of mapped reads that were aligned/not aligned over a target region. This metric is used to ensure the technology amplifies predominantly the targets to which the panel was designed.
- Low quality is tracked to ensure the data is worth analyzing. This metric is a general system metric and isn't directly related to this technology.

TABLE 1

Precision Assay Gene Content by Variant Class

DNA

Inter-Genetic
Intra-Genetic

Hotspots
CNV
Fusions
Fusions

AKT1
GNAS
ALK

ALK

AR

AKT2
HRAS
AR
BRAF
EGFR

AKT3
IDH1
CD274
ESR1
MET

ALK
IDH2
CDKN2A

FGFR1

AR
KIT
EGFR

FGFR2

ARAF
KRAS
ERBB2

FGFR3

BRAF
MAP2K1
ERBB3
MET

CDK4
MAP2K2
FGFR1
NRG1

CDKN2A
MET
FGFR2

NTRK1

CHEK2
MTOR
FGFR3

NTRK2

CTNNB1
NRAS
KRAS

NTRK3

EGFR
NTRK1
MET
NUTM1

ERBB2
NTRK2
PIK3CA

RET

ERBB3
NTRK3
PTEN
ROS1

ERBB4
PDGFRA

RSPO2

ESR1
PIK3CA

RSPO3

FGFR1
PTEN

50

Total Genes

FGFR2
RAF1

45
DNA Hotspot Genes

FGFR3
RET

14
CNV Genes

FGFR4
ROS1

16
Inter-Genetic Fusions

FLT3
SMO

3
Intra-Genetic Fusions

GNA11
TP53

GNAQ

Bold includes non-targeted fusion

Clinical evidence is defined as number of instances that a gene/variant combination appears in drug labels, guidelines, and/or clinical trials. Tables 2 and 3 depict top genes/variants and indications relevant to provided assay, as supported by clinical evidence.

TABLE 2

Top 5 assay genes/variant types

with the most clinical evidence

ERBB2 (HER2) amplification

EGFR hotspot mutations

BRAF hotspot mutations

KRAS hotspot mutations

ALK fusions

TABLE 3

Top 5 indications

with the most clinical evidence

NSCLC

Breast

Colorectal

Melanoma

Kidney

Up to 29 gene and variant combinations covered under the provided assay are on drug labels and/or guidelines (NCCN and ESMO)

TABLE 4

Cancer Indications Ranked by Clinical Evidence

Non-Small Cell Lung Cancer
Ovarian Cancer
Thyroid Cancer

Unspecified Solid Tumor
Bladder Cancer
Glioblastoma

Breast Cancer
Esophageal Cancer
Soft Tissue Sarcoma

Colorectal Cancer
Head and Neck Cancer
Gastrointestinal Stromal Tumor

Melanoma
Endometrial Cancer
Small Cell Lung Cancer

Kidney Cancer
Pancreatic Cancer
Cervical Cancer

Gastric Cancer
Liver Cancer

Example 4 Results

Primers were designed using the composition design approach provided herein and targeted to oncology genes using those of the panel target genes as described above in Table 1, where the library amplification step utilized two primer pairs (to put the two universal sequences on each end of amplicons, e.g., an A-universal handle and a P1-universal handle on each end) to enable bi-directional sequencing as described herein. Prepared library was sequenced using Ion Gene Studio Templating/and Sequencing kits and instrumentation (Thermo Fisher Scientific, Inc.) and/or a new fully integrated library preparation, templating and sequencing system. Performance with the instant panel indicates the technology is able to appropriately detect targeted mutations, copy number variations and fusions as intended.

A. Fusion Detection Capability in Various ALK and ROS Isoforms from NSCLC FFPE Samples

Libraries were prepared and sequenced as described above. Various fusion isoform detection was demonstrated as expected:

HIP1-ALK.H21A20
KIF5B-ALK.K17A20
EML4-ALK.E20A20

Read Count: 373
Read Count: 602
Read Count: 671

Molecular Count: 3
Molecular Count: 10
Molecular Count: 12

CD74-ROS1. C6R34
SLC34A2-ROS1.S13R32

Read Count: 1947
Read Count: 2518

Molecular Count: 82
Molecular Count: 82

4B: Mutation Detection in Matched Samples with GeneStudio S5 and New Sequencer

Sample
GeneStudio S5
NEW System

Type
FFPE
Plasma
FFPE
Plasma

Breast
✓ PIK3CA G1049R
✓ PIK3CA G1049R
✓ PIK3CA G1049R
✓ PIK3CA G1049R

Colon
✓ None
✓ None
✓ None
✓ None

Colon
✓ KRAS G12V
✓ KRAS G12V
✓ KRAS G12V
✓ KRAS G12V

Colon
✓ KRAS G12D
✓ KRAS G12D
✓ KRAS G12D
✓ KRAS G12D

NSCLC
✓ KRAS Q61H
✓ KRAS Q61H
✓ KRAS Q61H
✓ KRAS Q61H

Library preparation, sequencing and analysis was carried out for mutation detection in matched samples as described above using both manual preparation and sequencing on ION GeneStudio S5 as well as an automated and integrated library preparation, templating and sequencing system. The precision assay demonstrated concordant PIK3CA and KRAS mutation detection across matched tissue and plasma samples with both the GeneStudio S5 with manual workflow as compared to the automated system.

4C: DNA Variant Detection Across Various Cancer Indications

Library preparation, sequencing and analysis was carried out for mutation detection in a variety of different sample types as described above using both manual preparation and sequencing on ION GeneStudio S5 as well as an automated and integrated library preparation, templating and sequencing system. The precision assay demonstrated detected various driver mutations across different cancer indication sample types.

Squamous

Cell

Carcinoma

Solid

Melanoma
Breast
Colon
(SCC)
Pancreas
Glioblastoma
Tumor
NSCLC

NRAS
NRAS
EGFR
KRAS
KRAS
EGFR
NTRK
EGFR

BRAF
BRAF
BRAF
TP53

Amplification
fusions
BRAF

KIT
PIK3CA
KRAS

KRAS

ERBB2
NRAS

ERBB2

amplification

MET

ALK fusions

ROS fusions

RET fusions

4D: Detection Using Cohort of Matched FFPE and Plasma Samples

Library preparation, sequencing and analysis was carried out for evaluation of performance of the assay in detecting variants across cohort of matched FFPE and plasma samples. The assay demonstrated detection of various driver mutations across different cancer indication sample types. Using the assay on a set of matched FFPE and plasma samples, 4 out of 8 had concordant PIK3CA(1) and KRAS(3) mutations; while 1 out of 8 had concordant NO variants detected

Sample
Cancer
Tumor

FFPE Variant
Plasma Variant

#
Type
Grade
Stage
Results
Results

1
Breast
G2
IIIA
PIK3CA
PIK3CA

G1049R
G1049R

(32.26%)
(12.88%)

2
Breast
G3
IIIA
FGFR1 CNV
TBD*

(7.2)

3
Breast
G3
IIIC
TP53 H179R
None

(68.42%)

ERBB2 CNV
TBD*

(16.2)

4
Colon
G2
IIIA
None
None

5
Colon
G3
IIIA
KRAS G12V
KRAS G12V

(41.35%)
(12.81%)

PTEN 1.14
TBD*

6
Colon
G2
IIIA
KRAS G12D
KRAS G12D

(11.29%)
(15.42%)

7
NSCLC
Unknown
IIIB
KRAS G12D
None

(12.24%)

8
NSCLC
Unknown
IIIA
KRAS Q61H
KRAS Q61H

(26.41%)
(1.67%)

4E: Detection of Variants in FFPE Cancer Samples with Known Variants

Library preparation, sequencing and analysis was carried out for evaluation of performance of the assay in detecting variants in 16 FFPE samples (NSCLC, Breast, and CRC) with known mutations previously confirmed using ONCOMINE cfDNA assays. Samples were tested within a single run (chip) using the assay on an integrated system. 8 samples had previously been characterized using Oncomine cfDNA assays In this cohort, 17 mutations were detected by the present assay, within EGFR, ERBB4, IDH1, KRAS, MET, PIK3CA, and TP53 Additionally, 3 amplifications in EGFR, ERBB2, and FGFR1 were detected Lastly, 3 fusions with FGFR2 and RSPO3 driver genes were also detected. The assay was able to detect a number of variants, including SNV mutations, CNV amplifications, and fusions in a cohort of FFPE samples.

4F. SNV and CNV Detection with Multiple Cancer Tvpe from FFPE

The assay was used to detect various driver mutations across different cancer indications. All results were concordant with previous characterization using different assay and system:

Allele

Allele

freq.
Allele
freq.
Expected

Sample

Previous
FFPE
freq.
FFPE
CNV

Pathological
ID

SNV
family-
FFPE
read-
based on
Detected

diagnosis
FFPE
Previous SNV
AF
based
hybrid
based
S5
CNV

Adenocarcinoma
AB 2
KRAS
4.03%
9.4%
9.4%
10.8%

ductal

COSM521_p.G12D

Invasive
AB 3
IDH1
45.54%
48.7%
51.2%
49.9%

adenocarcinoma

COSM97131_p.V178I

EGFR
13.81%
9.5%
13.9%
15.1%

COSM6224_p.L858R

Colon
AB 4
BRAE
18.75%
34.2%
17.2%
17.3%

adenocarcinoma

COSM476_p.V600E

Invasive
AB 5
none
none
none
none
none

carcinoma of no

special type

(NCT)

Glioblastoma
AB 8
none
none
none
none
none
EGFR,
EGFR

multiforme

40
gain

(22x)

Melanoma
AB 10
NRAS
18.57%
28.6%
25.3%
26.5%

COSM584_p.Q61R

Squamous cell
AB 11
HRAS
11.83%
25.0%
21.6%
21.6%

carcinoma

COSM487_p.G13S

Infiltrative
AB 14
none
none
none
none
none
FGFR1,
FGFR1

urothelial

6
gain (3x)

carcinoma (high

grade)

Melanoma
AB 1
BRAE
63.16%
88.9%
65.2%
65.2%

COSM476_p.V600E

Invasive
AB 6
PIK3CA
27.59%
14.6%
13.2%
13.2%

carcinoma of no

COSM763_p.E545K

special type

(NCT)

Squamous cell
AB 12
TP53
23.88%
23.7%
22.2%
23.4%

CDKN2

carcinoma

COSM10660_p.R273H

A loss

(0.6x)

Glioblastoma
AB 20
none
none
none
none
none
EGFR,
EGFR

multiforme

42
gain

(24x)

4G. Detection of Fusions Across ALK, ROS1, RET, NTRK1, NTRK2, and NTRK3 Driver Genes

Library preparation, sequencing and analysis was carried out for evaluation of performance of the assay in detecting fusion variants The assay was able to reproducibly detect 11 fusion isoforms representing 6 driver genes (ALK, BRAF FGFR3, NTRK1, NTRK3, RET, and ROS1), as well as 15 NTRK fusion isoforms representing 3 driver genes (NTRK1, NTRK2, and NTRK3) using targeted isoform detection.

Replicate 1
Replicate 2

(read/
(read/

molecular
molecular

Material
Fusion Isoform
counts)
counts)

SeraCare Seraseq
CD74-ROS1.C6R34
78/6
141/12

FFPE Tumor
EML4-ALK.E13A20
583/18
530/21

Fusion RNA
ETV6-NTRK3.E5N15
639/34
307/17

Reference
FGFR3-BAIAP2L1.F17B2
343/14
443/24

Material
FGFR3-TACC3.F17T11
504/26
360/22

KIF5B-RET.K24R11
333/20
153/3

LMNA-NTRK1.L2N11
819/22
815/26

NCOA4-RET.N7R12
664/22
805/25

SLC34A2-ROS1.S4R34
74/4
121/9

SLC45A3-BRAF.S1B8
519/35
566/34

TPM3-NTRK1.T7N10
553/28
464/30

SeraCare Seraseq
AFAP1-NTRK2.A14N10
2542/157
1823/137

FFPE NTRK
BTBD1-NTRK3.B4N14
1676/153
1753/169

Fusion RNA
ETV6-NTRK3.E4N14
1892/144
1815/161

Reference
ETV6-NTRK3.E4N15
2224/132
2348/164

Material
ETV6-NTRK3.E5N14
1605/132
1775/145

ETV6-NTRK3.E5N15
2811/160
2208/165

IRF2BP2-NTRK1.I1N9
1584/122
1877/156

LMNA-NTRK1.L11N11
3618/184
3000/185

NACC2-NTRK2.N4N12
1839/94
1626/106

PAN3-NTRK2.P1N15
772/48
783/68

QKI-NTRK2.Q6N14
2376/196
1891/181

SQSTM1-NTRK1.S5N9
4679/186
3260/159

TFG-NTRK1.T5N9
1842/141
1529/134

TPM3-NTRK1.T7N9
2645/185
2891/207

TRIM24-NTRK2.T12N13
2940/112
1969/92

4H: Detection of EGFR and KRAS Variants in Control Materials

Expected
Replicate 1
Replicate 2

Material
Variant
Variant Type
AF %
Observed AF %

Horizon
EGFR p.G719S
SNV
5%
6.02%
8.45%

5% FFPE
EGFR
INDEL
5%
3.26%
5.26%

p.E746_A750delELREA

EGFR p.T790M
SNV
5%
6.76%
4.40%

EGFR p.L858R
SNV
5%
5.00%
5.29%

Horizon
EGFR p.G719S
SNV
1%
ND
1.46%

1% FFPE
EGFR
INDEL
1%
1.35%
1.08%

p.E746_A750delELREA

EGFR p.T790M
SNV
1%
2.21%
0.77%

EGFR p.L858R
SNV
1%
1.53%
1.00%

KRAS
NRAS p.Q61K
SNV
5%
2.46%
3.67%

Gene-
NRAS p.G12V
SNV
5%
5.51%
4.26%

Specific
KRAS p.A146T
SNV
5%
2.94%
4.00%

Multiplex
KRAS p.Q61H
SNV
5%
6.45%
5.66%

Reference
KRAS p.G13D
SNV
5%
2.48%
6.14%

Standard
KRAS p.G12D
SNV
5%
2.97%
3.07%

5% FFPE

Library preparation, sequencing and analysis was carried out for evaluation of performance of the assay in detecting variants of EGFR Using Horizon EGFR Gene-Specific Multiplex Reference Standard 5% and 1% FFPE Controls; and in detecting fusion variants of RAS Using Horizon KRAS Gene-Specific Multiplex Reference Standard 5% FFPE. The assay was able to detect all EGFR variants at 5% allele frequency using a Horizon FFPE control. At 1% allele frequency—which is below typical LOD—the assay picked up 7 out of 8 instances across 2 replicates: The assay was able to reproducibly detect 6 RAS mutations using Horizon control.

4I: Detection of KRAS, BRAF, KIT, EGFR Mutations Using cfDNA Controls

Library preparation, sequencing and analysis was carried out for evaluation of performance of the assay in detecting KRAS, BRAF, KIT, EGFR mutations using cfDNA controls SeraCare Seraseq ctDNA Reference Material v2 AF 0.125% or Horizon Multiplex I cfDNA Reference Standard Set (1% and 0.1%). The assay was able to detect mutations down to an allele frequency of 0.1% using cfDNA controls.

Replicate
Replicate

Expected
1
2

Material
Variant
Type
AF
Observed AF

Seraseq
KRAS G12D
SNV
0.11%
0.089%
ND

ctDNA
BRAF V600E
SNV
0.14%
0.111%
0.206%

KIT D816V
SNV
0.125%
0.193%
ND

EGFRp.E746_A750
INDEL
0.12%
0.155%
0.143%

delELREA

EGFR
INDEL
0.18%
0.101%
0.086%

p.D770_N771insG

EGFR T790M
SNV
0.18%
0.177%
0.184%

EGFRL858R
SNV
0.17%
0.225%
ND

Horizon
KRAS G12D
SNV
1%
1.272%
1.269%

1%

Horizon
KRAS G12D
SNV
0.1%
0.094%
0.232%

0.1%

4J: Detection of MET and PTEN Copy Number Variation

Library preparation, sequencing and analysis was carried out for evaluation of performance of the assay in detection of MET copy gain and PTEN copy loss using control and cell line. Structural Multiplex FFPE Reference Standard (Horizon) was utilized to detect MET, and a PTEN cell line (ATCC) was used for detection of PTEN copy number variation. The assay was able to detect the MET copy number gain and PTEN copy number loss using control and cell line, respectively.

Expected
Expected
Observed
Observed

CNV
Copy
Copy
Copy

Sample
Gene
Status
Number
Number R1
Number R2

Wild-
MET

2
2.3
2.3

type

Horizon
MET

4.5
4.6
4.6

Wild-
PTEN

2
2.5
2.6

type

Cell Line
PTEN

0
0
0

4K: Detection of NTRK1, FGFR3, RET Fusions in Cell Lines.

Library preparation, sequencing and analysis was carried out for evaluation of performance of the assay in detection in NTRK1, FGFR3, RET fusion cell lines. KM12 cell line (ATCC); SW780 cell line (ATCC); LC-2/ad cell line (Sigma Aldrich) were used for nucleic acid preparation and evaluation. The assay was able to detect the TPM3-NTRK1 fusion isoform using both the targeted isoform and imbalance assay methods of the assay. The assay was able to detect the FGFR3-BAIAP2L1 fusion isoform using both the targeted isoform and imbalance assay methods of the assay. Interestingly, an ALK imbalance was also defected in this cell line; research is ongoing to understand these results further The assay was able to detect the CCDC6-RET fusion isoform using both the targeted isoform and imbalance assay methods

Targeted

Targeted
Read
Molecular
Imbalance
Imbalance
Imbalance

Isoform
Rep
Isoform
Count
Counts
Assay
Score
p-value

TPM3-
1
DETECTED
4430
360
DETECTED
2.781
0.0017

NTRK1.T7N10
2
DETECTED
6665
531

FGFR3-
1
DETECTED
9096
763
DETECTED
1.716
0.0066

BAIAP2L1.F17B2
2
DETECTED
10913
952

CCDC6-
1
DETECTED
3342
318
DETECTED
1.763
0.0055

RET.C1R12
2
DETECTED
3233
301

4L: Detection of ALK and ROS1 Fusions in FFPE Samples

Library preparation, sequencing and analysis was carried out for evaluation of performance of the assay in detection of ALK and ROS1 fusions in FFPE samples. The assay was able to detect the ALK fusions using both the targeted isoform and imbalance assay methods, and ROS1 fusions using the targeted isoform method for FFPE samples

Targeted

Read
Molecular
Imbalance
Imbalance
Imbalance

Sample
Isoform
Isoform
Count
Counts
Assay
Score
p-value

1
DETECTED
EML4-
1535
50
DETECTED
4.767
0.0007

ALK.E13A20

DETECTED
EML4-
6665
531

ALK.E6A20

2
DETECTED
CD74-
85
5
No imbalance assays for ROS1

ROS1.C6R33

DETECTED
CD74-
1566
110

ROS1.C6R34

While preferred embodiments of the present invention have been shown and described herein, it will be obvious to those skilled in the art that such embodiments are provided by way of example only. Numerous variations, changes, and substitutions will now occur to those skilled in the art without departing from the invention. It should be understood that various alternatives to the embodiments of the invention described herein may be employed in practicing the invention. It is intended that the following claims define the scope of the invention and that methods and structures within the scope of these claims and their equivalents be covered thereby.

TABLE A

primer sequences of the oncology precision assay, FWD pool and REV pool

SEQ

SEQ

ID

ID

NO
PrimerSeqFWD(A)
NO
PrimerSeqREV(B)

1
TCTGTACGGTGACAAGGCGUNNNACTNN
997
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCGAGUCGGGCTCUGGA

CCGCUGUGGCCCUCGUG

2
TCTGTACGGTGACAAGGCGUNNNACTNN
998
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCACGGGUCGGGUGAGA

GACAGCGGCUGCGAUCA

3
TCTGTACGGTGACAAGGCGUNNNACTNN
999
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUGGUGCCGAGCCUCUG

GGUCGCCCUCCACGCAG

4
TCTGTACGGTGACAAGGCGUNNNACTNN
1000
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGGCCGUUAGGGUGCAG

AAGUGCCCAGCGAGCUA

5
TCTGTACGGTGACAAGGCGUNNNACTNN
1001
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGCCUCACCUCCACCGT

CAAUGCCGAUGGCCUCC

6
TCTGTACGGTGACAAGGCGUNNNACTNN
1002
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUUCUGCGCAGCUUCCC

AGACGACAGGGCUGGUT

7
TCTGTACGGTGACAAGGCGUNNNACTNN
1003
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGAUGGCCAUGGCGCGGA

ACUGGCAUGACCCCCAC

8
TCTGTACGGTGACAAGGCGUNNNACTNN
1004
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGUCGUCUCUCCAGCCC

GGCUCUCGCGGAGGAAG

9
TCTGTACGGTGACAAGGCGUNNNACTNN
1005
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGCCCCUGAGCGUCAUC

UUUGUUGGCGGGCAACC

10
TCTGTACGGTGACAAGGCGUNNNACTNN
1006
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCGUCUGAGGAGCCCGUG

GCAGUCCGGCUUGGAGG

11
TCTGTACGGTGACAAGGCGUNNNACTNN
1007
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGCGUCCUCCCAGCGUA

AAGUCCUGCCGAGCACT

12
TCTGTACGGTGACAAGGCGUNNNACTNN
1008
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGCGACCCCCUCAUCAT

ACUGGUUGGUGGCUGGA

13
TCTGTACGGTGACAAGGCGUNNNACTNN
1009
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUACUUGGAGGACCGUCGC

AGCUGCAUGGUGCGGUT

14
TCTGTACGGTGACAAGGCGUNNNACTNN
1010
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUGGGACAGUGGGCCAA

AGGCUCCAGUGCUGGUT

15
TCTGTACGGTGACAAGGCGUNNNACTNN
1011
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCAGCUGGGCCAGAGUGT

AGGCUCCUCCAGGCUCA

16
TCTGTACGGTGACAAGGCGUNNNACTNN
1012
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAACAUGGCCUCCUCCGC

AUCCUCUGCCCCACCCT

17
TCTGTACGGTGACAAGGCGUNNNACTNN
1013
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCAGGUCCCCAUGGUGGC

CAACAUGGCCUGGCAGC

18
TCTGTACGGTGACAAGGCGUNNNACTNN
1014
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGGCGCCUUCCAUGGAG

CAAAAAGGGAUUCAAUUGCCAUCCA

19
TCTGTACGGTGACAAGGCGUNNNACTNN
1015
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUGCAGCAGUGGAGCCA

CAUCUCCACCGCCGUGT

20
TCTGTACGGTGACAAGGCGUNNNACTNN
1016
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUCCAGGACGUGCUGCUC

CCCAUCCUCUGGAGCCA

21
TCTGTACGGTGACAAGGCGUNNNACTNN
1017
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUGGGCAACGUGGUUGG

CCCCCACCUGAGACUCC

22
TCTGTACGGTGACAAGGCGUNNNACTNN
1018
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUCCAUCGAGCCUCCGAC

CUGCUUGGCCUGGAGGG

23
TCTGTACGGTGACAAGGCGUNNNACTNN
1019
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCUGCAACCUGCAGCAC

GAGCUGAGCGCCUGGCA

24
TCTGTACGGTGACAAGGCGUNNNACTNN
1020
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUUUUGGUGGCACGCAGC

GAUGUCCCGGCGCUUGA

25
TCTGTACGGTGACAAGGCGUNNNACTNN
1021
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGUCUUCCCCAACGGCA

GCACACGCGGAUGUGCA

26
TCTGTACGGTGACAAGGCGUNNNACTNN
1022
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGCGUGGAGCUAUGGGT

GCGUGACCGGGACUUCC

27
TCTGTACGGTGACAAGGCGUNNNACTNN
1023
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCUGCCCCCACUCCCAG

GGCAGCAGGGUGGUGAG

28
TCTGTACGGTGACAAGGCGUNNNACTNN
1024
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGAGGCCUCCUGCACUCC

GGCCACCUGGACCUUCC

29
TCTGTACGGTGACAAGGCGUNNNACTNN
1025
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGGACGGUCGGACUCCC

GGCCAGACUGACCCUCC

30
TCTGTACGGTGACAAGGCGUNNNACTNN
1026
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCAGGCGAUGUCGCCGAA

GGCCCGUGUCUUGGAGG

31
TCTGTACGGTGACAAGGCGUNNNACTNN
1027
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGCUUCGAGGCCGUUGA

GGGUAGGCCGUGUCUGG

32
TCTGTACGGTGACAAGGCGUNNNACTNN
1028
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCGAAGGCGUCUCCCUG

GUCAGCCAGGGCACCUG

33
TCTGTACGGTGACAAGGCGUNNNACTNN
1029
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGCGGCUUGGGAGAAUG

GUCAGCCCCAGGGAUGG

34
TCTGTACGGTGACAAGGCGUNNNACTNN
1030
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUACACGGUGCGCGAGG

GUGUCCUCCGCUGAGGC

35
TCTGTACGGTGACAAGGCGUNNNACTNN
1031
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGUGUCCAGAGGACCCC

UCAGGGCUCUGCAGCUC

36
TCTGTACGGTGACAAGGCGUNNNACTNN
1032
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGAGCCAUGGGCUGCAT

UCCACGCUGCUCGGCAT

37
TCTGTACGGTGACAAGGCGUNNNACTNN
1033
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCGGCUCAGUGAGGCUCG

UCCGGCAUUCGUGUUGC

38
TCTGTACGGTGACAAGGCGUNNNACTNN
1034
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGUGCCACCCGCCUAUG

UCUCUGGGAGGGCACUG

39
TCTGTACGGTGACAAGGCGUNNNACTNN
1035
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGAGUGCUGGCAUGCCG

UGAUGGUCGAGGUGCGG

40
TCTGTACGGTGACAAGGCGUNNNACTNN
1036
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCUGGUGGAGGACCUGG

UGCACGUCGGUUUUGGG

41
TCTGTACGGTGACAAGGCGUNNNACTNN
1037
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUUGGCACUGAGGGUCGC

UGGCCGCUCCAACUCAC

42
TCTGTACGGTGACAAGGCGUNNNACTNN
1038
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGAACCCGCGCUCUCUGA

UGUUGCACUGUGCCUGG

43
TCTGTACGGTGACAAGGCGUNNNACTNN
1039
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGAGCUCGGCUGUUCCA

UUUUUCCGCGGCACCUC

44
TCTGTACGGTGACAAGGCGUNNNACTNN
1040
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCUCAGAGCCCCACCUG

CUCCUUCCUUGCCAACGC

45
TCTGTACGGTGACAAGGCGUNNNACTNN
1041
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCGUGCUGGAGAGACCCC

UGAGCCCACCUGACUUGG

46
TCTGTACGGTGACAAGGCGUNNNACTNN
1042
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGGAGCCGUCAACGAUG

GCUGCCGAAGACCAACUG

47
TCTGTACGGTGACAAGGCGUNNNACTNN
1043
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCGUGCCCUCCGUGUUCA

GAGCCCAGGCCUUUCUUG

48
TCTGTACGGTGACAAGGCGUNNNACTNN
1044
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCGGCGUCCACAACUCA

AGGAACUCCCGCAGGUUT

49
TCTGTACGGTGACAAGGCGUNNNACTNN
1045
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGGAGAUGCCGUCGGUG

CUCCACCCCUGAAGCCUG

50
TCTGTACGGTGACAAGGCGUNNNACTNN
1046
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCUGGCCUUCGUACGGG

CCGUCUCCUCCACGGAUG

51
TCTGTACGGTGACAAGGCGUNNNACTNN
1047
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAUCUGCCAGCGGCUCAG

GUGAGGCAGAUGCCCAGC

52
TCTGTACGGTGACAAGGCGUNNNACTNN
1048
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGCUGCAUGAUCUGCGG

GGAGGUGGUGGUGGUCCC

53
TCTGTACGGTGACAAGGCGUNNNACTNN
1049
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCCGUCAUGAGACCCGA

CUAGUUGCAUGGGUGGCG

54
TCTGTACGGTGACAAGGCGUNNNACTNN
1050
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCAGCCUCUCUGCCCAGC

CCAGAUCAUCCGCGAGCT

55
TCTGTACGGTGACAAGGCGUNNNACTNN
1051
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCGUUCUGCCUCCCGUGG

GUGAGCCUGCAAUCCCUG

56
TCTGTACGGTGACAAGGCGUNNNACTNN
1052
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGCCGACCUUGAGGCUG

UUACCCUUGGCCGCGUAC

57
TCTGTACGGTGACAAGGCGUNNNACTNN
1053
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCCUCUCAUGCCCGCAG

CUCACAGGUCGUGUGUGC

58
TCTGTACGGTGACAAGGCGUNNNACTNN
1054
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUCCCAGUGGCCCUCGG

GUACCGGAGGAAGCGGUT

59
TCTGTACGGTGACAAGGCGUNNNACTNN
1055
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGAGGCCACUGGGUCACC

AACCUGCAGCAUGAGCAC

60
TCTGTACGGTGACAAGGCGUNNNACTNN
1056
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUGGAGAUGGCCCGACA

ACAUGUCUCCGCUGGUCG

61
TCTGTACGGTGACAAGGCGUNNNACTNN
1057
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCACCGUCUCCUCGGAGC

AUCAGCGAGAGUGGCAGG

62
TCTGTACGGTGACAAGGCGUNNNACTNN
1058
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUGCUCCCAGCAAGCGA

AUGCCAGGUGCAAGCACA

63
TCTGTACGGTGACAAGGCGUNNNACTNN
1059
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAUCACCGCAUCGUGCAG

CACGACUGUUGGACCGUG

64
TCTGTACGGTGACAAGGCGUNNNACTNN
1060
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAUCCGCUCGUCCACCAG

CAGCGAAUGGGCAGCAUG

65
TCTGTACGGTGACAAGGCGUNNNACTNN
1061
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCUCGCCCACGAGUAGC

CAGGAGUCCGAGGUGGUG

66
TCTGTACGGTGACAAGGCGUNNNACTNN
1062
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCAGCGCCACCUGCUGAC

CCAGUAGCGCUGCUUCCT

67
TCTGTACGGTGACAAGGCGUNNNACTNN
1063
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCGCCUCUCACCAUCGA

CCGUGGAGCUCCUCACAC

68
TCTGTACGGTGACAAGGCGUNNNACTNN
1064
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUCGAGCCCGGGAAGUG

CGGGAAGCGGGAGAUCUT

69
TCTGTACGGTGACAAGGCGUNNNACTNN
1065
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGGACUCGAUGGACCGC

CUCUUGCGGGUACCCACG

70
TCTGTACGGTGACAAGGCGUNNNACTNN
1066
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUACUGCCUGGCUGGCUG

CUGCUUCCUCAAGGCCGA

71
TCTGTACGGTGACAAGGCGUNNNACTNN
1067
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUUGUGCCCACCAGGCAA

CUGUAGGGACACAGGGCA

72
TCTGTACGGTGACAAGGCGUNNNACTNN
1068
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAAUCUCCUGCGCCCUGG

GCCAUGCGGGUCUCUCUG

73
TCTGTACGGTGACAAGGCGUNNNACTNN
1069
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUGAUUGCUCCGGCCGT

GCCUCCGGAAGGUCAUCT

74
TCTGTACGGTGACAAGGCGUNNNACTNN
1070
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUGGAGGCACUGAGGCG

GCCUUUUGUCCGGCUCCT

75
TCTGTACGGTGACAAGGCGUNNNACTNN
1071
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUCCGCCAUGCAAGGCT

GCUCGUGUCCCCCAACAA

76
TCTGTACGGTGACAAGGCGUNNNACTNN
1072
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGUCCAGGGAGAGCCUG

GGCGAUCUCCUCGUUUGC

77
TCTGTACGGTGACAAGGCGUNNNACTNN
1073
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUCCGACUCCGAGGACG

GGGCUUGUCUUGAGGCUG

78
TCTGTACGGTGACAAGGCGUNNNACTNN
1074
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUGCUGCAGAACGGGAG

GUGAGGGCUGACGCAGAG

79
TCTGTACGGTGACAAGGCGUNNNACTNN
1075
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGUCGUUCCGCUUCGGG

GUGGGCUCAGGAACCGAG

80
TCTGTACGGTGACAAGGCGUNNNACTNN
1076
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUCCAGCAAGGCCUGGUG

UACCCGAGGUCCCUGGAG

81
TCTGTACGGTGACAAGGCGUNNNACTNN
1077
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCUAGCUUUGGCGAGGG

UCAGCGCGAUCAGCAUCT

82
TCTGTACGGTGACAAGGCGUNNNACTNN
1078
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGAGCUUGCCCUGACCC

UCCGCAGGCUUCCUUAGG

83
TCTGTACGGTGACAAGGCGUNNNACTNN
1079
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAUGAGGGCGAUGGGCUG

UCUGCAGAGGACUCCAGC

84
TCTGTACGGTGACAAGGCGUNNNACTNN
1080
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCAGUGAGCUGCCUGCGT

UGGGUCUCUGUGAGGGCA

85
TCTGTACGGTGACAAGGCGUNNNACTNN
1081
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUGCGGCGAGUCCUGAG

UUCUUCCCGCCUUUCCCG

86
TCTGTACGGTGACAAGGCGUNNNACTNN
1082
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGGCUCCGGGUGACAGC

GAUCUCCCAGAGCAGGACC

87
TCTGTACGGTGACAAGGCGUNNNACTNN
1083
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGAGCGGACUCCCCUCG

CUGCACACACCAGUUGAGC

88
TCTGTACGGTGACAAGGCGUNNNACTNN
1084
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUCCAGCAUCCGACCAC

ACAUAGUCCCGGAAGCUGC

89
TCTGTACGGTGACAAGGCGUNNNACTNN
1085
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUCCCUGCUGUCUGCCG

GGUACGCCUCCAGAUGAGC

90
TCTGTACGGTGACAAGGCGUNNNACTNN
1086
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGAUGCAGCCGUGCCAG

UGUACACGUCCCGGGACAT

91
TCTGTACGGTGACAAGGCGUNNNACTNN
1087
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCUCGGCAGCCGCAGAA

UGAGGUUGGAGUCCAUGGG

92
TCTGTACGGTGACAAGGCGUNNNACTNN
1088
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUCCAGGAGGUGGAGGG

GCACCUGGCUCCUCUUCAC

93
TCTGTACGGTGACAAGGCGUNNNACTNN
1089
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUUCCUGAGCCAGCAGGG

AUUCCUACCGGAAGCAGGT

94
TCTGTACGGTGACAAGGCGUNNNACTNN
1090
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUCCAUUCCCGGGAGGG

CGUGUGAUGCAGCUCUUCG

95
TCTGTACGGTGACAAGGCGUNNNACTNN
1091
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGGCUCCACCUCAGCAG

CUUGUGCCCACGAAGGAGT

96
TCTGTACGGTGACAAGGCGUNNNACTNN
1092
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUGAAGUCAGCCGGCUC

CUAGUCCCUGGCUGGACCA

97
TCTGTACGGTGACAAGGCGUNNNACTNN
1093
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCAGGACGCCUUCUGCA

CUCUUGACCAGCACGUUCC

98
TCTGTACGGTGACAAGGCGUNNNACTNN
1094
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAUGCCCAGGCUGGGAAG

CGGUUUUCCCGGACAUGGT

99
TCTGTACGGTGACAAGGCGUNNNACTNN
1095
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCUCACCACGAGCUGCC

CUGCUGUGUGCUGGCAGAT

100
TCTGTACGGTGACAAGGCGUNNNACTNN
1096
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCUGGACCAGACCCUGC

GAUGCACCACGGCCACAUA

101
TCTGTACGGTGACAAGGCGUNNNACTNN
1097
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCAUGAGUCGGCCUGUGG

AUCAGAACUGCCGACCACA

102
TCTGTACGGTGACAAGGCGUNNNACTNN
1098
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCCACCCUUCCGACCUC

CCUCUUGACCUGUCCAGGC

103
TCTGTACGGTGACAAGGCGUNNNACTNN
1099
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUCUUCCCGGGUCCCGAG

CCAUGCUGGACCUUCUGCA

104
TCTGTACGGTGACAAGGCGUNNNACTNN
1100
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCGGUGGGCAGCCAGGAG

GGACGACCCAGAGCUGAUG

105
TCTGTACGGTGACAAGGCGUNNNACTNN
1101
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAAGGCUUUGGUCCAGCCA

CUUGCAGUGGAACUCCACG

106
TCTGTACGGTGACAAGGCGUNNNACTNN
1102
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAUGAGUGGGCAGGAGGC

CCCAUGGCAAACACCAUGA

107
TCTGTACGGTGACAAGGCGUNNNACTNN
1103
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCAGUGGAGGCCGGAUG

ACUCCUGAACCCUGAAGGC

108
TCTGTACGGTGACAAGGCGUNNNACTNN
1104
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCCUCGGGCAGUGACAC

CUCCCCUUCUCUGCCCAGA

109
TCTGTACGGTGACAAGGCGUNNNACTNN
1105
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGAGCCGUGCAGCGAUUG

GUUGCCCUUGGAGGCAUAC

110
TCTGTACGGTGACAAGGCGUNNNACTNN
1106
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCCCACUGUGCUUCCUC

AUUGCAGGCUCACCCCAAT

111
TCTGTACGGTGACAAGGCGUNNNACTNN
1107
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCGUUCGCGCACACCCUA

AAUGCACCAGUGGUGGUCT

112
TCTGTACGGTGACAAGGCGUNNNACTNN
1108
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUGCGUCUGCUGUUGCT

GAAUCUGUCUGCUGCUCCUGT

113
TCTGTACGGTGACAAGGCGUNNNACTNN
1109
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCAACGGCAGCUUCGUG

AGAUGCUGCAGAUGCUGCT

114
TCTGTACGGTGACAAGGCGUNNNACTNN
1110
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGUGGCCCUGUAGGACCT

AGCAACCACUCGAUCCUGT

115
TCTGTACGGTGACAAGGCGUNNNACTNN
1111
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCGAAUGCGCCCCGGACUT

AGCUCCAUCUGCAUGGCUT

116
TCTGTACGGTGACAAGGCGUNNNACTNN
1112
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGAGCUGGUGGAGGCUGAC

AGCUGAGGCCUUGCAGAAC

117
TCTGTACGGTGACAAGGCGUNNNACTNN
1113
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCUGCCCGUGAAGUGGAT

CAAAGCAGCCCUCUCCCAG

118
TCTGTACGGTGACAAGGCGUNNNACTNN
1114
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCUGUCCUCCACAGGCAT

CACCAGACACAGCAUCUGC

119
TCTGTACGGTGACAAGGCGUNNNACTNN
1115
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUGACCCGGAGCACUUCC

CACUCCAGCCGUCUCUUGC

120
TCTGTACGGTGACAAGGCGUNNNACTNN
1116
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUACCUGCAACUGCUUCCCT

CAGUGGGCAGGUCCUUCAA

121
TCTGTACGGTGACAAGGCGUNNNACTNN
1117
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGAGAGUGGGCGAGUUUGC

CAUCGGAACCUGCACACAG

122
TCTGTACGGTGACAAGGCGUNNNACTNN
1118
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGGCUCCUGACCUGGAGT

CCUUGUGGCUUUCAGGGUC

123
TCTGTACGGTGACAAGGCGUNNNACTNN
1119
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGGCCCUCCCAGAAGGUC

CGAUGUCAUUCGCUGCAGT

124
TCTGTACGGTGACAAGGCGUNNNACTNN
1120
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUGCCUGACAUCCACGGT

CGCUCCAAAACACGACCUT

125
TCTGTACGGTGACAAGGCGUNNNACTNN
1121
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGGCAUGGUCCACCACAG

CUGCAGGGCCAUCUUGGAG

126
TCTGTACGGTGACAAGGCGUNNNACTNN
1122
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCAUGGCACAGCCUCCCUT

CUGCCUUGUCCCACAUCAG

127
TCTGTACGGTGACAAGGCGUNNNACTNN
1123
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUACACAGCUGGGCGCUUUG

CUUCCCCAUCCAUUUCGGG

128
TCTGTACGGTGACAAGGCGUNNNACTNN
1124
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUUCAGGCGCCAAGUAGGT

GAGUUGAACUGGCGGCCAT

129
TCTGTACGGTGACAAGGCGUNNNACTNN
1125
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUCACCAUGCUGCAGCAC

GCAGGAGCCAAGGUCAGUG

130
TCTGTACGGTGACAAGGCGUNNNACTNN
1126
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUAGAUGGACGCACUGGGC

GCCACGAGAGUGUGGUGAG

131
TCTGTACGGTGACAAGGCGUNNNACTNN
1127
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAAAGCCGGCUACGCGCUG

GCCACUCCGCAGGAUAAAC

132
TCTGTACGGTGACAAGGCGUNNNACTNN
1128
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCAGCCCCUCCUCAGAUG

GGAUCCUUGUCCCCACCAT

133
TCTGTACGGTGACAAGGCGUNNNACTNN
1129
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUCUGUGACAACGGGCUGC

GUCACCCCUUCCUUGGCAC

134
TCTGTACGGTGACAAGGCGUNNNACTNN
1130
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUGGACAGCAUCGGGAGC

GUCGGGAUGGAGAAAGCGA

135
TCTGTACGGTGACAAGGCGUNNNACTNN
1131
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGAACAGGAGCAGCUGCG

UCCGGCUGCAAUGAUCAGG

136
TCTGTACGGTGACAAGGCGUNNNACTNN
1132
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAUCUCCUGUGUGCCCAGA

UCCUCAGCUCCCGGUUCUC

137
TCTGTACGGTGACAAGGCGUNNNACTNN
1133
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCCCAAGUCCUCCUUGCC

UGCUUGGAGUCAGCUGAGG

138
TCTGTACGGTGACAAGGCGUNNNACTNN
1134
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCACAGACAGGCUGUGUGC

UGGGAUCUCCUUGGGUGCC

139
TCTGTACGGTGACAAGGCGUNNNACTNN
1135
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUCCUGCCAAGAAGGCCA

UGUGUCCACACCUGUGUCC

140
TCTGTACGGTGACAAGGCGUNNNACTNN
1136
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCACAAGUCGGACCCCUA

UUGAGCGUGUGAAGACUGC

141
TCTGTACGGTGACAAGGCGUNNNACTNN
1137
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCGGAUCUGGAGGAGCAG

UUGCGCUUCUCCUCCUCCT

142
TCTGTACGGTGACAAGGCGUNNNACTNN
1138
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGAGUGCGGAAGAUUGCCC

CCACAUCCACCGAGGCAUT

143
TCTGTACGGTGACAAGGCGUNNNACTNN
1139
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUACACGUUCACGGUGCCC

AGUCCUUGCGUGCAUUGUC

144
TCTGTACGGTGACAAGGCGUNNNACTNN
1140
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUGGAGGAAGCCCAUCGA

CAGCGAAUGGGCAGCAUUG

145
TCTGTACGGTGACAAGGCGUNNNACTNN
1141
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUGGGCUCUUACCGCAAG

GACUCGGCCCUGAGUGAUA

146
TCTGTACGGTGACAAGGCGUNNNACTNN
1142
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCCCAAGAGUGCCAAGUG

CGGCUUUACCUCCAAUGGUG

147
TCTGTACGGTGACAAGGCGUNNNACTNN
1143
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUCUCGUCUGUCACCCAGG

GAGGAUGAGCCUGACCAGUG

148
TCTGTACGGTGACAAGGCGUNNNACTNN
1144
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGACCAGACGGUCUCAGA

AAACAGUAGCUUCCCUGGGT

149
TCTGTACGGTGACAAGGCGUNNNACTNN
1145
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAUCCCCAACCGCACUGAG

GUAGCUGACCCUGCCUACCT

150
TCTGTACGGTGACAAGGCGUNNNACTNN
1146
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGUGACGGAGGAGCUUGT

CUUACCAGGCAAGGCCUUGG

151
TCTGTACGGTGACAAGGCGUNNNACTNN
1147
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAUGGCACUCAGCAGCAAG

AUCAUUUCUGCUGGCGCACA

152
TCTGTACGGTGACAAGGCGUNNNACTNN
1148
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUGGUGGCCAUAGGAACG

CAUCGUAGACCUGGGUCCCT

153
TCTGTACGGTGACAAGGCGUNNNACTNN
1149
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUAAAACCCAGUGCGUACGCAT

GAGUCCACAGUCUGGAAGCG

154
TCTGTACGGTGACAAGGCGUNNNACTNN
1150
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCUGAAGAGCACGCCAUG

CGGCCCAACACCUUCAUCAT

155
TCTGTACGGTGACAAGGCGUNNNACTNN
1151
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCAGCUGCACGUUUCCUCC

GUAGAGUGUGCGUGGCUCUC

156
TCTGTACGGTGACAAGGCGUNNNACTNN
1152
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGCACCAGCUCACUGCAC

UUGGCUCUGACUGUACCACC

157
TCTGTACGGTGACAAGGCGUNNNACTNN
1153
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAUCGUGGCCUUGACCUCC

UGGCCAUCUACAAGCAGUCA

158
TCTGTACGGTGACAAGGCGUNNNACTNN
1154
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCAGCUGGUGGAAGACCT

CGCAUCGUGUACUUCCGGAT

159
TCTGTACGGTGACAAGGCGUNNNACTNN
1155
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGACGACUCCGUGUUUGCC

UCCUUCCUGUCCUCCUAGCA

160
TCTGTACGGTGACAAGGCGUNNNACTNN
1156
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGCUCACAGUCUCCUGGG

AGAAGGCGGGAGACAUAUGG

161
TCTGTACGGTGACAAGGCGUNNNACTNN
1157
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCAACCUCCGUGAGGACG

AAGCCGAAGGUCACAAAGUC

162
TCTGTACGGTGACAAGGCGUNNNACTNN
1158
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGCUGGUGUUGCUGAGGG

CAGGUCCUCAAGUCUUCGGG

163
TCTGTACGGTGACAAGGCGUNNNACTNN
1159
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGACAGUGCCCAGGGCUC

CAGCUGGCCUUACCAUCCUG

164
TCTGTACGGTGACAAGGCGUNNNACTNN
1160
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGCCUGCUCUUCCUUGGG

ACACCUGGCCUUCAUACACC

165
TCTGTACGGTGACAAGGCGUNNNACTNN
1161
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUUGGGUUUCGAGGCCAAC

UUCUUUCUCUUCCGCACCCA

166
TCTGTACGGTGACAAGGCGUNNNACTNN
1162
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGGCUUUCCUCCUGCGUC

ACACCUGGCCUUCAUACACC

167
TCTGTACGGTGACAAGGCGUNNNACTNN
1163
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUAAUGCUGGGACGCUGCC

ACCAGGAAGGACUCCACUUC

168
TCTGTACGGTGACAAGGCGUNNNACTNN
1164
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUCCACCACCACUUCCCC

ACCAUGCCAUAGUCCAUGCC

169
TCTGTACGGTGACAAGGCGUNNNACTNN
1165
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGGAGAUCCACGCCUACC

ACGGAGACCACUCUUCACGA

170
TCTGTACGGTGACAAGGCGUNNNACTNN
1166
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCGAAGCUUCGAGACCUG

AUGAUUUGCAAAGCGCACAC

171
TCTGTACGGTGACAAGGCGUNNNACTNN
1167
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUUCAGAGGAGGUCGUGGG

AUGUCUGUGUGUCCCGUCAA

172
TCTGTACGGTGACAAGGCGUNNNACTNN
1168
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUGGCAAGCUCCUUCCUG

AUGUUGCACAGCCUCCUUGG

173
TCTGTACGGTGACAAGGCGUNNNACTNN
1169
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUGCCGAGAAGCCAGUCA

CAAUCGCGGUAGAGGCUGUC

174
TCTGTACGGTGACAAGGCGUNNNACTNN
1170
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUGCAACGGAAGCACUGG

CAGGUGGAGAAGUUCCUGGT

175
TCTGTACGGTGACAAGGCGUNNNACTNN
1171
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUGGAGUGGCAGCAGAAG

CCCGUGCCUGUAUUCAAGUG

176
TCTGTACGGTGACAAGGCGUNNNACTNN
1172
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCUGGAGGAGCAGCUUGA

CCGAGGGAAUUCCCACUUUG

177
TCTGTACGGTGACAAGGCGUNNNACTNN
1173
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUUCCGAGCCAAUCACGGG

CCUGGACAGCUUGUGGGAAG

178
TCTGTACGGTGACAAGGCGUNNNACTNN
1174
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUACAUCUCCUACGCCCUGG

CCUUGUCCCUCCUUCAAGGG

179
TCTGTACGGTGACAAGGCGUNNNACTNN
1175
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUACCACCUGCUCCUUCCAG

CGCCCAGAGUGAAGAUCUCC

180
TCTGTACGGTGACAAGGCGUNNNACTNN
1176
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGACACGGUGGUACUGGC

CUCGUACGGUCAGGUUGACG

181
TCTGTACGGTGACAAGGCGUNNNACTNN
1177
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCGAUUGCAGCUCAUGCT

CUGACCUAGUGUGAGGGAGG

182
TCTGTACGGTGACAAGGCGUNNNACTNN
1178
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUCUGGAAGCCAAGGCAG

CUGGACGUUGAUGCCACUGA

183
TCTGTACGGTGACAAGGCGUNNNACTNN
1179
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCAUUGGCCAAGGAGUGCC

CUUCUUCUCCACCGGGUCUC

184
TCTGTACGGTGACAAGGCGUNNNACTNN
1180
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUCUGGCGGAGCAGAUGAG

GCUCAGCUUGUACUCAGGGC

185
TCTGTACGGTGACAAGGCGUNNNACTNN
1181
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGAUACCCGGACCCUGGAG

GCUCCUUCAGUUGAGGCUGG

186
TCTGTACGGTGACAAGGCGUNNNACTNN
1182
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGAGAUCGCCGCCCUCAUT

GGGUGUUGGAGUUCAUGGAG

187
TCTGTACGGTGACAAGGCGUNNNACTNN
1183
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUGCACGCAGCCCAAAUC

GGUGACGUUGUGCAAGGAGA

188
TCTGTACGGTGACAAGGCGUNNNACTNN
1184
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUACAUCCUGUUGCACCCCA

GGUGCACUUCACAACAGGGT

189
TCTGTACGGTGACAAGGCGUNNNACTNN
1185
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUCGCCACCUCCAACCAUC

GUCAUAGUGGGCUUCAGCCG

190
TCTGTACGGTGACAAGGCGUNNNACTNN
1186
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGGUGCGCAAGGUGAAAT

GUCUGGACGCCCGAUUCUUC

191
TCTGTACGGTGACAAGGCGUNNNACTNN
1187
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUACUUGCUGGAUGGGCCUG

UAUAGGUCCGGUGGACAGGG

192
TCTGTACGGTGACAAGGCGUNNNACTNN
1188
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUGAAGGAGGGUCACCGC

UCUCAGCUGAGGAGAUGGGT

193
TCTGTACGGTGACAAGGCGUNNNACTNN
1189
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGUGUGCCAGUAGCCGUG

UCUUGAAGGCAUCCACGGAG

194
TCTGTACGGTGACAAGGCGUNNNACTNN
1190
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCAUCUGGAGCUCCGUGA

UGCCCAAAGCAACCUUCUCC

195
TCTGTACGGTGACAAGGCGUNNNACTNN
1191
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGAGAACGUGGUGGGCAT

UGUCUUCAGGCUGAUGUUGC

196
TCTGTACGGTGACAAGGCGUNNNACTNN
1192
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUUCUGAAGACCGGCCAC

UUCUCGCUUCAGCACGAUGT

197
TCTGTACGGTGACAAGGCGUNNNACTNN
1193
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUCCCCAACAGGCAGGUG

UUGUUGAGCACAAGGAGCAG

198
TCTGTACGGTGACAAGGCGUNNNACTNN
1194
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGGCAUGGAGUACUUGGC

UUUCAGCAUCUUCACGGCCA

199
TCTGTACGGTGACAAGGCGUNNNACTNN
1195
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGCAGUGUCAUGGGCAAG

ACCAUUCUGUUCUCUCUGGCA

200
TCTGTACGGTGACAAGGCGUNNNACTNN
1196
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGGUGUCUGUCCUGGGAGT

GAAUCCUGCUGCCACACAUUG

201
TCTGTACGGTGACAAGGCGUNNNACTNN
1197
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGUGCUUUUAGGGCCCACC

CUUGGAGCUGGAGCUCUUGUG

202
TCTGTACGGTGACAAGGCGUNNNACTNN
1198
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCAAGCCCGCUCAUGAUCAA

CAGCAUCCAACAAGGCACUGA

203
TCTGTACGGTGACAAGGCGUNNNACTNN
1199
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGGCACUGGGUCAAAGUCT

GGAUGCCUGACCAGUUAGAGG

204
TCTGTACGGTGACAAGGCGUNNNACTNN
1200
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAUCAAGAAUCGCCCGAGCC

GGAUGAGGAAGUAGCCUCCCA

205
TCTGTACGGTGACAAGGCGUNNNACTNN
1201
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUGAUGGGACCCACUCCAT

CUUGGGCACUUGCACAGAGAT

206
TCTGTACGGTGACAAGGCGUNNNACTNN
1202
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUCCAAAAUGGCCCGAGAC

CGUUGAACUCUGACAGCAGGT

207
TCTGTACGGTGACAAGGCGUNNNACTNN
1203
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCAGCAGGGCUUCUUCAGCA

AUUCGGACACACUGGCUGUAC

208
TCTGTACGGTGACAAGGCGUNNNACTNN
1204
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGCGGGACAUGGACUCAAC

CCCUUCUCUGUCUCCCUUGGA

209
TCTGTACGGTGACAAGGCGUNNNACTNN
1205
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCAUGUACUGGUCCCGCAT

GUCCCGUGAGCACAAUCUCAA

210
TCTGTACGGTGACAAGGCGUNNNACTNN
1206
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCGGAAUGCCAACCCAUGGA

GCACCUUCAUUGGCUACAAGG

211
TCTGTACGGTGACAAGGCGUNNNACTNN
1207
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUAGGCGAGGAGCUCCAGUC

GUUUUUCCCUCAGGCCCUCAT

212
TCTGTACGGTGACAAGGCGUNNNACTNN
1208
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGUGAGGCUCCCCUUUCUT

UUCCUACAGUACUCCCCUGCC

213
TCTGTACGGTGACAAGGCGUNNNACTNN
1209
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCAGCCCUCUGACGUCCAUC

CACUUCUCACACCGCUGUGUT

214
TCTGTACGGTGACAAGGCGUNNNACTNN
1210
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGGUGCCCAUCAAGUGGAT

CCCAUGACACACCAUAACUCC

215
TCTGTACGGTGACAAGGCGUNNNACTNN
1211
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUCCUCCAGCUUCUUCUGCA

AACCCUCCUGAUGUACACGGT

216
TCTGTACGGTGACAAGGCGUNNNACTNN
1212
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCGGGCUUGGUUCUGAUGT

CUUGCCUUUCUCCCCAACCAG

217
TCTGTACGGTGACAAGGCGUNNNACTNN
1213
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCGUCCUGAAGCAGGUCAAC

ACCUUCAGCACUCUGCUUGUG

218
TCTGTACGGTGACAAGGCGUNNNACTNN
1214
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGGUGAGGGUGUCUCUCUG

UUCUAUCGGCAAAGCGGUGUT

219
TCTGTACGGTGACAAGGCGUNNNACTNN
1215
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGUAAAUACGGGCCCGACG

ACAUUGGGAGCUGAUGAGGAT

220
TCTGTACGGTGACAAGGCGUNNNACTNN
1216
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUUCCCCUCCAUUGUGGGC

ACUUCCUACAGGAAGCCUCCC

221
TCTGTACGGTGACAAGGCGUNNNACTNN
1217
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCAUCCGAAAGCAGUCCAA

AGGUGGCACCAAAGCTGTAUT

222
TCTGTACGGTGACAAGGCGUNNNACTNN
1218
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGGGCAGGAGUCAAGAUGC

AGUCUUCCCCACUUCUGCCUT

223
TCTGTACGGTGACAAGGCGUNNNACTNN
1219
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAUGGGCCCCUGGAUGGAUA

AUGCUUUCAGGAGGCAUCCAG

224
TCTGTACGGTGACAAGGCGUNNNACTNN
1220
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGUAGCAGCCGUCUGUCUC

AUUCCGAUGUCAGCACCAAAG

225
TCTGTACGGTGACAAGGCGUNNNACTNN
1221
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUACACACUGCAGCCCAAG

CACAGUGAUAGGAGGUGUGGG

226
TCTGTACGGTGACAAGGCGUNNNACTNN
1222
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUACUAUCCCUCGGGAGGC

CACCGUUCCACCUGAAAGACT

227
TCTGTACGGTGACAAGGCGUNNNACTNN
1223
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCUUUAAGGCCCCAGCGUC

CCCCUGCUCUUCAAUACAGCC

228
TCTGTACGGTGACAAGGCGUNNNACTNN
1224
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGACCGGAUUCGCAUGUGUG

CCCUCAGCUACCAGGAUGUUT

229
TCTGTACGGTGACAAGGCGUNNNACTNN
1225
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUCCAGCUCCUCUGACAGC

CCUCUUCGAACCUGUCCAUGA

230
TCTGTACGGTGACAAGGCGUNNNACTNN
1226
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUCUCCACCCCAGCAAAAC

CCUGCUCAGUGUAGCUAGGUT

231
TCTGTACGGTGACAAGGCGUNNNACTNN
1227
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUCAUUCAUGCCCCUCCUGG

CUACACUUGGCUGGGCAAAGA

232
TCTGTACGGTGACAAGGCGUNNNACTNN
1228
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUGAAGUGCAAGGCACUGC

CUCCAUCCUGAGUCAUGGCUT

233
TCTGTACGGTGACAAGGCGUNNNACTNN
1229
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUUCACGUGCAGCACAUGG

CUCCCUCUGGAAAUCCUUCCG

234
TCTGTACGGTGACAAGGCGUNNNACTNN
1230
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGUUUCACGCCACCAACUT

CUCGCUGAGAUUGAACUGGAG

235
TCTGTACGGTGACAAGGCGUNNNACTNN
1231
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUCUUCUGGGCUGGGUGUGA

CUGAGUCCUCCUCACCACUGA

236
TCTGTACGGTGACAAGGCGUNNNACTNN
1232
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUACGCUGGCCUAUAAGGUGC

CUUCGCCUAGCUCCCUUUUCA

237
TCTGTACGGTGACAAGGCGUNNNACTNN
1233
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUUGACCUCCCAGACCGAG

GAAGACAUGAGCUCGAGUGCT

238
TCTGTACGGTGACAAGGCGUNNNACTNN
1234
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUUGGGCAUCACUGUCCUCG

GAAUAUGUGGAAGCCCACAGC

239
TCTGTACGGTGACAAGGCGUNNNACTNN
1235
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGCUUGAGCAGCAGCUGAG

GCAGCAAGUCCAACUGCUAUG

240
TCTGTACGGTGACAAGGCGUNNNACTNN
1236
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAACAGCUCUCUGUGAUGCG

GCAGGCUGGACGUACAUUCUT

241
TCTGTACGGTGACAAGGCGUNNNACTNN
1237
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUACGACGGGAGGACAAUCUC

GCAGUGAUGCCUACCAACUGT

242
TCTGTACGGTGACAAGGCGUNNNACTNN
1238
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCGGCUGCAGGACUAUGAGG

GGAAUACUCCAGCUCACAGGG

243
TCTGTACGGTGACAAGGCGUNNNACTNN
1239
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUACCUUGAGCAUCGCAUCCA

GGAGCUUGCUCAGCUUGUACT

244
TCTGTACGGTGACAAGGCGUNNNACTNN
1240
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGAGGGACGUGAACGGAGUG

GGGUAGCAGACAAACCUGUGG

245
TCTGTACGGTGACAAGGCGUNNNACTNN
1241
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUACACCCCCAGCUCCAGCUC

UUUUCCAGGAGAGAGACTCCAGA

246
TCTGTACGGTGACAAGGCGUNNNACTNN
1242
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCCUUUCGAGCAGUACUCC

GGUGUCUUCAUCCUCGAUGGT

247
TCTGTACGGTGACAAGGCGUNNNACTNN
1243
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCACAGUACCCGGCUGUAGA

GUGAUCCUUGCCAGGUAAUCC

248
TCTGTACGGTGACAAGGCGUNNNACTNN
1244
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUGGCCAUAAAGGGCAACC

GUGGUUCGUGGCUCUCUUAUC

249
TCTGTACGGTGACAAGGCGUNNNACTNN
1245
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUCAGCCCAGACCAUUCAG

UCCCAAAUUCUGCCAGGAAGC

250
TCTGTACGGTGACAAGGCGUNNNACTNN
1246
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGCUCAGGCUACAUCUCGC

UCCUUCUCCAAGGCCAGAAUC

251
TCTGTACGGTGACAAGGCGUNNNACTNN
1247
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUGGACCACGGCAAAGAUG

UCUCCACUAGCACCAAGGACA

252
TCTGTACGGTGACAAGGCGUNNNACTNN
1248
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCAGUGGACGUGGAUUUGGG

UGGCCAAGCAAUCUGCGUAUT

253
TCTGTACGGTGACAAGGCGUNNNACTNN
1249
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCAUCACAGAGCGAAGCUG

UAACGCCUGUUUUCUUUCUGCC

254
TCTGTACGGTGACAAGGCGUNNNACTNN
1250
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGUGUUGUGAUCCGCCACT

CUGGCCAAGAGUUACGGGAUUC

255
TCTGTACGGTGACAAGGCGUNNNACTNN
1251
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUUUCUGGGACUCAUGCCCT

AGGAGUGUGUACUCUUGCAUCG

256
TCTGTACGGTGACAAGGCGUNNNACTNN
1252
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUACCGCGACGACAAGAUCUG

GGGACAUUCACCACAUCGACUA

257
TCTGTACGGTGACAAGGCGUNNNACTNN
1253
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUUCUACAGCCCAGCCCAG

CUCUGAAUCUCUGUGCCCUCAG

258
TCTGTACGGTGACAAGGCGUNNNACTNN
1254
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUGGCCAACAUUCAGCAGC

CCUUCCUGGUUGGCCGUUAUAT

259
TCTGTACGGTGACAAGGCGUNNNACTNN
1255
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGGUUCUCUCCAUCGCCUT

UGAACUGCUAGCCUCUGGAUUT

260
TCTGTACGGTGACAAGGCGUNNNACTNN
1256
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUCAGCUGCUUCCGUUGCUC

GACUUGGUGUCAUGCACCUACC

261
TCTGTACGGTGACAAGGCGUNNNACTNN
1257
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAUUCCGAGGCUGGAAUGGA

GAAGGGAGUCACUCUGGUUUGG

262
TCTGTACGGTGACAAGGCGUNNNACTNN
1258
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCUCAGAAGUCCAGCAGGC

CUGAAAUUGGUGUCGGUGCCUA

263
TCTGTACGGTGACAAGGCGUNNNACTNN
1259
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCAGGCUGUCAGAGCAGGAG

AGAUACUGAUCUCGCCAUCGCT

264
TCTGTACGGTGACAAGGCGUNNNACTNN
1260
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUGACGAGAUCGCCAACAG

CUGGUUGGAGCGAAUCUGCUAG

265
TCTGTACGGTGACAAGGCGUNNNACTNN
1261
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUGGAGGAAGCCAUGGAGC

GAACAUGUGUGAGCACAGCAAC

266
TCTGTACGGTGACAAGGCGUNNNACTNN
1262
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUUCUGAUGGCUUGAAGGCG

UGCCUCUUGCUUCUCUUUUCCT

267
TCTGTACGGTGACAAGGCGUNNNACTNN
1263
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGGUGUUUGCUGACGUCCA

GUUCUCCAGGUCGAAAGGGUAC

268
TCTGTACGGTGACAAGGCGUNNNACTNN
1264
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUACAACCAGCCCUCCGAC

CUGGAUCCUCAGGACUCUGUCT

269
TCTGTACGGTGACAAGGCGUNNNACTNN
1265
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGAGCCAGAGUCCGUCAUCG

CGUUGAAGCACUGGAUCCACUT

270
TCTGTACGGTGACAAGGCGUNNNACTNN
1266
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCUGGUACCAGCUCUCCAA

CCACAUCCUCUUCCUCAGGAUT

271
TCTGTACGGTGACAAGGCGUNNNACTNN
1267
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCACCUUACUGCCCAGGUG

AACGACCAAGUCACCAAGGAUG

272
TCTGTACGGTGACAAGGCGUNNNACTNN
1268
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCACCCGCCAGCAUCCUUAG

CAGAGUUCAUGGAUGCACUGGA

273
TCTGTACGGTGACAAGGCGUNNNACTNN
1269
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUGGACUGCUCAGGGUGCC

UACUCCACAGUGAGCUCGAUCC

274
TCTGTACGGTGACAAGGCGUNNNACTNN
1270
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGCCUGUCAUGAGACCUCC

ACAAGGCUGUUUUGGAGAUGGA

275
TCTGTACGGTGACAAGGCGUNNNACTNN
1271
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCGCUCUUUCCAGCUGGCUA

ACAGCAUACAUGCAUUCCUCAG

276
TCTGTACGGTGACAAGGCGUNNNACTNN
1272
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAACGAGGUUCCGGUGUGUC

ACCAUCGGUGUCAUCCUCAUCA

277
TCTGTACGGTGACAAGGCGUNNNACTNN
1273
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCUCCCAGGACCUCCACUA

AGACUGUCUCGGACUGUAACUC

278
TCTGTACGGTGACAAGGCGUNNNACTNN
1274
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCGACACACUGUAGGCAGT

CAAACACUGCCGAGGUGAUUUT

279
TCTGTACGGTGACAAGGCGUNNNACTNN
1275
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUACCUGGGAACCUACUGUGG

CAUCAUUGCUGAUAACGGAGGC

280
TCTGTACGGTGACAAGGCGUNNNACTNN
1276
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCUGAAGCUGGACUACCGC

CCCAGAGCAAGGAAGUGUUAUC

281
TCTGTACGGTGACAAGGCGUNNNACTNN
1277
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGUGGACCUCAGCAGCAUT

CCUCCCUCAGGACUGUAACAGA

282
TCTGTACGGTGACAAGGCGUNNNACTNN
1278
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCCGUGGCUAUGCCUUCAT

CGAGCCCCCUAAAGUGAAGAUC

283
TCTGTACGGTGACAAGGCGUNNNACTNN
1279
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUACAGGGAUUCCUCUUCCCC

CGCUUCCUUCAGGGUCUUCAUC

284
TCTGTACGGTGACAAGGCGUNNNACTNN
1280
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAUGGCACGGAACUGAACCA

CUUUCAAUGUUGCCACCACACT

285
TCTGTACGGTGACAAGGCGUNNNACTNN
1281
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCUGGAGAACCAGGACCUT

GACCUUGGCUGCAUGAAGUUUT

286
TCTGTACGGTGACAAGGCGUNNNACTNN
1282
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUUUGACCGCAAGCUCCUCC

GAGCUUCCCUCUGGAUCUCUCA

287
TCTGTACGGTGACAAGGCGUNNNACTNN
1283
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCCUUGCAAGCUGGUCAUT

GCAUCGUUUGUGGUUAGUGUCA

288
TCTGTACGGTGACAAGGCGUNNNACTNN
1284
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCUGGGACUCGUACGAGAA

GCCGTCTUCCTCCATCTCAUAG

289
TCTGTACGGTGACAAGGCGUNNNACTNN
1285
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUGGCAUGACAUGCAGACT

GGCUAUCUCCAGGUAGUCUGGG

290
TCTGTACGGTGACAAGGCGUNNNACTNN
1286
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGCCCUUAGAGAGCUUGGG

GGGUGCUGUAUUCUGCAGGAUC

291
TCTGTACGGTGACAAGGCGUNNNACTNN
1287
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCGCAUACCCGCCAUCUUCT

GGGUUGUAGUCGGUCAUGAUGG

292
TCTGTACGGTGACAAGGCGUNNNACTNN
1288
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUGCGGAUACAAAGGCGAC

GGUGGUGUUCAAAGAACUUGGA

293
TCTGTACGGTGACAAGGCGUNNNACTNN
1289
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAAAGCUGUGGCUGGAAACA

UCCUCCACAGUGAGGUUAGGUG

294
TCTGTACGGTGACAAGGCGUNNNACTNN
1290
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUACAUCUGCUCCGGCUUAGC

UGAAGAUGACUUCCUUUCUCGC

295
TCTGTACGGTGACAAGGCGUNNNACTNN
1291
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCGACGACUUUAUCUGGGC

UUCACCAGCGUCAAGUUGAUGG

296
TCTGTACGGTGACAAGGCGUNNNACTNN
1292
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCAGUGACTGCUGCCACAAC

UUUCUGGCAUUGAUCUCGGCUT

297
TCTGTACGGTGACAAGGCGUNNNACTNN
1293
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCUGUACAUCCUGGUUGGG

AUCUUCAUCACGUUGUCCUCGG

298
TCTGTACGGTGACAAGGCGUNNNACTNN
1294
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCAAACCCAACCGUGUGACC

CACAAGAACAGUGCAGAGGGUT

299
TCTGTACGGTGACAAGGCGUNNNACTNN
1295
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCCAAGCCUGUCACCGUAG

CUGUUUCUGGGAAACUCCCAUUT

300
TCTGTACGGTGACAAGGCGUNNNACTNN
1296
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGACGGCACACCCUACGUUA

GUGCUGGAAGCCUUUGUCUAUGA

301
TCTGTACGGTGACAAGGCGUNNNACTNN
1297
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGAGUGCCACAACCUCCUG

UUCCAGACCAGGGUGUUGUUUUC

302
TCTGTACGGTGACAAGGCGUNNNACTNN
1298
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGAUGGCUCCCAGCUUCCT

ACAGCAAAGCAGAAACUCACAUC

303
TCTGTACGGTGACAAGGCGUNNNACTNN
1299
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCUGGAUCCUCACAGAGCT

UUAGAGGGACUCUUCCCAAUGGA

304
TCTGTACGGTGACAAGGCGUNNNACTNN
1300
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGACGAAGUGAGUCCCACA

CUGAUAAAGCACCCUCCAUCGUT

305
TCTGTACGGTGACAAGGCGUNNNACTNN
1301
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGAACGGGAAGCCCUCAUGUC

AUGACGGAAUAUAAGCUGGUGGT

306
TCTGTACGGTGACAAGGCGUNNNACTNN
1302
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGGACUCUGGAUCCCAGAAG

CCCAAGCCUGGGACCUCUAUUAT

307
TCTGTACGGTGACAAGGCGUNNNACTNN
1303
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCCUCCUUCUGGCCACCAUG

CAUCUGCAUGGUACUCUGUCUCG

308
TCTGTACGGTGACAAGGCGUNNNACTNN
1304
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGUGCUGGACACGACAACAA

CCUCAUUUCUCCUCCAUCCUCAG

309
TCTGTACGGTGACAAGGCGUNNNACTNN
1305
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGACCCUGAAGGAUGCCAGT

CAACCUUGUCCUAACCUCUCUCC

310
TCTGTACGGTGACAAGGCGUNNNACTNN
1306
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCGAUCGUUUGCAACCUGCUC

CUGGUUUACAGAGAAACCCACCA

311
TCTGTACGGTGACAAGGCGUNNNACTNN
1307
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGACAGGCUAUGUCCUCGUG

ACAUCAGAGAAAGGGACCCUAGT

312
TCTGTACGGTGACAAGGCGUNNNACTNN
1308
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGCACAAGAGGCCCUAGAUT

GUUGCCACUUUCUCAACUUUCCC

313
TCTGTACGGTGACAAGGCGUNNNACTNN
1309
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGCUACCUGACCGACGUUGA

UUUUUCCUCUCACUGGCUUCUCC

314
TCTGTACGGTGACAAGGCGUNNNACTNN
1310
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGGACCAGCUCUUUCGGAAC

ACUGCUGUUCCUUCAUACACUUC

315
TCTGTACGGTGACAAGGCGUNNNACTNN
1311
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGUGAAGGUGCUUGGAUCUG

AUACCCCAGCUCAGAUCUUCUCC

316
TCTGTACGGTGACAAGGCGUNNNACTNN
1312
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUCUUCAGCAGGAAGUACCGT

GCAUGUUUGUUGGUGAUUCCAAG

317
TCTGTACGGTGACAAGGCGUNNNACTNN
1313
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUGAGGUGGAAGAGACAGGC

AAAUGUGUAAAUUGCCGAGCACG

318
TCTGTACGGTGACAAGGCGUNNNACTNN
1314
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGGAGGAGCUCUUCAAGCUG

AAUGCUUAUUCAUGGCAGGACCA

319
TCTGTACGGTGACAAGGCGUNNNACTNN
1315
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGAAGACCCAAGCUGCCUGAC

AGAUGAUGAUCUCCAGGUACAGG

320
TCTGTACGGTGACAAGGCGUNNNACTNN
1316
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGCUGUUGUGAAAAGGACGG

AGGUCUGUCCUCAAGGAAUGGAT

321
TCTGTACGGTGACAAGGCGUNNNACTNN
1317
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGUGCUACAUACGGGCUGAA

CAUCACCACGAAAUCCUUGGUCT

322
TCTGTACGGTGACAAGGCGUNNNACTNN
1318
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGGCCAUUGGCUCUAUGGAA

CAUCAGGAGUCUGUUGGACCUUG

323
TCTGTACGGTGACAAGGCGUNNNACTNN
1319
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGAGGCUAUCACAAGCUGCAC

CCAAACUGCUCCAGGUAAUCCAC

324
TCTGTACGGTGACAAGGCGUNNNACTNN
1320
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCAUGUCUGGCUGUGAUGCT

CCAUCCUUCAUAGCUGUAUGCAC

325
TCTGTACGGTGACAAGGCGUNNNACTNN
1321
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAUGACGUACCAAACAGGCAC

CCCAAGAAAUCGAACUCCACAAG

326
TCTGTACGGTGACAAGGCGUNNNACTNN
1322
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUCCGGUUCCCACUGAUGACA

CCUCUUUGAGGUCUUGUCCAGUC

327
TCTGTACGGTGACAAGGCGUNNNACTNN
1323
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUCAUUCAGCACCAGAGGCA

CGAUUCCUGGCUUUUCAUCUCUT

328
TCTGTACGGTGACAAGGCGUNNNACTNN
1324
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGAGCUGCUCAGUUACAGCAG

CUGACACCAGAUCAGAAAGGUCT

329
TCTGTACGGTGACAAGGCGUNNNACTNN
1325
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGAAAUCUCUGGCCAACUCCG

CUGAGGAUUUCCAGCAAAUAGGG

330
TCTGTACGGTGACAAGGCGUNNNACTNN
1326
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCAUGCAGAAUGCCACCAAG

GGUGAACUCCUGCAUGUCAUCAG

331
TCTGTACGGTGACAAGGCGUNNNACTNN
1327
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCACUCCUUGGAGCAAAAGC

GUAACAAUACCAGUGAAGACCCG

332
TCTGTACGGTGACAAGGCGUNNNACTNN
1328
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUUGUUGGCCUGGCAGAAAA

GUAGUAGUGGUUGUGGCACUUGG

333
TCTGTACGGTGACAAGGCGUNNNACTNN
1329
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGAGAAGGAGAUUGCCCUGCT

GUCACAUUCAGGAUGUGCUUUCG

334
TCTGTACGGTGACAAGGCGUNNNACTNN
1330
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUCACAGUUUGAGGCACAGG

GUUCCUCAGAUCAUUCUCCAGCT

335
TCTGTACGGTGACAAGGCGUNNNACTNN
1331
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCCUCUCAUUGACCGGAACC

UACAGCUUCUCCCAGUAAGCAUC

336
TCTGTACGGTGACAAGGCGUNNNACTNN
1332
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCAGCUCUCAUCGGCCAAUCA

UAUGGAGGCCAAUGCUCUCUUCA

337
TCTGTACGGTGACAAGGCGUNNNACTNN
1333
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCUUCUGGACCAAGACGACT

UCUACAUUUGUAGGUGUGGCUGT

338
TCTGTACGGTGACAAGGCGUNNNACTNN
1334
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCCCAUCCAGACCUACUCUG

UGACAGGAAGACCUUGAGGUAGA

339
TCTGTACGGTGACAAGGCGUNNNACTNN
1335
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCUGCCGCUAAAGAAGGGUC

UGGUUGAGGACUGUGAGACAGUT

340
TCTGTACGGTGACAAGGCGUNNNACTNN
1336
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUGGCAUCUCUCGCUGGUUT

UUAUCCUUAAGGAGCCCUGUGUG

341
TCTGTACGGTGACAAGGCGUNNNACTNN
1337
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGGUGUCAUCCAGCCUUAGC

UUCAACACAGCUGUUGGUUUCUC

342
TCTGTACGGTGACAAGGCGUNNNACTNN
1338
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCAUACCAGAGGCAAUCCGCA

CAGACGUCACUUUCAAACGUGUAT

343
TCTGTACGGTGACAAGGCGUNNNACTNN
1339
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCACAGCCGGAGGUCAUACUG

UCUCUUGGAAACUCCCAUCUUGAG

344
TCTGTACGGTGACAAGGCGUNNNACTNN
1340
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCGGAUAGUGGAUCCCAACGG

AGUGACAGAAAGGUAAAGAGGAGC

345
TCTGTACGGTGACAAGGCGUNNNACTNN
1341
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCAGCAGAGGCAUAAGGUUC

UGGAGUUUGUCUGCUGAAUGAACC

346
TCTGTACGGTGACAAGGCGUNNNACTNN
1342
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGAAAACGCCUGUGUUCCACC

GUCCUUCUCUUCCAGAGACUUCAG

347
TCTGTACGGTGACAAGGCGUNNNACTNN
1343
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUUCCGGCAAAUCACAGAUCG

CUGUGCCAGGGACCUUACCUUAUA

348
TCTGTACGGTGACAAGGCGUNNNACTNN
1344
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUCUCCGGUGUGGAGUUCUG

UUGCCAUCAUUGUCCAACAAAGUC

349
TCTGTACGGTGACAAGGCGUNNNACTNN
1345
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGCAACAUUGAAAGCCUCGT

UUGUUAACCUUGCAGAAUGGUCGA

350
TCTGTACGGTGACAAGGCGUNNNACTNN
1346
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCACACAAGGGAGGUCCUCAA

UUUUUCAGCAUUAACAUGCGUGCT

351
TCTGTACGGTGACAAGGCGUNNNACTNN
1347
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGACGACGAGGAUGAGGAUG

GGCCAUGAAUUCGUCAGCUAGUUT

352
TCTGTACGGTGACAAGGCGUNNNACTNN
1348
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCGAACAGAAACCCCUCCUC

GGUGACUGGAUCCACAACCAAAAT

353
TCTGTACGGTGACAAGGCGUNNNACTNN
1349
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUCCUGCAUCCAAUGGAUGCT

CUUGCAGCCAUGAUCCAAUUCUCA

354
TCTGTACGGTGACAAGGCGUNNNACTNN
1350
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGUCUCUUCAUGGCCAGUGC

CUCCCAGAAUUACCAAGUGAGUCC

355
TCTGTACGGTGACAAGGCGUNNNACTNN
1351
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGAUGGAGAGGCUGAAGCAG

GAAAGAGAAGUGCAUGUGCAAGAC

356
TCTGTACGGTGACAAGGCGUNNNACTNN
1352
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCGAAGCCAUCAAACAGCUGC

GUGCCUUUAAAAAUUUGCCCCGAT

357
TCTGTACGGTGACAAGGCGUNNNACTNN
1353
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCUUACAUACCCAGCACCGA

AGUGCCACUGGUCUAUAAUCCAGA

358
TCTGTACGGTGACAAGGCGUNNNACTNN
1354
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUACCUUGUGUCAAUGGAGGCA

UAAGGCCUGCUGAAAAUGACUGAA

359
TCTGTACGGTGACAAGGCGUNNNACTNN
1355
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCACAGCAUCAAGGAUGUGCA

GAUCAUUGUUCCUUCCCCUCAGAC

360
TCTGTACGGTGACAAGGCGUNNNACTNN
1356
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCCAAUACCUGCAGCUUCUG

AUCUGAUCCUAAAACCCAGCCUCT

361
TCTGTACGGTGACAAGGCGUNNNACTNN
1357
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCACUACCAGGAUUGCCAACC

UUACAGCCCUGGAUUUGUCAAGUT

362
TCTGTACGGTGACAAGGCGUNNNACTNN
1358
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGGAUGGAAACUUUGCUGCT

GCCGUUGUACACUCAUCUUCCUAG

363
TCTGTACGGTGACAAGGCGUNNNACTNN
1359
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGUUGGAGCCCAUUCAGAGC

UGCAGUUGGUGGAACCAUUAACUC

364
TCTGTACGGTGACAAGGCGUNNNACTNN
1360
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUUCGCUCACCUGGAUGACAA

CUUCACCUUUAACACCUCCAGUCC

365
TCTGTACGGTGACAAGGCGUNNNACTNN
1361
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAUGAGGAGUACGUGGAGGUG

AAAAACUAUGAUGGUGACGUGCAG

366
TCTGTACGGTGACAAGGCGUNNNACTNN
1362
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUCUCUCCUUGGCCUCUCCUG

GUGAUGAUUGGGAGAUUCCUGAUG

367
TCTGTACGGTGACAAGGCGUNNNACTNN
1363
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGCCCAACACUGUACCUCAG

AGACCCAAAGGGCAGUAAGAUAGG

368
TCTGTACGGTGACAAGGCGUNNNACTNN
1364
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCGGGCAGGAAUCUGAUGAC

AAUAAGGUUCACAUCAGGAAGGGT

369
TCTGTACGGTGACAAGGCGUNNNACTNN
1365
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCAGGGCAGCAACAUCUUUG

AAUAUGCUCAGACCAGUCAUCUGC

370
TCTGTACGGTGACAAGGCGUNNNACTNN
1366
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUUCUGCAACAGCAGCACAAA

AAUCUCCCAAUCAUCACUCGAGUC

371
TCTGTACGGTGACAAGGCGUNNNACTNN
1367
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCAGCUGCAAUUCCUCGAACG

AGCAUCAAAUUUGCGCUGGAUUUC

372
TCTGTACGGTGACAAGGCGUNNNACTNN
1368
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCGGCCUCACUAAACUGUUGG

CAUCAUCUCCAUCUCAGACACCAG

373
TCTGTACGGTGACAAGGCGUNNNACTNN
1369
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUGAAGGAGCUGGAGAAGCA

CAUUUUGAGAUGCUUGCAAUUGCC

374
TCTGTACGGTGACAAGGCGUNNNACTNN
1370
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUGGAGGAAAAGGUCGCCUC

CCCAGGUUUAUUAAAUUUCGCAGC

375
TCTGTACGGTGACAAGGCGUNNNACTNN
1371
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAAUUUAGAAGGGCUGGUGGC

CUCCUUCUCCGCACAUUUUACAAG

376
TCTGTACGGTGACAAGGCGUNNNACTNN
1372
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCAAACCCCCUACAGAUGGC

CUCUUUGUCGGUGGUAUUAACUCC

377
TCTGTACGGTGACAAGGCGUNNNACTNN
1373
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCACCAGUGGGAGGGUCUUAT

CUGGUCCAACUUCAUUUUCUGAGA

378
TCTGTACGGTGACAAGGCGUNNNACTNN
1374
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUACAUCUGCAACAGCAAGCAC

GAGUCCAUUAUGAUGCUCCAGGUG

379
TCTGTACGGTGACAAGGCGUNNNACTNN
1375
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUACAAUGUCCUCCUGACAGC

GCCAAUUCACUGUGGUUUAAGUGC

380
TCTGTACGGTGACAAGGCGUNNNACTNN
1376
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCUGAAGACAGGCCCAACUT

GCCAGAGUCAUAGCUGGAGUAACT

381
TCTGTACGGTGACAAGGCGUNNNACTNN
1377
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUACGAGGCUGCAAGAGAGAUC

GGACAUCAGUGGUACUGAGCAAUA

382
TCTGTACGGTGACAAGGCGUNNNACTNN
1378
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGGCACCCGAGGCAUUAUUT

GGUCUAUUCCUGUUGAAGCAGCAA

383
TCTGTACGGTGACAAGGCGUNNNACTNN
1379
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGAGAAUGAGUACGGCAGCA

GGUGGCUAAUAGCUUCUUCUGUUC

384
TCTGTACGGTGACAAGGCGUNNNACTNN
1380
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCGGACUCUCCCAUCACUCUG

GUCACAGCUGCAGUUGAAAAAGUT

385
TCTGTACGGTGACAAGGCGUNNNACTNN
1381
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCUGCUGAAGGAAGGACACA

GUUUUCCUUCCUUUAUCCCAGGUG

386
TCTGTACGGTGACAAGGCGUNNNACTNN
1382
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGCAUGAGACUCAGUGCAGA

UAGCAGGUCAAAAGUGAACUGAUG

387
TCTGTACGGTGACAAGGCGUNNNACTNN
1383
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGAGACUCUGCUUCGCUGCAT

UAUCUCUUCCAUAGGCUCCUGCUG

388
TCTGTACGGTGACAAGGCGUNNNACTNN
1384
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUGGAUAGCCUCCACCACCT

UAUGCUAUCUGAGCCGUCUAGACT

389
TCTGTACGGTGACAAGGCGUNNNACTNN
1385
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUGUGAAUUAGGGACCGGGA

UCAGUCUCCAUGAUAGUGGUCCAG

390
TCTGTACGGTGACAAGGCGUNNNACTNN
1386
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUCCUCAAGGAGCCCUUUCCA

UGGACUUCCAUGUGCAAACACUAC

391
TCTGTACGGTGACAAGGCGUNNNACTNN
1387
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAACCCCCAGUACUUCCGUCA

UGUGCUGUCCAUUUUCACUUUCUG

392
TCTGTACGGTGACAAGGCGUNNNACTNN
1388
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCGAGUGCUACAACCUCAGCC

UUACCAAAAGGCAAAAUCCCACCA

393
TCTGTACGGTGACAAGGCGUNNNACTNN
1389
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCAUUCAGCCCAGAGCCUUUG

UUCACUUCCAAUAUUCUCUGCUGC

394
TCTGTACGGTGACAAGGCGUNNNACTNN
1390
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCAUCUGAAAGGCAGAGCAGG

UUCUGGAUUUCAGCUUUGGAAAGT

395
TCTGTACGGTGACAAGGCGUNNNACTNN
1391
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUACCUACUCCCUCUCCGUGA

UUGCAGAAGGAACACCUAUUCGUT

396
TCTGTACGGTGACAAGGCGUNNNACTNN
1392
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCACUGCUGUGUCUGUAAACG

GCACACCAGAAAAGUCUUAGUAACC

397
TCTGTACGGTGACAAGGCGUNNNACTNN
1393
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUCUCUGCGCAUUCAGGAGUG

UGUUUCCAAAUGACAACCAGGACAA

398
TCTGTACGGTGACAAGGCGUNNNACTNN
1394
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCGUACCAGAUGGAUGUGAACC

UCUUUGUGAUCCGACCAUGAGUAAG

399
TCTGTACGGTGACAAGGCGUNNNACTNN
1395
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGGGACCUCCGGUCAGAAAAC

UGGUGAAACCUGUUUGUUGGACAUA

400
TCTGTACGGTGACAAGGCGUNNNACTNN
1396
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCACACGCAACUGUCUAGUGG

GCAAAUGUAAUCUACCAGGCUUUGG

401
TCTGTACGGTGACAAGGCGUNNNACTNN
1397
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCCUACAGAUUGCGAGAGAGC

GAGCCAUAGUGGAGAGCUGUAAAUT

402
TCTGTACGGTGACAAGGCGUNNNACTNN
1398
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAUUCCUGUGCAUGAAAGCACT

ACAUGUAUGCCAGCUGUUAGAGAUT

403
TCTGTACGGTGACAAGGCGUNNNACTNN
1399
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUCCUCUGUCACCAGGACAUUC

CACCCCAGCAAAGCAUUUUAAGAUC

404
TCTGTACGGTGACAAGGCGUNNNACTNN
1400
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGACUUGGCAGCCAGAAACAUC

UACAUCAUGAGAGGAAUGCAGGAAT

405
TCTGTACGGTGACAAGGCGUNNNACTNN
1401
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGACCACGUGACCUUGAAGCUC

CACUUAAUUUGGAUUGUGGCACAGA

406
TCTGTACGGTGACAAGGCGUNNNACTNN
1402
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUCGUGUUCUUCAUUCGGCAC

CGGUGUCAGCCUCCACT

407
TCTGTACGGTGACAAGGCGUNNNACTNN
1403
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGGUUCUGGAUCAGCUGGAUG

ACUUUGCGUGGUGUAGAUAUGAUCA

408
TCTGTACGGTGACAAGGCGUNNNACTNN
1404
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCACACUCUUGAGGGCCACAAA

CUCUCCUUCCUCCUGUAGUUUCAGA

409
TCTGTACGGTGACAAGGCGUNNNACTNN
1405
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCUCCUCAGGAGUCUCCACAT

CUCAGGACUUAGCAAGAAGUUAUGG

410
TCTGTACGGTGACAAGGCGUNNNACTNN
1406
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGAUACUUACGCGCCACAGAG

UAUCACAGAAUUCCUCCAGGCUUCT

411
TCTGTACGGTGACAAGGCGUNNNACTNN
1407
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUUGAUGAGCAGCAGCGAAAG

CACGGGAAAGUGGUGAAGAUAUGUG

412
TCTGTACGGTGACAAGGCGUNNNACTNN
1408
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUCAAGCCCUCCAACAUCCUA

UGUGGGUCCUGAAUUGGAGGAAUAT

413
TCTGTACGGTGACAAGGCGUNNNACTNN
1409
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUACUGACAACCACCCUUAACCC

CACCUGGAACUUGGUCUCAAAGAUT

414
TCTGTACGGTGACAAGGCGUNNNACTNN
1410
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUACCCUCUCAGCGUACCCUUG

AACCAUAUCAAAUUCACACACUGGC

415
TCTGTACGGTGACAAGGCGUNNNACTNN
1411
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUUUGCUGGCUGCAAGAAGAT

GCUUUUCCAUCUUUUCUGUGUUGGT

416
TCTGTACGGTGACAAGGCGUNNNACTNN
1412
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCACUCACCAUGUGUUCCAUG

CCCAGUUGUGGGUACCUUUAGAUUC

417
TCTGTACGGTGACAAGGCGUNNNACTNN
1413
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUCUCGUACAUGACCACACCCA

AAAGAGAUCAUUUGCCCCAUCAAUT

418
TCTGTACGGTGACAAGGCGUNNNACTNN
1414
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCAACGUUAGGUGGGACAGUAC

AAGAUCUAUGUCAUAAAAGCAGGGC

419
TCTGTACGGTGACAAGGCGUNNNACTNN
1415
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUCAAGAGGCAGUUUCUGGCA

ACGGCGAUAUUUUGUCUGAUGUAGG

420
TCTGTACGGTGACAAGGCGUNNNACTNN
1416
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGUUAGUCACUGGCAGCAACA

ACUAUCUGCAGGUUUCAUCUGAAUG

421
TCTGTACGGTGACAAGGCGUNNNACTNN
1417
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGACAACGUGAUGAAGAUCGCA

ACUGCAUGCAAUUUCUUUUCCAUCT

422
TCTGTACGGTGACAAGGCGUNNNACTNN
1418
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCUCUGCCUCUUCUUCUCCAG

AGUUGGUUGAACAGUUAUUUCUGCA

423
TCTGTACGGTGACAAGGCGUNNNACTNN
1419
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUCUUUAGCCAUGGCAAGGUC

CAAGACCUCUCAGGUAUUGUAAGGG

424
TCTGTACGGTGACAAGGCGUNNNACTNN
1420
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUCCACGGAGUGUAUGACCAC

CAAGCUCAGAUAUUUGGGCUUCAAG

425
TCTGTACGGTGACAAGGCGUNNNACTNN
1421
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCUCAGUCACUGGGAGAAGAA

CAGUGCUGUAUCAUCCCAAAUGUCA

426
TCTGTACGGTGACAAGGCGUNNNACTNN
1422
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUCCGGCUUUACACCAAAAGC

CAUUCAUCAGCUGUGUGUUCUGAAT

427
TCTGTACGGTGACAAGGCGUNNNACTNN
1423
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUUCCCUGCACUCUCAUCGCT

CCAGUCCCCAGGUAAUGUAAAUGUA

428
TCTGTACGGTGACAAGGCGUNNNACTNN
1424
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGAUCCAACCAAUGGUGGACA

CCUUCUAGUAAUUUGGGAAUGCCUG

429
TCTGTACGGTGACAAGGCGUNNNACTNN
1425
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGUUUAAUAACCCAGCCACGG

CUAGGUUUCAUGCUCAUAUCCGGUC

430
TCTGTACGGTGACAAGGCGUNNNACTNN
1426
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGAAGUCCUCUCGGAAGGUAGC

CUGAUCCUCAGUGGUUUGAACAGUC

431
TCTGTACGGTGACAAGGCGUNNNACTNN
1427
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGAAGAAGCAACUGAGAGCUG

GAUCGUCUCCUCUGAAAUGUCAUUC

432
TCTGTACGGTGACAAGGCGUNNNACTNN
1428
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCAGCUCCCAGAAGUUGACAG

GCCAUCUCUUUAUCGGAGUCUCUUT

433
TCTGTACGGTGACAAGGCGUNNNACTNN
1429
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUUCAGGCAUUGCUACUCUGG

GGAAGUCAAAUAUUUGCCUCUCCAG

434
TCTGTACGGTGACAAGGCGUNNNACTNN
1430
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCGUAGUAGACAUCACUCGCAC

UAAGGCAUUUCGCUCAACACUUUUC

435
TCTGTACGGTGACAAGGCGUNNNACTNN
1431
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGCAUCUAGUCUUUCCGCUUC

UCAUAUGGCUAUCCCUUUGCAAUUC

436
TCTGTACGGTGACAAGGCGUNNNACTNN
1432
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUAUGGCACAAUCAGAGCUGT

UCCAUACUGCUCAACCUCUGCAAUA

437
TCTGTACGGTGACAAGGCGUNNNACTNN
1433
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGAAGAUCAUGUGGCCUCAGT

UCCAUUUCUGAGAUCAGGUCUGACA

438
TCTGTACGGTGACAAGGCGUNNNACTNN
1434
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUUGCUUUGGAGCAGAAGAAGG

UCUAGCUGUAGCACAAAAUCUUCGT

439
TCTGTACGGTGACAAGGCGUNNNACTNN
1435
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGUUGCAAAGACACAAGUGGG

UGACACCAACAUCUUUACUGCAGAA

440
TCTGTACGGTGACAAGGCGUNNNACTNN
1436
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUUCGGAACCUUUCUUCCCCUG

GUUUGGAUGAAUGGAGGUGAGGAAUT

441
TCTGTACGGTGACAAGGCGUNNNACTNN
1437
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUCAACGAAAAGAGCUACCGC

CUUGCGCUUGUUAUACUCUUUAGUGC

442
TCTGTACGGTGACAAGGCGUNNNACTNN
1438
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUACCCUGACUUCCAGAAAACCA

AAAAGACUCGGAUGAUGUACCUAUGG

443
TCTGTACGGTGACAAGGCGUNNNACTNN
1439
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGCACUGCUCUCAGUGAGAAG

CCUGACCCAAGAUGAAAUAAAACGUC

444
TCTGTACGGTGACAAGGCGUNNNACTNN
1440
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGAUUAUGGGCAUCCCAGAAG

AGAGCCUAAACAUCCCCUUAAAUUGG

445
TCTGTACGGTGACAAGGCGUNNNACTNN
1441
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCAAAUAUUGGGCCCUUCCUG

GGUGUGAAAUGACUGAGUACAAACUG

446
TCTGTACGGTGACAAGGCGUNNNACTNN
1442
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAACUGAAGCUGUCAGGACAGA

GCUCACAGAAAUGUCUGCUAUACUGA

447
TCTGTACGGTGACAAGGCGUNNNACTNN
1443
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGUUGGCUACCUUGGGACAUC

AGACUGCUAAGGCAUAGGAAUUUUCG

448
TCTGTACGGTGACAAGGCGUNNNACTNN
1444
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAUGCAUCUCUUGUCGCAGGUT

GGUAAUAGUCGGUGCUGUAGAUAUCC

449
TCTGTACGGTGACAAGGCGUNNNACTNN
1445
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGAACUGUGAGGAUGUGGCUGA

GUUCAAAUGAGUAGACACAGCUUGAG

450
TCTGTACGGTGACAAGGCGUNNNACTNN
1446
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGUAAUCAAGCAGCAGCCAGA

UACCAGAUAGAACAGACACAGCUACT

451
TCTGTACGGTGACAAGGCGUNNNACTNN
1447
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUAUUGGGACUCCUCUGCCCUG

CAAAGAUGCAGAGCUCUGAGUAGAAC

452
TCTGTACGGTGACAAGGCGUNNNACTNN
1448
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCAAGCUGGUUUUGAAGUCGC

CUGCUUGAAGAUCAGAAGUUCCAAUG

453
TCTGTACGGTGACAAGGCGUNNNACTNN
1449
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUCAGAGAGAGCAGCUUUGUG

CAUAGGCAAGAAGAUGGAACAGAUGA

454
TCTGTACGGTGACAAGGCGUNNNACTNN
1450
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCAUUCAGCUCCUCUGUGUUT

AGAUGGAGAUGAUGAAGAUGAUUGGG

455
TCTGTACGGTGACAAGGCGUNNNACTNN
1451
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGUUGAAGAGAUUGGCUGGUC

AAGAGAAAAGGAGAUUACAGCUUCCC

456
TCTGTACGGTGACAAGGCGUNNNACTNN
1452
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUACAACUGCUACCAUGAGGGC

AGAAACCUGUCUCUUGGAUAUUCUCG

457
TCTGTACGGTGACAAGGCGUNNNACTNN
1453
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUGUGGGAACGUGAAACAUCT

CCCAAAUAUCCCCAGUUUCCAGAAUC

458
TCTGTACGGTGACAAGGCGUNNNACTNN
1454
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCUCACAUCUUCAGGUGCCUC

AUCUACUUCCAUCUUGUCAGGAGGAC

459
TCTGTACGGTGACAAGGCGUNNNACTNN
1455
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCAUCUACAAGAAAGCCCCCA

GAUCUUCUCAAAGUCGUCAUCCUUCA

460
TCTGTACGGTGACAAGGCGUNNNACTNN
1456
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCAGGUACCACCUUAUCCACA

CUCACAGAGUUCAAGCUGAAGAAGAT

461
TCTGTACGGTGACAAGGCGUNNNACTNN
1457
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUACCUGGACAAGCACAUGGAG

GCUGAAACAAAAAGCACUCUUCUGUC

462
TCTGTACGGTGACAAGGCGUNNNACTNN
1458
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUACAGUCUCUUGCAAUCGGCUA

ACAUGAUGGAUGUCACGUUCUCAAAG

463
TCTGTACGGTGACAAGGCGUNNNACTNN
1459
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCCCGGGAAUUUCUUCGAAAA

CUUGUGAGUGGAUGGGUAAAACCUAT

464
TCTGTACGGTGACAAGGCGUNNNACTNN
1460
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAACUUCAGUGGGCAUCGAGAT

CACAGCAGUCUUUCUUUCCCAUGUAA

465
TCTGTACGGTGACAAGGCGUNNNACTNN
1461
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUCACAGACUGUUUCCACUCCT

AGCCUCUUGCUCAGUUUUAUCUAAGG

466
TCTGTACGGTGACAAGGCGUNNNACTNN
1462
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGUAGAGGAUGCCGAGGAGAA

UCUCUUUAGGGAGCUUCUCUUCUUCC

467
TCTGTACGGTGACAAGGCGUNNNACTNN
1463
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUCCUGUGAUCGCACUGACAC

GCCAGAGAAAAGAGAGUUACUCACAC

468
TCTGTACGGTGACAAGGCGUNNNACTNN
1464
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUGCAUCCUUGGCAGAAAGUG

ACACCUUGUCUUGAUUUUACUUUCCC

469
TCTGTACGGTGACAAGGCGUNNNACTNN
1465
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCAACAACCUGUUGGAGCACAT

ACUAAUGAAUUCUUCUUCCUGCUCAG

470
TCTGTACGGTGACAAGGCGUNNNACTNN
1466
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUACCAGUGCAAGACUGAGACUC

AGUGAUCAGAGGUCUUGACAUAUUGG

471
TCTGTACGGTGACAAGGCGUNNNACTNN
1467
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUCCUCCAUCAGUGACCUGAAG

AUAAAUCAGGGAGUCAGAUGGAGUGG

472
TCTGTACGGTGACAAGGCGUNNNACTNN
1468
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCAGAGGCCUUCAUGGAAGGAA

CACUGACGGAAGUUCUCAUAAACGUC

473
TCTGTACGGTGACAAGGCGUNNNACTNN
1469
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUGACUUCCACCAGGACUGUG

CACUUUGACCAAAGUCUCACUGACAA

474
TCTGTACGGTGACAAGGCGUNNNACTNN
1470
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUGACCAGUGCUACGUUUCCT

CAUCUUCAAAGUUGCAGUAAAAACCC

475
TCTGTACGGTGACAAGGCGUNNNACTNN
1471
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCCACAUGGCGGAGAGUUUUA

CCUUUCACGAAUUCAUUUUCUUUGCG

476
TCTGTACGGTGACAAGGCGUNNNACTNN
1472
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUACUGUGCCACUUCAGUGUGC

CUAUCACAUUGUUCUCUCCAAUGCAG

477
TCTGTACGGTGACAAGGCGUNNNACTNN
1473
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGUUCAGUGCCAUCAUCCUGG

CUCGUACAAGUCACAAAGUGUAUCCA

478
TCTGTACGGTGACAAGGCGUNNNACTNN
1474
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAUCCCUUCCACAGACGUCACT

GACAGACUUCUCUCACACAUUGUGUC

479
TCTGTACGGTGACAAGGCGUNNNACTNN
1475
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUAACCGGAGCCUGGACCAUAG

GAGAGUGCAGUAUCAAGAAUCUUGUC

480
TCTGTACGGTGACAAGGCGUNNNACTNN
1476
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAACGCUCUGGAGUCUCUCUCC

GAGAUAGUUUCACUUUCUUCCCAGCT

481
TCTGTACGGTGACAAGGCGUNNNACTNN
1477
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUAGAGCAAAUCCAUCCCCACA

GCCAUCUCCUCUUGCAUAAACAAGUT

482
TCTGTACGGTGACAAGGCGUNNNACTNN
1478
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUCGAGCCACCAAUUUCAUAGGC

GGUGGAAAGUAAUAGUCAAUGGGCAA

483
TCTGTACGGTGACAAGGCGUNNNACTNN
1479
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUCCCAACCAAGCUCUCUUGAG

GUCUUCCCAACAAAUUUUGGGUGAAA

484
TCTGTACGGTGACAAGGCGUNNNACTNN
1480
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGGUAUUCGAUGAUCCCUGUGG

UCAUCAUCAUCAUCAUCAUCCUCCGA

485
TCTGTACGGTGACAAGGCGUNNNACTNN
1481
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUCCCCAAUGACCUGCUGAAAT

UGAUGCUUUGUUAAUGCGAAGUUCUG

486
TCTGTACGGTGACAAGGCGUNNNACTNN
1482
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCAGAAGCAUCUCACCGAAAUCC

UGUUGUACACUUUGAGGAGUGAUCUG

487
TCTGTACGGTGACAAGGCGUNNNACTNN
1483
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUAUGGUGGUGCCGACUACAAG

UUUUGUGAACAGUUCUUCUGGAUCAG

488
TCTGTACGGTGACAAGGCGUNNNACTNN
1484
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUUCUACCAGCUCACCAAGCUC

AACCAGACAGAAAAUUUCCACAUAAGC

489
TCTGTACGGTGACAAGGCGUNNNACTNN
1485
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCACCCCCACUGAACCUCUCUUA

UGGGACACAAUUUGACAAAUAUGACCA

490
TCTGTACGGTGACAAGGCGUNNNACTNN
1486
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCAAUCCCCACACCAAGUAUCA

UAGCACAGUUUAAAAAUGAGGCCUACT

491
TCTGTACGGTGACAAGGCGUNNNACTNN
1487
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGAGCCUGAAGAUCCUACCUGAG

GAUUCUUAUAAAGUGCAGCUUCUGCAT

492
TCTGTACGGTGACAAGGCGUNNNACTNN
1488
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCGCAAUCCGGAACCAGAUCAUA

UUGUGUGGAAGAUCCAAUCCAUUUUUG

493
TCTGTACGGTGACAAGGCGUNNNACTNN
1489
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGACUAGGCGUGGGAUGUUUUT

UGAAACUAAAAAUCCUUUGCAGGACUG

494
TCTGTACGGTGACAAGGCGUNNNACTNN
1490
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUACUCCACACGCAAAUUUCCUUC

ACCAUACUCUACCACAUAUAGGUCCUT

495
TCTGTACGGTGACAAGGCGUNNNACTNN
1491
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUGCAGCAAAGACUGGUUCUCA

GCUUCUUUGAGUUUGUAUCUUGGAUGC

496
TCTGTACGGTGACAAGGCGUNNNACTNN
1492
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGGAAUAACCAGCUGUCCUCCT

UGUGGAGUAUUUGGAUGACAGAAACAC

497
TCTGTACGGTGACAAGGCGUNNNACTNN
1493
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGGUAUUCUCGGAGGUUGCCUT

CAUGAACCGUUCUGAGAUGAAUUAGGA

498
TCTGTACGGTGACAAGGCGUNNNACTNN
1494
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCUUCCCUCGGGAAAAACUGAC

CUGAUCUACAGAGUUCCAAAAGUGACA

499
TCTGTACGGTGACAAGGCGUNNNACTNN
1495
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCCAAGCACAUGGAUCAGUGUT

CUUGAUUUCUUUUACUGACCCUUCUGC

500
TCTGTACGGTGACAAGGCGUNNNACTNN
1496
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAAGUACAAUUGCAGGCUGAACG

GAAGAUUUUCAAUCUCCUCUUGGGUUG

501
TCTGTACGGTGACAAGGCGUNNNACTNN
1497
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUAAGCCCACAUAUCAGGACCGA

GAUCUUUGUGCUUACUCCUUCCUAGUT

502
TCTGTACGGTGACAAGGCGUNNNACTNN
1498
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUACACCUGGACACCUUGUUAGAT

GGAAAUGUUUCCUAGACAAACUCGUCA

503
TCTGTACGGTGACAAGGCGUNNNACTNN
1499
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUCUGGAUCUGCAGCUCUAUGG

GUGGAAUUGGAAUGGAUUUUGAAGGAG

504
TCTGTACGGTGACAAGGCGUNNNACTNN
1500
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGCUACAGAGACACAACCCAUT

UGGAAGAUCUUAACUUCCCUUUCAAGA

505
TCTGTACGGTGACAAGGCGUNNNACTNN
1501
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGAAUCGCAAGAGAAGCACCUT

UUCUGGUUGAGAGAUUUGGUAUUUGGT

506
TCTGTACGGTGACAAGGCGUNNNACTNN
1502
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCGGCUCUUUCCACUAAACCAG

GUGUACGUUUGUCAGUUAUUAUAGUGCC

507
TCTGTACGGTGACAAGGCGUNNNACTNN
1503
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGCAGUGUUUAGCAUUCUUGGG

AAAAUCUCCAGGCCUAACAUAAUUUCAG

508
TCTGTACGGTGACAAGGCGUNNNACTNN
1504
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGCAAAACUACUGUAGAGCCCA

ACACAUGAAGCCAUCGUAUAUAUUCACA

509
TCTGTACGGTGACAAGGCGUNNNACTNN
1505
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAUCCUUUUGCUCCUGGUGGAAC

CAGCUCAGAAUUAACCAUAAAACUGGUG

510
TCTGTACGGTGACAAGGCGUNNNACTNN
1506
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGACAUGUUGGAUGUGAAGGAGC

UCAUACCUACCUCUGCAAUUAAAUUUGG

511
TCTGTACGGTGACAAGGCGUNNNACTNN
1507
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUCCACCAAAGUCACCAGAGGG

AUUGAUUGUUUCUAAUAGAGCAGCCAGA

512
TCTGTACGGTGACAAGGCGUNNNACTNN
1508
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGUCUUAUAGCGGAAGAGGCAGA

UGUAGACUUGGAAUCUACUGAUAUCCCT

513
TCTGTACGGTGACAAGGCGUNNNACTNN
1509
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGAGCCAUGGACACACUCAAGA

AAAUAAAGGACCCAUUAGAACCAACUCC

514
TCTGTACGGTGACAAGGCGUNNNACTNN
1510
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUCUCUUCUCCAUCGUCCAUGAC

UAGAAUGCCAGUUAAUGAAAACAGAACG

515
TCTGTACGGTGACAAGGCGUNNNACTNN
1511
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCAGAUUCCUCAUGGUCAUGGG

UGAAGACAGAUGGCUCAUUCAUAGGAUA

516
TCTGTACGGTGACAAGGCGUNNNACTNN
1512
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGAAGAACUAGUCCAGCUUCGA

CAAUUCCUCUUGACUAUUCUACAGCAAA

517
TCTGTACGGTGACAAGGCGUNNNACTNN
1513
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAACCUGCGCAAACUCUUUGUUC

ACAAGAAUGAAAAGUCUUCAACACUUGG

518
TCTGTACGGTGACAAGGCGUNNNACTNN
1514
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUCAAGUUGGUGAAAAGGCUUGG

CCAGAAAUGUUUUGGUAACAGAAAACAA

519
TCTGTACGGTGACAAGGCGUNNNACTNN
1515
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCAAGUUGACCCUGGGUCUGAUC

AAGCAUCAGCAUUUGACUUUACCUUAUC

520
TCTGTACGGTGACAAGGCGUNNNACTNN
1516
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGAGGAAAAGAGGAUGCUGGAG

GAAUCUCCAUUUUAGCACUUACCUGUGA

521
TCTGTACGGTGACAAGGCGUNNNACTNN
1517
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGAACGUAAAAUGUGUCGCUCC

AGAGUUUUUCCAAGAACCAAGUUCUUCC

522
TCTGTACGGTGACAAGGCGUNNNACTNN
1518
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCAUGUGCUGAAAAUCCGAAGUG

AUCUUCCACCUUAAAUUCUGGUUCUGUA

523
TCTGTACGGTGACAAGGCGUNNNACTNN
1519
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCUUAAUGCCUCAGAAACCACA

CAUCAACUCAUGAAUUAGCUGGUUUCGA

524
TCTGTACGGTGACAAGGCGUNNNACTNN
1520
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUACUUCUUGGCCAAGAGGAAGAC

GAUCUUCAAUGGCUUUAGUCUGUUCCAA

525
TCTGTACGGTGACAAGGCGUNNNACTNN
1521
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAUUGGAGAUGGUUUCACAGCAC

UGGAGAGAGAACAAAUAAAUGGUUACCUG

526
TCTGTACGGTGACAAGGCGUNNNACTNN
1522
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGAAUCUCCCAGGCGGUAUUUG

UAUUUUCAGCCUUCUACUAGUCGAAAGCG

527
TCTGTACGGTGACAAGGCGUNNNACTNN
1523
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCGCACUGCCCCAAGUUUUACUA

AAAAUCCAAAUCAUAUACCAAAGCAUCCA

528
TCTGTACGGTGACAAGGCGUNNNACTNN
1524
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGAAGACGAAAACUCUGCGGAAG

ACUUUUAACACUUCACCUUUAACUGCUUC

529
TCTGTACGGTGACAAGGCGUNNNACTNN
1525
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGAUCCUCUUCCCUCAGCUUCC

CGUUGAUGAUUUCUAACCUUUUCUGGUUT

530
TCTGTACGGTGACAAGGCGUNNNACTNN
1526
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGAGAUCACUGAUGACCUGCACT

CUGGUUUCUGUAGAAUUCCAUGAGUAGUT

531
TCTGTACGGTGACAAGGCGUNNNACTNN
1527
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCCUUCAAUGCACUGAUACACA

GCUCUGGUAGAAUUGACAUAUCUCAACAC

532
TCTGTACGGTGACAAGGCGUNNNACTNN
1528
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUAUGUCAGCGUUUGGCUUAACA

GGAGAUAUUUCACCUGACUUGAUUCAAGG

533
TCTGTACGGTGACAAGGCGUNNNACTNN
1529
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCACCAUAUACAGGAGCUCAGA

GUUUUUCUGGAUAAAAAGAGCCACUGUUC

534
TCTGTACGGTGACAAGGCGUNNNACTNN
1530
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGUCUUUGGAACCACACCAGAA

ACAACCCACUGAGGUAUAUGUAUAGGUAUT

535
TCTGTACGGTGACAAGGCGUNNNACTNN
1531
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUUGAUAGCUGCACUGAGUGUCA

AGAAAAUCAAAGCAUUCUUACCUUACUACA

536
TCTGTACGGTGACAAGGCGUNNNACTNN
1532
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGACCUGGAUCCACAGGAAAGAA

UCCCAGAGAACAAAUUAAAAGAGUUAAGGA

537
TCTGTACGGTGACAAGGCGUNNNACTNN
1533
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGACUUGCUUCUGCACUAGACA

GCUUCUUUAAAUAGUUCAUGCUUUAUGGUT

538
TCTGTACGGTGACAAGGCGUNNNACTNN
1534
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGAUGACCUGGAAGAUGGAGUCT

UCCGAAUAUAGAGAACCUCAAUCUCUUUGT

539
TCTGTACGGTGACAAGGCGUNNNACTNN
1535
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGCUGGAAGAAGCUGAAAAAGC

CAAUGCUUUUAAAUAUGUCAUUGUGGGCAT

540
TCTGTACGGTGACAAGGCGUNNNACTNN
1536
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGAGAGAACGGUUGCAAAACUG

GUACAACAGAUUAUCUCUGAAUUAGAGCGA

541
TCTGTACGGTGACAAGGCGUNNNACTNN
1537
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGCUACAGUGAUGCCCACUACA

UAGAUAAUGCUUAAUAUUCACUUCCCCGUG

542
TCTGTACGGTGACAAGGCGUNNNACTNN
1538
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCUCAAACAGAACGGUCCAGUC

UUUUUCCAGUUUAUUGUAUUUGCAUAGCACA

543
TCTGTACGGTGACAAGGCGUNNNACTNN
1539
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCUCAGCUAGAAGAGAAGCAGC

GAUCUACUGUUUUCCUUUACUUACUACACCT

544
TCTGTACGGTGACAAGGCGUNNNACTNN
1540
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGGAGUGAUUUGCGCCAUCAUC

AAAAUAUAGAACCUAAUGGAAGGAUUUGGUG

545
TCTGTACGGTGACAAGGCGUNNNACTNN
1541
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUCCCCCUGAUAGCAGAUUUGAT

GGGUAUGGCAUAUAUCCAAGAGAAAAGAUUT

546
TCTGTACGGTGACAAGGCGUNNNACTNN
1542
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAUACCAGCUGAAGAGCGACAAG

ACCUUGCUAAGAGAUAUUCAUCUGUCUUUUC

547
TCTGTACGGTGACAAGGCGUNNNACTNN
1543
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCAAACAUUCGUCUCGGAAACCC

AUUCUGAUCUGGUUGAACUAUUACUUUUCCA

548
TCTGTACGGTGACAAGGCGUNNNACTNN
1544
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUUGGUGAUUUUGGCAUGAGCAG

CCCUUCUUAAAUUGCUCCUGUAUCAUUGAUT

549
TCTGTACGGTGACAAGGCGUNNNACTNN
1545
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUCCGAGAUUGGAGCCUAACAGT

AAAAGAAUAUGAAAAGAUGAUUUGAGAUGGUG

550
TCTGTACGGTGACAAGGCGUNNNACTNN
1546
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCAGCAAGAAUAUUCCCCUGGCA

GGUUAGUAUGUUAUCAUUUGGGAAACCAAAUT

551
TCTGTACGGTGACAAGGCGUNNNACTNN
1547
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUCUCAUUUGCCUGGCAGAUCUC

AAAAUCUGUUUUCCAAUAAAUUCUCAGAUCCA

552
TCTGTACGGTGACAAGGCGUNNNACTNN
1548
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGACAUCAGCAAAGACCUGGAGA

CUAGAUAUGGUUAAGAAAACUGUUCCAAUACA

553
TCTGTACGGTGACAAGGCGUNNNACTNN
1549
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGAACUUGCUGGUGAAAAUCGG

AGAAUAGGAUAUUGUAUCAUACCAAUUUCUCG

554
TCTGTACGGTGACAAGGCGUNNNACTNN
1550
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUUCAUCCUGCACGAACAGAAAGA

GAAUUAAACACACAUCACAUACAUACAAGUCA

555
TCTGTACGGTGACAAGGCGUNNNACTNN
1551
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCCAGGCACUUGAUGAUACUCAC

UGUCUGUGUAAUCAAACAAGUUUAUAUUUCCC

556
TCTGTACGGTGACAAGGCGUNNNACTNN
1552
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCCUCUCUCUCUUGUCACGUAGC

UUCCCUUUUGUACUGAAUUUUAGAUUACUGAT

557
TCTGTACGGTGACAAGGCGUNNNACTNN
1553
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGGCCAUUUCUGUUUUCCUGUAGC

CUGGAAGCUUUAACUUCUUUAUUAAGUUCUUC

558
TCTGTACGGTGACAAGGCGUNNNACTNN
1554
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCUUCGCCUGUCCUCAUGUAUUGG

CUGUUCAAGAACUUCUGAAUUUAAAACAGUCT

559
TCTGTACGGTGACAAGGCGUNNNACTNN
1555
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAUCCGGGCUUUACGCAAAUAAGT

UAGUAAGUAUGAAACUUGUUUCUGGUAUCCAA

560
TCTGTACGGTGACAAGGCGUNNNACTNN
1556
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGGAUUUGACCCUCCAUGAUCAG

UCUUUGGCACAAUAUUAACUAGUCUAUUGUAG

561
TCTGTACGGTGACAAGGCGUNNNACTNN
1557
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAUCUGUACAGCAUGAAGUGCAAG

UGUCUGAUAUUCUUUCUCAUAUUUCUUCAGCT

562
TCTGTACGGTGACAAGGCGUNNNACTNN
1558
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUAGUCGUCAGCCUGAACAUAACAT

UUGCUCUUUUGAUUCUUUAAAUACAUCAAAGT

563
TCTGTACGGTGACAAGGCGUNNNACTNN
1559
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUUGCUUCUCAGAUGAAACCACCAG

GUUAUAUUGAAAAUGAUUAACAUGUAGAAGGGC

564
TCTGTACGGTGACAAGGCGUNNNACTNN
1560
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUGCUGCACCUUGACUUUAAGUGAG

UUAAGUGACAUACCAAUUUGUACAACAGUUAUC

565
TCTGTACGGTGACAAGGCGUNNNACTNN
1561
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCCGAGAAUGGUCAUAAAUGUGCA

CAACAUGCUGAUUCUUUUCAACGUUUUAUUUUC

566
TCTGTACGGTGACAAGGCGUNNNACTNN
1562
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUCACUAUGGAGCUCUCACAUGUGG

GUGGUAUUCUGUCUUUAAUUGUAAGAUAUGCAA

567
TCTGTACGGTGACAAGGCGUNNNACTNN
1563
TGACAAGGCGTAGTCACGGUNNNACTNNNTGAU

NTGAUACCCGAAGAAAGAGACUCUGGAA

UGUUCCUGUGUCAACUUAAUCAUUUGUUUGAUA

568
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUCUCACAUUGCCCCUGACAACAUA

569
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUUAAGGGACCAGGGUCUAUGAAGC

570
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUGUGUCCUUUCAGGAUGGUGGAUG

571
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUACAGGAAGAGCACAGUCACUUUG

572
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUGAUGCCCCCAAGAAUCCUAGUAG

573
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUUCCUCAACCCUCUUCUCAUCAGG

574
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUUUCUUUGAGGUGAAGCCAAACCT

575
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUUCCCCUACCUAGACCCUCCUAAC

576
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUGCUCCAGAAGCCCUGUUUGAUAG

577
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUUCAUUCCUGUGUCGUCUAGCCUT

578
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUGGUUAUUAUGAGGAAGCUGUGCC

579
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUCUUUGAACUCCAAGCUGCUCAAG

580
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUACGUCAUGGAGUAUAUGUGUGGG

581
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUAACUACCACCUGUCCUACACCUG

582
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUGUUGGUAUCCCUUCAGGACUAGG

583
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUCUCAGCAGACAAUAUCGGAUCGA

584
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUACUUGGAGAAGCUGAGAGAAAAC

585
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUUCCAGGUCAUGAAGGAGUACUUG

586
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUGACCUUCAUGAGCUGCAAUCUCA

587
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUUGUUUCAGUAUCCCUGCUCCAAA

588
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUAAGAUGUCAUCAUCAACCAAGCA

589
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUACUCCAUGUUCUUGGCCAUGCUA

590
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUGGAGCUGGUUCACAUGAUCAACT

591
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUAUAUUUCUUCCGCAAGUGUGUCC

592
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUAAACUCGAACUGAUUUCUCCUGG

593
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUGGCGCUGUCAACAGAAAGAAAAA

594
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUCCAACGUUCAAGCAGUUGGUAGA

595
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUUAACCAAGAGGAAGUUGGAGGUG

596
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUGGUAGAGGAGGUGUUUGAUGUUC

597
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUGAUCCUCCUUGCUUACCACACAC

598
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUGCCCUUCGAGAGCAAGUUUAAGA

599
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUCAAGUGACUCUUCAGAUCCCUGC

600
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUACAUGAAAGGGAGUUUGGUUCUG

601
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUUGCUAAAAGAGAGGGAGAGUGAT

602
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUGGAGGAACUGGACUUCCAGAAGA

603
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUUGGGAUCUUCGUAGCAUCAGUUG

604
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUGCAGAACCAUCCACCAACAUAAG

605
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUAGAGUUAAAUGCCCUCAAGUCGA

606
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUUGGGUUUUUCCUGUGGCUGAAAA

607
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUAGGACUGGGUGAAUGCUAUUGAG

608
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUACCAGGGAUGAGCAGAAUGAAGA

609
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUAAGACGGUCCGUAAACUGAAAAA

610
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUCAGGGAUAUAUCCCCCAAAGGAT

611
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUGGUUAUUAAGGAGCUUCGCAAGG

612
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUGCAUGUCCAGAGAUGUCUACAGC

613
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUGCCGUAUUUGAAGCCUCAGGAAC

614
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUUCCAGAAGUCCAGAGCUGAGAAG

615
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUCUAGACAUCUUCUCCCUCCCUUG

616
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUUAUCGCAGGAGAGACUGUGAUUC

617
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUCAGCAGAUGAAUCACCUUUCGUT

618
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUCUGAGGAUGCUCAAAGGGUUUUT

619
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUAGGCUCCUGAGACCUUUGAUAAC

620
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUGGAUGAGCAAGACCUAAAUGAGC

621
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUCUAUUGUAAGCAGGCGAUGUUGT

622
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUCCUUAGCUGUUGAAGGAAAACGA

623
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUUGAAUUUCCUGAAGAACGUUGGG

624
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUUACGUGAAGGAUGACAUCUUCCG

625
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUGUGCCUUUGAAAAUCAACGACAA

626
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUGACCUAAAGACCAUUGCACUUCG

627
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUCUUGUCAGGGAACAGGAAGAAUT

628
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUAAUAAAACUUUGCUGCCACCUGT

629
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUAACAACAGGAGUUGCCAUUCCAT

630
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUGUGAUCCUAGUUUCUGGGCUCAA

631
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUAUCAGGAAGAGGAAGAGUCCACA

632
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUAAAGAGGGACUGCCAUAACAUUC

633
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUACUGCCUUCUGAAAGGUGGAAUC

634
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUGUGGGAAUUGACAAAGACAAGCC

635
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUACAGCCCAAAGAUGAGAGUGAUT

636
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUGGUUUCAGACGCUGAAGGAUUUT

637
TCTGTACGGTGACAAGGCGUNNNACTNN

NTGAUUGAUGUGGACUGGAUAGUCACUG

638
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCU

AUUAACUCCGAGCACUUAGCGA

639
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGU

AUCUUCUAGCUCUCUGCCUACC

640
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUG

ACAGCCAUCAUCAAAGAGAUCG

641
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAU

UGUGAAGAUCUGUGACUUUGGC

642
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

GCGAAUUCCUUUGGAAAACCUG

643
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUA

UCCAGUGUGCCCACUACAUUGA

644
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUU

CUUUUUCAGAGUGCAACCAGCA

645
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCU

UGCAAGCAAAAAGUUUGUCCAC

646
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCC

UUGGUGUAGCACUGACAUUCAT

647
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCU

UGGGUCACUGUAUGGGAUGUAG

648
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUC

UGAUUUGCCAAGUUGCUCUCUT

649
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUU

UUUCUGUCCACCAGGGAGUAAC

650
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGU

UACUGCCAUCGACUUACAUUGG

651
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

GAUAGUGGUGAAGGACAAUGGC

652
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGU

CCUCAUGUACUGGUCCCUCAUT

653
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCU

GAAUUAGCUGUAUCGUCAAGGC

654
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCG

AUGCUGAGAACCAAUACCAGAC

655
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCU

CUGCUGGAUCAUGUGAGACAAC

656
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

UCUGGAUACAUGCCCAUGAACC

657
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUU

CAGAAUCUUGUUGGCUGCAUUG

658
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGG

AGAAUGUGAAAAUUCCAGUGGC

659
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCU

GAGUGUAUCCUGGAGGUUGUUG

660
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCU

GACCAUGUGGACAUUAGGUGUG

661
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCC

UGAAGAAGACCUUUGACUCUGT

662
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

GGAGGAGGAUGAGAUUCUUCCA

663
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCU

CCUAGAAGACUCCAAGGGAGUA

664
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUG

GACGACAUAUACCUGUGUGCUA

665
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAA

UGCUACGAAGUGGGAAUGAUGA

666
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUC

AGGCUACCAUUAUGGAGUCUGG

667
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUC

UUACAAUGGCAGGACCAUUCUG

668
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUU

GGAAGUGGUCAUUUCAGAUGUG

669
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAA

GAGAUGCGCCAAUUGUAAACAA

670
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAA

UUUCUCCUUCAGACAAUGCAGT

671
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGC

UCUUCCAGCUUAAGAAUGAACC

672
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAU

ACAGAAGCUGAUGGGCCAGAUA

673
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUC

AGAAUUACCAAGCUAGGGAAGC

674
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUC

GUUAAAGUCUCUCUUCACCCUG

675
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGC

CCCAUCUAUGAGUUCAAGAUCA

676
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCC

GAGUGGCGGAAAGCAAUAAAAT

677
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGG

ACAAAGGGUGGAUGAAAUUGAT

678
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCC

AGUGAUGAUCUCAAUGGGCAAT

679
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCG

AGGUGUUUUUACCACCAAGACT

680
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGU

AAAUGACUGUGUCCAGCAAGUT

681
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAC

AGCUGCCUACAUAAAGGAAUGG

682
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGU

UUGCAAGAUGAAAGGAGAAGGG

683
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAC

ACUGGAAAGGAAGAGAUUCAUG

684
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUA

CUGGAGGAGAUGGUCAAGAAUC

685
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUU

GUACACAUGUACAAUGCCCAAT

686
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGC

ACUUUUUGGAUACUUUGUGCCT

687
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCG

CAAUUUAUGUUUUCCAAGCCAC

688
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGC

UCCCUGGAUAUUCUUAGUAGCG

689
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAG

AGCUCGAAUUCCAGAAUGAUGA

690
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCU

UCAGCGAGGAAGCUACACUUUT

691
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUC

AUCAAGUCCUUUGACAGUGCAT

692
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUC

AGAAGUGGUUUCCUUUCUCACC

693
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAA

GUUUCGGACAGUACAAAGAACG

694
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

AGUCCAAGUUGCUUCUCAGUCT

695
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGC

UUGGAGCAAGAAAAGGAAUUGC

696
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAC

CAAUCCAGAAAACCUUCCAUCG

697
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

GUGUGCCAGAUACCAUUGAUGA

698
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUC

AUCAUUAUUCUGGCUGGAGCAA

699
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAU

GUAGGCUUUUGUUUCGUUUGUG

700
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCU

GGCAACAAACAAGAUACUGGUG

701
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUC

GUGGCUUUUGACAAUAUCUCCA

702
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAG

AAUAACUCCUCGGUUCUAGGGC

703
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUC

UCUGAGUAUGAGCUUCCCGAAG

704
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAC

GUCCAUCUUUUUAAGGGAUUGC

705
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGU

CAUUACGUCAACGCAACGUCUA

706
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUC

CACACAUAAACGGCAGUGUUAA

707
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

GAAAAGCCUGUUUACCAAGGAG

708
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGG

AGAUCUUCACCUAUGGAAAGCA

709
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCG

UGUUGUGGGAGAUUUUCACCUA

710
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAC

UGACUUUUACUCCAGGCUAACUT

711
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUG

UCCUGGUCAUUUAUAGAAACCGA

712
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAG

UUCGUGGGCUUGUUUUGUAUCAA

713
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGG

UUGAAUGUAAGGCUUACAACGAT

714
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGU

GGUUCUGGAUUAGCUGGAUUGUC

715
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAU

GUGCCUCCUUCAGGAAUUCAAUC

716
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAA

CCAAGUUCUUUCUUUUGCACAGG

717
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUU

GUGGGCUACAAGAACUACCGAUA

718
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

CCUCACCAUAGCUAAUCUUGGGA

719
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCC

AGCACUUCUGCAUUGGAACUAUT

720
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUU

GCAUUGUGUGUUUUUGACCACUG

721
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCC

CUCAUUCCUUUUUCCUCUGUGUA

722
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCC

CAGCCAAGUAGAAUGUGAAAGAC

723
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGU

UUUUCCUCCUACUCACCAUCCUG

724
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUC

UCAAAUUGUUGCCAUUUCAGGGT

725
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

GCCUGUUUUGUGUCUACUGUUCT

726
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGG

ACCAGAGCUUCAAGACUGUUUAG

727
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUC

CUCCUCCUCUUCCCUAGAUAACT

728
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

UCAUUCUUGAGGAGGAAGUAGCG

729
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

CUCUACCUCCAGCACAGAAUUUG

730
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCC

UUAGACAACUACCUUUCUACGGA

731
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUC

CUUGUUGGUGUCCAUUUUCUUGT

732
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAC

UCCUCUUCAGAGGAGAAAGAAAC

733
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUG

ACUAAGAAUGGGAAGGAGUCACC

734
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUC

CUGUUCCUCCCAGUUUAAGAUUT

735
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGC

CAACACAAGAGAAAAUAUUUGCT

736
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCC

AUCUCAUUAAUGACAAUCAGCCA

737
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAA

GCAGAGGCAUCUGUAAAGUCAUG

738
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAG

CAGUUGAAAAACUCCUAGAAGCC

739
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

GCAGUACACUACCAACAGAUCAA

740
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCC

UUACCAGCUUUGACAAUACAGGA

741
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGC

CUCUGAGAAGUAUGUCUGAUCCA

742
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAG

AAAGUACCAAUCAGAAGGACGUG

743
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAU

CAGAGCCAGAAUUUUGCAGAAGA

744
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUC

AACCAGAUGCAGUAUGAGUACAC

745
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUC

CAAGUCUUAUGGUUCUGGAUCAA

746
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

AGUGAAACUGUGUGAGAAGAUGG

747
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAU

GUCUAACUCGGGAGACUAUGAAA

748
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

AAAGAAACUCUUUCAUCUGCUGC

749
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCU

CUGGCGUUGGUGUUUUCAAAAUA

750
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGG

UGAAUAACAACUUGAGUGACGAG

751
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAA

AGAUGCUGAAAUCCAGAAGCUGA

752
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

CUAGCUGCCAAGUACUUGGAUAA

753
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAU

AAUGCUGUUUCCUUUACCUGGGA

754
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAG

GUCAAAGAAUAUGGCCAGAAGAG

755
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGU

AAACCAACAGCUCACAAAGGAGA

756
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUU

GAAGAGCAUCAACAAGAAGACCA

757
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCU

UGAAAUCCGCCUGAAUGAACAAG

758
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUC

UUAAGGUUGAAGUGUGGUUCAGG

759
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUU

UUCUUUCUCAGAAAGCAGAGGCT

760
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAG

UAUCAACAUCACGGACAUCUCAA

761
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCC

UCUAAAGAUCAAAACACCCCUGT

762
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUA

GAACGAGUAAAUCUGUCUGCAGC

763
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUC

AUCGUGAUUCAGGAGACAAUUCT

764
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAG

CUGUUUCUGGUGUUAUCAGUGAC

765
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUA

CAUGGCACUAGAAGAACGCUUAG

766
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGC

AAAGACAAAUGUGAAAUUGUGGG

767
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAA

CACAUUCAUUCAUAACACUGGGA

768
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCU

UAAUCAGCAAGCUUUCUCUGCUG

769
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

GAUGCAAGCAGUUAUUGAUGCAA

770
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAU

AGCAAGAAGGAAGUGCCUAUCCA

771
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUC

CCACAGCUAAUUUGGACCAAAAG

772
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGU

UUCUUCGUCUUAUCUUUGGGACC

773
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

GGAAGCCAGAGUUUAUUAACUGC

774
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAG

AUGAGAAGAAGCACCAUGACAAT

775
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAG

AGGGUAAAGUUCACAAAAGACCA

776
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGC

CAAAAAUGUGCAUACUCACAGAG

777
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUG

UAUCAUCUCCUGAAGCAACAUCT

778
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCC

UGGAUAAUGAAAGACUCCUUCCC

779
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUC

AGAUAGCAUACAAGAGACCAUGC

780
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

AAGACCAAGAAGAACUUACUCCCT

781
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGU

GAUCUAUUUUUCCCUUUCUCCCCA

782
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAG

CAAGAGGCUUUGGAGUAUUUCAUG

783
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAC

AAAUGCUGAAAGCUGUACCAUACC

784
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAC

UAGGUGAAUACUGUUCGAGAGGUT

785
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCU

CCAUGCCUUUGAGAACCUAGAAAT

786
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUU

CAAAAGGAAGUAUCUUGGCCUCCA

787
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAA

ACGCUGUGCCAAUUUUGUAAAUGT

788
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

AUCAUGUUGCAGCAAUUCACUGUA

789
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

CUUUACCCUGUAAUAAUCCGUGCT

790
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

GUGAUGAGAGUGACAUGUACUGUT

791
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

GUCCCAACCAUGUCAAAAUUACAG

792
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAU

UGCCAACAUGACUUACUUGAUCCC

793
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUU

ACCCUCUUCAGCUCAGUUUCUUUC

794
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGG

UGAGAUCCAUUGACCUCAAUUUUG

795
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGG

UGGACCCCAAGCUUUAGUAAAUAT

796
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAA

AUACCCCCUCCAUCAACUUCUUCA

797
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAU

CCAAAGAUCAAAGAGACGAAGUCT

798
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAU

UAGAGAACUACCCUGGAAUGACCC

799
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCC

UUGCUUACCUGAGGAACUUAUUCA

800
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCC

ACAUUACAUACUUACCAUGCCACT

801
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGG

GAAGCUGUCCAUCAGUAUACAUUC

802
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUG

UCUAUAUCCAUCUCCAUGUCCUCT

803
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

UUAUUGUGGCCUGUUUGACUCUGT

804
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUG

ACUCUUUACUUCAAACUCUGAGCC

805
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGU

CGUCUUCGGAAAUGUUAUGAAGCA

806
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

GAGAGUACUGAAUUCUUGCAGCAG

807
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAC

UUCAAAAUCAAGUUUGCUGAGACT

808
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGC

AAUUCACUAACAAGAAAACAGGGA

809
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAA

AUACACAGACAAACUCCAGAAAGC

810
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCU

ACUGUUUGCUCCUAACUUGCUCUT

811
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

GAGAGGAAAGUCCCUUAUUGAUUG

812
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUG

AGUGUGGUGGAGUUCAGUUUCUAT

813
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAA

GACCUUGCAGAAAUAGGAAUUGCT

814
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

AGAAAAUGAAAAGGAGUUAGCAGC

815
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

GUAACACAUCUUCUCAACCAGGAC

816
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUG

GAUUUUUCUUACCACAACAUGACA

817
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAU

UUCAGUUUGCUGAAGUCAAGGAGG

818
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGG

AUUUCUUCUGAUGGUAGCUUUUGT

819
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

GUCUACAAAAAGACCUGCUAGAGC

820
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGU

GUGGAUGAAACUUUGAUGUGUUCA

821
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAG

AAUUAUGGACCAGACUCAGUGCCT

822
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCG

GGAGGAAUUCAUCAUAUUCAACAG

823
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAA

AAAGACAUGGAUGAAAGACGACGA

824
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAG

UAGGACUGUAGACAGUGAAACUUG

825
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCU

AUUGGGAUAUCCUUUCACUCUGCA

826
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAU

GAUCGGGAAACACAAAAACAUCAT

827
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUU

CCUAGCUGAAUGCUAUAACCUCUG

828
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUU

CCUGGUUAUAGGAAAUUACACUGGC

829
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAU

CCAAGCAAUUCUAUGCUAUACACAC

830
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

CAAAGAUUUGUGAUUUUGGUCUAGC

831
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCU

GACCAACUUUUCCCAGUUUCUCAAT

832
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAC

CUUUCCUCUGGAGUAUCUACAUGAA

833
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

CAGCAUCAAGCUAUGUACGUAGUUC

834
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAG

AUUUCUUGUUACUUUUUCCCCAGAC

835
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

CUGUUGUUUCACAAGAUGAUGUUUG

836
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGU

UUGACAGUUAAAGGCAUUUCCUGUG

837
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAC

CUUCGGCUUUUUCAACCCUUUUUAA

838
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGG

AAACAACAUUCAACUCCCUACUUUG

839
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAA

AUCAUCAACAUCAACAUUGCAGACT

840
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

UGGCUGAUCUUGAAGGUUUACACUT

841
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

AGUAUUACAAUAGAGCUGGGAUGGA

842
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGG

AGGAUCCUGUAAUUAUUGAAAGAGC

843
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUA

GAAGACUUGACUGGUCUUACAUUGC

844
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAG

GAAGAAGCAGAUCAGAUACGAAAAA

845
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

CAGUAAAGAGAUUGUGGCUAUCAGC

846
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCG

UAUACAAAGGAAACUCAGACUCCAG

847
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAA

UGAAUCAUUUGGAGGUGGAUUUGCT

848
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGU

UCCACUUGUCAGUGAAGUUCAAAUA

849
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

AGACUUGGAUCGAAUUCUCACUCUC

850
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAA

AGGGAUCUUCCAGUAUGACUACCAT

851
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGU

AACGAAACAGACAGUCUUACAGAAG

852
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUU

GAAACACUCAGAAAAACAGUUGAGG

853
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCU

AGAGGAAGAGUUAAGAAAGGCCAAC

854
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGC

AGAACAGGAUAUAACUACCUUGGAG

855
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

CAGUGAGAGACUUCAGUAUGAAAAA

856
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

AUUAUGAGACCUACUGAUGUCCCUG

857
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAU

UCCUCAGGGAAUACUUUGAGAGGUT

858
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCU

UUGCAAGCUGAUAAUGAUUUCACCA

859
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGU

UGCUCCAGCACUAAGUGUAUUUAAT

860
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCC

UCACUUUCAAUAUCACGAAGACCAT

861
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

AGAAACCACUGGAUGGAGAAUAUUT

862
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAG

AUCGGUAGCCAAGCUGGAAAAGACA

863
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUG

GUUACUAGUUUAGAAGAAUCCCUGA

864
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

GUUAUCCAAGUUCCCAACACAGAUC

865
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUG

UGGAAAAAGAUUUAGCAGGCUAGAC

866
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAA

GAAUGAAUCUGGCACAUGGAUUCAG

867
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGC

CGUGAUAGAAAAUAUACAGCGAGAA

868
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUG

UGGCUCAUAAAGCAUUUCUGAAAAA

869
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUG

CCUAUCAUAGUCAUUCAGUGAUUGUT

870
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

GUAGCUCCAAAUUAAUGAAUGUGCAT

871
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUC

UCAUGUCUGAACUGAAGAUAAUGACT

872
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUA

CCCAAAUUGCUUCUGUCUGUUAAAUG

873
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUA

AAUGGUUUUCUUUUCUCCUCCAACCT

874
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAU

AGUGAUUAGUAAAGGAGCCCAAGAAT

875
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAA

UUAACUUACUUGCCACUGAAAAGUUG

876
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCC

GUGAAGAUCCCAUUGUCUAUGAAAUT

877
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUC

CUUCCUAGAGAGUUAGAGUAACUUCA

878
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCU

GUCUUUAUAUUCAUGACCUACUGGCA

879
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCU

GACCCCUUCCCAUCAAAAUUUUAUCT

880
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAU

CCCUUUGGGUUAUAAAUAGUGCACUC

881
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAU

CUUGACAAAGCAAAUAAAGACAAAGC

882
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUU

AAGGGAAAAUGACAAAGAACAGCUCA

883
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGG

GAAGAAAAGUGUUUUGAAAUGUGUUT

884
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUA

AAUCUUUUCUCAAUGAUGCUUGGCUC

885
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAA

GGAACUGUGUGCAAAAUCUUCAAUUG

886
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

AGAAUAAAAUGUCUAGCAGCAAGAAG

887
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAG

AUCAGAUCUGGACUAUAUUAGGUCCC

888
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGG

AACAGAUAUCCAGAACUAGUGAACUT

889
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGG

UGAAGAACUUAAAACUGUGACAGAGA

890
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCC

UGAAAACCAAAUACGAUGAAGAAACT

891
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAU

GUUGACAACUAUGAUGACAUCAGAAC

892
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

UGAAGACUUCCUAGAGAAUUCACAUC

893
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAU

AAAGGACAAGGUAAGAAGAAGACAAG

894
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

ACUGGAGAAGAUGAUGACUAUGUUGA

895
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAC

CUGUGAAGAAAAUGUGUGUUGAUUUT

896
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCU

CAACAAACAGGACUAAGGAAAGGAAA

897
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUC

UACUCAGCUGAAAAGCAGAGUUAAAA

898
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAG

GUUCUCACCCAUAUAUUGAUUUUCGT

899
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

GAAAAUGAAGAGUUUGUUGAAGUGGG

900
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCC

AGUUCCUAGCAGAUUUAAUAGACGAG

901
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGG

GAAUUGGCUAUUCUUUACAACUGUAC

902
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGG

AUUUCAAAGUGUUACCUCAAGAAGCA

903
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGG

ACAGAAUUGAAUCAGGGAGAUAUGAA

904
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

UAUAGUGAUCAGAGAUUAAGGCCAAG

905
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAC

AUAAAAUUCACAGGAAAUCAGAUCCA

906
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCU

UUUCAGGAGGUGUAAAACAAGAAAAA

907
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUU

GAAAAAUGGCAAAGAAUUCAAACCUG

908
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUC

UGUUCAAUUUUGUUGAGCUUCUGAAUT

909
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUG

AUCGGGAAGCAUAAGAAUAUCAUCAAC

910
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUU

AUUUAUUGGUCUCUCAUUCUCCCAUCC

911
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCC

CAAGUACAUAUCCUGUAAGACCAGAAT

912
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGG

AGCCUAAUCUUUCAUUAUUACUGGGAA

913
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAA

CGGGUUAUUAACAUAUUUCAGAGCAAC

914
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

AAGCAGGGAUUUCAUUCAUCAUUAAGA

915
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCC

UGUAAAUACGAAUCUUUCCAAAGGAGA

916
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUU

GAAAGCGUUUGAGAAUCUUUUAGGACA

917
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAA

GUUGAAGAUUUACCACUGAAACUGACA

918
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCC

UUUUUGGAAACAUACAGGAUAUCUACC

919
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAC

CAAUACUUCAGAAGACAAAUGUGAAAA

920
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

UUGAAGAAGCAUACAUGACAAAAUGUG

921
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUG

AUUUGGAUUUUCCUGCCUUAAGAAAAA

922
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGG

UACAAACAUUUCAAGAAGACAAAAGAT

923
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUG

UGAUAAUUUGCAACAUAGUAAGAAGGG

924
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAC

AGUCAGAAUAUUCCUGUUCCUACUACA

925
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGU

CUCAAAGUAAACUAUUGUUAGCAACCAT

926
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

AUGCAAAUUAGUUUCUUGCAAGAGAAAA

927
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUG

AUAAACUUCAGAAAGAACUCAAUGUACT

928
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAA

CUGGAGAGAUAUGUCAAGUCUUGUUUAC

929
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

UGACAAAAAGCUUCAGAGUUCUCUAAAA

930
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAU

GAGAAAGAAGAAGAAUUCCUCACUAAUG

931
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUG

AAAAGAACAAGAGAUGAAUUGAUAGAGT

932
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUA

UGCAAAUUUCACAGAGCCUCAGUUUUAT

933
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUA

AUACCAAAAGUUACCAAAACUGCAGACA

934
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

AGAAGACCUUUCUGUGGAAAUAGAUGAC

935
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUU

GGAAAUUAUGGAAAUCAAGCAACUUCAA

936
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAU

GGAAAAGGAGCACUUAAAUAAGGUUCAG

937
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUC

CUUUCUUGAAAAUAAUCUUGAACAGCUC

938
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

GCUUAAAGUUGAUAAAGAGAAGUGGUUA

939
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

CCGGCAAAUUAAAGCAAUUAUGAAAGAA

940
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

GCUACAUCAAUCCUUGAGUAUCCUAUUG

941
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGG

ACUGCACUUUUAUUCAUCAAUUCAUAGA

942
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

CAAGAUAAAGUGAUUUCAGGAAUAGCAA

943
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

AAAGGAAAAUCUGCAAAGAACUUUCCUG

944
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAG

AAGGAAUUAGAGAAUGCAAAUGACCUUC

945
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAA

AAAUGCAGUCAGAUAUGGAGAAAAUCCA

946
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUC

UACUACGAAAUUCUUAAUUCCCCUGACC

947
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGG

UUCCUAAUAUGUAUUGGGAUGUUGGUAA

948
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGU

UCUCUUCGUCAUGAUCAACAAAUAUGGT

949
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAC

AGAAAUGGUUUCAAAUGAAUCUGUAGACT

950
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUG

AUCAUAUUCACUAAGCGCUACUAGAAACA

951
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAA

CAUGUUCAUGCUGUGUAUGUAAUAGAAUG

952
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAU

CAUCUCUAAGGUAUCUUCUAGAUCCAACA

953
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

AAGCUUUAAAUGCACUAAAUAACCUGAGT

954
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

AUAUGAUCAACUCCUGAAAGAACACUCUG

955
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

GUUUACUCCAGUAAAAAUUGAAGGUUAUG

956
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAA

UAUUUGCGAUUAUUGAAGCUGCUUAAUGT

957
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAC

ACAGUUAAUAUGCCAGAAAAAGAAAGAAA

958
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

UCUGUGCUCAAUAAUCAGUUGUUAGAAAT

959
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUG

GAUUUGUUUCUCAUUCUCAUAUUUCACCA

960
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAA

AAUAAUUCUGUGGGAUCAUGAUCUGAAUC

961
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAU

CCGGGUAUAAUAAUGAAGUUAAAAGAGCA

962
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGC

UCAUAUUCUACUUCAUUCAGAAGAUCAGG

963
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCC

CGAUUUAAUUCACAUUUAUAAAGGCUUUG

964
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGU

AACUAAAUUGGAGAAAAGCAUUGAUGACT

965
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGU

UUUCGAAUUUCUCGAACUAAUGUAUAGAAG

966
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAA

AAACAAAGUGGACAACUAGAAAGAUUUUGA

967
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUC

UUCCUAAGUGCAAAAGAUAACUUUAUAUCA

968
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

CAUAACAGUUAUGAUUUUGCAGAAAACAGA

969
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCC

UUUCAGAAAUUUCUUCAAAUAAACAGAACC

970
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAG

AAGAAUGACAAAGAUAAGAAGAUAGCUGAG

971
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

UGUAGAUUUUAAUCUGAACUUUGAACCAUC

972
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAG

GAGGAACUCUUUACUAUGAAGUUAAUAGAA

973
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGG

AGAAGACAUCAACCAAUUAAUCAUAAAUAC

974
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAU

UAUUUUCAUGCUUUGGAGAUUGGAUAUAGG

975
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

AUCCUUAUCAAUCAUCAAUGAAAAAGUACC

976
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

UGAUUUACUUGGAGAAGAUUUGCUAUCUGG

977
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

GGAAGAAAAUCAUCAAUUACGAAGUGAAAA

978
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAG

GAAGAAAUCAAGAUUCUUACUGAUAAACUC

979
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUA

GAACCAAUGAGAGACUAUCUCAAGAACUUG

980
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUA

UGAUGAGACAGAUCCAUUUAUUGAUAACUC

981
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAA

GAACUUAAACGAAAAUUGAACAUUCUGACT

982
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAG

CAAGCUGGUAUUUUCAUACAAAUUCUUCUA

983
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUC

UAGCUACACUGAAAAAUUAUAAUGAAGUAGG

984
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

GUAAACUGACUCUAAACUUAAAAUCUUACCT

985
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

GAUAUUUAUCCAAACAUUAUUGCUAUGGGAT

986
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAC

UAAAUAGUUUAAGAUGAGUCAUAUUUGUGGG

987
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUUC

UCAUCUCUAAAGGAUUUAAUUACAAAGAUGC

988
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCU

CUAUGUAGUCUCUGAAAAUGGAAGAAAAUAT

989
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

ACAAGAUAGAAGAUUUGGAGCAAGAAAUAAA

990
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCA

UGCAACUUACUGAAAAAUACUAUAAAUGACC

991
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCC

UUAUUAAAGAACUUUCUAAAGUAAUUCGAGC

992
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCU

CAGGACUCAUUAUUUUAACAUUUGGGAGAAA

993
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGC

CCUAUAUUUGCAUUAAAAUGGAAUAAGAAAG

994
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUGA

AUUAAAUGCCCACAUAAAACUUUCUAAUUUG

995
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUAG

UUGCUAUAUUUACACUGAUGGUAGAAAUAAA

996
TCTGTACGGTGACAAGGCGUNNNACTNNNTGAUCC

UAUAUUACAGAUUCUAUUCAUGAACAAUGCT

Each N independently is A, C, G or T

	Number	Date	Country
Parent	16948046	Aug 2020	US
Child	17887727		US

COMPOSITIONS AND METHODS FOR ONCOLOGY PRECISION ASSAYS

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

International Classifications

Abstract

Description

Claims

CROSS-REFERENCE TO RELATED APPLICATIONS

Provisional Applications (1)

Continuations (1)