The present application is being filed along with a Sequence Listing in electronic format. The Sequence Listing is provided as a file entitled ILLINC450WOSEQLIST, created Mar. 22, 2021, which is approximately 2 Kb in size. The information in the electronic format of the Sequence Listing is incorporated herein by reference in its entirety.
Embodiments relate to the preparation of nucleic acid libraries. Some embodiments relate to the preparation of normalized nucleic acid libraries, such as libraries in which the amounts of amplified nucleic acids are substantially the same for different amounts of input nucleic acids. Some embodiments relate to preparation of indexed nucleic acid libraries with certain adaptors.
As common practice, biological samples used to prepare nucleic acid libraries for downstream analyses are homogeneously processed according to standardized assay protocols without regard to customized procedures for each sample. An important quality control (QC) measurement is nucleic acid concentration of nucleic acid libraries since performance of many modern nucleic acid analysis technologies is dependent on the nucleic acid concentration of input nucleic acids. Since such downstream analyses typically use expensive reagents and incur significant costs to perform, samples not meeting nucleic acid concentration requirements and/or other QC measurements are simply discarded on the assumption that the sample preparation was intrinsically unsuitable for the desired application.
For example, in next-generation sequencing (NGS) applications, also known as high-throughput sequencing, it is often desirable but logistically challenging to prepare nucleic acid libraries with substantially uniform DNA molar concentrations. This problem often arises in the preparation of nucleic acid libraries constructed from a plurality of heterogeneous biological samples, particularly When a desirable source biological sample is disproportionately underrepresented. This problem additionally arises when a target nucleic acid molecule is a relatively rarer and/or more unstable nucleic acid species when compared to even other nucleic acids that are derived from the same biological sample. It is challenging to create a nucleic acid library from sources such as a genome, with relatively equal representation of different regions of the genome.
Some embodiments include a method of normalizing a nucleic acid library comprising target nucleic acids, comprising: (a) obtaining a substrate having a normalizing amount of capture probes attached thereto, wherein the capture probes comprise first amplification sites; (b) hybridizing a plurality of target nucleic acids to the capture probes; (c) extending the capture probes to obtain extended probes, wherein the extended probes comprise second amplification sites; and (d) amplifying the extended probes by hybridizing extension primers to the second amplification sites, wherein the amount of the extension primers is equal to or greater than the normalizing amount of capture probes, wherein the amplification is performed such that substantially all the capture probes are extended, thereby obtaining a normalizing amount of amplified target nucleic acids; wherein the first amplification sites and the extension primers are incapable of or are essentially incapable of hybridizing to one another.
In some embodiments, the first amplification sites and the extension primers are non-complementary to one another.
In some embodiments, the first amplification sites and the extension primers comprise non-complementary nucleotide sequences to one another.
In some embodiments, the first amplification sites comprise modified nucleotides that inhibit hybridization with the extension primers.
In some embodiments, the first amplification sites and the extension primers each lack the same type of nucleotide.
In some embodiments, the first amplification sites lack types of nucleotides complementary to one another, and each extension primer consists essentially of the same types of nucleotides as the first amplification site.
In some embodiments, the first amplification sites lack at least one type of nucleotide selected from adenine (A), cytosine (C), guanine (G), and thymine (T).
In some embodiments, the first amplification sites lack a combination of nucleotides selected from: adenine (A) and thymine (T); or guanine (G) and cytosine (C).
In some embodiments, the first amplification sites consist of a combination of nucleotides selected from: adenine (A) and guanine (G); adenine (A) and cytosine (C); cytosine (C) and thymine (T); or guanine (G) and thymine (T).
In some embodiments, the first amplification sites consist of a combination of adenine (A) and guanine (G) nucleotides,
In some embodiments, the extension primers are in solution.
In some embodiments, the capture probes comprise first indexes or first sequencing primer sites.
In some embodiments, the capture probes comprise first locus specific primers.
In some embodiments, the plurality of target nucleic acids hybridize to the first locus specific primers.
In some embodiments, extending the capture probes comprises ligating the first locus specific primers to the second locus specific primers.
In some embodiments, extending the capture probes comprises: (i) hybridizing second locus specific primers to the target nucleic acids; and (ii) ligating the first locus specific primers to the second locus specific primers.
In some embodiments, the second locus specific primers comprise the second amplification sites.
In some embodiments, the second locus specific primers comprise second indexes, or second sequencing primer sites.
Some embodiments also include: (iii) hybridizing extension oligonucleotides to the second locus specific primers; and (iv) extending the second locus specific primers with sequences complementary to the extension oligonucleotides by polymerase extension.
In some embodiments, the extension oligonucleotides comprise sites complementary to the second amplification sites, complementary to second indexes, or complementary to second sequencing primer sites.
In some embodiments, the target nucleic acids comprise sites complementary to the second amplification sites, and the extending comprises polymerase extension of the first locus specific primers with sequences complementary to the target nucleic acids.
Some embodiments also include preparing the target nucleic acids by adding adaptors to an end of the target nucleic acids, wherein the adaptors comprise sites complementary to the second amplification sites.
In some embodiments, adding adaptors comprises a tagmentation reaction.
In some embodiments, the substrate comprises a plurality of beads.
In some embodiments, the substrate comprises a flow cell.
In some embodiments, the amplification is performed under conditions such that the normalizing amount of capture probes limits the amount of amplification products.
Some embodiments also include sequencing the amplified target nucleic acids.
Some embodiments include a method of preparing a library of nucleic acids, comprising: (a) obtaining a substrate having a plurality of capture probes attached thereto, wherein the capture probes comprise first amplification sites and first locus specific primers; (b) hybridizing a plurality of target nucleic acids to the first locus specific primers; (c) hybridizing second locus specific primers to the target nucleic acids; (d) hybridizing the second locus specific primers to extension oligonucleotides, wherein the extension oligonucleotides comprise sites complementary to second amplification sites; and (e) extending the hybridized second locus specific primers to obtain a plurality of extended probes by: (i) ligating the first locus specific primers to the second locus specific primers, and (ii) extending the ligated second locus specific primers with sequences complementary to the extension oligonucleotides by polymerase extension.
In some embodiments, (i) comprises extending the first locus specific primers, and ligating the first locus specific primers to the second locus specific primers.
In some embodiments, the capture probes comprise first indexes, or first sequencing primer sites.
In some embodiments, the extension oligonucleotides comprise sites complementary to second indexes, or complementary to second sequencing primer sites.
In some embodiments, the substrate comprises a plurality of beads.
In some embodiments, the substrate comprises a flow cell.
Some embodiments also include obtaining a normalized amount of amplified target nucleic acids, comprising: amplifying the extended probes by hybridizing extension primers to the second amplification sites, wherein the substrate comprises a normalizing amount of the capture probes, and the amount of the extension primers is equal to or greater than the normalizing amount of capture probes,
In some embodiments, the extension primers are m solution.
In some embodiments, the amplification is performed under conditions such that the normalizing amount of capture probes limits the amount of amplification products.
Some embodiments also include sequencing the amplified target nucleic acids.
In some embodiments, the first amplification sites or second amplification sites comprise a P5 sequence, a complement of a P5 sequence, P7 sequence, or a complement of a P7 sequence.
In some embodiments, the first amplification sites and the extension primers are incapable of or are essentially incapable of hybridizing to one another.
In some embodiments, the first amplification sites and the extension primers are non-complementary to one another.
In some embodiments, the first amplification sites and the extension primers comprise non-complementary nucleotide sequences to one another.
In some embodiments, the first amplification sites comprise modified nucleotides that inhibit hybridization with the extension primers.
In some embodiments, the first amplification sites and the extension primers each lack the same type of nucleotide.
In some embodiments, the first amplification sites lack at least one type of nucleotide selected from adenine (A), cytosine (C), guanine (G), and thymine (T).
In some embodiments, the first amplification site lacks a combination of nucleotides selected from: adenine (A) and thymine (T); or guanine (G) and cytosine (C).
In some embodiments, the first amplification site consists of a combination of nucleotides selected from: adenine (A) and guanine (G); adenine (A) and cytosine (C); cytosine (C) and thymine (T); or guanine (G) and thymine (T).
In some embodiments, the first amplification site consists of a combination of adenine (A) and guanine (G) nucleotides.
Some embodiments include a method comprising: (a) obtaining a substrate comprising a plurality of capture probes and a plurality of extension primers attached thereto, wherein the capture probes comprise first amplification sites and first locus specific primers, and the extension primers are capable of hybridizing to second amplification sites, wherein the amount of the capture probes or the extension primers is a normalizing amount; (b) hybridizing a plurality of target nucleic acids to the first locus specific primers; (c) extending the capture probes to obtain extended probes, wherein the extended probes comprise second amplification sites; (e) amplifying the extended probes by hybridizing the extended probes to the extension primers, to obtain a normalized amount of amplified target nucleic acids.
In some embodiments, the normalizing amount limits the amount of amplified target nucleic acids.
In some embodiments, the capture probes comprise first indexes, or first sequencing primer sites.
In some embodiments, the extension oligonucleotides comprise sites complementary to second indexes, or complementary to second sequencing primer sites.
In some embodiments, the amplification is performed under conditions such that the normalizing amount of capture probes or the extension primers limits the amount of amplification products.
In some embodiments, the substrate comprises a plurality of beads.
In some embodiments, the substrate comprises a flow cell.
In some embodiments, the amplification comprises bridge amplification.
Some embodiments also include sequencing the amplified target nucleic acids.
In some embodiments, extending the capture probes comprises ligating the first locus specific primers to the second locus specific primers.
In some embodiments, extending the capture probes comprises: (i) hybridizing second locus specific primers to the target nucleic acids; and (ii) ligating the first locus specific primers to the second locus specific primers.
In some embodiments, the second locus specific primers comprise the second amplification sites.
In some embodiments, the second locus specific primers comprise second indexes or second sequencing primer sites.
Some embodiments also include: (iii) hybridizing extension oligonucleotides to the second locus specific primers and (iv) extending the second locus specific primers with sequences complementary to the extension oligonucleotides by polymerase extension.
In some embodiments, the extension oligonucleotides comprise sites complementary to the second amplification sites, complementary to second indexes, or complementary to second sequencing primer sites.
In some embodiments, the target nucleic acids comprise sites complementary to the second amplification sites, and the extending comprises polymerase extension of the first locus specific primers with sequences complementary to the target nucleic acids.
Some embodiments also include preparing the target nucleic acids by adding adaptors to an end of the target nucleic acids, wherein the adaptors comprise sites complementary to the second amplification sites.
In some embodiments, adding adaptors comprises a tagmentation reaction.
Some embodiments also include determining the presence or absence of a variant in the target nucleic acid.
Some embodiments include a method of preparing an indexed nucleic acid library, comprising: (a) adding Y-adaptors to a plurality of target nucleic acids, wherein the Y-adaptors are added to first and second ends of each target nucleic acid, and each Y-adaptor comprises a first strand comprising a first primer binding site and a mosaic element, and a second strand comprising a second primer binding site and a complement to the mosaic element; and (b) hybridizing a first extension oligonucleotide to the second primer binding site, wherein the first extension oligonucleotide comprises a first index and a third primer binding site; and (c) extending the second strand comprising the second primer binding site, thereby adding the first index to the target nucleic acids.
In some embodiments. a 5′ end of the first strand of the Y-adaptor is resistant to nuclease degradation.
In some embodiments, a 5′ end of the first strand of the Y-adaptor comprises a phosphorothioate bond between two consecutive nucleotides.
In some embodiments, a 5′ end of the second strand of the Y-adaptor is phosphorylated.
In some embodiments, (a) comprises contacting the plurality of target nucleic acids with a plurality of transposomes, wherein each transposome comprises a Y-adaptor and a transposase.
In some embodiments, the transposomes comprise dimers.
In some embodiments, the transposase comprises a Tn5 transposase.
In some embodiments, the transposomes are bound to a substrate.
In some embodiments, the substrate comprises a plurality of beads.
In some embodiments, the beads are magnetic.
In some embodiments, a 3′ end of the first extension oligonucleotide is blocked.
In some embodiments, a 5′ end of the first extension oligonucleotide is phosphorylated.
In some embodiments, the first extension oligonucleotide is bound to a bead.
Some embodiments also include cleaving the first extension oligonucleotide from the bead prior to (c).
In some embodiments, (c) comprises polymerase extension.
In some embodiments, (c) comprises extension with a ligase.
In some embodiments, extension with a ligase comprises: hybridizing a ligation oligonucleotide to the first extension oligonucleotide hybridized to the second primer binding site; and ligating the ligation oligonucleotide to the second strand comprising the second primer binding site.
In some embodiments, the ligation oligonucleotide comprises an additional index.
In some embodiments, the ligation oligonucleotide comprises an additional primer binding site.
Some embodiments also include removing the first extension oligonucleotide after step (c).
In some embodiments, the removing comprises an exonuclease treatment.
In some embodiments, the first extension oligonucleotide comprises uracil nucleotides, and the removing comprises degradation of the first extension oligonucleotide with a uracil-specific excision reagent (USER) enzyme.
Some embodiments also include amplifying the target nucleic acids comprising the first index.
In some embodiments, the amplification comprises hybridizing amplification primers to the first primer binding sites and to the third primer binding sites.
In some embodiments, the amplification comprises a PCR.
In some embodiments, the amplification comprises bridge amplification.
Some embodiments also include adding a second index to the target nucleic acids comprising the first index.
In some embodiments, adding the second index comprises: hybridizing a second extension oligonucleotide to the third primer binding site of the target nucleic acids comprising the first index, wherein the second extension oligonucleotide comprises the second index; and extending the second strand comprising the third primer binding site, thereby adding the second index to the target nucleic acids comprising the first index.
In some embodiments, a 3′ end of the second extension oligonucleotide is blocked.
In some embodiments, a 5′ end of the second extension oligonucleotide is phosphorylated.
In some embodiments, extending the second strand comprising the third primer binding site comprises polymerase extension.
In some embodiments, extending the second strand comprising the third primer binding site comprises extension with a ligase.
Some embodiments also include removing the second extension oligonucleotide after extending the third primer binding site.
Some embodiments also include adding a third index to the target nucleic acids comprising the second index.
Some embodiments also include adding an additional index to the target nucleic acids comprising the third index.
In some embodiments, the target nucleic acids comprise genomic DNA.
Some embodiments include a method of combinatorial indexing a plurality of target nucleic acids, comprising: (a) obtaining a pool of primary indexed nucleic acids, comprising: adding a first index to a plurality of subpopulations of target nucleic acids by extending the target nucleic acids with the first index, wherein a different first index is added to each subpopulation, and combining the subpopulations comprising the different first indexes to obtain the pool of primary indexed nucleic acids, wherein the first index is added to a subpopulation of target nucleic acids according to the method of any one of the foregoing methods; (b) splitting the pool into a plurality of subpopulations of primary indexed nucleic acids; (c) obtaining a pool of secondary indexed nucleic acids, comprising: adding a second index to the plurality of subpopulations of primary indexed nucleic acids by extending the primary indexed nucleic acids with the second index, wherein a different second index is added to each subpopulation, and combining the subpopulations comprising the different second indexes to obtain the pool of secondary indexed nucleic acids
Some embodiments also include repeating (b) and (c) and adding additional indexes to indexed subpopulations.
In some embodiments, adding a first index to a subpopulation of target nucleic acids is performed in a compartment selected from a well, a channel, or a droplet.
Some embodiments include a method of modifying a nucleic acid comprising: amplifying or extending the nucleic acid in the presence of a polymerase.
Some embodiments of any one of the foregoing methods include methods in Which the amplifying and/or the extending is performed under conditions suitable to remove inorganic pyrophosphate (PPi), to inhibit pyrohosphorolysis, and/or inhibit formation of a Mg2+-PPi complex. In some embodiments, the PPi is soluble. In some embodiments, the amplifying and/or the extending is performed in the presence of an inorganic pyrophosphatase.
In some embodiments, a rate of achieving a yield of a product of the amplifying and/or the extending is increased in the presence of the inorganic pyrophosphatase compared to a rate of achieving a yield of a product of the amplifying and/or the extending in the absence of the inorganic pyrophosphatase. In some embodiments, the rate of achieving a yield of a product of the amplifying and/or the extending is increased by at least 2-fold.
In some embodiments, the amplifying and/or the extending is performed under isothermal conditions.
In some embodiments, the amplifying and/or the extending comprises performing a reaction selected from a PCR, a bridge amplification, a whole genome amplification, a loop-mediated isothermal amplification (LAMP), an amplification from nucleic acids obtained from a single cell, a sequencing by synthesis (SBS) reaction, and an exclusion amplification (ExAMp), In some embodiments, the amplifying and/or the extending comprises performing an amplification from nucleic acids obtained from a single cell. In some embodiments, the amplifying and/or the extending comprises performing a sequencing by synthesis (SBS) reaction. In some embodiments, the amplifying and/or the extending comprises performing a bridge amplification. In some embodiments, a sequencing reaction comprises the amplifying and/or the extending.
Note that Xraw and Yraw intensities are greater for formulations that contain IPP at 30 min and 1 h.
Embodiments relate to the preparation of nucleic acid libraries. Some embodiments relate to the preparation of normalized nucleic acid libraries, such as libraries in which the amounts of (amplified) nucleic acids are substantially the same. Some embodiments relate to preparation of indexed nucleic acid libraries with certain adaptors.
Some embodiments relate to preparing normalized nucleic acid libraries having substantially similar amounts of target nucleic acids. In some such embodiments prepared normalized nucleic acid libraries having substantially similar amounts of amplified nucleic acids. In some embodiments, input nucleic acids are hybridized to capture probes attached to beads, the capture probes comprising first amplification sites lacking types of nucleotides complementary to one another. In some embodiments, the first amplification sites lack a single type nucleotide. In some embodiment the first amplification sites lack a combination of nucleotides selected from: adenine (A) and thymine (T); or guanine (G) and cytosine (C). In some embodiments, the first amplification sites consist of a combination of nucleotides selected from: adenine (A) and guanine (G); adenine (A) and cytosine (C); cytosine (C) and thymine (T); or guanine (G) and thymine (I). In some embodiments, the first amplification sites consist of a combination of adenine (A) and guanine (G) nucleotides. In some embodiments, the first amplification site can consist of one type of nucleotide.
The capture probe can be extended with sequences complementary to the hybridized input nucleic acid. In some embodiments, the capture probe is extended by polymerization. In some embodiments, the capture probe is extended by ligation of the capture probe with an oligonucleotide complementary to the hybridized input nucleic acid. in some embodiments, the capture probe is extended by polymerization and ligation. The extended capture probe can include a second amplification site and/or sample index.
The extended capture probe can be amplified on the bead by hybridizing extension primers in solution to the second amplification sites. in some embodiments, the amount of the extension primers is equal to or greater than a normalizing amount of capture probes. In some embodiments, the extension primers consists essentially of the same types of nucleotides as the first amplification site. In some such embodiments, the extension primers and the capture probes do not include complementary nucleotides, and interactions between the extension primers and capture probes are reduced, such as the formation of primer-dimers. The hybridized extension primers can be extended, and the products hybridized to non-extended capture probes on the beads. The extended capture probes are amplified such that substantially all the capture probes are extended, thereby obtaining a normalizing amount of amplified target nucleic acids.
Some embodiments relate to preparing a library of nucleic acids in which an adaptor is added by extending a target nucleic acid. in some embodiments, a target nucleic acid is hybridized to a capture probe comprising a first locus specific primer. The capture probe can be extended by hybridizing the hybridized target nucleic acid to a second locus specific primer. In some embodiments, the first and second locus specific primers can be ligated, thereby extending the capture probe. In some embodiments, the second locus specific primer can be extended by hybridizing an extension primer to the second locus specific primer, and extending the second locus specific primer with sequences complementary to the extension primer. In some embodiments, a sample index is introduced through an extension step.
As used herein, “hybridization”, can refer to the ability of nucleic acid molecules to join via complementary base strand pairing. Such hybridization may occur when nucleic acid molecules are contacted under appropriate conditions and/or circumstances. As used herein, two nucleic acid molecules are said to be capable of specifically hybridizing to one another if the two molecules are capable of forming an anti-parallel, double-stranded nucleic acid structure. A nucleic acid molecule is said to be the “complement” of another nucleic acid molecule if they exhibit complete complementarity. As used herein, nucleic acid molecules are said to exhibit “complete complementarity” when every nucleotide of one of the molecules is complementary to its base pairing partner nucleotide of the other. Two molecules are said to be “minimally complementary” if they can hybridize to one another with sufficient stability to permit them to remain annealed to one another under at least conventional “low-stringency” conditions. In some instances, the molecules are said to be “complementary” if they can hybridize to one another with sufficient stability to permit them to remain annealed to one another under conventional “high-stringency” conditions. Nucleic acid molecules that hybridize to other nucleic acid molecules, e.g., at least under low stringency conditions are said to be “hybridizable cognates” of the other nucleic acid molecules. Conventional stringency conditions are described by Sambrook et al., Molecular Cloning, A Laboratory Handbook, Cold Spring Harbor Laboratory Press, 1989), and by Haymes et al. In: Nucleic Acid Hybridization, A Practical Approach, IRL Press, Washington, D.C. (1985), each of which is herein incorporated by reference in its entirety, and for the disclosure discussed herein. Departures from complete complementarity are therefore permissible in some embodiments, as long as such departures do not completely preclude the capacity of the molecules to form a. double-stranded structure. Thus, in order for a nucleic acid molecule or fragment thereof of the present disclosure to serve as a primer or probe in some embodiments it needs only be sufficiently complementary in sequence to be able to form a stable double-stranded structure under the particular solvent and salt concentrations employed. In some embodiments, nucleic acids disclosed herein are fully complementary to their targets.
As used herein, “nucleic acid molecule” and “polynucleotide” are used interchangeably herein, and refer to both RNA and DNA molecules, including nucleic acid molecules comprising cDNA, genomic DNA, synthetic DNA, and DNA or RNA molecules containing nucleic acid analogs. Nucleic acid molecules can have any three-dimensional structure. A nucleic acid molecule can be double-stranded or single-stranded (e.g., a sense strand or an antisense strand). Non-limiting examples of nucleic acid molecules include genes, gene fragments, exons, introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, siRNA, micro-RNA, tracrRNAs, crRNAs, guide RNAs, ribozymes, cDNA, recombinant polynucleotides, branched polynucleotides, nucleic acid probes and nucleic acid primers. A nucleic acid molecule may contain unconventional or modified nucleotides. The terms “polynucleotide sequence” and “nucleic acid sequence” as used herein interchangeably refer to the sequence of a polynucleotide molecule. The nomenclature for nucleotide bases as set forth in 37 CFR § 1.822 is used herein.
As used herein, the term a “nucleic acid sample” refers to a collection of nucleic acid molecules. In some embodiments, the nucleic acid sample is from a single biological source, e.g. one individual or one tissue sample, and in other embodiments the nucleic acid sample is a pooled sample, e.g., containing nucleic acids from more than one organism, individual or tissue.'
As used herein, “nucleic acid sample” encompasses “nucleic acid library” which, as used herein, includes a nucleic acid library that has been prepared by any method known in the art. In some embodiments, providing the nucleic acid library includes the steps required for preparing the library, for example, including the process of incorporating one or more nucleic acid samples into a vector-based collection, such as by ligation into a vector and transformation of a host. In some embodiments, providing a nucleic acid library includes the process of incorporating a nucleic acid sample into a non-vector-based collection, such as by ligation to adaptors. In some embodiments, the adaptors can anneal to PCR primers to facilitate amplification by PCR or can be universal primer regions such as, for example, sequencing tail adaptors. In some embodiments, the adaptors can be universal sequencing adaptors.
As used herein, “substantially” has its ordinary meaning as read in light of the specification, and can mean, for example, at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99%
Some embodiments relate to preparing nucleic acid libraries having normalized amounts of amplified products. For example, the amounts of certain amplified products obtained from different target nucleic acids can be substantially the same between the amplified products. In some embodiments, the nucleic acid amounts in normalized nucleic acid libraries vary by less than about 30%, 20%, 10%, 5%, 3%, 2%, or 1%. In some embodiments, the nucleic acid amounts in normalized nucleic acid libraries vary by less than about 2-fold, 3-fold, 4-fold, or 5-fold. In some embodiments, pooled nucleic acid libraries are generated in which the amount of constituent nucleic acids in the resultant pooled nucleic acid libraries are at substantially similar amounts regardless of the amount of input nucleic acids. In some embodiments, the amounts of constituent nucleic acids in a pooled nucleic acid libraries vary by less than about 30%, 20%, 10%, 5%, 3%, 2%, or 1%. In some embodiments, the amounts of constituent nucleic acids in a pooled nucleic acid libraries vary by less than about 2-fold, 3-fold, 4-fold, or 5-fold.
Some embodiments of preparing a normalized nucleic acid library can include providing amplification conditions in which an amplification primer is provided in a limited amount, and other amplification reagents are provided in non-limiting amounts. In some such embodiments, the amount of the amplification products are limited according to the amount of the limited amplification primer. In some such embodiments, more than one amplification reaction can be performed, and each amplification reaction can result in an amount of amplification product which is substantially the same as the other amplification reaction(s).
In some embodiments, a substrate can be provided which includes a plurality of capture probes and a plurality of extension primers attached to the substrate. In some embodiments, a substrate comprises a solid support in some embodiments, a substrate is a planar substrate. In some embodiments, a planar substrate includes discrete sites, such as wells. In some embodiments, a substrate can include a plurality of beads. In some embodiments, the capture probes can include first amplification sites. An amplification site can include a binding site for an amplification primer. In some embodiments, the extension primers can include sites capable of hybridizing to second amplification sites. In some embodiments, the capture probes, or the extension primers can be provided in a normalizing amount such that amplification of a target nucleic acid from the first and second amplification primers results in a normalized amount of amplification product. For example, the amount of the capture probes can be provided at an amount less than or equal to the extension primers, such that the amount of amplification products is limited by the amount of the capture probes; or the amount of the extension primers can be provided at an amount less than or equal to the capture probes, such that the amount of amplification products is limited by the amount of the extension primers.
In some embodiments, a nucleic acid sample can be hybridized to the capture probes, the capture probes can be extended to generate extended probes. In some embodiments, the extended probes can include a second amplification site capable of hybridizing to the extension primers. In some embodiments, the extended probes can be amplified by hybridizing the extended probes to the extension probes, the extension probes can be extended, and subsequent products hybridized to the capture probes and extension primers. In some embodiments, the amplification can include bridge amplification.
Some embodiments of normalizing a nucleic acid library comprising target nucleic acids can include obtaining a substrate having a normalizing amount of capture probes attached to the substrate, and a plurality of extension primers. In some embodiments, the substrate comprises a plurality of beads, In some embodiments, the capture probes comprise first amplification sites, and are incapable of or are essentially incapable of hybridizing to the extension primers. In some embodiments, the first amplification sites and the extension primers are non-complementary to one another. In some embodiments, the first amplification sites and the extension primers comprise non-complementary nucleotide sequences to one another. In some embodiments, the first amplification sites comprise modified nucleotides that inhibit hybridization with the extension primers. In some embodiments, the first amplification sites and the extension primers each lack the same type of nucleotide. In some embodiments, the capture probes comprise first amplification sites lacking types of nucleotides complementary to one another. In some embodiments, the first amplification sites lack a combination of nucleotides selected from: adenine (A) and thymine (T); or guanine (G) and cytosine (C). In some embodiments, the first amplification sites consist of a combination of nucleotides selected from: adenine (A) and guanine (G); adenine (A) and cytosine (C); cytosine (C) and thymine (T), or guanine (G) and thymine (T). In some embodiments, the first amplification sites consist of a combination of adenine (A) and guanine (G) nucleotides. In some embodiments, the primers can contain pseudo-complementary bases or bases and modifications that prevent dimer formation.
In some embodiments, the extension primers comprise sites capable of hybridizing to second amplification sites. In some embodiments, the extension primers consist essentially of the same types of nucleotides as the first amplification site. In some embodiments, the extension primers consist of the same types of nucleotides as the first amplification site. In some embodiments, the extension primers are in solution.
Some embodiments include hybridizing a plurality of target nucleic acids to the capture probes, and extending the capture probes to obtain extended probes. In some embodiments, the extended probes can be amplified by hybridizing the extended probes to the extension probes, the extended probes can be extended, and subsequent products hybridized to the capture probes and extension primers. In some embodiments, the amplification can include bridge amplification. In some embodiments, the amount of the extension primers is equal to or greater than the normalizing amount of capture probes, and the amplification is performed such that substantially all the capture probes are extended, to obtain a normalizing amount of amplified target nucleic acids.
In some embodiments, the DNA is amplified using whole genome amplification, hybridized to capture probes, formation of amplifiable targeted libraries, limited amplification with normalization, and sequencing of the normalized targeted sequencing library.
In some embodiments, a step is included for the analysis of the targeted sequencing library. For example, sequencing can include the capture probe and genomic DNA for the analysis of single nucleotide polymorphisms and variants. Sequencing of the capture probe can identify the targeted region, a variant, and sample index or barcode. In some embodiments, analysis does not include the alignment of the sequencing reads to a reference genome.
In some embodiments, the capture probes can include a first index, and/or the extension primer can include a sequence complementary to a second index. An index can include a sequence that can identify the source of a nucleic acid. For example, an index can identify the source of a target nucleic acid to a certain sample. In some embodiments, an index can identify the source of an amplification product of a target nucleic acid to a certain sample of nucleic acids. In some embodiments, an index can comprise less than 50, 40, 30, 20, 10, or 5 consecutive nucleotides, or a number of nucleotides between any two of the foregoing numbers.
In some embodiments, the capture probes can include a first sequencing primer site, and/or the extension primer can include a sequence complementary to a second sequencing primer site. A sequencing primer site can include a binding site for a sequencing primer. Examples of a sequencing primer site include a nucleic acids having the sequence P5, P7, or complements therefore.
In some embodiments, the capture probes can include a first locus specific primer. The first locus specific primer can include a nucleic acid sequence capable of binding to a first locus binding site in a target nucleic acid. In some embodiments, the capture probes can be extended by hybridizing a target nucleic acid to the first locus specific primer of the capture probe, and hybridizing a second locus specific primer to the target nucleic acid. In some embodiments, the first locus specific primer and second locus specific primer bind at locations on the target nucleic acid adjacent to one another. In some embodiments, the locations that the first locus specific primer and second locus specific primer bind on the target nucleic acid are less than 200, 100, 50, 40, 30, 20, 10 5, 1 consecutive nucleotides apart on the target nucleic acid, or any number of nucleotides apart on the target nucleic acid between any two of the foregoing numbers. In some embodiments, extending the capture probe can include ligating the first locus specific primer to the second locus specific primer. In some embodiments, extending the capture probe can include extending the first locus specific primer with a polymerase, and ligating the first locus specific primer to the second locus specific primer. In sonic embodiments, extending the capture probe can include extending the first locus specific primer with a polymerase.
In some embodiments, the second locus specific primers can include a second amplification site. In some embodiments, the second locus specific primers can include a second index. In some embodiments, the second locus specific primers can include a second sequencing primer site.
In some embodiments, an adaptor can be added to the second locus specific primer. The adaptor can include a second amplification site, a second index, and/or a second sequencing primer site. In some embodiments, the adaptor can be added by ligation or extension and ligation to the second locus specific primer. In some embodiments, an extension oligonucleotide can be hybridized to the second locus specific primer. The extension oligonucleotide can include sequences complementary to a second amplification site, a second index, and/or a second sequencing primer site. The second locus specific primer can be extended with sequences complementary to the extension oligonucleotide, and thereby include a second amplification site, a second index, and/or a second sequencing primer site. In some embodiments, a target nucleic acid is hybridized to a first locus specific primer and a second locus specific primer, and an extension oligonucleotide is hybridized to the second locus specific primer. In some embodiments, the target nucleic is hybridized to the first locus specific primer and the second locus specific primer, at the same time as the extension oligonucleotide is hybridized to the second locus specific primer. in some embodiments, the target nucleic is hybridized to the first locus specific primer and the second locus specific primer, prior to the extension oligonucleotide being hybridized to the second locus specific primer, In some embodiments, the target nucleic is hybridized to the first locus specific primer and the second locus specific primer, after the extension oligonucleotide being hybridized to the second locus specific primer. In some embodiments, the first locus specific primer is extended and joined to the second locus specific primer use of a polymerase and/or a ligase, the second locus specific primer is extended with a polymerase.
An example embodiment is illustrated in
An example embodiment is depicted in
Another example embodiment is depicted in
Yet another example embodiment is depicted in
Another example embodiments is depicted in
One other example embodiment is depicted in
An example embodiment is also depicted in
Differing amounts of input nucleic acids can result in substantially similar amounts of amplified products. As depicted in
In some embodiments, the target nucleic acid includes sequences complementary to the second amplification sites. In some embodiments, the target nucleic acid include sequences complementary to a second index and/or a second sequencing primer. In some such embodiments, a capture probe containing a first locus specific primer can be extended by hybridizing the target nucleic acid to the first locus specific primer, and extending the first locus specific primer with a polymerase to incorporate sequences complementary to the target nucleic acid, such as a second amplification site, a second index, and/or a second sequencing primer. In some embodiments, a target nucleic acid can be prepared by added adaptors that include sequences complementary to a second amplification sites, a second index and/or a second sequencing primer. In some embodiments, the target nucleic acid can be prepared by a tagmentation reaction. For example, an input nucleic acid comprising a target nucleic acid can be contacted with a plurality of transposomes. The transposomes can fragment the input nucleic acid, and attach adaptors to the ends of the nucleic acid fragments. Examples of tagmentation reactions are disclosed in U.S. Pat. No. 9,040,256, which is incorporated by reference in its entirety.
In some embodiments, prepared nucleic acid libraries or pooled libraries can be suitable for downstream analytical applications, including sequencing applications utilizing techniques such as next-generation sequencing (NGS) and related methodologies such as genotyping-by-sequencing (GBS). For example, the first locus specific primer and second locus specific primer can bind to the target nucleic acid at locations which have single nucleotide polymorphism SNP, insertion, deletion, or duplication between the two binding sites. The amplified target nucleic acids can be sequenced and the SNP, insertion, deletion, or duplication can be identified.
In some embodiments, amplification products can be separated from a solid support, such as a bead. Generally, any suitable method for removing nucleic acids from the solid support can be used, such as use of enzymes, and/or changes in temperature and/or pH. In some embodiments, the amplification products are separated from the solid support by elution. In some embodiments, the amplification products are eluted in a heated buffer. In some embodiments where streptavidin is included in solid phase support and the nucleic acids are biotinylated to facilitate binding of the nucleic acids to the solid support, the nucleic acid amplification products can be separated from the solid support via by heated avidin-biotin cleavage.
in some embodiments, nucleic acid amplification is performed under substantially isothermal conditions. The optimal temperature for amplification varies and may for example depend on primer characteristics, such as sequence length, melting temperature, and choice of polymerase. In some embodiments, the amplification temperature is lower than 60 degrees Celsius, such as lower than 50 degrees Celsius, lower than 45 degrees Celsius, such as, 42, 38, 35, 30, 25, or 20 degrees Celsius. In some preferred embodiments, the amplification temperature is about 38 degrees Celsius. In some embodiments, isothermal amplification can be performed by using kinetic exclusion amplification (KEA), also referred to as exclusion amplification (Ex-Amp).
In some embodiments, an amplified/normalized nucleic acid library can be constructed by reacting an amplification reagent to produce a plurality of amplification sites that each includes a substantially clonal population of amplicons from an individual target nucleic acid that has seeded the site. In some embodiments, the amplification reaction proceeds until a sufficient number of amplicons are generated to fill the capacity of the respective amplification site. Filling an already seeded site to capacity in this way inhibits target nucleic acids from landing and amplifying at the site thereby producing a clonal population of amplicons at the site. In some embodiments, apparent clonality can be achieved even if an amplification site is not filled to capacity prior to a second target nucleic acid arriving at the site. Under some conditions, amplification of a first target nucleic acid can proceed to a point that a sufficient number of copies are made to effectively outcompete or overwhelm production of copies from a second target nucleic acid that is transported to the site. For example in an embodiment that uses a bridge amplification process on a circular feature (e.g., bead) that is smaller than 500 nm in diameter, it has been determined that after 14 cycles of exponential amplification for a first target nucleic acid, contamination from a second target nucleic acid at the same site wilt produce an insufficient number of contaminating amplicons to adversely impact sequencing-by-synthesis analysis on an IIlumina sequencing platform.
In some embodiments, kinetic exclusion can occur when a process occurs at a sufficiently rapid rate to effectively exclude another event or process from occurring. For example, a solution of beads having universal primers where the beads are randomly seeded with target nucleic acids in a solution and copies of the target nucleic acids are generated in an amplification process to fill each of the beads to capacity. In accordance with the kinetic exclusion, the seeding and amplification processes can proceed simultaneously under conditions where the amplification rate exceeds the seeding rate. As such, the relatively rapid rate at which copies are made on a particular bead that has been seeded by a first target nucleic acid will effectively exclude a second nucleic acid from seeding that particular bead for amplification. Kinetic exclusion amplification methods can be performed as described in detail in the disclosure of U.S. Application Pub. No. 2013/0338042, which is incorporated herein by reference in its entirety.
In some embodiments, kinetic exclusion can exploit a relatively slow rate for initiating amplification, such as a slow rate of making a first copy of a target nucleic acid compared to a relatively rapid rate for making subsequent copies of the target nucleic acid, or of the first copy of the target nucleic acid. In some embodiments, kinetic exclusion occurs due to the relatively slow rate of target nucleic acid seeding, such as relatively slow diffusion or transport compared to the relatively rapid rate at which amplification occurs to fill the site, such as bead or other site on a solid substrate with copies of the nucleic acid seed. In another embodiment, kinetic exclusion can occur due to a delay in the formation of a first copy of a target nucleic acid that has seeded a site, such as delayed or slow activation, compared to the relatively rapid rate at which subsequent copies are made to fill the site. In some embodiments, an individual site may have been seeded with several different target nucleic acids, such as several target nucleic acids can be present at each site prior to amplification. However, first copy formation for any given target nucleic acid can be activated randomly such that the average rate of first copy formation is relatively slow compared to the rate at which subsequent copies are generated. In this case, although an individual site may have been seeded with several different target nucleic acids, kinetic exclusion will allow only one of those target nucleic acids to be amplified. More specifically, once a first target nucleic acid has been activated for amplification, the site will rapidly fill to capacity with its copies, thereby preventing copies of a second target nucleic acid from being made at the site.
An amplification reagent can include further components that facilitate amplicon formation and in some cases increase the rate of amplicon formation. An example is a recombinase. Recombinase can facilitate amplicon formation by allowing repeated invasion/extension. More specifically, recombinase can facilitate invasion of a target nucleic acid by the polymerase and extension of a primer by the polymerase using the target nucleic acid as a template for amplicon formation. This process can be repeated as a chain reaction where amplicons produced from each round of invasion/extension serve as templates in a subsequent round. The process can occur more rapidly than standard PCR since a denaturation cycle, such as via heating or chemical denaturation, is not required. As such, recombinase-facilitated amplification can be carried out isothermally. It is generally desirable to include ATP, or other nucleotides, or in some cases non-hydrolyzable analogs thereof, in a recombinase-facilitated amplification reagent to facilitate amplification. A mixture of recombinase and single stranded binding (SSB) protein is particularly useful as SSB can further facilitate amplification. Example formulations for recombinase-facilitated amplification include those sold commercially as TwistAmp kits by TwistDx (Cambridge, UK). Useful components of recombinase-facilitated amplification reagent and reaction conditions are set forth in U.S. Pat. Nos. 5,223,414 and 7,399,590, each of which is incorporated herein by reference in its entirety.
Another example of a component that can be included in an amplification reagent to facilitate amplicon formation and in some cases to increase the rate of amplicon formation is a helicase. Helicase can facilitate amplicon formation by allowing a chain reaction of amplicon formation. The process can occur more rapidly than standard PCR since a denaturation cycle, such as via heating or chemical denaturation, is not required. As such, helicase-facilitated amplification can be carried out isothermally. A mixture of helicase and single stranded binding (SSB) protein is particularly useful as SSB can further facilitate amplification. Exemplary formulations for helicase-facilitated amplification include those sold commercially as IsoAmp kits from Biohelix (Beverly, Ma.). Further, examples of useful formulations that include a helicase protein are described in U.S. Pat. Nos. 7,399,590 and 7,829,284, each of which is incorporated herein by reference in its entirety.
In some embodiments, amplification reagents can include one or more origin binding proteins. Without being bound by any particular theory, the inclusion of an origin binding protein in the amplification reaction facilitates amplicon formation and, in some case increases the rate of amplicon formation.
In some embodiments, the amplification reagents can further include a polymerase. In some embodiments, the polymerase can be a strand-displacing polymerase such as a Bst polymerase, a polD polymerase, a 9°N polymerase or phi29 polymerase. In some embodiments, the polymerase is a thermostable polymerase. In some embodiments, the template is RNA and the polymerase can include a reverse transcriptase.
The maintenance of sample representation or specificity is useful to many library preparation methods for several downstream analytical applications, such as next generation sequencing (NGS) and targeted sequencing. For example, for many cDNA library applications, such as searching for differentially expressed genes, it is useful to minimize the distortion of cDNA representation in a library with respect to initial mRNA population. Stated differently, the content of individual cDNAs in the library, in some downstream applications, can be proportional to the copy number of the initial RNAs. In contrast, for some other applications, the concentrations of different individual cDNAs in a library can be equalized. Embodiments described herein, can maintain the complexity of an input DNA library, including the DNA species-to-species ratio, and possible minor allele frequency calls, with little or no observed difference other than the amount of the library being amplified and normalized.
Some embodiments include methods and systems for genotyping by sequencing a target nucleic acid to identify a SNP, insertion, deletion, or duplication in the target nucleic acid. In some such embodiments, a nucleic acid library comprising a target nucleic acid can be prepared by obtaining a substrate having a plurality of capture probes attached thereto. In some embodiments, a substrate comprises a solid support. In some embodiments, a substrate is a planar substrate. In some embodiments, a planar substrate includes discrete sites, such as wells. In some embodiments, a substrate can include a plurality of beads. In some embodiments, the capture probes comprise first amplification sites and first locus specific primers. Some embodiments include hybridizing a plurality of target nucleic acids to the first locus specific primers. In some embodiments, the first locus specific primers can be extended to obtain extended probes containing second amplification sites. In some embodiments, the extended probes can be amplified and sequenced, thereby identifying a SNP, insertion, deletion, or duplication in the target nucleic acid.
In some embodiments, extending the first locus specific primers can include hybridizing second locus specific primers to the target nucleic acids, and ligating the first locus specific primers to the second locus specific primers. In some embodiments, an extension oligonucleotide can be hybridized to the second locus specific primer, and the second locus specific primer can be extended with a polymerase to include sequences complementary to the extension oligonucleotide. In some embodiments, the sequences complementary to the extension oligonucleotide can include a second amplification site, a second sequencing primer, and/or a second index.
In some embodiments, the extended probes can be amplified to obtain a normalized amount of amplification products. In some embodiments, the amplification is limited by either the amount of the capture probes or the amount of the extension primers. In some embodiments, the substrate comprises a normalizing amount of the capture probes, and the amount of the extension primers is equal to or greater than the normalizing amount of capture probes. In some embodiments, the substrate comprises a normalizing amount of the extension primers, and the amount of the capture probes is equal to or greater than the normalizing amount of capture probes. In some embodiments, the extension primers are attached to the substrate. In some embodiments, the extension primers are in solution.
In some embodiments, the first amplification sites and the extension primers are incapable of or are essentially incapable of hybridizing to one another. In some embodiments, the first amplification sites and the extension primers are non-complementary to one another. In some embodiments, the first amplification sites and the extension primers comprise non-complementary nucleotide sequences to one another. In some embodiments, the first amplification sites comprise modified nucleotides that inhibit hybridization with the extension primers. In some embodiments, the first amplification sites and the extension primers each tack the same type of nucleotide. In some embodiments, the first amplification sites lack types of nucleotides complementary to one another; and the extension primers consist essentially of the same types of nucleotides as the first amplification site. In some embodiments, the first amplification site lacks a combination of nucleotides selected from: adenine (A) and thymine (T); or guanine (G) and cytosine (C). In some embodiments, the first amplification site consists of a combination of nucleotides selected from: adenine (A) and guanine (G); adenine (A) and cytosine (C); cytosine (C) and thymine (T); or guanine (G) and thymine (T). In some embodiments, the first amplification site consists of a combination of adenine (A) and guanine (G) nucleotides.
Some embodiments include one or more substrates, such as solid supports. Solid supports suitable for the methods disclosed herein can generally be of any convenient size and fabricated from any number of known materials. Preferably, the solid support used in the embodiments disclosed herein can be of any suitable type that provides a known binding capacity, resulting in a substantially unfluctuating amount of bound nucleic acids per fixed amount of solid support. Example of such materials include: inorganics, natural polymers, and synthetic polymers. Specific examples of these materials include: cellulose, cellulose derivatives, acrylic resins, glass, silica gels, gelatin, polystyrene, polyvinyl pyrrolidone, co-polymers of vinyl and acrylamide, polystyrene cross-linked with divinylbenzene or the like, polyacrylamides, latex gels, silicon, plastics, nitrocellulose, polystyrene, dextran, rubber, natural sponges, silica gels, control pore glass, metals, cross-linked dextrans (e.g., Sephadex™) agarose gel (Sepharose™), and other solid supports known to those of skill in the art.
In some embodiments, the solid phase supports can include synthetic polymer supports, such as polystyrene, polypropylene, substituted polystyrene (e.g., carboxylated or aminated polystyrene), polyamides, polyacrylamides, polyvinylchloride, and the like, or any material useful in nucleic acid affinity chromatography. In some embodiments, the solid phase can be a flat surface, curved surface, a well, or part of a microfluidic device.
In some embodiments disclosed herein, the solid support can include beads. Beads may be any of a wide variety of shapes, such as spherical, generally spherical, egg shaped, disc shaped, cubical, amorphous and other three dimensional shapes. Beads may be manufactured using a wide variety of materials, including for example, resins, and polymers. The beads may be any suitable size, including for example, microbeads, microparticles, nanobeads and nanoparticles. In some cases, beads are magnetically responsive; in other cases beads are not significantly magnetically responsive. For magnetically responsive beads, the magnetically responsive material may constitute substantially all of a bead, a portion of a bead, or only one component of a bead. The remainder of the bead may include, among other things, polymeric material, coatings, and moieties which permit attachment of an assay reagent. Examples of suitable beads include flow cytometry microbeads, polystyrene microparticles and nanoparticles, functionalized polystyrene microparticles and nanoparticles, coated polystyrene microparticles and nanoparticles, silica microbeads, fluorescent microspheres and nanospheres, functionalized fluorescent microspheres and nanospheres, coated fluorescent microspheres and nanospheres, color dyed microparticles and nanoparticles, magnetic microparticles and nanoparticles, superparamagnetic microparticles and nanoparticles (e.g., DYNABEADS® particles, available from Invitrogen Group, Carlsbad, Calif.), fluorescent microparticles and nanoparticles, coated magnetic microparticles and nanoparticles, ferromagnetic microparticles and nanoparticles, coated ferromagnetic microparticles and nanoparticles, and those disclosed in U.S. Patent Pub. No. 20050260686; U.S. Patent Pub. No. 20030132538; U.S. Patent Pub. No. 20050118574; U.S. Patent Pub. No. 20050277197; U.S. Patent Pub. No. 20060159962, the entire disclosures of which are incorporated herein by reference in their entireties. Beads may be pre-coupled with a biomolecule or other substance that is able to bind to and form a complex with a biomolecule. Beads may be pre-coupled with an antibody, protein or antigen, DNA/RNA probe or any other molecule with an affinity for a desired target. Bead characteristics may be employed in embodiments of multiplexing aspects. Examples of beads having characteristics suitable for multiplexing, as well as methods of detecting and analyzing signals emitted from such beads, may be found in U.S. Patent Pub. No. 20080305481; U.S. Patent Pub. No. 20080151240; U.S. Patent Pub. No. 20070207513; U.S. Patent Pub. No. 20070064990; U.S. Patent Pub. No. 20060159962; U.S. Patent Pub. No. 20050277197; U.S. Patent Publication No. 20050118574, the disclosures of which are incorporated herein by reference in their entireties.
In some embodiments, beads can be of any convenient size and fabricated from any number of known materials. In some embodiments, the bead can be, for example, magnetic beads, paramagnetic beads, plastic beads, polystyrene beads, glass beads, agarose beads, and combinations thereof. In some embodiments, the beads are beads approximately 2 to 100 μm in diameter, or 10 to 80 μm in diameter, most preferably 20 to 40 μm in diameter. In some embodiments, the beads can be provided in solution. In some embodiments, or the beads can be immobilized on a solid support.
In some embodiments, the solid support can include streptavidin. In some embodiments, the solid support is, or can include, streptavidin-coated beads. In some embodiments, the solid support is or can include streptavidin-coated magnetic beads. In some embodiments where the solid phase support includes streptavidin, the nucleic acids molecules can be biotinylated to facilitate binding of the nucleic acids to the solid support.
In some embodiments, an input nucleic acid sample includes single-stranded nucleic acids. In some embodiments, at least one target nucleic acid in an input nucleic acid sample can be double-stranded, or can be rendered at least partly double-stranded using appropriate procedures. In some embodiments, the input nucleic acid sample includes a mixture of single-stranded nucleic acid molecules and double-stranded nucleic acid molecules. In some embodiments, the target nucleic acid molecules can be linear. In some embodiments, the target nucleic acid molecules can be circular, or include a combination of linear and circular regions.
In some embodiments, the amount of an input nucleic acid can be from about 0.01 ng to 100 ng. In some embodiments, the amount of input nucleic acid is about 0.1 ng, 0.2 ng, 0.3 ng, 0.4 ng, 0.5 ng, 0.6 ng, 0.7 ng, 0.8 ng, 0.9 ng, 1 ng, 1.1 ng, 1.2 ng, 1.3 ng, 1.5 ng, 2.0 ng, 2.5 ng, 3.0 ng, 3.5 ng, 4.0 ng, 4.5 ng, 5.0 ng, or within a range defined by any two of the aforementioned values. In some embodiments, the amount of input nucleic acid is about 5.5 ng, 6.0 ng, 6.5 ng, 7.0 ng,7.5 ng, 8.0 ng, 8.5 ng, 9.0 ng, 9.5 ng, 10.0 ng, 11.0 ng, 11.5 ng, 12.0 ng, 12.5 ng, 13.0 ng, 13.5 ng, 14.0 ng, 15.0 ng, 15.5 ng, 16.0 ng, 16.5 ng, 17.0 ng, 18.0 ng, 18.5 ng, 19.0 ng, 19.5 ng, 20.0 ng, or within a range defined by any two of the aforementioned values. In some embodiments, the amount of input nucleic acid is about 21.0 ng, 22.0 ng, 22.5 ng, 23.0 ng, 23.5 ng, 24.0 ng, 24.5 ng, 25.0 ng, 26.0 ng, 27.0 ng, 28.0 ng, 29.0 ng, 30.0 ng, 32.5 ng, 35.0 ng, 37.5 ng, 40.0 ng, 42.5 ng, 45.0 ng, 47.5 ng, 50.0 ng, 52.5 ng, 55.0 ng, 60.0 ng, 65.0 ng, 70.0 ng, 75.0 ng, 80.0 ng, 85.0 ng, 90.0 ng, or within a range defined by any two of the aforementioned values. In some embodiments, the amount of input nucleic acid is about 0.08, 0.4, 2.0, 10.0, or 50.0 ng.
In some embodiments, an amplification step is performed on a single nucleic acid libraries. In some embodiments, an amplification step is performed on a plurality of nucleic acid libraries. In some embodiments, the amplification products from the plurality of nucleic acid libraries are combined to form a combined pooled nucleic acid library. In some embodiments, the amplification products derived from the plurality of nucleic acid libraries are combined after being removed from the solid phase support. In some embodiments, the amplification products from the plurality of nucleic acid libraries are combined before being removed from the solid phase support. In some embodiments, the plurality of input nucleic acid samples is combined before the amplification step. In some embodiments, the amount of each input nucleic acid sample is not normalized across the plurality of nucleic acid samples.
In some embodiments, the library is generated from DNA. In some embodiments the library is generated from RNA. In some embodiments, the library is generated from proteins. In some embodiments, the library is generated from a combination of DNA, RNA, and protein. For example, antibody-oligo conjugates can be used to generate a library on a solid-support and subsequently normalized through limited amplification on a solid-support.
In some embodiments, the plurality of input nucleic acid samples comprises at least 2, 4, 8, 12, 24, 48, 96, 200, 384, 400, 500, 100, 1500, or a number of input nucleic acid samples within a range defined by any two of the aforementioned numbers.
In some embodiments, the relative representation of each population of constituent amplification products can be advantageously adjusted in the pooled nucleic acid library. By “advantageously adjusted”, as used herein, is meant that the amount of each constituent amplification products in the pooled nucleic acid library can be controlled or predetermined. In some embodiments, the amount of each constituent amplification products is substantially uniformly represented in the pooled nucleic acid library. Alternatively, the plurality of constituent amplification products can be present in the pooled nucleic acid library in different, predetermined concentrations. This may be achieved by assembling different amounts of the solid phase supports with amplification products remaining affixed thereon or by pooling different amounts of the plurality of constituent amplification products after being recovered from the solid phase support. Stated differently, the advantageous adjustment can include selectively adjusting both the proportional representation and the population number of constituent amplification products in the pooled nucleic acid library. In some embodiments, an advantageous adjustment may include subjecting a sample of constituent amplification products to at least one processing step in addition to recovering amplification products from the solid phase support.
Some embodiments provided herein include indexing of nucleic acid libraries by extension of indexing oligonucleotides hybridized to common sequences on nucleic acid molecules. Some protocols for DNA library indexing can include multiple enzymatic and purification steps, or can be performed in a final PCR amplification step. Such indexing protocols may also rely on a single round of indexing, thus not supporting combinatorial indexing which can include individually tagging large number of molecules in a single sample with individual combination of indices. Advantageously, embodiments provided herein can include both single and combinatorial indexing, such as multiple levels of indexing. Some embodiments include indexing nucleic acids in solution, or indexing nucleic acids attached to a substrate, such as beads. Some embodiments include indexing nucleic acids, such as DNA, such as crude chromatin bound to beads. Some embodiments include in situ indexing of nucleic acids, such as DNA present inside of cell/nuclei.
Some embodiments for preparing an indexed nucleic acid library include adding Y-adaptors to a plurality of target nucleic acids. In some embodiments, a Y-adaptor includes a double-stranded nucleic acid comprising a portion or end in which both strands are hybridized to one another, and a portion or end in which the strands do not hybridize to one another. In some embodiments, the Y-adaptors are added to first and second ends of each target nucleic acid, and each Y-adaptor comprises a first strand comprising a first primer binding site and a mosaic element, and a second strand comprising a second primer binding site and a complement to the mosaic element. Some embodiments also include hybridizing a first extension oligonucleotide to the second primer binding site, wherein the first extension oligonucleotide comprises a first index and a third primer binding site. Some embodiments also include extending the second strand comprising the second primer binding site, thereby adding the first index to the target nucleic acids.
In some embodiments, a 5′ end of the first strand of the Y-adaptor is resistant to nuclease degradation. For example, the 5′ end of the first strand of the Y-adaptor can include a phosphorothioate bond between at least two consecutive nucleotides. In some embodiments, a 5′ end of the second strand of the Y-adaptor is phosphorylated.
In some embodiments, adding Y-adaptors to a plurality of target nucleic acids includes contacting the plurality of target nucleic acids with a plurality of transposomes, wherein each transposome comprises a Y-adaptor and a transposase. In some embodiments, the transposomes comprise dimers. In some embodiments, the transposase comprises a Tn5 transposase. In some embodiments, the transposomes are bound to a substrate. In some embodiments, the substrate comprises a plurality of beads. In some embodiments, the beads are magnetic.
In some embodiments, a 3′ end of the first extension oligonucleotide is blocked. In some embodiments, a 5′ end of the first extension oligonucleotide is phosphorylated. In some embodiments, the first extension oligonucleotide is bound to a bead. Some embodiments also include cleaving the first extension oligonucleotide from the bead prior to extending the second strand comprising the second primer binding site.
In some embodiments, extending the second strand comprising the second primer binding site comprises polymerase extension, and/or extension with a ligase. In some embodiments, extension with a ligase comprises: hybridizing a ligation oligonucleotide to the first extension oligonucleotide hybridized to the second primer binding site; and ligating the ligation oligonucleotide to the second strand comprising the second primer binding site. In some embodiments, the ligation oligonucleotide comprises an additional index. In some embodiments, the ligation oligonucleotide comprises an additional primer binding site.
Some embodiments also include removing the first extension oligonucleotide after extending the second strand comprising the second primer binding site. In some embodiments, the removing comprises an exonuclease treatment. In some embodiments, the first extension oligonucleotide comprises uracil nucleotides, and the removing comprises degradation of the first extension oligonucleotide with a uracil-specific excision reagent (USER) enzyme.
Some embodiments also include adding a second index to the target nucleic acids comprising the first index. In some embodiments, adding the second index comprises: hybridizing a second extension oligonucleotide to the third primer binding site of the target nucleic acids comprising the first index, wherein the second extension oligonucleotide comprises the second index; and extending the second strand comprising the third primer binding site, thereby adding the second index to the target nucleic acids comprising the first index. In some embodiments, a 3′ end of the second extension oligonucleotide is blocked. In some embodiments, a 5′ end of the second extension oligonucleotide is phosphorylated.
In some embodiments, extending the second strand comprising the third primer binding site comprises polymerase extension. In some embodiments, extending the second strand comprising the third primer binding site comprises extension with a ligase. Some embodiments also include removing the second extension oligonucleotide after extending the third primer binding site.
Some embodiments also include adding a third index to the target nucleic acids comprising the second index. Some embodiments also include adding an additional index to the target nucleic acids comprising the third index.
Some embodiments also include amplifying the target nucleic acids comprising one or more indexes, such as a first index, second index, third index, or more, in some embodiments, the amplification comprises hybridizing amplification primers to the first primer binding sites and to a third, fourth, or fifth primer binding sites. In some embodiments, the primer binding sites can include generic sequences such as a P5 or P7 sequence or complement thereof. In some embodiments, the amplification comprises a PCR. In some embodiments, the amplification comprises bridge amplification. In some embodiments, the target nucleic acids comprise genomic DNA.
Some embodiments include a method of combinatorial indexing a plurality of target nucleic acids. Some embodiments include (a) obtaining a pool of primary indexed nucleic acids, comprising: adding a first index to a plurality of subpopulations of target nucleic acids, wherein a different first index is added to each subpopulation, and combining the subpopulations comprising the different first indexes to obtain the pool of primary indexed nucleic acids, wherein the first index is added to a subpopulation of target nucleic acids according to any of the foregoing embodiments; (b) splitting the pool into a plurality of subpopulations of primary indexed nucleic acids; and (c) obtaining a pool of secondary indexed nucleic acids, comprising: adding a second index to the plurality of subpopulations of primary indexed nucleic acids, wherein a different second index is added to each subpopulation, and combining the subpopulations comprising the different second indexes to obtain the pool of secondary indexed nucleic acids. Some embodiments also include repeating (b) and (c) and adding additional indexes to indexed subpopulations. In some embodiments, adding a first index to a subpopulation of target nucleic acids is performed in a compartment selected from a well, a channel, or a droplet.
An example of a single level indexing by extension of tagmented DNA in solution is presented in
Some embodiments of an indexing workflow applied for DNA tagmented on beads is shown in
Some embodiments that include the addition of multiple indices to a DNA molecule are depicted in
In some embodiments, nucleic acids may degrade with increasing number of cycles. To reduce degradation of nucleic acids, such as oligonucleotides, some embodiments can include one or more method steps depicted in
In some embodiments, indexing by hybridization-extension can be performed in individual compartments containing a single indexing bead from a large pool of beads covered by unique cleavable indexing oligonucleotides. Examples of compartments include wells in a microwell plate, Fluidigm-like channels, or droplets. To provide phasing information or single cell resolution, tagmented DNA for indexing can be either prebound to beads, or be contained inside of nuclear envelope. An example embodiment of single level indexing of bead bound tagmented DNA by hybridization-extension in droplets is depicted in
In some embodiments, droplets can be used for a hybridization step. In some embodiments, subsequent extension steps can be performed in bulk after contents of the droplets is released.
Some embodiments also include DNA molecules enriched on beads grafted by oligonucleotides containing locus specific sequences. Briefly, a sample indexing oligonucleotide is added to the enrichment/hybridization mixture and extended at the same time as the gap between locus specific enrichment oligonucleotides is sealed in post enrichment step. As shown in
Nucleic acid amplification reactions include the use of polymerases which extend primers by incorporating deoxynucleoside triphosphates (dNTPs) at the 3′ ends of primers in the presence of catalytic metals, such as Mg2+. A forward polymerization reaction includes addition of a nucleotide to a primer to generate an extended primer and inorganic pyrophosphate (PPi); however, the reverse reaction is possible, and increasing amounts of PPi can inhibit polymerization.
PPi accumulation is exacerbated when reaction conditions yield large quantities of DNA, such as >0.4 ug/uL during isothermal whole-genome amplification reactions. The accumulation of a magnesium PPi complex can be visualized as a white visible precipitate. The degree of observed precipitate is time dependent and increases with increasing time in whole genome amplification reactions. However, precipitation in amplification reactions may interfere with applications where high DNA yield is desired, such as single-cell genomics applications. DNA extension in nanowells may be impacted because the local concentration may be elevated due to the confined volumes, such as DNA clusters in nanowells as utilized during SBS chemistry events.
Some embodiments of the methods and compositions provided herein include modifying a nucleic acid comprising: amplifying or extending the nucleic acid in the presence of a polymerase. In some embodiments, the amplifying and/or the extending is performed under conditions suitable to remove PPi, to inhibit pyrohosphorolysis, and/or inhibit formation of a Mg2+-PPi complex. In some embodiments, the PPi is soluble. In some embodiments, the amplifying and/or the extending is performed in the presence of an inorganic pyrophosphatase. In some embodiments, a rate of achieving a yield of a product of the amplifying and/or the extending is increased in the presence of the inorganic pyrophosphatase compared to a rate of achieving a yield of a product of the amplifying and/or the extending in the absence of the inorganic pyrophosphatase. In some embodiments, the rate of achieving a yield of a product of the amplifying and/or the extending is increased by at least 2-fold, at least 3-fold, or at least 5-fold. In some embodiments, the amplifying and/or the extending is performed under isothermal conditions. In some embodiments, the amplifying and/or the extending comprises performing a reaction selected from a PCR, a bridge amplification, a whole genome amplification, a loop-mediated isothermal amplification (LAMP), an amplification from nucleic acids obtained from a single cell, a sequencing by synthesis (SBS) reaction, and an exclusion amplification (ExAMp).
In some embodiments, the amplifying and/or the extending comprises performing an amplification from nucleic acids obtained from a single cell, or from cell-free nucleic acids. In some embodiments, the amplifying and/or the extending comprises performing a sequencing by synthesis (SBS) reaction. In some embodiments, the amplifying and/or the extending comprises performing a bridge amplification.
In SBS, extension of a nucleic acid primer along a nucleic acid template (e.g. a target nucleic acid or amplicon thereof) is monitored to determine the sequence of nucleotides in the template. The underlying chemical process can be polymerization (e.g. as catalyzed by a polymerase enzyme). In a particular polymerase-based SBS embodiment, fluorescently labeled nucleotides are added to a primer (thereby extending the primer) in a template dependent fashion such that detection of the order and type of nucleotides added to the primer can be used to determine the sequence of the template. In some embodiments, SBS includes pyrosequencing. Pyrosequencing detects the release of PPi as particular nucleotides are incorporated into a nascent nucleic acid strand (Ronaghi, et al., Analytical Biochemistry 212(1), 84-9 (1996); Ronaghi, Genome Res. 11(1). 3-11 (2001); Ronaghi et al. Science 281(5375), 363 (1998); U.S. Pat. Nos. 6,210,891; 6,258,568 and 6,271,320, each of which is incorporated herein by reference in its entirety). In pyrosequencing, released PPi can be detected by being converted to adenosine triphosphate (ATP) by ATP sulfurylase, and the level of ATP generated can be detected via luciferase produced photons. Thus, the sequencing reaction can be monitored via a luminescence detection system. Flow cells provide a convenient format for housing amplified nucleic acid molecules produced by some methods and compositions provided herein. One or more amplified nucleic acid molecules in such a format can be subjected to an SBS or other detection technique that involves repeated delivery of reagents in cycles. For example, to initiate a first SBS cycle, one or more labeled nucleotides, DNA polymerase, etc., can be flowed into/through a flow cell that houses one or more amplified nucleic acid molecules. Those sites where primer extension causes a labeled nucleotide to be incorporated can be detected. Optionally, the nucleotides can further include a reversible termination property that terminates further primer extension once a nucleotide has been added to a primer. For example, a nucleotide analog having a reversible terminator moiety can be added to a primer such that subsequent extension cannot occur until a deblocking agent is delivered to remove the moiety. Thus, for embodiments that use reversible termination, a deblocking reagent can be delivered to the flow cell (before or after detection occurs). Washes can be carried out between the various delivery steps. The cycle can then be repeated n times to extend the primer by n nucleotides, thereby detecting a sequence of length n. Example SBS procedures, fluidic systems and detection platforms that can be readily adapted for use with methods and compositions provided herein are described, for example, in Bentley et al., Nature 456:53-59 (2008), WO 04/018497; U.S. Pat. No. 7,057,026; U.S. Pat. No. 7,329,492; U.S. Pat. No. 7,211,414; U.S. Pat. No. 7,315,019; U.S. Pat. No. 7,405,281, each of which is incorporated herein by reference.
Some embodiments include amplification and/or extension of nucleic acids comprising oligonucleotide extension and ligation, rolling circle amplification (RCA) (Lizardi et al., Nat. Genet. 19:225-232 (1998), which is incorporated herein by reference in its entirety) and oligonucleotide ligation assay (OLA). See e.g., U.S. Pat. Nos. 7,582,420, 5,185,243, 5,679,524 and 5,573,907; EP 0320308; EP 0336731; EP 0439182; WO 90101069; WO 89/12696; and WO 89109835, each of which is incorporated herein by reference in its entirety. It will be appreciated that these amplification methodologies can be designed to amplify immobilized nucleic acid fragments. For example, in some embodiments, the amplification method can include ligation probe amplification or oligonucleotide ligation assay (OLA) reactions that contain primers directed specifically to the nucleic acid of interest. In some embodiments, the amplification method can include a primer extension-ligation reaction that contains primers directed specifically to the nucleic acid of interest. As a non-limiting example of primer extension and ligation primers that can be specifically designed to amplify a nucleic acid of interest, the amplification can include primers used for the GoldenGate assay (Illumina, Inc., San Diego, Calif.) as exemplified by U.S. Pat. Nos. 7,582,420 and 7,611.869, each of which is incorporated herein by reference in its entirety.
Example isothermal amplification methods include Multiple Displacement Amplification (MDA) as exemplified by, for example, Dean et al., Proc. Natl. Acad. Sci. USA 99:5261-66 (2002) or isothermal strand displacement nucleic acid amplification as exemplified by, for example U.S. Pat. No. 6.214,587, each of Which is incorporated herein by reference in its entirety. Other non-PCR-based methods that can be used in the present disclosure include. for example, strand displacement amplification (SDA) which is described in, for example Walker et al., Molecular Methods for Virus Detection, Academic Press, Inc., 1995; U.S. Pat. Nos. 5,455,166, and 5,130,238, and Walker et al., Nucl. Acids Res. 20:1691-96 (1992) or hyperbranched strand displacement amplification which is described in, for example Lage et al., Genome Research 13:294-307 (2003), each of which is incorporated herein by reference in its entirety.
Some embodiments include amplification such as bridge amplification. Examples of bridge amplification are disclosed in U.S. 20200181686, U.S. 20190374923, U.S. 20180291444 and U.S. 20170342406 which are each incorporated by reference in its entirety. In some embodiments, bridge amplification can include double stranded or partially double stranded amplicons immobilized on a solid support and produced from an extension reaction, which are denatured and the immobilized strand annealed to a universal capture primer through hybridization to an adapter. The resulting structure is immobilized at both ends to create a bridge and the universal capture primer can be extended and then amplified using, for example, a universal primer region contained in the target specific capture primer. This second universal capture primer can be different than the first universal primer complementary to the adapter sequence. The strand that is not immobilized can, for example, be washed away, Bridge amplification can result in a uniform cluster or colony number for amplicons within a plurality. In some embodiments, amplification can include exclusion amplification. See e.g., U.S. 20200181686 and U.S. 20190374923.
Some embodiments relate to systems and kits comprising a substrate having capture probes attached to the substrate. In some embodiments, the substrate comprises a plurality of beads. Some embodiments also include extension primers In some embodiments, the extension primers are in solution. In some embodiments, the capture probes comprises first amplification sites. In some such embodiments, the first amplification sites and the extension primers are incapable of or are essentially incapable of hybridizing to one another. In some embodiments, the first amplification sites and the extension primers are non-complementary to one another. In some embodiments, the first amplification sites and the extension primers comprise non-complementary nucleotide sequences to one another. In some embodiments, the first amplification sites comprise modified nucleotides that inhibit hybridization with the extension primers. In some embodiments, the first amplification sites and the extension primers each tack the same type of nucleotide. In some embodiments, the first amplification sites lack types of nucleotides complementary to one another, and each extension primer consists essentially of the same types of nucleotides as the first amplification site. In some embodiments, the first amplification sites and extension primers lack types of nucleotides complementary to one another, and each extension primer. For example, the first amplification sites lack a combination of nucleotides selected from: adenine (A) and thymine (I); or guanine (G) and cytosine (C). In some embodiments, the first amplification sites consist of a combination of nucleotides selected from: adenine (A) and guanine (G); adenine (A) and cytosine (C); cytosine (C) and thymine (T); or guanine (G) and thymine (I). In some embodiments, the first amplification sites consist of a combination of adenine (A) and guanine (G) nucleotides. In some embodiments, the first amplification site can consist of one type of nucleotide. In some embodiments, the system or kit includes an extension primer. In some embodiments, the extension primer consists of the same types of nucleotides as the capture probes. In some embodiments, the extension primers are provided in an amount less than or equal to the amount of the capture probes. In some embodiments, a system or kit can also include a transposome, a ligase, a polymerase, and/or an inorganic pyrophosphatase.
A method was developed which utilized a “GGA/AAG” primer system. A GGA primer had the sequence of SEQ ID NO:03 (AAAAAGGAGGAGGAGGAGGAGGAAAA); a AAG primer had the sequence of SEQ ID NO:04 (AAGAAGAAGAAGAAGAAGAAGAAGAAGAAA). In this method, both an on-bead primer (capture probe) and an in-solution primer (extension primer) did not include Tor C nucleotides, and were used to drive isothermal amplification with Ex-amp. The primers could not interact with themselves or each other, and as a result would not form oligo dimers on the surface of the bead.
To compare the P5/P7 and GGA/AAG primer systems, two bead types were generated by grafting on the surface either GGA or P5 primers (
In a substantially similar study to Example 1, the relative amounts of output DNA was measured for various amounts of input DNA using either the P5/P7 primer system, or the GGA/AAG primer system. TABLE 1 and
Libraries were hybridized to beads for either 1 hour, or overnight, for various amounts of the library input DNA.
The GGA/AAG or P5/P7 primer systems were compared in a study with 12-plea bead pools. Briefly, 4.8 M beads/condition (800 K each bead type); replicates for each bead pool were used. Human gDNA was whole genome amplified, and about 50 μg of the amplified DNA was hybridized overnight in 24% formamide at room temperature with capture probes on the beads. Non-hybridized DNA was washed from the beads, capture probes were extended, hybridized DNA removed from the extended capture probes, and the capture probes amplified using either P7 or AAG primers in solution. Amplified products were analyzed by quantitative PCR (qPCR).
An example protocol substantially similar to the protocol used in Example 3 is provided. The nomenclature of the primers is consistent with
Over-night Hybridization: (1) Annealed LSP2 and index primer in a 10:9 ratio (LSP2:index ratio corresponded to 5 μM:4.5 μM) 1×. annealing buffer (10 mM Tris-HCl. pH 7.5, 100 mM NaCl, 0.1 mM EDTA) and kept on ice. (2) Prepared LSP1 bead pool by mixing equal amounts of a panel of beads individually pre-bound with capture oligos for specific genomic loci, and “GGA-amplification” oligo (SEQ ID NO:05) that served in later steps as a primer for ExAMP normalization. (3) Added to each sample tube desired amount of LSP1 bead pool (example: 500 IrSP1 beads/locus) and brought the total number of beads in each tube to 5 million with empty PMP beads. The empty PMP beads were used to decrease bead loss. (4) In a separate tube mixed 5 μl previously prepared whole genom amplified (WGA) DNA, 17 μl of Illumina RA1 buffer, and heat denature at 95° C. for 5 min. (5) During WGA DNA denaturation magnetized the beads, removed the supernatant, and transferred 22 μl of the denatured WGA DNA in RA1 buffer to beads pellets and resuspended. (6) Spiked in 2 μl of the 5:4.5 μM LSP2:index duplex. The LSP2:index duplex was at ˜400:360 nM final. (7) Performed over-night hybridization at 48° C. with constant rotation or frequent agitation.
Gap-fill ligate: (1) After incubation, magnetized the beads and discarded the supernatant. (2) Resuspended the beads in 100 μl of prewarmed to 42° C. RA1 buffer. Incubated beads at 42° C. for 5 min. (3) Repeated heated wash step 2 more times. (4) Washed 1× with 100 μl GBS-wash buffer (100 mM Tris-HCl, pH 7.5, 100 mM NaCl, 0.1% Tween20). (5) Magnetized the beads, removed supernatant, and resuspended in 20 μl Illumina ELM3 mix. Incubated beads at room temperature for 30 min. (6) After incubation, magnetized the beads and discarded the supernatant. (7) Washed 2× with 100 μl GBS-wash buffer. (8) Magnetized the beads, discarded the supernatant, and resuspended beads in 20 μl 0.1N NaOH. Incubated beads for 5 min at room temperature. (9) Magnetized the beads, removed supernatant, and washed 3× with 100 μl GBS-wash buffer.
ExAMP normalization: (1) Prepared. ExAMP reagents as follows (for 8 samples): Mixed 140 μl EPX1, 20 μl EPX2, 74 μl EPX3, 200 nM final in-solution ‘AAGGGGT’ primer (SEQ ID NO:06). (2) Magnetized the beads, removed supernatant, resuspended beads in 5 μl RSB, and added 20 μl EPX/in-solution primer mix to the beads. Proceeded with Single-step or Double-step ExAMP protocol.
Single-step ExAMP: (1) Incubated the beads at 38° C. for min. After 15 min, spiked in 350 nM final ‘P5-GGA’ (SEQ ID NO:07) and 350 nM final ‘AAGGGGT-P7’ (SEQ ID NO:08) adapter primers. (2) Incubated the beads for an additional 15 min at 38° C. rotating.
Two-step ExAMP: (1) Incubated the beads at 38° C. for 1 hr. (2) After incubation, prepared new ExAMP reagents as follows (for 8 samples): Mixed 140 μl EPX1, 20 μl EPX2, 74 μl EPX3, 500 nM ‘P5-GGA’ primer (SEQ ID NO:07), and 500 nM ‘AAGGGGT-P7’ (SEQ NO:08) primer. (3) Magnetized beads, removed supernatant, and washed 1X with GBS wash buffer. (4) Resuspended beads with 5 μl RSB and 20 μl new ExAMP/primer mix. (5) Incubated the beads for an additional for 1 hr at 38° C. rotating.
Sequencing: (1) After incubation, samples were cleaned up with 5 μg Zymo column, quantified with Qubit, and clustered for sequencing. (2) Sequencing was performed with a single read and single index run as follows: Read 1: 76 bps with ‘P5-GGA’ Read 1 primer (SEQ ID NO:07). Index 1: 8 bps with CS′ Index 1 primer (SEQ ID NO:09). TABLE 2 lists sequences of certain primers.
In this example, a whole genome amplification reaction was performed to determine if PPi accumulation had an adverse effect on reaction kinetics and inhibited polymerization. DNA yield during the amplification was measured with a PicoGreen dsDNA Quantification kit (Molecular Probes). DNA yield reached 80 μg after 3 hours at 37° C. and remained substantially constant up to 24 hours. Increasing the amount of the dNTPs did not increase the rate of the reaction.
An insoluble precipitate accumulated in reaction tubes during DNA amplification. It was hypothesized that the insoluble material was an PPi precipitate. The precipitate was treated with 0.1 to 1 unit inorganic pyrophosphatase (iPPase) (New England Biolabs, M0361L) post DNA amplification. However, attempts to hydrolyze the precipitate were unsuccessful. The iPPase activity may have been suppressed by insoluble PPi substrate and/or the iPPase might lack activity with a magnesium pyrophosphate complex as compared to an uncomplexed PPi substrate. It was hypothesized that the accumulation of a magnesium PPi complex could inhibit the reaction kinetics via the following two mechanisms: (1) PPi ions driving the reaction in the pyrophosphorolysis direction; and (2) Mg2+ ion depletion due to the formation of the magnesium pyrophosphate species.
In order to decrease the generation of PPi, minimize pyrophosphorolysis and the formation of magnesium PPi complexes, iPPase was added to isothermal whole-genome amplification reactions such that the iPPase would hydrolyze PPi as it is was being generated in real-time. Whole-genome amplification reactions were incubated from 0.5 to 3 hours in either the presence or absence of iPPase (IPP) with 200 ng input DNA.
Amplification reactions were repeated with 100 ng DNA. The DNA was processed with an Infinium EX workflow to measure functional genotyping performance with 60K SNP content per sample well. The Xraw and Yraw dye intensities originating from spectrally unique dyes post staining were measured (
The term “comprising” as used herein is synonymous with “including,” “containing,” or “characterized by,” and is inclusive or open-ended and does not exclude additional, unrecited elements or method steps.
The above description discloses several methods and materials of the present invention. This invention is susceptible to modifications in the methods and materials, as well as alterations in the fabrication methods and equipment. Such modifications will become apparent to those skilled in the art from a consideration of this disclosure or practice of the invention disclosed herein. Consequently, it is not intended that this invention be limited to the specific embodiments disclosed herein, but that it cover all modifications and alternatives coming within the true scope and spirit of the invention.
All references cited herein, including but not limited to published and unpublished applications, patents, and literature references, are incorporated herein by reference in their entirety and are hereby made a part of this specification. To the extent publications and patents or patent applications incorporated by reference contradict the disclosure contained in the specification, the specification is intended to supersede and/or take precedence over any such contradictory material.
This application claims priority to U.S. Prov. App. No. 63/001,684 filed Mar. 30, 2020 entitled “METHOD AND COMPOSITIONS FOR PREPARING NORMALIZED NUCLEIC ACID LIBRARIES” which is incorporated herein by reference in its entirety.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2021/024695 | 3/29/2021 | WO |
Number | Date | Country | |
---|---|---|---|
63001684 | Mar 2020 | US |