METHODS AND COMPOSITIONS FOR IDENTIFYING LIGANDS ON ARRAYS USING INDEXES AND BARCODES

Information

  • Patent Application
  • 20210087613
  • Publication Number
    20210087613
  • Date Filed
    September 18, 2020
    4 years ago
  • Date Published
    March 25, 2021
    3 years ago
Abstract
Some embodiments provided herein include methods and compositions for the detection of target ligands on an array. In some embodiments, a capture probe specifically binds to a target ligand from a sample, the location of a bead comprising the capture probe in an array is determined, and the bead is decoded to identify the capture probe and the sample. In some embodiments, a barcode is indicative of a capture probe attached to a bead; and an index is indicative of a subpopulation of beads. Some embodiments relate to sequencing target polynucleotides from several different nucleic acids samples on a bead array.
Description
FIELD OF THE INVENTION

Some embodiments provided herein include methods and compositions for the detection of target ligands on an array. In some embodiments, a capture probe specifically binds to a target ligand from a sample, the location of a bead comprising the capture probe in an array is determined, and the bead is decoded to identify the capture probe and the sample. In some embodiments, a barcode is indicative of a capture probe attached to a bead; and an index is indicative of a subpopulation of beads. Some embodiments relate to sequencing target polynucleotides from several different nucleic acids samples on a bead array.


BACKGROUND OF THE INVENTION

The detection of specific nucleic acid sequences present in a biological sample has been used, for example, as a method for identifying and classifying microorganisms, diagnosing infectious diseases, detecting and characterizing genetic abnormalities, identifying genetic changes associated with cancer, studying genetic susceptibility to disease, and measuring response to various types of treatment. A common technique for detecting specific nucleic acid sequences in a biological sample is nucleic acid sequencing.


Nucleic acid sequencing methodology has evolved significantly from the chemical degradation methods used by Maxam and Gilbert and the strand elongation methods used by Sanger. Several sequencing methodologies are now in use which allow for the parallel processing of thousands of nucleic acids all on a single chip. Some platforms include bead-based and microarray formats in which silica beads are functionalized with probes depending on the application of such formats in applications including sequencing, genotyping, gene expression profiling.


Methods of genotyping different samples on current bead-based arrays require gaskets to physically sub-divide different areas of bead chips into multiple sectors. Individual samples are then loaded into each discrete section created by the gasket. However, such methods might be used with relatively low sample number input but prove arduous or unmanageable as the density of samples per bead chip increase from 24 to 96, 384, 1536, or more samples per bead chip.


SUMMARY OF THE INVENTION

Some embodiments include a method of sequencing target nucleic acids on an array, comprising: (a) obtaining a first and a second population of beads, wherein: the first population of beads comprise oligonucleotides comprising first capture probes, first barcodes, and barcode primer binding sites adjacent to the first barcodes, and the second population of beads comprise oligonucleotides comprising second capture probes, second barcodes, and barcode primer binding sites adjacent to the second barcodes; (b) obtaining a first and a second plurality of polynucleotides, wherein: the first plurality of polynucleotides comprises first target nucleic acids, wherein the plurality of first polynucleotides is in solution, and the second plurality of polynucleotides comprises second target nucleic acids, wherein the plurality of second polynucleotides is in solution; (c) hybridizing the first target nucleic acids to the first capture probes to obtain hybridized first beads, and hybridizing the second target nucleic acids to the second capture probes to obtain hybridized second beads; (d) randomly distributing the hybridized first beads and hybridized second bead on an array; (e) decoding the locations of the first and second beads on the array by sequencing the first and second barcodes; and (f) extending the first and second capture probes to obtain nucleic acid sequence data for the first and second target nucleic acids.


In some embodiments, the first plurality of polynucleotides comprises first indexes, and index primer binding sites adjacent to the first indexes; and the second plurality of polynucleotides comprises second target nucleic acids, second indexes, and index primer binding sites adjacent to the second indexes.


In some embodiments, the first or second plurality of polynucleotides is obtained by tagmenting a nucleic acid sample with a plurality of transposomes. In some embodiments, the plurality of transposomes comprise the first or second indexes. Some embodiments also include adding adaptors to the tagmented nucleic acid sample, wherein the adaptors comprise the first or second indexes. Some embodiments also include amplifying the tagmented nucleic acid sample with primers comprising the first or second indexes.


In some embodiments, extending the first and second capture probes incorporates sequences complementary to the first and second indexes and to the first and second index primer binding sites into the extended capture probes.


In some embodiments, the first population of beads comprise first indexes, and index primer binding sites adjacent to the first indexes, and the second population of beads comprise second indexes, and index primer binding sites adjacent to the second indexes. In some embodiments, the oligonucleotides of the first population of beads comprise the first indexes; and the oligonucleotides of the second population of beads comprise the second indexes. In some embodiments, the first indexes are indicative of a source of the first target nucleic acids, and the second indexes are indicative of a source of the second target nucleic acids. In some embodiments, the first indexes are the same as one another, and the second indexes are the same as one another.


Some embodiments also include sequencing the first and second indexes. In some embodiments, sequencing the first and second indexes comprises extending primers hybridized to the index primer binding sites. In some embodiments, the index binding primer sites are the same.


In some embodiments, the first and second target nucleic acids are obtained from different nucleic acid samples. In some embodiments, the first and second target nucleic acids are obtained from genomic DNA.


In some embodiments, the first and second barcodes are indicative of a nucleic acid sequence of the first or second capture probes. In some embodiments, the first barcodes are different from one another, and the second barcodes are different from one another. In some embodiments, sequencing the first and second barcodes comprises extending primers hybridized to the barcode primer binding sites. In some embodiments, the barcode primer binding sites are the same.


In some embodiments, extending the first and second capture probes comprises polymerase extension. In some embodiments, extending the first and second capture probes comprises addition of a single nucleotide to a capture probe. Some embodiments also include ligating locus specific oligonucleotides to the extended capture probes. In some embodiments, extending the first and second capture probes comprises ligating locus specific oligonucleotides to the capture probes.


In some embodiments, step (c) is performed in solution.


In some embodiments, the array is located on the surface of a flowcell. In some embodiments, the first and second beads are adapted to be attached to the array. In some embodiments, the first and second beads comprise biotin, streptavidin, or a derivative thereof and the array comprises biotin, streptavidin, or a derivative thereof. In some embodiments, the first and second beads are magnetic.


Some embodiments include a method of sequencing target nucleic acids on an array, comprising: (a) obtaining a first and a second population of beads, wherein: the first population of beads comprise oligonucleotides comprising first capture probes, first barcodes, and barcode primer binding sites adjacent to the first barcodes, and the second population of beads comprise oligonucleotides comprising second capture probes, second barcodes, and barcode primer binding sites adjacent to the second barcodes; (b) obtaining a first and a second plurality of polynucleotides, wherein: the first plurality of polynucleotides comprises first target nucleic acids, first indexes, and index primer binding sites adjacent to the first indexes, wherein the plurality of first polynucleotides is in solution, and the second plurality of polynucleotides comprises second target nucleic acids, second indexes, and second primer binding sites adjacent to the second indexes wherein the plurality of second polynucleotides is in solution; (c) hybridizing the first target nucleic acids to the first capture probes to obtain hybridized first beads, and hybridizing the second target nucleic acids to the second capture probes to obtain hybridized second beads; (d) randomly distributing the hybridized first beads and hybridized second bead on an array; (e) decoding the locations of the first and second beads on the array by sequencing the first and second barcodes; (f) extending the first and second capture probes to obtain nucleic acid sequence data for the first and second target nucleic acids; and (g) determining a source for the nucleic acid sequence data of the first and second target nucleic acids by sequencing the first and second indexes.


In some embodiments, the first plurality of polynucleotides comprises first indexes, and index primer binding sites adjacent to the first indexes; and the second plurality of polynucleotides comprises second target nucleic acids, second indexes, and index primer binding sites adjacent to the second indexes.


In some embodiments, the first or second plurality of polynucleotides is obtained by tagmenting a nucleic acid sample with a plurality of transposomes. In some embodiments, the plurality of transposomes comprise the first or second indexes. Some embodiments also include adding adaptors to the tagmented nucleic acid sample, wherein the adaptors comprise the first or second indexes. Some embodiments also include amplifying the tagmented nucleic acid sample with primers comprising the first or second indexes.


In some embodiments, extending the first and second capture probes incorporates sequences complementary to the first and second indexes and to the first and second index primer binding sites into the extended capture probes.


Some embodiments also include a method of sequencing target nucleic acids on an array, comprising: (a) obtaining a first and a second population of beads, wherein: the first population of beads comprise oligonucleotides comprising first capture probes, first barcodes and barcode primer binding sites adjacent to the first barcodes, and first indexes and index primer binding sites adjacent to the first indexes, and the second population of beads comprise oligonucleotides comprising second capture probes, second barcodes and barcode primer binding sites adjacent to the second barcodes, and second indexes and index primer binding sites adjacent to the second indexes; (b) obtaining a first and a second plurality of polynucleotides, wherein: the first plurality of polynucleotides comprises first target nucleic acids, wherein the plurality of first polynucleotides is in solution, and the second plurality of polynucleotides comprises second target nucleic acids, wherein the plurality of second polynucleotides is in solution; (c) hybridizing the first target nucleic acids to the first capture probes to obtain hybridized first beads, and hybridizing the second target nucleic acids to the second capture probes to obtain hybridized second beads; (d) randomly distributing the hybridized first beads and hybridized second bead on an array; (e) decoding the locations of the first and second beads on the array by sequencing the first and second barcodes; and (f) extending the first and second capture probes to obtain nucleic acid sequence data for the first and second target nucleic acids.


In some embodiments, the oligonucleotides of the first population of beads comprise the first indexes; and the oligonucleotides of the second population of beads comprise the second indexes. In some embodiments, the first indexes are indicative of a source of the first target nucleic acids, and the second indexes are indicative of a source of the second target nucleic acids. In some embodiments, the first indexes are the same as one another, and the second indexes are the same as one another.


Some embodiments also include sequencing the first and second indexes. In some embodiments, sequencing the first and second indexes comprises extending primers hybridized to the index primer binding sites. In some embodiments, the index binding primer sites are the same.


In some embodiments, the first and second target nucleic acids are obtained from different nucleic acid samples. In some embodiments, the first and second target nucleic acids are obtained from genomic DNA.


In some embodiments, the first and second barcodes are indicative of a nucleic acid sequence of the first or second capture probes. In some embodiments, the first barcodes are different from one another, and the second barcodes are different from one another. In some embodiments, sequencing the first and second barcodes comprises extending primers hybridized to the barcode primer binding sites. In some embodiments, the barcode primer binding sites are the same.


In some embodiments, extending the first and second capture probes comprises polymerase extension. In some embodiments, the first and second capture probes comprises addition of a single nucleotide to a capture probe. Some embodiments also include ligating locus specific oligonucleotides to the extended capture probes. In some embodiments, extending the first and second capture probes comprises ligating locus specific oligonucleotides to the capture probes.


In some embodiments, step (c) is performed in solution.


In some embodiments, a flowcell comprises the array. In some embodiments, the array comprises a plurality of wells. In some embodiments, the first and second beads are adapted to be attached to the array. In some embodiments, the first and second beads comprise biotin, streptavidin, or a derivative thereof; and the array comprises biotin, streptavidin, or a derivative thereof. In some embodiments, the first and second beads are magnetic.


Some embodiments include a kit comprising: a plurality of populations of beads comprising oligonucleotides attached to the beads, the oligonucleotides comprising indexes, index primer binding sites adjacent to the indexes, capture probes, barcodes, and barcode primer binding sites adjacent to the barcodes, wherein the indexes are different between the populations of beads. In some embodiments, the index primer binding sites are the same in the plurality of populations. In some embodiments, the barcodes are indicative of a nucleic acid sequence of the capture probes. In some embodiments, the barcodes are different in a population of beads. In some embodiments, the barcode primer binding sites are the same in the plurality of populations. Some embodiments also include a reagent selected from: a locus specific oligonucleotide; a transposome for tagmenting a nucleic acid sample; a transposome comprising an index and an index primer binding site; an adaptor comprising an index and an index primer binding site; a primer capable of hybridizing to an index primer binding site or complement thereof; and a primer capable of hybridizing to a barcode primer binding site or complement thereof. Some embodiments also include a flowcell.


Some embodiments include a method of preparing an indexed population of beads, comprising: (a) obtaining a population of beads, wherein each bead comprises an adaptor, a capture probe, and a first polynucleotide comprising a barcode, and a barcode primer binding site; (b) obtaining a plurality of index polynucleotides, wherein each index polynucleotide comprises an index, and an index primer binding site; and (c) attaching the plurality of index polynucleotides to the population of beads via the adaptors, thereby obtaining an indexed population of beads.


In some embodiments, (c) comprises extending the adaptors by polymerase extension.


In some embodiments, each index polynucleotide comprises an adaptor binding site, and the attaching comprises hybridizing the adaptor binding sites to the adaptors.


In some embodiments, (c) comprises ligating the index polynucleotides to the adaptors.


In some embodiments, the attaching comprises hybridizing a splint polynucleotide to the adaptor and to the index polynucleotide.


In some embodiments, (c) comprises attaching the plurality of index polynucleotides to the adaptors of the population of beads via a chemically reactive moiety. In some embodiments, the attaching comprises a click chemistry reaction.


In some embodiments, the first polynucleotides of the population of beads comprise capture probes different from one another.


In some embodiments, the index of each index polynucleotide is the same.


In some embodiments, the first polynucleotide comprises the capture probe.


Some embodiments also include contacting the indexed population of beads with a plurality of nucleic acids comprising a target nucleic acid.


Some embodiments also include mixing the indexed population of beads contacted with a plurality of nucleic acids comprising a target nucleic with an additional indexed population of beads, wherein the additional indexed population of beads comprises an index polynucleotide comprising an index different from the index of the indexed population of beads contacted with a plurality of nucleic acids.


In some embodiments, the capture probe comprises a protein.


In some embodiments, the method is performed on a flow cell.


Some embodiments include a method for detecting a target ligand, comprising: (a) obtaining a population of beads, wherein each bead comprises a capture probe, a first polynucleotide comprising a barcode, and a barcode primer binding site; (b) obtaining an index polynucleotide comprising an index, an index primer binding site, and an adaptor capable of binding to the barcode primer binding site; (c) specifically binding a target ligand to the capture probe; (d) hybridizing the index polynucleotide to the first polynucleotide via the adaptor; (e) detecting the target ligand on an array; and (f) determining the index and the barcode of the first polynucleotide.


In some embodiments, (e) comprises distributing the population of beads on an array.


In some embodiments, (f) comprises hybridizing an index primer to the index primer binding site, and determining the sequence of the index.


In some embodiments, dehybridizing the index polynucleotide from the first polynucleotide; hybridizing a barcode primer to the barcode primer binding site; and extending the barcode primer to determine the sequence of the barcode.


In some embodiments, the index polynucleotide further comprises a cleavable linker located between the adaptor and the index, and (f) comprises: (i) cleaving the cleavable linker; and (ii) extending the adaptor to determine the sequence of the barcode.


In some embodiments, the capture probe comprises a protein.


In some embodiments, the target ligand comprises a target nucleic acid. In some embodiments, the first polynucleotide comprises the capture probe. In some embodiments, (e) comprises extending the first polynucleotide hybridized to the target nucleic acid. In some embodiments, the extension comprises adding a detectable dideoxynucleotide.


In some embodiments, the method is performed on a flow cell.


Some embodiments include a method for detecting a target ligand, comprising: (a) obtaining a population of beads, wherein each bead comprises a capture probe, a first polynucleotide comprising a barcode, and a barcode primer binding site, and a second polynucleotide; (b) obtaining an index polynucleotide comprising an index, an index primer binding site, and an adaptor; (c) specifically binding a target ligand to the capture probe; (d) attaching the index polynucleotide to the second polynucleotide via the adaptor; (e) detecting the target ligand on an array; and (f) determining the index and the barcode of the first polynucleotide.


In some embodiments, the second polynucleotide comprises a barcode, and a barcode primer binding site.


In some embodiments, (d) comprises adding a reactive moiety to the second polynucleotide, wherein the adaptor is capable of attaching to the reactive moiety. In some embodiments, the adding a reactive moiety comprises a click chemistry reaction.


In some embodiments, (e) comprises distributing the population of beads on an array.


In some embodiments, (f) comprises hybridizing an index primer to the index primer binding site, and determining the sequence of the index.


In some embodiments, (f) comprises hybridizing a barcode primer to the barcode primer binding site, and determining the sequence of the barcode.


In some embodiments, the capture probe comprises a protein.


In some embodiments, the target ligand comprises a target nucleic acid. In some embodiments, the first polynucleotide comprises the capture probe. In some embodiments, (e) comprises extending the first polynucleotide hybridized to the target nucleic acid. In some embodiments, the extension comprises adding a detectable dideoxynucleotide.


In some embodiments, the method is performed on a flow cell.


Some embodiments include a method of detecting a target ligand on an array, comprising: (a) obtaining a first and a second population of beads, wherein each bead comprises: a capture probe, wherein the capture probe is capable of specifically binding to a target ligand, a nucleic acid encoding a barcode and a barcode primer binding site, wherein the barcode is indicative of the capture probe, and a nucleic acid encoding an index and an index primer binding site, wherein the index is indicative of the source of the bead from the first population or the second population; (b) contacting the first population of beads with a first sample comprising a first target ligand, wherein the first target ligand specifically binds to a capture probe of the first population of beads, and thereby obtaining a target-bound first population of beads; (c) contacting the second population of beads with a second sample comprising a second target ligand, wherein the second target ligand specifically binds to a capture probe of the second population of beads, and thereby obtaining a target-bound second population of beads; (d) randomly distributing the target-bound first population of beads and the target-bound second population of beads on an array; (e) detecting the location of the beads comprising the first target ligand and the second target ligand on the array; and (f) determining the sequence of the index and of the barcode of the beads comprising the first target ligand and the second target ligand on the array.


In some embodiments, the capture probe comprises a polynucleotide. In some embodiments, the target ligand comprises a nucleic acid. In some embodiments, detecting the location of the beads comprising the first target ligand and the second target ligand on the array comprises extending the capture probe by polymerase extension or by ligation.


In some embodiments, the capture probe comprises a protein.


In some embodiments, step (e) is performed after step (f).


In some embodiments, the barcodes of the first population of beads comprise barcodes different from one another, and the barcodes of the second population of beads comprise barcodes different from one another.


In some embodiments, the indexes of the first population of beads are the same as one another, and the indexes of the second population of beads are the same as one another.


In some embodiments, the array is located on the surface of a flowcell. In some embodiments, the first and a second population of beads are adapted to be attached to the array. In some embodiments, the first and a second population of beads comprise biotin, streptavidin, or a derivative thereof; and the array comprises biotin, streptavidin, or a derivative thereof. In some embodiments, the first and a second population of beads are magnetic.





BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 depicts an example embodiment of a polynucleotide comprising a barcode, primer binding site, and capture probe attached to a bead via a 5′ linker.



FIG. 2 depicts an example embodiment of a polynucleotide comprising a capture probe, barcode, and primer binding site, attached to a bead via a 5′ linker, and having a cleavable linker between the capture probe and barcode.



FIG. 3 depicts an example embodiment of a polynucleotide comprising a barcode, primer binding site, and capture probe attached to a bead via a 3′ linker.



FIG. 4 depicts an example embodiment of a polynucleotide comprising a spacer, barcode, primer binding site, and capture probe attached to a bead via a 5′ linker.



FIG. 5A depicts an example embodiment of a bead with an attached first polynucleotide comprising a barcode, a barcode primer binding site, and a capture probe, and with an attached second polynucleotide comprising an index and an index primer binding site.



FIG. 5B depicts an example embodiment of a target nucleic containing a single nucleotide polymorphism (SNP) hybridized to a capture probe attached to a bead.



FIG. 5C depicts an example embodiment of capture probe extended with a detectable marker.



FIG. 5D depicts an example embodiment of a barcode primer hybridized to a barcode primer binding site, and extension of the barcode primer.



FIG. 5E depicts an example embodiment of an index primer hybridized to an index primer binding site, and extension of the index primer.



FIG. 6A depicts example embodiment of: a bead comprising a single polynucleotide comprising a capture probe (left panel); a bead comprising a nucleic acid capture probe, a polynucleotide comprising a barcode indicative of the capture probe, and a polynucleotide comprising an index to distinguish a subpopulation of beads from another subpopulation of beads (middle panel); and a bead comprising a capture probe comprising an antibody or antigen-binding fragment of the antibody, a polynucleotide comprising a barcode indicative of the capture probe, and a polynucleotide comprising an index to distinguish a subpopulation of beads from another subpopulation of beads (right panel).



FIG. 6B depicts example embodiments of: a bead comprising first and second capture probes hybridized to a target nucleic acid, in which the first capture probe comprises a cleavable linker (left panel); and a bead comprising a protein capture probe which transiently binds a substrate to generate a signal comprising a detectable label (right panel).



FIG. 6C depicts an example embodiment of a dual probe assay in which a target nucleic acid is hybridized to first and second capture probes on a bead, the first capture probe is ligated to the second capture probe, and a cleavable linker is cleaved to generate a bead comprising an extended first capture probe comprising a detectable label.



FIG. 7A is a photograph of a bead pool immobilized on a flow cell.



FIG. 7B is a bar graph of the number of replicates for certain bead types in certain bins.



FIG. 7C is a graph of mean C intensity vs mean T intensity. The mean C intensity is the intensity of the fluorescence coming from labeled cytosine in a pair of nucleotides and the mean T intensity is the intensity of the fluorescence from labeled thymidine in a pair of nucleotides.



FIG. 8 is a histogram of the number of certain bead types in certain bins for a representative sample.



FIG. 9 depicts an embodiment of a work flow for sequencing target nucleic acids on an array, in which polynucleotides comprising target nucleic also include an index.



FIG. 10 depicts an embodiment of a work flow for sequencing target nucleic acids on an array, in which beads include an index.



FIG. 11 depicts an embodiment of a work flow for preparing a population of indexed beads, and use of the indexed beads in a multiwell plate, each well containing a different sample.



FIG. 12A is a schematic view of a bead with first and second polynucleotides attached to the bead. The first polynucleotide comprises a code, such as a barcode; a primer A, such as a barcode primer binding site; and a probe, such as a capture probe. The second polynucleotide comprises an index. The second polynucleotide is attached to the bead via a spacer and a linker X-Y.



FIG. 12B is a schematic view of a bead with first and second polynucleotides attached to the bead. The first polynucleotide comprises a code, such as a barcode; a primer A, such as a barcode primer binding site; and a probe, such as a capture probe. The second polynucleotide comprises an index; and a primer B, such as an index primer binding site. The second polynucleotide is attached to the bead via a splint hybridized to the second polynucleotide and the adapter. The second polynucleotide can be ligated to the Adapter.



FIG. 12C is a schematic view of a bead with first and second polynucleotides attached to the bead. The first polynucleotide comprises a code, such as a barcode; a primer A, such as a barcode primer binding site; and a probe, such as a capture probe. The second polynucleotide comprises an index; a primer B, such as an index primer binding site; and an adaptor binding site. The second polynucleotide is attached to the bead via hybridization to an adaptor attached to the bead. The Adapter can be extended to incorporate a complement of an index; a Primer B.



FIG. 13A is a schematic view of a bead with a first polynucleotide attached to the bead. The first polynucleotide comprises a code, such as a barcode; a primer, such as a barcode primer binding site; and a probe, such as a capture probe. A second polynucleotide is shown comprising an index; a primer 2, such as an index primer binding site; a spacer which can include an optional cleavable site, such as a uridine cleavage site; and an adaptor hybridized to the barcode primer binding site.



FIG. 13B depicts an embodiment of a work flow for detecting a target nucleic acid on an array in which an index polynucleotide is hybridized to a first polynucleotide attached to the bead.



FIG. 14 is a schematic view of a bead with first and second polynucleotides attached to the bead. The first polynucleotide comprises a code, such as a barcode; a primer A, such as a barcode primer binding site; and a probe, such as a capture probe. The second polynucleotide comprises repeats of an index, and of an index primer binding site.



FIG. 15A depicts an embodiment of a work flow for detecting a target nucleic acid on an array in which an index polynucleotide is attached to a bead via a reactive group.



FIG. 15B depicts an embodiment of a work flow for detecting a target nucleic acid on an array, and is a continuation of the workflow of FIG. 15A.





DETAILED DESCRIPTION

Some embodiments provided herein include methods and compositions for the detection of target ligands on an array. In some embodiments, a target ligand can include a nucleic acid, a protein, or other antigen. In some embodiments, a capture probe specifically binds to a target ligand from a sample, the location of a bead comprising the capture probe in an array is determined, and the bead is decoded to identify the capture probe and the sample. In some embodiments, a barcode is indicative of a capture probe attached to a bead; and an index is indicative of a subpopulation of beads. In some embodiments, the barcode and the index are determined by sequencing. Some embodiments also include a dual probe assay in which first and second capture probes attached to a bead are ligated together in the presence of a target nucleic acid, an end of the ligation product is cleaved from the bead, and the bead comprising the extended capture probe detected and decoded on an array.


Some embodiments provided herein in relate to high-throughput genotyping on arrays. Some embodiments relate to decoding the locations of microfeatures in an array. In some embodiments, microfeatures comprise polynucleotides having barcodes and indexes. Some embodiments include sequencing barcodes and indexes to identify the locations of polynucleotides in an array. Certain aspects that may be useful with the methods and compositions disclosed herein are disclosed in WO 2020/086746 which is incorporated by reference in its entirety.


Decoding by hybridization includes identifying the location of a capture probe in a randomly distributed array of capture probes. The method typically involves several successive cycles of hybridizing labeled hybridization probes to one or more portion of the capture probe, imaging hybridization events, and removing the hybridization probes. Decoding by hybridization requires specialized reagents, specialized fluidic devices and specialized detectors. In some embodiments, decoding by hybridization can take up to 8 hours with 7-8 successive cycles.


Embodiments provided herein include random-distributed arrays of polynucleotides comprising a primer binding site and a barcode. In some embodiments, the barcode can be readily sequenced to decode the array using a high throughput sequencing system. Some embodiments can significantly reduce the time taken to decode an array with no additional reagents, hybridization probes, or specialized decoding equipment.


Some embodiments include the use of next generation sequencing (NGS) techniques and bead-based microarrays. Some such embodiments deliver high-performance, low-cost and high throughput genotyping assays that can be run on a generic NGS sequencing platform with minor modifications to substrates and reagents.


In some multiplexing methods, multiple nucleic samples from different sources can be processed in parallel by keeping each sample physically separated. Some embodiments provided herein include a nucleic acid indexing method that removes the need for physical barriers separating individual samples in all steps by indexing each sample via an index on an associated bead. In some embodiments, index de-multiplexing is performed by sequencing and can be executed at a user site using standard sequencing by synthesis (SBS) chemistry. In addition, cost, space and time limitations due to internal decoding of a bead array are reduced by adopting a decode by sequencing (DBS) approach that can be implemented at the customer site using a standard platform and SBS chemistry.


Some embodiments include an indexed enrichment bead pool comprising a bead depicted in FIG. 5A. In some such embodiments, a bead includes a first polynucleotide comprising a locus-specific capture probe, a barcode for positional identification of the associated probe on an array, and a barcode primer binding site for SBS reading of the barcode; and a second polynucleotide comprising an index for sample multiplexing, and an index primer binding site for SBS reading of the index.


In some embodiments, a bead pool complexity is defined by the number of capture probes, or plexity (N) and by the number of samples supported (S). Thus, a bead pool supporting plexity N and S samples will consist of S×N unique bead types. In some embodiments, capture probes and/or index primers include additional 3′ orthogonal blockers to avoid interference during SBS on one of the two oligonucleotides.


Some embodiments include conducting a genotyping assay in which a multiwell plate containing S wells is loaded with S bead pools, each bead pool having a unique sample index, with each well containing N unique bead types. After nucleic acid library generation from samples, such treating a nucleic acid sample with steps including random primer amplification followed by enzymatic fragmentation and clean up, each sample library is added to an indexed well and allowed to hybridize to the capture probes. After hybridization is complete, a single base extension assay to probe the SNP of interest is executed by adding an incorporation mix that includes fluorescent nucleotides and an appropriate polymerase. At the end of the incorporation, all bead-capture samples in a plate are pooled and loaded into a flowcell. The flowcell can be plain or patterned, and the surface appropriately modified to support the immobilization of beads at the desired density. In some embodiments, upon bead immobilization, a SNP readout is performed which includes a single scan cycle to read the signal deriving from fluorescent incorporation at the SNP site. This cycle may include an SBS cycle on the instrument. A barcode readout is also performed which includes 12-20 SBS cycles, depending on beadpool plexity, to identify capture probe and position of a specific bead within the flowcell. In some embodiments, this step could be replaced by additional cycles of sequencing past the identified SNP. A sample index readout is also performed which includes 6-12 SBS cycles to read the sample index. In some embodiments, the entire on-flowcell assay can include less than about 30 SBS cycles and can be executed in less than 4 hours.


Some embodiments relate to methods and compositions of sequencing target polynucleotides on an array. Some embodiments relate to sequencing target polynucleotides from several different nucleic acids samples on a bead array. In some such embodiments, an index sequence is associated with target nucleic acids from a nucleic acid sample.


In one embodiment of the invention, gaskets are not used to subdivide the various areas of a gene array. Instead, index sequences are added to the samples to differentiate samples hybridized to beads and to support high plexity sample pooling. In some embodiments indexes can be added to either bead pools or samples.


Embodiments relate to the preparation of polynucleotide libraries from many different nucleic acid samples, and determining the presence of certain features, such as single nucleotide polymorphisms, insertions, deletions, in target nucleic acids of each nucleic acid sample. The polynucleotide libraries are interrogated in parallel on an array for the presence of the certain features of target nucleic acids.


In some embodiments, each nucleic acid sample is associated with a different index sequence, such that a library of polynucleotides derived from a nucleic acid sample include the same index. A target nucleic acid is identified in a library of polynucleotides by selectively hybridizing the target nucleic acid to a capture probe attached to a bead, extending the capture probe, and detecting the extension of the capture probe on an array. Each capture probe may include a barcode and thus the capture probe can be identified by sequencing the barcode associated with the capture probe on the array. In some such embodiments, an oligonucleotide attached to a bead includes the capture probe and a barcode.


In some embodiments, the index associated with a polynucleotide can be sequenced on the array. The location of signals on the array for the extension of the capture probe, the sequence of the barcode, and the sequence of the index can identify the presence of a certain feature, such as single nucleotide polymorphism, insertion, deletion, in a target nucleic acid from a particular nucleic acid sample. In some embodiments, many different nucleic acid samples may be tested on a single array.


As used herein, “array” can refer to a population of different microfeatures, such as microfeatures comprising polynucleotides, which are associated or attached with a surface such that the different microfeatures can be differentiated from each other according to relative location. An individual feature of an array can include a single copy of a microfeature or multiple copies of the microfeature can be present as a population of microfeatures at an individual feature of the array. The population of microfeatures at each feature typically is homogenous, having a single species of microfeature. Thus, multiple copies of a single nucleic acid sequence can be present at a feature, for example, on multiple nucleic acid molecules having the same sequence.


In some embodiments, a heterogeneous population of microfeatures can be present at a feature. Thus, a feature may but need not include only a single microfeature species and can instead contain a plurality of different microfeature species such as a mixture of nucleic acids having different sequences. Neighboring features of an array can be discrete one from the other in that they do not overlap. Accordingly, the features can be adjacent to each other or separated by a gap. In embodiments where features are spaced apart, neighboring sites can be separated, for example, by a distance of less than 100 μm, 50 μm, 10 μm, 5 μm, 1 μm, 0.5 μm, 100 nm, 50 nm, 10 nm, 5 nm, 1 nm, 0.5 nm, 100 pm, 50 pm, 1 pm or any distance within a range of any two of the foregoing distances. The layout of features on an array can also be understood in terms of center-to-center distances between neighboring features. An array useful in the invention can have neighboring features with center-to-center spacing of less than about 100 μm, 50 μm, 10 μm, 5 μm, 1 μm, 0.5 μm, 100 nm, 50 nm, 10 nm, 5 nm, 1 nm, 0.5 nm, 100 pm, 50 pm, 1 pm or any distance within a range of any two of the foregoing distances.


In some embodiments, the distance values described above and elsewhere herein can represent an average distance between neighboring features of an array. As such, not all neighboring features need to fall in the specified range unless specifically indicated to the contrary, for example, by a specific statement that the distance constitutes a threshold distance between all neighboring features of an array. Embodiments can be used with arrays having features at any of a variety of densities. Examples ranges of densities for certain embodiments include from about 10,000,000 features/cm2 to about 2,000,000,000 features/cm2; from about 100,000,000 features/cm2 to about 1,000,000,000 features/cm2; from about 100,000 features/cm2 to about 10,000,000 features/cm2; from about 1,000,000 features/cm2 to about 5,000,000 features/cm2; from about 10,000 features/cm2 to about 100,000 features/cm2; from about 20,000 features/cm2 to about 50,000 features/cm2; from about 1,000 features/cm2 to about 5,000 features/cm2, or any density within a range of any two of the foregoing densities.


As used herein, “surface” can refer to a part of a substrate or support structure that is accessible to contact with reagents, beads or analytes. The surface can be substantially flat or planar. Alternatively, the surface can be rounded or contoured. Example contours that can be included on a surface are wells, depressions, pillars, ridges, channels or the like. Example materials that can be used as a substrate or support structure include glass such as modified or functionalized glass; plastic such as acrylic, polystyrene or a copolymer of styrene and another material, polypropylene, polyethylene, polybutylene, polyurethane or TEFLON; polysaccharides or cross-linked polysaccharides such as agarose or Sepharose; nylon; nitrocellulose; resin; silica or silica-based materials including silicon and modified silicon; carbon-fibre; metal; inorganic glass; optical fibre bundle, or a variety of other polymers. A single material or mixture of several different materials can form a surface useful in the invention. In some embodiments, a surface comprises wells. In some embodiments, a support structure can include one or more layers. Example support structures can include a chip, a film, a multi-well plate, and a flow-cell.


As used herein, “bead” can refer to a small body made of a rigid or semi-rigid material. The body can have a shape characterized, for example, as a sphere, oval, microsphere, or other recognized particle shape whether having regular or irregular dimensions. Example materials that are useful for beads include glass such as modified or functionalized glass; plastic such as acrylic, polystyrene or a copolymer of styrene and another material, polypropylene, polyethylene, polybutylene, polyurethane or TEFLON; polysaccharides or cross-linked polysaccharides such as agarose or Sepharose; nylon; nitrocellulose; resin; silica or silica-based materials including silicon and modified silicon; carbon-fiber; metal; inorganic glass; or a variety of other polymers. Example beads include controlled pore glass beads, paramagnetic beads, thoria sol, Sepharose beads, nanocrystals and others known in the art. Beads can be made of biological or non-biological materials. Magnetic beads are particularly useful due to the ease of manipulation of magnetic beads using magnets at various steps of the methods described herein. Beads used in certain embodiments can have a diameter, width or length from about 0.1 μm to about 100 μm, from about 0.1 nm to about 500 nm. In some embodiments, beads used in certain embodiments can have a diameter, width or length less than about 100 μm, 50 μm, 10 μm, 5 μm, 1 μm, 0.5 μm, 100 nm, 50 nm, 10 nm, 5 nm, 1 nm, 0.5 nm, 100 pm, 50 pm, 1 pm or any diameter, width or length within a range of any two of the foregoing diameters, widths or lengths. Bead size can be selected to have reduced size, and hence get more features per unit area, whilst maintaining sufficient signal (template copies per feature) in order to analyze the features.


In some embodiments, polynucleotides can be attached to beads. In some embodiments, the beads can be distributed into wells on the surface of a substrate. Example bead arrays that can be used in certain embodiments include randomly ordered BEADARRAY technology (Illumina Inc., San Diego Calif.). Such bead arrays are disclosed in Michael et al., Anal Chem 70, 1242-8 (1998); Walt, Science 287, 451-2 (2000); Fan et al., Cold Spring Harb Symp Quant Biol 68:69-78 (2003); Gunderson et al., Nat Genet 37:549-54 (2005); Bibikova et al. Am J Pathol 165:1799-807 (2004); Fan et al., Genome Res 14:878-85 (2004); Kuhn et al., Genome Res 14:2347-56 (2004); Yeakley et al., Nat Biotechnol 20:353-8 (2002); and Bibikova et al., Genome Res 16:383-93 (2006), each of which is incorporated by reference in its entirety.


As used herein “polynucleotide” and “nucleic acid”, may be used interchangeably, and can refer to a polymeric form of nucleotides of any length, either ribonucleotides or deoxyribonucleotides. Thus, this term includes single-, double-, or multi-stranded DNA or RNA. The term polynucleotide also refers to both double and single-stranded molecules. Examples of polynucleotides include a gene or gene fragment, genomic DNA, genomic DNA fragment, exon, intron, messenger RNA (mRNA), transfer RNA, ribosomal RNA, non-coding RNA (ncRNA) such as PIWI-interacting RNA (piRNA), small interfering RNA (siRNA), and long non-coding RNA (lncRNA), small hairpin (shRNA), small nuclear RNA (snRNA), micro RNA (miRNA), small nucleolar RNA (snoRNA) and viral RNA, ribozyme, cDNA, recombinant polynucleotide, branched polynucleotide, plasmid, vector, isolated DNA of any sequence, isolated RNA of any sequence, nucleic acid probe, primer or amplified copy of any of the foregoing. A polynucleotide can include modified nucleotides, such as methylated nucleotides and nucleotide analogs including nucleotides with non-natural bases, nucleotides with modified natural bases such as aza- or deaza-purines. A polynucleotide can be composed of a specific sequence of four nucleotide bases: adenine (A); cytosine (C); guanine (G); and thymine (T). Uracil (U) can also be present, for example, as a natural replacement for thymine when the polynucleotide is RNA. Uracil can also be used in DNA. Thus, the term ‘sequence’ refers to the alphabetical representation of a polynucleotide or any nucleic acid molecule, including natural and non-natural bases.


As used herein, “target nucleic acid” or grammatical equivalent thereof can refer to nucleic acid molecules or sequences that it is desired to sequence, analyze and/or further manipulate. In some embodiments, a target nucleic acid can be attached to an array. In some embodiments, a capture probe can be attached to an array and the array used subsequently to detect a target nucleic acid in a sample that interacts with the probe. In this regard, it will be understood that in some embodiments, the terms “target” and “probe” can be used interchangeably with regard to nucleic acid detection methods.


As used herein, “capture probe” can refer to a polynucleotide having sufficient complementarity to specifically hybridize to a target nucleic acid. A capture probe can function as an affinity binding molecule for isolation of a target nucleic acid from other nucleic acids and/or components in a mixture. In some embodiments, a target nucleic acid can be specifically bound by a capture probe through intervening molecules. Examples of intervening molecules include linkers, adaptors and other bridging nucleic acids having sufficient complementarity to specifically hybridize to both a target sequence and a capture probe.


As used herein, “hybridization”, “hybridizing” or grammatical equivalent thereof, can refer to a reaction in which one or more polynucleotides react to form a complex that is formed at least in part via hydrogen bonding between the bases of the nucleotide residues. The hydrogen bonding can occur by Watson-Crick base pairing, Hoogstein binding, or in any other sequence-specific manner. The complex can have two strands forming a duplex structure, three or more strands forming a multi-stranded complex, a single self-hybridizing strand, or any combination of thereof. The strands can also be cross-linked or otherwise joined by forces in addition to hydrogen bonding.


As used herein, “extending”, “extension” or any grammatical equivalents thereof can refer to the addition of dNTPs to a primer, polynucleotide or other nucleic acid molecule by an extension enzyme such as a polymerase. For example, in some methods disclosed herein, the resulting extended primer includes sequence information of an RNA. While some embodiments are discussed as performing extension using a polymerase such as a DNA polymerase, or a reverse transcriptase, extension can be performed in any other manner well known in the art. For example, extension can be performed by ligating short pieces of random oligonucleotides together, such as oligonucleotides that have hybridized to a strand of interest.


As used herein, “ligation” or “ligating” or other grammatical equivalents thereof can refer to the joining of two nucleotide strands by a phosphodiester bond. Such a reaction can be catalyzed by a ligase. A ligase refers to a class of enzymes that catalyzes this reaction with the hydrolysis of ATP or a similar triphosphate.


Decoding by Sequencing

Some embodiments of the methods and compositions provided herein include decoding the location of microfeatures of an array using high throughput sequencing. In some embodiments, the microfeatures of an array comprise polynucleotides. In some embodiments, the polynucleotides can be randomly distributed on the surface of the substrate. In some embodiments, a polynucleotide can include a primer binding site, and a barcode. In some embodiments, a polynucleotide can include a capture probe, a primer binding site, and a barcode.


Some embodiments to decode the location of polynucleotides in an array can include (a) obtaining a substrate having an array of polynucleotides distributed on a surface of the substrate, wherein each polynucleotide comprises a barcode and a primer binding site; (b) hybridizing a plurality of primers to the primer binding sites; and (c) determining the sequences of the barcodes by extending the hybridized primers. In some such embodiments, the sequence of each barcode can indicate the location of a polynucleotide in the array. For example, in some embodiments an array can be prepared with polynucleotides in which the barcodes are known to be associated with certain capture probes, such that identifying the location of a barcode on an array, can indicate the location of the associated capture probe. In such embodiments, each polynucleotide can be associated with a capture probe through a common element. For example, a polynucleotide and a capture probe can each be bound to the same microfeature, such as a bead. In more such embodiments, each polynucleotide can include a capture probe.


In some embodiments, a barcode can include a nucleic acid sequence that can be used to identify a polynucleotide within an array. The barcode can include a unique nucleotide sequence that is distinguishable from other barcodes. It can also be distinguishable from other nucleotide sequences within the polynucleotides and target nucleic acids by the barcode's sequence, and also by the barcode's location within the polynucleotide, for example by its location 5′ of the primer binding site. For example, in some embodiments, the sequence of a barcode may be present more than once in plurality of nucleic acids, however, the barcode which is located 5′ of the primer binding site can be detected. A barcode can be of any desired sequence length sufficient to be unique nucleotide sequence within a plurality of barcodes in a population and/or within a plurality of polynucleotides and target nucleic acids that are being analyzed or interrogated. In some embodiments, a barcode is a nucleic acid or region within a polynucleotide ranging from about 6-30 nucleotides. A barcode can be, for example, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29 or 30 nucleotides, or longer. For example, a barcode can be 35, 40, 45 or 50 nucleotides or longer. Suitable barcodes for some embodiments are disclosed in of U.S. Pat. No. 8,053,192, which is incorporated by reference in its entirety. In some embodiments, a barcode can distinguish a polynucleotide from another polynucleotide in an array, such that each barcode is different from another barcode. In some embodiments, a barcode can distinguish a population of polynucleotides from another population of polynucleotides in an array, such that a set of barcodes is different from another set of barcodes. Some aspects useful with methods and compositions provided herein are disclosed in U.S. 20180334711A1 and U.S. 20190085384A1 which are each incorporated by reference in its entirety. In some embodiments, a barcode can include a unique molecular identifier (UMI).


In some embodiments, the primer binding site can be 3′ of the barcode such that a primer hybridized to the primer binding site can be extended to provide the complement of the barcode. For example, the primer can be extended to obtain the sequence of the barcode. In some embodiments, the primer binding site can be directly adjacent to the barcode in a polynucleotide. In some embodiments, each primer binding site in a population of polynucleotides can have the same sequence. In some embodiments, a subpopulation of polynucleotides can include a primer binding sites with a first sequence, and another subpopulation of polynucleotides can include primer binding sites with a second sequence. In some embodiments, hybridizing different primers to a plurality of different primer binding sites can be simultaneous, sequential, or iterative.


Some embodiments include polynucleotides comprising a capture probe. In some embodiments, the capture probe can include a sequence capable of hybridizing with a target nucleic acid. In some embodiments, a population of polynucleotides can include capture probes that are different from one another. In some embodiments, each capture probe can be different from one another. In some embodiments, the capture probes can be similar to one another, for example they can have similar sequences, and/or similar sequences with similar lengths. In some embodiments, a capture probe can differ from one another capture probe by a number of less than 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, or 2 nucleotides, in which the number of nucleotides can be consecutive nucleotides, non-consecutive nucleotides, inserted nucleotides, or deleted nucleotides in a capture probe. In some embodiments a capture probe can differ from one another capture probe by a single nucleotide. In some embodiments, the primer binding site and barcode can be 5′ of the capture probe. In some embodiments, the primer binding site and barcode can be 3′ of the capture probe.


Some embodiments include polynucleotides comprising cleavable linkers. In some such embodiments, the cleavable linker can be located such that cleaving the linker can separate the capture probe from the primer binding site and the barcode. In some embodiments, the cleavable linker can be located within the polynucleotide between the capture probe, and the primer binding site and the barcode. In some embodiments, a cleavable linker can remove the polynucleotide comprising the primer binding site and the barcode linked to a capture probe. For example, the polynucleotide and capture probe can both be bound to a bead. Cleavage of the cleavable linker can remove the polynucleotide comprising the primer binding site and the barcode from the bead.


In some embodiments, a cleavable linker can have a length corresponding to at least 2, 3, 5, 10, 15, 20, 25, 30, 50, 100, 500 nucleotides, or a length within a range of any two of the foregoing lengths. In some embodiments, a cleavable linker is susceptible to cleavage with agents such as light, base, acid and/or an enzyme such as a sequence specific restriction enzyme or a protease. A cleavable linker can include a certain sequence of nucleotides, such as the recognition site of an enzyme, and/or can include certain modified nucleotides susceptible to cleavage with an agent. In some embodiments, a cleavable linker can include uracil, which is cleavable by an exogenous base cleaving agent such as DNA glycosylase (UDG). In some embodiments, a cleavable linker can include 8-hydroxyguanine which can be cleaved by 8-hydroxyguanine DNA glycosylase (FPG protein). More examples of cleavable linkers are disclosed in in U.S. Pat. App. Pub. 2005/0181394, which is incorporated by reference in its entirety.


In some embodiments, the polynucleotides are attached to a substrate. In some embodiments, a substrate can include a bead. Polynucleotides can be immobilized to a substrate, such as bead or other surface by single point covalent attachment to the surface of the substrate at or near the 5′ end or the 3′ end of the polynucleotide. In some embodiments, a polynucleotide can include a spacer which is attached to the substrate. In some embodiments, a spacer can have a length corresponding to at least 2, 3, 5, 10, 15, 20, 25, 30, 50, 100, 500 nucleotides, or a length within a range of any two of the foregoing lengths. Any suitable covalent attachment means known in the art may be used for this purpose. The chosen attachment chemistry will depend on the nature of the solid support, and any derivatization or functionalization applied to it. The polynucleotide may include a moiety, which may be a non-nucleotide chemical modification, to facilitate attachment. In some embodiments, the polynucleotide may include a sulfur-containing nucleophile, such as phosphorothioate or thiophosphate, for example, located at the 5′ end. In the case of solid-supported polyacrylamide hydrogels, this nucleophile will bind to a bromoacetamide group present in the hydrogel. An example means of attaching polynucleotide to a solid support is via 5′ phosphorothioate attachment to a hydrogel comprised of polymerized acrylamide and N-(5-bromoacetamidylpentyl) acrylamide (BRAPA), which is disclosed in U.S. Pat. No. 8,168,388 which is incorporated by reference in its entirety.


In some embodiments, the location of polynucleotides in the array can be decoded by sequencing barcodes of the polynucleotides. For example, a sequence of a barcode can be associated with a site on an array, and the site can be associated with a particular capture probe. Some embodiments include Next Generation Sequencing (NGS) which can refer to sequencing methods that allow for massively parallel sequencing of clonally amplified molecules and of single nucleic acid molecules. Examples of NGS include sequencing-by-synthesis (SBS) using reversible dye terminators, and sequencing-by-ligation. In SBS, extension of a nucleic acid primer along a nucleic acid template is monitored to determine the sequence of nucleotides in the template. The underlying chemical process can be polymerization. In a particular polymerase-based SBS embodiment, fluorescently labeled nucleotides are added to extend a primer in a template dependent fashion such that detection of the order and type of nucleotides added to the primer can be used to determine the sequence of the template.


One or more amplified nucleic acids can be subjected to an SBS or other detection technique that involves repeated delivery of reagents in cycles. For example, to initiate a first SBS cycle, one or more labeled nucleotides, DNA polymerase, etc., can be flowed into/through a hydrogel bead that houses one or more amplified nucleic acid molecules. Those sites where primer extension causes a labeled nucleotide to be incorporated can be detected. Optionally, the nucleotides can further include a reversible termination property that terminates further primer extension once a nucleotide has been added to a primer. For example, a nucleotide analog having a reversible terminator moiety can be added to a primer such that subsequent extension cannot occur until a deblocking agent is delivered to remove the moiety. Thus, for embodiments that use reversible termination, a deblocking reagent can be delivered to the flow cell before or after detection occurs. Washes can be carried out between the various delivery steps. The cycle can then be repeated n times to extend the primer by n nucleotides, thereby detecting a sequence of length n.


Some SBS embodiments include detection of a proton released upon incorporation of a nucleotide into an extension product. For example, sequencing based on detection of released protons can use an electrical detector and associated techniques that are commercially available. Examples of such sequencing systems are pyrosequencing such as a commercially available platform from 454 Life Sciences a subsidiary of Roche; sequencing using γ-phosphate-labeled nucleotides, such as a commercially available platform from Pacific Biosciences; and sequencing using proton detection, such as a commercially available platform from Ion Torrent subsidiary of Life Technologies. Some embodiments include pyrosequencing which is described in U.S. Pat. App. Pub. 2005/0130173 and 2006/0134633 and U.S. Pat. Nos. 4,971,903; 6,258,568 and 6,210,891 which are each incorporated by reference in its entirety. Some embodiments include sequencing by ligation which is disclosed in U.S. Pat. Nos. 5,599,675; and 5,750,341, each of which is incorporated by reference in its entirety.


Some embodiments can utilize methods involving the real-time monitoring of DNA polymerase activity. For example, nucleotide incorporations can be detected through fluorescence resonance energy transfer (FRET) interactions between a fluorophore-bearing polymerase and γ-phosphate-labeled nucleotides, or with zero mode waveguides (ZMWs). Another useful sequencing technique is nanopore sequencing. In some nanopore embodiments, the target nucleic acid or individual nucleotides removed from a target nucleic acid pass through a nanopore. As the nucleic acid or nucleotide passes through the nanopore, each nucleotide type can be identified by measuring fluctuations in the electrical conductance of the pore.


As shown in FIG. 1, in some embodiments a microfeature of an array can include a polynucleotide attached to a bead 10 via a 5′ linker 20. The polynucleotide can include a bar code 30, a primer binding site 40, and a capture probe 50. A primer 60 can hybridize to the primer binding site and be extended to obtain the sequence of the barcode to decode the location of the microfeature in the array. In some embodiments, the capture probe can hybridize to target nucleic acids and the capture probe can be extended, for example by a polymerase or by a ligase. In some embodiments, the capture probe can participate in bridge amplification. Methods of bridge amplification are disclosed in U.S. Pat. Nos. 7,985,565 and 7,115,400, which are each incorporated by reference in its entirety.


As shown in FIG. 2, in some embodiments a microfeature of an array can include a polynucleotide comprising a capture probe 50, barcode 30, and primer binding site 40, in which the polynucleotide is attached to a bead 10 via a 5′ linker 20. In some embodiments, the polynucleotide can include a cleavable linker 70 between the capture probe and barcode. In some embodiments, a primer 60 can be hybridized to the primer binding site, and the sequence of the barcode determined, thereby decoding the location of the microfeature in the array. In some embodiments, the polynucleotide can be cleaved, and the barcode and primer binding site removed from the microfeature comprising the bead and capture probe. Some such embodiments provide decoded arrays before hybridizing target nucleic acids with the capture probes. In some embodiments, a target nucleic acid can be hybridized to the capture probe at the decoded location in the array. In some embodiments, the hybridized capture probe can be extended, for example by a polymerase or by a ligase. In some embodiments, the capture probe can participate in bridge amplification.


As shown in FIG. 3, in some embodiments a microfeature of an array can include a polynucleotide comprising a barcode 30, primer binding site 40, and capture probe 50 attached to a bead 10 via a 3′ linker 25. In some embodiments, the primer binding site can abut a linker attached to the bead. Some embodiments can include the use of such microfeatures in assays to screen for and develop certain polymerases. For example, the activity of polymerases can be screened for primer sites attached to beads without spacers, compared to primer sites attached to beads with spacers. In some embodiments, a target nucleic acid can be hybridized to the capture probe. In some embodiments, the hybridized target nucleic acids extended, for example by a polymerase or by a ligase.


As shown in FIG. 4, embodiments of a microfeature of an array can include a polynucleotide comprising a spacer 80, barcode 30, primer binding site 40, and capture probe 50 attached to a bead 10 via a 5′ linker 20.


Sequencing Multiple Target Nucleic Acids

Some embodiments include sequencing a plurality of target nucleic acids. In some embodiments, the target nucleic acids can be derived from different sources, such as different subjects, for example genomic DNA from different subjects. In some embodiments, different target nucleic acids can be associated with different indexes, such that an index can identify a particular population of target nucleic acids, such as a population derived from a single source.


Some embodiments include obtaining at least a first and a second subpopulation of beads, wherein each bead comprises a first polynucleotide comprising a capture probe, a barcode indicative of the capture probe of the same bead, and a barcode primer binding site 3′ of the barcode, and a second polynucleotide comprising an index and an index primer binding site 3′ of the index, wherein the indexes of the first subpopulation are different from the indexes of the second subpopulation.


In some embodiments, the nucleotide sequences of the indexes of the first subpopulation of beads comprise the same nucleotide sequence, and the nucleotide sequences of the indexes of the second subpopulation of beads comprise the same nucleotide sequence. In some embodiments, the nucleotide sequences of the index primer binding sites comprise the same nucleotide sequence.


In some embodiments, the capture probes of the first and/or the second subpopulations of beads each comprise different nucleotide sequences from one another. For example, the capture probes of the first subpopulation can be different from one another; and/or the capture probes of the first subpopulation can be different from one another. In some embodiments, the capture probes of the first subpopulations of beads each comprise capture probes having the same nucleotide sequences as the capture probes of the second subpopulations of beads. In some embodiments, a capture probe can comprise a nucleotide sequence capable of hybridizing to a single nucleotide polymorphism (SNP) or complement thereof. In some embodiments, the barcode primer binding sites comprise the same nucleotide sequence.


Some embodiments also include hybridizing first target nucleic acids to the capture probes of the first subpopulation of beads, and hybridizing second target nucleic acids to the capture probes of the second subpopulation of beads. In some such embodiments, the hybridizing first target nucleic acids to the capture probes of the first subpopulation of beads, and the hybridizing second target nucleic acids to the capture probes of the second subpopulation of beads are performed at different locations. For example, the different locations comprise different reaction volumes, such as different wells, such as different wells in a multiwall plate. For example, in a 96 well plate, 96 different subpopulations of beads could be hybridized with 96 different target nucleic acids, in which each different subpopulation of beads is hybridized to a different target nucleic acid in a different well.


In some embodiments, the different subpopulations of beads comprising the hybridized capture probes and target nucleic acids are distributed on a substrate, such as a planar substrate. In some embodiments, the distributed subpopulations of beads comprise an array. In some embodiments, the substrate comprises a plurality of discrete sites. In some embodiments, the substrate comprises a plurality of wells. In some embodiments, the substrate comprises a plurality of channels. In some embodiments, a flowcell comprises the substrate.


In some embodiments, the different subpopulations of beads comprising the hybridized capture probes and target nucleic acids are combined prior to being distributed on the substrate. In some embodiments, the different subpopulations of beads comprising the hybridized capture probes and target nucleic acids are distributed on the substrate sequentially. For example, a first subpopulation of beads comprising the hybridized capture probes and target nucleic acids is distributed on the substrate before a second subpopulation of beads comprising hybridized capture probes and target nucleic acids is distributed on the substrate.


In some embodiments, the hybridized capture probes are extended. In some embodiments, the hybridized capture probes are extended before the different subpopulations of beads comprising the hybridized capture probes and target nucleic acids are distributed on the substrate. In some embodiments, the hybridized capture probes are extended after the beads comprising the hybridized capture probes and target nucleic acids are distributed on the substrate. In some embodiments, extending the hybridized capture probes can include polymerase extension. In some embodiments, extending the hybridized capture probes can include ligase-based extension, such as ligation of an extension probe to a capture probe in the presence of a ligase. In some embodiments, the extension step can add a detectable marker to the extended capture probe. In some embodiments, the detectable marker can include a fluorescent marker.


Some embodiments include decoding the beads on the substrate. Some embodiments include decoding the beads by identifying the locations of extended capture probes, barcodes, and indexes on the substrate. In some embodiments, the presence of a particular barcode at a location on the substrate is indicative of a particular capture probe at the location. In some embodiments, the presence of a certain index at a location on the substrate is indicative that a target nucleic acid of a certain subpopulation of target nucleic acids is associated with the location on the substrate. In some embodiments, the presence of an extended capture probe at a location on the substrate is indicative of the presence of a certain target nucleic acid in the subpopulations of target nucleic acids. In some embodiments, the identity of a barcode, the identity of an index, and the presence of an extended capture probe at a single location on the surface is indicative of the presence of a specific target nucleic in a specific subpopulation of target nucleic acids.


In some embodiments, decoding the beads on the substrate includes detecting the locations of the hybridized capture probes or the extended capture probes. In some embodiments, detecting the location of the hybridized capture probes or the extended capture probes includes extending the hybridized capture probes with a detectable marker. In some embodiments, detecting the location of the hybridized capture probes or the extended capture probes comprises at least one cycle of sequencing by synthesis.


In some embodiments, decoding the beads on the substrate includes decoding the location of the indexes of the beads comprising an extended capture probe. Some embodiments include hybridizing a plurality of index primers to the index primer sites, and extending the hybridized index primers. In some embodiments, extending the hybridized index primers comprises at least one cycle of sequencing by synthesis. In some embodiments, decoding the location of the indexes of the beads comprises sequencing the indexes on the substrate.


In some embodiments, decoding the beads on the substrate includes decoding the location of the barcodes of the beads comprising an extended capture probe. Some embodiments include hybridizing a plurality of barcode primers to the barcode primer sites, and extending the hybridized barcode primers. In some embodiments, decoding the beads on the substrate includes extending the hybridized barcode primers comprises at least one cycle of sequencing by synthesis. In some embodiments, decoding the beads on the substrate includes decoding the location of the barcodes of the beads comprises sequencing the barcodes on the substrate.


In some embodiments, the first and second subpopulations of beads each comprises at least 50 capture probes comprising different nucleotide sequences. In some embodiments, the first and second subpopulations of beads each comprises at least 100, 200, 300, 400, or 500 or more capture probes comprising different nucleotide sequences. In some embodiments, the first and second subpopulations of beads each comprises at least 5000 capture probes comprising different nucleotide sequences. In some embodiments, the first and second subpopulations of beads each comprises at least 50,000 capture probes comprising different nucleotide sequences.


Some embodiments include at least 10 different subpopulations of beads, each subpopulation comprising an index different from another subpopulation. Some embodiments include at least 100 different subpopulations of beads, each subpopulation comprising an index different from another subpopulation. Some embodiments include at least 1000 different subpopulations of beads, each subpopulation comprising an index different from another subpopulation. Some embodiments include at least 10,000 different subpopulations of beads, each subpopulation comprising an index different from another subpopulation.


Aspects of some embodiments are shown in FIG. 5A-FIG. 5E. As shown in FIG. 5A, a subpopulation of beads includes a bead with an attached first polynucleotide comprising a barcode, a barcode primer binding site, and a capture probe, and with an attached second polynucleotide comprising an index and an index primer binding site. Different subpopulations of beads can include different indexes. Different subpopulations of beads, each subpopulation having a particular index can be distributed into wells of a 96 well plate, such that each well contains a single subpopulation of beads.


As shown in FIG. 5B, a target nucleic acid from a population of target nucleic acids is hybridized to the capture probe, in which the target nucleic acid contains a SNP. In some embodiments, a subpopulation of target nucleic acids is added to each well containing a subpopulation of beads. Each subpopulation of target nucleic acids can be derived from a different source, such as a different subject. For example, a subpopulation of target nucleic acids can be obtained by preparing a library of nucleic acids from a single source of nucleic acids, such as a single sample of nucleic acids from a subject. In some embodiments, the target nucleic acids do not require an adaptor to be sequenced.


As shown in FIG. 5C, a capture probe that is hybridized to a target nucleic acid containing a SNP can be extended. In some embodiments, the extension can be performed by executed by adding an incorporation mix that includes fluorescent nucleotides and an appropriate polymerase. In some embodiments, the extension is a single base extension.


In some embodiments, all beads are combined and distributed on to the surface of a flowcell. The flowcell can have a patterned surface, and the surface can be modified to support the immobilization of beads at a desired density.


In some embodiments, an SNP readout is performed. In some embodiments, a scan cycle is performed to read a signal from incorporation at the SNP site of the capture probe. In some embodiments, this cycle can include at least one cycle of sequence by synthesis. In some embodiments, the 3′ ends of the extended capture probes are cleaved and blocked.


As shown in FIG. 5D, a barcode primer is hybridized to the barcode primer binding site, and the barcode primer is extended. In some embodiments, extension of the barcode primer includes a barcode readout. In some embodiments, the extension can include at least one cycle of sequence by synthesis. In some embodiments, the number of cycles of sequence by synthesis can depend of the number of different barcodes in a subpopulation of beads. In some embodiments, the extension can include 12-20 sequence by synthesis cycles. In some embodiments, the extension identifies the capture probe and position of a specific bead within the flowcell.


As shown in FIG. 5E, an index primer is hybridized to the index primer binding site, and the index primer is extended. In some embodiments, extension of the index primer includes an index readout. In some embodiments, the extension can include at least one cycle of sequence by synthesis. In some embodiments, the number of cycles of sequence by synthesis can depend of the number of different indexes in a plurality of subpopulations of beads. In some embodiments, the extension can include 6-12 sequence by synthesis cycles.


Certain Methods of Detecting Target Ligands

Some embodiments include methods of detecting target ligands. In some embodiments, the target ligands can include nucleic acids, proteins, or other antigens. In some embodiments, the target ligands are obtained from different sources, for example, from different samples, different individual subjects, or different populations of subjects.


In some embodiments, methods of detecting target ligands can include obtaining a population of beads, wherein each bead comprises a capture probe which specifically binds to a target ligand. For example, a capture probe can include a nucleic acid, an antibody, or an antigen-binding fragment of an antibody. In some embodiments, a bead includes a first polynucleotide comprising a barcode indicative of the capture probe of the same bead, and a barcode primer binding site 3′ of the barcode. In some embodiments, each bead also includes a second polynucleotide comprising an index and an index primer binding site 3′ of the index. In some embodiments, the population of beads includes first and second subpopulations of beads. In some embodiments, the indexes of the first subpopulation of beads are different from the indexes of the second subpopulation of beads. For example, the indexes of the first subpopulation of beads can be used to identify the first subpopulation of beads from the indexes of the second subpopulation of beads.


Some embodiments include contacting first target ligands to the capture probes of the first subpopulation of beads, and contacting second target ligands to the capture probes of the second subpopulation of beads. For example, first target ligands can be obtained from a first sample of ligands, and second target ligands can be obtained from a second sample of ligands. Some embodiments also include distributing the first and second subpopulations of beads comprising the specifically bound first and second target ligands on a substrate. Some embodiments also include detecting the capture probes specifically bound to the first and second target ligands distributed on the substrate. Some embodiments also include decoding the locations of beads comprising the detected capture probes on the substrate.


In some embodiments, the capture probe comprises a nucleic acid and the target ligands comprise nucleic acids. In some embodiments, the first polynucleotide comprises the capture probe. In other embodiments, the capture probe is distinct from the first polynucleotide. In some embodiments, the capture probes of a subpopulation of beads comprises different nucleotide sequences from one another. In some embodiments, different subpopulations of beads can include the same different capture probes. In some embodiments, the capture probes comprise a nucleotide sequence capable of hybridizing to a single nucleotide polymorphism (SNP) or complement thereof.


In some such embodiments, detecting the capture probes specifically bound to the target ligands includes extending the capture probes specifically bound to the target ligands. In some embodiments, the extending can include polymerase extension, and/or ligase extension. In some embodiments the extension include a detectable nucleotide, such as a fluorescently-labeled nucleotide. In some embodiments, the extension include single nucleotide extension of the capture probes. In some embodiments, the extension includes extension of the capture probes with a plurality of nucleotides.


In some such embodiments, the capture probe comprises an antibody or antigen binding fragment thereof. In some embodiments, the capture probes of a subpopulation of beads specifically bind to different target ligands from one another. In some embodiments, different subpopulations of beads can include the same different capture probes. In some such embodiments, the different capture probes of a subpopulation of beads specifically bind to the same target ligands.


In some embodiments, detecting the capture probes specifically bound to the target ligands includes an immunoassay. For example, target ligands specifically bound to the capture probes are contacted with a secondary antibody or antigen-binding fragment thereof, wherein the secondary antibody or antigen-binding fragment thereof comprises a detectable label, such as a fluorescent label.


In some embodiments, the barcode primer binding sites comprise the same nucleotide sequence.


In some embodiments, the nucleotide sequences of the indexes of a subpopulation of beads comprise the same nucleotide sequence and can distinguish a subpopulation of beads from another subpopulation of beads. For example, the nucleotide sequences of the indexes of a first subpopulation of beads, and the nucleotide sequences of the indexes of the second subpopulation of beads comprise the same nucleotide sequence.


In some embodiments, the nucleotide sequences of the index primer binding sites comprise the same nucleotide sequence.


In some embodiments, contacting the first target ligand to the capture probes of the first subpopulation of beads, and contacting the second target ligands to the capture probes of the second subpopulation of beads are performed at different locations. For example, the different locations may comprise different reaction volumes, such as different volumes in different wells of a microtiter plate.


Some embodiments also include combining the first and second subpopulations of beads prior to distributing the first and second subpopulations of beads on the substrate. In other embodiments, the first subpopulation of beads is distributed on the substrate before the second subpopulation of beads is distributed on the substrate. In some embodiments, the subpopulations of beads are distributed on the substrate before the capture probe specifically bound to the ligand is detected.


In some embodiments, detecting the capture probes specifically bound to the target ligands can also include determining the location of the capture probes specifically bound to the target ligands on the substrate.


In some embodiments, decoding the location of the detected capture probes comprises decoding the location of the indexes of the beads comprising a detected capture probe. Some embodiments also include hybridizing a plurality of index primers to the index primer sites, and extending the hybridized index primers. In some embodiments, also include extending the hybridized index primers comprises at least one cycle of sequencing by synthesis. In some embodiments, decoding the location of the indexes of the beads comprises sequencing the indexes on the substrate.


In some embodiments, decoding the location of the detected capture probes comprises decoding the location of the barcodes of the beads comprising a detected capture probe. Some such embodiments include hybridizing a plurality of barcode primers to the barcode primer sites, and extending the hybridized barcode primers. In some embodiments, extending the hybridized barcode primers comprises at least one cycle of sequencing by synthesis. In some embodiments, decoding the location of the barcodes of the beads comprises sequencing the barcodes on the substrate.


In some embodiments, the substrate comprises a plurality of discrete sites. In some embodiments, the substrate comprises a plurality of wells. In some embodiments, the substrate comprises a plurality of channels. In some embodiments, a flowcell comprises the substrate. In some embodiments, the distributed first and second subpopulations of beads comprise an array.


In some embodiments, the first and second subpopulations of beads each comprises at least 50, 100, 500, 1000, or 5000 capture probes different from one another, or any number of capture probes between any two of the foregoing numbers. Some embodiments also include at least 5, 10, 20, 50, 100, 200, 500, 1000 different subpopulations of beads, each subpopulation comprising an index different from another subpopulation, or any number of different subpopulations of beads between any two of the foregoing numbers.


An embodiment is depicted in FIG. 6A, left panel which includes a bead 200 having a polynucleotide 210 attached to the bead. The polynucleotide includes a capture probe which is hybridized to the target nucleic acid 220. The capture probe is extended with a nucleotide comprising a detectable label 230. In some embodiments, the polynucleotide can include a barcode indicative of the capture probe, and a barcode primer binding site useful to sequence and identify the barcode. In some embodiments, the polynucleotide can also include an index indicative of a subpopulation of beads from another subpopulation of beads, and an index primer binding site useful to sequence and identify the index. In some embodiments, the bead can be distributed in an array on a substrate, and decoded. Decoding can include determining the location of the detectable label on the array; determining the barcode attached to the bead on the array; and/or determining the index attached to the bead on the array.


An embodiment is depicted in FIG. 6A, middle panel which includes a bead 200 with a capture probe 240 attached to the bead. The capture probe is hybridized to the target nucleic acid 220. A first polynucleotide 260 comprising a barcode indicative of the capture probe, and a barcode primer binding site useful to sequence and identify the barcode is also attached to the bead. A second polynucleotide 250 comprising an index indicative of a subpopulation of beads from another subpopulation of beads is also attached to the bead. The capture probe is extended with a nucleotide comprising a detectable label 230. In some embodiments, the bead can be distributed in an array on a substrate, and decoded. Decoding can include determining the location of the detectable label on the array; determining the barcode attached to the bead on the array; and/or determining the index attached to the bead on the array.


An embodiment is depicted in FIG. 6A, right panel which includes a bead 200 with a capture probe 270 attached to the bead in which the capture probe is an antibody, or antigen-binding fragment of an antibody. The capture probe is specifically bound to a ligand 280. The ligand is also bound with a secondary antibody 290 which includes a detectable label 230. A first polynucleotide 260 comprising a barcode indicative of the capture probe, and a barcode primer binding site useful to sequence and identify the barcode is also attached to the bead. A second polynucleotide 250 comprising an index indicative of a subpopulation of beads from another subpopulation of beads is also attached to the bead. In some embodiments, the bead can be distributed in an array on a substrate, and decoded. Decoding can include determining the location of the detectable label on the array; determining the barcode attached to the bead on the array; and/or determining the index attached to the bead on the array.


An embodiment is depicted in FIG. 6B, right panel which includes a bead 200 with a capture probe 330 comprising a protein attached to the bead. In some embodiments, the bead can be distributed in an array on a substrate. A substrate 340 for the protein contact the protein to generate a signal comprising a detectable label 230. The location of the signal on the array can be determined. In some embodiments, the bead can include a first polynucleotide comprising a barcode indicative of the capture probe, and a barcode primer binding site useful to sequence and identify the barcode is also attached to the bead. In some embodiments, the bead can include a second polynucleotide comprising an index indicative of a subpopulation of beads from another subpopulation of beads is also attached to the bead. In some embodiments on the bead is decoded on the array. Decoding can include determining the location of the detectable label on the array; determining the barcode attached to the bead on the array; and/or determining the index attached to the bead on the array.


Some embodiments include methods of detecting a target ligand on an array, comprising: (a) obtaining a first and a second population of beads, wherein each bead comprises: a capture probe, wherein the capture probe is capable of specifically binding to a target ligand, a nucleic acid encoding a barcode and a barcode primer binding site, wherein the barcode is indicative of the capture probe, and a nucleic acid encoding an index and an index primer binding site, wherein the index is indicative of the source of the bead from the first population or the second population, and (b) contacting the first population of beads with a first sample comprising a first target ligand, wherein the first target ligand specifically binds to a capture probe of the first population of beads, and thereby obtaining a target-bound first population of beads; (c) contacting the second population of beads with a second sample comprising a second target ligand, wherein the second target ligand specifically binds to a capture probe of the second population of beads, and thereby obtaining a target-bound second population of beads; (d) randomly distributing the target-bound first population of beads and the target-bound second population of beads on an array; (e) detecting the location of the beads comprising the first target ligand and the second target ligand on the array; and (f) determining the sequence of the index and of the barcode of the beads comprising the first target ligand and the second target ligand on the array. In some embodiments, the capture probe comprises a polynucleotide. In some embodiments, the target ligand comprises a nucleic acid. In some embodiments, detecting the location of the beads comprising the first target ligand and the second target ligand on the array comprises extending the capture probe by polymerase extension or by ligation. In some embodiments, the capture probe comprises a protein. In some embodiments, step (e) is performed after step (f). In some embodiments, the barcodes of the first population of beads comprise barcodes different from one another, and the barcodes of the second population of beads comprise barcodes different from one another. In some embodiments, the indexes of the first population of beads are the same as one another, and the indexes of the second population of beads are the same as one another. In some embodiments, the array is located on the surface of a flowcell. In some embodiments, the first and a second population of beads are adapted to be attached to the array. In some embodiments, the first and a second population of beads comprise biotin, streptavidin, or a derivative thereof; and the array comprises biotin, streptavidin, or a derivative thereof. In some embodiments, the first and a second population of beads are magnetic.


Sequencing and Analysis of Target Nucleic Acids

Some embodiments include sequencing and/or analysis of target nucleic acids. Some embodiments include decoding the locations of polynucleotides in an array according to methods provided herein; hybridizing target nucleic acid to capture probes; extending the capture probes; and detecting the extension of the capture probes hybridized to the target nucleic acid at a location on the array. In some embodiments, the locations of the polynucleotides on an array can be decoded before hybridizing target nucleic acid to the polynucleotides. In some embodiments, the locations of the polynucleotides on an array can be decoded after detecting the extension of the capture probes hybridized to the target nucleic acid. In some such embodiments, each polynucleotide can be associated with a capture probe through a common element. For example, a polynucleotide and a capture probe can each be bound to the same microfeature, such as a bead. In more such embodiments, each polynucleotide can include the capture probe.


Some embodiments include single base extension (SBE) of capture probes. In some embodiments, SBE can be used for detection of an allele, mutations or other features in target nucleic acids. Briefly, SBE utilizes a capture probe that hybridizes to a target genome fragment at a location that is proximal or adjacent to a detection position, the detection position being indicative of a particular locus. A polymerase can be used to extend the 3′ end of the capture probe with a nucleotide analog labeled with a detection label. Based on the fidelity of the enzyme, a nucleotide is only incorporated into the capture probe if it is complementary to the detection position in the target nucleic acid. If desired, the nucleotide can be derivatized such that no further extensions can occur using a blocking group, including reversible blocking groups, and thus only a single nucleotide is added. The presence of the labeled nucleotide in the extended capture probe can be detected for example, at a particular location in an array and the added nucleotide identified to determine the identity of the locus or allele. SBE can be carried out under known conditions such as those described in U.S. Pat. Nos. 9,441,267 and 9,045,796 each of which is incorporated by reference in its entirety.


Some embodiments include allele specific primer extension (ASPE). In some embodiments ASPE can include extension of capture probes that differ in nucleotide composition at their 3′ end. An ASPE method can be performed using a nucleoside or nucleotide containing a cleavable linker, so that a label can be removed after a probe is detected. This allows further use of the probes or verification that the signal detected was due to the label that has now been removed. Briefly, ASPE can be carried out by hybridizing a target nucleic acid to a capture probe having a 3′ sequence portion that is complementary to a detection position and a 5′ portion that is complementary to a sequence that is adjacent to the detection position. Template directed modification of the 3′ portion of the capture probe, for example, by addition of a labeled nucleotide by a polymerase yields a labeled extension product, but only if the template includes the target sequence. The presence of such a labeled primer-extension product can then be detected, for example, based on its location in an array to indicate the presence of a particular allele. In some embodiments, ASPE can be carried out with multiple capture probes that have similar 5′ ends such that they anneal adjacent to the same detection position in a target nucleic acid but different 3′ ends, such that only capture probes having a 3′ end that complements the detection position are modified by a polymerase. A capture probe having a 3′ terminal base that is complementary to a particular detection position is referred to as a perfect match (PM) probe for the position, whereas capture probes that have a 3′ terminal mismatch base and are not capable of being extended in an ASPE reaction are mismatch (MM) probes for the position. The presence of the labeled nucleotide in the PM probe can be detected and the 3′ sequence of the capture probe determined to identify a particular allele at the detection position.


Some embodiments include methods for decoding the locations of polynucleotides in an array. Some such methods include (a) obtaining a substrate having an array of polynucleotides distributed on a surface of the substrate, wherein each polynucleotide comprises a primer binding site 3′ of a barcode, wherein each polynucleotide is linked to a capture probe; (b) hybridizing a plurality of primers to the primer binding sites; and (c) determining the sequences of the barcodes by extending the hybridized primers, wherein the sequence of each barcode is indicative of the location of a polynucleotide in the array. In some embodiments, each polynucleotide is linked to a capture probe via a bead. In some embodiments, each polynucleotide comprises the capture probe. In some embodiments, the capture probe is 3′ of the primer binding site. In some embodiments, the capture probe is 5′ of the barcode. In some embodiments, a capture probe comprises a different sequence from another capture probe. In some embodiments, a capture probe is different from another capture probe by less than 5 different nucleotides. In some embodiments, each capture probe comprises a different sequence. In some embodiments, the polynucleotides are randomly distributed on the surface of the substrate. In some embodiments, a barcode comprises a different sequence from another barcode. In some embodiments, each barcode comprises a different sequence. In some embodiments, each primer binding site comprises the same sequence. In some embodiments, the polynucleotides are attached to beads. In some embodiments, the beads are distributed in the wells. In some embodiments, each polynucleotide comprises a cleavable linker. In some embodiments, the cleavable linker is adapted to remove the capture probe from the primer binding site and the barcode. In some embodiments, the substrate comprises wells. In some embodiments, each polynucleotide comprises a spacer. In some embodiments, the spacer is attached to the substrate. In some embodiments, the spacer is attached to a bead. Some embodiments also include hybridizing a target nucleic acid to the capture probes. In some embodiments, the hybridizing a target nucleic acid to the capture probes is performed after the determining the sequences of the barcodes. In some embodiments, the hybridizing a target nucleic acid to the capture probes is performed before the determining the sequences of the barcodes. Some embodiments also include extending the hybridized target nucleic acid, or the polynucleotide. In some embodiments, the extending comprises ligation. Some embodiments also include amplifying the target nucleic acid.


Some embodiments include methods of sequencing a target nucleic acid. Some such methods (a) decoding the locations of polynucleotides in an array according to any one of the foregoing methods; (b) hybridizing the target nucleic acid to the capture probes; (c) extending the capture probes hybridized to the target nucleic acid; and (d) detecting the location of the extended capture probes. In some embodiments, (d) detecting the location of the extended capture probes is performed before (a) decoding the locations of polynucleotides in an array. In some embodiments, the capture probes are extended by a ligase. In some embodiments, the capture probes are extended by a polymerase. In some embodiments, the capture probes are extended by the addition of a single nucleotide. Some embodiments also include cleaving the primer binding sites and the barcodes from the capture probes before hybridizing the target nucleic acid to the capture probes. Some embodiments also include cleaving the primer binding sites and the barcodes from the capture probes after hybridizing the target nucleic acid to the capture probes.


Some embodiments include sequencing target polynucleotides derived from different biological sources on an array. In some such embodiments, polynucleotides comprising target nucleic acids can be prepared from a nucleic acid sample. Examples of polynucleotide samples include genomic DNA samples, cDNA samples, RNA samples, and amplicons from individuals. A polynucleotide can include a target nucleic acid, an index, and an index primer binding site adjacent to the index. The index can be used to indicate the source of a target nucleic acid as a certain nucleic acid sample. For example, polynucleotides prepared from different nucleic acid samples can include different indexes. Each index may have a position designed to bind a particular amplification primer. This index primer binding site can be used to sequence an index by extending a primer hybridized to the index primer binding site. In some embodiments, an index is a nucleic acid or region within a polynucleotide ranging from about 3-30 consecutive nucleotides. An index can be, for example, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29 or 30 nucleotides, or longer. Some aspects useful with methods and compositions provided herein are disclosed in U.S. 20180334711A1 and U.S. 20190085384A1 which are each incorporated by reference in its entirety. In some embodiments, a barcode or an index can include a unique molecular identifier (UMI).


Polynucleotides can be prepared by a variety of methods. In some embodiments, a nucleic acid sample comprising target nucleic acids can be tagmented with transposomes to obtain nucleic acid fragments with ends containing sequences from the transposomes. In some embodiments, the transposomes can include indexes and index primer binding sites, such that tagmentation adds the indexes and index primer binding sites to the nucleic acid fragments. For example, an input nucleic acid comprising a target nucleic acid can be contacted with a plurality of transposomes. The transposomes can fragment the input nucleic acid and attach adaptors to the ends of the nucleic acid fragments. Examples of tagmentation reactions are disclosed in U.S. Pat. No. 9,040,256, which is incorporated by reference in its entirety.


In some embodiments, polynucleotides comprising indexes can be prepared by adding adaptors to the ends of nucleic acid fragments comprising target nucleic acids, the adaptors can include the indexes and index primer binding sites. In some embodiments, polynucleotides comprising indexes can be prepared by amplifying nucleic acid fragments comprising target nucleic acids with primers comprising the indexes and index primer binding sites, such that the amplification products include the indexes and index primer binding sites.


In some embodiments, the target nucleic acids of the polynucleotides are hybridized to capture probes. In some embodiments, the capture probes are attached to beads. In some embodiments oligonucleotides comprise the capture probes. An oligonucleotide can include a capture probe, a barcode, and a barcode primer binding site. In some embodiments, the barcode can be sequenced by extending a primer hybridized to the primer binding site. In some embodiments, a barcode can include a nucleic acid sequence that can be used to identify a polynucleotide, such as a capture probe, within an array. The barcode can include a unique nucleotide sequence that is distinguishable from other barcodes. It can also be distinguishable from other nucleotide sequences within the polynucleotides and target nucleic acids by the barcode's sequence, and also by the barcode's location within the polynucleotide, for example by its location adjacent to the barcode primer binding site. For example, in some embodiments, the sequence of a barcode may be present more than once in plurality of nucleic acids, however, the barcode which is adjacent to the barcode primer binding site can be detected. A barcode can be of any desired sequence length sufficient to be unique nucleotide sequence within a plurality of barcodes in a population and/or within a plurality of polynucleotides and target nucleic acids that are being analyzed or interrogated. In some embodiments, a barcode is a nucleic acid or region within a polynucleotide ranging from about 6-30 consecutive nucleotides. A barcode can be, for example, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29 or 30 nucleotides, or longer. Suitable barcodes for some embodiments are disclosed in of U.S. Pat. No. 8,053,192, which is incorporated by reference in its entirety. In some embodiments, a barcode can distinguish a polynucleotide from another polynucleotide in an array, such that each barcode is different from another barcode. In some embodiments, a barcode can be used to identify the location of a bead in an array. In some embodiments, a bar can be used to identify a capture probe.


In some embodiments, the target nucleic acids of the polynucleotides are hybridized to capture probes to obtain hybridized beads. In some embodiments the hybridized beads can include an oligonucleotide containing a barcode, a barcode primer binding site, and a capture probe, the capture probe can be hybridized to a target nucleic acid of a polynucleotide, and the polynucleotide can include an index and an index primer binding site. The hybridized beads can be randomly distributed on to an array.


In some embodiments, sequence information of the target nucleic acid can be obtained by extending the capture probe. In some embodiments, the capture probe can be extended to include sequences complementary to the target nucleic acid. In some such embodiments, the extension can include polymerase extension. In some embodiments, the extension is a single base extension (SBE). In some embodiments, SBE can be used for detection of an allele, mutations or other features in target nucleic acids.


In some embodiments, the capture probe can be extended by ligation. For example, a locus specific oligonucleotide can hybridize to the target nucleic acid at a location adjacent to the site where the capture probe hybridizes to the target nucleic acid, the locus specific oligonucleotide can then be ligated to the capture probe. In some embodiments, the capture probe can be extended such that the extended capture probe incorporates sequences complementary to the index and the index primer binding site of the polynucleotide.


In some embodiments, the index is sequenced to determine the source of the target nucleic acid attached to the bead on the array. In some embodiments, the barcode is sequenced to decode the location of the bead on the array. In some embodiments, the barcode is sequenced to identify the capture probe attached to the bead on the array.


In some embodiments, the array is located on the surface of a flowcell. In some embodiments, the beads are adapted to be attached to the array. For example, the beads can include agents such as biotin, streptavidin, or a derivative thereof and the array comprises biotin, streptavidin, or a derivative thereof. In some embodiments, the beads and array are magnetic.


Some embodiments include preparing a plurality of polynucleotides from different nucleic acids samples, hybridizing each plurality of polynucleotides with a population of beads comprising capture probes to obtain hybridized beads, and randomly distributing the hybridized beads on an array. An example embodiment includes obtaining a first and a second population of beads, in which the first population of beads comprise oligonucleotides comprising first capture probes, first barcodes, and barcode primer binding sites adjacent to the first barcodes, and the second population of beads comprise oligonucleotides comprising second capture probes, second barcodes, and barcode primer binding sites adjacent to the second barcodes. A first and a second plurality of polynucleotides are obtained in which the first plurality of polynucleotides comprises first target nucleic acids, wherein the plurality of first polynucleotides is in solution, and the second plurality of polynucleotides comprises second target nucleic acids, wherein the plurality of second polynucleotides is in solution. The first target nucleic acids are hybridized to the first capture probes to obtain hybridized first beads, and the second target nucleic acids are hybridized to the second capture probes to obtain hybridized second beads. The hybridized first beads and hybridized second bead on an array are randomly distributed on an array. The locations of the first and second beads on the array are decoded by sequencing the first and second barcodes to determine which beads were bound by each set of target nucleic acids. Nucleic acid sequence data is obtained for the first and second target nucleic acids by extending the first and second capture probes.


In some embodiments, the polynucleotides are indexed as described herein. For example, a first and a second population of beads are obtained, in which the first population of beads comprise oligonucleotides comprising first capture probes, first barcodes, and barcode primer binding sites adjacent to the first barcodes, and the second population of beads comprise oligonucleotides comprising second capture probes, second barcodes, and barcode primer binding sites adjacent to the second barcodes. A first and a second plurality of polynucleotides are obtained, in which the first plurality of polynucleotides comprises first target nucleic acids, first indexes, and index primer binding sites adjacent to the first indexes, wherein the plurality of first polynucleotides is in solution, and the second plurality of polynucleotides comprises second target nucleic acids, second indexes, and second primer binding sites adjacent to the second indexes wherein the plurality of second polynucleotides is in solution. The first target nucleic acids are hybridized to the first capture probes to obtain hybridized first beads, and the second target nucleic acids are hybridized to the second capture probes to obtain hybridized second beads. The hybridized first beads and hybridized second bead are randomly distributed on an array. The locations of the first and second beads on the array are decoded by sequencing the first and second barcodes. The first and second capture probes are extended to obtain nucleic acid sequence data for the first and second target nucleic acids. A source for the nucleic acid sequence data of the first and second target nucleic acids is determined by sequencing the first and second indexes.


In some embodiments, the first plurality of polynucleotides comprises first indexes, and index primer binding sites adjacent to the first indexes; and the second plurality of polynucleotides comprises second target nucleic acids, second indexes, and index primer binding sites adjacent to the second indexes. In some embodiments, the first or second plurality of polynucleotides is obtained by tagmenting a nucleic acid sample with a plurality of transposomes. In some embodiments, the plurality of transposomes comprise the first or second indexes. Some embodiments also include adding adaptors to the tagmented nucleic acid sample, wherein the adaptors comprise the first or second indexes. Some embodiments also include amplifying the tagmented nucleic acid sample with primers comprising the first or second indexes. In some embodiments, extending the first and second capture probes incorporates sequences complementary to the first and second indexes and to the first and second index primer binding sites into the extended capture probes.


In some embodiments, the beads can be indexed. For example, a bead can include a nucleic acid comprising an index and an index primer binding site. In some embodiments, an oligonucleotide attached to a bead can include a barcode, a barcode primer binding site, an index, an index primer binding site. For example, a first population of beads can comprise first indexes, and index primer binding sites adjacent to the first indexes, and a second population of beads can comprise second indexes, and index primer binding sites adjacent to the second indexes. In some embodiments, the oligonucleotides of the first population of beads comprise the first indexes; and the oligonucleotides of the second population of beads comprise the second indexes.


In some embodiments, a first and a second population of beads is obtained, in which the first population of beads comprise oligonucleotides comprising first capture probes, first barcodes and barcode primer binding sites adjacent to the first barcodes, and first indexes and index primer binding sites adjacent to the first indexes, and the second population of beads comprise oligonucleotides comprising second capture probes, second barcodes and barcode primer binding sites adjacent to the second barcodes, and second indexes and index primer binding sites adjacent to the second indexes. A first and a second plurality of polynucleotides are obtained, in which the first plurality of polynucleotides comprises first target nucleic acids, wherein the plurality of first polynucleotides is in solution, and the second plurality of polynucleotides comprises second target nucleic acids, wherein the plurality of second polynucleotides is in solution. The first target nucleic acids are hybridized to the first capture probes to obtain hybridized first beads, and the second target nucleic acids are hybridized to the second capture probes to obtain hybridized second beads. The hybridized first beads and hybridized second beads are randomly distributed on an array. The locations of the first and second beads on the array are decoded by sequencing the first and second barcodes. The first and second capture probes are extended to obtain nucleic acid sequence data for the first and second target nucleic acids. In some embodiments, the oligonucleotides of the first population of beads comprise the first indexes; and the oligonucleotides of the second population of beads comprise the second indexes.


In some embodiments, the first indexes are indicative of a source of the first target nucleic acids, and the second indexes are indicative of a source of the second target nucleic acids. In some embodiments, the first indexes are the same as one another, and the second indexes are the same as one another. Some embodiments also include sequencing the first and second indexes. In some embodiments, the first and second indexes comprises extending primers hybridized to the index primer binding sites. In some embodiments, the index binding primer sites are the same. In some embodiments, the first and second target nucleic acids are obtained from different nucleic acid samples. In some embodiments, the first and second target nucleic acids are obtained from genomic DNA.


In some embodiments, the first and second barcodes are indicative of a nucleic acid sequence of the first or second capture probes. In some embodiments, the first barcodes are different from one another, and the second barcodes are different from one another. In some embodiments, sequencing the first and second barcodes comprises extending primers hybridized to the barcode primer binding sites. In some embodiments, the barcode primer binding sites are the same.


In some embodiments, extending the first and second capture probes comprises polymerase extension. In some embodiments, extending the first and second capture probes comprises addition of a single nucleotide to a capture probe. Some embodiments also include ligating locus specific oligonucleotides to the extended capture probes. In some embodiments, extending the first and second capture probes comprises ligating locus specific oligonucleotides to the capture probes.


In some embodiments, hybridizing the first target nucleic acids to the first capture probes to obtain hybridized first beads, and hybridizing the second target nucleic acids to the second capture probes to obtain hybridized second beads is performed in solution.


In some embodiments, a flowcell comprises the array. In some embodiments, the array comprises a plurality of wells, wherein each well comprises one or more beads. In some embodiments, the first and second beads are adapted to be attached to the array. In some embodiments, the first and second beads comprise biotin, streptavidin, or a derivative thereof; and the array comprises biotin, streptavidin, or a derivative thereof. In some embodiments, the beads and array are magnetic.


An example embodiment of sequencing target nucleic acids on an array, in which polynucleotides comprising target nucleic also include an index is depicted in FIG. 9. A DNA sample comprising a target nucleic acid containing a single nucleotide polymorphism (star) undergoes tagmentation in which the DNA sample is fragmented, and amplification primer binding sites are added to each end of the fragments. The fragments are amplified by PCR with primers that bind to the amplification primer binding sites and include a sample index, and an index read primer binding site. The sample index and index read primer sequences are incorporated into the amplified products. Beads are prepared by attaching oligonucleotides to the beads. The oligonucleotides comprise a linker, a decode sequence (also known as a barcode sequence), a decode read primer binding site, and a capture probe. The amplified products and the beads are combined. The target nucleic acids are hybridized to the capture probes. The hybridization can occur in solution or on an array. The hybridized beads are randomly distributed on an array. The capture probe is extended with fully functional nucleotides (FFNs), and the extension is detected. The extended capture probe is extended further to incorporate sequences complementary to the sample index and index read primer binding site. The strand comprising the target nucleic acid is dissociated from the extended capture. The sample index of the extended capture probe is sequenced by extending a primer hybridized to the index read primer binding site. The decode sequence of the extended capture probe is sequenced by extending a primer hybridized to the decode read primer binding site. The source of the target nucleic acid is identified from the sequence of the sample index. The bead is decoded and the capture probe identified from the decode sequence.


Another example embodiment of sequencing target nucleic acids on an array, in which beads include an index is depicted in FIG. 10. Beads are prepared by attaching capture probe oligonucleotides and sample index probe oligonucleotides. The capture probe oligonucleotides include a linker, a decode sequence, a decode read primer binding site, and a capture probe. The sample index probe oligonucleotide includes a linker, a sample index, and an index read primer binding site. The beads are combined with fragmented nucleic acids comprising target nucleic acids, such as nucleic acids containing a single nucleotide polymorphisms of interest (star). The target nucleic acid hybridizes to the capture probe, either in solution or on an array. The capture probe is extended with FFNs. The extension is detected. The target nucleic acid is dissociated from the extended capture probe. The sample index is sequences by extending a primer hybridized to the index read primer binding site. The decode sequence of the extended capture probe is sequenced by extending a primer hybridized to the decode read primer binding site. The source of the target nucleic acid is identified from the sequence of the sample index. The bead is decoded and the capture probe identified from the decode sequence.


Certain Dual Probe Methods

Some embodiments include detecting a target nucleic acid with first and second capture probes. In some embodiments, a target nucleic acid includes a first portion capable of hybridizing to a first capture probe, and a second portion capable of hybridizing to a second capture probe. In some embodiments, a population of beads is obtained in which each bead comprises the first capture probe and the second capture. In some embodiments, one of the two capture probes is attached to the bead via a cleavable linker. In some embodiments, one of the capture probes comprises a detectable label, such as a fluorescent label. In some embodiments, the target nucleic acid hybridizes to the capture probes to generate a double-stranded nucleic acid comprising a single-stranded gap. The gap is filled, and the cleavable linker is cleaved. The extended capture probe is detected. In some embodiments, one of the capture probes comprises a detectable label, such as a fluorescent label. In some embodiments, a detectable label is incorporated into the extended capture probe during the gap-filling.


In some embodiments, the second capture probe is attached to the bead via a cleavable linker, and the second capture probe comprises a detectable label, such as a fluorescent label. Examples of cleavable linkers include linkers that can be cleaved by chemical means, by enzymes such as endonucleases, and by certain frequencies of light. In some embodiments, each bead also includes a first polynucleotide comprising a barcode indicative of the first or second capture probe, and a barcode primer binding site 3′ of the barcode. In some embodiments, the first capture probe comprises the first polynucleotide. In some embodiments, the first capture probe is distinct from the first polynucleotide.


In some embodiments, an end of the first capture probe is ligated to an end of the second capture probe. In some embodiments, the first capture probe is ligated to the second capture probe by hybridizing the target nucleic acid to the first and second capture probes of a bead of the population of beads to generate a double-stranded nucleic acid comprising a single-stranded gap between the first and second capture probes, and filing-in the gap between the first and second capture probes. In some embodiments, gap-filling can be performed with a polymerase and/or a ligase. In some embodiments, the cleavable linker is cleaved to generate a bead comprising first capture probe comprising the detectable label. In some embodiments, the population of beads is distributed on a substrate. In some embodiments, the population of beads is distributed on a substrate after the cleavable linker is cleaved. In some embodiments, the population of beads is distributed on a substrate before the cleavable linker is cleaved, or before ligating the first capture probe to the second capture probe. In some embodiments, the location of the bead comprising the first capture probe comprising the detectable label on the substrate is determined.


In some embodiments, decoding the location of the bead comprising the first capture probe comprising the detectable label on the substrate comprises decoding the location of the barcode of the bead comprising the first capture probe comprising the detectable label on the substrate. Some such embodiments can include hybridizing a barcode primer to the barcode primer site, and extending the hybridized barcode primer. In some embodiments, extending the hybridized barcode primer comprises at least one cycle of sequencing by synthesis. In some embodiments, decoding the location of the barcode of the bead comprises sequencing the barcode on the substrate.


In some embodiments, each bead comprises a second polynucleotide comprising an index indicative of the source of the target nucleic acid, and an index primer binding site 3′ of the index.


In some embodiments, the population of beads comprises first and second subpopulations of beads, each bead comprising a second polynucleotide comprising an index and an index primer binding site 3′ of the index, wherein the indexes of the first subpopulation are different from the indexes of the second subpopulation. In some embodiments, the nucleotide sequences of the indexes of the first subpopulation of beads comprise the same nucleotide sequence, and the nucleotide sequences of the indexes of the second subpopulation of beads comprise the same nucleotide sequence. In some embodiments, the nucleotide sequences of the index primer binding sites comprise the same nucleotide sequence.


In some embodiments, the ligating or cleaving step with the first subpopulation of beads is performed at different locations from the ligating or cleaving step with the second subpopulation of beads. In some embodiments, the different locations comprise different reaction volumes. In some embodiments, the different locations comprise different wells.


Some embodiments also include combining the first and second subpopulations of beads prior to distributing the population of beads on the substrate. In other embodiments, the first subpopulation of beads is distributed on the substrate before the second subpopulation of beads is distributed on the substrate.


In some embodiments, decoding the location of the bead comprising the first capture probe comprising the detectable label on the substrate comprises determining the location of the indexes of the beads comprising a detected capture probe. Some such embodiments include hybridizing a plurality of index primers to the index primer sites, and extending the hybridized index primers. In some embodiments, extending the hybridized index primers comprises at least one cycle of sequencing by synthesis. In some embodiments, decoding the location of the indexes of the beads comprises sequencing the indexes on the substrate.


In some embodiments, the substrate comprises a plurality of discrete sites. In some embodiments, the substrate comprises a plurality of wells. In some embodiments, the substrate comprises a plurality of channels. In some embodiments, a flowcell comprises the substrate. In some embodiments, the distributed population of beads comprise an array.


An embodiment is depicted in FIG. 6B left panel, in which a bead 200 comprises a first capture probe 300 attached to the bead via a cleavable linker 310. A second capture probe 320 is attached to the bead. A target nucleic acid 220 from a certain sample is hybridized to the first and second capture probes, and the first capture probe is extended to fill the gap between the first and second capture probes with nucleotides that include detectable labels 230. After the gap is filled, the target nucleic acid is removed, and the cleavable linker cleaved to generate an extended second capture probe comprising detectable labels attached to the bead. The bead can be distributed on an array on a substrate, and decoded. A polynucleotide comprising an index indicative of the sample attached to the bead, and a polynucleotide comprising a barcode indicative of the first or second capture probes are not shown. Decoding can include determining the location of the detectable label on the array; determining the barcode attached to the bead on the array; and/or determining the index attached to the bead on the array.


An embodiment id depicted in FIG. 6C in which a first capture probe is attached via its 5′ end to a bead 200. A second capture probe 360 is attached via its 3′ end and a cleavable linker 310 to the bead. The second capture probe includes detectable labels 230. A target nucleic acid 220 is hybridized to the first and second capture probes, the first capture probe is extended and ligated to the second capture probe. In some embodiments, the target nucleic acid can include structural variants (SV). The cleavable linker is cleaved to generate an extended first capture probe comprising detectable label and attached to the bead. The bead can be distributed on an array on a substrate, and decoded. A polynucleotide comprising an index indicative of the sample attached to the bead, and a polynucleotide comprising a barcode indicative of the first or second capture probes are not shown. Decoding can include determining the location of the detectable label on the array; determining the barcode attached to the bead on the array; and/or determining the index attached to the bead on the array. In the absence of a target nucleic acid, the second capture probe and detectable label is cleaved from the bead.


Certain Methods for the Preparation and Use of Indexed Beads

Some embodiments include preparing indexed beads. Some such embodiments can include providing a population of index polynucleotides, wherein each index polynucleotide comprises an index, an index primer binding site, and an anchor/adaptor. In some embodiments, the anchor/adaptor is capable of binding or hybridizing to an adaptor binding site attached to a bead.


Some embodiments include a method of preparing an indexed population of beads, comprising (a) obtaining a population of beads, wherein each bead comprises an adaptor, a capture probe, and a first polynucleotide comprising a barcode, and a barcode primer binding site; (b) obtaining a plurality of index polynucleotides, wherein each index polynucleotide comprises an index, and an index primer binding site; and (c) attaching the plurality of index polynucleotides to the population of beads via the adaptors, thereby obtaining an indexed population of beads. In some embodiments, (c) comprises extending the adaptors by polymerase extension. In some embodiments, each index polynucleotide comprises an adaptor binding site, and the attaching comprises hybridizing the adaptor binding sites to the adaptors. In some embodiments, (c) comprises ligating the index polynucleotides to the adaptors. In some embodiments, the attaching comprises hybridizing a splint polynucleotide to the adaptor and to the index polynucleotide. In some embodiments, (c) comprises attaching the plurality of index polynucleotides to the adaptors of the population of beads via a chemically reactive moiety. In some embodiments, the first polynucleotides of the population of beads comprise capture probes different from one another. In some embodiments, the index of each index polynucleotide is the same. Aspects useful for methods and compositions useful to join polynucleotides to one another by chemical ligation are disclosed in U.S. 20180127816 which is incorporated by reference in its entirety.


Some embodiments also include contacting the indexed population of beads with a plurality of nucleic acids comprising a target nucleic acid. Some embodiments also include mixing the indexed population of beads contacted with a plurality of nucleic acids comprising a target nucleic with an additional indexed population of beads, wherein the additional indexed population of beads comprises an index polynucleotide comprising an index different from the index of the indexed population of beads contacted with a plurality of nucleic acids.


In some embodiments, the first polynucleotide comprises the capture probe. Some embodiments also include contacting the indexed population of beads with a plurality of nucleic acids comprising a target nucleic acid. Some embodiments also include mixing the indexed population of beads contacted with a plurality of nucleic acids comprising a target nucleic with an additional indexed population of beads, wherein the additional indexed population of beads comprises an index polynucleotide comprising an index different from the index of the indexed population of beads contacted with a plurality of nucleic acids.


In some embodiments, the capture probe comprises a protein.


In some embodiments, the method is performed on a flow cell.


An example embodiment for preparing indexed beads is depicted in FIG. 11. An index polynucleotide comprises an anchor or adaptor, an index, and an index primer binding site. A plurality of populations of index polynucleotides are prepared and distributed into wells of a multiwell plate. Each well can contain a population of index polynucleotide comprising the same index. For example, a first well can contain a population of index polynucleotides comprising a first index, and a second well can contain a population of index polynucleotides comprising a second index. A plurality of beads can be distributed into the wells. Each bead can include a first polynucleotide comprising a capture probe, and a second polynucleotide comprising an indexing anchor such as a binding site capable of binding to the anchor or adaptor of an index polynucleotide. A single bead pool is added to each well. A single bead pool can include a population of beads in which the capture probes are different from one another. A single bead pool containing the same index can be used with a single sample, such that the source of subsequent products of manipulated nucleic acids of the sample can be identified to have originated from the single sample via identifying the associated index. As depicted in FIG. 11, a single indexed bead pool can be added to a single well in a multiwell plate, in which each well contains a single sample.


The index of the index polynucleotide can be incorporated into a bead by several different methods. In some embodiments, the index polynucleotide is hybridized to the bead via the anchor and second polynucleotide, and the second polynucleotide is extended thereby incorporating a complement of the index of the index polynucleotide into the extended second polynucleotide attached to the bead. In some embodiments, the index polynucleotide and the second polynucleotide are ligated together. In some embodiments, the index polynucleotide is attached to a polynucleotide attached to a bead via a chemical or enzymatic method. In some embodiments, the index polynucleotide is hybridized to a polynucleotide attached to a bead, and the index of the hybridized index polynucleotide is determined on an array.


Some embodiments include adding an index to a bead by chemical or enzymatic methods. An example embodiment for adding an index to a bead by chemical or enzymatic methods is depicted in FIG. 12A which depicts a bead comprising first and second polynucleotides. The first polynucleotide comprises a probe such as a capture probe; a code, such as a barcode; and a Primer A, such as a barcode primer binding site which can be used to determine the sequence of the barcode. The second polynucleotide comprises an index; and a Primer B, such as an index primer binding site which can be used to determine the sequence of the index. The second polynucleotide can be attached to the bead via a spacer and moieties X-Y which link the second polynucleotide to the spacer. In some embodiments, the capture probe can include a sequence primer binding site. In some embodiments, a blocking group can be added to the index polynucleotide. In some embodiments, the index primer binding site could also be a hairpin with a reversibly blocked 3′-terminus.


Some embodiments include adding an index to a bead by extending an adaptor attached to a bead. In some embodiments, an adaptor attached to a bead can be extended by ligation, or by polymerase extension. An example embodiment of extension by ligation is depicted in FIG. 12B which depicts a bead comprising first and second polynucleotides. The first polynucleotide comprises a probe such as a capture probe; a code, such as a barcode; and a Primer A, such as a barcode primer binding site which can be used to determine the sequence of the barcode. The second polynucleotide comprises an adaptor which can be extended by ligation to a third polynucleotide comprising an index and a Primer B with the use of a fourth polynucleotide comprising a Splint which is capable of hybridizing to both the second and third polynucleotides. In some embodiments, the capture probe can include a sequence primer binding site. In some embodiments, a blocking group can be added to the index polynucleotide. In some embodiments, the index primer binding site could also be a hairpin with a reversibly blocked 3′-terminus.


An example embodiment of extension by polymerase extension is depicted in FIG. 12C which depicts a bead comprising first and second polynucleotides. The first polynucleotide comprises a probe, Primer A, and code. The second polynucleotide comprises an adaptor. A third polynucleotide, such as an index polynucleotide comprises an adaptor binding site capable of binding to the adaptor, an index, and a Primer B. The third polynucleotide hybridizes to the second polynucleotide such that the second polynucleotide is extended by polymerase extension to incorporate the index into the extended adaptor. In some embodiments, the capture probe can include a sequence primer binding site. In some embodiments, a blocking group can be added to the index polynucleotide. In some embodiments, the index primer binding site could also be a hairpin with a reversibly blocked 3′-terminus.


Some embodiments include use of an index polynucleotide hybridized to a first poly nucleotide attached to a bead. Some embodiments include methods for detecting a target ligand, comprising: (a) obtaining a population of beads, wherein each bead comprises a capture probe, a first polynucleotide comprising a barcode, and a barcode primer binding site; (b) obtaining an index polynucleotide comprising an index, an index primer binding site, and an adaptor capable of binding to the barcode primer binding site; (c) specifically binding a target ligand to the capture probe; (d) hybridizing the index polynucleotide to the first polynucleotide via the adaptor; (e) detecting the target ligand on an array; and (f) determining the index and the barcode of the first polynucleotide. In some embodiments, (e) comprises distributing the population of beads on an array. In some embodiments, (f) comprises hybridizing an index primer to the index primer binding site, and determining the sequence of the index. In some embodiments, dehybridizing the index polynucleotide from the first polynucleotide; hybridizing a barcode primer to the barcode primer binding site; and extending the barcode primer to determine the sequence of the barcode. In some embodiments, the index polynucleotide further comprises a cleavable linker located between the adaptor and the index, and (f) comprises: (i) cleaving the cleavable linker; and (ii) extending the adaptor to determine the sequence of the barcode. In some embodiments, the capture probe comprises a protein. In some embodiments, the target ligand comprises a target nucleic acid. In some embodiments, the first polynucleotide comprises the capture probe. In some embodiments, (e) comprises extending the first polynucleotide hybridized to the target nucleic acid. In some embodiments, the extension comprises adding a detectable dideoxynucleotide. In some embodiments, the method is performed on a flow cell.


An example embodiment for use of an index polynucleotide hybridized to a first poly nucleotide attached to a bead is depicted in FIG. 13A and FIG. 13B. FIG. 13A depicts a bead comprising a first polynucleotide. The first polynucleotide comprises a probe, such as a capture probe; a Primer, such as a primer binding site; and a code, such as a barcode. An index polynucleotide comprises an index; a Primer 2, such as an index primer binding site; a Spacer, such as a cleavable spacer; and an adaptor capable of hybridizing to the a primer binding site of the first polynucleotide. A bead pool can be prepared with the index polynucleotide as depicted in FIG. 11. FIG. 13B, first panel, depicts a bead comprising the first polynucleotide in which a target nucleic acid is hybridized to the first polynucleotide via the capture probe, and the index polynucleotide is hybridized to the first polynucleotide via the primer binding site. FIG. 13B, second panel, depicts an extended first polynucleotide with a single fluorescent dideoxynucleotide (star) which can be detected on an array of beads. The index and barcode can be determined. FIG. 13B, third panel, depicts that the fluorescent dideoxynucleotide has been removed from the extended first polynucleotide. An index primer can be hybridized to the index primer binding site and extended, and the sequence of the index determined. FIG. 13B, fourth panel, depicts that the target nucleic acid has been dehybridized from the from the first polynucleotide. The index polynucleotide is cleaved at the cleavable linker, and the adaptor now corresponds to a barcode primer hybridized to the primer binding site which can be extended, and the sequence of the barcode determined.


Some embodiments include use of an index polynucleotide comprising a plurality of an index sequence and index primer binding site. In some such embodiments, the plurality of the index sequences can result in an increased signal at a location on an array when the index is sequenced. An example embodiment is depicted in FIG. 14. FIG. 14 depicts a bead comprising a first polynucleotide and a second polynucleotide. The first polynucleotide comprises: a code, such as a barcode; a Primer A, such as a barcode primer binding site; and a probe, such as a capture probe. The second polynucleotide comprises repeats of an index and index primer binding sites. In some embodiments, the repeats improve signal intensity on beads with low numbers of immobilized indexes. In some embodiments, the repeats can limit the amount of surface area occupied by primers without reducing a signal.


Some embodiments include enzymatic addition of an index polynucleotide to a bead. In some embodiments, an index polynucleotide can be attached to a bead by extending a polynucleotide attached to a bead via a reactive group. In some embodiments, the index polynucleotide can be attached to the bead prior to mixing a bead pool with other bead pools, for example, prior to loading the mixture on to an array. Some embodiments include a method for detecting a target ligand, comprising: (a) obtaining a population of beads, wherein each bead comprises a capture probe, a first polynucleotide comprising a barcode, and a barcode primer binding site, and a second polynucleotide; (b) obtaining an index polynucleotide comprising an index, an index primer binding site, and an adaptor; (c) specifically binding a target ligand to the capture probe; (d) attaching the index polynucleotide to the second polynucleotide via the adaptor; (e) detecting the target ligand on an array; and (f) determining the index and the barcode of the first polynucleotide. In some embodiments, the second polynucleotide comprises a barcode, and a barcode primer binding site. In some embodiments, (d) comprises adding a reactive moiety to the second polynucleotide, wherein the adaptor is capable of attaching to the reactive moiety. Aspects useful for methods and compositions useful to add a reactive moiety to a polynucleotide are disclosed in U.S. 20180127816 which is incorporated by reference in its entirety In some embodiments, (e) comprises distributing the population of beads on an array. In some embodiments, (f) comprises hybridizing an index primer to the index primer binding site, and determining the sequence of the index. In some embodiments, (f) comprises hybridizing a barcode primer to the barcode primer binding site, and determining the sequence of the barcode. In some embodiments, the capture probe comprises a protein. In some embodiments, the target ligand comprises a target nucleic acid. In some embodiments, the first polynucleotide comprises the capture probe. In some embodiments, (e) comprises extending the first polynucleotide hybridized to the target nucleic acid. In some embodiments, the extension comprises adding a detectable dideoxynucleotide. In some embodiments, the method is performed on a flow cell.


An example embodiment enzymatic addition of an index polynucleotide to a bead is depicted in FIG. 15A and FIG. 15B. FIG. 15A and FIG. 15B depict the hybridization of a target nucleic acid to a bead via a capture probe of a first polynucleotide, addition of an index polynucleotide to the bead via an second polynucleotide, and determination of a barcode and index attached to the bead. FIG. 15A, first panel, depicts a bead comprising first and second polynucleotides. The first polynucleotide comprises: a code, such as a barcode; a Primer, such as a barcode primer binding site; and a Probe, such as a capture probe. FIG. 15A, second panel, depicts hybridization of a target nucleic acid to the capture probe. FIG. 15A, third panel, depicts single base extension (SBE) of the first polynucleotide with a fluorescent dideoxynucleotide (star). FIG. 15A, fourth panel, depicts the addition of a reactive group (triangle) to the second polynucleotide, such as a reactive dideoxynucleotide. Addition can include use of a terminal deoxynucleotidyl transferase (TdT). FIG. 15B, first panel, depicts the addition of an index polynucleotide comprising an index, an index primer binding site, and an adaptor to the second polynucleotide via the reactive group and adaptor. The bead can be loaded on to an array, the location of the bead can be determined from the fluorescent dideoxynucleotide, and the fluorescent dideoxynucleotide can be removed. FIG. 15B, second panel, depicts hybridization of an index primer to the index primer binding site to determine the sequence of the index. The index primer and target nucleic acid can be dehybridized. FIG. 15B, third panel, depicts hybridization of barcode primers to barcode primer binding sites to determine the sequence of the barcodes.


Spacers

Some embodiments provided here include the use of spacers. A spacer can include a polynucleotide having a length greater than or equal to 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40, 50, 100, 200, 500 consecutive nucleotides, or a length within a range of any two of the foregoing numbers. In some embodiments, a spacer can be located between two polynucleotide elements to increase the efficiency of certain processes by decreasing steric hindrance. For example, a spacer can be useful to increase the efficiency of enzymatic processes, such as use of polymerases, ligases, and/or terminal transferases for a substrate such as polynucleotides attached to a bead. In some embodiments, a spacer can be located between any two elements including: a bead, such as an attached end of a polynucleotide to a bead, an index, an index primer binding site, a barcode, a barcode primer binding site, and a capture probe.


Kits and Systems

Some embodiments include kits and systems for decoding microfeatures, such as polynucleotides on an array. Some such kits and systems can include a substrate, such as chip, or fluidic cell having an array of polynucleotides randomly distributed on a surface of the substrate. The polynucleotides can include a primer binding site 3′ of a bar code. In some such embodiments, each polynucleotide can include a capture probe. In more such embodiments, each polynucleotide can be associated with a capture probe through a common element. For example, a polynucleotide and a capture probe can each be bound to the same microfeature, such as a bead. Some embodiments include a detector adapted to detect signals from reagents hybridized to the polynucleotides in the array; such reagents can include sequencing reagents such as nucleotides comprising detectable labels. Some embodiments include a detector adapted to detect signals that can result from the incorporation of a nucleotide into a polynucleotide, such as pyrophosphate, or changes in hydrogen ions.


Some embodiments include kits or systems comprising at least a first and a second subpopulation of beads. In some embodiments, each bead of a subpopulation can include a first polynucleotide comprising a capture probe, a barcode indicative of the capture probe of the same bead, and a barcode primer binding site 3′ of the barcode. In some embodiments each bead of a subpopulation can also include a second polynucleotide comprising an index and an index primer binding site 3′ of the index. In some embodiments, the index of a subpopulation of beads can be indicative of that particular subpopulation of beads. In some embodiments, the indexes of the first subpopulation are different from the indexes of the second subpopulation. In some embodiments of the kits and systems provided herein, a first volume comprises the first subpopulation of beads, and a second volume comprises the second subpopulation of beads.


In some embodiments, the capture probes of the first and the second subpopulations of beads each comprise different nucleotide sequences from one another. In some embodiments, the different capture probes of the first and second subpopulations of beads comprise the same nucleotide sequences. In some embodiments, the capture probes comprise a nucleotide sequence capable of hybridizing to a single nucleotide polymorphism (SNP) or complement thereof.


In some embodiments, the barcode primer binding sites comprise the same nucleotide sequence. Some embodiments also include a plurality of barcode primers capable of hybridizing to the barcode primer sites.


In some embodiments, the nucleotide sequences of the indexes of the first subpopulation of beads comprise the same nucleotide sequence, and the nucleotide sequences of the indexes of the second subpopulation of beads comprise the same nucleotide sequence. In some embodiments, the nucleotide sequences of the index primer binding sites comprise the same nucleotide sequence. Some embodiments also include a plurality of index primers capable of hybridizing to the index primer sites.


In some embodiments, the substrate comprises a plurality of discrete sites. In some embodiments, the substrate comprises a plurality of wells. In some embodiments, the substrate comprises a plurality of channels. In some embodiments, a flowcell comprises the substrate. In some embodiments, the substrate is adapted such that a combination of the first and the second subpopulations of beads form an array of beads on a surface of the substrate, and the array is capable of being sequenced in a plurality of sequencing by synthesis cycles.


In some embodiments, the first and second subpopulations of beads each comprises at least 50 capture probes comprising different nucleotide sequences. In some embodiments, the first and second subpopulations of beads each comprises at least 500 capture probes comprising different nucleotide sequences. In some embodiments, the first and second subpopulations of beads each comprises at least 5000 capture probes comprising different nucleotide sequences. In some embodiments, the first and second subpopulations of beads each comprises at least 50000 capture probes comprising different nucleotide sequences.


Some embodiments include at least 10 different subpopulations of beads, each subpopulation comprising an index different from another subpopulation. Some embodiments include at least 100 different subpopulations of beads, each subpopulation comprising an index different from another subpopulation. Some embodiments include at least 1000 different subpopulations of beads, each subpopulation comprising an index different from another subpopulation. Some embodiments include at least 10000 different subpopulations of beads, each subpopulation comprising an index different from another subpopulation.


Some embodiments include a kit comprising: an array of polynucleotides randomly distributed on a surface of a substrate, wherein each polynucleotide comprises a primer binding site 3′ of a barcode and is linked to a capture probe, wherein each polynucleotide comprises a different barcode and is linked to a different capture probe. In some embodiments, each polynucleotide is linked to the capture probe via a bead. In some embodiments, each polynucleotide comprises the capture probe In some embodiments, the substrate is planar. In some embodiments, the substrate comprises wells. In some embodiments, the polynucleotides are attached to beads. In some embodiments, the beads are distributed in the wells. In some embodiments, a flow cell comprises the array.


Some embodiments include kits and systems comprising a first and a second subpopulation of beads. In some embodiments, each bead comprises a capture probe which specifically binds to a target ligand. Examples of target ligands include nucleic acids, proteins, or other antigens. Examples of capture probes include nucleic acids, antibodies, and antigen-binding fragments of antibodies. In some embodiments, each bead comprises a first polynucleotide comprising a barcode indicative of the capture probe of the same bead, and a barcode primer binding site 3′ of the barcode. In some embodiments, each bead comprises a second polynucleotide comprising an index and an index primer binding site 3′ of the index. In some such embodiments, the indexes of the first subpopulation are different from the indexes of the second subpopulation. For example, the indexes of the first subpopulation of beads can be used to distinguish the first subpopulation of beads from the second subpopulation of beads. In some embodiments, the first subpopulation of beads is separate from the second subpopulation of beads. For example, a first volume comprises the first subpopulation of beads and a second volume comprises the second subpopulation of beads.


In some embodiments, the capture probe comprises a nucleic acid, and the target ligand comprises a target nucleic acid. In some such embodiments, the first polynucleotide comprises the capture probe. In some embodiments, the capture probes comprise a nucleotide sequence capable of hybridizing to a single nucleotide polymorphism (SNP) or complement thereof.


In some embodiments, the capture probe comprises an antibody or antigen binding fragment of an antibody.


In some embodiments, the capture probes of the first and the second subpopulations of beads each specifically bind to different target ligands from one another. For example, the capture probes of the first subpopulation of beads each specifically bind to different target ligands from one another; and the capture probes of the second subpopulation of beads each specifically bind to different target ligands from one another. In some embodiments, the different capture probes of the first and second subpopulations of beads specifically bind to the same target ligands. For example, the set of different capture probes of the first subpopulation of beads specifically bind to the same target ligands as the set of different capture probes of the first subpopulation of beads.


In some embodiments, the barcode primer binding sites comprise the same nucleotide sequence. Some embodiments also include a plurality of barcode primers capable of hybridizing to the barcode primer sites.


In some embodiments, the nucleotide sequences of the indexes of the first subpopulation of beads comprise the same nucleotide sequence, and/or the nucleotide sequences of the indexes of the second subpopulation of beads comprise the same nucleotide sequence. In some embodiments, the nucleotide sequences of the index primer binding sites comprise the same nucleotide sequence. Some embodiments also include a plurality of index primers capable of hybridizing to the index primer sites.


In some embodiments, the substrate comprises a plurality of discrete sites. In some embodiments, the substrate comprises a plurality of wells. In some embodiments, the substrate comprises a plurality of channels. In some embodiments, a flowcell comprises the substrate. In some embodiments, the substrate is adapted such that a combination of the first and the second subpopulations of beads form an array of beads on a surface of the substrate, and the array is capable of being sequenced in a plurality of sequencing by synthesis cycles.


In some embodiments, the first and second subpopulations of beads each comprises at least 50, 100, 500, 1000, or 5000 capture probes different from one another, or any number of capture probes between any two of the foregoing numbers. Some embodiments also include at least 5, 10, 20, 50, 100, 200, 500, 1000 different subpopulations of beads, each subpopulation comprising an index different from another subpopulation, or any number of different subpopulations of beads between any two of the foregoing numbers.


Some embodiments include a kit comprising a first and a second subpopulation of beads, wherein each bead comprises: a capture probe which specifically binds to a target ligand, a first polynucleotide comprising a barcode indicative of the capture probe, and a barcode primer binding site 3′ of the barcode, and a second polynucleotide comprising an index and an index primer binding site 3′ of the index, wherein the indexes of the first subpopulation are different from the indexes of the second subpopulation, wherein a first volume comprises the first subpopulation of beads, and a second volume comprises the second subpopulation of beads. In some embodiments, the capture probe comprises a nucleic acid, and the target ligand comprises a target nucleic acid. In some embodiments, the first polynucleotide comprises the capture probe. In some embodiments, the capture probes of the first and the second subpopulations of beads each comprise different nucleotide sequences from one another. In some embodiments, the different capture probes of the first and second subpopulations of beads comprise the same nucleotide sequences. In some embodiments, the capture probe comprises an antibody or antigen binding fragment thereof. In some embodiments, the capture probes of the first and the second subpopulations of beads each specifically bind to different target ligands from one another. In some embodiments, the different capture probes of the first and second subpopulations of beads specifically bind to the same target ligands. In some embodiments, the barcode primer binding sites comprise the same nucleotide sequence. Some embodiments also include a plurality of barcode primers capable of hybridizing to the barcode primer sites. In some embodiments, the nucleotide sequences of the indexes of the first subpopulation of beads comprise the same nucleotide sequence, and the nucleotide sequences of the indexes of the second subpopulation of beads comprise the same nucleotide sequence. In some embodiments, the nucleotide sequences of the index primer binding sites comprise the same nucleotide sequence. Some embodiments also include a plurality of index primers capable of hybridizing to the index primer sites. In some embodiments, a flowcell comprises the substrate. In some embodiments, the substrate is adapted such that a combination of the first and the second subpopulations of beads form an array of beads on a surface of the substrate, and the array is capable of being sequenced in a plurality of sequencing by synthesis cycles. In some embodiments, the first and second subpopulations of beads each comprises at least 50 capture probes different from one another. In some embodiments, the first and second subpopulations of beads each comprises at least 500 capture probes different from one another. Some embodiments also include at least 10 different subpopulations of beads, each subpopulation comprising an index different from another subpopulation.


More embodiments of kits and systems provided herein can include a plurality of populations of beads comprising oligonucleotides attached to the beads, the oligonucleotides comprising indexes, index primer binding sites adjacent to the indexes, capture probes, barcodes, and barcode primer binding sites adjacent to the barcodes, wherein the indexes are different between the populations of beads. In some embodiments, the index primer binding sites are the same in the plurality of populations. In some embodiments, the barcodes are indicative of a nucleic acid sequence of the capture probes. In some embodiments, the barcodes are different in a population of beads. In some embodiments, the barcode primer binding sites are the same in the plurality of populations. In some embodiments, the beads comprise biotin, streptavidin, or a derivative thereof. In some embodiments, the beads are magnetic.


Some embodiments also include a reagent selected from: a locus specific oligonucleotide; a transposome for tagmenting a nucleic acid sample; a transposome comprising an index and an index primer binding site; an adaptor comprising an index and an index primer binding site; a primer capable of hybridizing to an index primer binding site or complement thereof and/or a primer capable of hybridizing to a barcode primer binding site or complement thereof. Some embodiments, also include an array, such as an array on the surface of a flowcell. Some embodiments include a detector adapted to detect signals from reagents hybridized to the polynucleotides in the array; such reagents can include sequencing reagents such as nucleotides comprising detectable labels. Some embodiments include a detector adapted to detect signals that can result from the incorporation of a nucleotide into a polynucleotide, such as pyrophosphate, or changes in hydrogen ions.


EXAMPLES
Example 1—Decoding an Array by Sequencing

A plurality of polynucleotides are synthesized, each polynucleotide comprises 5′ to 3′: a spacer, a unique barcode, a primer binding site, and a unique capture probe. The sequences of the barcode and capture probe are known; the sequence of the primer binding site is the same for each polynucleotide. Each polynucleotide is attached to a bead. The beads are randomly distributed into the wells of a chip. The bead array is decoded by hybridizing a primer to the primer binding site, extending the primer, and detecting the sequence of the barcode. The location of the barcode identifies the location of the associated capture probe.


Example 2—Decoding Barcodes on an Array by Sequencing

A nucleic acid library prepared from human genomic DNA was prepared. A subpopulation of beads was prepared. A first and second polynucleotide was attached to each bead. The first polynucleotide included a capture probe, a barcode primer binding site, and a barcode indicative of the capture probe. The second polynucleotide included an index and an index primer binding site.


A bead pool containing 18816 different code/probe types was loaded onto a HiSeq flow cell (FIG. 7A). After immobilization, 20 nucleotide long codes were sequenced using SBS chemistry and the identity of each bead was determined by aligning code sequences to a list of bead types. FIG. 7B is a histogram of the number of replicates for certain bead types in certain bins, and showed that 97.5% of the expected content is identified using a sequencing based decode process and that the vast majority of bead types are present at a level sufficient for genotyping studies.


Example 3—Genotyping Performance on HiSeq Using FFN Detection

To demonstrate genotyping performance on HiSeq using FFN detection, oligonucleotide target DNA was hybridized to a suspension of beads with conjugated to probe oligos. Beads were loaded onto a HiSeq flow cell. Probes bound to target DNA were extended by a single base using fluorescent nucleotides. The bead loaded flow cell was imaged to obtain genotyping bead intensities. Fluorescent nucleotides were cleaved and beads were decoded using SBS chemistry. Decode and genotyping reads were aligned to measure assay performance. Individual points were colored according to the expected genotype. FIG. 7C is a graph of C intensity vs T intensity, where each point is the average of all replicates for a given bead type and are colored according to expected genotype and showed that a single base probe extension with fluorescent nucleotides enables accurate genotyping.


Example 4—Multiplexing 12 Samples with a 10,368-Plex Bead Pool

This example demonstrates de-multiplexing multiple samples on a single flow cell, and in particular, the ability to multiplex 12 samples with a 10,368-plex bead pool. A single bead pool was separately hybridized to 12 different index sequences. After hybridization, samples were pooled and loaded on a HiSeq flow cell. Two separate reads were then performed, one to identify the sample based on the hybridized index and a separate read to identify the bead type based on the decode read. FIG. 8 is a histogram of the number of certain bead types in certain binds for a representative sample, and showed that the majority of bead types are present at a level sufficient for genotyping experiments for a given sample. The table in FIG. 7 is a summary of the consistency of de-multiplexing and decoding beads across 12 indexed samples pooled and loaded simultaneously demonstrating that sample representation is uniform and that the majority of probes are present across all samples.


Example 5—Large Parallel SNP Genotyping

This is an example workflow to genotype 384 samples in a single sequencing run. Polynucleotides comprising target nucleic acids which include single nucleotide polymorphism (SNP) of interest are prepared in a 384 well plate. Each well contains DNA from a different subject. The DNA is tagmented with transposomes which fragment the DNA, and add amplification primer binding sites to each end of the fragments. The fragments are amplified with primers that include an index and an index primer binding site to obtain amplified fragments in which at least one end of each amplified fragment includes the index and the index primer binding site. The index is different for each well such that polynucleotides derived from a certain well can be identified by a certain index. The prepared polynucleotides include an index, an index primer binding site, and a target nucleic acid.


A population of beads is added to each well. The population of beads include an oligonucleotide attached to the bead. The oligonucleotide includes a barcode, a barcode primer binding site, and a capture probe. The capture probe is 50 nucleotides in length and is specific for a particular target nucleic acid. The barcode can be used to identify the capture probe. The population of beads include different capture probes. The target nucleic acids hybridize with the capture probes in solution in each well to obtain hybridized beads. The hybridized beads from each well are pooled together and loaded on to a bead array on a flow cell. The hybridized beads are randomly distributed on the array.


On the array, the capture probes are extended by a single nucleotide to identify a SNP. The indexes are sequenced by extending a primer hybridized to the index primer binding sites to identify the source of the target nucleic acid. The barcodes are sequenced by extending a primer hybridized to the index primer binding sites to decode the location of the bead on the array. Particular SNPs are identified and are associate with a particular DNA sample from a particular subject.


The term “comprising” as used herein is synonymous with “including,” “containing,” or “characterized by,” and is inclusive or open-ended and does not exclude additional, unrecited elements or method steps.


The above description discloses several methods and materials of the present invention. This invention is susceptible to modifications in the methods and materials, as well as alterations in the fabrication methods and equipment. Such modifications will become apparent to those skilled in the art from a consideration of this disclosure or practice of the invention disclosed herein. Consequently, it is not intended that this invention be limited to the specific embodiments disclosed herein, but that it cover all modifications and alternatives coming within the true scope and spirit of the invention.


All references cited herein, including but not limited to published and unpublished applications, patents, and literature references, are incorporated herein by reference in their entirety and are hereby made a part of this specification. To the extent publications and patents or patent applications incorporated by reference contradict the disclosure contained in the specification, the specification is intended to supersede and/or take precedence over any such contradictory material.

Claims
  • 1. A method of sequencing target nucleic acids on an array, comprising: (a) obtaining a first and a second population of beads, wherein: the first population of beads comprise oligonucleotides comprising first capture probes, first barcodes, and barcode primer binding sites adjacent to the first barcodes, andthe second population of beads comprise oligonucleotides comprising second capture probes, second barcodes, and barcode primer binding sites adjacent to the second barcodes;(b) obtaining a first and a second plurality of polynucleotides, wherein: the first plurality of polynucleotides comprises first target nucleic acids, wherein the plurality of first polynucleotides is in solution, andthe second plurality of polynucleotides comprises second target nucleic acids, wherein the plurality of second polynucleotides is in solution;(c) hybridizing the first target nucleic acids to the first capture probes to obtain hybridized first beads, and hybridizing the second target nucleic acids to the second capture probes to obtain hybridized second beads;(d) randomly distributing the hybridized first beads and hybridized second bead on an array;(e) decoding the locations of the first and second beads on the array by sequencing the first and second barcodes; and(f) extending the first and second capture probes to obtain nucleic acid sequence data for the first and second target nucleic acids.
  • 2. The method of claim 1, wherein: the first plurality of polynucleotides comprises first indexes, and index primer binding sites adjacent to the first indexes; andthe second plurality of polynucleotides comprises second target nucleic acids, second indexes, and index primer binding sites adjacent to the second indexes.
  • 3. The method of claim 2, wherein the first or second plurality of polynucleotides is obtained by tagmenting a nucleic acid sample with a plurality of transposomes; and wherein the plurality of transposomes comprise the first or second indexes.
  • 4. (canceled)
  • 5. (canceled)
  • 6. (canceled)
  • 7. The method of claim 2, wherein extending the first and second capture probes incorporates sequences complementary to the first and second indexes and to the first and second index primer binding sites into the extended capture probes.
  • 8. The method of claim 1, wherein: the first population of beads comprise first indexes, and index primer binding sites adjacent to the first indexes, andthe second population of beads comprise second indexes, and index primer binding sites adjacent to the second indexes.
  • 9. The method of claim 8, wherein the oligonucleotides of the first population of beads comprise the first indexes; and the oligonucleotides of the second population of beads comprise the second indexes.
  • 10. The method of claim 2, wherein the first indexes are indicative of a source of the first target nucleic acids, and the second indexes are indicative of a source of the second target nucleic acids; and wherein the first indexes are the same as one another, and the second indexes are the same as one another.
  • 11. (canceled)
  • 12. The method of claim 2, further comprising sequencing the first and second indexes.
  • 13. (canceled)
  • 14. (canceled)
  • 15. The method of claim 1, wherein the first and second target nucleic acids are obtained from different nucleic acid samples.
  • 16. (canceled)
  • 17. The method of claim 1, wherein the first and second barcodes are indicative of a nucleic acid sequence of the first or second capture probes; and wherein the first barcodes are different from one another, and the second barcodes are different from one another.
  • 18. (canceled)
  • 19. The method of claim 1, wherein sequencing the first and second barcodes comprises extending primers hybridized to the barcode primer binding sites; and wherein the barcode primer binding sites are the same.
  • 20. (canceled)
  • 21. The method of claim 1, wherein extending the first and second capture probes comprises polymerase extension, or ligating locus specific oligonucleotides to the capture probes.
  • 22. (canceled)
  • 23. (canceled)
  • 24. (canceled)
  • 25. The method of claim 1, wherein step (c) is performed in solution.
  • 26. The method of claim 1, wherein the array is located on the surface of a flowcell; and wherein the first and second beads are adapted to be attached to the array.
  • 27.-66. (canceled)
  • 67. A method of preparing an indexed population of beads, comprising: (a) obtaining a population of beads, wherein each bead comprises an adaptor, a capture probe, and a first polynucleotide comprising a barcode, and a barcode primer binding site;(b) obtaining a plurality of index polynucleotides, wherein each index polynucleotide comprises an index, and an index primer binding site; and(c) attaching the plurality of index polynucleotides to the population of beads via the adaptors, thereby obtaining an indexed population of beads.
  • 68. The method of claim 67, wherein (c) comprises a step selected from (i) extending the adaptors by polymerase extension; (ii) ligating the index polynucleotides to the adaptors; or (iii) attaching the plurality of index polynucleotides to the adaptors of the population of beads via a chemically reactive moiety.
  • 69.-80. (canceled)
  • 81. A method for detecting a target ligand, comprising: (a) obtaining a population of beads, wherein each bead comprises a capture probe, a first polynucleotide comprising a barcode, and a barcode primer binding site;(b) obtaining an index polynucleotide comprising an index, an index primer binding site, and an adaptor capable of binding to the barcode primer binding site;(c) specifically binding a target ligand to the capture probe;(d) hybridizing the index polynucleotide to the first polynucleotide via the adaptor;(e) detecting the target ligand on an array; and(f) determining the sequences of the index of the first polynucleotide and the sequence of the barcode of the first polynucleotide.
  • 82. (canceled)
  • 83. The method of claim 81, wherein (f) comprises hybridizing an index primer to the index primer binding site, and determining the sequence of the index; dehybridizing the index polynucleotide from the first polynucleotide; hybridizing a barcode primer to the barcode primer binding site; and extending the barcode primer to determine the sequence of the barcode.
  • 84. (canceled)
  • 85. The method of claim 81, wherein the index polynucleotide further comprises a cleavable linker located between the adaptor and the index, and (f) comprises: (i) cleaving the cleavable linker; and(ii) extending the adaptor to determine the sequence of the barcode.
  • 86. (canceled)
  • 87. (canceled)
  • 88. (canceled)
  • 89. (canceled)
  • 90. (canceled)
  • 91. (canceled)
  • 92. A method for detecting a target ligand, comprising: (a) obtaining a population of beads, wherein each bead comprises a capture probe, a first polynucleotide comprising a barcode, and a barcode primer binding site, and a second polynucleotide;(b) obtaining an index polynucleotide comprising an index, an index primer binding site, and an adaptor;(c) specifically binding a target ligand to the capture probe;(d) attaching the index polynucleotide to the second polynucleotide via the adaptor by adding a reactive moiety to the second polynucleotide, wherein the adaptor is capable of attaching to the reactive moiety;(e) detecting the target ligand on an array; and(f) determining the sequence of the index of the first polynucleotide and the sequence of the barcode of the first polynucleotide.
  • 93. (canceled)
  • 94. (canceled)
  • 95. (canceled)
  • 96. (canceled)
  • 97. (canceled)
  • 98. (canceled)
  • 99. (canceled)
  • 100. (canceled)
  • 101. (canceled)
  • 102. (canceled)
  • 103. (canceled)
  • 104. (canceled)
  • 105. A method of detecting a target ligand on an array, comprising: (a) obtaining a first and a second population of beads, wherein each bead comprises: a capture probe, wherein the capture probe is capable of specifically binding to a target ligand,a nucleic acid encoding a barcode and a barcode primer binding site, wherein the barcode is indicative of the capture probe, anda nucleic acid encoding an index and an index primer binding site, wherein the index is indicative of the source of the bead from the first population or the second population, and(b) contacting the first population of beads with a first sample comprising a first target ligand, wherein the first target ligand specifically binds to a capture probe of the first population of beads, and thereby obtaining a target-bound first population of beads;(c) contacting the second population of beads with a second sample comprising a second target ligand, wherein the second target ligand specifically binds to a capture probe of the second population of beads, and thereby obtaining a target-bound second population of beads;(d) randomly distributing the target-bound first population of beads and the target-bound second population of beads on an array;(e) detecting the location of the beads comprising the first target ligand and the second target ligand on the array; and(f) determining the sequence of the index and of the barcode of the beads comprising the first target ligand and the second target ligand on the array.
  • 106. The method of claim 105, wherein the capture probe comprises a polynucleotide, or a protein.
  • 107. (canceled)
  • 108. (canceled)
  • 109. (canceled)
  • 110. The method of claim 105, wherein step (e) is performed after step (f).
  • 111. The method of claim 105, wherein the barcodes of the first population of beads comprise barcodes different from one another, and the barcodes of the second population of beads comprise barcodes different from one another; and wherein the indexes of the first population of beads are the same as one another, and the indexes of the second population of beads are the same as one another, and different from the indexes of the second population of beads.
  • 112. (canceled)
  • 113. The method of claim 105, wherein the array is located on the surface of a flowcell.
  • 114.-116. (canceled)
RELATED APPLICATIONS

This application claims priority to U.S. Prov. App. 62/909,014 filed Oct. 1, 2019 entitled “METHODS AND COMPOSITIONS FOR SEQUENCING NUCLEIC ACIDS ON AN ARRAY”; and U.S. Prov. App. 62/903,108 filed Sep. 20, 2019 entitled “METHODS AND COMPOSITIONS FOR HIGH-THROUGHPUT GENOTYPING ON ARRAYS USING INDEXES AND BARCODES” which are each incorporated by reference in its entirety.

Provisional Applications (2)
Number Date Country
62909014 Oct 2019 US
62903108 Sep 2019 US