The present disclosure relates to materials and methods for large-scale spatial transcriptomics. In particular, the disclosure provides materials and methods for determining the spatial location of a desired nucleic acid, such as RNA, within a tissue sample.
Currently used methods for determining the spatial location of gene expression in a tissue sample are limited by various factors, including poor resolution and limited ability to use large tissue samples. Accordingly, improved methods for determining the spatial location of nucleic acids in a tissue sample are needed, in particular methods that permit detection of the spatial location of a nucleic acid with high resolution and methods that allow for large tissue samples to be utilized.
In some aspects, provided herein are methods of producing systems for spatial detection of nucleic acid in a tissue sample. In some embodiments, methods for producing a system for spatial detection of nucleic acid in a tissue sample comprise providing a support comprising an array of surface probes and performing reactions directly on the support, thereby creating a system comprising spots of capture probes. In some embodiments, each surface probe on the support comprises a first anchor sequence, a spatial barcode, and a second anchor sequence. In some embodiments, the method comprises hybridizing a first complementary oligonucleotide to the first anchor sequence of each surface probe and hybridizing a second complementary oligonucleotide to the second anchor sequence of each surface probe. In some embodiments, the second oligonucleotide comprises a nucleic acid capture region (e.g. a capture domain) and a unique molecular identifier. In some embodiments, the method comprises performing an extension-ligation reaction on the support. The extension-ligation reaction performed directly on the support is also referred to herein as an “on-slide” reaction. In some embodiments, the extension-ligation reaction comprises extending the first complementary nucleotide with a sequence complementary to the spatial barcode of the surface probe, and ligating the extended first complementary nucleotide and the second complementary nucleotide together to form a contiguous capture oligonucleotide. The contiguous capture oligonucleotide comprises a nucleic acid capture region, a unique molecular identifier, a first anchor sequence, a spatial barcode, and a second anchor sequence.
In some embodiments, for each surface probe, the first anchor sequence, the second anchor sequence, and the spatial barcode each comprise 10-30 nucleotides. In some embodiments, for each surface probe the first anchor sequence comprises 20-30 nucleotides, the second anchor sequence comprises 10-20 nucleotides, and the spatial barcode comprises 15-25 nucleotides. In some embodiments, for each surface probe the first anchor sequence comprises 24 nucleotides, the second anchor sequence comprises 16 nucleotides, and the spatial barcode comprises 18 nucleotides. In some embodiments, for each surface probe the second anchor sequence comprises 40% to 60% guanosine and/or cytosine (G/C) bases. In some embodiments, the nucleic acid capture region comprises at least 10 deoxythymidine residues.
In some embodiments, performing the extension ligation reaction on the support comprises adding a DNA polymerase and a DNA ligase to the support under conditions such that the reverse complement of the spatial barcode sequence is synthesized and ligated to the first complementary nucleotide and to the second complementary nucleotide, thereby forming the continuous capture probe.
In some aspects, provided herein are systems for spatial detection of nucleic acid. In some embodiments, provided herein are systems for spatial detection of nucleic acid in a tissue sample. For example, in some embodiments provided herein are systems for spatial detection of RNA in a tissue sample.
In some embodiments, the system comprises a plurality of spots immobilized on a support. In some embodiments, each spot comprises a plurality of capture oligonucleotides. In some embodiments, each capture oligonucleotide comprises a nucleic acid capture region, a unique molecular identifier, a first anchor sequence, and a spatial barcode. In some embodiments, each capture oligonucleotide in a single spot comprises the same spatial barcode. In some embodiments, the spatial barcode for each distinct spot is unique. In some embodiments, each capture oligonucleotide further comprises a second anchor sequence.
In some embodiments, the support comprises a working surface area of at least 2 cm2. Accordingly, the systems and methods described herein facilitate spatial detection of nucleic acids in a large (e.g. greater than 2 cm2 area) tissue slice. In some embodiments, the support comprises a working surface area of at least 5 cm2. In some embodiments, the support comprises a working surface area of at least 10 cm2. In some embodiments, the working surface area is substantially circular in shape. In some embodiments, the working surface area is substantially rectangular in shape. In some embodiments, the working surface area is substantially square shaped.
In some embodiments, the working surface area of the support comprises at least 200 spots/mm2. In some embodiments, the working surface area of the support comprises at least 400 spots/mm2. In some embodiments, the working surface area of the support comprises at least 800 spots/mm2. In some embodiments, the nucleic acid capture region comprises at least 10 deoxythymidine residues.
In some aspects, provided herein are methods of making a system for spatial detection of nucleic acid in a tissue sample. In some embodiments, the method comprises providing a support comprising an array of surface probes, each surface probe comprising a first anchor sequence, a spatial barcode, and a second anchor sequence. In some embodiments, the method further comprises hybridizing a first complementary oligonucleotide to the first anchor sequence of each surface probe. In some embodiments, the method further comprises hybridizing a second complementary oligonucleotide to the second anchor sequence of each surface probe. In some embodiments, the second oligonucleotide further comprises a nucleic acid capture region and a unique molecular identifier. In some embodiments, the method further comprises extending the first complementary nucleotide with a sequence complementary to the spatial barcode of the surface probe, such that the first complementary nucleotide and the second complementary nucleotide form a contiguous capture oligonucleotide. The capture oligonucleotide comprises a nucleic acid capture region, a unique molecular identifier, a first anchor sequence, a spatial barcode, and a second anchor sequence.
In some aspects, provided herein are kits. In some embodiments, provided herein is a kit comprising a system as described herein.
In some embodiments, provided herein are methods for spatial detection of nucleic acid in a tissue sample. In embodiments, provided herein are methods for spatial detection of RNA in a tissue sample. In some embodiments, the method comprises contacting a system as described herein with a tissue sample.
In some embodiments, the method for spatial detection of RNA in a tissue sample, comprising contacting a system described herein with a tissue sample, such that RNA within the tissue sample to binds to the capture oligonucleotides. In some embodiments, the method further comprises reverse-transcribing the bound RNA to generate cDNA. In some embodiments, the method further comprises sequencing the cDNA. In some embodiments, the method further comprises correlating the spatial barcode for each sequenced cDNA molecule with the location of the spot on the support having a corresponding spatial barcode. In some embodiments, the method further comprises imaging the tissue and/or staining the tissue before or after sequencing the nucleic acid molecules. In some embodiments, the method further comprises determining the spatial location of the sequenced cDNA molecules within the tissue sample by correlating the location of the spot on the support with a corresponding location within the tissue sample.
In some embodiments, provided herein is a method for spatial detection of RNA in a tissue sample performed using a hybridized tissue sample. In some embodiments, the method comprises hybridizing a first probe and a second probe to a target RNA sequence in a tissue sample, wherein the first probe and the second probe each comprise an RNA binding region complementary to the target RNA sequence. In some embodiments, the method further comprises ligating the RNA binding region of the first probe to the RNA binding region of the second probe, thereby forming an extended probe that is hybridized to the target RNA sequence in the tissue sample. In some embodiments, the method comprises contacting the tissue sample with a system described herein, such that extended probes bind to the capture oligonucleotides, reverse-transcribing the bound extended probes to generate cDNA, and sequencing the cDNA. In some embodiments, the first probe further comprises a capture oligonucleotide binding region complementary to the nucleic acid capture domain of a capture oligonucleotide. In some embodiments, the second probe further comprises a sequencing handle. In some embodiments, the tissue sample is a fresh frozen sample or a formalin-fixed, paraffin-embedded (FFPE) tissue sample. In some embodiments, the tissue sample is an FFPE tissue sample, and the method further comprises deparaffinizing the tissue sample and decrosslinking RNA in the tissue sample prior to hybridizing the first probe and the second probe to the target RNA sequence.
In some embodiments, the method further comprises correlating a spatial barcode for each sequenced cDNA molecule with the location of the spot on the support having a corresponding spatial barcode. In some embodiments, the method further comprises imaging the tissue and/or staining the tissue before or after sequencing the nucleic acid molecules. In some embodiments, the method comprises imaging and/or staining the tissue before hybridizing the first and second probe to the target RNA sequence in the tissue. In some embodiments, the method comprises imaging and/or staining the tissue after hybridizing the first and second probe to the target RNA sequence in the tissue. In some embodiments, the method further comprises determining the spatial location of the sequenced cDNA molecules within the tissue sample by correlating the location of the spot on the support with a corresponding location within the tissue sample. In some embodiments, the tissue sample has a surface area of at least 2 cm2. In some embodiments, the tissue sample has a surface area of at least 5 cm2. In some embodiments, the tissue has a surface area of at least 10 cm2.
In some embodiments, provided herein is a method for spatial detection of RNA in a tissue sample, comprising obtaining a tissue sample having a surface area of at least 2 cm2, wherein the tissue sample is stabilized on an adhesive film, and transferring the tissue sample from the adhesive film to a system described herein. In some embodiments, the method comprises transferring the tissue sample from the adhesive film to a system described herein, such that RNA within the tissue sample to binds to the capture oligonucleotides. In some embodiments, the method further comprises reverse-transcribing the bound RNA to generate cDNA, and sequencing the cDNA. In some embodiments, transferring the tissue sample comprises mounting the stabilized tissue sample on the support and dissolving the adhesive film in hexane, thereby transferring the tissue sample from the adhesive film to the support. In some embodiments, the method further comprises correlating a spatial barcode for each sequenced cDNA molecule with the location of the spot on the support having a corresponding spatial barcode. In some embodiments, the method further comprises imaging the tissue and/or staining the tissue after transferring the tissue sample from the adhesive film to the support. In some embodiments, the tissue is imaged and/or stained before or after sequencing the cDNA. In some embodiments, the method further comprises determining the spatial location of the sequenced cDNA molecules within the tissue sample by correlating the location of the spot on the support with a corresponding location within the tissue sample. In some embodiments, the tissue sample has a surface area of at least 2 cm2. In some embodiments, the tissue has a surface area of at least 5 cm2. In some embodiments, the tissue has a surface area of at least 10 cm2.
Although any methods and materials similar or equivalent to those described herein can be used in the practice or testing of embodiments described herein, some preferred methods, compositions, devices, and materials are described herein. However, before the present materials and methods are described, it is to be understood that this invention is not limited to the particular molecules, compositions, methodologies or protocols herein described, as these may vary in accordance with routine experimentation and optimization. It is also to be understood that the terminology used in the description is for the purpose of describing the particular versions or embodiments only, and is not intended to limit the scope of the embodiments described herein.
Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. However, in case of conflict, the present specification, including definitions, will control. Accordingly, in the context of the embodiments described herein, the following definitions apply.
As used herein and in the appended claims, the singular forms “a”, “an” and “the” include plural reference unless the context clearly dictates otherwise. Thus, for example, reference to “a peptide amphiphile” is a reference to one or more peptide amphiphiles and equivalents thereof known to those skilled in the art, and so forth.
As used herein, the term “comprise” and linguistic variations thereof denote the presence of recited feature(s), element(s), method step(s), etc. without the exclusion of the presence of additional feature(s), element(s), method step(s), etc. Conversely, the term “consisting of” and linguistic variations thereof, denotes the presence of recited feature(s), element(s), method step(s), etc. and excludes any unrecited feature(s), element(s), method step(s), etc., except for ordinarily-associated impurities. The phrase “consisting essentially of” denotes the recited feature(s), element(s), method step(s), etc. and any additional feature(s), element(s), method step(s), etc. that do not materially affect the basic nature of the composition, system, or method. Many embodiments herein are described using open “comprising” language. Such embodiments encompass multiple closed “consisting of” and/or “consisting essentially of” embodiments, which may alternatively be claimed or described using such language.
As used herein, the term “oligonucleotide” refers to a molecule comprising two or more nucleotides (e.g., deoxyribonucleotides or ribonucleotides). The terms “nucleotide” and “base” are used interchangeably when used in reference to a nucleic acid sequence, unless indicated otherwise herein. The term “nucleotide” or “base” is inclusive of naturally occurring and synthetic bases. “Oligonucleotides” are typically less than 200 residues long (e.g., between 2 and 100 nucleotides), however, as used herein, the term is also intended to encompass longer polynucleotide chains.
The architecture of a tissue determines its function and malfunction in health and disease. Therefore, technologies to image and measure the cells and molecules organized in tissues are fundamental to biomedical research and clinical pathology. Recent advances in the field of spatially resolved transcriptomics (ST) have enabled the sequencing of the transcriptome associated with specific areas of tissue sections. ST technologies have not spread beyond their labs of origin due to, for example, the need for complex instrumentation and expertise and the lack of access to key reagents, instruments, or well-documented protocols. One exception is the commercially available kit (Visium, 10× Genomics) which has been adopted by several institutions, but this commercial platform remains limited by its low resolution (55-μm spots with 100 μm center-to-center distances between them), small surface area (<0.5 cm2), and high cost (˜$25/mm2 without sequencing). In addition, the limitations of this commercial platform compound with the issues of standard freeze-sectioning techniques that are prone to section loss and damage. Taken together, these gaps in ST and histology technologies prevent spatiomolecular analyses across many samples, small or large, for detailed studies of temporal processes, patient cohorts, or spatial atlases in two or three dimensions. The systems for spatial detection of nucleic acid in a tissue sample and methods of making systems for spatial detection of nucleic acid in a tissue sample provided herein address these and other issues. In particular, the systems and methods described herein enable high resolution, high sensitivity, and affordable methods for spatial detection of nucleic acid in a tissue sample, including whole animal tissue samples.
The systems and methods of producing systems for spatial transcriptomics provided herein enable analysis of small-to-large samples ranging from a biopsy to whole-mount sections of human organs or adult rodents. Resulting data sets encompass large numbers of cells from a single experiment. For example, data sets can encompass ˜4-6M cells in a whole, adult mouse section or ˜1-2M cells in a whole mount human colon section. These large format arrays also enable high throughput ST profiling, enabling generation of 3D ST data using serial sectioning techniques, the creation of ST atlases across model organisms, and/or the cost-effective processing of fixed samples from large cohorts of patients. The systems and methods described herein use an adhesive film to support sections from tissues or larger samples such as whole-mount rodents or human organs, while preserving RNA for ST compatibility and dramatically increasing the quality and reproducibility of sectioning (little to no section loss).
In some aspects, provided herein are systems for spatial detection of nucleic acid in a tissue sample. In some aspects, provided herein are methods for producing systems for spatial detection of nucleic acid in a tissue sample. Spatial detection of nucleic acid in a tissue sample is also referred to herein as “spatial transcriptomics” or “ST”. The systems for spatial detection of nucleic acid in a tissue sample provided herein are also referred to herein as an “array”, a “custom array”, or an “ST array”. In some embodiments, the systems are used for spatial detection of RNA in a tissue sample. In some embodiments, the substrates may be used for spatial detection of RNA transcripts (e.g., mRNA) in a tissue sample.
The tissue sample can be any suitable tissue sample, including human tissues and tissues from non-human subjects (e.g. vertebrates).
In some embodiments, provided herein are systems for spatial detection of nucleic acids in a tissue sample. In some embodiments, the system comprises a plurality of spots immobilized on a support. In some embodiments, each spot comprises a plurality of capture oligonucleotides. In some embodiments, each capture oligonucleotide comprises a nucleic acid capture region, a unique molecular identifier, a first anchor sequence, and a spatial barcode. In some embodiments, each capture oligonucleotide further comprises a second anchor sequence. In some embodiments, the 5′ end of each capture oligonucleotide is proximal to the support, and the 3′ end of each capture oligonucleotide is furthest away from the support. In some embodiments, each capture oligonucleotide comprises, from 5′ end to 3′ end, a first anchor sequence, a spatial barcode, a second anchor sequence, a unique molecular identifier, and a nucleic acid capture region. In some embodiments, the capture oligonucleotides are immobilized to the support via hybridization with surface probes that are immobilized on the support.
In some embodiments, each capture oligonucleotide in a single spot comprises the same spatial barcode. In some embodiments, the spatial barcode for each distinct spot is unique. Accordingly, in some embodiments no two spots on the support comprise capture oligonucleotides comprising the same spatial barcode.
In some embodiments, the support comprises a working surface area of at least 1 cm2. The term “working surface area” as used herein refers to a surface area of the support (e.g. a section of the support) that is substantially occupied by spots. In other words, the “working surface area” refers to the area of the support that is densely populated with spots (e.g. contains at least 200 spots/mm2). The “working surface area” of the support comprises a portion of the total surface area of the support. For example, the support may comprise a total surface area that is larger than the working surface area, thereby providing regions on the support that can be touched without impacting the capture probes immobilized within the “working surface area” of the support. In some embodiments, the support comprises a working surface area of at least 1 cm2, 2 cm2, 3 cm2, 4 cm2, 5 cm2, 6 cm2, 7 cm2, 8 cm2, 9 cm2, 10 cm2, 11 cm2, 12 cm2, 13 cm2, 14 cm2, 15 cm2, 16 cm2, 17 cm2, 18 cm2, 19 cm2, or 20 cm2. The support may comprise any suitable shape, including square, rectangular, hexagonal, or circular. The working surface area of the support may be any suitable shape, including substantially square, substantially rectangular, substantially circular, etc. In some embodiments, the working surface area is sufficient large to enable spatial transcriptomic analysis of an equally large tissue sample. For example, in some embodiments the working surface area is at least 2 cm2 (e.g. at least 2 cm2, at least 3 cm2, at least 5 cm2, at least 7 cm2, at least 10 cm2, at least 12 cm2) and the tissue sample has a surface area of approximately the same size as the working surface area. In some embodiments, the working surface area of the support enables spatial transcriptomic analysis of a tissue sample (e.g. a tissue slice) having a surface area of at least 2 cm2 (e.g. at least 2 cm2, at least 3 cm2, at least 4 cm2, at least 5 cm2, at least 6 cm2, at least 7 cm2, at least 8 cm2, at least 9 cm2, at least 10 cm2, at least 11 cm2, at least 12 cm2, at least 13 cm2, at least 14 cm2, at least 15 cm2, at least 16 cm2, at least 17 cm2, at least 18 cm2, at least 19 cm2, or about 20 cm2.
The support may comprise any suitable material, including glass and/or plastics. The material may be porous or non-porous. In some embodiments, the support comprises a material selected from glass, silicon, poly-L-lysine coated materials, nitrocellulose, polystyrene, cyclic olefin copolymers (COCs), cyclic olefin polymers (COPs), polypropylene, polyethylene and polycarbonate. In some embodiments, the support comprises a solid material, such as glass and/or plastic. In other embodiments, the support comprises a gel.
In some embodiments, the working surface area of the support comprises at least 200 spots/mm2. In some embodiments, the working surface area of the support comprises at least 200 spots/mm2, at least 250 spots/mm2, at least 300 spots/mm2, at least 350 spots/mm2, at least 400 spots/mm2, at least 450 spots/mm2, at least 500 spots/mm2, at least 550 spots/mm2, at least 600 spots/mm2, at least 650 spots/mm2, at least 700 spots/mm2, at least 750 spots/mm2, or at least 800 spots/mm2.
In some embodiments, the support comprises at least 10,000 total spots. For example, the support may comprise at least 10,000, at least 15,000, at least 20,000, at least 25,000, at least 30,000, at least 35,000, at least 40,000, at least 45,000, or at least 50,000 total spots.
In some embodiments, each spot comprises at least 500 capture oligonucleotides. For example, each spot may comprise at least 500, at least 600, at least 700, at least 800, at least 900, at least 1000, at least 1100, at least 1200, at least 1300, at least 1400, at least 1500, at least 1600, at least 1700, at least 1800, at least 1900, at least 2000, at least 2100, at least 2200, at least 2300, at least 2400, at least 2500, at least 2600, at least 2700, at least 2800, at least 2900, or at least 3000 capture oligonucleotides.
In some embodiments, each capture oligonucleotide comprises a nucleic acid capture domain. The terms “nucleic acid capture domain”, “capture domain”, “nucleic acid capture region”, and “capture region” are used interchangeably herein. In some embodiments, the nucleic acid capture region comprises a poly-T oligonucleotide (e.g. a series of consecutive deoxythymidine residues linked by phosphodiester bonds). A poly-T oligonucleotide may also be referred to as oligo (dT) or an oligo (dT) tail. For example, in some embodiments, the capture domain comprises a poly-T oligonucleotide comprising at least 10 deoxythymidine residues. In some embodiments, the nucleic acid capture region comprises nucleotides which are analogous to poly-T and retain the functional property of binding to the poly-A tail of mRNA. For example, the capture domain may comprise a poly-U oligonucleotide.
In some embodiments, each capture oligonucleotide comprises a unique molecular identifier (UMI). Each UMI comprises a nucleotide sequence. The UMI may be a nucleotide sequence of any suitable length. In some embodiments, the UMI comprises 5-50 nucleotides. In some embodiments, the UMI comprises 5-40 nucleotides. In some embodiments, the UMI comprises 5-30 nucleotides. In some embodiments, the UMI comprises 5-25 nucleotides. In some embodiments, the UMI comprises 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25 nucleotides. In some embodiments, the UMI comprises 17 nucleotides.
In some embodiments, each capture oligonucleotide comprises a spatial barcode. The spatial barcode may be an oligonucleotide of any suitable length. In some embodiments, the spatial barcode sequence of a capture oligonucleotide depends on the spatial barcode sequence of the surface probe to which the capture oligonucleotide was hybridized. For example, in some embodiments, the spatial barcode comprises 10-100 nucleotides. In some embodiments, the spatial barcode comprises 10-50 nucleotides. In some embodiments, the spatial barcode comprises 10-30 nucleotides. In some embodiments, the spatial barcode comprises 15-25 nucleotides. In some embodiments, the spatial barcode comprises 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30 nucleotides. In some embodiments, the spatial barcode comprises 18 nucleotides.
In some embodiments, each capture oligonucleotide comprises at least one anchor sequence. In some embodiments, the capture oligonucleotide comprises a first anchor sequence and a second anchor sequence. In some embodiments, the first anchor sequence is proximal to the support. In some embodiments, the first anchor sequence and the second anchor sequence are separated by the spatial barcode. The first anchor sequence and/or the second anchor sequence may comprise any suitable number of contiguous nucleotides.
In some embodiments, the plurality of capture oligonucleotides within each spot are immobilized on the support. The capture oligonucleotides may be immobilized on the support by hybridization with surface probes. For example, a support comprising an array of surface probes may be provided, and the capture oligonucleotides described herein may be immobilized to the surface probes. For example, a suitable method for generating the systems described herein is shown in
In some embodiments, the systems described herein are generated by providing a support comprising an array of surface probes. In some aspects, provided herein are methods of producing systems for spatial detection of nucleic acid in a tissue sample. In some embodiments, methods for producing a system for spatial detection of nucleic acid in a tissue sample comprise providing a support comprising an array of surface probes and performing reactions directly on the support, thereby creating a system comprising a plurality of spots, each spot containing a plurality of capture probes. In some embodiments, each surface probe on the support comprises a first anchor sequence, a spatial barcode, and a second anchor sequence. In some embodiments, the method comprises hybridizing a first complementary oligonucleotide to the first anchor sequence of each surface probe and hybridizing a second complementary oligonucleotide to the second anchor sequence of each surface probe. In some embodiments, the second oligonucleotide comprises a nucleic acid capture region (e.g. a capture domain) and a unique molecular identifier. In some embodiments, the method comprises performing an extension-ligation reaction on the support. The extension-ligation reaction performed directly on the support is also referred to herein as an “on-slide” reaction. In some embodiments, the extension-ligation reaction comprises extending the first complementary nucleotide with a sequence complementary to the spatial barcode of the surface probe, and ligating the extended first complementary nucleotide and the second complementary nucleotide together to form a contiguous capture oligonucleotide. The contiguous capture oligonucleotide comprises a nucleic acid capture region, a unique molecular identifier, a first anchor sequence, a spatial barcode, and a second anchor sequence.
In some embodiments, each surface probe comprises a first anchor sequence, a spatial barcode, and a second anchor sequence. A first complementary nucleotide may be hybridized to the first anchor sequence of a surface probe (for example, as shown in
In some embodiments, the first anchor sequence of a surface probe (e.g. a surface probe immobilized on the support) hybridizes to an anchor sequence of a first complementary oligonucleotide that ultimately forms a capture probe upon extension to be joined with a second complementary oligonucleotide, as described above. Any suitable size and base composition of the first anchor sequence of a surface probe may be used. In some embodiments, the first anchor sequence of a surface probe is selected for compatibility with a desired sequencing primer or system. In some embodiments, the first anchor sequence comprises 10-30 bases. In some embodiments, the first anchor sequence comprises 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30 bases. In some embodiments, the first anchor sequence comprises 24 bases.
In some embodiments, the second anchor sequence of a surface probe hybridizes to an anchor sequence of a second complementary oligonucleotide that ultimately forms a capture probe, as described above. Any suitable size and composition of the second anchor sequence of a surface probe may be used. In some embodiments, the second anchor sequence comprises 10-30 bases. In some embodiments, the second anchor sequence comprises 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30 bases. In some embodiments, the second anchor sequence comprises 16 bases.
In some embodiments, the spatial barcode sequence of a surface probe (and thus the spatial barcode sequence of a capture probe generated following hybridization to the surface probe) comprises 10-100 bases. In some embodiments, the spatial barcode sequence of a surface probe comprises 10-50 bases. In some embodiments, the spatial barcode sequence of a surface probe comprises 10-30 nucleotides. In some embodiments, the spatial barcode sequence of a surface probe comprises 15-25 nucleotides. In some embodiments, the spatial barcode sequence of a surface probe comprises 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30 nucleotides. In some embodiments, the spatial barcode sequence of a surface probe comprises 18 nucleotides.
In some embodiments, the first anchor sequence, the spatial barcode sequence, and the second anchor sequence are designed to avoid self-dimerization. Suitable commercial packages may be used to optimize the design of the first anchor sequence, spatial barcode sequence, and second anchor sequence of the surface probes. For example, the commercial package DNABarcodes26 may be used for design of optimal surface probe sequences (e.g. first anchor sequence, spatial barcode sequence, and second anchor sequence).
In some embodiments, for each surface probe, the first anchor sequence, the second anchor sequence, and the spatial barcode each comprise 10-30 nucleotides. In some embodiments, for each surface probe the first anchor sequence comprises 20-30 nucleotides, the second anchor sequence comprises 10-20 nucleotides, and the spatial barcode comprises 15-25 nucleotides. For example, in some embodiments for each surface probe the first anchor sequence, the second anchor sequence, and the spatial barcode each comprise 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30 nucleotides. In some embodiments, for each surface probe the first anchor sequence comprises 24 nucleotides, the second anchor sequence comprises 16 nucleotides, and the spatial barcode comprises 18 nucleotides.
In some embodiments, for each surface probe the second anchor sequence comprises 40% to 60% guanosine and/or cytosine (G/C) bases. For example, in some embodiments for each surface probe the second anchor sequence comprises about 40%, about 42.5%, about 45%, about 47.5%, about 50%, about 52.5%, about 55%, about 57.5%, or about 60% G/C bases. In some embodiments, the distribution of G/C bases is substantially uniform throughout the second anchor sequence. In some embodiments, the distribution of G/C bases is biased towards the 5′ end of the second anchor sequence. In some embodiments, the distribution of G/C bases is biased towards the 3′ end of the second anchor sequence.
In some embodiments, the nucleic acid capture region comprises at least 10 deoxythymidine residues.
In some embodiments, performing the extension ligation reaction on the support comprises adding a DNA polymerase and a DNA ligase to the support under conditions such that the reverse complement of the spatial barcode sequence is synthesized and ligated to the first complementary nucleotide and to the second complementary nucleotide, thereby forming the continuous capture probe.
In some aspects, provided herein are methods for spatial detection of nucleic acid in a tissue sample. The methods comprise contacting the tissue sample with a system as described herein. In some embodiments, the nucleic acid is RNA. In some embodiments, provided herein are methods for spatial detection of RNA in a tissue sample.
In some embodiments, provided herein is a method for spatial detection of RNA in a tissue sample comprising contacting a system described herein with a tissue sample, such that RNA within the tissue sample to binds to the capture oligonucleotides. For example, in some embodiments the method comprises contacting a system as described herein with a tissue sample, such that RNA within the tissue sample hybridizes to the capture domain of a capture probe, which capture probe is bound to the support (e.g. array surface), such as by hybridization to a surface probe immobilized on the surface of the support. For example, the poly-A tail of RNA molecules (e.g. mRNA) may bind to the poly-T (or functionally equivalent) domain of the capture oligonucleotides. This is shown in
In some embodiments, the method further comprises reverse-transcribing the bound RNA to generate cDNA. In some embodiments, the method further comprises sequencing the cDNA. For example, in some embodiments the method comprises capturing the cDNA, and sequencing the captured cDNA.
In some embodiments, methods for spatial detection of nucleic acid are performed using a hybridized tissue sample. In some embodiments, provided herein is a method for spatial detection of nucleic acid in a tissue sample comprising hybridizing the tissue sample with a suitable hybridization probe(s). In some embodiments, the hybridization probe(s) bind to target nucleic acid (e.g. target RNA) in the tissue sample, and also bind to the capture oligonucleotides of the array (e.g. the capture oligonucleotides bound to the surface of the array via hybridization to capture probes immobilized on the array surface). In some embodiments, the method for spatial detection of RNA in a tissue sample comprise hybridizing a first probe and a second probe to a target RNA sequence in a tissue sample. An exemplary embodiment involving a first probe and a second probe is shown, for example, in
In some embodiments, provided herein are methods for spatial detection of RNA in a tissue sample having a surface area of at least 2 cm2. In some embodiments, the method comprises obtaining a tissue sample having a surface area of at least 2 cm2, wherein the tissue sample is stabilized on an adhesive film. In some embodiments, the tissue has a surface area of at least 2 cm2, at least 3 cm2, at least 4 cm2, at least 5 cm2, at least 6 cm2, at least 7 cm2, at least 8 cm2, at least 9 cm2, at least 10 cm2, at least 11 cm2, at least 12 cm2, at least 13 cm2, at least 14 cm2, at least 15 cm2, at least 16 cm2, at least 17 cm2, at least 18 cm2, at least 19 cm2, or at least 20 cm2. For example, as shown in
An exemplary method for stabilizing a tissue sample (e.g. a tissue slice) on an adhesive film is shown in
In some embodiments, the method comprises transferring the tissue sample from the adhesive film to a system (e.g. array) as described herein. In some embodiments, transferring the tissue sample comprises mounting the stabilized tissue sample on the support (e.g. on the surface of the array) and dissolving the adhesive film in hexane, thereby transferring the tissue sample from the adhesive film to the support. In some embodiments, the method comprises transferring the tissue sample from the adhesive film to a system described herein, such that RNA within the tissue sample to binds to the capture oligonucleotides on the support (e.g. on the surface of the array). In some embodiments, the method comprises reverse-transcribing the bound RNA to generate cDNA, and sequencing the cDNA. In some embodiments, the method further comprising correlating a spatial barcode for each sequenced cDNA molecule with the location of the spot on the support having a corresponding spatial barcode. In some embodiments, the method further comprising imaging the tissue and/or staining the tissue after transferring the tissue sample from the adhesive film to the support.
In some embodiments, the method comprises transferring the tissue from the adhesive film to a system described herein, and hybridizing the tissue sample with one or more hybridization probes. Suitable methods and hybridization probes are described above. For example, in some embodiments the method comprises hybridizing the tissue sample with a first probe and a second probe that bind to a target RNA sequence in the tissue sample. For example, in some embodiments the first probe and the second probe each comprise an RNA binding region complementary to the target RNA sequence. In some embodiments, the method further comprises ligating the RNA binding region of the first probe to the RNA binding region of the second probe, thereby forming an extended probe that is hybridized to the target RNA sequence in the tissue sample. In some embodiments, the method comprises permitting extended probes to bind to capture oligonucleotides on the on the support, reverse-transcribing the bound extended probes to generate cDNA; and sequencing the cDNA. In some embodiments, permitting extended probes to bind to capture oligonucleotides on the support comprises digesting RNA, thereby releasing the extended probes from the target RNA and permitting binding of the extended probes to the capture oligonucleotides of the array.
For any of the methods described herein. cDNA may be generated by reverse-transcription. The cDNA generated is complementary to the bound RNA, and is therefore indicative of the RNA present in in a given cell at the time the tissue sample was obtained from the source organism. As shown in
Generally speaking, reverse transcription is performed by adding a reverse transcriptase and suitable additional reagents (e.g. dNTPS, buffers, RNAse inhibitors, etc.) and holding the mixture at a suitable temperature for a suitable duration of time. Any suitable reverse transcriptase enzyme may be used, such as M-MLV, MuLV, AMV, HIV, ArrayScript™, MultiScribe™, ThermoScript™, or Superscript® I, II, and III enzymes. Typically, reverse transcriptase reactions are performed between 37-55° C. The reaction time may be as little as 1, minute, or as much as 48 hours. In some embodiments, the reverse transcription reaction is carried out at 42° C. for about 2 hours. In some embodiments, a template switch oligonucleotide (TSO) may be added during reverse transcription. For example, a TSO may be used to add a common 5′ sequence to the cDNA molecules that may be used for subsequent cDNA amplification.
In some embodiments, the methods described herein further comprise sequencing the cDNA. In some embodiments, the method comprises sequencing the strand of cDNA generated directly from reverse transcription of the bound RNA. This strand is referred to as the “first strand” cDNA. In some embodiments, a strand of cDNA complementary to the first strand cDNA is generated, referred to as a “second strand” cDNA. In some embodiments, sequencing cDNA comprises sequencing the second strand and/or the first strand cDNA. In some embodiments, the method comprises releasing the cDNA from the support, and optionally amplifying the cDNA, prior to sequencing. Sequencing may be performed by any suitable method. Sequencing is generally performed using one or multiple amplification steps, such as polymerase chain reaction (PCR). In some embodiments, sequencing may be performed using next-generation sequencing methods, such as high-throughput sequencing, sequencing by synthesis (e.g. ILLUMINA technology), pyrosequencing, and the like.
In some embodiments, the methods described herein further comprise correlating the spatial barcode for each sequenced cDNA molecule with the location of the spot on the support having a corresponding spatial barcode. In some embodiments, the location of each spot on the support may be determined prior to contacting the tissue sample with the system. In some embodiments, the location of each spot on the support is known. For example, a map or layout of the support may be provided to a user of the system, such that the user knows the spatial location of each spot (e.g. the location of a given spatial barcode) on the support. Accordingly, following determining the sequence of a given cDNA molecule, the location of that molecule on the support may be correlated with the known location of a spot containing the corresponding spatial barcode. Furthermore, the location of a corresponding nucleic acid on the tissue sample can be determined. For example, the location on the support comprising a given spatial barcode can be aligned with a given area of the tissue that was in contact with that location on the support during cDNA synthesis. Accordingly, the sequence of the spatial barcode of a given sequenced cDNA molecule can be correlated with a corresponding spatial barcode on the support, the location of which can be used to determine a precise area of tissue that was in contact with that location on the support. Accordingly, RNA expression (e.g. mRNA expression, indicative of gene expression) in a specific location of a tissue sample can be determined.
In some embodiments, the methods described herein further comprise imaging and/or staining the tissue. For example, the tissue may be imaged and/or stained before and/or after contacting the tissue with the support. Imaging and/or staining the tissue may assist in determining the spatial location of a given nucleic acid within the tissue. For example, imaging and/or staining the tissue may assist in correlating the spot on the support having a given spatial barcode (e.g. a spatial barcode corresponding to a given sequenced cDNA molecule) with a location on the tissue that was in contact with that spot on the support during cDNA synthesis. As another example, imaging and/or staining the tissue may assist in highlighting various regions within the organism itself. For example, as shown in
The methods and substrates described herein may be used with any suitable tissue sample. The tissue may be fresh or frozen. In some embodiments, the tissue is a fresh-frozen tissue. The term “fresh-frozen” or “fresh frozen” when used in reference to a tissue refers to a tissue that was frozen after being obtained, without any fixation steps in-between. In some embodiments, the tissue may be fixed (e.g. formalin fixed). In some embodiments, the methods and systems described herein enable spatial detection of nucleic acid in a larger tissue sample than possible using previous spatial transcriptomics systems. For example, the tissue may have a surface area of at least 0.5 cm2. For example, the tissue may have a surface area of at least 0.5 cm2, 1 cm2, 2 cm2, 3 cm2, 4 cm2, 5 cm2, 6 cm2, 7 cm2, 8 cm2, 9 cm2, 10 cm2, 11 cm2, 12 cm2, 13 cm2, 14 cm2, 15 cm2, 16 cm2, 17 cm2, 18 cm2, 19 cm2, or 20 cm2. For example, as shown in
In some embodiments, the tissue is contacted with the system in a specific manner to facilitate use of a large slice (e.g. at least 1 cm2, 2 cm2, 3 cm2, 4 cm2, 5 cm2, 6 cm2, 7 cm2, 8 cm2, 9 cm2, 10 cm2, 11 cm2, 12 cm2, 13 cm2, 14 cm2, 15 cm2, 16 cm2, 17 cm2, 18 cm2, 19 cm2, or 20 cm2) without ripping, tearing, or other contortions that affect the integrity of the tissue sample itself. For example, the tissue may be frozen prior to contact with the system. For example, the tissue may be fixed in an appropriate medium (e.g. paraformaldehyde, formaldehyde, etc.). The medium may further comprise one or more reagents, such as sugars (e.g. sucrose) at an appropriate concentration (e.g. 1-20%). The fixed tissue may be frozen by any suitable method. In some embodiments, the fixed tissue is rapidly frozen, such as by using dry ice, hexane-dry ice, liquid nitrogen, etc. The tissue may be placed in an appropriate embedding medium. For example, in some embodiments the tissue is fixed, frozen, and subsequently placed in an embedding medium. In some embodiments, the embedding medium comprises SCEM. In some embodiments, the embedding medium comprises SCEM-L1. In some embodiments, the embedded medium containing the tissue is subsequently frozen. For example, the embedded tissue may be frozen again using dry ice, hexane-dry ice, liquid nitrogen, etc.
For any of the embodiments described herein, the tissue is sliced to an appropriate thickness. For example, the tissue may be sliced using a cryostat. Furthermore, the tissue may be stabilized by use of an adhesive film. For example, an adhesive film may be applied to the exposed surface of the tissue prior to slicing. Following application of the adhesive film to the exposed surface, the tissue is sliced, such that a single slice is produced wherein one surface of the tissue is in contact with the adhesive film and the other surface of the tissue is not (e.g. in other words, only one surface of the tissue is exposed). The adhesive film-stabilized tissue may subsequently be transferred onto the oligonucleotide array. The adhesive film may be removed from the tissue prior to performing spatial transcriptomics measurements.
In some embodiments, the tissue sample is an FFPE tissue sample, and the method further comprises deparaffinizing the tissue sample and decrosslinking RNA in the tissue sample prior to contacting the tissue sample with a system as described herein. In some embodiments, the tissue sample is an FFPE tissue sample, and the method comprises deparaffinizing the tissue sample and decrosslinking RNA in the tissue sample prior to hybridizing the first probe and the second probe to the target RNA sequence. Suitable deparaffinization techniques include, for example, xylene washes. Suitable decrosslinking techniques include, for example, incubation at suitable temperatures with a chelating buffer.
In some aspects, provided herein are kits. In some embodiments, provided herein are kits for use in methods of spatial detection of nucleic acid in a tissue sample. In some embodiments, the kit comprises a system as described herein. For example, in some embodiments the kit comprises a support containing a plurality of spots immobilized on the support, wherein each spot comprises a plurality of capture oligonucleotides. In some embodiments, each capture oligonucleotide comprises a nucleic acid capture region, a unique molecular identifier, a first anchor sequence, and a spatial barcode. In some embodiments, each capture oligonucleotide further comprises a second anchor sequence. In some embodiments, the 5′ end of each capture oligonucleotide is proximal to the support, and the 3′ end of each capture oligonucleotide is furthest away from the support. In some embodiments, each capture oligonucleotide comprises, from 5′ end to 3′ end, a first anchor sequence, a spatial barcode, a second anchor sequence, a unique molecular identifier, and a nucleic acid capture region. In some embodiments, the capture oligonucleotides are immobilized on the support by hybridization to surface probes, which are immobilized on the surface of the substrate.
In some embodiments, the kit comprises a means for preparing a system by a method as described herein. For example, in some embodiments, the kit comprises a support comprising an array of surface probes. In some embodiments, each surface probe comprises a first anchor sequence, a spatial barcode, and a second anchor sequence. The surface probes may be immobilized on the support. In some embodiments, such a kit may be used to generate a system for spatial detection of nucleic acid as described herein. For example, a first complementary nucleotide may be hybridized to the first anchor sequence of a surface probe (for example, as shown in
In some embodiments, the kit further comprises additional reagents for spatial detection of nucleic acid in a tissue sample. For example, the kit may further comprise additional reagents for generation of cDNA, imaging of the tissue sample, for staining the tissue sample, and/or sequencing of cDNA. For example, the kit may further comprise enzymes (e.g. reverse transcriptases, ligases, etc.), dNTPs, buffers, RNAse inhibitors, primers, probes, labels (e.g. fluorescent dyes), and the like. Components of the kits may be physically packaged together or separately. The kits can also comprise instructions for using the components of the kit. Instructions can be supplied with the kit or as a separate component. Instructions may be in paper form, or an electronic form. For example, instructions may be provided on a computer readable memory device or downloaded from an internet website, or as recorded presentation.
In some embodiments, further provided herein is an electronic system for executing one or more steps in a method for determining the spatial location of nucleic acid in a tissue sample as described herein. In some embodiments, the electronic system comprises software. In some embodiments, the software contains instructions for performing one or more steps in a method described herein. For example, software may be designed to execute a program for generating cDNA, imaging tissue, performing PCR, performing sequencing, and the like. In some embodiments, the electronic system includes a memory for storing data collected during one or more steps in a method as described herein. For example, the memory may store sequencing and/or imaging data collected by a method as described herein. In some embodiments, the electronic system includes a computer (e.g., a controller), which may comprise the software and/or memory component.
Agilent G3 microarray was customized and purchased from Agilent Technologies. The array featured a plurality of custom designed surface probes. Each surface probe comprises a first anchor sequence, a spatial barcode, and a second anchor sequence. Surface probes were immobilized to the surface of a solid support, such that the first anchor region was tethered to the support. The location of each cluster of surface probes, along with the corresponding spatial barcode for each cluster, was known.
Complementary nucleotides were hybridized to the surface probes as shown in
The resulting system comprised a plurality of spots, each spot containing a plurality of capture oligonucleotides. Each capture oligonucleotide within a given spot comprises the same spatial barcode. The spatial barcode for each spot is unique (i.e., no two spots comprise the same spatial barcode). Each capture oligonucleotide comprises a unique molecular identifier.
Tissue was placed upon the system, and the poly A tail of RNA (e.g. mRNA) binds to the polyT tail of the capture region of the capture oligonucleotides. cDNA can be synthesized as shown in
Sagittal slices were obtained from a whole mouse. The tissue was fixed, frozen, and embedded. An embedded tissue is shown in
Cryotape-stabilized slices were then transferred a system comprising a plurality of spots as described herein. The system was maintained under suitable hybridization conditions to allow the RNA within the slice to hybridize to capture probes. Following hybridization, reverse-transcription was performed to produce cDNA. The resulting cDNA was sequenced.
The number of unique molecular identifiers per spot, which is indicative of the number of capture probes per spot, was visualized (
As shown in
Current ST methods use conventional freeze-sectioning techniques of small (<1 cm2), mostly soft tissues. However, these sectioning techniques rely on cryomicrotomes that do not work for large or complex samples containing various tissue types such as whole-mouse sections. In addition, these sectioning techniques often result in tissue section damage, such as structural dislocation or abnormalities (e.g., tearing, folding), and are not amenable to robust serial sectioning. To circumvent these issues of reproducibility and scale across specimen types and sizes, various strategies were developed and used in the systems and methods described herein. First, the default blade holders and blade types of a cryomicrotome (Leica CM3050 S) and a cryomacrotome (Leica CM3600 XP) were modified and used to section standard (up to 1 cm2) and large-format (up to 12 cm2) specimens, respectively (
Transfer of Histological Sections from a Custom Adhesive Film to ST Arrays
To perform ST profiling on histological sections supported by the custom adhesive film, the section is first transferred to the ST array (i.e., glass slide carrying oligonucleotides). The use of adhesive film enables transfer of large-format sections (e.g., whole mount human organs or rodents), which typically cannot be obtained without the use of an adhesive film as support without extensive damage occurring to the tissue slice. In addition, the use of adhesive film enables the robust serial sectioning of small-to-large samples (e.g., 3D reconstruction of serial sections), which is not feasible with standard freeze-sectioning and manual transfer steps (e.g., forceps) without extensive section loss or damage.
Fresh-frozen sections supported on an adhesive film (for example, obtained as shown in
To enable scalable ST profiling, a large-format mRNA capture array was generated and employed. A commercially available oligonucleotide microarrays (Agilent) (
10-μm histological sections from mouse kidneys were used to benchmark the custom array described above against the commercial Visium kit (10× Genomics) using similar sequencing depth for both platforms. For ST using Visium, the manufacturer's experimental and computational pipelines were followed. For ST using the systems described herein, custom microarrays were built as described above (
ST profiling was performed on whole mouse sections. Whole mouse sections were transferred onto custom arrays generated as described above. (
As shown in Example 3, custom ST arrays were generated using on-slide probe assembly to include a spatial barcode for localization, UMIs for quantification, and an oligo (dT) for mRNA capture (
The ST array can be further optimized by modifying the DNA sequence composition of the probes covalently attached to the custom arrays (
Performance of sequence variants can be quantified. For example, to quantify the performance of sequence variants of Anchor 2, purified, total RNA samples can be used, which can be hybridized to custom arrays carrying Anchor 2 variants or the original Anchor 2 sequence as benchmark. For variants of the Anchor 2 sequence, the sequences of the SpatialBarcodes can be designed to avoid self-dimerization issues between Anchor 2, Anchor 1, and Spatial Barcodes. To create Spatial Barcodes, ˜1 million 18-mer sequences (hamming distance ≥3) are generated using the R package DNABarcodes26, which includes various metrics such as self-dimerization, GC %, Tm, and interactions with Anchors. To measure the effects of Anchor 2 sequence variants on the efficiency of on-slide probe assembly, resulting oligo species can be stripped from the array using KOH and analyzed by PAGE as shown above (
A bias was observed whereby Spatial Barcodes ending in T in their 3′ ends were the least represented in UMI counts per spot (
The on-slide gap-filling reaction steps on the array can also be optimized (
The methods and systems described herein can also be used to generate whole and targeted transcriptome sequencing readouts. The main steps for ST profiling of sections from fresh frozen tissues are H&E staining and imaging, permeabilization followed by cDNA synthesis and amplification, and sequencing library construction. cDNA synthesis and library construction steps can be optimized to generate high quality, whole or targeted transcriptome data. For example, cDNA synthesis and library construction steps can be optimized using a suitable organ, such as mouse kidney sections, as a test system. Permeabilization conditions shown to be suitable for use in a mouse kidney section are shown in
The conditions for on-slide cDNA synthesis and template switching can be optimized by varying the concentrations of the reverse transcriptase and template switching oligo) and the temperature (42-55° C. using one- or two-step reactions) and the length (30 min to overnight) of the reverse transcription reaction. The conditions for full-length cDNA amplification can also be optimized. In some embodiments, cDNAs are amplified by single-primer PCR amplification using conditions similar to what has been established for bulk and single-cell RNA-seq24,25. The ST methods described herein and RNA-seq methods differ in the length of the partial Illumina Read 1 sequence used in ST (24-mer) vs RNA-seq (33-mer). This length difference is due to the length limit of probes on the custom arrays provided herein (60 bases). The input of cDNA used can vary per single-primer amplification reactions for optimal complexity of resulting next-generation sequencing libraries and downstream sequencing saturation while mitigating the depth of sequencing needed (number of reads per spot). To estimate these metrics from ST data, the median numbers of genes detected per spot as a function of the mean number of reads per spot can be plotted.
Targeted transcriptome analysis can also be performed using the systems and methods described herein. For example, in an exemplary method a gene panel representing low, medium, and high levels of expression in a desired organ can be selected. Biotinylated probes targeting the selected genes can be designed and obtained. Amplified cDNA from the organ can be processed on the ST arrays described herein, hybridized with the biotinylated probes, and target cDNAs can be pulled down (e.g. using streptavidin beads or another biotin-binding agent). Enriched cDNAs can be processed for RNA-seq to quantify the enrichment and expression levels of the gene panel.
In an exemplary method, 500 genes expressed in mouse kidney can be selected to represent low, medium, and high levels of expressions using ST data (
The methods and systems described herein can be used with formalin-fixed, paraffin-embedded (FFPE) sections. Clinical biospecimens are often preserved as FFPE blocks of tissues or biopsies which highlights the need in biomedical research for spatial profiling methods capable to generate data from FFPE sections. For example, FFPE-compatible ST methods will increase tissue availability and allow for association between ST data and clinical outcomes across patient cohorts. However, FFPE sections typically contain mRNAs in lower quantity and of lesser quality than in fresh frozen counterparts due to RNA fragmentation and long-term storage. To address these challenges, probe hybridization-based methods can be used with the custom ST arrays described herein.
The main steps for ST profiling of sections from FFPE tissues are H&E staining and imaging, oligonucleotide probe hybridization and ligation, RNase treatment and permeabilization, and probe extension followed by next-generation sequencing library construction. In some embodiments, after H&E staining and imaging, FFPE sections can be placed onto custom ST arrays and processed as shown in
The systems and methods described herein can be used to simultaneously measure mRNA and protein levels and to sequence B and T cell receptors (VDJ-seq)29,30 from the same histological section. Measuring spatial mRNA and protein levels is valuable to, for example, annotate cell type information using antibodies targeting well-known cell surface markers or study the regulation of gene expression. Sequencing the receptors of lymphocytes in with spatial information will permit studies of immune repertoires, such as host responses to vaccination and infection or infiltrating lymphocytes in tumors.
ST methods can be combined with sequencing-based measurements of (i) protein levels using DNA-barcoded antibodies, and/or (ii) immune T and B cell receptor clones. To combine mRNA and protein readouts using sequencing, DNA-barcoded antibodies can be used. Suitable DNA-barcoded antibodies can be obtained from commercial sources, such as BioLegend. Each antibody carries an amplification primer and a barcode sequence followed by a poly(dA) sequence for capture by the oligo (dT) overhang on the array (
Methods to enrich B and T cell receptor sequences from full-length cDNAs obtained from tissue sections processed ST arrays can be used (
This approach was conducted using 139 probes targeting all mouse TRAV, TRBV, TRAC, and TRBV genes (127 genes for ˜17 kb in combined length). Using this custom probe set on total RNA purified from splenocytes, >70% of the reads matching T cell clonotypes were obtained by long read sequencing (Nanopore 2.0 chemistry for improved base calling accuracy), along with a V gene-usage distribution similar to that obtained by standard short read, TCR-seq methods (
The spatial analysis of biological tissues and organisms provides understanding of how cell localizations and states are coupled with biological outcomes. For example, cancer progression and remission due to treatment are intimately linked to the cellular and molecular composition and state of tumor lesions. The systems provided herein can be used to evaluate colon samples, including colon samples from patients having colon cancer. The size of the organ is compatible with ST profiling of whole-mount sections. H&E images and ST data from whole-mount sections of human colon specimens containing both normal and cancerous tissues can be obtained. In addition, single-nucleus (sn) RNA-seq data from a tissue section immediately adjacent to the section used for H&E and ST can be obtained.
Fresh colon samples can be obtained from patients undergoing surgical resection for colon cancer. A single, full thickness, en bloc ring of tissue containing the cancer lesion and the adjacent normal colon tissue and mesentery can be obtained (
It is understood that the foregoing detailed description is merely illustrative and are not to be taken as limitations upon the scope of the disclosure, which is defined solely by the appended claims and their equivalents.
Various changes and modifications to the disclosed embodiments will be apparent to those skilled in the art. Such changes and modifications, including without limitation those relating to the chemical structures, substituents, derivatives, intermediates, syntheses, compositions, formulations, or methods of use of the disclosure, may be made without departing from the spirit and scope thereof.
Any patents and publications referenced herein are herein incorporated by reference in their entireties.
Exemplary embodiments in accordance with the disclosure are set forth in the following clauses:
Clause 1: A method of producing a system for spatial detection of nucleic acid in a tissue sample, the method comprising:
This application claims priority to U.S. Provisional Patent Application No. 63/309,795, filed Feb. 14, 2022, the entire contents of which are incorporated herein by reference for all purposes.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US23/13010 | 2/14/2023 | WO |
Number | Date | Country | |
---|---|---|---|
63309795 | Feb 2022 | US |