METHODS, COMPOSITIONS, AND SYSTEMS FOR DETECTING EXOGENOUS NUCLEIC ACIDS

BACKGROUND

Cells within a tissue have differences in cell morphology and/or function due to varied analyte levels (e.g., gene and/or protein expression) within the different cells. The specific position of a cell within a tissue (e.g., the cell's position relative to neighboring cells or the cell's position relative to the tissue microenvironment) can affect, e.g., the cell's morphology, differentiation, fate, viability, proliferation, behavior, signaling, and cross-talk with other cells in the tissue.

Spatial heterogeneity has been previously studied using techniques that typically provide data for a handful of analytes in the context of intact tissue or a portion of a tissue (e.g., tissue section), or provide significant analyte data from individual, single cells, but fails to provide information regarding the position of the single cells from the originating biological sample (e.g., tissue).

Spatial analysis of an analyte within a biological sample may require determining the sequence of the analyte sequence or a complement thereof and the sequence of the spatial barcode or a complement thereof to identify the location of the analyte. The biological sample may be placed on a solid support to improve specificity and efficiency when being analyzed for identification or characterization of an analyte, such as DNA, RNA, protein, or other genetic material, within the sample.

Advances in cell and gene therapies represent promising technologies for disease treatment. Such therapies can involve delivery of therapeutic exogenous nucleic acids to subjects, for example via viral delivery vectors (e.g., lentiviral vectors, AAV vectors), or by introduction of exogenous sequences (e.g., chimeric antigen receptors) into cells for transplantation into subjects in need thereof.

Research and development, as well as animal studies and clinical trials in the field of cell and gene therapy, generally involve detecting presence of the delivered exogenous nucleic acids in complex biological samples, including clinical samples such as solid tissue samples and FFPE samples, as well as characterizing molecular and cellular responses to the delivered nucleic acids in such samples.

There exists a need for technologies for the highly sensitive detection of exogenously delivered nucleic acids and cellular responses thereto, in complex biological samples including clinical samples such as solid tissue samples and FFPE samples.

SUMMARY

The present disclosure provides methods, compositions, devices, and systems for determining the location and/or abundance of an analyte in a biological sample. Determining the spatial location and/or abundance of analytes (e.g., proteins, DNA, or RNA) within a biological sample leads to better understanding of spatial heterogeneity in various contexts, such as disease models. Described herein are methods for capturing probes and/or barcodes to a capture domain. In some instances, the techniques disclosed herein facilitate downstream processing, such as sequencing of the probes and/or barcodes bound to a capture domain.

In some examples, the methods, compositions, devices, and systems disclosed herein utilize RNA-templated ligation (RTL) for analyzing an analyte (e.g., RNA) in a biological sample. In some examples, RTL is used in combination with a “sandwich process,” wherein an analyte or analyte-derived molecule is transferred from a first substrate to a second substrate for further downstream processing. In some examples, analyte capture agents are used for analyzing an analyte (e.g., protein) in a biological sample. In some examples, the methods disclosed herein allow spatial analysis of two different types of analytes.

Provided herein are methods of detecting an exogenous nucleic acid in a biological sample, the method including (a) hybridizing a first exogenous probe oligonucleotide and a second exogenous probe oligonucleotide to an exogenous nucleic acid in a biological sample on a first substrate, wherein the first exogenous probe oligonucleotide and the second exogenous probe oligonucleotide each comprise a sequence that is substantially complementary to a first sequence and a second sequence of the exogenous nucleic acid, respectively, and wherein the second exogenous probe oligonucleotide comprises an exogenous capture probe binding domain; (b) coupling the first exogenous probe oligonucleotide and the second exogenous probe oligonucleotide, thereby generating an exogenous connected probe; (c) aligning the first substrate with a second substrate comprising an array, such that at least a portion of the biological sample is aligned with at least a portion of the array, wherein the array comprises a plurality of capture probes, wherein a capture probe of the plurality of capture probes comprises: (i) a spatial barcode and (ii) a capture domain; (d) releasing the exogenous connected probe from the exogenous nucleic acid when the biological sample is aligned with at least a portion of the array; and (e) hybridizing the exogenous capture probe binding domain of the exogenous connected probe to the capture domain of the capture probe.

In some embodiments, the exogenous nucleic acid comprises viral DNA or viral RNA. In some embodiments, the exogenous nucleic acid is an adeno-associated virus (AAV) nucleic acid, wherein the AAV is selected from the group consisting of AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, and AAV9. In some embodiments, the exogenous nucleic acid is from a genome integrated nucleic acid from a virus that integrates into a host genome, and wherein the virus that integrates into the host genome is selected from a retroviral vector, an arenavirus vector, a herpes virus vector, an Epstein-Barr Virus vector, or an adenovirus vector. In some embodiments, the exogenous nucleic acid comprises RNA from a reporter gene, wherein the reporter gene is selected from the group consisting of green fluorescent protein (GFP), Cerulean, tdTomato, and mCherry.

In some embodiments, the method further includes analyzing an analyte in the endogenous genome of the biological sample, wherein the biological sample is on the first substrate, the method including hybridizing a first probe oligonucleotide and a second probe oligonucleotide to the analyte, wherein the first probe oligonucleotide and the second probe oligonucleotide each comprise a sequence that is substantially complementary to a first sequence and a second sequence of the analyte, respectively, and wherein the second probe oligonucleotide comprises a capture probe binding domain; coupling the first probe oligonucleotide and the second probe oligonucleotide, thereby generating a connected probe prior to aligning the first substrate with the second substrate; when the biological sample is aligned with at least a portion of the array, releasing the connected probe from the analyte; and hybridizing the connected probe to the second capture domain of a second capture probe on the array, wherein the second capture probe further comprises a second spatial barcode.

In some embodiments, the analyte comprises DNA or RNA.

In some embodiments, the method further includes determining (i) all or a part of a sequence of the connected probe, or a complement thereof, and (ii) the sequence of the second spatial barcode, or a complement thereof, to determine the location and/or abundance of the analyte in the biological sample.

In some embodiments, the coupling of the first probe oligonucleotide and the second probe oligonucleotide comprises ligating the first probe oligonucleotide and the second probe oligonucleotide.

In some embodiments, the method further includes determining location and/or abundance of a protein in the biological sample, the method including prior to aligning the first substrate with the second substrate, contacting the biological sample with a plurality of analyte capture agents, wherein an analyte capture agent of the plurality of analyte capture agents comprises an analyte binding moiety and a capture agent barcode domain, wherein the analyte binding moiety specifically binds to the protein, and wherein the capture agent barcode domain comprises an analyte binding moiety barcode and a capture handle sequence; when the biological sample is aligned with at least the portion of the array, hybridizing the capture handle sequence to a third capture domain of a third capture probe on the array, wherein the third capture probe further comprises a third spatial barcode; and determining (i) all or a part of a sequence of the capture agent barcode domain, or a complement thereof; and (ii) the sequence of the third spatial barcode, or a complement thereof, to determine the location and/or abundance of the protein in the biological sample.

In some embodiments, the releasing step comprises contacting the biological sample with a reagent medium comprising proteinase K or pepsin.

In some embodiments, the aligning includes (i) mounting the first substrate on a first member of a support device, the first member configured to retain the first substrate; (ii) mounting the second substrate on a second member of the support device; (iii) applying the reagent medium to the first substrate and/or the second substrate; and (iv) operating an alignment mechanism of the support device to move the first member and/or the second member such that at least a portion of the biological sample is aligned with at least a portion of the array, and such that the portion of the biological sample and the portion of the array contact the reagent medium.

In some embodiments, the method further includes determining (i) all or a part of the sequence of the exogenous connected probe, or a complement thereof, and (ii) the sequence of the spatial barcode, or a complement thereof, and wherein the method further comprises using the determined sequences of (i) and (ii) to determine the location and/or abundance of the exogenous nucleic acid in the biological sample.

In some embodiments, the capture probe comprises a poly(T) sequence. In some embodiments, the capture probe further comprises one or more functional domains, a unique molecular identifier (UMI), a cleavage domain, and combinations thereof.

In some embodiments, the biological sample is a tissue sample. In some embodiments, the biological sample is a fixed tissue sample. In some embodiments, the fixed tissue sample is a formalin fixed paraffin embedded (FFPE) tissue sample. In some embodiments, the tissue sample is a fresh frozen tissue sample. In some embodiments, the tissue sample is fixed and stained prior to step (a).

Also provided herein is a method of detecting an exogenous nucleic acid in a biological sample, the method comprising: hybridizing a first exogenous probe oligonucleotide and a second exogenous probe oligonucleotide to an exogenous nucleic acid in a biological sample on a first substrate, wherein the first exogenous probe oligonucleotide and the second exogenous probe oligonucleotide each comprise a sequence that is substantially complementary to a first sequence and a second sequence of the exogenous nucleic acid, respectively, and wherein the second exogenous probe oligonucleotide comprises an exogenous capture probe binding domain; coupling the first exogenous probe oligonucleotide and the second exogenous probe oligonucleotide, thereby generating an exogenous connected probe; aligning the first substrate with a second substrate comprising an array, such that at least a portion of the biological sample is aligned with at least a portion of the array, wherein the array comprises a plurality of capture probes, wherein a capture probe of the plurality of capture probes comprises: (i) a spatial barcode and (ii) a capture domain; releasing the capture probe from the array when the biological sample is aligned with at least a portion of the array; and hybridizing the exogenous capture probe binding domain of the exogenous connected probe to the capture domain of the capture probe.

In some instances, disclosed herein are methods of detecting an exogenous nucleic acid in a biological sample, the methods comprising: (a) hybridizing a first exogenous probe oligonucleotide and a second exogenous probe oligonucleotide to an exogenous nucleic acid in a biological sample on a first substrate, wherein the first exogenous probe oligonucleotide and the second exogenous probe oligonucleotide each comprise a sequence that is substantially complementary to a first sequence and a second sequence of the exogenous nucleic acid, respectively, and wherein the second exogenous probe oligonucleotide comprises an exogenous capture probe binding domain; (b) coupling the first exogenous probe oligonucleotide and the second exogenous probe oligonucleotide, thereby generating an exogenous connected probe; (c) aligning the first substrate with a second substrate comprising an array, such that at least a portion of the biological sample is aligned with at least a portion of the array, wherein the array comprises a plurality of capture probes, wherein a capture probe of the plurality of capture probes comprises: (i) a spatial barcode and (ii) a capture domain; (d) when the biological sample is aligned with at least a portion of the array, (i) releasing the exogenous connected probe from the exogenous nucleic acid and (ii) migrating the exogenous connected probe from the biological sample to the array; and (e) hybridizing the exogenous capture probe binding domain of the exogenous connected probe to the capture domain of the capture probe.

In some embodiments, the exogenous nucleic acid comprises viral DNA or viral RNA. In some embodiments, the exogenous nucleic acid is an adeno-associated virus (AAV) nucleic acid. In some embodiments, the AAV is any one of AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, or AAV9. In some embodiments, the AAV is a recombinant adeno-associated virus (rAAV). In some embodiments, the exogenous nucleic acid is from a genome integrated nucleic acid. In some embodiments, the genome integrated nucleic acid comprises a nucleotide sequence encoding a Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)—CRISPR associated (Cas) (CRISPR-Cas) system guide RNA that hybridizes with the target host sequence. In some embodiments, the genome integrated nucleic acid is from a virus that integrates into a host (e.g., mammalian) genome. In some embodiments, the virus that integrates into the host genome is selected from a retroviral vector, an arenavirus vector, a herpes virus vector, an Epstein-Barr Virus vector, or an adenovirus vector. In some embodiments, the retroviral vector is a lentiviral vector.

In some embodiments, the methods further comprise detecting one or more additional exogenous nucleic acids in the biological sample. In some embodiments, the exogenous nucleic acid comprises RNA from a conserved region of the exogenous nucleic acid sequence. In some embodiments, the exogenous nucleic acid comprises RNA from a transgene. In some embodiments, the exogenous nucleic acid comprises RNA from a reporter gene. In some embodiments, the reporter gene is selected from the group consisting of green fluorescent protein (GFP), Cerulean, tdTomato, and mCherry. In some embodiments, the reporter gene comprises GFP.

In some embodiments, the methods further comprise analyzing an analyte in the endogenous genome of the biological sample, wherein the biological sample is on the first substrate, the method comprising: (a) hybridizing a first probe oligonucleotide and a second probe oligonucleotide to the analyte, wherein the first probe oligonucleotide and the second probe oligonucleotide each comprise a sequence that is substantially complementary to a first sequence and a second sequence of the analyte, respectively, and wherein the second probe oligonucleotide comprises a capture probe binding domain; (b) coupling the first probe oligonucleotide and the second probe oligonucleotide, thereby generating a connected probe prior to aligning the first substrate with the second substrate; (c) when the biological sample is aligned with at least a portion of the array, (i) releasing the connected probe from the analyte and (ii) migrating the connected probe from the biological sample to the array; and (d) hybridizing the connected probe to the capture domain of the capture probe. In some embodiments, the analyte comprises viral DNA or RNA.

In some embodiments, the first exogenous probe oligonucleotide and the second exogenous probe oligonucleotide form a contiguous exogenous nucleic acid sequence; and/or the first probe oligonucleotide and the second probe oligonucleotide are on a contiguous nucleic acid sequence. In some embodiments, the first exogenous probe oligonucleotide is on the 3′ end of the contiguous exogenous nucleic acid sequence; and/or the first probe oligonucleotide is on the 3′ end of the contiguous nucleic acid sequence. In some embodiments, the second exogenous probe oligonucleotide is on the 5′ end of the contiguous exogenous nucleic acid sequence; and/or the second probe oligonucleotide is on the 5′ end of the contiguous nucleic acid sequence.

In some embodiments, any one of the first sequences and second sequences abut one another. In some embodiments, any one of the first sequences and second sequences are at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more nucleotides away from one another. In some embodiments, the methods further comprise generating an extended first exogenous probe oligonucleotide, wherein the extended first exogenous probe oligonucleotide comprises a sequence complementary to a sequence between the sequence hybridized to the first exogenous probe oligonucleotide and the sequence hybridized to the second exogenous probe oligonucleotide; and/or further comprise generating an extended first probe oligonucleotide, wherein the extended first probe oligonucleotide comprises a sequence complementary to a sequence between the sequence hybridized to the first probe oligonucleotide and the sequence hybridized to the second probe oligonucleotide. In some embodiments, the methods further comprise generating an extended second exogenous probe oligonucleotide using a polymerase, wherein the extended second exogenous probe oligonucleotide comprises a sequence complementary to a sequence between the sequence hybridized to the first exogenous probe oligonucleotide and the sequence hybridized to the second exogenous probe oligonucleotide; and/or further comprise generating an extended second probe oligonucleotide using a polymerase, wherein the extended second probe oligonucleotide comprises a sequence complementary to a sequence between the sequence hybridized to the first probe oligonucleotide and the sequence hybridized to the second probe oligonucleotide.

In some embodiments, the coupling of the first probe oligonucleotide and the second probe oligonucleotide comprises ligating the first probe oligonucleotide and the second probe oligonucleotide. In some embodiments, the coupling of the first probe oligonucleotide and the second probe oligonucleotide comprises ligating via a ligase: (i) the first probe oligonucleotide and the extended second probe oligonucleotide; or (ii) the extended first probe oligonucleotide and the second probe oligonucleotide. In some embodiments, the ligase is selected from a Chlorella virus DNA ligase, a PBCV-1 DNA ligase, a splintR® ligase, a single stranded DNA ligase, or a T4 DNA ligase.

In some embodiments, the methods further comprise amplifying the connected probe prior to the releasing step. In some embodiments, the amplifying comprises rolling circle amplification. In some embodiments, the releasing step (d) comprises contacting the biological sample with a reagent medium comprising a permeabilization agent and an agent for releasing the connected probe, thereby permeabilizing the biological sample and releasing the connected probe from the analyte. In some embodiments, the methods further comprise analyzing a protein in the biological sample, the method comprising: (a) prior to aligning the first substrate with the second substrate, contacting the biological sample with a plurality of analyte capture agents, wherein an analyte capture agent of the plurality of analyte capture agents comprises an analyte binding moiety and a capture agent barcode domain, wherein the analyte binding moiety specifically binds to the protein, and wherein the capture agent barcode domain comprises an analyte binding moiety barcode and a capture handle sequence; (b) when the biological sample is aligned with at least the portion of the array, (i) releasing the capture agent barcode domain from the protein and (ii) passively or actively migrating the capture agent barcode domain from the biological sample to the array; and (c) coupling the capture handle sequence to the capture domain. In some embodiments, the coupling of the capture handle sequence to the capture domain comprises hybridization.

In some embodiments, the protein is an extracellular protein. In some embodiments, the protein is an intracellular protein. In some embodiments, the analyte binding moiety is an antibody. In some embodiments, the analyte capture agent comprises a linker. In some embodiments, the linker is a cleavable linker. In some embodiments, the cleavable linker is a photo-cleavable linker, a UV-cleavable linker, or an enzyme cleavable linker. In some embodiments, the cleavable linker is a photo-cleavable linker.

In some embodiments, the methods further comprise determining (i) all or a part of a sequence of the capture agent barcode domain, or a complement thereof, and (ii) the sequence of the spatial barcode, or a complement thereof. In some embodiments, the methods further comprise using the determined sequences of (i) and (ii) to determine the location and/or abundance of the protein in the biological sample. In some embodiments, the determining comprises sequencing (i) all or a part of the capture agent barcode domain, or a complement thereof, and (ii) the spatial barcode, or a complement thereof.

In some embodiments, the releasing step (b) comprises contacting the biological sample and the array with a reagent medium comprising a nuclease. In some embodiments, the releasing step (b) comprises contacting the biological sample and the array with a reagent medium comprising a permeabilization agent. In some embodiments, one or more of the releasing steps comprises contacting the biological sample with a reagent medium comprising a permeabilization agent and an agent for releasing the connected probe, thereby permeabilizing the biological sample and releasing the connected probe from the exogenous nucleic acid.

In some embodiments, the aligning comprises: (i) mounting the first substrate on a first member of a support device, the first member configured to retain the first substrate; (ii) mounting the second substrate on a second member of the support device; (iii) applying the reagent medium to the first substrate and/or the second substrate; and (iv) operating an alignment mechanism of the support device to move the first member and/or the second member such that at least a portion of the biological sample is aligned with at least a portion of the array, and such that the portion of the biological sample and the portion of the array contact the reagent medium. In some embodiments, the alignment mechanism is coupled to the first member, the second member, or both the first member and the second member. In some embodiments, the alignment mechanism comprises a linear actuator, optionally wherein: the linear actuator is configured to move the second member along an axis orthogonal to the plane of the first member and/or the second member, and/or the linear actuator is configured to move the first member along an axis orthogonal to the plane of the first member and/or the second member, and/or the linear actuator is configured to move the first member, the second member, or both the first member and the second member at a velocity of at least 0.1 mm/sec, and/or the linear actuator is configured to move the first member, the second member, or both the first member and the second member with an amount of force of at least 0.1 lbs.

In some embodiments, during the releasing step, a separation distance is maintained between the first substrate and the second substrate, optionally wherein the separation distance is less than 50 microns, optionally wherein the separation distance is between 2-25 microns, optionally wherein the separation distance is measured in a direction orthogonal to the surface of the first substrate that supports the biological sample, and/or at least the portion of the biological sample is vertically aligned with the at least portion of the array. In some embodiments, at least one of the first substrate and the second substrate further comprise a spacer, wherein after the first and second substrate being mounted on the support device, the spacer is disposed between the first substrate and second substrate and is configured to maintain the reagent medium within a chamber formed by the first substrate, the second substrate, and the spacer, and maintain a separation distance between the first substrate and the second substrate, the spacer positioned to at least partially surround an area on the first substrate on which the biological sample is disposed and/or the array disposed on the second substrate, wherein the area of the first substrate, the spacer, and the second substrate at least partially encloses a volume comprising the biological sample.

In some embodiments, the chamber comprises a partially or fully sealed chamber, and/or the second substrate comprises the spacer, and/or the first substrate comprises the spacer, and/or the applying the reagent medium to the first substrate and/or the second substrate comprises applying the reagent medium to a region of the spacer, the region outside an enclosed area of the second substrate, the enclosed area formed by the spacer.

In some embodiments, as the first substrate and/or the second substrate are moved via the alignment mechanism, the first substrate is at an angle relative to the second substrate such that a dropped side of the first substrate and a portion of the second substrate contact the reagent medium, optionally wherein: a dropped side of the first substrate urges the reagent medium toward the opposite direction, and/or the alignment mechanism further moves the first substrate and/or the second substrate to maintain an approximately parallel arrangement of the first substrate and the second substrate and a separation distance between the first substrate and the second substrate, optionally when the approximately parallel arrangement and the separation distance are maintained, the spacer fully encloses and surrounds the at least portion of the biological sample and the at least portion of the array, and the spacer forms the sides of the chamber which hold a volume of the reagent medium.

In some embodiments, the agent for releasing the connected probe comprises a nuclease. In some embodiments, the nuclease comprises an RNase, optionally wherein the RNase is selected from RNase A, RNase C, RNase H, or RNase I. In some embodiments, the permeabilization agent comprises a protease. In some embodiments, the protease is selected from trypsin, pepsin, elastase, or proteinase K. In some embodiments, the reagent medium further comprises a detergent. In some embodiments, the detergent is selected from sodium dodecyl sulfate (SDS), sarkosyl, saponin, a nonionic surfactant, or a nonionic detergent. In some embodiments, the reagent medium does not comprise sodium dodecyl sulfate (SDS) or sarkosyl. In some embodiments, the reagent medium further comprises polyethylene glycol (PEG). In some embodiments, the biological sample and the array are contacted with the reagent medium for about 1 to about 60 minutes. In some embodiments, the biological sample and the array are contacted with the reagent medium for about 30 minutes.

In some embodiments, the methods further comprise determining (i) all or a part of the sequence of the connected probe, or a complement thereof, and (ii) the sequence of the spatial barcode, or a complement thereof, optionally wherein the method further comprises using the determined sequences of (i) and (ii) to determine the location and/or abundance of the exogenous nucleic acid in the biological sample. In some embodiments, the determining comprises sequencing (i) all or a part of the connected probe, or a complement thereof, and (ii) the spatial barcode, or a complement thereof. In some embodiments, the sequence of the connected probe comprises the sequence of the spatial barcode or the reverse complement thereof, and a sequence corresponding to the exogenous nucleic acid in the biological sample or reverse complement thereof. In some embodiments, the capture probe comprises a poly(T) sequence. In some embodiments, the capture probe comprises a sequence complementary to the first probe oligonucleotide or the second probe oligonucleotide. In some embodiments, the capture probe further comprises one or more functional domains, a unique molecular identifier (UMI), a cleavage domain, and combinations thereof.

In some embodiments, the biological sample is a tissue sample. In some embodiments, the tissue sample is a solid tissue sample. In some embodiments, the solid tissue sample is a tissue section. In some embodiments, the biological sample is a fixed tissue sample. In some embodiments, the fixed tissue sample is a formalin fixed paraffin embedded (FFPE) tissue sample, optionally wherein the FFPE tissue sample is an FFPE tissue section. In some embodiments, the FFPE tissue sample is deparaffinized and decrosslinked prior to step (a). In some embodiments, the fixed tissue sample is a formalin fixed paraffin embedded cell pellet. In some embodiments, the tissue sample is a fresh frozen tissue sample. In some embodiments, the tissue sample is fixed and stained prior to step (a).

Also provided herein are systems or kits for analyzing an analyte in a biological sample, the systems or the kits comprising: (a) a support device configured to retain a first substrate and a second substrate, wherein the biological sample is placed on the first substrate, and wherein the second substrate comprises a plurality of capture probes, wherein a capture probe of the plurality of capture probes comprises: (i) a spatial barcode and (ii) a capture domain; (b) (b1) a first exogenous probe oligonucleotide and a second exogenous probe oligonucleotide, wherein the first exogenous probe oligonucleotide and a second exogenous probe oligonucleotide each comprise a sequence that is substantially complementary to a first sequence and a second sequence of an exogenous nucleic acid, respectively, wherein the second exogenous probe oligonucleotide comprises a capture probe binding domain, and wherein the first exogenous probe oligonucleotide and the second exogenous probe oligonucleotide are capable of being ligated together to form an exogenous connected probe; or (b2) a first probe oligonucleotide and a second probe oligonucleotide, wherein the first probe oligonucleotide and the second probe oligonucleotide each comprise a sequence that is substantially complementary to a first sequence and a second sequence of the analyte, respectively, wherein the second probe oligonucleotide comprises a capture probe binding domain, and wherein the first probe oligonucleotide and the second probe oligonucleotide are capable of being ligated together to form a connected probe; or (b3) a plurality of analyte capture agents, wherein an analyte capture agent of the plurality of analyte capture agents comprises an analyte binding moiety and a capture agent barcode domain, wherein the analyte binding moiety specifically binds to the analyte, and wherein the capture agent barcode domain comprises an analyte binding moiety barcode and an capture handle sequence; (c) a reagent medium comprising a permeabilization agent and optionally an agent for releasing the connected probe; and (d) instructions for performing a method disclosed herein.

In some embodiments, the permeabilization agent is pepsin or proteinase K. In some embodiments, the agent for releasing the connected probe is an RNAse, optionally wherein the RNAse is selected from RNase A, RNase C, RNase H, or RNase I.

In some embodiments, the systems or kits comprise an alignment mechanism on the support device to align the first substrate and the second substrate. In some embodiments, the alignment mechanism comprises a linear actuator, wherein the first substrate comprises a first member and the second substrate comprises a second member, and optionally wherein: the linear actuator is configured to move the second member along an axis orthogonal to the plane of the first member and/or the second member, and/or the linear actuator is configured to move the first member along an axis orthogonal to the plane of the first member and/or the second member, and/or the linear actuator is configured to move the first member, the second member, or both the first member and the second member at a velocity of at least 0.1 mm/sec, and/or the linear actuator is configured to move the first member, the second member, or both the first member and the second member with an amount of force of at least 0.1 lbs.

All publications, patents, and patent applications mentioned in this specification are herein incorporated by reference to the same extent as if each individual publication, patent, patent application, or item of information was specifically and individually indicated to be incorporated by reference. To the extent publications, patents, patent applications, and items of information incorporated by reference contradict the disclosure contained in the specification, the specification is intended to supersede and/or take precedence over any such contradictory material.

Where values are described in terms of ranges, it should be understood that the description includes the disclosure of all possible sub-ranges within such ranges, as well as specific numerical values that fall within such ranges irrespective of whether a specific numerical value or specific sub-range is expressly stated.

The term “each,” when used in reference to a collection of items, is intended to identify an individual item in the collection but does not necessarily refer to every item in the collection, unless expressly stated otherwise, or unless the context of the usage clearly indicates otherwise.

Various embodiments of the features of this disclosure are described herein. However, it should be understood that such embodiments are provided merely by way of example, and numerous variations, changes, and substitutions can occur to those skilled in the art without departing from the scope of this disclosure. It should also be understood that various alternatives to the specific embodiments described herein are also within the scope of this disclosure.

DESCRIPTION OF DRAWINGS

The following drawings illustrate certain embodiments of the features and advantages of this disclosure. These embodiments are not intended to limit the scope of the appended claims in any manner. Like reference symbols in the drawings indicate like elements.

FIG. 1A shows an exemplary sandwiching process where a first substrate (e.g., a slide), including a biological sample, and a second substrate (e.g., array slide) are brought into proximity with one another.

FIG. 1B shows a fully formed sandwich configuration creating a chamber formed from the one or more spacers, the first substrate, and the second substrate.

FIG. 2A shows a perspective view of an exemplary sample handling apparatus in a closed position.

FIG. 2B shows a perspective view of an exemplary sample handling apparatus in an open position.

FIG. 3A shows the first substrate angled over (superior to) the second substrate.

FIG. 3B shows that as the first substrate lowers, and/or as the second substrate rises, the dropped side of the first substrate may contact a drop of reagent medium.

FIG. 3C shows a full closure of the sandwich between the first substrate and the second substrate with one or more spacers contacting both the first substrate and the second substrate.

FIG. 4A shows a side view of the angled closure workflow.

FIG. 4B shows a top view of the angled closure workflow.

FIG. 5 is a schematic diagram showing an example of a barcoded capture probe, as described herein.

FIG. 6 shows a schematic illustrating a cleavable capture probe.

FIG. 7 shows exemplary capture domains on capture probes.

FIG. 8 shows an exemplary arrangement of barcoded features within an array.

FIG. 9A shows and exemplary workflow for performing a templated capture and producing a ligation product, and FIG. 9B shows an exemplary workflow for capturing a ligation product from FIG. 9A on a substrate.

FIG. 10 is a schematic diagram of an exemplary analyte capture agent.

FIG. 11 is a schematic diagram depicting an exemplary interaction between a feature-immobilized capture probe 1124 and an analyte capture agent 1126.

FIG. 12A shows an exemplary workflow for using ligated probes to capture intracellular analytes.

FIG. 12B shows an exemplary schematic illustrating the tissue sample sandwiched between a substrate and a spatially-barcoded capture probe array, wherein the ligated probes are transferred to the spatially-barcoded capture probe array.

FIG. 13A shows an exemplary workflow for using ligated probes and analyte capture agents to capture intracellular analytes.

FIG. 13B shows an exemplary schematic illustrating the tissue sample sandwiched between a substrate and a spatially-barcoded capture probe array, wherein the ligated probes and capture agent barcode domains are transferred to the spatially-barcoded capture probe array.

FIG. 14A shows an exemplary schematic illustrating the use of an RNase cleavable linker to release a capture agent barcode domain from an analyte binding moiety of an analyte capture agent.

FIG. 14B shows an exemplary schematic illustrating the use of a UV cleavable linker to release a capture agent barcode domain from an analyte binding moiety of an analyte capture agent.

FIG. 15A shows an exemplary workflow of spatial analysis assays using fresh frozen tissue samples.

FIG. 15B shows an exemplary workflow of spatial analysis assays using formalin-fixation and paraffin-embedded (FFPE) tissue samples.

DETAILED DESCRIPTION
I. Spatial Analysis Methods

Spatial analysis methodologies described herein can provide a vast amount of analyte and/or expression data for a variety of analytes within a biological sample at high spatial resolution, while retaining native spatial context. Spatial analysis methods can include, e.g., the use of a capture probe including a spatial barcode (e.g., a nucleic acid sequence that provides information as to the location or position of an analyte within a cell or a tissue sample (e.g., mammalian cell or a mammalian tissue sample) and a capture domain that is capable of binding to an analyte (e.g., a protein and/or a nucleic acid) produced by and/or present in a cell. Spatial analysis methods and compositions can also include the use of a capture probe having a capture domain that captures an intermediate agent for indirect detection of an analyte. For example, the intermediate agent can include a nucleic acid sequence (e.g., a barcode) associated with the intermediate agent. Detection of the intermediate agent is therefore indicative of the analyte in the cell or tissue sample.

Non-limiting aspects of spatial analysis methodologies and compositions are described in U.S. Pat. Nos. 11,447,807, 11,352,667, 11,168,350, 11,104,936, 11,008,608, 10,995,361, 10,913,975, 10,774,374, 10,724,078, 10,640,816, 10,494,662, 10,480,022, 10,364,457, 10,317,321, 10,059,990, 10,041,949, 10,030,261, 10,002,316, 9,879,313, 9,783,841, 9,727,810, 9,593,365, 8,951,726, 8,604,182, and 7,709,198; U.S. Patent Application Publication Nos. 2020/0239946, 2020/0080136, 2020/0277663, 2019/0330617, 2020/0256867, 2020/0224244, 2019/0085383, and 2013/0171621; PCT Publication Nos. WO2018/091676, WO2020/176788, WO2017/144338, and WO2016/057552; Non-patent literature references Rodriques et al., Science 363(6434):1463-1467, 2019; Lee et al., Nat. Protoc. 10(3):442-458, 2015; Trejo et al., PLoS ONE 14(2):e0212031, 2019; Chen et al., Science 348(6233):aaa6090, 2015; Gao et al., BMC Biol. 15:50, 2017; and Gupta et al., Nature Biotechnol. 36:1197-1202, 2018; the Visium Spatial Gene Expression Reagent Kits User Guide (e.g., Rev F, dated January 2022); and/or the Visium Spatial Gene Expression Reagent Kits—Tissue Optimization User Guide (e.g., Rev E, dated February 2022), both of which are available at the 10× Genomics Support Documentation website, and can be used herein in any combination, and each of which is incorporated herein by reference in their entireties. Further non-limiting aspects of spatial analysis methodologies and compositions are described herein.

Some general terminology that may be used in this disclosure can be found in Section (I)(b) of PCT Publication No. WO2020/176788 and/or U.S. Patent Application Publication No. 2020/0277663. Typically, a “barcode” is a label, or identifier, that conveys or is capable of conveying information (e.g., information about an analyte in a sample, a bead, and/or a capture probe). A barcode can be part of an analyte, or independent of an analyte. A barcode can be attached to an analyte. A particular barcode can be unique relative to other barcodes. For the purpose of this disclosure, an “analyte” can include any biological substance, structure, moiety, or component to be analyzed. The term “target” can similarly refer to an analyte of interest.

Analytes can be broadly classified into one of two groups: nucleic acid analytes, and non-nucleic acid analytes. Examples of non-nucleic acid analytes include, but are not limited to, lipids, carbohydrates, peptides, proteins, glycoproteins (N-linked or O-linked), lipoproteins, phosphoproteins, specific phosphorylated or acetylated variants of proteins, amidation variants of proteins, hydroxylation variants of proteins, methylation variants of proteins, ubiquitylation variants of proteins, sulfation variants of proteins, viral proteins (e.g., viral capsid, viral envelope, viral coat, viral accessory, viral glycoproteins, viral spike, etc.), extracellular and intracellular proteins, antibodies, and antigen binding fragments. In some embodiments, the analyte(s) can be localized to subcellular location(s), including, for example, organelles, e.g., mitochondria, Golgi apparatus, endoplasmic reticulum, chloroplasts, endocytic vesicles, exocytic vesicles, vacuoles, lysosomes, etc. In some embodiments, analyte(s) can be peptides or proteins, including without limitation antibodies and enzymes. Additional examples of analytes can be found in Section (I)(c) of PCT Publication No. WO2020/176788 and/or U.S. Patent Application Publication No. 2020/0277663. In some embodiments, an analyte can be detected indirectly, such as through detection of an intermediate agent, for example, a ligation product or an analyte capture agent (e.g., an oligonucleotide-conjugated antibody), such as those described herein.

A “biological sample” is typically obtained from the subject for analysis using any of a variety of techniques including, but not limited to, biopsy, surgery, and laser capture microscopy (LCM), and generally includes cells and/or other biological material from the subject. In some embodiments, the biological sample is a tissue sample. In some embodiments, the biological sample (e.g., tissue sample) is a tissue microarray (TMA). A tissue microarray contains multiple representative tissue samples—which can be from different tissues or organisms—assembled on a single histologic slide. The TMA can therefore allow for high throughput analysis of multiple specimens at the same time. Tissue microarrays are paraffin blocks produced by extracting cylindrical tissue cores from different paraffin donor blocks and re-embedding these into a single recipient (microarray) block at defined array coordinates.

In some embodiments, the biological sample, e.g., the tissue, is embedded in a matrix e.g., optimal cutting temperature (OCT) compound to facilitate sectioning. OCT compound is a formulation of clear, water-soluble glycols and resins, providing a solid matrix to encapsulate biological (e.g., tissue) specimens. In some embodiments, the sectioning is performed by cryosectioning, for example using a microtome. In some embodiments, the methods further comprise a thawing step, after the cryosectioning.

The biological sample can be from a mammal. In some instances, the biological sample is from a human, mouse, or rat. In addition to the subjects described above, the biological sample can be obtained from non-mammalian organisms (e.g., a plants, an insect, an arachnid, a nematode (e.g., Caenorhabditis elegans), a fungi, an amphibian, or a fish (e.g., zebrafish)). A biological sample can be obtained from a prokaryote such as a bacterium, e.g., Escherichia coli, Staphylococci or Mycoplasma pneumoniae; an archaea; a virus such as Hepatitis C virus or human immunodeficiency virus; or a viroid. A biological sample can be obtained from a eukaryote, such as a patient derived organoid (PDO) or patient derived xenograft (PDX). The biological sample can include organoids, a miniaturized and simplified version of an organ produced in vitro in three dimensions that shows realistic micro-anatomy. Organoids can be generated from one or more cells from a tissue, embryonic stem cells, and/or induced pluripotent stem cells, which can self-organize in three-dimensional culture owing to their self-renewal and differentiation capacities. In some embodiments, an organoid is a cerebral organoid, an intestinal organoid, a stomach organoid, a lingual organoid, a thyroid organoid, a thymic organoid, a testicular organoid, a hepatic organoid, a pancreatic organoid, an epithelial organoid, a lung organoid, a kidney organoid, a gastruloid, a cardiac organoid, or a retinal organoid. Subjects from which biological samples can be obtained can be healthy or asymptomatic individuals, individuals that have or are suspected of having a disease (e.g., cancer) or a pre-disposition to a disease, and/or individuals that are in need of therapy or suspected of needing therapy.

Biological samples can be derived from a homogeneous culture or population of the subjects or organisms mentioned herein or alternatively from a collection of several different organisms, for example, in a community or ecosystem.

Biological samples can include one or more diseased cells. A diseased cell can have altered metabolic properties, gene expression, protein expression, and/or morphologic features. Examples of diseases include inflammatory disorders, metabolic disorders, nervous system disorders, and cancer. Cancer cells can be derived from solid tumors, hematological malignancies, cell lines, or obtained as circulating tumor cells.

In some embodiments, the biological sample, e.g., the tissue sample, is fixed in a fixative including alcohol, for example methanol. In some embodiments, instead of methanol, acetone, or an acetone-methanol mixture can be used. In some embodiments, the fixation is performed after sectioning. In some instances, the biological sample is not fixed with paraformaldehyde (PFA). In some instances, when the biological sample is fixed with a fixative including an alcohol (e.g., methanol or acetone-methanol mixture), it is not decrosslinked afterward. In some preferred embodiments, the biological sample is fixed with a fixative including an alcohol (e.g., methanol or an acetone-methanol mixture) after freezing and/or sectioning. In some instances, the biological sample is flash-frozen, and then the biological sample is sectioned and fixed (e.g., using methanol, acetone, or an acetone-methanol mixture). In some instances when methanol, acetone, or an acetone-methanol mixture is used to fix the biological sample, the sample is not decrosslinked at a later step. In instances when the biological sample is frozen (e.g., flash frozen using liquid nitrogen and embedded in OCT) followed by sectioning and alcohol (e.g., methanol, acetone-methanol) fixation or acetone fixation, the biological sample is referred to as “fresh frozen”. In some embodiments, fixation of the biological sample e.g., using acetone and/or alcohol (e.g., methanol, acetone-methanol) is performed while the sample is on a substrate (e.g., mounted on a glass slide, such as a positively charged glass slide).

In some embodiments, the biological sample, e.g., the tissue sample, is fixed e.g., immediately after being harvested from a subject. In such embodiments, the fixative is preferably an aldehyde fixative, such as paraformaldehyde (PFA) or formalin. In some embodiments, the fixative induces crosslinks within the biological sample. In some embodiments, after fixing e.g., by formalin or PFA, the biological sample is dehydrated via sucrose gradient. In some instances, the fixed biological sample is treated with a sucrose gradient and then embedded in a matrix e.g., OCT compound. In some instances, the fixed biological sample is not treated with a sucrose gradient, but rather is embedded in a matrix e.g., OCT compound after fixation. In some embodiments when a fixed frozen tissue sample is treated with a sucrose gradient, it can be rehydrated with an ethanol gradient. In some embodiments, the PFA or formalin fixed biological sample, which can be optionally dehydrated via sucrose gradient and/or embedded in OCT compound, is then frozen e.g., for storage or shipment. In such instances, the biological sample is referred to as “fixed frozen”. In preferred embodiments, a fixed frozen biological sample is not treated with methanol. In preferred embodiments, a fixed frozen biological sample is not paraffin embedded. Thus, in preferred embodiments, a fixed frozen biological sample is not deparaffinized. In some embodiments, a fixed frozen biological sample is rehydrated in an ethanol gradient.

In some instances, the biological sample (e.g., a fixed frozen tissue sample) is treated with a citrate buffer. Citrate buffer can be used for antigen retrieval to decrosslink antigens and fixation medium in the biological sample. Thus, any suitable decrosslinking agent can be used in addition to or alternatively to citrate buffer. In some embodiments, for example, the biological sample (e.g., a fixed frozen tissue sample) is decrosslinked with Tris-EDTA (TE) buffer.

In any of the foregoing, the biological sample can further be stained, imaged, and/or destained. For example, in some embodiments, a fresh frozen tissue sample or fixed frozen tissue sample is stained (e.g., via eosin and/or hematoxylin), imaged, destained (e.g., via HCl), or a combination thereof. In some embodiments, when a fresh frozen tissue sample is fixed in methanol, it is treated with isopropanol prior to being stained (e.g., via eosin and/or hematoxylin), imaged, destained (e.g., via HCl), or a combination thereof. In some embodiments when a fixed frozen tissue sample is treated with a sucrose gradient, it can be rehydrated with an ethanol gradient before being stained, (e.g., via eosin and/or hematoxylin), imaged, destained (e.g., via HCl), decrosslinked (e.g., via Tris & EDTA (TE)) buffer or citrate buffer), or a combination thereof. In some embodiments, the biological sample can undergo further fixation (e.g., while mounted on a substrate), stained, imaged, and/or destained. For example, a fixed frozen biological sample may be subject to an additional fixing step (e.g., using PFA) before optional ethanol rehydration, staining, imaging, and/or destaining.

In any of the foregoing, the biological sample can be fixed using PAXgene. For example, the biological sample can be fixed using PAXgene in addition, or alternatively to, a fixative disclosed herein or known in the art (e.g., alcohol, acetone, acetone-alcohol, formalin, paraformaldehyde). PAXgene is a non-cross-linking mixture of different alcohols, acid and a soluble organic compound that preserves morphology and bio-molecules. It is a two-reagent fixative system in which tissue is firstly fixed in a solution containing methanol and acetic acid then stabilized in a solution containing ethanol. See, Ergin B. et al., J Proteome Res. 2010 Oct. 1; 9(10):5188-96; Kap M. et al., PLoS One.; 6(11):e27704 (2011); and Mathieson W. et al., Am J Clin Pathol.; 146(1):25-40 (2016), each of which are hereby incorporated by reference in their entirety, for a description and evaluation of PAXgene for tissue fixation. Thus, in some embodiments, when the biological sample, e.g., the tissue sample, is fixed in a fixative including alcohol, the fixative is PAXgene. In some embodiments, a fresh frozen tissue sample is fixed with PAXgene. In some embodiments, a fixed frozen tissue sample is fixed with PAXgene.

In some embodiments, the biological sample, e.g., the tissue sample is fixed, for example in methanol, acetone, acetone-methanol, PFA, PAXgene or is formalin-fixed and paraffin-embedded (FFPE). In some embodiments, the biological sample comprises intact cells. In some embodiments, the biological sample is a cell pellet, e.g., a fixed cell pellet, e.g., an FFPE cell pellet. FFPE samples are used in some instances in the RTL methods disclosed herein. A limitation of direct RNA capture for fixed samples is that the RNA integrity of fixed (e.g., FFPE) samples can be lower than a fresh sample, thereby making it more difficult to capture RNA directly, e.g., by capture of a common sequence such as a poly(A) tail of an mRNA molecule. However, by utilizing RTL probes that hybridize to RNA target sequences in the transcriptome, one can avoid a requirement for RNA analytes to have both a poly(A) tail and target sequences intact. Accordingly, RTL probes can be utilized to beneficially improve capture and spatial analysis of fixed samples. The biological sample, e.g., tissue sample, can be stained, and imaged prior, during, and/or after each step of the methods described herein. Any of the methods described herein or known in the art can be used to stain and/or image the biological sample. In some embodiments, the imaging occurs prior to destaining the sample. In some embodiments, the biological sample is stained using an H&E staining method. In some embodiments, the tissue sample is stained and imaged for about 10 minutes to about 2 hours (or any of the subranges of this range described herein). Additional time may be needed for staining and imaging of different types of biological samples.

The tissue sample can be obtained from any suitable location in a tissue or organ of a subject, e.g., a human subject. In some instances, the sample is a mouse sample. In some instances, the sample is a human sample. In some embodiments, the sample can be derived from skin, brain, breast, lung, liver, kidney, prostate, tonsil, thymus, testes, bone, lymph node, ovary, eye, heart, or spleen. In some instances, the sample is a human or mouse breast tissue sample. In some instances, the sample is a human or mouse brain tissue sample. In some instances, the sample is a human or mouse lung tissue sample. In some instances, the sample is a human or mouse tonsil tissue sample. In some instances, the sample is a human or mouse liver tissue sample. In some instances, the sample is a human or mouse bone, skin, kidney, thymus, testes, or prostate tissue sample. In some embodiments, the tissue sample is derived from normal or diseased tissue. In some embodiments, the sample is an embryo sample. The embryo sample can be a non-human embryo sample. In some instances, the sample is a mouse embryo sample.

Non-limiting examples of stains include histological stains (e.g., hematoxylin and/or eosin) and immunological stains (e.g., fluorescent stains). The biological sample can be stained using Can-Grunwald, Giemsa, hematoxylin and eosin (H&E), Jenner's, Leishman, Masson's trichrome, Papanicolaou, Romanowsky, silver, Sudan, Wright's, and/or Periodic Acid Schiff (PAS) staining techniques. In some instances, PAS staining is performed after formalin or acetone fixation. In some embodiments, a biological sample (e.g., a fixed and/or stained biological sample) can be imaged. Biological samples are also described in Section (I)(d) of PCT Publication No. WO2020/176788 and/or U.S. Patent Application Publication No. 2020/0277663.

The following embodiments can be used with any of the methods described herein. In some embodiments, the biological sample is imaged. In some embodiments, the biological sample is visualized or imaged using bright field microscopy. In some embodiments, the biological sample is visualized or imaged using fluorescence microscopy. Additional methods of visualization and imaging are known in the art. Non-limiting examples of visualization and imaging include expansion microscopy, bright field microscopy, dark field microscopy, phase contrast microscopy, electron microscopy, fluorescence microscopy, reflection microscopy, interference microscopy and confocal microscopy. In some embodiments, the sample is stained and imaged prior to adding reagents for analyzing captured analytes as disclosed herein to the biological sample.

In some embodiments, the method includes staining the biological sample. In some embodiments, the staining includes the use of hematoxylin and eosin. In some embodiments, a biological sample can be stained using any number of biological stains, including but not limited to, acridine orange, Bismarck brown, carmine, coomassie blue, cresyl violet, DAPI, eosin, ethidium bromide, acid fuchsine, hematoxylin, Hoechst stains, iodine, methyl green, methylene blue, neutral red, Nile blue, Nile red, osmium tetroxide, propidium iodide, rhodamine, or safranin. In some instances, the biological sample can be stained using known staining techniques, including Can-Grunwald, Giemsa, hematoxylin and eosin (H&E), Jenner's, Leishman, Masson's trichrome, Papanicolaou, Romanowsky, silver, Sudan, Wright's, and/or Periodic Acid Schiff (PAS) staining techniques. PAS staining is typically performed after formalin or acetone fixation.

In some embodiments, the staining includes the use of a detectable label selected from the group consisting of a radioisotope, a fluorophore, a chemiluminescent compound, a bioluminescent compound, or a combination thereof.

In some embodiments, a biological sample is permeabilized with one or more permeabilization reagents. For example, permeabilization of a biological sample can facilitate analyte capture. Exemplary permeabilization agents and conditions are described in Section (I)(d)(ii)(13) or the Exemplary Embodiments Section of PCT Publication No. WO2020/176788 and/or U.S. Patent Application Publication No. 2020/0277663. Briefly, in any of the methods described herein, the method includes a step of permeabilizing the biological sample. For example, the biological sample can be permeabilized to facilitate transfer of the extension products to the capture probes on the array. In some embodiments, the permeabilizing includes the use of an organic solvent (e.g., acetone, ethanol, and methanol), a detergent (e.g., saponin, Triton X-100™, Tween-20™, or sodium dodecyl sulfate (SDS)), an enzyme (an endopeptidase, an exopeptidase, a protease), or combinations thereof. In some embodiments, the permeabilizing includes the use of an endopeptidase, a protease, SDS, polyethylene glycol tert-octylphenyl ether, polysorbate 80, and polysorbate 20, N-lauroylsarcosine sodium salt solution, saponin, Triton X-100™, Tween-20™, or combinations thereof. In some embodiments, the endopeptidase is pepsin. In some embodiments, the endopeptidase is Proteinase K. Additional methods for sample permeabilization are described, for example, in Jamur et al., Method Mol. Biol. 588:63-66, 2010, the entire contents of which are incorporated herein by reference.

Array-based spatial analysis methods typically involve the transfer of one or more analytes from a biological sample to an array of features on a substrate, where each feature is associated with a unique spatial location on the array. Subsequent analysis of the transferred analytes include determining the identity of the analytes and the spatial location of the analytes within the biological sample. The spatial location of an analyte within the biological sample is determined based on the feature to which the analyte is bound (e.g., directly or indirectly) on the array, and the feature's relative spatial location within the array.

A “capture probe” refers to any molecule capable of capturing (directly or indirectly) and/or labelling an analyte (e.g., an analyte of interest) in a biological sample. In some embodiments, the capture probe is a nucleic acid or a polypeptide. In some embodiments, the capture probe includes a barcode (e.g., a spatial barcode and/or a unique molecular identifier (UMI) and a capture domain). In some instances, the capture probe includes a homopolymer sequence, such as a poly(T) sequence. In some embodiments, a capture probe can include a cleavage domain and/or a functional domain (e.g., a primer-binding site, such as for next-generation sequencing (NGS)). See, e.g., Section (II)(b) (e.g., subsections (i)-(vi)) of PCT Publication No. WO2020/176788 and/or U.S. Patent Application Publication No. 2020/0277663. Generation of capture probes can be achieved by any appropriate method, including those described in Section (II)(d)(ii) of PCT Publication No. WO2020/176788 and/or U.S. Patent Application Publication No. 2020/0277663.

In some instances, hybridization between a capture probe and a nucleic acid analyte (or any other nucleic acid to nucleic acid interaction) occurs because the sequences of the two nucleic acids are substantially complementary to one another. The terms “substantial,” “substantially” complementary and the like, describe the relationship between nucleic acid sequences when at least 60% of the nucleotide residues of one nucleic acid sequence are complementary to nucleotide residues in another nucleic acid sequence. The complementary residues within a particular complementary nucleic acid sequence need not always be contiguous with each other, and can be interrupted by one or more non-complementary residues within the complementary nucleic acid sequence. In some embodiments, at least 60%, but less than 100%, of the residues of one of the two complementary nucleic acid sequences are complementary to residues in the other nucleic acid sequence. In some embodiments, at least 70%, 80%, 90%, 95% or 99% of the residues of one nucleic acid sequence are complementary to residues in the other nucleic acid sequence. Sequences are said to be “substantially complementary” when at least 60% (e.g., at least 70%, at least 80%, or at least 90%) of the residues of one nucleic acid sequence are complementary to residues in the other nucleic acid sequence.

In some embodiments, the biological sample is on a first substrate and the substrate comprising the array of capture probes is a second substrate. During this process, one or more analytes or analyte derivatives (e.g., intermediate agents; e.g., ligation products) are released from the biological sample and migrate to the second substrate comprising an array of capture probes. In some embodiments, the release and migration of the analytes or analyte derivatives to the second substrate comprising the array of capture probes occurs in a manner that preserves the original spatial context of the analytes in the biological sample. This method can be referred to as a sandwiching process, which is described e.g., in U.S. Patent Application Pub. No. 2021/0189475 and PCT Pub. Nos. WO 2021/252747 A1, WO 2022/061152 A2, and WO 2022/140028 A1.

FIG. 1A shows an exemplary sandwiching process 100 where a first substrate (e.g., slide 103), including a biological sample 102, and a second substrate (e.g., array slide 104 including an array having spatially barcoded capture probes 106) are brought into proximity with one another. As shown in FIG. 1A a liquid reagent drop (e.g., permeabilization solution 105) is introduced on the second substrate in proximity to the capture probes 106 and in between the biological sample 102 and the second substrate (e.g., slide 104 including an array having spatially barcoded capture probes 106). The permeabilization solution 105 may release analytes or analyte derivatives (e.g., intermediate agents; e.g., ligation products) that can be captured by the capture probes of the array 106.

During the exemplary sandwiching process, the first substrate is aligned with the second substrate, such that at least a portion of the biological sample is aligned with at least a portion of the capture probes (e.g., aligned in a sandwich configuration). As shown, the second substrate (e.g., array slide 104) is in an inferior position to the first substrate (e.g., slide 103). In some embodiments, the first substrate (e.g., slide 103) may be positioned superior to the second substrate (e.g., slide 104). A reagent medium 105 within a gap between the first substrate (e.g., slide 103) and the second substrate (e.g., slide 104) creates a liquid interface between the two substrates. The reagent medium may be a permeabilization solution which permeabilizes and/or digests the biological sample 102. In some embodiments wherein the biological sample 102 has been pre-permeabilized, the reagent medium is not a permeabilization solution. In some embodiments, analytes (e.g., mRNA transcripts) and/or analyte derivatives (e.g., intermediate agents; e.g., ligation products) of the biological sample 102 may release from the biological sample, and actively or passively migrate (e.g., diffuse) across the gap toward the capture probes on the array 106. Alternatively, in certain embodiments, migration of the analyte or analyte derivative (e.g., intermediate agent; e.g., ligation product) from the biological sample is performed actively (e.g., electrophoretic, by applying an electric field to promote migration). Exemplary methods of electrophoretic migration are described in WO 2020/176788, and US. Patent Application Pub. No. 2021/0189475, each of which is hereby incorporated by reference.

As further shown, one or more spacers 110 may be positioned between the first substrate (e.g., slide 103) and the second substrate (e.g., array slide 104 including spatially barcoded capture probes 106). The one or more spacers 110 may be configured to maintain a separation distance between the first substrate and the second substrate. While the one or more spacers 110 is shown as disposed on the second substrate, the spacer may additionally or alternatively be disposed on the first substrate.

In some embodiments, the one or more spacers 110 is configured to maintain a separation distance between first and second substrates that is between about 2 microns and 1 mm (e.g., between about 2 microns and 800 microns, between about 2 microns and 700 microns, between about 2 microns and 600 microns, between about 2 microns and 500 microns, between about 2 microns and 400 microns, between about 2 microns and 300 microns, between about 2 microns and 200 microns, between about 2 microns and 100 microns, between about 2 microns and 25 microns, or between about 2 microns and 10 microns), measured in a direction orthogonal to the surface of first substrate that supports the biological sample. In some instances, the separation distance is about 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25 microns. In some embodiments, the separation distance is less than 50 microns. In some embodiments, the separation distance is less than 25 microns. In some embodiments, the separation distance is less than 20 microns. The separation distance may include a distance of at least 2 μm.

FIG. 1B shows a fully formed sandwich configuration 125 creating a chamber 150 formed from the one or more spacers 110, the first substrate (e.g., the slide 103), and the second substrate (e.g., the slide 104 including an array 106 having spatially barcoded capture probes) in accordance with some example implementations. In the example of FIG. 1B, the liquid reagent (e.g., the permeabilization solution 105) fills the volume of the chamber 150 and may create a permeabilization buffer that allows analytes (e.g., mRNA transcripts and/or other molecules) or analyte derivatives (e.g., intermediate agents; e.g., ligation products) to diffuse from the biological sample 102 toward the capture probes of the second substrate (e.g., slide 104). In some aspects, flow of the permeabilization buffer may deflect transcripts and/or molecules from the biological sample 102 and may affect diffusive transfer of analytes or analyte derivatives (e.g., intermediate agents; e.g., ligation products) for spatial analysis. A partially or fully sealed chamber 150 resulting from the one or more spacers 110, the first substrate, and the second substrate may reduce or prevent flow from undesirable convective movement of transcripts and/or molecules over the diffusive transfer from the biological sample 102 to the capture probes.

The sandwiching process methods described above can be implemented using a variety of hardware components. For example, the sandwiching process methods can be implemented using a sample holder (also referred to herein as a support device, a sample handling apparatus, and an array alignment device). Further details on support devices, sample holders, sample handling apparatuses, or systems for implementing a sandwiching process are described in, e.g., US. Patent Application Pub. No. 2021/0189475, and PCT Publ. No. WO 2022/061152 A2, each of which are incorporated by reference in their entirety.

In some embodiments of a sample holder, the sample holder can include a first member including a first retaining mechanism configured to retain a first substrate comprising a biological sample. The first retaining mechanism can be configured to retain the first substrate disposed in a first plane. The sample holder can further include a second member including a second retaining mechanism configured to retain a second substrate disposed in a second plane. The sample holder can further include an alignment mechanism connected to one or both of the first member and the second member. The alignment mechanism can be configured to align the first and second members along the first plane and/or the second plane such that the sample contacts at least a portion of the reagent medium when the first and second members are aligned and within a threshold distance along an axis orthogonal to the second plane. The adjustment mechanism may be configured to move the second member along the axis orthogonal to the second plane and/or move the first member along an axis orthogonal to the first plane.

In some embodiments, the adjustment mechanism includes a linear actuator. In some embodiments, the linear actuator is configured to move the second member along an axis orthogonal to the plane of the first member and/or the second member. In some embodiments, the linear actuator is configured to move the first member along an axis orthogonal to the plane of the first member and/or the second member. In some embodiments, the linear actuator is configured to move the first member, the second member, or both the first member and the second member at a velocity of at least 0.1 mm/sec. In some embodiments, the linear actuator is configured to move the first member, the second member, or both the first member and the second member with an amount of force of at least 0.1 lbs.

FIG. 2A is a perspective view of an example sample handling apparatus 200 in a closed position in accordance with some example implementations. As shown, the sample handling apparatus 200 includes a first member 204, a second member 210, optionally an image capture device 220, a first substrate 206, optionally a hinge 215, and optionally a mirror 216. The hinge 215 may be configured to allow the first member 204 to be positioned in an open or closed configuration by opening and/or closing the first member 204 in a clamshell manner along the hinge 215.

FIG. 2B is a perspective view of the example sample handling apparatus 200 in an open position in accordance with some example implementations. As shown, the sample handling apparatus 200 includes one or more first retaining mechanisms 208 configured to retain one or more first substrates 206. In the example of FIG. 2B, the first member 204 is configured to retain two first substrates 206, however the first member 204 may be configured to retain more or fewer first substrates 206.

In some aspects, when the sample handling apparatus 200 is in an open position (e.g., in FIG. 2B), the first substrate 206 and/or the second substrate 212 may be loaded and positioned within the sample handling apparatus 200 such as within the first member 204 and the second member 210, respectively. As noted, the hinge 215 may allow the first member 204 to close over the second member 210 and form a sandwich configuration.

In some aspects, after the first member 204 closes over the second member 210, an adjustment mechanism of the sample handling apparatus 200 may actuate the first member 204 and/or the second member 210 to form the sandwich configuration for the permeabilization step (e.g., bringing the first substrate 206 and the second substrate 212 closer to each other and within a threshold distance for the sandwich configuration). The adjustment mechanism may be configured to control a speed, an angle, a force, or the like of the sandwich configuration.

In some embodiments, the biological sample (e.g., sample 102 from FIG. 1A) may be aligned within the first member 204 (e.g., via the first retaining mechanism 208) prior to closing the first member 204 such that a desired region of interest of the sample is aligned with the barcoded array of the second substrate (e.g., the slide 104 from FIG. 1A), e.g., when the first and second substrates are aligned in the sandwich configuration. Such alignment may be accomplished manually (e.g., by a user) or automatically (e.g., via an automated alignment mechanism). After or before alignment, spacers may be applied to the first substrate 206 and/or the second substrate 212 to maintain a minimum spacing between the first substrate 206 and the second substrate 212 during sandwiching. In some aspects, the permeabilization solution (e.g., permeabilization solution 305) may be applied to the first substrate 206 and/or the second substrate 212. The first member 204 may then close over the second member 210 and form the sandwich configuration. Analytes or analyte derivatives (e.g., intermediate agents; e.g., ligation products) may be captured by the capture probes of the array and may be processed for spatial analysis.

In some embodiments, during the permeabilization step, the image capture device 220 may capture images of the overlap area between the biological sample and the capture probes on the array 106. If more than one first substrates 206 and/or second substrates 212 are present within the sample handling apparatus 200, the image capture device 220 may be configured to capture one or more images of one or more overlap areas.

Provided herein are methods for delivering a fluid to a biological sample disposed on an area of a first substrate and an array disposed on a second substrate. FIGS. 3A-3C depict a side view and a top view of an exemplary angled closure workflow 300 for sandwiching a first substrate (e.g., slide 303) having a biological sample 302 and a second substrate (e.g., slide 304 having capture probes 306) in accordance with some exemplary implementations.

FIG. 3A depicts the first substrate (e.g., the slide 303 including a biological sample 302) angled over (superior to) the second substrate (e.g., slide 304). As shown, reagent medium (e.g., permeabilization solution) 305 is located on the spacer 310 toward the right-hand side of the side view in FIG. 3A. While FIG. 3A depicts the reagent medium on the right hand side of side view, it should be understood that such depiction is not meant to be limiting as to the location of the reagent medium on the spacer.

FIG. 3B shows that as the first substrate lowers, and/or as the second substrate rises, the dropped side of the first substrate (e.g., a side of the slide 303 angled toward the second substrate) may contact the reagent medium 305. The dropped side of the first substrate may urge the reagent medium 305 toward the opposite direction (e.g., towards an opposite side of the spacer 310, towards an opposite side of the first substrate relative to the dropped side). For example, in the side view of FIG. 3B the reagent medium 305 may be urged from right to left as the sandwich is formed.

In some embodiments, the first substrate and/or the second substrate are further moved to achieve an approximately parallel arrangement of the first substrate and the second substrate.

FIG. 3C depicts a full closure of the sandwich between the first substrate and the second substrate with the spacer 310 contacting both the first substrate and the second substrate and maintaining a separation distance and optionally the approximately parallel arrangement between the two substrates. As shown in the top view of FIG. 3C, the spacer 310 fully encloses and surrounds the biological sample 302 and the capture probes 306, and the spacer 310 form the sides of chamber 350 which holds a volume of the reagent medium 305.

While FIG. 3C depicts the first substrate (e.g., the slide 303 including biological sample 302) angled over (superior to) the second substrate (e.g., slide 304) and the second substrate comprising the spacer 310, it should be understood that an exemplary angled closure workflow can include the second substrate angled over (superior to) the first substrate and the first substrate comprising the spacer 310.

It may be desirable that the reagent medium be free from air bubbles between the substrates to facilitate transfer of target analytes with spatial information. Additionally, air bubbles present between the substrates may obscure at least a portion of an image capture of a desired region of interest. Accordingly, it may be desirable to ensure or encourage suppression and/or elimination of air bubbles between the two substrates (e.g., slide 303 and slide 304) during a permeabilization step (e.g., step 104). In some aspects, it may be possible to reduce or eliminate bubble formation between the substrates using a variety of filling methods and/or closing methods. In some instances, the first substrate and the second substrate are arranged in an angled sandwich assembly as described herein. For example, during the sandwiching of the two substrates (e.g., the slide 303 and the slide 304), an angled closure workflow may be used to suppress or eliminate bubble formation.

FIG. 4A is a side view of the angled closure workflow 400 in accordance with some exemplary implementations. FIG. 4B is a top view of the angled closure workflow 400 in accordance with some exemplary implementations. As shown at 405, reagent medium 401 is positioned to the side of the substrate 402.

At step 410, the dropped side of the angled substrate 406 contacts the reagent medium 401 first. The contact of the substrate 406 with the reagent medium 401 may form a linear or low curvature flow front that fills uniformly with the slides closed.

At step 415, the substrate 406 is further lowered toward the substrate 402 (or the substrate 402 is raised up toward the substrate 406) and the dropped side of the substrate 406 may contact and may urge the reagent medium toward the side opposite the dropped side and creating a linear or low curvature flow front that may prevent or reduce bubble trapping between the substrates.

At step 420, the reagent medium 401 fills the gap between the substrate 406 and the substrate 402. The linear flow front of the liquid reagent may form by squeezing the 401 volume along the contact side of the substrate 402 and/or the substrate 406. Additionally, capillary flow may also contribute to filling the gap area.

In some embodiments, the reagent medium (e.g., 105 in FIG. 1A) comprises a permeabilization agent. In some embodiments, following initial contact between the biological sample and a permeabilization agent, the permeabilization agent can be removed from contact with the biological sample (e.g., by opening the sample holder). Suitable agents for this purpose include, but are not limited to, organic solvents (e.g., acetone, ethanol, and methanol), cross-linking agents (e.g., paraformaldehyde), detergents (e.g., saponin, Triton X-100™, Tween-20™, or sodium dodecyl sulfate (SDS)), and enzymes (e.g., trypsin, proteases (e.g., proteinase K). In some embodiments, the detergent is an anionic detergent (e.g., SDS or N-lauroylsarcosine sodium salt solution).

In some embodiments, the reagent medium comprises a lysis reagent. Lysis solutions can include ionic surfactants such as, for example, sarkosyl and sodium dodecyl sulfate (SDS). More generally, chemical lysis agents can include, without limitation, organic solvents, chelating agents, detergents, surfactants, and chaotropic agents. In some embodiments, the reagent medium comprises a protease. Exemplary proteases include, e.g., pepsin, trypsin, pepsin, elastase, and proteinase K. In some embodiments, the reagent medium comprises a nuclease. In some embodiments, the nuclease comprises an RNase. In some embodiments, the RNase is selected from RNase A, RNase C, RNase H, and RNase I. In some embodiments, the reagent medium comprises one or more of sodium dodecyl sulfate (SDS) or a sodium salt thereof, proteinase K, pepsin, N-lauroylsarcosine, and RNase.

In some embodiments, the reagent medium comprises polyethylene glycol (PEG). In some embodiments, the PEG is from about 2K to about 16K. In some embodiments, the PEG is PEG 2K, 3K, 4K, 5K, 6K, 7K, 8K, 9K, 10K, 11K, 12K, 13K, 14K, 15K, or 16K. In some embodiments, the PEG is present at a concentration from about 2% to 25%, from about 4% to about 23%, from about 6% to about 21%, or from about 8% to about 20% (v/v).

In certain embodiments a dried permeabilization reagent is applied or formed as a layer on the first substrate or the second substrate or both prior to contacting the biological sample with the array. For example, a permeabilization reagent can be deposited in solution on the first substrate or the second substrate or both and then dried.

In some instances, the aligned portions of the biological sample and the array are in contact with the reagent medium for about 1 minute, about 5 minutes, about 10 minutes, about 12 minutes, about 15 minutes, about 18 minutes, about 20 minutes, about 25 minutes, about 30 minutes, about 36 minutes, about 45 minutes, or about an hour. In some instances, the aligned portions of the biological sample and the array are in contact with the reagent medium for about 1-60 minutes.

There are at least two methods to associate a spatial barcode with one or more neighboring cells, such that the spatial barcode identifies the one or more cells, and/or contents of the one or more cells, as associated with a particular spatial location. One method is to promote analytes or analyte proxies (e.g., intermediate agents) out of a cell and towards a spatially-barcoded array (e.g., including spatially-barcoded capture probes). Another method is to cleave spatially-barcoded capture probes from an array and promote the spatially-barcoded capture probes towards and/or into or onto the biological sample.

In some cases, capture probes may be configured to prime, replicate, and consequently yield optionally barcoded extension products from a template (e.g., a DNA or RNA template, such as an analyte or an intermediate agent (e.g., a ligation product or an analyte capture agent), or a portion thereof), or derivatives thereof (see, e.g., Section (II)(b)(vii) of PCT Publication No. WO2020/176788 and/or U.S. Patent Application Publication No. 2020/0277663 regarding extended capture probes). In some cases, capture probes may be configured to form ligation products with a template (e.g., a DNA or RNA template, such as an analyte or an intermediate agent, or portion thereof), thereby creating ligation products that serve as proxies for the template.

As used herein, an “extended capture probe” refers to a capture probe having additional nucleotides added to the terminus (e.g., 3′ or 5′ end) of the capture probe thereby extending the overall length of the capture probe. For example, an “extended 3′ end” indicates additional nucleotides were added to the most 3′ nucleotide of the capture probe to extend the length of the capture probe, for example, by polymerization reactions used to extend nucleic acid molecules including templated polymerization catalyzed by a polymerase (e.g., a DNA polymerase or a reverse transcriptase). In some embodiments, extending the capture probe includes adding to a 3′ end of a capture probe a nucleic acid sequence that is complementary to a nucleic acid sequence of an analyte or intermediate agent specifically bound to the capture domain of the capture probe. In some embodiments, the capture probe is extended by a reverse transcriptase. In some embodiments, the capture probe is extended using one or more DNA polymerases. In some embodiments, the extended capture probes include the sequence of the capture domain and the sequence of the spatial barcode of the capture probe, and the complementary sequence of the template used for extension of the capture probe.

In some embodiments, extended capture probes are amplified (e.g., in bulk solution or on the array) to yield quantities that are sufficient for downstream analysis, e.g., sequencing. In some embodiments, extended capture probes (e.g., DNA molecules) can act as templates for an amplification reaction (e.g., a polymerase chain reaction).

Additional variants of spatial analysis methods, including in some embodiments, an imaging step, are described in Section (II)(a) of PCT Publication No. WO2020/176788 and/or U.S. Patent Application Publication No. 2020/0277663. Analysis of captured analytes (and/or intermediate agents or portions thereof), for example, including sample removal, extension of capture probes using the captured analyte as a template, sequencing (e.g., of a cleaved extended capture probe and/or a cDNA molecule complementary to an extended capture probe), sequencing on the array (e.g., using, for example, in situ hybridization or in situ ligation approaches), temporal analysis, and/or proximity capture, is described in Section (II)(g) of PCT Publication No. WO2020/176788 and/or U.S. Patent Application Publication No. 2020/0277663. Some quality control measures are described in Section (II)(h) of PCT Publication No. WO2020/176788 and/or U.S. Patent Application Publication No. 2020/0277663.

Spatial information can provide information of medical importance. For example, the methods described herein can allow for: identification of one or more biomarkers (e.g., diagnostic, prognostic, and/or for determination of efficacy of a treatment) of a disease or disorder; identification of a candidate drug target for treatment of a disease or disorder; identification (e.g., diagnosis) of a subject as having a disease or disorder; identification of stage and/or prognosis of a disease or disorder in a subject; identification of a subject as having an increased likelihood of developing a disease or disorder; monitoring of progression of a disease or disorder in a subject; determination of efficacy of a treatment of a disease or disorder in a subject; identification of a patient subpopulation for which a treatment is effective for a disease or disorder; modification of a treatment of a subject with a disease or disorder; selection of a subject for participation in a clinical trial; and/or selection of a treatment for a subject with a disease or disorder. Exemplary methods for identifying spatial information of biological and/or medical importance can be found in U.S. Patent Application Publication Nos. 2021/0140982, 2021/0198741, and 2021/0199660.

Spatial information can provide information of biological importance. For example, the methods described herein can allow for: identification of transcriptome and/or proteome expression profiles (e.g., in healthy and/or diseased tissue); identification of multiple analyte types in close proximity (e.g., nearest neighbor or proximity based analysis); determination of up- and/or down-regulated genes and/or proteins in diseased tissue; characterization of tumor microenvironments; characterization of tumor immune responses; characterization of cells types and their co-localization in healthy and diseased tissue; and identification of genetic variants within tissues (e.g., based on gene and/or protein expression profiles associated with specific disease or disorder biomarkers).

Typically, for spatial array-based methods, a substrate functions as a support for direct or indirect attachment of capture probes to features of the array. A “feature” is an entity that acts as a support or repository for various molecular entities used in spatial analysis. In some embodiments, some or all of the features in an array are functionalized for analyte capture. Exemplary substrates are described in Section (II)(c) of PCT Publication No. WO2020/176788 and/or U.S. Patent Application Publication No. 2020/0277663. Exemplary features and geometric attributes of an array can be found in Sections (II)(d)(i), (II)(d)(iii), and (II)(d)(iv) of PCT Publication No. WO2020/176788 and/or U.S. Patent Application Publication No. 2020/0277663.

Generally, analytes and/or intermediate agents (or portions thereof) can be captured when contacting a biological sample with a substrate including capture probes (e.g., a substrate with capture probes embedded, spotted, printed, fabricated on the substrate, or a substrate with features (e.g., beads, wells) comprising capture probes). As used herein, “contact,” “contacted,” and/or “contacting,” a biological sample with a substrate refers to any contact (e.g., direct or indirect) such that capture probes can interact (e.g., bind covalently or non-covalently (e.g., hybridize)) with analytes from the biological sample. Capture can be achieved actively (e.g., using electrophoresis) or passively (e.g., using diffusion). Analyte capture is further described in Section (II)(e) of PCT Publication No. WO2020/176788 and/or U.S. Patent Application Publication No. 2020/0277663.

FIG. 5 is a schematic diagram showing an exemplary capture probe, as described herein. As shown, the capture probe 502 is optionally coupled to a feature 501 by a cleavage domain 503, such as a disulfide linker. The capture probe can include a functional sequence 504 that is useful for subsequent processing. The functional sequence 504 can include all or a part of sequencer specific flow cell attachment sequence (e.g., a P5 or P7 sequence), all or a part of a sequencing primer sequence, (e.g., a R1 primer binding site, a R2 primer binding site), or combinations thereof. The capture probe can also include a spatial barcode 505. The capture probe can also include a unique molecular identifier (UMI) sequence 506. While FIG. 5 shows the spatial barcode 505 as being located upstream (5′) of UMI sequence 506, it is to be understood that capture probes wherein UMI sequence 506 is located upstream (5′) of the spatial barcode 505 is also suitable for use in any of the methods described herein. The capture probe can also include a capture domain 507 to facilitate capture of a target analyte. The capture domain can have a sequence complementary to a sequence of a nucleic acid analyte. The capture domain can have a sequence complementary to a connected probe described herein. The capture domain can have a sequence complementary to a capture handle sequence (also referred to as analyte capture sequence) present in an analyte capture agent. The capture domain can have a sequence complementary to a splint oligonucleotide. A splint oligonucleotide, in addition to having a sequence complementary to a capture domain of a capture probe, can have a sequence complementary to a sequence of a nucleic acid analyte, a portion of a connected probe described herein, a capture handle sequence described herein, and/or a methylated adaptor described herein.

FIG. 6 is a schematic illustrating a cleavable capture probe, wherein the cleaved capture probe can enter into a non-permeabilized cell and bind to analytes within the sample. The capture probe 601 contains a cleavage domain 602, a cell penetrating peptide 603, a reporter molecule 604, and a disulfide bond (—S—S—). 605 represents all other parts of a capture probe, for example a spatial barcode and a capture domain.

FIG. 7 is a schematic diagram of an exemplary multiplexed spatially-barcoded feature. In FIG. 7, the feature 701 can be coupled to spatially-barcoded capture probes, wherein the spatially-barcoded probes of a particular feature can possess the same spatial barcode, but have different capture domains designed to associate the spatial barcode of the feature with more than one target analyte. For example, a feature may include four different types of spatially-barcoded capture probes, each type of spatially-barcoded capture probe possessing the spatial barcode 702. One type of capture probe associated with the feature includes the spatial barcode 702 in combination with a poly(T) capture domain 703, designed to capture mRNA target analytes. A second type of capture probe associated with the feature includes the spatial barcode 702 in combination with a random N-mer capture domain 704 for gDNA analysis. A third type of capture probe associated with the feature includes the spatial barcode 702 in combination with a capture domain complementary to the analyte capture agent of interest 705. A fourth type of capture probe associated with the feature includes the spatial barcode 702 in combination with a capture probe that can specifically bind a nucleic acid molecule 706 that can function in a CRISPR assay (e.g., CRISPR/Cas9). While only four different capture probe-barcoded constructs are shown in FIG. 7, capture-probe barcoded constructs can be tailored for analyses of any given analyte associated with a nucleic acid and capable of binding with such a construct. For example, the schemes shown in FIG. 7 can also be used for concurrent analysis of other analytes disclosed herein, including, but not limited to: (a) mRNA, a lineage tracing construct, cell surface or intracellular proteins and metabolites, and gDNA; (b) mRNA, accessible chromatin (e.g., ATAC-seq, DNase-seq, and/or MNase-seq) cell surface or intracellular proteins and metabolites, and a perturbation agent (e.g., a CRISPR crRNA/sgRNA, TALEN, zinc finger nuclease, and/or antisense oligonucleotide as described herein); (c) mRNA, cell surface or intracellular proteins and/or metabolites, a barcoded labelling agent (e.g., the MHC multimers described herein), and a V(D)J sequence of an immune cell receptor (e.g., T-cell receptor). In some embodiments, a perturbation agent can be a small molecule, an antibody, a drug, an aptamer, a miRNA, a physical environmental (e.g., temperature change), or any other known perturbation agents.

The functional sequences can generally be selected for compatibility with any of a variety of different sequencing systems, e.g., Ion Torrent Proton or PGM, Illumina sequencing instruments, PacBio, Oxford Nanopore, etc., and the requirements thereof. In some embodiments, functional sequences can be selected for compatibility with non-commercialized sequencing systems. Examples of such sequencing systems and techniques, for which suitable functional sequences can be used, include (but are not limited to) Ion Torrent Proton or PGM sequencing, Illumina sequencing, PacBio SMRT sequencing, and Oxford Nanopore sequencing. Further, in some embodiments, functional sequences can be selected for compatibility with other sequencing systems, including non-commercialized sequencing systems.

In some embodiments, referring to FIG. 5, the spatial barcode 505 and functional sequences 504 are common to all of the probes attached to a given feature. In some embodiments, the UMI sequence 506 of a capture probe attached to a given feature is different from the UMI sequence of a different capture probe attached to the given feature.

FIG. 8 depicts an exemplary arrangement of barcoded features within an array. From left to right, FIG. 8 shows (L) a slide including six spatially-barcoded arrays, (C) an enlarged schematic of one of the six spatially-barcoded arrays, showing a grid of barcoded features in relation to a biological sample, and (R) an enlarged schematic of one section of an array, showing the specific identification of multiple features within the array (labelled as ID578, ID579, ID560, etc.).

In some embodiments, more than one analyte type (e.g., nucleic acids and proteins) from a biological sample can be detected (e.g., simultaneously or sequentially) using any appropriate multiplexing technique, such as those described in Section (IV) of PCT Publication No. WO2020/176788 and/or U.S. Patent Application Publication No. 2020/0277663.

In some cases, spatial analysis can be performed by attaching and/or introducing a molecule (e.g., a peptide, a lipid, or a nucleic acid molecule) having a barcode (e.g., a spatial barcode) to a biological sample (e.g., to a cell in a biological sample). In some embodiments, a plurality of molecules (e.g., a plurality of nucleic acid molecules) having a plurality of barcodes (e.g., a plurality of spatial barcodes) are introduced to a biological sample (e.g., to a plurality of cells in a biological sample) for use in spatial analysis. In some embodiments, after attaching and/or introducing a molecule having a barcode to a biological sample, the biological sample can be physically separated (e.g., dissociated) into single cells or cell groups for analysis. Some such methods of spatial analysis are described in Section (III) of PCT Publication No. WO2020/176788 and/or U.S. Patent Application Publication No. 2020/0277663.

In some cases, spatial analysis can be performed by detecting multiple oligonucleotides that hybridize to an analyte. In some instances, for example, spatial analysis can be performed using RNA-templated ligation (RTL). Methods of RTL have been described previously. See, e.g., Credle et al., Nucleic Acids Res. 2017 Aug. 21; 45(14):e128. Typically, RTL includes hybridization of two oligonucleotides to adjacent sequences on an analyte (e.g., an RNA molecule, such as an mRNA molecule). In some instances, the oligonucleotides are DNA molecules. In some instances, one of the oligonucleotides includes at least two ribonucleic acid bases at the 3′ end and/or the other oligonucleotide includes a phosphorylated nucleotide at the 5′ end. In some instances, one of the two oligonucleotides includes a capture domain (e.g., a poly(A) sequence, a non-homopolymeric sequence). After hybridization to the analyte, a ligase (e.g., a T4 RNA ligase (Rnl2), a PBCV-1 DNA Ligase or Chorella virus DNA Ligase, a single-stranded DNA ligase, or a T4 DNA ligase) ligates the two oligonucleotides together, creating a ligation product. In some instances, the two oligonucleotides hybridize to sequences that are not adjacent to one another. For example, hybridization of the two oligonucleotides creates a gap between the hybridized oligonucleotides. In some instances, a polymerase (e.g., a DNA polymerase) can extend one of the oligonucleotides prior to ligation. After ligation, the ligation product is released from the analyte. In some instances, the ligation product is released using an endonuclease (e.g., RNase H). In some instances, the ligation product is removed using heat. In some instances, the ligation product is removed using KOH. The released ligation product can then be captured by capture probes (e.g., instead of direct capture of an analyte) on an array, optionally amplified, and sequenced, thus determining the location and optionally the abundance of the analyte in the biological sample.

In some instances, one or both of the oligonucleotides may hybridize to genomic DNA (gDNA) which can lead to false positive sequencing data from ligation events on gDNA (off target) in addition to the desired (on target) ligation events on target nucleic acids, (e.g., mRNA). Thus, in some embodiments, the disclosed methods can include contacting the biological sample with a deoxyribonuclease (DNase). The DNase can be an endonuclease or exonuclease. In some embodiments, the DNase digests single- and/or double-stranded DNA. Suitable DNases include, without limitation, a DNase I and a DNase II. Use of a DNase as described can mitigate false positive sequencing data from off target gDNA ligation events.

A non-limiting example of templated ligation methods disclosed herein is depicted in FIG. 9A. After a biological sample is contacted with a substrate including a plurality of capture probes and contacted with (a) a first probe 901 having a target-hybridization sequence 903 and a primer sequence 902 and (b) a second probe 904 having a target-hybridization sequence 905 and a capture domain (e.g., a poly-A sequence) 906, the first probe 901 and a second probe 904 hybridize 910 to an analyte 907. A ligase 921 ligates 920 the first probe to the second probe thereby generating a ligation product 922. The ligation product is released 930 from the analyte 931 by digesting the analyte using an endoribonuclease 932. The sample is permeabilized 940 and the ligation product 941 is able to hybridize to a capture probe on the substrate. Methods and composition for spatial detection using templated ligation have been described in PCT Publ. No. WO 2021/133849 A1, U.S. Pat. Nos. 11,332,790 and 11,505,828, each of which is incorporated by reference in its entirety.

In some embodiments, as shown in FIG. 9B, the ligation product 9001 includes a capture probe capture domain 9002, which can bind to a capture probe 9003 (e.g., a capture probe immobilized, directly or indirectly, on a substrate 9004). In some embodiments, methods provided herein include contacting 9005 a biological sample with a substrate 9004, wherein the capture probe 9003 is affixed to the substrate (e.g., immobilized to the substrate, directly or indirectly). In some embodiments, the capture probe capture domain 9002 of the ligated product specifically binds to the capture domain 9006. The capture probe can also include a unique molecular identifier (UMI) 9007, a spatial barcode 9008, a functional sequence 9009, and a cleavage domain 9010.

In some embodiments, methods provided herein include permeabilization of the biological sample such that the capture probe can more easily capture the ligation products (i.e., compared to no permeabilization). In some embodiments, reverse transcription (RT) reagents can be added to permeabilized biological samples. Incubation with the RT reagents can extend the capture probes 9011 to produce spatially-barcoded full-length cDNA 9012 and 9013 from the captured ligation products.

In some embodiments, the extended ligation products can be denatured 9014 from the capture probe and transferred (e.g., to a clean tube) for amplification, and/or library construction. The spatially-barcoded, ligation products can be amplified 9015 via PCR prior to library construction. P59016, i59017, i79018, and P79019, and can be used as sample indexes. The amplicons can then be sequenced using paired-end sequencing using TruSeq Read 1 and TruSeq Read 2 as sequencing primer sites.

In some embodiments, detection of one or more analytes (e.g., protein analytes) can be performed using one or more analyte capture agents. As used herein, an “analyte capture agent” refers to an agent that interacts with an analyte (e.g., an analyte in a biological sample) and with a capture probe (e.g., a capture probe attached to a substrate or a feature) to identify the analyte. In some embodiments, the analyte capture agent includes: (i) an analyte binding moiety (e.g., that binds to an analyte), for example, an antibody or antigen-binding fragment thereof; (ii) analyte binding moiety barcode; and (iii) an analyte capture sequence. As used herein, the term “analyte binding moiety barcode” refers to a barcode that is associated with or otherwise identifies the analyte binding moiety. As used herein, the term “analyte capture sequence” refers to a region or moiety configured to hybridize to, bind to, couple to, or otherwise interact with a capture domain of a capture probe. In some cases, an analyte binding moiety barcode (or portion thereof) may be able to be removed (e.g., cleaved) from the analyte capture agent. Additional description of analyte capture agents can be found in Section (II)(b)(ix) of PCT Publication No. WO2020/176788 and/or Section (II)(b)(viii) U.S. Patent Application Publication No. 2020/0277663.

FIG. 10 is a schematic diagram of an exemplary analyte capture agent 1002 comprised of an analyte-binding moiety 1004 and an analyte-binding moiety barcode domain 1008. The exemplary analyte-binding moiety 1004 is a molecule capable of binding to an analyte 1006 and the analyte capture agent is capable of interacting with a spatially-barcoded capture probe. The analyte-binding moiety can bind to the analyte 1006 with high affinity and/or with high specificity. The analyte capture agent can include an analyte-binding moiety barcode domain 1008 which serves to identify the analyte binding moiety, and a capture domain which can hybridize to at least a portion or an entirety of a capture domain of a capture probe. The analyte-binding moiety 1004 can include a polypeptide and/or an aptamer. The analyte-binding moiety 1004 can include an antibody or antibody fragment (e.g., an antigen-binding fragment).

FIG. 11 is a schematic diagram depicting an exemplary interaction between a feature-immobilized capture probe 1124 and an analyte capture agent 1126. The feature-immobilized capture probe 1124 can include a spatial barcode 1108 as well as functional sequences 1106 and a UMI 1110, as described elsewhere herein. The capture probe can be affixed 1104 to a feature (e.g., bead) 1102. The capture probe can also include a capture domain 1112 that is capable of binding to an analyte capture agent 1126. The analyte-binding moiety barcode domain of the capture agent 1126 can include a functional sequence 1118, analyte binding moiety barcode 1116, and a capture handle (or analyte capture) sequence 1114 that is capable of binding (e.g., hybridizing) to the capture domain 1112 of the capture probe 1124. The analyte capture agent can also include a linker 1120 that allows the analyte-binding moiety barcode domain (e.g., including the functional sequence 1118, analyte binding barcode 1116, and analyte capture sequence 1114) to couple to the analyte binding moiety 1122. In some embodiments, the linker is a cleavable linker. In some embodiments, the cleavable linker is a photo-cleavable linker, a UV-cleavable linker, or an enzyme cleavable linker. In some instances, the cleavable linker is a disulfide linker. A disulfide linker can be cleaved by use of a reducing agent, such as dithiothreitol (DTT), Beta-mercaptoethanol (BME), or Tris (2-carboxyethyl) phosphine (TCEP).

During analysis of spatial information, sequence information for a spatial barcode associated with an analyte is obtained, and the sequence information can be used to provide information about the spatial distribution of the analyte in the biological sample. Various methods can be used to obtain the spatial information. In some embodiments, specific capture probes and the analytes they capture are associated with specific locations in an array of features on a substrate. For example, specific spatial barcodes can be associated with specific array locations prior to array fabrication, and the sequences of the spatial barcodes can be stored (e.g., in a database) along with specific array location information, so that each spatial barcode uniquely maps to a particular array location.

Alternatively, specific spatial barcodes can be deposited at predetermined locations in an array of features during fabrication such that at each location, only one type of spatial barcode is present so that spatial barcodes are uniquely associated with a single feature of the array. Where necessary, the arrays can be decoded using any of the methods described herein so that spatial barcodes are uniquely associated with array feature locations, and this mapping can be stored as described above.

When sequence information is obtained for capture probes and/or analytes during analysis of spatial information, the locations of the capture probes and/or analytes can be determined by referring to the stored information that uniquely associates each spatial barcode with an array feature location. In this manner, specific capture probes and captured analytes are associated with specific locations in the array of features. Each array feature location represents a position relative to a coordinate reference point (e.g., an array location, a fiducial marker) for the array. Accordingly, each feature location has an “address” or location in the coordinate space of the array.

Some exemplary spatial analysis workflows are described in the Exemplary Embodiments section of PCT Publication No. WO2020/176788 and/or U.S. Patent Application Publication No. 2020/0277663. See, for example, the Exemplary embodiment starting with “In some non-limiting examples of the workflows described herein, the sample can be immersed . . . ” of PCT Publication No. WO2020/176788 and/or U.S. Patent Application Publication No. 2020/0277663. See also, e.g., the Visium Spatial Gene Expression Reagent Kits User Guide (e.g., Rev F, dated January 2022); and/or the Visium Spatial Gene Expression Reagent Kits—Tissue Optimization User Guide (e.g., Rev E, dated February 2022).

In some embodiments, spatial analysis can be performed using dedicated hardware and/or software, such as any of the systems described in Sections (II)(e)(ii) and/or (V) of PCT Publication No. WO2020/176788 and/or U.S. Patent Application Publication No. 2020/0277663, or any of one or more of the devices or methods described in Sections Control Slide for Imaging, Methods of Using Control Slides and Substrates for, Systems of Using Control Slides and Substrates for Imaging, and/or Sample and Array Alignment Devices and Methods, Informational labels of PCT Publication No. WO2020/123320.

Suitable systems for performing spatial analysis can include components such as a chamber (e.g., a flow cell or sealable, fluid-tight chamber) for containing a biological sample. The biological sample can be mounted for example, in a biological sample holder. One or more fluid chambers can be connected to the chamber and/or the sample holder via fluid conduits, and fluids can be delivered into the chamber and/or sample holder via fluidic pumps, vacuum sources, or other devices coupled to the fluid conduits that create a pressure gradient to drive fluid flow. One or more valves can also be connected to fluid conduits to regulate the flow of reagents from reservoirs to the chamber and/or sample holder.

The systems can optionally include a control unit that includes one or more electronic processors, an input interface, an output interface (such as a display), and a storage unit (e.g., a solid state storage medium such as, but not limited to, a magnetic, optical, or other solid state, persistent, writeable and/or re-writeable storage medium). The control unit can optionally be connected to one or more remote devices via a network. The control unit (and components thereof) can generally perform any of the steps and functions described herein. Where the system is connected to a remote device, the remote device (or devices) can perform any of the steps or features described herein. The systems can optionally include one or more detectors (e.g., CCD, CMOS) used to capture images. The systems can also optionally include one or more light sources (e.g., LED-based, diode-based, lasers) for illuminating a sample, a substrate with features, analytes from a biological sample captured on a substrate, and various control and calibration media.

The systems can optionally include software instructions encoded and/or implemented in one or more of tangible storage media and hardware components such as application specific integrated circuits. The software instructions, when executed by a control unit (and in particular, an electronic processor) or an integrated circuit, can cause the control unit, integrated circuit, or other component executing the software instructions to perform any of the method steps or functions described herein.

In some cases, the systems described herein can detect (e.g., register an image) the biological sample on the array. Exemplary methods to detect the biological sample on an array are described in PCT Publication No. WO2021/102003 and/or U.S. Patent Application Publication No. 2021/0150707, each of which is incorporated herein by reference in their entireties.

Prior to transferring analytes from the biological sample to the array of features on the substrate, the biological sample can be aligned with the array. Alignment of a biological sample and an array of features including capture probes can facilitate spatial analysis, which can be used to detect differences in analyte presence and/or level within different positions in the biological sample, for example, to generate a three-dimensional map of the analyte presence and/or level. Exemplary methods to generate a two- and/or three-dimensional map of the analyte presence and/or level are described in PCT Publication No. WO2020/053655 and spatial analysis methods are generally described in PCT Publication No. WO2021/102039 and/or U.S. Patent Application Publication No. 2021/0155982, each of which is incorporated herein by reference in their entireties.

In some cases, a map of analyte presence and/or level can be aligned to an image of a biological sample using one or more fiducial markers, e.g., objects placed in the field of view of an imaging system which appear in the image produced, as described in the Substrate Attributes Section, Control Slide for Imaging Section of PCT Publication Nos. WO2020/123320, WO 2021/102005, and/or U.S. Patent Application Publication No. 2021/0158522, each of which is incorporated herein by reference in their entireties. Fiducial markers can be used as a point of reference or measurement scale for alignment (e.g., to align a sample and an array, to align two substrates, to determine a location of a sample or array on a substrate relative to a fiducial marker) and/or for quantitative measurements of sizes and/or distances.

II. Methods, Compositions, Devices, and Systems for Capturing Analytes and Derivatives Thereof

(a) Introduction

Provided herein are methods, devices, compositions, and systems for analyzing (i) an exogenous nucleic acid and (ii) a nucleic acid and/or protein analyte in a biological sample. Detecting the presence of exogenous nucleic acids, e.g., a viral vector delivered to a biological sample often relies on detection of a moiety such as a fluorescent marker expressed from the viral vector. Expression of such a moiety has a few drawbacks. First, the size of a viral vector can have size limitations such that both a gene of interest and a marker cannot be efficiently expressed in one vector. Second, expression of a reporter marker such as a fluorescent protein not only can be toxic, but when it is expressed contiguously with a gene of interest, it can affect the normal expression and folding of the translated product of the gene of interest. Thus, methods are needed in which one does not necessarily need to rely on reporter gene/moiety expression to determine detection. The methods, compositions, and systems described herein address such a need. Using the spatial methods described herein, both exogenous (e.g., viral) gene and endogenous gene or protein expression and/or location can be determined simultaneously.

In some instances, the methods described herein include aligning (e.g., sandwiching) a first substrate having the biological sample with a second substrate that includes a plurality of capture probes, thereby “sandwiching” the biological sample between the two substrates. Upon interaction of the biological sample with the substrate having a plurality of probes (in either instance), the location and/or abundance of a nucleic acid or protein analyte in a biological sample can be determined, as provided herein. This method includes an advantage in that prior to analyte or analyte-derived molecule by the capture probe, most-if not all steps can be performed on a substrate that does not have capture probes, thereby providing a method that is cost effective.

The methods and systems provided herein can be applied to an analyte and/or an analyte-derived molecule(s). As used herein, an analyte derived molecule includes, without limitation, a connected probe (e.g., a ligation product) from an RNA-templated ligation (RTL) assay, a product of reverse transcription (e.g., an extended capture probe), and an analyte binding moiety barcode (e.g., a binding moiety barcode that identifies that analyte binding moiety (e.g., an antibody)). In some embodiments, the analyte or analyte derived molecules comprise RNA and/or DNA. In some embodiments, the analyte or analyte derived molecules comprise one or more proteins.

In some instances, the methods, devices, compositions, and systems disclosed herein provide efficient release of an analyte or analyte derived molecule from a biological sample so that it can be easily captured or detected using methods disclosed herein.

In some instances, the methods, devices, compositions, and systems disclosed herein allow for detection of analytes or analyte derived molecules from different biological samples using a single array comprising a plurality of capture probes. As such, in some instances, the methods, devices, compositions, and systems allow for serial capture of analytes or analyte derived molecules from multiple samples. The analytes or analyte derived molecules can then be demultiplexed using biological-sample-specific index sequences to identify it biological sample origin. Embodiments of the methods, devices, compositions, and systems disclosed herein are provided below.

(A) Exemplary Biological Samples

The biological sample as used herein can be any suitable biological sample described herein or known in the art. In some embodiments, the biological sample is a tissue sample. In some embodiments, the tissue sample is a solid tissue sample. In some embodiments, the biological sample is a tissue section. In some embodiments, the tissue sample is flash-frozen and sectioned. Any suitable methods described herein or known in the art can be used to flash-freeze and section the tissue sample. In some embodiments, the biological sample, e.g., the tissue, is flash-frozen using liquid nitrogen before sectioning. In some embodiments, the sectioning is performed using cryosectioning. In some embodiments, the methods further comprise a thawing step, after the cryosectioning. In some embodiments, the biological sample, e.g., the tissue sample is fixed, for example in methanol, acetone, PFA or is formalin-fixed and paraffin-embedded (FFPE). In some embodiments, the biological sample comprises intact cells. In some embodiments, the biological sample is a cell pellet, e.g., a fixed cell pellet, e.g., an FFPE cell pellet. FFPE samples are used in some instances in the RTL methods disclosed herein. A limitation of direct RNA capture for fixed samples is that the RNA integrity of fixed (e.g., FFPE) samples can be lower than a fresh sample, thereby making it more difficult to capture RNA directly, e.g., by capture of a common sequence such as a poly(A) tail of an mRNA molecule. However, by utilizing RTL probe oligonucleotides that hybridize to RNA target sequences in the transcriptome, one can avoid a requirement for RNA analytes to have both a poly(A) tail and intact/complete target sequences. Accordingly, RTL probes can be utilized to beneficially improve capture and spatial analysis of fixed samples.

The biological sample, e.g., tissue sample, can be stained, and imaged prior, during, and/or after each step of the methods described herein. Any of the methods described herein or known in the art can be used to stain and/or image the biological sample. In some embodiments, the imaging occurs prior to destaining the sample. In some embodiments, the biological sample is stained using an H&E staining method. In some embodiments, the tissue sample is stained and imaged for about 10 minutes to about 2 hours (or any of the subranges of this range described herein). Additional time may be needed for staining and imaging of different types of biological samples.

The tissue sample can be obtained from any suitable location in a tissue or organ of a subject, e.g., a human subject. In some instances, the sample is a human sample. In some instances, the sample is a human breast tissue sample. In some instances, the sample is a human brain tissue sample. In some embodiments, the sample is an embryo sample. The embryo sample can be a non-human embryo sample. In some instances, the sample is a mouse embryo sample.

(B) Exemplary First and Second Substrates

In some instances, the biological sample is placed (e.g., mounted or otherwise immobilized) on a first substrate. The first substrate can be any solid or semi-solid support upon which a biological sample can be mounted. In some instances, the first substrate is a slide. In some instances, the slide is a glass slide. In some embodiments, the substrate is made of glass, silicon, paper, hydrogel, polymer monoliths, or other material known in the art. In some embodiments, the first substrate is comprised of an inert material or matrix (e.g., glass slides) that has been functionalized by, for example, treating the substrate with a material comprising reactive groups which facilitate mounting of the biological sample.

In some embodiments, the first substrate does not comprise a plurality (e.g., array) of capture probes, each comprising a spatial barcode.

A substrate, e.g., a first substrate and/or a second substrate, can generally have any suitable form or format. For example, a substrate can be flat, curved, e.g., convexly or concavely curved. For example, a first substrate can be curved towards the area where the interaction between a biological sample, e.g., tissue sample, and a first substrate takes place. In some embodiments, a substrate is flat, e.g., planar, chip, or slide. A substrate can contain one or more patterned surfaces within the first substrate (e.g., channels, wells, projections, ridges, divots, etc.).

A substrate, e.g., a first substrate and/or second substrate, can be of any desired shape. For example, a substrate can be typically a thin, flat shape (e.g., a square or a rectangle). In some embodiments, a substrate structure has rounded corners (e.g., for increased safety or robustness). In some embodiments, a substrate structure has one or more cut-off corners (e.g., for use with a slide clamp or cross-table). In some embodiments, where the substrate structure is flat, the substrate structure can be any appropriate type of support having a flat surface (e.g., a chip or a slide such as a microscope slide).

First and/or second substrates can optionally include various structures such as, but not limited to, projections, ridges, and channels. A substrate can be micropatterned to limit lateral diffusion of analytes (e.g., to improve resolution of the spatial analysis). A substrate modified with such structures can be modified to allow association of analytes, features (e.g., beads), or probes at individual sites. For example, the sites where a substrate is modified with various structures can be contiguous or non-contiguous with other sites.

In some embodiments, the surface of a first and/or second substrate is modified to contain one or more wells, using techniques such as (but not limited to) stamping, microetching, or molding techniques. In some embodiments in which a first and/or second substrate includes one or more wells, the first substrate can be a concavity slide or cavity slide. For example, wells can be formed by one or more shallow depressions on the surface of the first and/or second substrate. In some embodiments, where a first and/or second substrate includes one or more wells, the wells can be formed by attaching a cassette (e.g., a cassette containing one or more chambers) to a surface of the first substrate structure.

In some embodiments where the first and/or second substrate is modified to contain one or more structures, including but not limited to, wells, projections, ridges, features, or markings, the structures can include physically altered sites. For example, a first and/or second substrate modified with various structures can include physical properties, including, but not limited to, physical configurations, magnetic or compressive forces, chemically functionalized sites, chemically altered sites, and/or electrostatically altered sites. In some embodiments where the first substrate is modified to contain various structures, including but not limited to wells, projections, ridges, features, or markings, the structures are applied in a pattern. Alternatively, the structures can be randomly distributed.

In some embodiments, a first substrate includes one or more markings on its surface, e.g., to provide guidance for aligning at least a portion of the biological sample with a plurality of capture probes on the second substrate during a sandwich process disclosed herein. For example, the first substrate can include a sample area indicator identifying the sample area. In some embodiments, during a sandwiching process described herein the sample area indicator on the first substrate is aligned with an area of the second substrate comprising a plurality of capture probes. In some embodiments, the first and/or second substrate can include a fiducial marker. In some embodiments, the first and/or second substrate does not comprise a fiducial marker. In some embodiments, the first substrate does not comprise a fiducial marker and the second substrate comprises a fiducial marker. Such markings can be made using techniques including, but not limited to, printing, etching, sand-blasting, and depositing on the surface.

In some embodiments, a substrate includes a plurality of beads. For example, a substrate can include a monolayer of beads where each bead occupies a unique position on the substrate. In some instances, the beads can be immobilized on the substrate. The beads can each contain a plurality of capture probes. In some instances, the capture probes on a particular bead have the same barcode, which is unique, and thus differs from the barcodes of capture probes on other beads in the plurality. Thus, the barcode contained by the capture probes on each bead can serve as a spatial barcode that is associated with a distinct position on the substrate.

In some embodiments, imaging can be performed using one or more fiducial markers, e.g., objects placed in the field of view of an imaging system which appear in the image produced. Fiducial markers are typically used as a point of reference or measurement scale. Fiducial markers can include, but are not limited to, detectable labels such as fluorescent, radioactive, chemiluminescent, and colorimetric labels. The use of fiducial markers to stabilize and orient biological samples is described, for example, in Carter et al., Applied Optics 46:421-427, 2007), the entire contents of which are incorporated herein by reference. In some embodiments, a fiducial marker can be a physical particle placed in the field of view (e.g., in an imaging system) which appears or can be located in the image produced (e.g., a nanoparticle, a microsphere, a nanosphere, a bead, a post, or any of the other exemplary physical particles described herein or known in the art).

In some embodiments, a fiducial marker can be present on a first substrate to provide orientation of the biological sample. In some embodiments, a microsphere can be coupled to a first substrate to aid in orientation of the biological sample. In some examples, a microsphere coupled to a first substrate can produce an optical signal (e.g., fluorescence). In some embodiments, a quantum dot can be coupled to the first substrate to aid in the orientation of the biological sample. In some examples, a quantum dot coupled to a first substrate can produce an optical signal.

In some embodiments, a fiducial marker can be an immobilized molecule with which a detectable signal molecule can interact to generate a signal. For example, a fiducial marker nucleic acid can be linked or coupled to a chemical moiety capable of fluorescing when subjected to light of a specific wavelength (or range of wavelengths). Although not required, it can be advantageous to use a fiducial marker that can be detected using the same conditions (e.g., imaging conditions) used to detect a labelled DNA (e.g., cDNA).

In some embodiments, a fiducial marker can be randomly placed in the field of view. For example, an oligonucleotide containing a fluorophore can be randomly printed, stamped, synthesized, or attached to a first substrate (e.g., a glass slide) at a random position on the first substrate. A tissue section can be contacted with the first substrate such that the oligonucleotide containing the fluorophore contacts, or is in proximity to, a cell from the tissue section or a component of the cell (e.g., mRNA, nuclei or DNA molecule). An image of the first substrate and the tissue section can be obtained, and the position of the fluorophore within the tissue section image can be determined (e.g., by reviewing an optical image of the tissue section overlaid with the fluorophore detection). In some embodiments, fiducial markers can be precisely placed in the field of view (e.g., at known locations on a first substrate). In this instance, a fiducial marker can be stamped, attached, etched, or synthesized on the first substrate and contacted with a biological sample. Typically, an image of the biological sample and the fiducial marker is taken, and the position of the fiducial marker on the first substrate can be confirmed by viewing the image.

In some embodiments, a fiducial marker can be an immobilized molecule (e.g., a physical particle) attached to the first substrate. For example, a fiducial marker can be a nanoparticle, e.g., a nanorod, a nanowire, a nanocube, a nanopyramid, or a spherical nanoparticle. In some examples, the nanoparticle can be made of a heavy metal (e.g., gold, titanium). In some embodiments, the nanoparticle can be made from diamond. In some embodiments, the fiducial marker can be visible by eye and without the aid of computerized imaging and/or microscopy.

A wide variety of different first substrates can be used for the foregoing purposes. In general, a first substrate can be any suitable support material. Exemplary first substrates include, but are not limited to, glass, modified and/or functionalized glass, hydrogels, films, membranes, plastics (including e.g., acrylics, polystyrene, copolymers of styrene and other materials, polypropylene, polyethylene, polybutylene, polyurethanes, Teflon™, cyclic olefins, polyimides etc.), nylon, ceramics, resins, Zeonor, silica or silica-based materials including silicon and modified silicon, carbon, metals, inorganic glasses, optical fiber bundles, and polymers, such as polystyrene, cyclic olefin copolymers (COCs), cyclic olefin polymers (COPs), polypropylene, polyethylene polycarbonate, or combinations thereof.

Among the examples of first substrate materials discussed above, polystyrene is a hydrophobic material suitable for binding negatively charged macromolecules because it normally contains few hydrophilic groups. For nucleic acids immobilized on glass slides, by increasing the hydrophobicity of the glass surface the nucleic acid immobilization can be increased. Such an enhancement can permit a relatively more densely packed formation (e.g., provide improved specificity and resolution).

In another example, a first substrate can be a flow cell. Flow cells can be formed of any of the foregoing materials, and can include channels that permit reagents, solvents, features, and analytes to pass through the flow cell. In some embodiments, a hydrogel embedded biological sample is assembled in a flow cell (e.g., the flow cell is utilized to introduce the hydrogel to the biological sample). In some embodiments, a hydrogel embedded biological sample is not assembled in a flow cell. In some embodiments, the hydrogel embedded biological sample can then be prepared and/or isometrically expanded as described herein.

Exemplary first substrates (e.g., a substrate having no capture probes) and/or the second substrate are described in Section (I) above and in WO 2020/123320, which is hereby incorporated by reference in its entirety.

(b) Detecting Exogenous Nucleic Acids Using RNA-Templated Ligation

In some embodiments, the methods, devices, compositions, and systems described herein utilize RNA-templated ligation to detect an analyte and/or an exogenous nucleic acid. As used herein, spatial “RNA-templated ligation,” or “RTL” or simply “templated ligation” is a process wherein individual probe oligonucleotides (e.g., a first probe oligonucleotide, a second probe oligonucleotide) in a probe pair hybridize to a first sequence and a second sequence of an analyte (e.g., an RNA molecule), respectively, in a biological sample (e.g., a tissue sample). The RTL probe oligonucleotides are then coupled (e.g., ligated) together, thereby creating a connected probe (e.g., a ligation product). RNA-templated ligation is disclosed in PCT Publ. No. WO 2021/133849 A1 and US Publ. No. US 2021/0285046 A1, each of which is incorporated by reference in its entirety.

An advantage to using RTL is that it allows for enhanced detection of analytes (e.g., low expressing analytes) because both probe oligonucleotides must hybridize to the analyte in order for the coupling (e.g., ligating) reaction to occur. As used herein, “coupling” refers to an interaction between two probe oligonucleotides that results in a single connected probe that comprises the two probe oligonucleotides. In some instances, coupling is achieved through ligation. In some instances, coupling is achieved through extension of one probe oligonucleotide to the second probe oligonucleotide followed by ligation. In some instances, coupling is achieved through hybridization (e.g., using a third probe oligonucleotide that hybridizes to each of the two probe oligonucleotides) followed by extension of one probe oligonucleotide or gap filling of the sequence between the two probe oligonucleotides using the third probe oligonucleotide as a template.

The connected probe (e.g., ligation product) that results from the coupling (e.g., ligation) of the two probe oligonucleotides can serve as a proxy for the target analyte, as such an analyte derived molecule. Further, it is appreciated that probe oligonucleotide pairs can be designed to cover any gene of interest. For example, a pair of probe oligonucleotides can be designed so that each analyte, e.g., a whole exome, a transcriptome, a genome, can conceivably be detected using a probe oligonucleotide pair.

Provided herein are methods, devices, compositions, and systems for detecting an exogenous nucleic acid in a biological sample by utilizing RTL. In some embodiments, a method of detecting an exogenous nucleic acid in a biological sample on a first substrate can include (a) hybridizing a first exogenous probe oligonucleotide and a second exogenous probe oligonucleotide to the exogenous nucleic acid, wherein the first exogenous probe oligonucleotide and the second exogenous probe oligonucleotide each comprise a sequence that is substantially complementary to a first sequence and a second sequence of the exogenous nucleic acid, respectively, and wherein the second exogenous probe oligonucleotide comprises an exogenous capture probe binding domain; (b) coupling the first exogenous probe oligonucleotide and the second exogenous probe oligonucleotide, thereby generating an exogenous connected probe; (c) aligning the first substrate with a second substrate comprising an array, such that at least a portion of the biological sample is aligned with at least a portion of the array, wherein the array comprises a plurality of capture probes, wherein a capture probe of the plurality of capture probes comprises: (i) a spatial barcode and (ii) a capture domain; (d) when the biological sample is aligned with at least a portion of the array, (i) releasing the exogenous connected probe from the exogenous nucleic acid and (ii) migrating the exogenous connected probe from the biological sample to the array; and (e) hybridizing the exogenous capture probe binding domain of the exogenous connected probe to the capture domain. In some embodiments, the method further comprises detecting one or more additional exogenous nucleic acids in the biological sample.

As used herein, an “exogenous nucleic acid” refers to a nucleic acid originating outside of an organism or subject. In some embodiments, an exogenous nucleic acid can become integrated into the endogenous genome of the host cell. An exogenous nucleic acid can include nucleic acids from a microorganism (e.g., a virus, a bacterium) that infects the cell, nucleic acids artificially introduced into the cell (e.g., plasmids or nucleic acids derived therefrom), or nucleic acids for gene editing (e.g., CRISPR-related RNA such as crRNA, or guide RNA). In some embodiments, the exogenous nucleic acid comprises viral DNA or viral RNA.

(A) Adeno-Associated Virus (AAV) and Other Non-Integrating Nucleic Acid Delivery Systems

In some embodiments, the exogenous nucleic acid is an adeno-associated virus (AAV) nucleic acid. Adeno-associated virus is the only viral vector system based on a nonpathogenic and replication-defective virus, wherein recombinant AAV virions have been successfully used to establish efficient and sustained gene transfer of both proliferating and terminally differentiated cells in a variety of tissues.

The AAV genome is a linear, single-stranded DNA molecule containing about 4681 nucleotides. The AAV genome generally comprises an internal nonrepeating genome flanked on each end by inverted terminal repeats (ITRs). The ITRs are approximately 145 base pairs (bp) in length. The ITRs have multiple functions, including as origins of DNA replication, and as packaging signals for the viral genome. The internal nonrepeated portion of the genome includes two large open reading frames, known as the AAV replication (rep) and capsid (cap) genes. The rep and cap genes code for viral proteins that allow the virus to replicate and package into a virion. In particular, a family of at least four viral proteins is expressed from the AAV rep region, Rep 78, Rep 68, Rep 52, and Rep 40, named according to their apparent molecular weight. The AAV cap region encodes at least three proteins, VP1, VP2, and VP3.

As used herein, the term “vector” means any genetic element, such as a plasmid, phage, transposon, cosmid, chromosome, virus, virion, etc., which is capable of replication when associated with the proper control elements and which can transfer gene sequences between cells. Thus, the term includes cloning and expression vehicles, as well as viral vectors. Similarly, “recombinant expression vector” refers to systems of polynucleotide(s) which operatively encode polypeptides expressible in eukaryotes or prokaryotes. Methods of expressing DNA sequences having eukaryotic or viral sequences in prokaryotes are well known in the art. Biologically functional viral and plasmid DNA vectors capable of expression and replication in a host are also well known in the art. Hosts can include microbial, yeast, insect and mammalian organisms.

The vectors or recombinant expression vectors can be easily manufactured, and the advantages of adenovirus (high titer, high infectivity, large capacity, lack of association with human malignancy) can be combined with the integration capability of AAV, making them particularly suitable for stable gene transfer which is useful in, for example, gene therapy approaches. A further advantage of the described AAV vectors is that, by virtue of containing AAV TR or ITR and D sequences that flank the gene of interest, it is expected that they integrate into cellular chromosomal DNA. Integration is important for stable gene transfer into cells. Another advantage of AAV vectors is that they are packaged efficiently into stable virus particles whether small or large polynucleotides are used. Still, another advantage of AAV vectors is that they are less cytotoxic than first generation adenovirus vectors since no adenovirus genes are expressed within transduced cells.

In some embodiments, the AAV-vector containing virions are “transfected”, which refers to the uptake of foreign DNA by a cell. That is, a cell “transfected” with exogenous DNA and the DNA is introduced inside the cell membrane. A number of transfection techniques are generally known in the art. See, e.g., Graham et al. (1973) Virology, 52:456, Sambrook et al. (1989) Molecular Cloning, a laboratory manual, Cold Spring Harbor Laboratories, New York, Davis et al. (1986) Basic Methods in Molecular Biology, Elsevier, and Chu et al. (1981) Gene 13:197, incorporated herein by reference. Such techniques can be used to introduce one or more exogenous DNA moieties, such as a nucleotide integration vector and other nucleic acid molecules, into suitable host cells.

The term “host cell” or “host” denotes, for example, mammalian cells, that can be, or have been, used as recipients of an AAV helper construct, an AAV vector plasmid, an accessory function vector, or other transfer DNA. In some embodiments, the term includes the progeny of the original cell which has been transfected. Thus, a “host cell” or “host” as used herein generally refers to a cell which has been transfected with an exogenous DNA sequence. It is understood that the progeny of a single parental cell may not necessarily be completely identical in morphology or in genomic or total DNA complement as the original parent, due to natural, accidental, or deliberate mutation.

In some embodiments, AAV has been engineered to deliver genes of interest by deleting the internal nonrepeating portion of the AAV genome (e.g., the rep and cap genes) and inserting a heterologous gene between the ITRs. The heterologous gene is typically functionally or operatively linked to a heterologous promoter (constitutive, cell-specific, or inducible) capable of driving gene expression in the patient's target cells under appropriate conditions. Termination signals, such as polyadenylation sites, can also be included.

As used herein, the term “AAV vector” means a vector derived from an adeno-associated virus serotype, including without limitation, AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, and mutated forms thereof. AAV vectors can have one or more of the AAV wild-type genes deleted in whole or part, the rep and/or cap genes, but retain functional flanking ITR sequences. In some embodiments, the AAV vector is derived from an adeno-associated virus serotype AAV1. Despite the high degree of homology, the different serotypes have tropisms for different tissues. The receptor for AAV1 is unknown; however, AAV1 is known to transduce skeletal and smooth muscle more efficiently than AAV2. Without being bound by theory, since most of the studies have been done with pseudotyped vectors in which the vector DNA flanked with AAV2 ITR is packaged into capsids of alternate serotypes, it is clear that the biological differences are related to the capsid rather than to the genomes. Recent evidence indicates that DNA expression cassettes packaged in AAV1 capsids are at least 1 log 10 more efficient at transducing cardiomyocytes than those packaged in AAV2 capsids.

Accordingly, as used herein, AAV refers to all serotypes of AAV (e.g., 1-9) and mutated forms thereof. Thus, it is routine in the art to use the ITR sequences from other serotypes of AAV since the ITRs of all AAV serotypes are expected to have similar structures and functions with regard to replication, integration, excision and transcriptional mechanisms. In some embodiments, the AAV is any one of AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, or AAV9. In some embodiments, the AAV is a recombinant adeno-associated virus (rAAV). In some embodiments, typically, recombinant AAV (rAAV) virus is made by cotransfecting a plasmid containing the gene of interest flanked by the two AAV terminal repeats and/or an expression plasmid containing the wild-type AAV coding sequences without the terminal repeats.

(B) Genome Integrated Exogenous Nucleic Acids

In some embodiments, the exogenous nucleic acid is from a genome integrated nucleic acid. As used herein, a “genome integrated nucleic acid” refers to a nucleic acid that is inserted into the host genome. In some embodiments, the genome integrated nucleic acid can integrate into the host genome by direct gene integration methods or viral integration methods. A number of genome integration techniques are generally known in the art. See, e.g., Desfarges et al. Viruses: Essential Agents of Life. (2012), Tang et al. Nucleic Acids Res. (2020), incorporated herein by reference.

In some embodiments, the genome integrated nucleic acid can integrate into the host genome by direct gene integration. In some embodiments, the genome integrated nucleic acid comprises a nucleotide sequence encoding a Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)—CRISPR associated (Cas) (CRISPR-Cas) system guide RNA that hybridizes with the target host sequence.

Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)—CRISPR Associated (Cas) (CRISPR-Cas) System

As used herein, the term “CRISPR” refers to clustered regularly interspaced short palindromic repeats pathway, which regulates gene expression at a transcriptional level. As used herein, a “Cas effector” or “CRISPR-associated protein” can refer to an enzyme or protein that uses CRISPR sequences as a guide to recognize and cleave specific nucleic acid strands that are complementary to the CRISPR sequence. A gene-editing Cas effector can associate with a CRISPR RNA sequence to bind to, and alter DNA or RNA target sequences. In some embodiments, the CRISPR-Cas system comprises a gene-editing Cas effector. In some embodiments, the gene-editing Cas effector comprises a Cas9 protein, a Cas13b protein, or a Cas13d protein. In some embodiments, a gene-editing Cas effector can be a Cas9 endonuclease that makes a double-stranded break in a target DNA sequence. In some embodiments, a gene-editing Cas effector can be a Cas12a nuclease that also makes a double-stranded break in a target DNA sequence. In some embodiments, a gene-editing Cas effector can be a Cas13 nuclease which targets RNA. In some embodiments, a gene-editing Cas effector comprises a Cas9 protein, a Cas13b protein, or a Cas13d protein. In some embodiments, the gene-editing Cas effector comprises a nuclease dead Cas9 (dCas9) protein. In some embodiments, the gene-editing Cas effector comprises a Cas13b protein. In some embodiments, the gene-editing Cas effector comprises a Cas13d protein.

As used herein, the term “gRNA” or “guide RNA” refers to the guide RNA sequences used to target specific genes for correction employing the CRISPR technique. Techniques of designing gRNAs and donor therapeutic polynucleotides for target specificity are well known in the art. For example, see e.g., Doench, J., et al. Nature biotechnology 2014; 32(12):1262-7 and Graham, D., et al. Genome Biol. 2015; 16: 260, incorporated herein by reference. The term “Single guide RNA” or “sgRNA” is a specific type of gRNA that combines tracrRNA (transactivating RNA), which binds to Cas9 to activate the complex to create the necessary strand breaks, and crRNA (CRISPR RNA), comprising complimentary nucleotides to the tracrRNA, into a single RNA construct. Exemplary methods of employing the CRISPR technique are described in WO 2017/091630, which is incorporated by reference in its entirety.

In some embodiments, the single guide RNA can recognize a target RNA, for example, by hybridizing to the target RNA. In some embodiments, the single guide RNA comprises a sequence that is complementary to the target RNA. In some embodiments, the sgRNA can include one or more modified nucleotides. In some embodiments, the sgRNA has a length that is about 10 nt (e.g., about 20 nt, about 30 nt, about 40 nt, about 50 nt, about 60 nt, about 70 nt, about 80 nt, about 90 nt, about 100 nt, about 120 nt, about 140 nt, about 160 nt, about 180 nt, about 200 nt, about 300 nt, about 400 nt, about 500 nt, about 600 nt, about 700 nt, about 800 nt, about 900 nt, about 1000 nt, or about 2000 nt).

In some embodiments, a single guide RNA can recognize a variety of RNA targets. For example, a target RNA can be messenger RNA (mRNA), ribosomal RNA (rRNA), signal recognition particle RNA (SRP RNA), transfer RNA (tRNA), small nuclear RNA (snRNA), small nucleolar RNA (snoRNA), antisense RNA (aRNA), long noncoding RNA (lncRNA), microRNA (miRNA), piwi-interacting RNA (piRNA), small interfering RNA (siRNA), short hairpin RNA (shRNA), retrotransposon RNA, viral genome RNA, or viral noncoding RNA. In some embodiments, a target RNA can be an RNA involved in pathogenesis of conditions such as cancers, neurodegeneration, cutaneous conditions, endocrine conditions, intestinal diseases, infectious conditions, neurological conditions, liver diseases, heart disorders, or autoimmune diseases. In some embodiments, a target RNA can be a therapeutic target for conditions such as cancers, neurodegeneration, cutaneous conditions, endocrine conditions, intestinal diseases, infectious conditions, neurological conditions, liver diseases, heart disorders, or autoimmune diseases.

Viral Vector Delivery Systems

In some embodiments, an exogenous nucleic acid may be integrated into the genome of a host cell, wherein the integration may be in a specific location and orientation via homologous recombination (gene replacement), or it may be integrated in a random, non-specific location (gene augmentation). In yet further embodiments, the exogenous nucleic acid may be stably maintained in the cell as a separate, episomal segment of DNA. Such polynucleotide segments or “episomes” encode sequences sufficient to permit maintenance and replication independent of or in synchronization with the host cell cycle. The manner in which the expression construct is delivered to a cell and where in the cell the exogenous nucleic acid remains is dependent on the type of expression construct employed.

In some embodiments, the genome integrated nucleic acid is from a virus that integrates into a host genome. In some embodiments, the virus that integrates into the host genome is selected from a retroviral vector, an arenavirus vector, a herpes virus vector, an Epstein-Barr Virus vector, or an adenovirus vector.

In some embodiments, an exogenous nucleic acid is included in a retroviral vector. Retroviruses may be chosen as gene delivery vectors due to their ability to integrate their genes into the host genome, transferring a large amount of foreign genetic material, infecting a broad spectrum of species and cell types and for being packaged in special cell-lines.

The retroviral genome contains three genes, gag, pol, and env that code for capsid proteins, polymerase enzyme, and envelope components, respectively. A sequence found upstream from the gag gene contains a signal for packaging of the genome into virions. Two long terminal repeat (LTR) sequences are present at the 5′ and 3′ ends of the viral genome. These contain strong promoter and enhancer sequences and are also required for integration in the host cell genome.

In order to construct a retroviral vector, a nucleic acid encoding a gene of interest is inserted into the viral genome in the place of certain viral sequences to produce a virus that is replication-defective. In order to produce virions, a packaging cell line is constructed containing the gag, pol, and/or env genes but without the LTR and/or packaging components. When a recombinant plasmid containing a cDNA, together with the retroviral LTR and packaging sequences is introduced into this cell line (by calcium phosphate precipitation for example), the packaging sequence allows the RNA transcript of the recombinant plasmid to be packaged into viral particles, which are then secreted into the culture media. The media containing the recombinant retroviruses is then collected, optionally concentrated, and used for gene transfer. Retroviral vectors can infect a broad variety of cell types. However, integration and stable expression require the division of host cells.

In some embodiments, the retroviral vector is a lentiviral vector. Lentiviruses are complex retroviruses, which, in addition to the common retroviral genes gag, pol, and env, contain other genes with regulatory or structural function. The higher complexity enables the virus to modulate its life cycle, as in the course of latent infection. Some examples of lentivirus include the Human Immunodeficiency Viruses (HTV 1, REV 2) and the Simian Immunodeficiency Virus (SIV). Lentiviral vectors have been generated by multiply attenuating the HIV virulence genes, for example, the genes env, vif, vpr, vpu and nef are deleted making the vector biologically safe.

Lentiviral vectors are known in the art, see, e.g., U.S. Pat. Nos. 6,013,516 and 5,994,136, both of which are incorporated herein by reference. In general, the vectors are plasmid-based or virus-based, and are configured to carry the essential sequences for incorporating exogenous nucleic acid, for selection and for transfer of the nucleic acid into a host cell. The gag, pol and env genes of the vectors of interest also are known in the art. Thus, the relevant genes are cloned into the selected vector and then used to transform the target cell of interest.

Recombinant lentivirus capable of infecting a non-dividing cell wherein a suitable host cell is transfected with two or more vectors carrying the packaging functions, namely gag, pol and env, as well as rev and tat is described in U.S. Pat. No. 5,994,136, incorporated herein by reference. This describes a first vector that can provide a nucleic acid encoding a viral gag and a pol gene and a second vector that can provide a nucleic acid encoding a viral env to produce a packaging cell. Introducing a vector providing a heterologous gene into that packaging cell yields a producer cell which releases infectious viral particles carrying the foreign gene of interest. In some embodiments, the env is an amphotropic envelope protein which allows transduction of cells of human and other species.

In some embodiments, an exogenous nucleic acid is included in a herpes virus. Because herpes simplex virus (HSV) is neurotropic, it has generated considerable interest in treating certain disorders. Moreover, the ability of HSV to establish latent infections in non-dividing cells without integrating into the host cell chromosome or otherwise altering the host cell's metabolism, along with the existence of a promoter that is active during latency makes HSV an attractive vector. And though much attention has focused on the neurotropic applications of HSV, this vector also can be exploited for other tissues given its wide host range.

Another factor that makes HSV an attractive vector is the size and organization of the genome. Because HSV is large, incorporation of multiple genes or expression cassettes is less problematic than in other smaller viral systems. In addition, the availability of different viral control sequences with varying performance (temporal, strength, etc.) makes it possible to control expression to a greater extent than in other systems. It also is an advantage that the virus has relatively few spliced messages, further easing genetic manipulations.

HSV also is relatively easy to manipulate and can be grown to high titers. Thus, delivery is less of a problem, both in terms of volumes needed to attain sufficient multiplicity of infection (MOI) and in a lessened need for repeat dosing. For a review of HSV as a gene therapy vector, see Glorioso et al., Annu. Rev. Microbiol., 1995; 49:675-710, incorporated herein by reference. Avirulent variants of HSV have been developed and are readily available for use in gene therapy contexts (see, e.g., U.S. Pat. No. 5,672,344, incorporated herein by reference).

In some embodiments, the exogenous nucleic acid comprises RNA from a conserved region of the exogenous nucleic acid sequence. In some embodiments, the exogenous nucleic acid comprises RNA from a conserved region of a viral vector. In some embodiments, the exogenous nucleic acid comprises RNA from a conserved region of an adeno-associated virus (AAV). In some embodiments, the exogenous nucleic acid comprises RNA from a conserved region of a lentivirus.

In some embodiments, the exogenous nucleic acid comprises RNA from a transgene. In some embodiments, the exogenous nucleic acid comprises RNA from a reporter gene. As used herein, a “reporter gene” refers to a gene that can be attached to a sequence of another gene of interest in a cell. Reporter genes include characteristics that are easily identified and measured such that they can be used as selectable markers. In some embodiments, reporter genes can be used as an indication of whether a specific gene has been taken up by or expressed in a cell or organism. In some embodiments, a reporter gene can include green fluorescent protein (GFP), yellow fluorescent protein (YFP), blue fluorescent protein (BFP), cerulean fluorescent protein, cyan fluorescent protein, or red fluorescent protein (RFP). In some embodiments, the reporter gene can include azurite, EBFP, cerulean, mTFP1, AmCyan1, EGFP, emerald, GFP, TagGFP, ZsGreen, mBanana, mCitrine, ZsYellow1, tdTomato, mOrange, TagRFP, AsRed2, JRed, mApple, mCherry, or mRuby. In some embodiments, the reporter gene is selected from the group consisting of green fluorescent protein (GFP), Cerulean, tdTomato, and mCherry. In some embodiments, the reporter gene comprises GFP.

In some embodiments, a method of detecting an exogenous nucleic acid in a biological sample on a first substrate can include (a) hybridizing a first exogenous probe oligonucleotide and a second exogenous probe oligonucleotide to the exogenous nucleic acid, wherein the first exogenous probe oligonucleotide and the second exogenous probe oligonucleotide each comprise a sequence that is substantially complementary to a first sequence and a second sequence of the exogenous nucleic acid, respectively, and wherein the second exogenous probe oligonucleotide comprises an exogenous capture probe binding domain; (b) coupling the first exogenous probe oligonucleotide and the second exogenous probe oligonucleotide, thereby generating an exogenous connected probe; (c) aligning the first substrate with a second substrate comprising an array, such that at least a portion of the biological sample is aligned with at least a portion of the array, wherein the array comprises a plurality of capture probes, wherein a capture probe of the plurality of capture probes comprises: (i) a spatial barcode and (ii) a capture domain; (d) when the biological sample is aligned with at least a portion of the array, (i) releasing the exogenous connected probe from the exogenous sequence and (ii) migrating the exogenous connected probe from the biological sample to the array; and (e) hybridizing the exogenous capture probe binding domain of the exogenous connected probe to the capture domain of the capture probe.

In some embodiments, the method further comprises analyzing an analyte in the endogenous genome of the biological sample, wherein the biological sample is on the first substrate, the method comprising: (a) hybridizing a first probe oligonucleotide and a second probe oligonucleotide to the analyte, wherein the first probe oligonucleotide and the second probe oligonucleotide each comprise a sequence that is substantially complementary to a first sequence and a second sequence of the analyte, respectively, and wherein the second probe oligonucleotide comprises a capture probe binding domain; (b) coupling the first probe oligonucleotide and the second probe oligonucleotide, thereby generating a connected probe prior to aligning the first substrate with the second substrate; (c) when the biological sample is aligned with at least a portion of the array, (i) releasing the connected probe from the analyte and (ii) migrating the connected probe from the biological sample to the array; and (d) hybridizing the connected probe to the capture domain.

In some embodiments, the analyte comprises DNA or RNA.

In some embodiments, the exogenous probe oligonucleotide pairs include sequences that are complementary or substantially complementary to an exogenous nucleic acid. For example, in some embodiments, each exogenous probe oligonucleotide includes a sequence that is complementary or substantially complementary to an exogenous nucleic acid (e.g., to a portion of the sequence of an exogenous nucleic acid). In some embodiments, each exogenous nucleic acid includes a first target region and a second target region. In some embodiments, the methods include providing a plurality of first exogenous probe oligonucleotides and a plurality of second exogenous probe oligonucleotides, wherein a pair of exogenous probe oligonucleotides for an exogenous nucleic acid comprises both a first and second exogenous probe oligonucleotide. In some embodiments, a first exogenous probe oligonucleotide hybridizes to a first target region of the exogenous nucleic acid, and the second exogenous probe oligonucleotide hybridizes to a second, adjacent or nearly adjacent target region of the exogenous nucleic acid.

In some instances, the exogenous probe oligonucleotides are DNA molecules. In some instances, the first exogenous probe oligonucleotide is a DNA molecule. In some instances, the second exogenous probe oligonucleotide is a DNA molecule. In some instances, the first exogenous probe oligonucleotide comprises at least two ribonucleic acid bases at the 3′ end. In some instances, the second exogenous probe oligonucleotide comprises a phosphorylated nucleotide at the 5′ end.

In some instances, the first and second target regions of an exogenous nucleic acid are directly adjacent to one another. In some embodiments, the complementary sequences to which the first exogenous probe oligonucleotide and the second exogenous probe oligonucleotide hybridize are 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, about 15, about 20, about 25, about 30, about 35, about 40, about 45, about 50, about 55, about 60, about 65, about 70, about 75, about 80, about 85, about 90, about 95, about 100, about 125, or about 150 nucleotides away from each other. Gaps between the exogenous probe oligonucleotides may first be filled prior to coupling (e.g., ligation), using, for example, dNTPs in combination with a polymerase such as polymerase mu, DNA polymerase, RNA polymerase, reverse transcriptase, VENT polymerase, Taq polymerase, and/or any combinations, derivatives, and variants (e.g., engineered mutants) thereof. In some embodiments, when the first and second exogenous probe oligonucleotides are separated from each other by one or more nucleotides, deoxyribonucleotides are used to extend and couple (e.g., ligate) the first and second exogenous probe oligonucleotides.

In some embodiments, the first exogenous probe oligonucleotide and the second exogenous probe oligonucleotide are on a contiguous exogenous nucleic acid sequence; and/or where the first probe oligonucleotide and the second probe oligonucleotide are on a contiguous nucleic acid sequence. In some embodiments, the first exogenous probe oligonucleotide is on the 3′ end of the contiguous exogenous nucleic acid sequence; and/or where the first probe oligonucleotide is on the 3′ end of the contiguous nucleic acid sequence. In some embodiments, the second exogenous probe oligonucleotide is on the 5′ end of the contiguous exogenous nucleic acid sequence; and/or the second probe oligonucleotide is on the 5′ end of the contiguous nucleic acid sequence. In some embodiments, any one of the first sequences and second sequences abut one another. In some embodiments, any one of the first sequences and second sequences are at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more nucleotides away from one another.

In some embodiments, the method further comprises generating an extended first exogenous probe oligonucleotide, wherein the extended first exogenous probe oligonucleotide comprises a sequence complementary to a sequence between the sequence hybridized to the first exogenous probe oligonucleotide and the sequence hybridized to the second exogenous probe oligonucleotide; and/or further comprises generating an extended first probe oligonucleotide, wherein the extended first probe oligonucleotide comprises a sequence complementary to a sequence between the sequence hybridized to the first probe oligonucleotide and the sequence hybridized to the second probe oligonucleotide.

In some embodiments, the method further comprises generating an extended second exogenous probe oligonucleotide using a polymerase, wherein the extended second exogenous probe oligonucleotide comprises a sequence complementary to a sequence between the sequence hybridized to the first exogenous probe oligonucleotide and the sequence hybridized to the second exogenous probe oligonucleotide; and/or further comprises generating an extended second probe oligonucleotide using a polymerase, wherein the extended second probe oligonucleotide comprises a sequence complementary to a sequence between the sequence hybridized to the first probe oligonucleotide and the sequence hybridized to the second probe oligonucleotide.

In some embodiments, the method further comprises hybridizing a third exogenous probe oligonucleotide to the first exogenous probe oligonucleotide and the second exogenous probe oligonucleotide; and/or further comprises hybridizing a third probe oligonucleotide to the first probe oligonucleotide and the second probe oligonucleotide. In some embodiments, the third exogenous probe oligonucleotide comprises: (i) a sequence that is at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99%, or 100% complementary to a portion of the first exogenous probe oligonucleotide that hybridizes to the third exogenous probe oligonucleotide; and (ii) a sequence that is at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99%, or 100% complementary to a portion of the second exogenous probe oligonucleotide that hybridizes to the third exogenous probe oligonucleotide; and/or wherein the third probe oligonucleotide comprises: (iii) a sequence that is at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99%, or 100% complementary to a portion of the first probe oligonucleotide that hybridizes to the third probe oligonucleotide; and (iv) a sequence that is at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99%, or 100% complementary to a portion of the second probe oligonucleotide that hybridizes to the third probe oligonucleotide.

In some embodiments, the method further comprises analyzing a protein in the biological sample, wherein the method includes (a) prior to aligning the first substrate with the second substrate contacting the biological sample with a plurality of analyte capture agents, wherein an analyte capture agent of the plurality of analyte capture agents comprises an analyte binding moiety and a capture agent barcode domain, wherein the analyte binding moiety specifically binds to the analyte protein, and wherein the capture agent barcode domain comprises an analyte binding moiety barcode and an capture handle sequence; (b) when the biological sample is aligned with at least the portion of the array, (i) releasing the capture agent barcode domain from the analyte protein and (ii) passively or actively migrating the capture agent barcode domain from the biological sample to the array; and (c) coupling the capture handle sequence to the capture domain of the capture probe.

In some instances, disclosed herein are methods for analyzing an analyte in a biological sample. In some instances, the biological sample has been transfected with, treated with, or otherwise has encountered an exogenous nucleic acid. The methods disclosed herein can be used to examine “endogenous” host genes as a result of expression of one or more exogenous nucleic acids. For instance, a user could overexpress a gene of interest using an AAV vector, and then can look at host gene expression for changes relative to treatment without AAV vector transfection.

In addition to the methods of detecting exogenous genes disclosed here, the methods can also include methods of detecting endogenous analytes (e.g., nucleic acids). In some instances, the methods of endogenous analyte detection are performed concurrently with methods of detecting exogenous gene expression (e.g., both RTL probe types are added at the same time). For instance, concurrently with methods of detecting exogenous nucleic acids, the methods disclosed herein include (a) hybridizing a first probe oligonucleotide and a second probe oligonucleotide to the analyte, wherein the first probe oligonucleotide and the second probe oligonucleotide each comprise a sequence that is substantially complementary to a first sequence and a second sequence of the analyte, respectively, and wherein the second probe oligonucleotide comprises a capture probe binding domain; (b) coupling the first probe oligonucleotide and the second probe oligonucleotide, thereby generating a connected probe (e.g., a ligation product) comprising the capture probe binding domain; (c) contacting the biological sample with a reagent medium comprising a permeabilization agent and an agent for releasing the connected probe (e.g., a ligation product), thereby (i) permeabilizing the biological sample and (ii) releasing the connected probe (e.g., a ligation product) from the analyte; and (d) hybridizing the capture probe binding domain of the connected probe (e.g., a ligation product) to a capture domain of a capture probe, wherein the capture probe comprises: (i) a spatial barcode and (ii) a capture domain.

Also provided herein are methods for analyzing an analyte in a biological sample on a first substrate including (a) hybridizing a first probe oligonucleotide and a second probe oligonucleotide to the analyte, wherein the first probe oligonucleotide and the second probe oligonucleotide each include a sequence that is substantially complementary to a first sequence and a second sequence of the analyte, respectively, and wherein the second probe oligonucleotide includes a capture probe binding domain; (b) coupling (e.g., ligating) the first probe oligonucleotide and the second probe oligonucleotide, thereby generating a connected probe (e.g., a ligation product) including the capture probe binding domain; (c) aligning the first substrate with a second substrate including an array, such that at least a portion of the biological sample is aligned with at least a portion of the array, wherein the array includes a plurality of capture probes, wherein a capture probe of the plurality of capture probes includes: (i) a spatial barcode and (ii) a capture domain; (d) when the biological sample is aligned with at least a portion of the array, (i) releasing the connected probe (e.g., a ligation product) from the analyte and (ii) passively or actively migrating the connected probe (e.g., a ligation product) from the biological sample to the array; and (e) hybridizing the capture probe binding domain of the connected probe (e.g., a ligation product) to the capture domain.

In some embodiments, the process of transferring the connected probe (e.g., a ligation product) from the first substrate to the second substrate is referred to as a “sandwich” process. The sandwich process is described in PCT Patent Application Publication No. WO 2020/123320, which is incorporated by reference in its entirety. Described herein are methods in which an array with capture probes located on a substrate and a biological sample located on a different substrate, are contacted such that the array is in contact with the biological sample (e.g., the substrates are sandwiched together). In some embodiments, the array and the biological sample can be contacted (e.g., sandwiched), without the aid of a substrate holder. In some embodiments, the array and biological sample substrates can be placed in a substrate holder (e.g., an array alignment device) designed to align the biological sample and the array. For example, the substrate holder can have placeholders for two substrates. In some embodiments, an array including capture probes can be positioned on one side of the substrate holder (e.g., in a first substrate placeholder). In some embodiments, a biological sample can be placed on the adjacent side of the substrate holder in a second placeholder. In some embodiments, a hinge can be located between the two substrate placeholders that allows the substrate holder to close, e.g., make a sandwich between the two substrate placeholders. In some embodiments, when the substrate holder is closed the biological sample and the array with capture probes are contacted with one another under conditions sufficient to allow analytes present in the biological sample to interact with the capture probes of the array. For example, dried permeabilization reagents can be placed on the biological sample and rehydrated. A permeabilization solution can be flowed through the substrate holder to permeabilize the biological sample and allow analytes in the biological sample to interact with the capture probes. Additionally, the temperature of the substrates or permeabilization solution can be used to initiate or control the rate of permeabilization. For example, the substrate including the array, the substrate including the biological sample, or both substrates can be held at a low temperature to slow diffusion and permeabilization efficiency. Once sandwiched, in some embodiments, the substrates can be heated to initiate permeabilization and/or increase diffusion efficiency. Transcripts that are released from the permeabilized tissue can diffuse to the array and be captured by the capture probes. The sandwich can be opened, and further analysis (e.g., cDNA synthesis) can be performed on the array.

In some embodiments, the methods as disclosed herein include hybridizing of one or more probe oligonucleotide probe pairs (e.g., RTL probes) to adjacent or nearby sequences of a target analyte (e.g., RNA; e.g., mRNA) of interest. In some embodiments, the probe oligonucleotide pairs include sequences that are complementary or substantially complementary to an analyte. For example, in some embodiments, each probe oligonucleotide includes a sequence that is complementary or substantially complementary to an mRNA of interest (e.g., to a portion of the sequence of an mRNA of interest). In some embodiments, each target analyte includes a first target region and a second target region. In some embodiments, the methods include providing a plurality of first probe oligonucleotides and a plurality of second probe oligonucleotides, wherein a pair of probe oligonucleotides for a target analyte comprises both a first and second probe oligonucleotide. In some embodiments, a first probe oligonucleotide hybridizes to a first target region of the analyte, and the second probe oligonucleotide hybridizes to a second, adjacent or nearly adjacent target region of the analyte.

In some instances, the probe oligonucleotides are DNA molecules. In some instances, the first probe oligonucleotide is a DNA molecule. In some instances, the second probe oligonucleotide is a DNA molecule. In some instances, the first probe oligonucleotide comprises at least two ribonucleic acid bases at the 3′ end. In some instances, the second probe oligonucleotide comprises a phosphorylated nucleotide at the 5′ end.

RTL probes can be designed using methods known in the art. In some instances, probe pairs are designed to cover an entire transcriptome of a species (e.g., a mouse or a human). In some instances, RTL probes are designed to cover a subset of a transcriptome (e.g., a mouse or a human). In some instances, the methods disclosed herein utilize about 500, about 1000, about 2000, about 3000, about 4000, about 5000, about 6000, about 7000, about 8000, about 9000, about 10,000, about 15,000, about 20,000, or more probe pairs.

In some embodiments, one of the probe oligonucleotides of the pair of probe oligonucleotides for RTL includes a poly(A) sequence or a complement thereof. In some instances, the poly(A) sequence or a complement thereof is on the 5′ end of one of the probe oligonucleotides. In some instances, the poly(A) sequence or a complement thereof is on the 3′ end of one of the probe oligonucleotides. In some embodiments, one probe oligonucleotide of the pair of probe oligonucleotides for RTL includes a degenerate or UMI sequence. In some embodiments, the UMI sequence is specific to a particular target or set of targets. In some instances, the UMI sequence or a complement thereof is on the 5′ end of one of the probe oligonucleotides. In some instances, the UMI sequence or a complement thereof is on the 3′ end of one of the probe oligonucleotides.

In some instances, the first and second target regions of an analyte are directly adjacent to one another. In some embodiments, the complementary sequences to which the first probe oligonucleotide and the second probe oligonucleotide hybridize are 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, about 15, about 20, about 25, about 30, about 35, about 40, about 45, about 50, about 55, about 60, about 65, about 70, about 75, about 80, about 85, about 90, about 95, about 100, about 125, or about 150 nucleotides away from each other. Gaps between the probe oligonucleotides may first be filled prior to coupling (e.g., ligation), using, for example, dNTPs in combination with a polymerase such as polymerase mu, DNA polymerase, RNA polymerase, reverse transcriptase, VENT polymerase, Taq polymerase, and/or any combinations, derivatives, and variants (e.g., engineered mutants) thereof. In some embodiments, when the first and second probe oligonucleotides are separated from each other by one or more nucleotides, deoxyribonucleotides are used to extend and couple (e.g., ligate) the first and second probe oligonucleotides.

In some instances, the first probe oligonucleotide and the second probe oligonucleotide hybridize to an analyte on the same transcript. In some instances, the first probe oligonucleotide and the second probe oligonucleotide hybridize to an analyte on the same exon. In some instances, the first probe oligonucleotide and the second probe oligonucleotide hybridize to an analyte on different exons. In some instances, the first probe oligonucleotide and the second probe oligonucleotide hybridize to an analyte that is the result of a translocation event (e.g., in the setting of cancer). The methods provided herein make it possible to identify alternative splicing events, translocation events, and mutations that change the hybridization rate of one or both probe oligonucleotides (e.g., single nucleotide polymorphisms, insertions, deletions, point mutations).

In some embodiments, the first and/or second probe as disclosed herein includes at least two ribonucleic acid bases at the 3′ end; a functional sequence; a phosphorylated nucleotide at the 5′ end; and/or a capture probe binding domain. In some embodiments, the functional sequence is a primer sequence.

The “capture probe binding domain” is a sequence that is complementary to a particular capture domain present in a capture probe. In some embodiments, the capture probe binding domain includes a poly(A) sequence. In some embodiments, the capture probe binding domain includes a poly-uridine sequence, a poly-thymidine sequence, or a combination thereof. In some embodiments, the capture probe binding domain includes a random sequence (e.g., a random hexamer or octamer). In some embodiments, the capture probe binding domain is complementary to a capture domain in a capture probe that detects a particular target(s) of interest. In some embodiments, a capture probe binding domain blocking moiety that interacts with the capture probe binding domain is provided. In some embodiments, a capture probe binding domain blocking moiety includes a sequence that is complementary or substantially complementary to a capture probe binding domain. In some embodiments, a capture probe binding domain blocking moiety prevents the capture probe binding domain from binding the capture probe when present. In some embodiments, a capture probe binding domain blocking moiety is removed prior to binding the capture probe binding domain (e.g., present in a connected probe (e.g., a ligation product)) to a capture probe. In some embodiments, a capture probe binding domain blocking moiety comprises a poly-uridine sequence, a poly-thymidine sequence, or a combination thereof.

Hybridization of the probe oligonucleotides to the target analyte can occur at a target having a sequence that is 100% complementary to the probe oligonucleotide(s). In some embodiments, hybridization can occur at a target having a sequence that is at least (e.g. at least about) 80%, at least (e.g. at least about) 85%, at least (e.g. at least about) 90%, at least (e.g. at least about) 95%, at least (e.g. at least about) 96%, at least (e.g. at least about) 97%, at least (e.g. at least about) 98%, or at least (e.g. at least about) 99% complementary to the probe oligonucleotide(s). After hybridization, in some embodiments, the first probe oligonucleotide is extended. After hybridization, in some embodiments, the second probe oligonucleotide is extended. For example, in some instances a first probe oligonucleotide hybridizes to a target sequence upstream for a second oligonucleotide probe, whereas in other instances a first probe oligonucleotide hybridizes to a target sequence downstream of a second probe oligonucleotide.

In some embodiments, methods disclosed herein include a wash step after hybridizing the first and the second probe oligonucleotides. The wash step removes any unbound oligonucleotides and can be performed using any technique known in the art. In some embodiments, a pre-hybridization buffer is used to wash the sample. In some embodiments, a phosphate buffer is used. In some embodiments, multiple wash steps are performed to remove unbound oligonucleotides. For example, it is advantageous to decrease the amount of unhybridized probes present in a biological sample as they may interfere with downstream applications and methods.

In some embodiments, after hybridization of probe oligonucleotides (e.g., first and the second probe oligonucleotides) to the target analyte, the probe oligonucleotides (e.g., the first probe oligonucleotide and the second probe oligonucleotide) are coupled (e.g., ligated) together, creating a single connected probe (e.g., a ligation product) that is complementary to the target analyte. Ligation can be performed enzymatically or chemically, as described herein. For example, the first and second probe oligonucleotides are hybridized to the first and second target regions of the analyte, and the probe oligonucleotides are subjected to a nucleic acid reaction to ligate them together. For example, the probes may be subjected to an enzymatic ligation reaction using a ligase (e.g., T4 RNA ligase (Rnl2), a Chlorella virus DNA ligase, a PBCV-1 DNA ligase, a splintR® ligase, or a T4 DNA ligase). See, e.g., Zhang L., et al.; Archaeal RNA ligase from Thermococcus kodakarensis for template dependent ligation RNA Biol. 2017; 14(1): 36-44 for a description of KOD ligase. A skilled artisan will understand that various reagents, buffers, cofactors, etc. may be included in a ligation reaction depending on the ligase being used.

In some embodiments, the first probe oligonucleotide and the second probe oligonucleotides are a contiguous nucleic acid sequence. In some embodiments, the first probe oligonucleotide is on the 3′ end of the contiguous nucleic acid sequence. In some embodiments, the first probe oligonucleotide is on the 5′ end of the contiguous nucleic acid sequence. In some embodiments, the second probe oligonucleotide is on the 3′ end of the contiguous nucleic acid sequence. In some embodiments, the second probe oligonucleotide is on the 5′ end of the contiguous nucleic acid sequence.

In some embodiments, the method further includes hybridizing a third probe oligonucleotide to the first probe oligonucleotide and the second probe oligonucleotide such that the first probe oligonucleotide and the second probe oligonucleotide abut each other. In some embodiments, the third probe oligonucleotide comprises a sequence that is at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% complementary to a portion of the first probe oligonucleotide that hybridizes to the third probe oligonucleotide. In some embodiments, the third probe oligonucleotide comprises a sequence that is 100% complementary to a portion of the first probe oligonucleotide that hybridizes to the third probe oligonucleotide. In some embodiments, the third probe oligonucleotide comprises a sequence that is at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% complementary to a portion of the second probe oligonucleotide that hybridizes to the third probe oligonucleotide. In some embodiments, the third probe oligonucleotide comprises a sequence that is 100% complementary to a portion of the second probe oligonucleotide that hybridizes to the third probe oligonucleotide.

In some embodiments, a method for identifying a location of an analyte in a biological sample exposed to different permeabilization conditions includes (a) contacting the biological sample with a substrate, wherein the substrate comprises a plurality of capture probes, wherein a capture probe of the plurality of capture probes comprises a capture domain; (b) contacting the biological sample with a first probe oligonucleotide and a second probe oligonucleotide, wherein the first probe oligonucleotide and the second probe oligonucleotide are substantially complementary to a first sequence and a second sequence of the analyte, respectively, and wherein the second probe oligonucleotide comprises a capture probe-binding domain that is capable of binding to a capture domain of the capture probe; (c) hybridizing the first probe oligonucleotide and the second probe oligonucleotide to the first sequence and the second sequence of the analyte; (d) coupling (e.g., ligating) the first probe oligonucleotide and the second probe oligonucleotide, thereby creating a connected probe (e.g., a connected probe (e.g., a ligation product)) that is substantially complementary to the analyte; (e) releasing the connected probe (e.g., a ligation product) from the analyte; (f) hybridizing the capture probe-binding domain of the connected probe (e.g., a ligation product) to the hybridization domain of the capture probe; (g) hybridizing a padlock oligonucleotide to the connected probe (e.g., a ligation product) bound to the capture domain (e.g., such that the padlock oligonucleotide is circularized), wherein the padlock oligonucleotide comprises: (i) a first sequence that is substantially complementary to a first portion of the connected probe (e.g., a ligation product), (ii) a backbone sequence, and (iii) a second sequence that is substantially complementary to a second portion of the connected probe (e.g., a ligation product); and (i) ligating and amplifying the circularized padlock oligonucleotide (e.g., using rolling circle amplification using the circularized padlock oligonucleotide as a template), thereby creating an amplified circularized padlock oligonucleotide, and using the amplified circularized padlock oligonucleotide to identify the location of the analyte in the biological sample.

In some embodiments, the method further includes amplifying the connected probe (e.g., a ligation product) prior to the releasing step. In some embodiments, the entire connected probe (e.g., a ligation product) is amplified. In some embodiments, only part of the connected probe (e.g., a ligation product) is amplified. In some embodiments, amplification is isothermal. In some embodiments, amplification is not isothermal. Amplification can be performed using any of the methods described herein such as, but not limited to, a strand-displacement amplification reaction, a rolling circle amplification reaction, a ligase chain reaction, a transcription-mediated amplification reaction, an isothermal amplification reaction, and/or a 10 loop-mediated amplification reaction. In some embodiments, amplifying the connected probe (e.g., a ligation product) creates an amplified connected probe (e.g., a ligation product) that includes (i) all or part of sequence of the connected probe (e.g., a ligation product) specifically bound to the capture domain, or a complement thereof, and (ii) the sequence of the spatial barcode, or a complement thereof.

In some embodiments, the method further includes determining (i) all or a part of the sequence of the connected probe (e.g., a ligation product), or a complement thereof, and (ii) the sequence of the spatial barcode, or a complement thereof. In some embodiments, the method further includes using the determined sequences of (i) and (ii) to determine the location and/or abundance of the analyte in the biological sample.

In some embodiments, after coupling (e.g., ligation) of the first and second probe oligonucleotides to create a ligation product, the connected probe (e.g., a ligation product) is released from the analyte. To release the connected probe (e.g., a ligation product), an endoribonuclease (e.g., RNase A, RNase C, RNase H, or RNase I) is used. An endoribonuclease such as RNase H specifically cleaves RNA in RNA:DNA hybrids. In some embodiments, the connected probe (e.g., a ligation product) is released enzymatically. In some embodiments, an endoribonuclease is used to release the probe from the analyte. In some embodiments, the endoribonuclease is one or more of RNase H. In some embodiments, the RNase H is RNase H1 or RNase H2.

In some embodiments, the releasing of the connected probe (e.g., a ligation product) includes contacting the biological sample with a reagent medium comprising a permeabilization agent and an agent for releasing the connected probe (e.g., a ligation product), thereby permeabilizing the biological sample and releasing the connected probe (e.g., a ligation product) from the analyte. In some embodiments, the agent for releasing the connected probe (e.g., a ligation product) comprises a nuclease. In some embodiments, the nuclease is an endonuclease. In some embodiments, the nuclease is an exonuclease. In some embodiments, the nuclease includes an RNase. In some embodiments, the RNase is selected from RNase A, RNase C, RNase H, or RNase I.

In some embodiments, the reagent medium includes a wetting agent.

In some instances, after creation of the connected probe (e.g., a ligation product), the methods disclosed herein include simultaneous treatment of the biological sample with a permeabilization agent such as proteinase K (to permeabilize the biological sample) and a releasing agent such as an endonuclease such as RNase H (to release the connected probe (e.g., a ligation product) from the analyte). In some instances, the permeabilization step and releasing step occur at the same time. In some instances, the permeabilization step occurs before the releasing step. In some embodiments, the permeabilization agent comprises a protease. In some embodiments, the protease is selected from trypsin, pepsin, elastase, or Proteinase K. In some embodiments, the protease is an endopeptidase. Endopeptidases that can be used include but are not limited to trypsin, chymotrypsin, elastase, thermolysin, pepsin, clostripan, glutamyl endopeptidase (GluC), ArgC, peptidyl-asp endopeptidase (ApsN), endopeptidase LysC and endopeptidase LysN. In some embodiments, the endopeptidase is pepsin.

In some embodiments, the reagent medium further includes a detergent. In some embodiments, the detergent is selected from sodium dodecyl sulfate (SDS), sarkosyl, saponin, Triton X-100™, or Tween-20′. In some embodiments, the reagent medium includes less than 5 w/v % of a detergent selected from sodium dodecyl sulfate (SDS) and sarkosyl. In some embodiments, the reagent medium includes as least 5% w/v % of a detergent selected from SDS and sarkosyl. In some embodiments, the reagent medium does not include SDS or sarkosyl.

In some embodiments, the biological sample and the array are contacted with the reagent medium for about 1 to about 60 minutes (e.g., about 1 to about 55 minutes, about 1 to about 50 minutes, about 1 to about 45 minutes, about 1 to about 40 minutes, about 1 to about 35 minutes, about 1 to about 30 minutes, about 1 to about 25 minutes, about 1 to about 20 minutes, about 1 to about 15 minutes, about 1 to about 10 minutes, about 1 to about 5 minutes, about 5 to about 60 minutes, about 5 to about 55 minutes, about 5 to about 50 minutes, about 5 to about 45 minutes, about 5 to about 40 minutes, about 5 to about 35 minutes, about 5 to about 30 minutes, about 5 to about 25 minutes, about 5 to about 20 minutes, about 5 to about 15 minutes, about 5 to about 10 minutes, about 10 to about 60 minutes, about 10 to about 55 minutes, about 10 to about 50 minutes, about 10 to about 45 minutes, about 10 to about 40 minutes, about 10 to about 35 minutes, about 10 to about 30 minutes, about 10 to about 25 minutes, about 10 to about 20 minutes, about 10 to about 15 minutes, about 15 to about 60 minutes, about 15 to about 55 minutes, about 15 to about 50 minutes, about 15 to about 45 minutes, about 15 to about 40 minutes, about 15 to about 35 minutes, about 15 to about 30 minutes, about 15 to about 25 minutes, about 15 to about 20 minutes, about 20 to about 60 minutes, about 20 to about 55 minutes, about 20 to about 50 minutes, about 20 to about 45 minutes, about 20 to about 40 minutes, about 20 to about 35 minutes, about 20 to about 30 minutes, about 20 to about 25 minutes, about 25 to about 60 minutes, about 25 to about 55 minutes, about 25 to about 50 minutes, about 25 to about 45 minutes, about 25 to about 40 minutes, about 25 to about 35 minutes, about 25 to about 30 minutes, about 30 to about 60 minutes, about 30 to about 55 minutes, about 30 to about 50 minutes, about 30 to about 45 minutes, about 30 to about 40 minutes, about 30 to about 35 minutes, about 35 to about 60 minutes, about 35 to about 55 minutes, about 35 to about 50 minutes, about 35 to about 45 minutes, about 35 to about 40 minutes, about 40 to about 60 minutes, about 40 to about 55 minutes, about 40 to about 50 minutes, about 40 to about 45 minutes, about 45 to about 60 minutes, about 45 to about 55 minutes, about 45 to about 50 minutes, about 50 to about 60 minutes, about 50 to about 55 minutes, or about 55 to about 60 minutes). In some embodiments, the biological sample and the array are contacted with the reagent medium for about 30 minutes.

In some embodiments, the connected probe (e.g., a ligation product) includes a capture probe binding domain, which can hybridize to a capture probe (e.g., a capture probe immobilized, directly or indirectly, on a substrate). In some embodiments, the capture probe includes a spatial barcode and the capture domain. In some embodiments, the capture probe binding domain of the connected probe (e.g., a ligation product) specifically binds to the capture domain of the capture probe.

In some embodiments, methods provided herein include mounting a biological sample on a first substrate, then aligning the first substrate with a second substrate including an array, such that at least a portion of the biological sample is aligned with at least a portion of the array, wherein the array includes a plurality of capture probes. After hybridization of the connected probe (e.g., a ligation product) to the capture probe, downstream methods as disclosed herein can be performed.

In some embodiments, at least 50% of connected probes (e.g., a ligation products) released from the portion of the biological sample aligned with the portion of the array are captured by capture probes of the portion of the array. In some embodiments, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% of connected probe (e.g., a ligation products) are detected in features directly under the biological sample.

In some embodiments, the capture probe includes a poly(T) sequence. In some embodiments, the capture probe includes a sequence specific to the analyte. In some embodiments, the capture probe includes a functional domain. In some embodiments, the capture probe further includes one or more functional domains, a unique molecular identifier (UMI), a cleavage domain, and combinations thereof. In some embodiments, the capture probe binding domain includes a poly(A) sequence. In some embodiments, the capture probe binding domain includes a sequence complementary to a capture domain of a capture probe that detects a target analyte of interest. In some embodiments, the analyte is RNA. In some embodiments, the analyte is mRNA.

In some embodiments, the connected probe (e.g., a ligation product) (e.g., the analyte derived molecule) includes a capture probe binding domain, which can hybridize to a capture probe (e.g., a capture probe immobilized, directly or indirectly, on a substrate). Methods provided herein include contacting a biological sample with a substrate, wherein the capture probe is affixed to the substrate (e.g., immobilized to the substrate, directly or indirectly). After hybridization of the connected probe (e.g., a ligation product) to the capture probe, downstream methods as disclosed herein (e.g., sequencing, in situ analysis such as RCA) can be performed.

In some embodiments, the method further includes analyzing a different analyte in the biological sample. In some embodiments, the analysis of the different analyte includes (a) further contacting the biological sample with a plurality of analyte capture agents, wherein an analyte capture agent of the plurality of analyte capture agents includes an analyte binding moiety and a capture agent barcode domain, wherein the analyte binding moiety specifically binds to the different analyte, and wherein the capture agent barcode domain includes an analyte binding moiety barcode and an capture handle sequence that is complementary to a capture domain of a capture probe; and (b) hybridizing the analyte capture sequence to the capture domain.

An exemplary embodiment of a workflow for analysis of protein and RNA analytes is shown in FIG. 13A. As shown in FIG. 13A, a fixed tissue sample mounted on a first substrate (e.g., a slide-mounted tissue sample) is decrosslinked, followed by hybridization of probe pairs to nucleic acid analyte analytes. Also as shown in FIG. 13A, a first and second probe of a probe pair is connected, e.g., ligated. The sample is optionally washed (e.g., with a buffer), prior to incubation with an analyte capture agent (e.g., an antibody) that specifically binds a different analyte, e.g., a protein analyte. The analyte capture agent comprises a capture agent barcode domain. In some embodiments, the analyte capture agent is an antibody with an oligonucleotide tag, the oligonucleotide tag comprising a capture agent barcode domain. In some embodiments, the connected probes (e.g., the ligation products) and antibody oligonucleotide tags are released from the tissue under sandwich conditions as described herein. For the sandwich conditions, the tissue-mounted slide can be aligned with an array and permeabilized with a reagent medium in the sandwich configuration as described herein (see, e.g., FIG. 13B). In some embodiments, the reagent medium comprises RNase and a permeabilization agent (e.g., Proteinase K). Permeabilization releases the connected probe (e.g., a ligation product) and capture agent barcode domain, for capture onto a second substrate comprising an array with a plurality of capture probes (see, e.g., FIG. 13B). After capture of the connected probe and capture agent barcode domain, the tissue slide can be removed (e.g., the sandwich can be “opened” or “broken”).

In some embodiments, following opening of the sandwich, the capture probes can be extended, sequencing libraries can be prepared and sequenced, and the results can be analyzed computationally.

In some embodiments, the method further includes determining (i) all or part of the sequence of the capture agent barcode domain; and (ii) the sequence of the spatial barcode, or a complement thereof. In some embodiments, the method further includes using the determined sequences of (i), and (ii) to analyze the different analyte in the biological sample. In some embodiments, the releasing step further releases the capture agent barcode domain from the different analyte. In some embodiments, the different analyte is a protein analyte. In some embodiments, the protein analyte is an extracellular protein. In some embodiments, the protein analyte is an intracellular protein.

In another embodiment of the present disclosure, a set of RTL probes is added to a biological sample, e.g., a biological sample mounted on a first substrate, where the RTL probes hybridize to exogenous nucleic acid molecules in the biological sample. After ligating the RTL probes in the biological sample (e.g., as described herein), the capture probe having—in some instances—a capture domain, a UMI, a spatial barcode, and a primer—is released (e.g., cleaved) and migrates to the biological sample. In some instances, only one substrate is used. In some instances, the release occurs with the addition of the first reagent medium. In some embodiments, when the first substrate is aligned with a second substrate comprising an array of capture probes such that at least a portion of the biological sample is aligned with at least a portion of the array, the biological sample (or portion thereof) and the array are contacted with a first reagent medium. In some embodiments, the capture probe is released from the array of capture probes (e.g., during the contacting with the first reagent medium). Then, the released (e.g., cleaved) capture probe is free to migrate to the biological sample. The released capture probe penetrates the biological sample and interacts with the ligation product formed from the RTL probes in the biological sample. For example, the capture domain (e.g., poly(T) tail) of the released capture probe may hybridize to the connected probe (e.g., ligation product) in the biological sample. Then, the biological sample is contacted with a second reagent medium. The second reagent medium may comprise an RNAse which degrades the exogenous nucleic acid molecule hybridized to the connected probe (e.g., ligation product), leaving a singled-stranded DNA molecule (e.g., the connected probe) hybridized to the released capture probe. The second reagent medium may permeabilize the biological sample, thus releasing the connected probe/capture probe hybridization product into the second reagent medium for collection and bulk processing. Such bulk processing may comprise, e.g., an extension step, generating a copy of a connected probe having components of the capture probe (e.g., primer, spatial barcode, capture domain) or complements thereof, and the connected probe. The extended product may be denatured for further evaluation (e.g., sequencing).

In some instances, release of the capture probes can be performed by different methods. For instance, in some embodiments, a capture probe comprises a cleavage domain. In some embodiments, the cleavage domain comprises nucleotides with photo-sensitive chemical bonds; an ultrasonic cleavage domain; one or more labile chemical bonds; a sequence that is recognized by one or more enzymes capable of cleaving a nucleic acid molecule; a poly(U) sequence capable of being cleaved by a mixture of Uracil DNA glycosylase (UDG) and the DNA glycosylase-lyase Endonuclease VIII; one or more disulfide bonds; or any combination thereof. In some instances, removal of capture probes from the second substrate can be performed physically (e.g., by light or heat), chemically, enzymatically, or any combination thereof.

In some embodiments, the capture probe is released from the surface of the substrate (e.g., array) by physical means. Methods for disrupting the interaction between nucleic acid molecules are known in the art. A straightforward method for releasing the DNA molecules (e.g., of stripping the array of capture probes) is to use a solution (e.g., a first reagent medium) that interferes with the bonds between nucleotides. In some embodiments, the capture probe is released by heating the first reagent medium to at least 85° C., e.g., at least 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99° C. In some embodiments, the first reagent medium includes salts, surfactants, etc. that can further destabilize the interaction between the array and the capture probe.

(d) Capturing Analytes for Spatial Detection Using Analyte Capture Agents

In some embodiments, the methods, compositions, devices, and systems provided herein utilize analyte capture agents for spatial detection concurrently with detection of exogenous nucleic acids. An “analyte capture agent” refers to a molecule that interacts with a target analyte (e.g., a protein) and with a capture probe. Such analyte capture agents can be used to identify the analyte. In some embodiments, the analyte capture agent can include an analyte binding moiety and a capture agent barcode domain. In some embodiments, the analyte capture agent includes a linker. In some embodiments, the linker is a cleavable linker. In some embodiments, the cleavable linker is a pH-sensitive cleavable linker, a photo-cleavable linker, a UV-cleavable linker, a disulfide linker, or an enzyme cleavable linker.

An analyte binding moiety is a molecule capable of binding to a specific analyte. In some embodiments, the analyte binding moiety comprises an antibody or antibody fragment (e.g., an antigen binding fragment of an antibody). In some embodiments, the analyte binding moiety comprises a polypeptide and/or an aptamer. In some embodiments, the analyte is a protein (e.g., a protein on a surface of a cell or an intracellular protein).

A capture agent barcode domain can include a capture handle sequence which can hybridize to at least a portion or an entirety of a capture domain of a capture probe. In some embodiments, the capture handle sequence is complementary to a portion or entirety of a capture domain of a capture probe. In some embodiments, the capture handle sequence includes a poly (A) tail. In some embodiments, the capture handle sequence includes a sequence capable of binding a poly (T) domain. In some embodiments, the capture agent barcode domain comprises an analyte binding moiety barcode and a capture handle sequence. The analyte binding moiety barcode refers to a barcode that is associated with or otherwise identifies the analyte binding moiety, and the capture handle sequence can hybridize to a capture probe. In some embodiments, the capture handle sequence specifically binds to the capture domain of the capture probe. Other embodiments of an analyte capture agent useful in spatial analyte detection are described herein.

Provided herein are methods for analyzing an analyte (e.g., protein) in a biological sample including (a) contacting the biological sample with a plurality of analyte capture agents, wherein an analyte capture agent of the plurality of analyte capture agents includes an analyte binding moiety and a capture agent barcode domain, wherein the analyte binding moiety specifically binds to the analyte (e.g., protein), and wherein the capture agent barcode domain includes an analyte binding moiety barcode and an capture handle sequence; (b) contacting the biological sample with a reagent medium including an agent for releasing the capture agent barcode domain from the analyte binding moiety, thereby releasing the capture agent barcode domain from the analyte binding moiety; and (c) hybridizing the capture handle sequence to a capture domain of a capture probe, wherein the capture probe includes (i) a spatial barcode and (ii) a capture domain.

Also provided herein are methods for analyzing an analyte in a biological sample on a first substrate including (a) contacting the biological sample with a plurality of analyte capture agents, wherein an analyte capture agent of the plurality of analyte capture agents includes an analyte binding moiety and a capture agent barcode domain, wherein the analyte binding moiety specifically binds to the analyte, and wherein the capture agent barcode domain includes an analyte binding moiety barcode and an capture handle sequence; (b) aligning the first substrate with a second substrate comprising an array, such that at least a portion of the biological sample is aligned with at least a portion of the array, wherein the array includes a plurality of capture probes, wherein a capture probe of the plurality of capture probes includes (i) a spatial barcode and (ii) a capture domain; (c) when the biological sample is aligned with at least a portion of the array, (i) releasing the capture agent barcode domain from the analyte and (ii) passively or actively migrating the capture agent barcode domain from the biological sample to the array; and (d) coupling the capture handle sequence to the capture domain.

Also provided herein are methods for analyzing an analyte in a biological sample mounted on a first substrate including (a) contacting the biological sample with a plurality of analyte capture agents, wherein an analyte capture agent of the plurality of analyte capture agents includes an analyte binding moiety and a capture agent barcode domain, wherein the analyte binding moiety specifically binds to the analyte, and wherein the capture agent barcode domain includes an analyte binding moiety barcode and an capture handle sequence; (b) aligning the first substrate with a second substrate comprising an array, such that at least a portion of the biological sample is aligned with at least a portion of the array, wherein the array includes a plurality of capture probes, wherein a capture probe of the plurality of capture probes includes (i) a spatial barcode and (ii) a capture domain; (c) when the biological sample is aligned with at least a portion of the array, (i) releasing the capture agent barcode domain from the analyte and (ii) passively or actively migrating the capture agent barcode domain from the biological sample to the array; and (d) hybridizing the capture handle sequence to the capture domain.

It is appreciated that any of the methods of protein detection described in this section can be performed concurrently with both RTL detection of exogenous nucleic acids and RTL detection of endogenous nucleic acids.

In some embodiments, the process of transferring the connected probe (e.g., a ligation product) from the first substrate to the second substrate is referred to as a “sandwich process”. The sandwich process is described above and in PCT Patent Application Publication No. WO 2020/123320, which is incorporated by reference in its entirety.

In some embodiments, the method further includes determining (i) all or a part of a sequence of the capture agent barcode domain, or a complement thereof, and (ii) the sequence of the spatial barcode, or a complement thereof. In some embodiments, the method further includes using the determined sequences of (i) and (ii) to determine the location and abundance of the analyte in the biological sample.

In some embodiments, an analyte capture agent is introduced to a biological sample, wherein the analyte binding moiety specifically binds to a target analyte, and then the biological sample can be treated to release the capture agent barcode domain from the biological sample. In some embodiments, the capture agent barcode domain can then migrate and bind to a capture domain of a capture probe, and the capture agent barcode domain can be extended to generate a spatial barcode complement at the end of the capture agent barcode domain. In some embodiments, the spatially-tagged capture agent barcode domain can be denatured from the capture probe, and analyzed using methods described herein.

In some embodiments, the releasing includes contacting the biological sample and the array with a reagent medium including a nuclease. In some embodiments, the nuclease includes an RNase. In some embodiments, the RNase is selected from RNase A, RNase C, RNase H, and RNase I. In some embodiments, the reagent medium further includes a permeabilization agent. In some embodiments, the releasing further includes simultaneously permeabilizing the biological sample and releasing the capture agent barcode domain from the analyte. In some embodiments, the permeabilization agent further includes a protease. In some embodiments, the protease is selected from trypsin, pepsin, elastase, or Proteinase K.

In some embodiments, the capture agent barcode domain is released from the analyte binding moiety by using a different stimulus that can include, but is not limited to, a proteinase (e.g., Proteinase K), an RNase, and UV light.

In some embodiments, the reagent medium further includes a detergent. In some embodiments, the detergent is selected from sodium dodecyl sulfate (SDS), sarkosyl, saponin, Triton X-100™, or Tween-20™. In some embodiments, the reagent medium includes less than 5 w/v % of a detergent selected from sodium dodecyl sulfate (SDS) and sarkosyl. In some embodiments, the reagent medium includes as least 5% w/v % of a detergent selected from SDS and sarkosyl. In some embodiments, the reagent medium does not include SDS or sarkosyl.

Also provided herein are methods further including analyzing a different analyte in the biological sample. In some embodiments, the analysis of the different analyte includes (a) hybridizing a first probe oligonucleotide and a second probe oligonucleotide to the different analyte, wherein the first probe oligonucleotide and the second probe oligonucleotide each comprise a sequence that is substantially complementary to a first sequence and a second sequence of the different analyte, respectively, and wherein the second probe oligonucleotide comprises a capture probe binding domain; (b) ligating the first probe oligonucleotide and the second probe oligonucleotide, thereby generating a connected probe (e.g., a ligation product) comprising the capture probe binding domain; and (c) hybridizing the capture probe binding domain of the connected probe (e.g., a ligation product) to the capture domain.

In some embodiments, the method further includes determining (i) all or part of the sequence of the connected probe (e.g., a ligation product), or a complement thereof, and (ii) the sequence of the spatial barcode, or a complement thereof. In some embodiments, the method further includes using the determined sequences of (i), and (ii) to analyze the different analyte in the biological sample. In some embodiments, the releasing step further releases the connected probe (e.g., a ligation product) from the different analyte. In some embodiments, the different analyte is RNA. In some embodiments, the different analyte is mRNA.

In some embodiments, the capture probe comprises a poly(T) sequence. In some embodiments, the capture probe comprises a sequence complementary to the capture handle sequence. In some embodiments, the capture probe comprises a functional domain. In some embodiments, the capture probe further comprises one or more functional domains, a unique molecular identifier (UMI), a cleavage domain, and combinations thereof.

In some embodiments, the biological sample is a tissue sample. In some embodiments, the tissue sample is a tissue section. In some embodiments, the tissue sample is a fixed tissue sample. In some embodiments, the fixed tissue sample is a formalin fixed paraffin embedded (FFPE) tissue sample. In some embodiments, the FFPE tissue is deparaffinized and decrosslinked prior to step (a) of any one of the methods provided herein. In some embodiments, the fixed tissue sample is a formalin fixed paraffin embedded cell pellet. In some embodiments, the tissue sample is a fresh tissue sample or a frozen tissue sample. In some embodiments, the tissue sample is fixed and stained prior to step (a) of any one of the methods provided herein.

In some instances, RTL is performed between two oligonucleotides that each are affixed to an analyte binding moiety (e.g., a protein-binding moiety). In some embodiments, provided herein is a method of determining a location of at least one analyte in a biological sample including: (a) hybridizing a first analyte-binding moiety to a first analyte in the biological sample, wherein the first analyte-binding moiety is bound to a first oligonucleotide, wherein the first oligonucleotide comprises: (i) a functional sequence; (ii) a first barcode; and (iii) a first bridge sequence; (b) hybridizing a second analyte-binding moiety to a second analyte in the biological sample, wherein the second analyte-binding moiety is bound to a second oligonucleotide; wherein the second oligonucleotide comprises: (i) capture probe binding domain sequence, (ii) a second barcode; and (ii) a second bridge sequence; (c) contacting the biological sample with a third oligonucleotide; (d) hybridizing the third oligonucleotide to the first bridge sequence of the first oligonucleotide and second bridge sequence of the second oligonucleotide; (e) ligating the first oligonucleotide and the second oligonucleotide, creating a connected probe (e.g., a ligation product); (f) contacting the biological sample with a substrate, wherein a capture probe is affixed to the substrate, wherein the capture probe comprises a spatial barcode and the capture domain; and (g) allowing the capture probe binding domain sequence of the second oligonucleotide to specifically bind to the capture domain. In some instances, the connected probe (e.g., a ligation product) is cleaved from the analyte binding moieties.

In some instances, two analytes (e.g., two different proteins) in close proximity in a biological sample are detected by a first analyte-binding moiety and a second analyte-binding moiety, respectively. In some embodiments, a first analyte-binding moiety and/or the second analyte-binding moiety is an analyte capture agent (e.g., any of the exemplary analyte capture agents described herein). In some embodiments, the first analyte-binding moiety and/or the second analyte-binding moiety is a first protein. In some embodiments, the first analyte-binding moiety and/or the second analyte-binding moiety is an antibody. For example, the antibody can include, without limitation, a monoclonal antibody, recombinant antibody, synthetic antibody, a single domain antibody, a single-chain variable fragment (scFv), and or an antigen-binding fragment (Fab). In some embodiments, the first analyte-binding moiety binds to a cell surface analyte (e.g., any of the exemplary cell surface analytes described herein). In some embodiments, binding of the analyte is performed metabolically. In some embodiments, binding of the analyte is performed enzymatically. In some embodiments, the methods include a secondary antibody that binds to a primary antibody, enhancing its detection.

In some embodiments, the first analyte-binding moiety and the second analyte-binding moiety each bind to the same analyte. In some embodiments, the first analyte-binding moiety and/or second analyte-binding moiety each bind to a different analyte. For example, in some embodiments, the first analyte-binding moiety binds to a first polypeptide and the second analyte-binding moiety binds to a second polypeptide.

In some embodiments of any of the methods of determining a location of at least one analyte in a biological sample, a first and/or a second oligonucleotide are bound (e.g., conjugated or otherwise attached using any of the methods described herein) to a first analyte-binding moiety and/or a second analyte-binding moiety, respectively.

In some embodiments of any of the methods of determining a location of at least one analyte in a biological sample as described herein, a second oligonucleotide is bound (e.g., conjugated or otherwise attached using any of the methods described herein) to a second analyte-binding moiety. For example, the second oligonucleotide can be covalently linked to the second analyte-binding moiety. In some embodiments, the second oligonucleotide is bound to the second analyte-binding moiety via its 5′ end. In some embodiments, the second oligonucleotide includes a free 3′ end. In some embodiments the second oligonucleotide is bound to the second analyte-binding moiety via its 3′ end. In some embodiments, the second oligonucleotide includes a free 5′ end.

In some embodiments, the oligonucleotides are bound to the first and/or second analyte-binding moieties via a linker (e.g., any of the exemplary linkers described herein). In some embodiments, the linker is a cleavable linker. In some embodiments, the linker is a linker with photo-sensitive chemical bonds (e.g., photo-cleavable linkers). In some embodiments, the linker is a cleavable linker that can undergo induced dissociation.

FIGS. 14A-14B show exemplary schematics illustrating methods to release the capture agent barcode domain from the analyte binding moiety of an analyte capture agent. FIG. 14A shows the use of an RNase cleavable linker to release a capture agent barcode domain from an analyte binding moiety of an analyte capture agent. In particular, a target (indicated by the circle in FIG. 14A interacts with an antibody. After an interaction is established, an enzyme (e.g., an RNAse) cleaves a linker, freeing a barcode from the antibody. The barcode, which is specific to the antibody, is captured on a slide comprising a plurality of probes. Similarly, FIG. 14B shows the use of a UV cleavable linker to release a capture agent barcode domain from an analyte binding moiety of an analyte capture agent.

In some embodiments, the oligonucleotides are bound (e.g., attached via any of the methods described herein) to an analyte-binding domain via a 5′ end.

In some embodiments, a barcode is used to identify the analyte-binding moiety to which it is bound. The barcode can be any of the exemplary barcodes described herein. In some embodiments, the first and/or second oligonucleotide include a capture probe binding domain sequence. For example, a capture probe binding domain sequence can be a poly(A) sequence when the capture domain sequence is a poly(T) sequence.

In some embodiments, a third oligonucleotide (e.g., a splint oligonucleotide) hybridizes to both the first and second oligonucleotides and enables ligation of the first oligonucleotide and the second oligonucleotide. In some embodiments, a ligase is used. In some aspects, the ligase includes a DNA ligase. In some aspects, the ligase includes a RNA ligase. In some aspects, the ligase includes T4 DNA ligase. In some embodiments, the ligase is a SplintR® ligase. In some embodiments, the ligase is a Chlorella virus DNA ligase. In some embodiments, the ligase is a PBCV-1 DNA ligase.

In another embodiment of the present disclosure, a set of analyte capture agents is added to a biological sample, e.g., a biological sample mounted on a first substrate, where the analyte capture agents bind to exogenous analytes (e.g., proteins) in the biological sample. In this method, the capture probe on the substrate having—in some instances—a capture domain, a UMI, a spatial barcode, and a primer—is released (e.g., cleaved) and migrates to the biological sample. In some instances, only one substrate is used. In some instances, the release occurs with the addition of the first reagent medium. In some embodiments, when the first substrate is aligned with a second substrate comprising an array of capture probes such that at least a portion of the biological sample is aligned with at least a portion of the array, the biological sample (or portion thereof) and the array are contacted with a first reagent medium. In some embodiments, the capture probe is released from the array of capture probes (e.g., during the contacting with the first reagent medium). Then, the released (e.g., cleaved) capture probe is free to migrate to the biological sample. The released capture probe penetrates the biological sample and interacts with (e.g., hybridizes to) the oligonucleotide (e.g., the capture handle sequence) in the analyte capture agent in the biological sample. For example, the capture domain (e.g., poly(T) tail) of the released capture probe may hybridize to the capture handle sequence in the biological sample. Then, in some instances, the biological sample is contacted with a second reagent medium. The second reagent medium may permeabilize the biological sample, thus releasing the capture handle sequence/capture probe hybridization product into the second reagent medium for collection and bulk processing. Such bulk processing may comprise, e.g., an extension step, generating a copy of a capture handle sequence having components of the capture probe (e.g., primer, spatial barcode, capture domain) or complements thereof, and the capture handle sequence. The extended product may be denatured for further evaluation (e.g., sequencing).

(d) Sandwich Processes

In some embodiments, one or more analytes from the biological sample are released from the biological sample and migrate to a substrate comprising an array of capture probes for attachment to the capture probes of the array. In some embodiments, the release and migration of the analytes to the substrate comprising the array of capture probes occurs in a manner that preserves the original spatial context of the analytes in the biological sample. In some embodiments, the biological sample is mounted on a first substrate and the substrate comprising the array of capture probes is a second substrate. In some embodiments, the alignment of the first substrate and the second substrate is facilitated by a sandwiching process. Accordingly, described herein are methods, compositions, devices, and systems for sandwiching together the first substrate as described herein with a second substrate having an array with capture probes.

In some embodiments, a method described herein includes a sandwiching process between a first substrate comprising a biological sample (e.g., a tissue section on a slide) and a second substrate comprising a spatially barcoded array, e.g., a slide that is populated with spatially-barcoded capture probes. During the sandwiching process, the first substrate is aligned with the second substrate, such that at least a portion of the biological sample is aligned with at least a portion of the array (e.g., aligned in a sandwich configuration). As shown, the second substrate is in a superior position to the first substrate. In some embodiments, the first substrate may be positioned superior to the second substrate. In some embodiments, the first and second substrates are aligned to maintain a gap or separation distance between the two substrates. When the first and second substrates are aligned, one or more analytes are released from the biological sample and actively or passively migrate to the array for capture. In some embodiments, the migration occurs while the aligned portions of the biological sample and the array are contacted with a reagent medium. The released one or more analytes may actively or passively migrate across the gap via the reagent medium toward the capture probes, and be captured by the capture probes.

In some embodiments, the separation distance between first and second substrates is maintained between 2 microns and 1 mm (e.g., between 2 microns and 800 microns, between 2 microns and 700 microns, between 2 microns and 600 microns, between 2 microns and 500 microns, between 2 microns and 400 microns, between 2 microns and 300 microns, between 2 microns and 200 microns, between 2 microns and 100 microns, between 2 microns and 25 microns, between 2 microns and 10 microns), measured in a direction orthogonal to the surface of first substrate that supports sample. In some embodiments, the separation distance 2307 between first and second substrates is less than 50 microns. In some instances, the distance is 2 microns. In some instances, the distance is 2.5 microns. In some instances, the distance is about 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25 microns. In some embodiments, second substrate is placed in direct contact with the sample on the first substrate ensuring no diffusive spatial resolution losses. In some embodiments, the separation distance is measured in a direction orthogonal to a surface of the first substrate that supports the biological sample.

In some embodiments, the sandwiching process may be facilitated by a device, sample holder, sample handling apparatus, or system described in, e.g., US. Patent Application Pub. No. 20210189475, WO 2021/252747, or WO 2022/061152.

In some embodiments, the first and second substrates are placed in a substrate holder (e.g., an array alignment device) configured to align the biological sample and the array. In some embodiments, the device comprises a sample holder. In some embodiments, the sample holder includes a first member and a second member that receive a first substrate and a second substrate, respectively. The device can include an alignment mechanism that is connected to at least one of the members and aligns the first and second members. Thus, the devices of the disclosure can advantageously align the first substrate and the second substrate and any samples, barcoded probes, or permeabilization reagents that may be on the surface of the first and second substrates.

In some embodiments, the sandwiching process comprises: mounting the first substrate on a first member of a support device, the first member configured to retain the first substrate; mounting the second substrate on a second member of the support device, the second member configured to retain the second substrate, applying a reagent medium to the first substrate and/or the second substrate, the reagent medium comprising a permeabilization agent, operating an alignment mechanism of the support device to move the first member and/or the second member such that a portion of the biological sample is aligned (e.g., vertically aligned) with a portion of the array of capture probes and within a threshold distance of the array of capture probes, and such that the portion of the biological sample and the capture probe contact the reagent medium, wherein the permeabilization agent releases the analyte from the biological sample.

In some embodiments of a sample holder, the sample holder can include a first member including a first retaining mechanism configured to retain a first substrate comprising a sample. The first retaining mechanism can be configured to retain the first substrate disposed in a first plane. The sample holder can further include a second member including a second retaining mechanism configured to retain a second substrate disposed in a second plane. The sample holder can further include an alignment mechanism connected to one or both of the first member and the second member. The alignment mechanism can be configured to align the first and second members along the first plane and/or the second plane such that the sample contacts at least a portion of the reagent medium when the first and second members are aligned and within a threshold distance along an axis orthogonal to the second plane. The alignment mechanism may be configured to move the second member along the axis orthogonal to the second plane and/or move the first member along an axis orthogonal to the first plane.

In some embodiments, the alignment mechanism includes a linear actuator. In some embodiments, the alignment mechanism includes one or more of a moving plate, a bushing, a shoulder screw, a motor bracket, and a linear actuator. The moving plate may be coupled to the first member or the second member. The alignment mechanism may, in some cases, include a first moving plate coupled to the first member and a second moving plate coupled to the second member. In some embodiments, the linear actuator is configured to move the second member along an axis orthogonal to the plane of the first member and/or the second member. For example, the moving plate may be coupled to the second member and adjust the separation distance along a z axis (e.g., orthogonal to the second substrate) by moving the moving plate up in a superior direction toward the first substrate. In some embodiments, the linear actuator is configured to move the first member along an axis orthogonal to the plane of the first member and/or the second member. The movement of the moving plate may be accomplished by the linear actuator configured to move the first member and/or the second member at a velocity. The velocity may be controlled by a controller communicatively coupled to the linear actuator. In some embodiments, the linear actuator is configured to move the first member, the second member, or both the first member and the second member at a velocity of at least 0.1 mm/sec (e.g., at least 0.1 mm/sec to 2 mm/sec). In some aspects, the velocity may be selected to reduce or minimize bubble generation or trapping within the reagent medium. In some embodiments, the linear actuator is configured to move the first member, the second member, or both the first member and the second member with an amount of force of at least 0.1 lbs (e.g., between 0.1-4.0 pounds of force).

In some aspects, the velocity of the moving plate (e.g., closing the sandwich) may affect bubble generation or trapping within the reagent medium. It may be advantageous to minimize bubble generation or trapping within the reagent medium during the “sandwiching” process, as bubbles can interfere with the migration of analytes through the reagent medium to the array. In some embodiments, the closing speed is selected to minimize bubble generation or trapping within the reagent medium. In some embodiments, the closing speed is selected to reduce the time it takes the flow front of the reagent medium from an initial point of contact with the first and second substrate to sweep across the sandwich area (also referred to herein as “closing time”). In some embodiments, the closing speed is selected to reduce the closing time to less than about 1100 milliseconds (ms). In some embodiments, the closing speed is selected to reduce the closing time to less than about 1000 ms. In some embodiments, the closing speed is selected to reduce the closing time to less than about 900 ms. In some embodiments, the closing speed is selected to reduce the closing time to less than about 750 ms. In some embodiments, the closing speed is selected to reduce the closing time to less than about 600 ms. In some embodiments, the closing speed is selected to reduce the closing time to about 550 ms or less. In some embodiments, the closing speed is selected to reduce the closing time to about 370 ms or less. In some embodiments, the closing speed is selected to reduce the closing time to about 200 ms or less. In some embodiments, the closing speed is selected to reduce the closing time to about 150 ms or less.

Analytes within a biological sample may be released through disruption (e.g., permeabilization, digestion, etc.) of the biological sample or may be released without disruption. Various methods of permeabilizing (e.g., any of the permeabilization reagents and/or conditions described herein) a biological sample are described herein, including for example including the use of various detergents, buffers, proteases, and/or nucleases for different periods of time and at various temperatures. Additionally, various methods of delivering fluids (e.g., a buffer, a permeabilization solution) to a biological sample are described herein including the use of a substrate holder (e.g., for sandwich assembly, sandwich configuration, as described herein) Provided herein are methods for delivering a fluid to a biological sample disposed on an area of a first substrate and an array disposed on a second substrate.

In some embodiments, the sandwich configuration described herein between a first substrate comprising a biological sample and a second substrate comprising a spatially barcoded array (e.g., slide with barcoded capture probes) may include a reagent medium (e.g., a liquid reagent medium, e.g., a permeabilization solution or other target molecule release and capture solution) to fill a gap. It may be desirable that the reagent medium be free from air bubbles between the slides to facilitate transfer of target molecules with spatial information. Additionally, air bubbles present between the slides may obscure at least a portion of an image capture of a desired region of interest. Accordingly, it may be desirable to ensure or encourage suppression and/or elimination of air bubbles between the two substrates during a permeabilization step.

In some aspects, it may be possible to reduce or eliminate bubble formation between the slides using a variety of filling methods and/or closing methods.

Workflows described herein may include contacting a drop of the reagent medium (e.g., liquid reagent medium, e.g., a permeabilization solution) disposed on a first substrate or a second substrate with at least a portion of the second substrate or first substrate, respectively. In some embodiments, the contacting comprises bringing the two substrates into proximity such that the sample on the first substrate is aligned with the barcode array of capture probes on the second substrate.

In some embodiments, the drop includes permeabilization reagents (e.g., any of the permeabilization reagents described herein). In some embodiments, the rate of permeabilization of the biological sample is modulated by delivering the permeabilization reagents (e.g., a fluid containing permeabilization reagents) at various temperatures.

In the exemplary sandwich workflows described herein, the reagent medium (e.g., liquid reagent medium, permeabilization solution 2305) may fill a gap (e.g., the gap 2307) between a first substrate (e.g., slide 2303) and a second substrate (e.g., slide 2304 with barcoded capture probes 2306) to warrant or enable transfer of target molecules with spatial information. Described herein are examples of filling methods that may suppress bubble formation and suppress undesirable flow of transcripts and/or target molecules or analytes. Robust fluidics in the sandwich making described herein may preserve spatial information by reducing or preventing deflection of molecules as they move from the tissue slide to the capture slide.

An exemplary sandwiching process where a first substrate (e.g., slide), including a biological sample (e.g., a tissue section), and a second substrate (e.g., slide including spatially barcoded capture probes) are brought into proximity with one another. In some instances, a liquid reagent drop (e.g., permeabilization solution) is introduced on the second substrate in proximity to the capture probes and in between the biological sample and the second substrate (e.g., slide including spatially barcoded capture probes). The permeabilization solution may release analytes that can be captured by the capture probes of the array. As further shown, one or more spacers may be positioned between the first substrate (e.g., slide) and the second substrate (e.g., slide including spatially barcoded capture probes). The one or more spacers may be configured to maintain a separation distance between the first substrate and the second substrate. While the one or more spacers is shown as disposed on the second substrate, the spacer may additionally or alternatively be disposed on the first substrate.

In some embodiments, the one or more spacers is configured to maintain a separation distance between first and second substrates that is between about 2 microns and 1 mm (e.g., between about 2 microns and 800 microns, between about 2 microns and 700 microns, between about 2 microns and 600 microns, between about 2 microns and 500 microns, between about 2 microns and 400 microns, between about 2 microns and 300 microns, between about 2 microns and 200 microns, between about 2 microns and 100 microns, between about 2 microns and 25 microns, or between about 2 microns and 10 microns), measured in a direction orthogonal to the surface of first substrate that supports the sample. In some instances, the separation distance is about 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25 microns. In some embodiments, the separation distance is less than 50 microns. In some embodiments, the separation distance is less than 25 microns. In some embodiments, the separation distance is less than 20 microns. The separation distance may include a distance of at least 2 μm.

In some instances, also provided herein is a fully formed sandwich configuration creating a chamber formed from the one or more spacers, the first substrate (e.g., the slide), and the second substrate (e.g., the slide including spatially barcoded capture probes) in accordance with some example implementations. In some instances, the liquid reagent (e.g., the permeabilization solution) fills the volume of the chamber and may create a permeabilization buffer that allows analytes, RTL ligation products, and analyte capture agents to diffuse from the biological sample toward the capture probes of the second substrate (e.g., slide). In some aspects, flow of the permeabilization buffer may deflect transcripts and/or molecules from the biological sample and may affect diffusive transfer of analytes for spatial analysis. A partially or fully sealed chamber resulting from the one or more spacers, the first substrate, and the second substrate may reduce or prevent flow from undesirable convective movement of transcripts and/or molecules over the diffusive transfer from the biological sample to the capture probes.

In some instances, the first substrate and the second substrate are arranged in an angled sandwich assembly as described herein. For example, during the sandwiching of the two substrates (e.g., the slide and the array slide), an angled closure workflow may be used to suppress or eliminate bubble formation.

Further details on angled closure workflows, and devices and systems for implementing an angled closure workflow, are described in WO 2021/252747 and WO 2022/061152, which are hereby incorporated by reference in their entirety.

Additional configurations for reducing or eliminating bubble formation, and/or for reducing unwanted fluid flow, are described in WO 2021/252747, which is hereby incorporated by reference in its entirety.

In some embodiments, the reagent medium comprises a permeabilization agent. Suitable agents for this purpose include, but are not limited to, organic solvents (e.g., acetone, ethanol, and methanol), cross-linking agents (e.g., paraformaldehyde), detergents (e.g., saponin, Triton X-100™, Tween-20™, or sodium dodecyl sulfate (SDS)), and enzymes (e.g., trypsin, proteases (e.g., proteinase K). In some embodiments, the detergent is an anionic detergent (e.g., SDS or N-lauroylsarcosine sodium salt solution). Exemplary permeabilization reagents are described in in PCT Patent Application Publication No. WO 2020/123320, which is incorporated by reference in its entirety.

In some embodiments, the reagent medium comprises a protease. Exemplary proteases include, e.g., pepsin, trypsin, pepsin, elastase, and proteinase K. Exemplary proteases are described in PCT Patent Application Publication No. WO 2020/123320, which is incorporated by reference in its entirety.

In some embodiments, the reagent medium comprises a detergent. Exemplary detergents include sodium dodecyl sulfate (SDS), sarkosyl, saponin, Triton X-100™, and Tween-20™. Exemplary detergents are described in PCT Patent Application Publication No. WO 2020/123320, which is incorporated by reference in its entirety.

In some embodiments, the reagent medium comprises a nuclease. In some embodiments, the nuclease comprises an RNase. In some embodiments, the RNase is selected from RNase A, RNase C, RNase H, and RNase I. In some embodiments, the reagent medium comprises one or more of sodium dodecyl sulfate (SDS), proteinase K, pepsin, N-lauroylsarcosine, RNAse, and a sodium salt thereof.

The sample holder is compatible with a variety of different schemes for contacting the aligned portions of the biological sample and array with the reagent medium to promote analyte capture. In some embodiments, the reagent medium is deposited directly on the second substrate (e.g., forming a reagent medium that includes the permeabilization reagent and the feature array), and/or directly on the first substrate. In some embodiments, the reagent medium is deposited on the first and/or second substrate, and then the first and second substrates aligned in the sandwich configuration such that the reagent medium contacts the aligned portions of the biological sample and array. In some embodiments, the reagent medium is introduced into the gap while the first and second substrates are aligned in the sandwich configuration.

In certain embodiments a dried permeabilization reagent is applied or formed as a layer on the first substrate or the second substrate or both prior to contacting the sample and the feature array. For example, a reagent can be deposited in solution on the first substrate or the second substrate or both and then dried. Drying methods include, but are not limited to, spin coating a thin solution of the reagent and then evaporating a solvent included in the reagent or the reagent itself. Alternatively, in other embodiments, the reagent can be applied in dried form directly onto the first substrate or the second substrate or both. In some embodiments, the coating process can be done in advance of the analytical workflow and the first substrate and the second substrate can be stored pre-coated. Alternatively, the coating process can be done as part of the analytical workflow. In some embodiments, the reagent is a permeabilization reagent. In some embodiments, the reagent is a permeabilization enzyme, a buffer, a detergent, or any combination thereof. In some embodiments, the permeabilization enzyme is pepsin. In some embodiments, the reagent is a dried reagent (e.g., a reagent free from moisture or liquid). In some instances, the substrate that includes the sample (e.g., a histological tissue section) is hydrated. The sample can be hydrated by contacting the sample with a reagent medium, e.g., a buffer that does not include a permeabilization reagent. In some embodiments, the hydration is performed while the first and second substrates are aligned in a sandwich configuration.

In some instances, the aligned portions of the biological sample and the array are in contact with the reagent medium for about 1 minute. In some instances, the aligned portions of the biological sample and the array are in contact with the reagent medium for about 5 minutes. In some instances, the aligned portions of the biological sample and the array are in contact with the reagent medium in the gap for about 1 minute, about 5 minutes, about 10 minutes, about 12 minutes, about 15 minutes, about 18 minutes, about 20 minutes, about 25 minutes, about 30 minutes, about 36 minutes, about 45 minutes, or about an hour. In some instances, the aligned portions of the biological sample and the array are in contact with the reagent medium for about 1-60 minutes. In some instances, the aligned portions of the biological sample and the array are in contact with the reagent medium for about 30 minutes.

In some embodiments, following initial contact between sample and a permeabilization agent, the permeabilization agent can be removed from contact with sample (e.g., by opening sample holder) before complete permeabilization of sample. For example, in some embodiments, only a portion of sample is permeabilized, and only a portion of the analytes in sample may be captured by feature array. In some instances, the reduced amount of analyte captured and available for detection can be offset by the reduction in lateral diffusion that results from incomplete permeabilization of sample. In general, the spatial resolution of the assay is determined by the extent of analyte diffusion in the transverse direction (e.g., orthogonal to the normal direction to the surface of sample). The larger the distance between the sample on the first substrate and the feature array on the second substrate, the greater the extent of diffusion in the transverse direction, and the concomitant loss of resolution. Analytes liberated from a portion of the sample closest to the feature array have a shorter diffusion path, and therefore do not diffuse as far laterally as analytes from portions of the sample farthest from the feature array. As a result, in some instances, incomplete permeabilization of the sample (by reducing the contact interval between the permeabilization agent and the sample) can be used to maintain adequate spatial resolution in the assay.

In some instances, the device is configured to control a temperature of the first and second substrates. In some embodiments, the temperature of the first and second members is lowered to a first temperature that is below room temperature (e.g., 25 degrees Celsius) (e.g., 20 degrees Celsius or lower, 15 degrees Celsius or lower, 10 degrees Celsius or lower, 5 degrees Celsius or lower, 4 degrees Celsius or lower, 3 degrees Celsius or lower, 2 degrees Celsius or lower, 1 degree Celsius or lower, 0 degrees Celsius or lower, −1 degrees Celsius or lower, −5 degrees Celsius or lower). In some embodiments, the device includes a temperature control system (e.g., heating and cooling conducting coils) to control the temperature of the sample holder. Alternatively, in other embodiments, the temperature of the sample holder is controlled externally (e.g., via refrigeration or a hotplate). In some embodiments, the first substrate is contacted with the second member which is at the first temperature, and the second substrate is contacted with the first member which is also at the first temperature, thereby lowering the temperature of the first substrate and the second substrate to a second temperature. In some embodiments, the second temperature is equivalent to the first temperature. In some embodiments, the first temperature is lower than room temperature (e.g., 25 degrees Celsius). In some embodiments, the second temperature ranges from about −10 degrees Celsius to about 4 degrees Celsius. In some embodiments, the second temperature is below room temperature (e.g., 25 degrees Celsius) (e.g., 20 degrees Celsius or lower, 15 degrees Celsius or lower, 10 degrees Celsius or lower, 5 degrees Celsius or lower, 4 degrees Celsius or lower, 3 degrees Celsius or lower, 2 degrees Celsius or lower, 1 degree Celsius or lower, 0 degrees Celsius or lower, −1 degrees Celsius or lower, −5 degrees Celsius or lower).

In an exemplary embodiment, the second substrate is contacted with the permeabilization reagent. In some embodiments, the permeabilization reagent is dried. In some embodiments, the permeabilization reagent is a gel or a liquid. Also in the exemplary embodiment, the biological sample is contacted with buffer. Both the first and second substrates are placed at lower temperature to slow down diffusion and permeabilization efficiency. Alternatively, in some embodiments, the sample can be contacted directly with a liquid permeabilization reagent without inducing an unwanted initiation of permeabilization due to the substrates being at the second temperature. In some embodiments, the low temperature slows down or prevents the initiation of permeabilization. In a second step, keeping the sample holder and substrates at a cold temperature (e.g., at the first or second temperatures) continues to slow down or prevent the permeabilization of the sample. In a third step, the sample holder (and consequently the first and second substrates) is heated up to initiate permeabilization. In some embodiments, the sample holder is heated up to a third temperature. In some embodiments, the third temperature is above room temperature (e.g., 25 degrees Celsius) (e.g., 30 degrees Celsius or higher, 35 degrees Celsius or higher, 40 degrees Celsius or higher, 50 degrees Celsius or higher, 60 degrees Celsius or higher). In some embodiments, analytes that are released from the permeabilized tissue of the sample diffuse to the surface of the second substrate and are captured on the array (e.g., barcoded probes) of the second substrate. In a fourth step, the first substrate and the second substrate are separated (e.g., pulled apart) and temperature control is stopped.

In some embodiments, where either the first substrate or substrate second (or both) includes wells, a permeabilization solution can be introduced into some or all of the wells, and then the sample and the feature array can be contacted by closing the sample holder to permeabilize the sample. In certain embodiments, a permeabilization solution can be soaked into a hydrogel film that is applied directly to the sample, and/or soaked into features (e.g., beads) of the array. When the first and second substrates are aligned in the sandwich configuration, the permeabilization solution promotes migration of analytes from the sample to the array.

In certain embodiments, different permeabilization agents or different concentrations of permeabilization agents can be infused into array features (e.g., beads) or into a hydrogel layer as described above. By locally varying the nature of the permeabilization reagent(s), the process of analyte capture from the sample can be spatially adjusted.

In some instances, migration of the analyte from the biological sample to the second substrate is passive (e.g., via diffusion). Alternatively, in certain embodiments, migration of the analyte from the biological sample is performed actively (e.g., electrophoretic, by applying an electric field to promote migration). In some instances, first and second substrates can include a conductive epoxy. Electrical wires from a power supply can connect to the conductive epoxy, thereby allowing a user to apply a current and generate an electric field between the first and second substrates. In some embodiments, electrophoretic migration results in higher analyte capture efficiency and better spatial fidelity of captured analytes (e.g., on a feature array) than random diffusion onto matched substrates without the application of an electric field (e.g., via manual alignment of the two substrates). Exemplary methods of electrophoretic migration, including those illustrated in FIGS. 7-11C, are described in WO 2020/176788, which is hereby incorporated by reference in its entirety.

Loss of spatial resolution can occur when analytes migrate from the sample to the feature array and a component of diffusive migration occurs in the transverse (e.g., lateral) direction, approximately parallel to the surface of the first substrate on which the sample is mounted. To address this loss of resolution, in some embodiments, a permeabilization agent deposited on or infused into a material with anisotropic diffusion can be applied to the sample or to the feature array. The first and second substrates are aligned by the sample holder and brought into contact. A permeabilization layer that includes a permeabilization solution infused into an anisotropic material is positioned on the second substrate.

In some embodiments, the feature array can be constructed atop a hydrogel layer infused with a permeabilization agent. The hydrogel layer can be mounted on the second substrate, or alternatively, the hydrogel layer itself may function as the second substrate. When the first and second substrates are aligned, the permeabilization agent diffuses out of the hydrogel layer and through or around the feature array to reach the sample. Analytes from the sample migrate to the feature array. Direct contact between the feature array and the sample helps to reduce lateral diffusion of the analytes, mitigating spatial resolution loss that would occur if the diffusive path of the analytes was longer.

Spatial analysis workflows can include a sandwiching process described herein. In some embodiments, the workflow includes provision of the first substrate comprising the biological sample. In some embodiments, the workflow includes, mounting the biological sample onto the first substrate. In some embodiments wherein the biological sample is a tissue sample, the workflow includes sectioning of the tissue sample (e.g., cryostat sectioning). In some embodiments, the workflow includes a fixation step. In some instances, the fixation step can include fixation with methanol. In some instances, the fixation step includes formalin (e.g., 2% formalin).

In some embodiments, the biological sample on the first substrate is stained using any of the methods described herein. In some instances, the biological sample is imaged, capturing the stain pattern created during the stain step. In some instances, the biological sample then is destained prior to the sandwiching process.

The biological sample can be stained using known staining techniques, including, without limitation, Can-Grunwald, Giemsa, hematoxylin and eosin (H&E), hematoxylin, Jenner's, Leishman, Masson's trichrome, Papanicolaou, Romanowsky, silver, Sudan, Wright's, and/or Periodic Acid Schiff (PAS) staining techniques. PAS staining is typically performed after formalin or acetone fixation. In some embodiments, the biological sample can be stained using a detectable label (e.g., radioisotopes, fluorophores, chemiluminescent compounds, bioluminescent compounds, and dyes) as described elsewhere herein. In some embodiments, a biological sample is stained using only one type of stain or one technique. In some embodiments, staining includes biological staining techniques such as H&E staining. In some embodiments, staining includes biological staining using hematoxylin. In some embodiments, staining includes identifying analytes using fluorescently-conjugated antibodies, e.g., by immunofluorescence. In some embodiments, a biological sample is stained using two or more different types of stains, or two or more different staining techniques. For example, a biological sample can be prepared by staining and imaging using one technique (e.g., H&E staining and brightfield imaging), followed by staining and imaging using another technique (e.g., IHC/IF staining and fluorescence microscopy) for the same biological sample. In some instances, a biological sample on the first substrate is stained.

In some instances, methods for immunofluorescence include a blocking step. The blocking step can include the use of blocking probes to decrease unspecific binding of the antibodies. The blocking step can optionally further include contacting the biological sample with a detergent. In some instances, the detergent can include Triton X-100™. The method can further include an antibody incubation step. In some embodiments, the antibody incubation step effects selective binding of the antibody to antigens of interest in the biological sample. In some embodiments, the antibody is conjugated to an oligonucleotide (e.g., an oligonucleotide-antibody conjugate as described herein). In some embodiments, the antibody is not conjugated to an oligonucleotide. In some embodiments, the method further comprises an antibody staining step. The antibody staining step can include a direct method of immunostaining in which a labelled antibody binds directly to the analyte being stained for. Alternatively, the antibody staining step can include an indirect method of immunostaining in which a first antibody binds to the analyte being stained for, and a second, labelled antibody binds to the first antibody. In some embodiments, the antibody staining step is performed prior to sandwich assembly. In some embodiments wherein an oligonucleotide-antibody conjugate is used in the antibody incubation step, the method does not comprise an antibody staining step.

In some instances, the methods include imaging the biological sample. In some instances, imaging occurs prior to sandwich assembly. In some instances, imaging occurs while the sandwich configuration is assembled. In some instances, imaging occurs during permeabilization of the biological sample. In some instances, images are captured using high resolution techniques (e.g., having 300 dots per square inch (dpi) or greater). For example, images can be captured using brightfield imaging (e.g., in the setting of hematoxylin or H&E stain), or using fluorescence microscopy to detect adhered labels. In some instances, high resolution images are captured temporally using e.g., confocal microscopy. In some instances, a low resolution image is captured. A low resolution image (e.g., images that are about 72 dpi and normally have an RGB color setting) can be captured at any point of the workflow, including but not limited to staining, destaining, permeabilization, sandwich assembly, and migration of the analytes. In some instances, a low resolution image is taken during permeabilization of the biological sample.

In some embodiments, the location of the one or more analytes in a biological sample are determined by immunofluorescence. In some embodiments, one or more detectable labels (e.g., fluorophore-labeled antibodies) bind to the one or more analytes that are captured (hybridized to) by a probe on the first slide and the location of the one or more analytes is determined by detecting the labels under suitable conditions. In some embodiments, one or more fluorophore-labeled antibodies are used to conjugate to a moiety that associates with a probe on the first slide or the analyte that is hybridized to the probe on the first slide. In some instances, the location(s) of the one or more analytes is determined by imaging the fluorophore-labeled antibodies when the fluorophores are excited by a light of a suitable wavelength. In some embodiments, the location of the one or more analytes in the biological sample is determined by correlating the immunofluorescence data to an image of the biological sample. In some instances, the tissue is imaged throughout the permeabilization step.

In some instances, the biological samples can be destained. In some instances, destaining occurs prior to permeabilization of the biological sample. By way of example only, H&E staining can be destained by washing the sample in HCl. In some instances, the hematoxylin of the H&E stain is destained by washing the sample in HCl. In some embodiments, destaining can include 1, 2, 3, or more washes in HCl. In some embodiments, destaining can include adding HCl to a downstream solution (e.g., permeabilization solution).

Between any of the methods disclosed herein, the methods can include a wash step (e.g., with SSC (e.g., 0.1×SSC)). Wash steps can be performed once or multiple times (e.g., 1×, 2×, 3×, between steps disclosed herein). In some instances, wash steps are performed for about 10 seconds, about 15 seconds, about 20 seconds, about 30 seconds, or about a minute. In some instances, three washes occur for 20 seconds each. In some instances, the wash step occurs before staining the sample, after destaining the sample, before permeabilization the sample, after permeabilization the sample, or any combination thereof.

In some instances, after the sandwiching process the first substrate and the second substrate are separated (e.g., such that they are no longer aligned in a sandwich configuration, also referred to herein as opening the sandwich). In some embodiments, subsequent analysis (e.g., cDNA synthesis, library preparation, and sequences) can be performed on the captured analytes after the first substrate and the second substrate are separated.

In some embodiments, the process of transferring the ligation product or methylated-adaptor-containing nucleic acid from the first substrate to the second substrate is referred to interchangeably herein as a “sandwich process,” “sandwiching process,” or “sandwiching”. The sandwich process is further described in PCT Patent Application Publication No. WO 2020/123320, WO 2021/252747, and WO 2022/061152, which are incorporated by reference in its entirety.

(e) Use of Multiplexed Sandwich Workflows

This disclosure also provides methods, compositions, devices, and systems for using a single capture probe to detect analytes from different biological samples (e.g., tissues) on different slides using serial sandwich processes. Thus, only one capture probe-containing array is necessary for the methods disclosed herein. In this way, as described herein, analytes from different samples or tissues can be captured serially and demultiplexed by sample-specific index sequences.

The methods include generating a connected probe (e.g., a ligation product) in multiple biological samples (e.g., a first sample, a second sample, a third sample, etc.). Generation of a connected probe (e.g., a ligation product) has been described above, and, the same methods are used herein to generate a connected probe (e.g., a ligation product) from analytes that are either protein analytes or nucleic acid (e.g., mRNA) analytes. That is, in some instances, the multiplexed sandwich maker methods disclosed herein can be used to detect protein analytes. In other instances, the multiplexed sandwich maker methods disclosed herein can be used to detect nucleic acid (e.g., mRNA) analytes.

The methods, compositions, devices, and systems include utilizing an analyte capture agent in multiple biological samples (e.g., a first sample, a second sample, a third sample, etc.). Using analyte capture agents for spatial detection has been described above, and, the same methods are used herein to use an analyte capture agent to identify analytes in a biological sample. In some embodiments, the multiplexed sandwich workflows disclosed herein can be used to detect protein analytes.

As discussed below, each connected probe (e.g., a ligation product) that is generated or analyte capture agent includes a sample index sequence, which is a nucleotide sequence that is associated with a particular sample of origin in the multiplex sandwich methods. After generation of each connected probe (e.g., a ligation product) or analyte capture agent, each sample is serially sandwiched to an array or slide having a plurality of capture probes that can detect and hybridize to a capture probe binding domain from the connected probe (e.g., a ligation product) or analyte capture agent. During the sandwiching process, the indexed connected probe or analyte capture agent actively or passively migrates from the sample to the array for capture by a capture probe. Then the sandwich is opened, and the next sample is sandwiched with the array. In some embodiments, the array is washed prior to sandwiching with the next sample. Additional samples or tissues (e.g., 2 or more) can then be sandwiched with the array or slide having a plurality of capture probes, wherein connected probes (e.g., ligation products) or analyte capture agents from the additional samples or tissues can be transferred to the array in a similar manner. Because each sample includes a unique sample index, the sample of origin for each connected probe (e.g., a ligation product) or analyte capture agent that is captured on the array can be identified. In addition, the location of the connected probe (e.g., a ligation product) can be identified. In some embodiments, the location of the analyte capture agent can be identified. In some instances, the location is identified using fiducial markers on the gene expression slide (e.g., spatial array) so that location of the ligation probe on the array mirrors the location of the sample on the sample slide. Exemplary fiducial markers are described in PCT Patent Application Publication No. WO 2020/123320, which is incorporated by reference in its entirety.

Such methods, compositions, devices, and systems allow for detection of analytes in multiple samples using only one gene expression slide and can be performed on any slide sample described herein. For example, in some instances, the biological sample is a formalin-fixed, paraffin-embedded (FFPE) sample, a frozen sample, or a fresh sample. In some instances, the biological sample is contacted with one or more stains. In some instances, the one or more stains include hematoxylin and eosin. In some instances, cell markers are detected using methods known in the art (e.g., using one or more optical labels such as fluorescent labels, radioactive labels, chemiluminescent labels, calorimetric labels, or colorimetric labels. In some instances, the biological sample is imaged before generating a connected probe (e.g., a ligation product) and before transferring the connected probe (e.g., a ligation product) to the gene expression slide.

The multiplex sandwich methods, compositions, devices, and systems described herein allow for detection of different types of samples and different analytes. For example in some instances, the samples used in the multiplex sandwich methods are from different species. In some instances, the samples used in the multiplex sandwich methods are from the same species but different individuals in the same species. In some instances, the samples used in the multiplex sandwich methods are from the same individual organism. In some instances, the samples are from different tissues or cell types. In some instances, the samples are from the same tissues or cell types. In some instances, the samples are from the same subject taken at different time points (e.g., before and after therapeutic treatment). It is appreciated that the samples can be from any source so long as ligated products having sample index sequences are unique to each sample are generated.

Multiple samples can be used in the methods described herein. For example, in some instances, at least two samples are used. In some instances, more than two samples (e.g., at least 3, 4, 5, 6, 7, 8, 9, 10, or more) samples are used in the methods disclosed herein. It is appreciated that each sample can be from different sources (e.g., different species, different organisms). In some embodiments, from each sample, the same gene is detected and identified. In some embodiments, for each sample, different genes are detected and identified.

In order to differentiate one sample from another, probe oligonucleotide for each sample in a multiplexed setting can include one or more unique sequences to identify the origin of the connected probe (e.g., a ligation product). In some instances, the unique sequence is a sample index sequence. In some instances, probe oligonucleotides for each sample include one or more (e.g., at least 1, 2, 3, 4, 5, or more) unique sample index sequence to identify the origin of the connected probe (e.g., a ligation product).

In some instances, the sample index is about 5 nucleotides to about 50 nucleotides long (e.g., about 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50) nucleotides long. In some embodiments, the sample index is about 5-15 nucleotides long. In some embodiments, the sample index is about 10-12 nucleotides long. Both synthetic and/or naturally-occurring nucleotides can be used to generate a sample index sequence. It is appreciated that any sequence can be designed so long as it is unique among other sample index sequences and optionally that it can be distinguished from any sequence in the genome of the sample.

A sample index sequence can be located anywhere on the connected probe (e.g., a ligation product) so long as it does not affect (1) hybridization of the probe oligonucleotides to the analyte, (2) ligation of the probe oligonucleotides to generate the connected probe (e.g., a ligation product), and (3) hybridization of the capture probe binding domain to the capture probe on an array. For example, in some instances, the sample index sequence can be located on the first probe oligonucleotide (e.g., the left hand probe; e.g., as shown in FIG. 12A). In some instances, the sample index is located on the flap of the first probe oligonucleotide that does not hybridize to the analyte. In some instances, the sample index sequence can be located on the second probe oligonucleotide (e.g., the right hand probe; e.g., as shown in FIG. 12A). In some instances, the sample index is located on the flap of the second probe oligonucleotide that does not hybridize to the analyte.

(f) Systems and Kits

Also disclosed herein are systems and kits used for any one of the methods disclosed herein. In some instances, the system of kit is used for analyzing an analyte in a biological sample. In some instances, the system or kit includes a support device configured to retain a first substrate and a second substrate, wherein the biological sample is placed on the first substrate, and wherein the second substrate comprises a plurality of capture probes, wherein a capture probe of the plurality of capture probes comprises: (i) a spatial barcode and (ii) a capture domain. In some instances, the system or kit includes a first probe oligonucleotide and a second probe oligonucleotide, wherein the first probe oligonucleotide and the second probe oligonucleotide each comprise a sequence that is substantially complementary to a first sequence and a second sequence of the analyte, respectively, wherein the second probe oligonucleotide comprises a capture probe binding domain, and wherein the first probe oligonucleotide and the second probe oligonucleotide are capable of being ligated together to form a connected probe. In some instances, the system or kit includes a plurality of analyte capture agents, wherein an analyte capture agent of the plurality of analyte capture agents comprises an analyte binding moiety and a capture agent barcode domain, wherein the analyte binding moiety specifically binds to the analyte, and wherein the capture agent barcode domain comprises an analyte binding moiety barcode and a capture handle sequence. In some instances, the system or kit further includes a reagent medium comprising a permeabilization agent and optionally an agent for releasing the connected probe. In some instances, the system or kit includes instructions for performing any one of the methods described herein.

In some instances, the permeabilization agent is pepsin or proteinase K. In some instances, the agent for releasing the connected probe is an RNAse, optionally wherein the RNAse is RNAse H.

In some instances, the system or kit further includes an alignment mechanism on the support device to align the first substrate and the second substrate. In some instances, the alignment mechanism comprises a linear actuator and the first substrate comprises a first member and the second substrate comprises a second member. The linear actuator can be configured to move the second member along an axis orthogonal to the plane of the first member and/or the second member. The linear actuator can be configured to move the first member along an axis orthogonal to the plane of the first member and/or the second member. The linear actuator can be configured to move the first member, the second member, or both the first member and the second member at a velocity of at least 0.1 mm/sec. Finally, in some instances, the linear actuator can be configured to move the first member, the second member, or both the first member and the second member with an amount of force of at least 0.1 lbs.

EXAMPLES
Example 1—Methods for Capturing Connected Probes with Sandwich Process

In a non-limiting example, mouse brain and mouse kidney FFPE sections on standard slides (for sandwich conditions) or gene expression (GEx) slides (for non-sandwich control conditions) were deparaffinized, H&E stained, and imaged. Next, the tissue samples were hematoxylin-destained with three HCl solution washes. The sections were then decrosslinked by incubating at 70° C. for 1 hour in TE pH 9.0. TE was removed and the tissues were incubation in 1×PBS-Tween for 15 minutes.

Individual probe oligonucleotides (e.g., a first probe oligonucleotide, a second probe oligonucleotide) of probe pairs were hybridized to a first sequence and a second sequence of an analyte (e.g., an RNA molecule), respectively, in the mouse brain tissue. The RTL probe oligonucleotides were then ligated together, thereby creating a connected probe (e.g., a ligation product) (FIG. 12A). The connected probe (e.g., a ligation product) included a capture probe binding domain. The probes were designed to hybridize to part of the mouse transcriptome (e.g., using 5000 total probe pairs) or to hybridize to each transcript in the mouse transcriptome.

After ligation of the RTL probe oligonucleotides, the connected probes were released from the tissue using various conditions: (A) sandwich process with RNase in the reagent medium; (B) sandwich process with RNAse+Proteinase K in the reagent medium; or (C) without sandwich methods (e.g., a control). For sandwich conditions A and B, the tissue mounted standard slides were aligned with a GEx slide and permeabilized in the sandwich configuration as described herein (see, e.g., FIG. 12B). For the non-sandwich condition C, the tissue mounted on the GEx slides were permeabilized directly on the GEx array. Following permeabilization, the capture probes were extended, sequencing libraries were prepared and sequenced, and the results were analyzed computationally.

Table 1 and Table 2 below show results of the mouse brain 5000 probe pair experiment.

TABLE 1

Fraction
Reads
Fraction

targeted
Mapped
Reads in
Fraction

Valid

reads
Confidently to
Spots Under
reads

Conditions
Barcodes
Valid UMIs
usable
Transcriptome
Tissue
unmapped

RNAse only
97.5%
100.0%
82.8%
92.0%
91.9%
4.8%

(30 mins)

mouse brain

RNAse +
97.6%
100.0%
86.1%
91.1%
96.4%
5.6%

Proteinase K

(30 mins)

mouse brain

RNAse +
97.0%
100.0%
83.6%
89.2%
96.0%
6.7%

Proteinase K +

Sarkosyl

(30 mins)

mouse brain

Control
97.1%
100.0%
63.8%
84.2%
77.9%
11.6%

mouse brain

(RTL but no

sandwich

methods)

TABLE 2

Median
Median

Median
Median
panel genes
panel UMI

panel genes
panel UMI
directed
counts

detected at
counts at
at 1000
at 1000

260 panel
250 reads
panel reads
panel reads

Conditions
reads per spot
per spot
per spot
per spot

RNAse only
90
123
134
201

(30 mins)

mouse brain

RNAse +
115
169
192
330

Proteinase K

(30 mins)

mouse brain

RNAse +
47
63
57
79

Proteinase K +

Sarkosyl

(30 mins)

mouse brain

Control
106
149
192
330

mouse brain

(RTL but no

sandwich

methods)

As shown in Table 1 and Table 2, the combination of RNAse+Proteinase K using sandwiching methods resulted in increased useable fraction targeted reads, increased reads in spots under tissues, and increased median panel genes and UMI counts.

Table 3 and Table 4 below show results of the captured ligated probes on mouse kidney in a setting of using probes designed to hybridize to part of the mouse transcriptome (e.g., using partial (e.g., n=5000) total probe pairs).

TABLE 3

Fraction
Reads
Fraction

targeted
Mapped
Reads in
Fraction

Valid

reads
Confidently to
Spots Under
reads

Conditions
Barcodes
Valid UMIs
usable
Transcriptome
Tissue
unmapped

RNAse only
96.3%
100.0%
79.3%
88.3%
91.7%
8.8%

(30 mins)

mouse kidney

RNAse +
97.9%
100.0%
84.8%
93.3%
92.8%
3.1%

Proteinase K

(30 mins)

mouse kidney

RNAse +
96.9%
100.0%
79.9%
91.6%
89.0%
4.9%

Proteinase K +

Sarkosyl

(30 mins)

mouse kidney

Control
96.4%
100.0%
65.3%
85.9%
78.3%
9.1%

mouse kidney

TABLE 4

Median
Median

Median
Median
panel genes
panel UMI

panel genes
panel UMI
directed
counts

detected at
counts at
at 1000
at 1000

260 panel
250 reads
panel reads
panel reads

Conditions
reads per spot
per spot
per spot
per spot

RNAse only
99
167
166
335

(30 mins)

mouse

kidney

RNAse +
127
208
283
642

Proteinase K

(30 mins)

mouse

kidney

RNAse +
79
121
129
226

Proteinase K +

Sarkosyl

(30 mins)

mouse

kidney

Control
103
164
203
411

mouse

kidney

As shown in Table 3 and Table 4, the combination of RNAse+Proteinase K using sandwiching methods resulted in increased useable fraction targeted reads, increased reads in spots under tissues, and increased median panel genes and UMI counts.

Example 2—Methods for Spatial Transcriptomics Analysis Utilizing Sandwich Process

In a non-limiting example, the sandwich process can be utilized for a downstream analytical step in spatial transcriptomics analysis workflows as described herein.

FIG. 15A shows an exemplary workflow of spatial analysis assays using fresh frozen tissue samples. For example, a tissue section is (a) fixed and stained (e.g., hematoxylin and eosin staining, fluorescent antibody staining); (b) imaged to evaluate the quality of the antibody staining; (c) destaining and unmounting the tissue section; (d) permeabilizing the tissue section with Proteinase K and sarkosyl prior to performing the sandwich process; and (e) performing a reverse transcription protocol and generating an analyte library.

FIG. 15B shows an exemplary workflow of spatial analysis assays using formalin-fixation and paraffin-embedded (FFPE) tissue samples. For example, a FFPE tissue section is (a) deparaffinized, wherein the paraffin-embedding material is removed; (b) fixed and stained (e.g., hematoxylin and eosin staining, fluorescent antibody staining); (b) imaged to evaluate the quality of the antibody staining; (c) destaining and unmounting the tissue section; (d) permeabilizing the tissue section with RNase, Proteinase K and sarkosyl prior to performing the sandwich process; and (e) performing a reverse transcription protocol and generating an analyte library.

Example 3—Detecting AAV Sequences Using RNA-Template Ligation

In a non-limiting example, an exogenous nucleic acid from an adeno-associated virus is detected in a biological sample on a first substrate. A first exogenous probe oligonucleotide and a second exogenous probe oligonucleotide are hybridized to the AAV nucleic acid, wherein the first exogenous probe oligonucleotide and the second exogenous probe oligonucleotide each include a sequence that is substantially complementary to a first sequence and a second sequence of the AAV nucleic acid, respectively, and wherein the second exogenous probe oligonucleotide comprises an exogenous capture probe binding domain. The first exogenous probe oligonucleotide and the second exogenous probe oligonucleotide are then ligated, thereby generating an exogenous connected probe.

The first substrate is aligned with a second substrate including an array, such that at least a portion of the biological sample is aligned with at least a portion of the array, wherein the array includes a plurality of capture probes, wherein a capture probe of the plurality of capture probes includes (i) a spatial barcode and (ii) a capture domain. Once the biological sample is aligned with at least a portion of the array, the exogenous connected probe is released from the AAV sequence and the exogenous connected probe is migrated from the biological sample to the array; and the exogenous capture probe binding domain of the exogenous connected probe hybridizes to the capture domain of the capture probe.

An analyte (e.g., endogenous nucleic acid) in the endogenous genome of the biological sample can also be analyzed, wherein a first probe oligonucleotide and a second probe oligonucleotide are hybridized to the analyte. The first probe oligonucleotide and the second probe oligonucleotide each include a sequence that is substantially complementary to a first sequence and a second sequence of the analyte, respectively, and the second probe oligonucleotide also includes a capture probe binding domain. The first probe oligonucleotide and the second probe oligonucleotide are then coupled, thereby generating a connected probe prior to aligning the first substrate with the second substrate. Once the biological sample is aligned with at least a portion of the array the connected probe is released from the analyte and the connected probe migrates from the biological sample to the array, wherein the connected probe hybridizes to the capture domain.

The use of both sets of RTL probes to detect AAV nucleic acids and endogenous nucleic acids allows the user to detect and analyze changes to exogenous and endogenous nucleic acid and/or protein expression.

Example 4—Detecting AAV Sequences Using Analyte Capture Agents

In a non-limiting example, an exogenous nucleic acid from an adeno-associated virus is detected in a biological sample on a first substrate. A first exogenous probe oligonucleotide and a second exogenous probe oligonucleotide are hybridized to the AAV nucleic acid, wherein the first exogenous probe oligonucleotide and the second exogenous probe oligonucleotide each include a sequence that is substantially complementary to a first sequence and a second sequence of the AAV nucleic acid, respectively, and wherein the second exogenous probe oligonucleotide comprises an exogenous capture probe binding domain. The first exogenous probe oligonucleotide and the second exogenous probe oligonucleotide are then coupled, thereby generating an exogenous connected probe.

A protein in the biological sample can also be analyzed, wherein the biological sample is contacted with a plurality of analyte capture agents. As described herein, an analyte capture agent of the plurality of analyte capture agents includes an analyte binding moiety and a capture agent barcode domain, wherein the analyte binding moiety specifically binds to the protein. The capture agent barcode domain includes an analyte binding moiety barcode and a capture handle sequence. The biological sample is contacted with the plurality of analyte capture agents prior to aligning the first substrate with the second substrate. Once the biological sample is aligned with at least the portion of the array, the capture agent barcode domain is released from the protein and the capture agent barcode domain passively or actively migrates from the biological sample to the array. The capture handle sequence is then coupled to the capture domain.

The use of RTL probes to detect AAV nucleic acids and analyte capture agents to capture proteins in the biological sample allows the user to detect and analyze changes to exogenous nucleic acid and/or protein expression concurrently.

Additional Embodiments

In some embodiments, disclosed herein is a method of detecting an exogenous nucleic acid in a biological sample, the method comprising: (a) hybridizing a first exogenous probe oligonucleotide and a second exogenous probe oligonucleotide to an exogenous nucleic acid in a biological sample mounted on a first substrate, wherein the first exogenous probe oligonucleotide and the second exogenous probe oligonucleotide each comprise a sequence that is substantially complementary to a first sequence and a second sequence of the exogenous nucleic acid, respectively, and wherein the second exogenous probe oligonucleotide comprises an exogenous capture probe binding domain; (b) coupling the first exogenous probe oligonucleotide and the second exogenous probe oligonucleotide, thereby generating an exogenous connected probe; (c) aligning the first substrate with a second substrate comprising an array, such that at least a portion of the biological sample is aligned with at least a portion of the array, wherein the array comprises a plurality of capture probes, wherein a capture probe of the plurality of capture probes comprises: (i) a spatial barcode and (ii) a capture domain; (d) when the biological sample is aligned with at least a portion of the array, (i) releasing the exogenous connected probe from the exogenous sequence and (ii) migrating the exogenous connected probe from the biological sample to the array; and (e) hybridizing the exogenous capture probe binding domain of the exogenous connected probe to the capture domain.

In some embodiments, the exogenous nucleic acid comprises viral DNA or viral RNA. In some embodiments, the exogenous nucleic acid is from an adeno-associated virus (AAV) nucleic acid. In some embodiments, the AAV is any one of AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, or AAV9. In some embodiments, the AAV is a recombinant adeno-associated virus (rAAV).

In some embodiments, the exogenous nucleic acid is from a genome integrated nucleic acid. In some embodiments, the genome integrated nucleic acid comprises a nucleotide sequence encoding a Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)—CRISPR associated (Cas) (CRISPR-Cas) system guide RNA that hybridizes with the target host sequence. In some embodiments, the genome integrated nucleic acid is from a virus that integrates into a host genome. In some embodiments, the virus that integrates into the host genome is selected from a retroviral vector, an arenavirus vector, a herpes virus vector, an Epstein-Barr Virus vector, or an adenovirus vector. In some embodiments, the retroviral vector is a lentiviral vector.

In some embodiments, the methods further include detecting one or more additional exogenous nucleic acids in the biological sample. In some embodiments, the exogenous nucleic acid comprises RNA from a conserved region of the exogenous nucleic acid sequence. In some embodiments, the exogenous nucleic acid comprises RNA from a transgene. In some embodiments, the exogenous nucleic acid comprises RNA from a reporter gene. In some embodiments, the reporter gene is selected from the group consisting of green fluorescent protein (GFP), Cerulean, tdTomato, and mCherry. In some embodiments, the reporter gene comprises GFP.

In some embodiments, the methods further include analyzing an analyte in the endogenous genome of the biological sample, wherein the biological sample is on the first substrate, the method comprising: (a) hybridizing a first probe oligonucleotide and a second probe oligonucleotide to the analyte, wherein the first probe oligonucleotide and the second probe oligonucleotide each comprise a sequence that is substantially complementary to a first sequence and a second sequence of the analyte, respectively, and wherein the second probe oligonucleotide comprises a capture probe binding domain; (b) coupling the first probe oligonucleotide and the second probe oligonucleotide, thereby generating a connected probe prior to aligning the first substrate with the second substrate; (c) when the biological sample is aligned with at least a portion of the array, (i) releasing the connected probe from the analyte and (ii) migrating the connected probe from the biological sample to the array; and (d) hybridizing the connected probe to the capture domain.

In some embodiments, the analyte comprises DNA or RNA.

In some embodiments, the first exogenous probe oligonucleotide and the second exogenous probe oligonucleotide are on a contiguous exogenous nucleic acid sequence; and/or the first probe oligonucleotide and the second probe oligonucleotide are on a contiguous nucleic acid sequence. In some embodiments, the first exogenous probe oligonucleotide is on the 3′ end of the contiguous exogenous nucleic acid sequence; and/or the first probe oligonucleotide is on the 3′ end of the contiguous nucleic acid sequence. In some embodiments, the second exogenous probe oligonucleotide is on the 5′ end of the contiguous exogenous nucleic acid sequence; and/or the second probe oligonucleotide is on the 5′ end of the contiguous nucleic acid sequence.

In some embodiments, the methods further include generating an extended first exogenous probe oligonucleotide, wherein the extended first exogenous probe oligonucleotide comprises a sequence complementary to a sequence between the sequence hybridized to the first exogenous probe oligonucleotide and the sequence hybridized to the second exogenous probe oligonucleotide; and/or further comprising generating an extended first probe oligonucleotide, wherein the extended first probe oligonucleotide comprises a sequence complementary to a sequence between the sequence hybridized to the first probe oligonucleotide and the sequence hybridized to the second probe oligonucleotide.

In some embodiments, the methods further include generating an extended second exogenous probe oligonucleotide using a polymerase, wherein the extended second exogenous probe oligonucleotide comprises a sequence complementary to a sequence between the sequence hybridized to the first exogenous probe oligonucleotide and the sequence hybridized to the second exogenous probe oligonucleotide; and/or further comprising generating an extended second probe oligonucleotide using a polymerase, wherein the extended second probe oligonucleotide comprises a sequence complementary to a sequence between the sequence hybridized to the first probe oligonucleotide and the sequence hybridized to the second probe oligonucleotide.

In some embodiments, the methods further include amplifying the connected probe prior to the releasing step. In some embodiments, the amplifying comprises rolling circle amplification.

In some embodiments, the releasing step (d) comprises contacting the biological sample with a reagent medium comprising a permeabilization agent and an agent for releasing the connected probe, thereby permeabilizing the biological sample and releasing the connected probe from the analyte.

In some embodiments, the methods further include analyzing a protein in the biological sample, the method comprising: (a) prior to aligning the first substrate with the second substrate, contacting the biological sample with a plurality of analyte capture agents, wherein an analyte capture agent of the plurality of analyte capture agents comprises an analyte binding moiety and a capture agent barcode domain, wherein the analyte binding moiety specifically binds to the protein, and wherein the capture agent barcode domain comprises an analyte binding moiety barcode and an capture handle sequence; (b) when the biological sample is aligned with at least the portion of the array, (i) releasing the capture agent barcode domain from the protein and (ii) passively or actively migrating the capture agent barcode domain from the biological sample to the array; and (c) coupling the capture handle sequence to the capture domain. In some embodiments, the coupling of the capture handle sequence to the capture domain comprises hybridization.

In some embodiments, the methods further include determining (i) all or a part of the capture agent barcode domain, or a complement thereof; and (ii) the sequence of the spatial barcode, or a complement thereof. In some embodiments, the methods further include using the determined sequences of (i) and (ii) to determine the location and abundance of the protein in the biological sample. In some embodiments, the determining comprises sequencing (i) all or a part of the sequence of the capture agent barcode domain, or a complement thereof, and (ii) the sequence of the spatial barcode, or a complement thereof.

In some embodiments, during the releasing step, a separation distance is maintained between the first substrate and the second substrate, optionally the separation distance is less than 50 microns, optionally the separation distance is between 2-25 microns, optionally the separation distance is measured in a direction orthogonal to the surface of the first substrate that supports the biological sample, and/or at least the portion of the biological sample is vertically aligned with the at least portion of the array.

In some embodiments, at least one of the first substrate and the second substrate further comprise a spacer, wherein after the first and second substrate being mounted on the support device, the spacer is disposed between the first substrate and second substrate and is configured to maintain the reagent medium within a chamber formed by the first substrate, the second substrate, and the spacer, and maintain a separation distance between the first substrate and the second substrate, the spacer positioned to at least partially surround an area on the first substrate on which the biological sample is disposed and/or the array disposed on the second substrate, wherein the area of the first substrate, the spacer, and the second substrate at least partially encloses a volume comprising the biological sample.

In some embodiments, the permeabilization agent comprises a protease. In some embodiments, the protease is selected from trypsin, pepsin, elastase, or proteinase K. In some embodiments, the reagent medium further comprises a detergent. In some embodiments, the detergent is selected from sodium dodecyl sulfate (SDS), sarkosyl, saponin, a nonionic surfactant, or a nonionic detergent. In some embodiments, the reagent medium does not comprise sodium dodcyl sulfate (SDS) or sarkosyl. In some embodiments, the reagent medium further comprises polyethylene glycol (PEG).

In some embodiments, the biological sample and the array are contacted with the reagent medium for about 1 to about 60 minutes. In some embodiments, the biological sample and the array are contacted with the reagent medium for about 30 minutes.

In some embodiments, the methods further include determining (i) all or a part of the sequence of the connected probe, or a complement thereof, and (ii) the sequence of the spatial barcode, or a complement thereof, optionally wherein the method further comprises using the determined sequences of (i) and (ii) to determine the location and abundance of the exogenous nucleic acid in the biological sample. In some embodiments, the determining comprises sequencing (i) all or a part of the sequence of the connected probe, or a complement thereof, and (ii) the sequence of the spatial barcode, or a complement thereof. In some embodiments, the sequence of the connected probe comprises the sequence of the spatial barcode or the reverse complement thereof, and a sequence corresponding to the exogenous nucleic acid in the biological sample or reverse complement thereof.

In some embodiments, the capture probe comprises a poly(T) sequence. In some embodiments, the capture probe comprises a sequence complementary to the first probe oligonucleotide or the second probe oligonucleotide. In some embodiments, the capture probe further comprises one or more functional domains, a unique molecular identifier (UMI), a cleavage domain, and combinations thereof.

In some embodiments, also described herein are systems or kits for analyzing an analyte in a biological sample, the system or the kit comprising: (a) a support device configured to retain a first substrate and a second substrate, wherein the biological sample is placed on the first substrate, and wherein the second substrate comprises a plurality of capture probes, wherein a capture probe of the plurality of capture probes comprises: (i) a spatial barcode and (ii) a capture domain; (b) (b1) a first exogenous probe oligonucleotide and a second exogenous probe oligonucleotide, wherein the first exogenous probe oligonucleotide and a second exogenous probe oligonucleotide each comprise a sequence that is substantially complementary to a first sequence and a second sequence of an exogenous nucleic acid, respectively, wherein the second exogenous probe oligonucleotide comprises a capture probe binding domain, and wherein the first exogenous probe oligonucleotide and the second exogenous probe oligonucleotide are capable of being ligated together to form an exogenous connected probe; or (b2) a first probe oligonucleotide and a second probe oligonucleotide, wherein the first probe oligonucleotide and the second probe oligonucleotide each comprise a sequence that is substantially complementary to a first sequence and a second sequence of the analyte, respectively, wherein the second probe oligonucleotide comprises a capture probe binding domain, and wherein the first probe oligonucleotide and the second probe oligonucleotide are capable of being ligated together to form a connected probe; or (b3) a plurality of analyte capture agents, wherein an analyte capture agent of the plurality of analyte capture agents comprises an analyte binding moiety and a capture agent barcode domain, wherein the analyte binding moiety specifically binds to the analyte, and wherein the capture agent barcode domain comprises an analyte binding moiety barcode and an capture handle sequence; (c) a reagent medium comprising a permeabilization agent and optionally an agent for releasing the connected probe; and (d) instructions for performing the method of any one of the preceding claims.

In some embodiments, the systems or kits further include an alignment mechanism on the support device to align the first substrate and the second substrate. In some embodiments, the alignment mechanism comprises a linear actuator, wherein the first substrate comprises a first member and the second substrate comprises a second member, and optionally wherein: the linear actuator is configured to move the second member along an axis orthogonal to the plane of the first member and/or the second member, and/or the linear actuator is configured to move the first member along an axis orthogonal to the plane of the first member and/or the second member, and/or the linear actuator is configured to move the first member, the second member, or both the first member and the second member at a velocity of at least 0.1 mm/sec, and/or the linear actuator is configured to move the first member, the second member, or both the first member and the second member with an amount of force of at least 0.1 lbs.

METHODS, COMPOSITIONS, AND SYSTEMS FOR DETECTING EXOGENOUS NUCLEIC ACIDS

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

International Classifications

Abstract

Description

Claims

CROSS-REFERENCE TO RELATED APPLICATIONS

Provisional Applications (1)