This application includes and incorporates by reference in its entirety a Sequence Listing XML in the required .xml format. The Sequence Listing XML file that has been electronically filed contains the information of the nucleotide and/or amino acid sequences disclosed in the patent application using the symbols and format in accordance with the requirements of 37 C.F.R. §§ 1.832 through 1.834.
The Sequence Listing XML filed herewith serves as the electronic copy required by § 1.834(b)(1).
The Sequence Listing XML is identified as follows: “KANVAS_002_SEQ_LIST.xml” (1,125 kilo bytes in size), which was created on Feb. 20, 2023.
This invention relates to methods for highly-multiplexed, rapid detection of nucleotides in samples, and constructs to be used in said methods.
Microbiota often form complex communities with each other and their environment, which can include eukaryotic cells. The spatial localization of these microbes can have effects on the ecosystem, though describing the functions of each bacterium is a challenge.
Spatial transcriptomics, a method to identify specific mRNA molecules in cells in their native biological context, can be a powerful tool but has thus far been largely developed for eukaryotic systems, leaving methods to profile the spatial properties of microbial communities untouched. To accurately and comprehensively profile the microbiome transcriptome, a method that has high target multiplexity, capable of labelling potentially millions of gene targets, can be required. The development of such a method could revolutionize our understanding of microbial community assembly and lead to new diagnostic and therapeutic applications.
In one aspect, a method for analyzing a sample, can include contacting at least one encoding probe with the sample to produce a first complex, adding at least two different DNA amplifiers to the first complex to produce a second complex, and adding emissive readout probes to the second complex. Each encoding probe can include a targeting sequence and an initiator sequence. Each DNA amplifier can include an initiator complimentary sequence and a readout sequence. Each emissive readout probe can include a label and a complimentary sequence to the readout sequence of a corresponding DNA amplifier.
In another aspect, a method for analyzing a sample can include generating a set of probes, wherein each probe includes:
In another aspect, a method for analyzing a cell can include:
In another aspect, a construct can include a targeting sequence that is complementary to a region of interest on a DNA/RNA sequence, a first initiator sequence, a second initiator sequence that is different from the first initiator sequence, a first amplifier sequence including a readout sequence on the 5′ end of the sequence, a second amplifier sequence including a readout sequence on the 3′ end of the sequence, wherein the second amplifier sequence is different from the first amplifier sequence, and an emissive readout sequence including a sequence complimentary to the readout sequence of the first and/or second amplifier sequences and a label on the 5′ and/or 3′ end of the complimentary sequence.
In another aspect, a construct can include a targeting sequence that is a region of interest on a nucleotide, a first initiator sequence, a second initiator sequence that is different from the first initiator sequence, a first amplifier sequence including a third initiator sequence, a second amplifier sequence including a fourth initiator sequence, a third amplifier sequence including a readout sequence on the 5′ end of the sequence, a fourth amplifier sequence including a readout sequence on the 3′ end of the sequence, wherein the first, second, third, and fourth amplifier sequences are different from each other, and an emissive readout sequence including a sequence complimentary to the readout sequence of the third and/or fourth amplifier sequences and a label on the 5′ and/or 3′ end of the complimentary sequence.
In another aspect, a library of constructs can include a plurality of barcoded probes, wherein each barcoded probe can include:
In another aspect, a library of constructs can include a plurality of barcoded probes, wherein each barcoded probe can include:
In another aspect, a method for analyzing a sample can include:
Other aspects, embodiments, and features as disclosed herein will be apparent from the following description, the drawings, and the claims.
It is to be appreciated that certain aspects, modes, embodiments, variations, and features of the present methods and compositions are described below in various levels of detail in order to provide a substantial understanding of the present disclosure.
Where values are described as ranges, endpoints are included. Furthermore, it will be understood that such disclosure includes the disclosure of all possible sub-ranges within such ranges, as well as specific numerical values that fall within such ranges irrespective of whether a specific numerical value or specific sub-range is expressly stated.
“5′-end” and “3′-end” refers to the directionality, e.g., the end-to-end orientation of a nucleotide polymer (e.g., DNA). The 5′-end of a polynucleotide is the end of the polynucleotide that has the fifth carbon.
The term “about,” as used herein, refers to +/−10% of a recited value.
“Complementary” refers to the topological compatibility or matching together of interacting surfaces of two nucleotides as understood by those of skill in the art. Thus, two sequences are “complementary” to one another if they are capable of hybridizing to one another to form a stable anti-parallel, double-stranded nucleic acid structure. A first nucleotide is complementary to a second nucleotide if the nucleotide sequence of the first nucleotide is substantially identical to the nucleotide sequence of the nucleotide binding partner of the second nucleotide, or if the first nucleotide can hybridize to the second nucleotide under stringent hybridization conditions. Thus, the nucleotide whose sequence is 5′-TATAC-3′ is complementary to a nucleotide whose sequence is 5′-GTATA-3′.
“Nucleotides,” “Nucleic acids,” “polynucleotide” or “oligonucleotide” refer to a polymeric-form of DNA and/or RNA (e.g., ribonucleotides, deoxyribonucleotides, or analogs thereof) of any length; e.g., a sequence of two or more ribonucleotides or deoxyribonucleotides. As used herein, the term “nucleotides” includes double- and single-stranded DNA, as well as double- and single-stranded RNA; it also includes modified and unmodified forms of a nucleotide (modifications to and of a nucleotide, for example, can include methylation, phosphorylation, and/or capping). In some embodiments, a nucleotide can be one of the following: a gene or gene fragment; genomic DNA; genomic DNA fragment; exon; intron; messenger RNA (mRNA); transfer RNA (tRNA); ribosomal RNA (rRNA); ribozyme; cDNA; recombinant nucleotide; branched nucleotide; plasmid; vector; isolated DNA of any sequence; isolated RNA of any sequence; any DNA described herein, any RNA described herein, primer or amplified copy of any of the foregoing.
In some embodiments, nucleotides can have any three-dimensional structure and may perform any function, known or unknown. The structure of nucleotides can also be referenced to by their 5′- or 3′-end or terminus, which indicates the directionality of the nucleotide sequence. Adjacent nucleotides in a single-strand of nucleotides are typically joined by a phosphodiester bond between their 3′ and 5′ carbons. However, different internucleotide linkages could also be used, such as linkages that include a methylene, phosphoramidate linkages, etc. This means that the respective 5′ and 3′ carbons can be exposed at either end of the nucleotide sequence, which may be called the 5′ and 3′ ends or termini. The 5′ and 3′ ends can also be called the phosphoryl (PO4) and hydroxyl (OH) ends, respectively, because of the chemical groups attached to those ends. The term “nucleotides” also refers to both double- and single-stranded molecules.
In some embodiments, nucleotides can include modified nucleotides, such as methylated nucleotides and nucleotide analogs (including nucleotides with non-natural bases, nucleotides with modified natural bases such as aza- or deaza-purines, etc.). If present, modifications to the nucleotide structure can be imparted before or after assembly of the nucleotide sequence.
In some embodiments, the sequence of nucleotides can be interrupted by non-nucleotide components. One or more ends of the nucleotides can be protected or otherwise modified to prevent that end from interacting in a particular way (e.g. forming a covalent bond) with other nucleotides.
In some embodiments, nucleotides can be composed of a specific sequence of four nucleotide bases: adenine (A); cytosine (C); guanine (G); and thymine (T). Uracil (U) can also be present, for example, as a natural replacement for thymine when the nucleotide is RNA. Uracil can also be used in DNA. Thus, the term “sequence” refers to the alphabetical representation of nucleotides or any nucleic acid molecule, including natural and non-natural bases.
When used in terms of length, for example 20 nt, “nt” refers to nucleotide(s).
As used herein a “taxon” refers to a group of one or more populations of an organism or organisms. In some embodiments, a “taxon” refers to a phylum, a class, an order, a family, a genus, a species, or a train. In some embodiments, the disclosure includes providing a list of taxa of microorganisms. In some embodiments, the list of taxa of microorganisms is selected from a list of phyla, a list of classes, a list of orders, a list of families, a list of genera, or a list of species, of microorganisms.
In analysis of a sample, a species can be a target of interest. For example, a species can include a taxonomic species.
In the event of any term having an inconsistent definition between this application and a referenced document, the term is to be interpreted as defined herein.
Bacteria can form biofilms, aggregations of microbial consortia, that are encased in a complex, self-produced polymeric matrix and that adhere to biological and non-biological surfaces. Environmental microbes frequently reside in biofilm microbial communities with rich taxonomic diversity and exquisite spatial organization. Biofilms have been observed on the intestinal mucosa of colorectal cancer patients, even on tumor-free mucosa far distant from the tumors. Patients with familial polyposis also harbor colonic biofilms that include tumorigenic bacteria.
The local environment of individual microbes can have strong influences on their physiology, which in turn shapes the ecology of the community. The oral plaque microbiome has been shown to exhibit intricate spatial structure, which is thought to contribute to the metabolic interactions within the microbial community, and between the community and the surrounding environment. In other instances, biofilm formation has been shown to lead to decreased antimicrobial resistance and virulence.
Sequencing strategies have revealed extensive genomic information of microbial communities from a wide range of environments, ranging from human body sites to the global ocean, but at the expense of the spatial structure of these communities.
Imaging methods based on fluorescence in-situ hybridization (FISH) have enabled studies of the spatial organization of biofilms but suffer significant multiplexity limitations. Existing FISH strategies distinguish different taxa by conjugating each taxon-specific oligonucleotide probe with a unique fluorophore or a combination of fluorophores. The spectral overlap between commercially available fluorophores and the limited range of wavelength typically used in fluorescence imaging significantly limit the number of taxa that can be probed in a single experiment using current FISH-based strategies. The state-of-the-art method allows distinction of 15 taxa, which falls short of the diversity typically observed in natural biofilm communities.
Quantitative measurements of spatial organization in microbial communities are limited by existing image segmentation algorithms. Single cell segmentation will allow physical measurements of cell size, cell shape, cell-to-cell distance, and cellular adjacency network. Previous reports have used various coarse grained metrics to quantitatively dissect spatial organization of environmental microbial communities. However, microbes in environmental biofilms are typically densely packed, which reduces the contrast between intracellular space and cells. Furthermore, cells from different taxa typically contain different amounts of ribosome, leading to a high dynamic range of biofilm images. Both factors make single-cell segmentation challenging in images of environmental biofilms.
The FISH probes typically used in existing methods are limited in their taxonomic coverage. Due to the aforementioned multiplexity limit, most existing methods either (a) use probes for a limited number of taxa at low taxonomic levels (e.g., genus or species) or (b) use probes designed at high taxonomic levels (e.g., phylum or class). Using probes for a limited number of low level taxa risks missing many low-abundance taxa. On the other hand, high taxonomic level probes do not provide high phylogenetic resolution, and can suffer from incomplete coverage of species within the target taxon.
Accordingly, methods for detection without multiplexity limit and other constraints from the art are needed.
Deciphering what each cell within such communities is doing through gene expression and metabolic signatures represents the next frontier in understanding and interpreting microbial systems, with wide ranging applicability from clinical to agricultural domains (e.g., agricultural, clinical, pharmaceutical, biotechnological, medical, scientific, biotherapeutic, wastewater management domains). Here, we describe a novel technology for spatially-resolved multiplexed detection of gene expression within cells, which we refer to as Hybridization Chain Reaction (HCR) based High Phylogenetic Resolution (HiPR-Cycle). HiPR-Cycle uses HCR to extend DNA polymers upon encountering ‘initiator’ sequences attached to DNA probes which recognize target genes of interest within cells in situ. The HCR products that form at the site of detected transcripts bear High Phylogenetic Resolution microbiome mapping by Fluorescence in situ Hybridization (HiPR-FISH) readout probe binding sites, in numbers proportional to the size of the HCR product. Barcoding these products with, for example, 10 unique readout probes can be used to distinguish over 1000 distinct targets. Moreover, by physically amplifying the fluorescent signal of encoding probe binding events through HCR, the methods described herein are able to detect lowly expressed genes otherwise overlooked.
Hybridization Chain Reaction (HCR) is a method for the triggered hybridization of nucleic acid molecules starting from metastable hairpin monomers or other metastable nucleic acid structures. See, for example, Dirks, R. and Pierce, N. Proc. Natl. Acad. Sci. USA 101(43): 15275-15278 (2004), and U.S. Pat. Nos. 7,632,641; 8,105,778, 8,507,204, 10,450,599, and PCT Patent Publication WO 2021/221789, filed Mar. 4, 2021. The contents of the aforementioned disclosures are each incorporated herein by reference in their entireties.
High Phylogenetic Resolution microbiome mapping by Fluorescence in situ Hybridization (HiPR-FISH), is a versatile technology that uses binary encoding, spectral imaging, and machine learning based decoding to create micron-scale maps of the locations and identities of hundreds of microbial species in complex communities. See, for example, Shi, H. et al. “Highly multiplexed spatial mapping of microbial communities.” Nature vol. 588, 7839 (2020): 676-681, PCT Patent Publication WO 2019/173555, filed Mar. 7, 2019; PCT Patent Application No. PCT/US2022/080355, filed on Nov. 24, 2022, and U.S. application Ser. No. 18/058,171, filed on Nov. 24, 2022. The contents of the aforementioned disclosures are each incorporated herein by reference in their entireties.
HiPR-Cycle has three primary steps: 1) encoding probe hybridization, 2) hybridization chain reaction (HCR)-based amplification, and 3) readout probe hybridization.
HiPR-Cycle can be described as follows, for example, when used to identified microbial samples:
Fixed microbial cultures are pipetted onto a glass microscope slide and allowed to dry. The cell walls of microbes can then be digested by adding lysozyme to the plated sample. To prepare the cells for encoding probe hybridization, encoding buffer (with no DNA probes) can be added to the plated sample. The pre-encoding buffer can be aspirated and new encoding buffer containing HiPR-Cycle encoding probes specific to target transcripts can be added. These encoding probes possess specific initiator sequence(s). Samples are incubated with the encoding probes. Residual encoding probes or those binding to off-target sites can be removed with 37° C. washes. At this point, samples are ready for amplification. Each reaction requires the presence of two distinct DNA amplifier species, which will cross react to form long chains in the presence of an initiator sequence. Amplification reactions can be conducted for 2 to 24 hours at room temperature. After amplification, samples are washed and a readout hybridization can be conducted by adding emissive readout probes. Once done, the sample can be dried and, once dried, mountant is applied to the sample. At this stage the sample can be imaged via microscopy.
As disclosed herein, a variety of nucleotide probes may be used to analyze a sample (e.g., by determining one or more nucleotides present in the sample).
Accordingly, a method for analyzing a sample can include:
In some embodiments, more than one type of probe set (e.g., encoding probe, DNA amplifiers, and emissive readout probes) may be introduced to a sample. For example, there may be at least 2, at least 5, at least 10, at least 25, at least 50, at least 75, at least 100, at least 300, at least 1,000, at least 3,000, at least 10,000, or at least 30,000 distinguishable probe sets that are introduced to a sample. In some embodiments, the distinct probes are introduced simultaneously. In some embodiments, the distinct probes are introduced sequentially.
Encoding Probe Hybridization
In the methods described herein for analyzing a sample, the method can include contacting at least one encoding probe with the sample to produce a first complex, wherein each encoding probe includes a targeting sequence and an initiator sequence. This step may also be referred to as the “encoding probe hybridization” step. In here, at least one encoding probe is contacted with the sample to produce a first complex. The first complex can include the targeting sequence of the encoding probe hybridized to the nucleic acid target sequence (see, for example, step 2 of
In some embodiments, contacting the encoding probes with the sample is contacting the encoding probes with at least one nucleotide sequence of the sample. In some embodiments, contacting the encoding probes with the sample is hybridizing the encoding probe (e.g., via the targeting sequence present in the encoding probe) with a target sequence present in the sample.
In some embodiments, in order to contact the encoding probes with the sample, the sample can be digested or lysed so as to allow the encoding probes (and other probes described herein) to contact with the target sequence.
In some embodiments, to contact the at least one encoding probe with the sample to produce a first complex, encoding buffer is added to the sample. In some embodiments, the encoding buffer can include a denaturing/deionizing agent, a salt buffer, a detergent, a polyanionic polymer, a blocking agent, acids, or combinations thereof. In some embodiments, the encoding buffer can include more than one type of agent, for example, the encoding buffer can include two or more polyanionic polymers and/or two or more blocking agents. In some embodiments, the encoding buffer can include a denaturing/deionizing agent, a salt buffer, a detergent, two polyanionic polymers, two blocking agents, and an acid.
In some embodiments, the encoding buffer can include a denaturing/deionizing agent. In some embodiments, the denaturing/deionizing agent can be formamide, ethylene carbonate, or urea. In some embodiments, the encoding buffer can include about 10% (v/v) to about 50% (v/v), about 15% (v/v) to about 45% (v/v), about 20% (v/v) to about 40% (v/v), about 25% (v/v) to about 35% (v/v), about 10% (v/v), 15% (v/v), 20% (v/v), 25% (v/v), or 30% (v/v) of a denaturing/deionizing agent (e.g., formamide).
In some embodiments, the encoding buffer can include a salt buffer. In some embodiments, the salt buffer is saline sodium citrate (SSC), NaCl, or MgCl2. In some embodiments, the encoding buffer can include about 2× to about 20×, about 5× to about 10×, or about 5× of a salt buffer (e.g., saline sodium citrate (SSC)).
In some embodiments, the encoding buffer can include at least one polyanionic polymer. In some embodiments, the encoding buffer can include one polyanionic polymer. In some embodiments, the encoding buffer can include two polyanionic polymers. In some embodiments, the polyanionic polymer can be dextran sulfate, heparin, or polyglutamic acid. In some embodiments, the encoding buffer can include about 2.5% (v/v) to about 25% (v/v), about 5% (v/v) to about 15% (v/v), about 7.5% (v/v) to about 12.5% (v/v), about 5% (v/v), or about 10% (v/v) of a polyanionic polymer (e.g., dextran sulfate). In some embodiments, the encoding buffer can include about 20 μg/mL to about 80 μg/mL, about 30 μg/mL to about 70 μg/mL, about 40 μg/mL to about 60 μg/mL, or about 50 μg/mL of a polyanionic polymer (e.g., heparin).
In some embodiments, the encoding buffer can include a detergent. In some embodiments, the detergent can be Tween 20, Tween 80, sodium dodecyl sulfate (SDS), Triton X-100, Triton X-114, NP-40, Brij-35, Brij-58. N-Dodecyl-beta-maltoside, Octyl-beta-glucoside, octylthioglucoside (OTG). In some embodiments, the encoding buffer can include about 0.01% (v/v) to about 1.0% (v/v), about 0.05% (v/v) to about 0.5% (v/v), or about 0.1% (v/v), or about 0.05% (v/v) of detergent (e.g., Tween 20).
In some embodiments, the encoding buffer can include an acid. In these embodiments, the acid lowers the pH of the buffer. In some embodiments, the acid can be citric acid. In some embodiments, the encoding buffer can include about 1 mM to about 30 mM, about 5 mM to about 15 mM, about 5 mM to about 10 mM, about 7 mM to about 10 mM, or about 9 mM of an acid (e.g., citric acid).
In some embodiments, the encoding buffer can include at least one blocking agent. In some embodiments, the encoding buffer can include one blocking agent. In some embodiments, the blocking agents can be Denhardt's solution, bovine serum albumin (BSA), salmon sperm DNA, Ficoll, polyvinyl pyrrolidone (PVP), E. coli tRNA, casein solution, or random hexamers. In some embodiments, the encoding buffer can include about 0.1× to about 10×, about 0.5× to about 5×, about 1× to about 2×, or about 1× of a blocking agent (e.g., Denhardt's solution).
In some embodiments, the encoding buffer can include formamide, SSC, dextran sulfate, Tween 20, citric acid (pH 6), heparin, and Denhardt's solution. In some embodiments citric acid and/or heparin can be omitted from the encoding buffer composition. In some embodiments, the encoding buffer can include 30% formamide, 5×SSC, 10% dextran sulfate, 0.1% Tween 20, 9 mM citric acid (pH 6), 50 μg/mL heparin and 1×Denhardt's solution.
In some embodiments, the encoding buffer can include SSC, dextran sulfate, ethylene carbonate, SDS, and Denhardt's solution. In some embodiments, the encoding buffer can include 2×SSC, 10% dextran sulfate, 10% ethylene carbonate, 0.01% SDS, and 5×Denhardt's solution.
Amplification
Following the hybridization of the encoding probe with the target sequence to form a first complex, at least two different DNA amplifiers are added to the first complex to produce a second complex, wherein each DNA amplifier can include an initiator complimentary sequence and a readout sequence. In some embodiments, this step may be referred to as the “amplification” step. In here, each amplification step/reaction requires the presence of two different DNA amplifiers, which cross react to form long nucleotide chains in the presence of an initiator sequence (see, for example, steps 3 and 4 in
In some embodiments, prior to adding the two DNA amplifiers to the first complex, each DNA amplifier is briefly heated (e.g., at 95° C. for 2 minutes) to denature any unwanted structure, followed by a cooling period (e.g., to room temperature) where the DNA amplifier (e.g., hairpin structure) reforms.
In order to form the second complex (e.g., perform amplification step), amplification buffer is added to the sample. In some embodiments, the amplification buffer can include a salt buffer, a detergent a polyanionic polymer, a denaturing/deionizing reagent, or combinations thereof. In some embodiments, the amplification buffer can include a salt buffer, a detergent a polyanionic polymer, and a denaturing/deionizing reagent.
In some embodiments, the amplification buffer can include a salt buffer. In some embodiments, the salt buffer is saline sodium citrate (SSC), NaCl, or MgCl2. In some embodiments, the amplification buffer can include about 2× to about 20×, about 5× to about 10×, or about 5× of a salt buffer (e.g., saline sodium citrate (SSC)).
In some embodiments, the amplification buffer can include a detergent. In some embodiments, the detergent can be Tween 20, Tween 80, sodium dodecyl sulfate (SDS), Triton X-100, Triton X-114, NP-40, Brij-35, Brij-58. N-Dodecyl-beta-maltoside, Octyl-beta-glucoside, octylthioglucoside (OTG). In some embodiments, the amplification buffer can include about 0.01% (v/v) to about 1.0% (v/v), about 0.05% (v/v) to about 0.5% (v/v), or about 0.1% (v/v), or about 0.05% (v/v) of detergent (e.g., Tween 20).
In some embodiments, the amplification buffer can include at least one polyanionic polymer. In some embodiments, the amplification buffer can include one polyanionic polymer. In some embodiments, the amplification buffer can include two polyanionic polymers. In some embodiments, the polyanionic polymer can be dextran sulfate, heparin, or polyglutamic acid. In some embodiments, the amplification buffer can include about 2.5% (v/v) to about 25% (v/v), about 5% (v/v) to about 15% (v/v), about 7.5% (v/v) to about 12.5% (v/v), about 5% (v/v), or about 10% (v/v) of a polyanionic polymer (e.g., dextran sulfate). In some embodiments, the amplification buffer can include about 20 μg/mL to about 80 μg/mL, about 30 μg/mL to about 70 μg/mL, about 40 μg/mL to about 60 μg/mL, or about 50 μg/mL of a polyanionic polymer (e.g., heparin).
In some embodiments, the amplification buffer can include a denaturing/deionizing agent. In some embodiments, the denaturing/deionizing agent can be formamide, ethylene carbonate, or urea. In some embodiments, the amplification buffer can include about 10% (v/v) to about 50% (v/v), about 15% (v/v) to about 45% (v/v), about 20% (v/v) to about 40% (v/v), about 25% (v/v) to about 35% (v/v), about 10% (v/v), 15% (v/v), 20% (v/v), 25% (v/v), or 30% (v/v) of a denaturing/deionizing agent (e.g., formamide).
In some embodiments, the amplification buffer can include SSC, dextran sulfate, and Tween 20. In some embodiments, the amplification buffer can include 5×SSC, 10% dextran sulfate, and 0.1% Tween 20.
In some embodiments, the amplification reaction can be conducted at about 4° C. to about 37° C., or at room temperature, at 4° C. or at 37° C., depending on the properties of the amplifier probes. In some embodiments, the amplification reaction can be conducted for about 30 minutes to about 24 hours, or about 2 hours to about 12 hours, or about 2 hours to about 5 hours, or about 2 hours to about 4 hours, about 2 hours to about 3 hours, or about 3 hours, or about 2 hours.
In some embodiments, after the amplification reaction is completed, a washing step can be performed. In some embodiments, the washing step can be with a washing buffer comprising about 2×, 3×, 4×, or 5×SSCT (2×SSC+0.1% Tween 20). In some embodiments, the washing step can be conducted at about room temperature to about 48° C.
In some embodiments, contacting at least one encoding probe with the sample to produce a first complex and adding at least two different DNA amplifiers to the first complex to produce a second complex are performed at the same time. In some embodiments, the amplification step can be performed simultaneously with the readout probe hybridization step.
Readout Probe Hybridization
After the amplification step is complete, emissive readout probes are added to the second complex, wherein each emissive readout probe can include a label and a complimentary sequence to the readout sequence of a corresponding DNA amplifier. In some embodiments, this step may be referred to as the “readout probe hybridization” step. In here, the emissive readout probes hybridize to their complementary sequences present in the second complex (see, for example, step 5 of
In some embodiments, the readout probes are added so they achieve a final concentration of about 10 nM to about 20 μM, or about 10 nM to about 10 μM, or about 100 nM to about 1 μM, about 200 nM to about 500 nM, or about 200 nM, about 300 nM, about 400 nM, or about 500 nM for each readout probe. In some embodiments, the readout probes are added so they achieve a final concentration of about 400 nM.
In some embodiments, to hybridize the readout probes to the second complex, readout buffer is added to the sample. In some embodiments, the readout buffer can include a denaturing/deionizing agent, a salt buffer, a detergent, a polyanionic polymer, a blocking agent, or combinations thereof. In some embodiments, the readout buffer includes more than one type of agent, for example, the readout buffer can include two or more polyanionic polymers and/or two or more blocking agents.
In some embodiments, the readout buffer can include a denaturing/deionizing agent. In some embodiments, the denaturing/deionizing agent can be formamide, ethylene carbonate, or urea. In some embodiments, the readout buffer can include about 10% (v/v) to about 50% (v/v), about 15% (v/v) to about 45% (v/v), about 20% (v/v) to about 40% (v/v), about 25% (v/v) to about 35% (v/v), about 10% (v/v), 15% (v/v), 20% (v/v), 25% (v/v), or 30% (v/v) of a denaturing/deionizing agent (e.g., formamide or ethylene carbonate).
In some embodiments, the readout buffer can include a salt buffer. In some embodiments, the salt buffer is saline sodium citrate (SSC), NaCl, or MgCl2. In some embodiments, the readout buffer can include about 2× to about 20×, about 5× to about 10×, about 5×, or about 2× of a salt buffer (e.g., saline sodium citrate (SSC)).
In some embodiments, the readout buffer can include at least one polyanionic polymer. In some embodiments, the readout buffer can include one polyanionic polymer. In some embodiments, the polyanionic polymer can be dextran sulfate, heparin, or polyglutamic acid. In some embodiments, the readout buffer can include about 2.5% (v/v) to about 25% (v/v), about 5% (v/v) to about 15% (v/v), about 7.5% (v/v) to about 12.5% (v/v), about 5% (v/v), or about 10% (v/v) of a poly anionic polymer (e.g., dextran sulfate).
In some embodiments, the readout buffer can include a detergent. In some embodiments, the detergent can be Tween 20, Tween 80, sodium dodecyl sulfate (SDS), Triton X-100, Triton X-114, NP-40, Brij-35, Brij-58. N-Dodecyl-beta-maltoside, Octyl-beta-glucoside, octylthioglucoside (OTG). In some embodiments, the encoding buffer can include about 0.005 (v/v) to about 1.0% (v/v), about 0.01% (v/v) to about 0.05% (v/v), about 0.05% (v/v) to about 0.5% (v/v), or about 0.1% (v/v), about 0.01% (v/v), or about 0.05% (v/v) of detergent (e.g., SDS).
In some embodiments, the readout buffer can include at least one blocking agent. In some embodiments, the readout buffer can include one blocking agent. In some embodiments, the blocking agents can be Denhardt's solution, bovine serum albumin (BSA), salmon sperm DNA, Ficoll, polyvinyl pyrrolidone (PVP), E. coli tRNA, casein solution, or random hexamers. In some embodiments, the readout buffer can include about 0.1× to about 10×, about 0.5× to about 5×, about 1× to about 2×, or about 1× of a blocking agent (e.g., Denhardt's solution).
In some embodiments, the readout buffer can include SSC, Denhardt's solution, dextran sulfate, ethylene carbonate, and SDS. In some embodiments, the readout buffer can include 2×SSC, 5×Denhardt's solution, 10% (v/v) dextran sulfate, 10% (v/v) ethylene carbonate, and 0.01% (v/v) SDS.
In some embodiments, the readout buffer can include SSC, Denhardt's solution, dextran sulfate, formamide, and SDS. In some embodiments, the readout buffer can include 2×SSC, 5×Denhardt's solution, 10% (v/v) dextran sulfate, 10% (v/v) formamide, and 0.01% (v/v) SDS.
In some embodiments, the readout hybridization reaction can be conducted at about 4° C. to about 37° C., or at room temperature, at 4° C. or at 37° C., depending on the properties of the amplifier probes In some embodiments, the readout hybridization reaction can be conducted for about 2 hours to about 24 hours, or about 2 hours to about 12 hours, or about 2 hours to about 5 hours, or about 2 hours to about 4 hours, about 2 hours to about 3 hours, or about 3 hours, or about 2 hours.
In some embodiments, after each reaction and before proceeding to the next one, the samples or probes are washed with a “wash buffer.”
In some embodiments, the wash buffer can include a denaturing/deionizing agent, a salt buffer, a detergent, a polyanionic polymer, acids, a pH stabilizer, a chelating agent, or combinations thereof. In some embodiments, the wash buffer can include more than one type of agent, for example, the wash buffer can include two or more detergents. In some embodiments, the wash buffer can include a denaturing/deionizing agent, a salt buffer, a detergent, a polyanionic polymer, and an acid. In some embodiments, the wash buffer can include a salt buffer and a detergent. In some embodiments, the wash buffer can include a salt buffer, a pH stabilizer, and a chelating agent.
In some embodiments, the wash buffer can include a denaturing/deionizing agent. In some embodiments, the denaturing/deionizing agent can be formamide, ethylene carbonate, or urea. In some embodiments, the wash buffer can include about 10% (v/v) to about 50% (v/v), about 15% (v/v) to about 45% (v/v), about 20% (v/v) to about 40% (v/v), about 25% (v/v) to about 35% (v/v), about 10% (v/v), 15% (v/v), 20% (v/v), 25% (v/v), or 30% (v/v) of a denaturing/deionizing agent (e.g., formamide).
In some embodiments, the wash buffer can include a salt buffer. In some embodiments, the salt buffer is saline sodium citrate (SSC), NaCl, or MgCl2. In some embodiments, the wash buffer can include about 2× to about 20×, about 5× to about 10×, or about 5× of a salt buffer (e.g., saline sodium citrate (SSC)).
In some embodiments, the wash buffer can include a polyanionic polymer. In some embodiments, the polyanionic polymer can be dextran sulfate, heparin, or polyglutamic acid. In some embodiments, the wash buffer can include about 2.5% (v/v) to about 25% (v/v), about 5% (v/v) to about 15% (v/v), about 7.5% (v/v) to about 12.5% (v/v), about 5% (v/v), or about 10% (v/v) of a polyanionic polymer (e.g., dextran sulfate). In some embodiments, the wash buffer can include about 20 μg/mL to about 80 μg/mL, about 30 μg/mL to about 70 μg/mL, about 40 μg/mL to about 60 μg/mL, or about 50 μg/mL of a polyanionic polymer (e.g., heparin).
In some embodiments, the wash buffer can include a detergent. In some embodiments, the detergent can be Tween 20, Tween 80, sodium dodecyl sulfate (SDS), Triton X-100, Triton X-114, NP-40, Brij-35, Brij-58. N-Dodecyl-beta-maltoside, Octyl-beta-glucoside, octylthioglucoside (OTG). In some embodiments, the wash buffer can include about 0.01% (v/v) to about 1.0% (v/v), about 0.05% (v/v) to about 0.5% (v/v), or about 0.1% (v/v), or about 0.05% (v/v) of detergent (e.g., Tween 20).
In some embodiments, the wash buffer can include an acid. In these embodiments, the acid lowers the pH of the buffer. In some embodiments, the acid can be citric acid. In some embodiments, the wash buffer can include about 1 mM to about 30 mM, about 5 mM to about 15 mM, about 5 mM to about 10 mM, about 7 mM to about 10 mM, or about 9 mM of an acid (e.g., citric acid).
In some embodiments, the wash buffer can include a pH stabilizer. In some embodiments, the pH stabilizer can be at least one of tris-HCl, citric acid, SSC, 4-(2-hydroxyethyl)-1-piperazineethanesulfonic acid (HEPES), sucrose/EDTA/Tris-HCl (SET), potassium phosphate, tris(hydroxymethyl)methylamino]propanesulfonic acid (TAPS), NaOH, 3-(N-morpholino)propanesulfonic acid (MOPS), Tricine, Bicine, sodium pyrophosphate, piperazine-N,N′-bis(2-ethanesulfonic acid) (PIPES), SSPE. In some embodiments, the pH stabilizer can be tris-HCl. In some embodiments, the wash buffer can include about 5 mM to about 30 mM, about 10 mM to about 20 mM, about 10 mM, or about 20 mM of a pH stabilizer (e.g., tris-HCl).
In some embodiments, the wash buffer can include a chelating agent. In some embodiments, the chelating agent is at least one of EDTA, Ethylene glycol tetraacetic acid (EGTA), Salicylic acid, Triethanolamine (TEA), or Dimercaptopropanol. In some embodiments, the chelating agent is EDTA. In some embodiments, the was buffer can include about 1 mM to about 10 mM, about 2 mM to about 5 mM, or about 5 mM of a chelating agent (e.g., EDTA).
In some embodiments, the wash buffer can include formamide, SSC, Tween 20, citric acid (pH 6), and heparin. In some embodiments citric acid and/or heparin can be omitted from the wash buffer composition. In some embodiments, the wash buffer can include 30% formamide, 5×SSC, 0.1% Tween 20, optional 9 mM citric acid (pH 6), and optional 50 μg/mL heparin.
In some embodiments, the wash buffer can include SSC and Tween 20. In some embodiments, the wash buffer can include 5×SSC and 0.1% Tween 20. In some embodiments, the wash buffer can include NaCl, tris-HCl, and EDTA. In some embodiments, the wash buffer can include 215 mM NaCl, 20 mM tris-EDTA, and 5 mM EDTA.
In another aspect, a method for analyzing a sample can include:
Sample
In some embodiments, the sample is at least one of a cell, a cell suspension, a tissue biopsy, a tissue specimen, urine, stool, blood, serum, plasma, bone biopsies, bone marrow, respiratory specimens, sputum, induced sputum, tracheal aspirates, bronchoalveolar lavage fluid, sweat, saliva, tears, ocular fluid, cerebral spinal fluid, pericardial fluid, pleural fluid, peritoneal fluid, placenta, amnion, pus, nasal swabs, nasopharyngeal swabs, oropharyngeal swabs, ocular swabs, skin swabs, wound swabs, mucosal swabs, buccal swabs, vaginal swabs, vulvar swabs, nails, nail scrapings, hair follicles, corneal scrapings, gavage fluids, gargle fluids, abscess fluids, wastewater, or plant biopsies.
In some embodiments, the sample is a cell. In some embodiments, the cell is a bacterial cell. In some embodiments, the cell is a eukaryotic cell. In some embodiments the eukaryotic cell is a unicellular organism including protozoa, chromista, algae, or fungi. In some embodiments the eukaryotic cell is part of a multicellular organism from chromista, plantae, fungi, or animalia. In some embodiments the sample is a tissue composed of cells. In some embodiments the cell contains foreign DNA/RNA from viruses, plasmids, and bacteria.
In some embodiments, the sample can include a plurality of cells. In some embodiments, each cell in the plurality of cells can include a specific targeting sequence, which may or may not be the same from the other targeting sequences.
In some embodiments, the sample is a human oral microbiome sample. In some embodiments, the sample is a whole organism.
In some embodiments, the sample is obtained from a patient diagnosed with, or suspected to be suffering from an infection, disease, or disorder. In some embodiments, the patient has been diagnosed with, or is suspected to be suffering from a bacterial, viral, fungal, or parasitic infection. In some embodiments, the infection includes, but is not limited to, Acute Flaccid Myelitis, Anaplasmosis, Anthrax, Babesiosis, Botulism, Brucellosis, Campylobacteriosis, Carbapenem-resistant Infection (CRE/CRPA), Chancroid, Chickenpox, Chikungunya Virus Infection (Chikungunya), Chlamydia, Ciguatera (Harmful Algae Blooms (HABs)), Clostridium difficile Infection, Clostridium perfringens (Epsilon Toxin), Coccidioidomycosis fungal infection (Valley fever), COVID-19 (Coronavirus Disease 2019), Creutzfeldt-Jacob Disease, transmissible spongiform encephalopathy (CJD), Cryptosporidiosis (Crypto), Cyclosporiasis, Dengue, 1,2,3,4 (Dengue Fever), Diphtheria, E. coli infection, Shiga toxin-producing (STEC), Eastern Equine Encephalitis (EEE), Ebola Hemorrhagic Fever (Ebola), Ehrlichiosis, Encephalitis, Arboviral or parainfectious, Enterovirus Infection, D68 (EV-D68), Enterovirus Infection, Non-Polio (Non-Polio Enterovirus), Giardiasis (Giardia), Glanders, Gonococcal Infection (Gonorrhea), Granuloma inguinale, Haemophilus Influenza disease, Type B (Hib or H-flu), Hantavirus Pulmonary Syndrome (HPS), Hemolytic Uremic Syndrome (HUS), Hepatitis (A, B, C, D, and/or E), Herpes Herpes Zoster, zoster VZV (Shingles), Histoplasmosis infection (Histoplasmosis), Human Immunodeficiency Virus/AIDS (HIV/AIDS), Human Papillomavirus (HPV), Influenza (Flu), Lead Poisoning, Legionellosis (Legionnaires Disease), Leishmaniasis, Leprosy (Hansens Disease), Leptospirosis, Listeriosis (Listeria), Lyme Disease, Lymphogranuloma venereum infection (LGV), Malaria, Measles, Melioidosis, Meningitis, Viral (Meningitis, viral), Meningococcal Disease, Bacterial (Meningitis, bacterial), Middle East Respiratory Syndrome Coronavirus (MERS-CoV), Mononucleosis, Multisystem Inflammatory Syndrome in Children (MIS-C), Mumps, Norovirus, Paralytic Shellfish Poisoning (Paralytic Shellfish Poisoning, Ciguatera), Pediculosis (Lice, Head and Body Lice), Pelvic Inflammatory Disease (PID), Pertussis (Whooping Cough), Plague; Bubonic, Septicemic, Pneumonic (Plague), Pneumococcal Disease (Pneumonia), Poliomyelitis (Polio), Powassan, Psittacosis (Parrot Fever), Phthiriasis (Crabs; Pubic Lice Infestation), Pustular Rash diseases (Small pox, monkeypox, cowpox), Q-Fever, Rabies, Ricin Poisoning, Rickettsiosis (Rocky Mountain Spotted Fever), Rubella, Salmonellosis gastroenteritis (Salmonella), Scabies Infestation (Scabies), Scombroid, Septic Shock (Sepsis), Severe Acute Respiratory Syndrome (SARS), Shigellosis gastroenteritis (Shigella), Smallpox, Staphylococcal Infection, Methicillin-resistant (MRSA), Staphylococcal Food Poisoning, Enterotoxin-B Poisoning (Staph Food Poisoning), Staphylococcal Infection, Vancomycin Intermediate (VISA), Staphylococcal Infection, Vancomycin Resistant (VRSA), Streptococcal Disease, Group A (invasive) (Strep A (invasive)), Streptococcal Disease, Group B (Strep-B), Streptococcal Toxic-Shock Syndrome, STSS, Toxic Shock (STSS, TSS), Syphilis, primary, secondary, early latent, late latent, congenital, Tetanus, Toxoplasmosis, Trichomoniasis (Trichomonas infection), Trichinosis Infection (Trichinosis), Tuberculosis (Latent) (LTBI), Tuberculosis (TB), Tularemia (Rabbit fever), Typhus, Typhoid Fever, Group D, Vaginosis, bacterial (Yeast Infection), Vaping-Associated Lung Injury (e-Cigarette Associated Lung Injury), Varicella (Chickenpox), Vibrio cholerae (Cholera), Vibriosis (Vibrio), Viral Hemorrhagic Fever (Ebola, Lassa, Marburg), West Nile Virus, Yellow Fever, Yersenia (Yersinia), or Zika Virus Infection (Zika).
In some embodiments, when the sample is obtained from a patient, the patient has been diagnosed with, or is suspected to be suffering from an infection caused by a bacterium selected from the group consisting of: Acinetobacter, Actinomyces, Aerococcus, Bacteroides, Bartonella, Brucella, Bordetella, Burkholderia, Campylobacter, Chlamydia, Citrobacter, Clostridium, Corynebacterium, Edwardsiella, Elizabethkingia, Enterobacter, Enterococcus, Escherichia, Fusobacterium, Haemophilus, Helicobacter, Klebsiella, Legionella, Leptospira, Listeria, Morganella, Mycobacterium, Mycoplasma, Neisseria, Pantoea, Prevotella, Proteus, Providencia, Pseudomonas, Raoultella, Salmonella, Serratia, Shigella, Staphylococcus, Stenotrophomonas, Streptococcus, Ureaplasma, and Vibrio.
In some embodiments, when the sample is obtained from a patient, the patient has been diagnosed with, or is suspected to be suffering from an infection caused by a virus selected from the group consisting of: bacteriophage, RNA bacteriophage (e.g., MS2, AP205, PP7 and Qβ), Infectious Haematopoietic Necrosis Virus, Parvovirus, Herpes Simplex Virus, Hepatitis A virus, Hepatitis B virus, Hepatitis C virus, Measles virus, Mumps virus, Rubella virus, HIV, Influenza virus, Rhinovirus, Rotavirus A, Rotavirus B, Rotavirus C, Respiratory Syncytial Virus (RSV), Varicella zoster, and Poliovirus, Norovirus, Zika virus, Denge Virus, Rabies Virus, Newcastle Disease Virus, and White Spot Syndrome Virus.
In some embodiments, when the sample is obtained from a patient, the patient has been diagnosed with, or is suspected to be suffering from an infection caused by a parasite selected from the group consisting of: Plasmodium, Trypanosoma, Toxoplasma, Giardia, Leishmania, Cryptosporidium, helminthic parasites: Trichuris spp., Enterobius spp., Ascaris spp., Ancylostoma spp. and Necatro spp., Strongyloides spp., Dracunculus spp., Onchocerca spp. and Wuchereria spp., Taenia spp., Echinococcus spp., and Diphyllobothrium spp., Fasciola spp., and Schistosoma spp.
Encoding Probes
Encoding probes are probes that bind directly to a target or targeting sequence and contain either 1 or 2 branches extending away from the hybridization site. The branches can either correspond to the readout sequences, initiator sequences, and/or sequences that comprise at least one site for secondary hybridization events. Encoding probes, for example, are designed to target bacterial ribosomal RNA (rRNA) and messenger RNA (mRNA) targets.
For example, rRNA-probes can contain (5′ to 3′):
mRNA-probes contain (5′ to 3′):
In some embodiments, each encoding probe can include a targeting sequence and at least one sequence that comprise at least one site for secondary hybridization events. In some embodiments, each encoding probe can include a targeting sequence and at least one sequence that comprises multiple (e.g., two or more) sites for secondary hybridization events. In some embodiments, each encoding probe can include a targeting sequence and at least one initiator sequence. In some embodiments, each encoding probe can include a targeting sequence and an initiator sequence.
Primer Sequences
In some embodiments, the primer sequence can include about 10 to about 30, about 15 to about 25, about 18 to about 23, about 15, 16, 17, 18, 19, 20, 21, 22, or 23 nucleotides long.
Targeting Sequence
In some embodiments, the targeting sequence targets at least one of messenger RNA (mRNA), micro RNA (miRNA), long non coding RNA (lncRNA), ribosomal RNA (rRNA), small interfering RNA (siRNA), transfer RNA (tRNA), Crispr RNA (crRNA), trans-activating cirspr RNA (tracrRNA), mitochondria RNA, Intronic RNA, viral mRNA, viral genomic RNA, environmental RNA, double-stranded RNA (dsRNA), small nuclear RNA (snRNA), small nucleolar (snoRNA), piwi-interacting RNA (piRNA), genomic DNA, synthetic DNA, DNA, plasmid DNA, a plasmid, viral DNA, retroviral DNA, environmental DNA, extracellular DNA, a protein, a small molecule, or an antigenic target. In some embodiments, the target is mRNA. In some embodiments, the target is rRNA. In some embodiments, the target is mRNA and rRNA.
In some embodiments, the targeting sequence of the encoding probe is substantially complementary to a specific target sequence. By “substantially complementary” it is meant that the nucleic acid fragment is capable of hybridizing to at least one nucleic acid strand or duplex even if less than all nucleobases do not base pair with a counterpart nucleobase. In some embodiments, a “substantially complementary” nucleic add contains at least one sequence in which about 70%, about 71%, about 72%, about 73%, about 74%, about 75%, about 76%, about 77%, about 77%, 8%, about 79%, about 80%, about 81%, about 82%, about 83%, about 84%, about 85%, about 86%, about 87%, about 88%, about 89%, about 90%, about 91%, about 92%, about 93%, about 94%, about 95%, about 96%, about 97%, about 98%, about 99%, to about 100%, and any range therein, of the nucleobase sequence is capable of basepairing with at least one single or double stranded nucleic acid molecule during hybridization.
In some embodiments, the targeting sequence is designed to have a predicted melting temperature of between about 55° C. and about 65° C. In some embodiments, the predicted melting temperature of the targeting sequence is 55° C., 56° C., 57° C., 58° C., 59° C., 60° C., 61° C., 62° C., 63° C., 64° C. or 65° C. In some embodiments, the targeting sequence can have a GC content of about 55%, 60%, 65% or 70%.
In some embodiments, the targeting sequence can include about 10 to about 35, about 15 to about 30, about 18 to about 30, about 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25 nucleotides long.
In some embodiments, the targeting sequence of an encoding probe is designed using publicly-available sequence data. In some embodiments, the targeting sequence of an encoding probe design is designed using custom catalogues of the target/sample. In some embodiments, the targeting sequence of an encoding probe is designed using a database that is relevant for a system. In a specific embodiment, the system is the gut microbiome. In some embodiments, the targeting sequence of an encoding probe is designed using a database that is relevant for a disease or infection.
Initiator Sequence
In some embodiments, the encoding probe can include the initiator sequence on the 5′ end and/or the 3′ end. In some embodiments, the encoding probe can include an initiator sequence on the 5′ end. In some embodiments, the encoding probe can include an initiator sequence on the 3′ end. In some embodiments, the encoding probe can include an initiator sequence on the 5′ end and an initiator sequence on the 3′ end. In some embodiments, the two initiator sequences have different sequences. In some embodiments, the two initiator sequences have the same sequence. In some embodiments, the encoding probe can include the at least one sequence that comprise at least one site for secondary hybridization events on the 5′ end and/or the 3′ end.
In some embodiments, the initiator sequence is about 10 to about 30, about 15 to about 25, about 18 to about 23, about 15, 16, 17, 18, 19, 20, 21, 22, or 23 nucleotides long. In some embodiments, the initiator sequence is substantially complementary to the toehold sequence of the DNA amplifier.
In some embodiments, an encoding probe can include two fractional encoding probes that have neighboring target regions. The two fractional encoding probes bind to the target to colocalize a full initiator. The colocalized full initiator is required to initiate the hybridization chain reaction by a corresponding amplifier. In some embodiments, there is an energetically unfavorable junction between the two duplexes. In some embodiments, by configuring the fractional initiators to bind to overlapping regions of the amplifier, the duplex can relax into an energetically more favorable conformation, increasing the affinity between the colocalized full initiator and the amplifier. In some embodiments the affinity between the two encoding probes and the target can be increased by configuring the target-binding regions of the two encoding probes to bind to overlapping regions of the target so as to permit the junction between the molecules to relax to an energetically favorable conformation. In some embodiments, the two fractional encoding probes have about the same nucleotide length. In some embodiments, the two fractional encoding probes have different nucleotide lengths, for example one fractional encoding probe may have about 25% nucleotide length and the other the 75% nucleotide length.
The encoding probes, and other probes described herein, may be introduced into the sample (e.g., cell) using any suitable method. In some cases, the sample may be sufficiently permeabilized such that the probes may be introduced into the sample by flowing a fluid containing the probes around the sample (e.g., cells). In some cases, the samples (e.g., cells) may be sufficiently permeabilized as part of a fixation process. In some embodiments, samples (e.g., cells) may be permeabilized by exposure to certain chemicals such as ethanol, methanol, Triton, or the like. In some embodiments, techniques such as electroporation or microinjection may be used to introduce the probes into a sample (e.g., cell).
DNA Amplifier Sequences
“DNA amplifiers,” “amplifiers,” and “amplifier sequences” are used interchangeably when referring to the HiPR-Cycle method described herein.
Amplifier sequences are metastable hairpin sequences that come in pairs (###_H1 and ###_H2), the design is based on HCR amplifier probes and contains a readout-complementary sequencing at the 5′-end (in the case of ###_H1) or 3′-end (###_H2), adjacent to the initiator sequence. Amplifier sequences are stored in a high salt buffer (e.g., 120 mM NaCl), and are heated (e.g., 95° C. for 1.5 min) and annealed (e.g., room temperature for 30 min) prior to addition to the sample.
In some embodiments the amplifiers are stored in high salt buffer, such as, 100 mM, 120 mM, 200 mM, 250 mM, 500 mM, 750 mM, or 1 M NaCl. In some embodiments, the amplifier sequences are heated at high temperatures (e.g., about 95° C. to 100° C.) for a short period of time (e.g., 1, 2, 3, 4 or 5 minutes) followed by a cooling period of about 15 min to 1 hour, e.g., 30 min, to room temperature.
In some embodiments, at least two amplifier probes (one pair) are used for at least one readout probe. In some embodiments, at least two amplifier probes (one pair) are used for multiple (e.g., two or more) readout probes. In some embodiments, at least two amplifier probes (one pair) are used for each readout probe. For example, each amplifier probe can have:
In other examples, each amplifier probe can have:
In other examples, each amplifier probe can have:
The readout complementary sequence of the amplifier probe/DNA amplifier is a nucleotide sequence that is about 10 to about 25, about 15 to about 20, about 15, 16, 17, 18, 19, or 20 nucleotides long and has a nucleotide sequence that is the complement of the emissive readout probe sequence. In some embodiments, the readout sequence present in the amplifier probe is also known as a “landing pad sequence.” In some embodiments, the readout complementary sequence present in the amplifier probe is also known as a “landing pad sequence.”
Each of the optional first and second spacer sequences of the amplifier probe/DNA amplifier is about 1 to 5, about 1, 2, 3, 4, or 5 nucleotides long.
The toehold sequence of the amplifier probe/DNA amplifier is a nucleotide sequence that is about 10 to about 30, about 15 to about 25, about 18 to about 23, about 15, 16, 17, 18, 19, 20, 21, 22, or 23 nucleotides long and has a nucleotide sequence that is the complement to the initiator sequence of the encoding probe.
The stem sequence of the amplifier probe/DNA amplifier is a nucleotide sequence that is about 5 to about 15, about 7 to about 10, about 5, 6, 7, 8, 9, or 10 nucleotides long and has a nucleotide sequence that is a complement to its other stem.
The loop sequence of the amplifier probe/DNA amplifier is a nucleotide sequence that is about 5 to about 15, about 7 to about 10, about 5, 6, 7, 8, 9, or 10 nucleotides long and has a nucleotide sequence that is a complement to the toehold sequence of its pair DNA amplifier.
The stem-complimentary sequence of the amplifier probe/DNA amplifier is a nucleotide sequence that is about 5 to about 15, about 7 to about 10, about 5, 6, 7, 8, 9, or 10 nucleotides long and has a nucleotide sequence that is a complement to its other stem.
In some embodiments, one of the two DNA amplifiers can include, from 5′ to 3′, a readout sequence (R.1), a toehold sequence (T.1), a stem sequence (S.1), a loop sequence (L.1), and a complement stem sequence (cS.1). In some embodiments, one of the two DNA amplifiers can include, from 5′ to 3′, a readout sequence (R.1), a first spacer sequence (Sp.1-1), a toehold sequence (T.1), a stem sequence (S.1), a loop sequence (L.1), and a complement stem sequence (cS.1). In some embodiments, one of the two DNA amplifiers can include, from 5′ to 3′, a readout sequence (R.1), a first spacer sequence (Sp.1-1), a toehold sequence (T.1), a stem sequence (S.1), a second spacer sequence (Sp.1-2), a loop sequence (L.1), and a complement stem sequence (cS.1).
In some embodiments, one of the two DNA amplifiers can include, from 5′ to 3′, a stem sequence (S.2), a loop sequence (L.2), a complement stem sequence (cS.2), a toehold sequence (T.2), and a readout sequence (R.2).
In some embodiments, one of the two DNA amplifiers can include, from 5′ to 3′, a stem sequence (S.2), a loop sequence (L.2), a complement stem sequence (cS.2), a toehold sequence (T.2), a first spacer sequence (Sp.2-1), and a readout sequence (R.2). In some embodiments, one of the two DNA amplifiers can include, from 5′ to 3′, a stem sequence (S.2), a loop sequence (L.2), a second spacer sequence (Sp. 2-2), a complement stem sequence (cS.2), a toehold sequence (T.2), a first spacer sequence (Sp.2-1), and a readout sequence (R.2).
In some embodiments, one of the two DNA amplifiers can include, from 5′ to 3′, a toehold sequence (T.1), a stem sequence (S.1), a loop sequence (L.1), a complement stem sequence (cS.1), and a readout sequence (R.1). In some embodiments, one of the two DNA amplifiers can include, from 5′ to 3′, a readout sequence (R.2), a stem sequence (S.2), a loop sequence (L.2), a complement stem sequence (cS.2), and a toehold sequence (T.2).
In some embodiments, the DNA amplifiers can further include a first spacer sequence and/or a second spacer sequence. In some embodiments, the DNA amplifiers further can include a first spacer sequence. In some embodiments, the DNA amplifiers further can include a second spacer sequence. In some embodiments, the DNA amplifiers further can include a first spacer sequence and a second spacer sequence. In some embodiments, the first spacer sequence is on the 3′ end of the readout sequence and to the 5′ end of the toehold sequence of the DNA amplifier. In some embodiments, the first spacer sequence is on the 3′ end of the toehold sequence and to the 5′ end of the readout sequence of the DNA amplifier. In some embodiments, the first spacer sequence is 1, 2, 3, 4, or 5 nucleotides long. In some embodiments, the first spacer sequence is a random string of three nucleotides. In some embodiments, the second spacer sequence is on the 3′ end of the stem sequence and to the 5′ end of the loop sequence complementary to the initiator of the DNA amplifier. In some embodiments, the second spacer sequence is on the 3′ end of the loop sequence complementary to the initiator and to the 5′ end of the stem sequence of the DNA amplifier. In some embodiments, the second spacer sequence is 1, 2, 3, 4, or 5 nucleotides long. In some embodiments, the second spacer sequence is a random string of three nucleotides.
In some embodiments, the readout sequence of the DNA amplifier can include 15 to 30 nucleotides, or 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30 nucleotides. In some embodiments, the readout sequence of each DNA amplifier is the same sequence. In some embodiments, the readout sequence of each DNA amplifier is the different. In some embodiments, the readout sequence of DNA amplifier has a 50% or less sequence identity to the other the readout sequence of DNA amplifier.
In some embodiments, the toehold sequence (T.1) is a sequence complementary to the loop sequence (L.2) of the other DNA amplifier. In some embodiments, the loop sequence (L.1) is a sequence complementary to the toehold sequence (T.2) of the other DNA amplifier. In some embodiments, the toehold sequence is 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, or 17 nucleotides long. In some embodiments, the loop sequence is 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, or 17 nucleotides long. In some embodiments, the stem region and its complementary sequence are each 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, or 21 nucleotides long.
In some embodiments, the method can include adding four DNA amplifiers.
In some embodiments, one of the four DNA amplifiers can include, from 5′ to 3′ a amplifier initiator sequence (HI.1), a toehold sequence (T.1), a stem sequence (S.1), a loop sequence (L.1), and a complement stem sequence (cS.1). In some embodiments, one of the four DNA amplifiers can include, from 5′ to 3′ a stem sequence (S.2), a loop sequence (L.2), complement stem sequence (cS.2), a toehold sequence (T.2), and an amplifier initiator sequence (HI.2). In some embodiments, one of the four DNA amplifiers can include, from 5′ to 3′, a readout sequence (R.1-2), a toehold sequence (T.1-2), a stem sequence (S.1-2), a loop sequence (L.1-2), and a complement stem sequence (cS.1-2). In some embodiments, one of the four DNA amplifiers can include, from 5′ to 3′, a stem sequence (S.2-1), a loop sequence (L.2-1), a complement stem sequence (cS.2-1), a toehold sequence (T.2-1), and a readout sequence (R.2-1).
In some embodiments, the four DNA amplifiers can further include a first and/or second spacer sequence, wherein the first and/or second spacer sequence is about 1 to 5, about 1, 2, 3, 4, or 5 nucleotides long.
In some embodiments, the amplifier initiator sequence (HI.1) is a sequence complementary to the loop sequence (L.1-2 or L.2-1) of one of the other DNA amplifiers can include the readout sequence. In some embodiments, the toehold sequence (T.1) is a sequence complementary to the loop sequence (L.2) of the other DNA amplifier can include the amplifier initiator sequence. In some embodiments, the loop sequence (L.1) is a sequence complementary to the toehold sequence (T.2) of the other DNA amplifier can include the amplifier initiator sequence. In some embodiments, the amplifier initiator sequence is unique so that its sequence is not complementary to any other sequence. In this instance, the initiator sequence is different from the rest of the sequences so that it does not prematurely trigger the amplification reaction.
Emissive Readout Probes
Emissive readouts probes are 15-20 nucleotide-long oligonucleotides bound with one of ten fluorescent dyes at the 5′- and/or 3′-end. In HiPR-Cycle, these sequences bind to the amplifier complexes that form. They can be added during or after the amplification step.
Readout probes (15-20 nt) can be designed as follows:
In some embodiments, the emissive readout sequence is about 10 to about 50, about 15 to about 50, about 15 to about 45, about 15 to about 35, about 15 to about 30, about 18 to about 24, about 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, or 24 nucleotides long.
In some embodiments, the emissive readout probe can include a label on the 5′ or 3′ end. In some embodiments, the emissive readout probe can include a label on the 5′ end and a label on the 3′ end. In some embodiments, the labels are the same. In some embodiments, the labels are different.
In some embodiments, the label is a fluorescent entity (fluorophore) or phosphorescent entity. In some embodiments, the label is a cyanine dye (e.g., Cy2, Cy3, Cy3B, Cy5, Cy5.5, Cy7, etc.), Alexa Fluor dye, Atto dye, photo switchable dye, photoactivatable dye, fluorescent dye, metal nanoparticle, semiconductor nanoparticle or “quantum dots”, fluorescent protein such as GFP (Green Fluorescent Protein), or photoactivatable fluorescent protein, such as PAGFP, PSCFP, PSCFP2, Dendra, Dendra2, EosFP, tdEos, mEos2, mEos3, PAmCherry, PAtagRFP, mMaple, mMaple2, and mMaple3.
In some embodiments, the label is Alexa Fluor 350, Alexa Fluor 405, Alexa Fluor 430, Alexa Fluor 488, Alexa Fluor 514, Alexa Fluor 532, Alexa Fluor 546, Alexa Fluor 555, Alexa Fluor 561, Alexa Fluor 568, Alexa Fluor 594, Alexa Fluor 610, Alexa Fluor 633, Alexa Fluor 635, Alexa Fluor 647, Alexa Fluor 647-R-phycoerythrin, Alexa Fluor 660, Alexa Fluor 680, Alexa Fluor 680-allophycocyanin, Alexa Fluor 700, Alexa Fluor 750, Alexa Fluor 790, Alexa Fluor Plus 405, Alexa Fluor Plus 488, Alexa Fluor Plus 555, Alexa Fluor Plus 594, Alexa Fluor Plus 647, Alexa Fluor Plus 680, Alexa Fluor Plus 750, Alexa Fluor Plus 800, Pacific Blue, Pacific Green, Rhodamine Red X, DyLight 485-LS, DyLight-510-LS, DyLight 515-LS, DyLight 521-LS, Hydroxycoumarin, methoxycoumarin, Cy2, FAM, Fluorescein FITC, R-phycoerythrin (PE), Tamara, Cy3.5 581, Rox, Red 613, Texas Red, Cy5, Cy5.5, Cy7, Allophycocyanin, ATTO 430LS, ATTO 490LS, ATTO 390, ATTO 425, Cyan 500 NHS-Ester, ATTO 465, ATTO 488, ATTO 495, ATTO Rho110, ATTO 514, ATTO 520, ATTO 532, ATTO Rho6G, ATTO 542, ATTO 550, ATTO 565, ATTO Rho3B, ATTO Rho11, ATTO Rho12, ATTO Thio12, ATTO Rho101, ATTO 590, ATTO 594, ATTO Rho13, ATTO 610, ATTO 620, ATTO Rho14, ATTO 633, ATTO 643, ATTO 647, ATTO 647N, ATTO 655, ATTO Oxa12, ATTO 665, ATTO 680, ATTO 700, ATTO 725, ATTO 740.
In some embodiments, the label is imaged using widefield microscopy, point scanning confocal microscopy, spinning disk confocal microscopy, lattice lightsheet microscopy, or light field microscopy.
In some embodiments, the detection strategy used is channel, spectral, channel and fluorescence lifetime, or spectral and fluorescence lifetime.
In some embodiments, the labels used in the present methods are imaged using a microscope. In some embodiments, the microscope is a confocal microscope. In some embodiments, the microscope is a fluorescence microscope. In some embodiments, the microscope is a light-sheet microscope. In some embodiments, the microscope is a super-resolution microscope.
In some embodiments, the sample is on an analyzing platform, wherein the analyzing platform is a microscope slide, at least one chamber, at least one microfluidic device, at least one well, at least one plate, or at least one filter membrane.
Increasing Sample Detection
In some embodiments, when it is necessary to increase the multiplexity of the method, the HiPR-Cycle method described herein can be performed multiple times/rounds, by physically expanding the sample, and/or by using branched amplification.
HiPR-Cycle has the ability to perform measurements at high multiplexity with barcoding and spectral readouts. This allows to theoretically detect 2d−1 targets where d is the number of dyes used in an assay. The targets can be given barcodes (based on the encoding sequences) of d bits. The addition of more rounds allows the barcode to be extended. For R rounds, the maximum target multiplexity becomes 2d*R−1 and allows for dR-bit barcodes. For example, for a 10-bit system (using 10 dyes), one such code may be 0010011101. In the 10-bit system with two rounds of HiPR-Cycle, 20-bit barcodes can be used, and achieve 1,048,575 targets. For example, a single gene could be labeled as 0010011101 in round one, and 0100010101 in round two, making its complete barcode 00100111010100010101. This is just a single code of the >1M available codes, which comes from concatenating the barcodes determined in the first and second round of imaging. Since HiPR-Cycle's multiplexing capabilities increase exponentially with both the number of distinguishable fluorescent dyes and the number of rounds, billions of potential targets are available (e.g. 3 rounds using 10 bits per round leads to 1.07 billion targets, as does 2 rounds using 15 bits per round).
In some embodiments, when there are two colors and two rounds, one can encode a gene as 00+01 and another as 01+01, one may have a 0101 gene, but there may be an issue preventing binding in the first case and it will be incorrectly read as 0001. Therefore, a practical maximum would then be (2(D)-1)R.
In some embodiments, multiple rounds can be performed by (1) repeating HiPR-Cycle, in its entirety, twice (or more) using two (or more) different sets of n-bit (e.g., 10-bit) encoding probes; (2) repeating HiPR-Cycle amplification/readout twice (or more) using two different sets of n-bit (e.g., 10-bit) encoding probes; (3) bleaching readout probes; (4) chemical or restriction enzyme or CRISPR cleaving of readout probes; or (5) DNAse cleaving of probes.
In some embodiments, multiple rounds can be performed by repeating HiPR-Cycle, in its entirety, twice (or more) using two (or more) different sets of n-bit (e.g., 10-bit) encoding probes. In these embodiments, the encoding probes are encoded with between 1 and 10 initiators from a selection of 10 initiators. Each initiator corresponds to a unique set of amplifiers which all together hybridize a set of up 10 emissive readout probes. HiPR-Cycle is then performed (including imaging). Encoding probes and readout probes are physically removed from the sample using a stripping buffer, then another round of HiPR-Cycle is performed in its entirety. In some embodiments, the stripping buffer can include formamide and SSC. In some embodiments, the stripping buffer can include about 40% to about 70%, about 40%, 50%, 60%, or 70% formamide. In some embodiments, the stripping buffer can include about 2× to about 10×, about 2×, 5×, or 10×SSC. In some embodiments, the stripping buffer can include 60% formamide and 2×SSC.
In some embodiments, multiple rounds can be performed by repeating HiPR-Cycle amplification/readout twice (or more) using two different sets of n-bit (e.g., 10-bit) encoding probes. In these embodiments, the encoding probes are encoded with between 1 and 20 initiators from a selection of 20 initiators. Each initiator corresponds to a unique set of amplifiers which all together hybridize a set of up 10 emissive readout probes. In this particular embodiment, only amplifiers/readout probes corresponding to a unique color are used in each round. For example, in the case where there are only 3 readout probes (red, blue, green), but want to use 2 rounds of imaging. There could be a code such as 110010 where the first three digits are each a single color and the last three digits are each a single color. Here only the first three digits can be read in a single round, and the last three digits can be read in a single round. Then the amplification/readout steps are performed. The probes are then stripped using stripping buffer. The stripping buffer removes the amplifier and readout probes but does not remove the encoding probes. A second amplifier/readout is then performed with a unique set of amplifier probes.
In some embodiments, multiple rounds can be performed by bleaching readout probes. In these embodiments, the encoding probes can contain many initiators, and many corresponding amplifier and readout probes. Further, two readout probes may have the same fluorophore but different sequences. In this embodiment, HiPR-Cycle is performed and all of the targets are amplified. Readout probes are collected into sets, each set has unique fluorescent dyes and sequences. Readout probes are then added and imaging is done according to the methods described herein. A bleaching buffer can then be placed on the sample and high intensity/exposure laser (e.g., 647 nm at 100% intensity for 1 sec) can be used to bleach probes. The bleaching buffer can then be removed, the sample washed, and the next set of readout probes is added to the bleaching buffer. In this embodiment, the bleaching buffer can include SSC and VRC. In some embodiments, the bleaching buffer can include about 0.1× to about 5×, about 0.5× to about 2.5×, about 1× to about 2×, or about 2×. In some embodiments, the bleaching buffer can include about 0.5 mM to about 5 mM, about 1 mM to about 3 mM, or about 2 mM Vanadyl ribonucleoside complex (VRC). In some embodiments, the bleaching buffer can include 2×SSC and 2 mM VRC. Vanadyl ribonucleoside complex (VRC) is a potent inhibitor of various ribonucleases. This complex is compatible with cell fractionation methods as well as sucrose-gradient centrifugations. The 200 mM stock solution is reconstituted to a green-black clear solution by incubating the sealed vial at 65° C. Once open, the entire sample should be aliquoted into smaller samples and frozen. The vanadyl ribonucleoside complex should be added to all buffers to a final concentration of 10 mM. The buffers should not contain EDTA since one equivalent will totally dissociate the complex. Use of the vanadyl complex is not recommended in cell-free translation systems and with reverse transcriptase. The vanadyl complex can be used in the selective degradation of DNA while preserving RNA since pancreatic deoxyribonuclease I is not inhibited. Removal of the vanadyl ribonucleoside complex from the RNA can be accomplished by adding 10 equivalents of EDTA before ethanol precipitation.
In some embodiments, multiple rounds can be performed by chemical or restriction enzyme or CRISPR cleaving of readouts, which is similar to the bleaching probe method. In here, after a round of imaging the readout sequences are “cut” (with bound readout probes) off of the amplifiers and washed away.
In some embodiments, multiple rounds can be performed by a DNAse method, where after imaging, DNAse is added to the sample to remove all encoding, amplifier, and readout probes. Another round of HiPR-Cycle is then performed in its entirety.
In some embodiments, when it is necessary to increase the ability to identify and quantify targets in a sample, physically expanding the sample can be used in conjunction with the HiPR-Cycle methods described herein. In here, combining the imaging methods described herein with physical expansion allows for samples/cells/molecules to increase the physical distance between them. Further, covalently embedding targets of a sample within a gel matrix makes it possible to “clear” (e.g., digest/remove) unwanted biomolecules (proteins, lipids, etc.) that could contribute to light scattering and background autofluorescence. In these embodiments, a sample is embedded in a hydrogel (e.g., polyacrylamide or polyacrylamide-based gels) and HiPR-Cycle is then performed on the sample-embedded gel.
In some embodiments, when it is necessary to increase the multiplexity of the method, the HiPR-Cycle method described herein can be performed using branched amplification. Intermediates located between the encoding probes and amplifiers can result in an increase (e.g., 4-100 more sites) of initiator sites available per encoding probe compared to other HiPR-Cycle methods described herein. The intermediates contain sequences that can be hybridized to the encoding probes and to the amplifiers. To produce the intermediates, a first intermediate probe is hybridized to the encoding probe, then, a second intermediate probe is hybridized to the first intermediate probe. The second intermediate probe contains multiple binding/complementary sequences for initiator probes to hybridize to. The initiator probes contain (initiator) sequences upon which the amplifiers can bind to. The rest of the HiPR-Cycle method can then be performed as described herein.
Accordingly, in some embodiments, the HiPR-Cycle method can be utilized with sets containing first intermediate probes, second intermediate probes, and/or initiator probes. In some embodiments, the first intermediate probe includes a sequence complementary to a sequence present in an encoding probe and at least one handle sequence (e.g., 1-5 handle sequences). In some embodiments, the second intermediate probe includes a sequence complementary to the at least one handle sequence of the first intermediate probe and at least one initiator landing (or presenting) sequence. In some embodiments, the initiator probes include a sequence complementary to the at least one initiator landing (or presenting) sequence and one initiator sequence complementary to an initiator sequence present in an amplifier. In some embodiments, when branched amplification is utilized, the number of initiators sites/sequences available per encoding probe is from about 4 to about 100, about 6 to about 75, about 10 to about 50, about 15 to about 35, or about 9 to about 18.
HiPR-Swap
Another aspect of the disclosure is directed to a method of analyzing a sample by performing HiPR-Cycle with multiple imaging rounds exchanging emissive readout probes which are referred to herein as HiPR-Swap.
HiPR-Swap uses DNA exchange as a method to quickly, specifically, carefully replace readout probes without disturbing encoding and/or amplifier probes. See, for example, PCT Patent Application PCT/US2022/080355 and U.S. application Ser. No. 18/058,171, filed on Nov. 24, 2022. The contents of the aforementioned disclosures are each incorporated herein by reference in their entireties.
Accordingly, a method for analyzing a sample can include:
Landing Pad Sequences
In the HiPR-Swap method, readout and amplifier probes are designed such that the “landing pad” is shorter than or equal to in length to the readout probe. The landing pad being shorter than the readout probe creates a single-stranded overhang of the readout probe, as it extends past the end of the landing pad. After a readout probe is bound, an exchange probe can be added to the sample. The exchange probe can be constructed to be of equal length and a perfect reverse complement to the readout probe. In some embodiments, the exchange probe may contain locked nucleic acids to increase the stability of the exchange-readout pair. When added, the exchange probe seeds a hybridization to the exposed area of the readout probe. Over a short period of time the exchange probe completely hybridizes to the readout probe, thereby removing it from its complementary sequence where it can be washed away. Importantly, orthogonal readout and exchange probes can be added simultaneously to reduce assay time.
In some embodiments, the readout sequence present in the amplifier probe is referred to as a “landing pad.” In some embodiments, the landing pad includes a sequence that is complementary to the emissive readout sequence. In some embodiments, when HiPR-Swap is being utilized, the landing pad includes a sequence that is complementary to the emissive readout sequence.
In some embodiments, each landing pad sequence is about 10 to about 50, about 15 to about 50, about 15 to about 40, about 10 to about 30, about 15 to about 25, about 18 to about 23, about 15, 16, 17, 18, 19, 20, 21, 22, or 23 nucleotides long. In some embodiments, each landing pad sequence is substantially complementary to the first and/or second emissive readout sequences.
Exchange Probes
Exchange probes are each about 10-50 or 15-50 nucleotide-long oligonucleotides. In some embodiments, each exchange probe includes a 100% complementary sequence to a respective emissive readout probe sequence. In some embodiments, the emissive readout probe sequence is an emissive readout probe as described herein.
In some embodiments, the exchange sequence is about 10 to about 50, about 15 to about 50, about 15 to about 45, about 15 to about 35, about 15 to about 30, about 18 to about 24, about 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, or 24 nucleotides long.
In some embodiments, the encoding and/or amplifier probes contain locked nucleic acids to stabilize the exchange reaction.
In some embodiments, adding an exchange probe to a sample, hybridizing the exchange probe to a first emissive readout probe, and removing a third complex from the sample are performed in the same step. In some embodiments, adding an exchange probe to the sample, hybridizing the exchange probe to the first emissive readout probe, and removing the third complex from the sample are performed sequentially. In some embodiments, adding an exchange probe to the sample, hybridizing the exchange probe to the first emissive readout probe, removing the third complex from the sample, and adding the second emissive readout probe are performed in the same step. In some embodiments, adding an exchange probe to the sample, hybridizing the exchange probe to the first emissive readout probe, removing the third complex from the sample, and adding the second emissive readout probe are performed sequentially.
In some embodiments, hybridizing the exchange probe to the first or second emissive readout probes results in de-hybridization of the first or second emissive readout probe from the readout (landing pad) sequence. In some embodiments, the step is achieved from about 30 seconds to about 1 hour. In some embodiments, the step is achieved within 30 seconds, 1 minute, 5 minutes, 10 minutes, 12 minutes, 15 minutes, 30 minutes, 45 minutes, or 1 hour. In some embodiments, the step is achieved within 1 hour. In some embodiments, the step is achieved overnight.
Another aspect, a construct can include:
In some embodiments, the region of interest on a nucleotide is at least one of messenger RNA (mRNA), micro RNA (miRNA), long non coding RNA (lncRNA), ribosomal RNA (rRNA), small interfering RNA (siRNA), transfer RNA (tRNA), Crispr RNA (crRNA), trans-activating cirspr RNA (tracrRNA), mitochondria RNA, Intronic RNA, viral mRNA, viral genomic RNA, environmental RNA, double-stranded RNA (dsRNA), small nuclear RNA (snRNA), small nucleolar (snoRNA), piwi-interacting RNA (piRNA), genomic DNA, synthetic DNA, DNA, plasmid DNA, a plasmid, viral DNA, retroviral DNA, environmental DNA, extracellular DNA, a protein, a small molecule, or an antigen.
In some embodiments, the region of interest on a nucleotide is mRNA. In some embodiments, the region of interest on a nucleotide is rRNA. In some embodiments, the region of interest on a nucleotide is mRNA and rRNA.
In some embodiments, the first initiator sequence is to the 5′ end of the targeting sequence. In some embodiments, the second initiator sequence is to the 3′ end of the targeting sequence.
In some embodiments, the first amplifier can include, from 5′ to 3′, a readout sequence (R.1), a toehold sequence (T.1), a stem sequence (S.1), a loop sequence (L.1), and a complement stem sequence (cS.1). In some embodiments, the first amplifier can include, from 5′ to 3′, a readout sequence (R.1), a first spacer sequence (Sp.1-1), a toehold sequence (T.1), a stem sequence (S.1), a loop sequence (L.1), and a complement stem sequence (cS.1). In some embodiments, the first amplifier can include, from 5′ to 3′, a readout sequence (R.1), a first spacer sequence (Sp.1-1), a toehold sequence (T.1), a stem sequence (S.1), a second spacer sequence (Sp.1-2), a loop sequence (L.1), and a complement stem sequence (cS.1).
In some embodiments, the second amplifier can include, from 5′ to 3′, a stem sequence (S.2), a loop sequence (L.2), a complement stem sequence (cS.2), a toehold sequence (T.2), and a readout sequence (R.2). In some embodiments, the second amplifier can include, from 5′ to 3′, a stem sequence (S.2), a loop sequence (L.2), a complement stem sequence (cS.2), a toehold sequence (T.2), a first spacer sequence (Sp.2-1), and a readout sequence (R.2). In some embodiments, the second amplifier can include, from 5′ to 3′, a stem sequence (S.2), a loop sequence (L.2), a second spacer sequence (Sp. 2-2), a complement stem sequence (cS.2), a toehold sequence (T.2), a first spacer sequence (Sp.2-1), and a readout sequence (R.2).
In some embodiments, each amplifier can further include a first and/or second spacer sequence, wherein the first and/or second spacer sequence is about 1 to 5 nucleotides long or about 1, 2, 3, 4, or 5 nucleotides long.
In some embodiments, the toehold sequence (T.1) of the first amplifier is a sequence complementary to the loop sequence (L.2) of the second amplifier.
In some embodiments, the loop sequence (L.1) of the first amplifier is a sequence complementary to the toehold sequence (T.2) of the second amplifier.
In some embodiments, the first and second amplifier have the same readout sequence. In some embodiments, the first and second amplifier have different readout sequences. In some embodiments, the readout sequence present in the amplifier probe is referred to as a “landing pad.” In some embodiments, the landing pad includes a sequence that is complementary to the emissive readout sequence.
In some embodiments, the emissive readout sequence can include a sequence complimentary to the readout sequence of the first amplifier sequence. In some embodiments, the emissive readout sequence can include a sequence complimentary to the readout sequence of the second amplifier sequence. In some embodiments, the emissive readout sequence can include a label on the 5′ end of the complimentary sequence. In some embodiments, the emissive readout sequence can include a label on the 3′ end of the complimentary sequence. In some embodiments, the emissive readout sequence can include a label on the 5′ end and 3′ end of the complimentary sequence.
In some embodiments, the label is a fluorescent entity (fluorophore) or phosphorescent entity. In some embodiments, the label is a cyanine dye (e.g., Cy2, Cy3, Cy3B, Cy5, Cy5.5, Cy7, etc.), Alexa Fluor dye, Atto dye, photo switchable dye, photoactivatable dye, fluorescent dye, metal nanoparticle, semiconductor nanoparticle or “quantum dots”, fluorescent protein such as GFP (Green Fluorescent Protein), or photoactivatable fluorescent protein, such as PAGFP, PSCFP, PSCFP2, Dendra, Dendra2, EosFP, tdEos, mEos2, mEos3, PAmCherry, PAtagRFP, mMaple, mMaple2, and mMaple3.
In some embodiments, the label is Alexa Fluor 350, Alexa Fluor 405, Alexa Fluor 430, Alexa Fluor 488, Alexa Fluor 514, Alexa Fluor 532, Alexa Fluor 546, Alexa Fluor 555, Alexa Fluor 561, Alexa Fluor 568, Alexa Fluor 594, Alexa Fluor 610, Alexa Fluor 633, Alexa Fluor 635, Alexa Fluor 647, Alexa Fluor 647-R-phycoerythrin, Alexa Fluor 660, Alexa Fluor 680, Alexa Fluor 680-allophycocyanin, Alexa Fluor 700, Alexa Fluor 750, Alexa Fluor 790, Alexa Fluor Plus 405, Alexa Fluor Plus 488, Alexa Fluor Plus 555, Alexa Fluor Plus 594, Alexa Fluor Plus 647, Alexa Fluor Plus 680, Alexa Fluor Plus 750, Alexa Fluor Plus 800, Pacific Blue, Pacific Green, Rhodamine Red X, DyLight 485-LS, DyLight-510-LS, DyLight 515-LS, DyLight 521-LS, Hydroxycoumarin, methoxycoumarin, Cy2, FAM, Fluorescein FITC, R-phycoerythrin (PE), Tamara, Cy3.5 581, Rox, Red 613, Texas Red, Cy5, Cy5.5, Cy7, Allophycocyanin, ATTO 430LS, ATTO 490LS, ATTO 390, ATTO 425, Cyan 500 NHS-Ester, ATTO 465, ATTO 488, ATTO 495, ATTO Rho110, ATTO 514, ATTO 520, ATTO 532, ATTO Rho6G, ATTO 542, ATTO 550, ATTO 565, ATTO Rho3B, ATTO Rho11, ATTO Rho12, ATTO Thio12, ATTO Rho101, ATTO 590, ATTO 594, ATTO Rho13, ATTO 610, ATTO 620, ATTO Rho14, ATTO 633, ATTO 643, ATTO 647, ATTO 647N, ATTO 655, ATTO Oxa12, ATTO 665, ATTO 680, ATTO 700, ATTO 725, ATTO 740.
In some embodiments, a construct can include:
In some embodiments, a construct described herein further includes at least one exchange probe and at least a second emissive readout probe, as described herein.
In some embodiments, a construct described herein further includes at least one first intermediate probe, at least one second intermediate probe, and at least one initiator (landing) probe, as described herein.
In another aspect, a library of constructs can include a plurality of barcoded probes, wherein each barcoded probe can include:
In another aspect, a library of constructs can include a plurality of barcoded probes, wherein each barcoded probe can include:
In some embodiments, a library of constructs can further include at least one exchange probe and at least a second emissive readout probe, as described herein.
In some embodiments, a library of constructs can further include at least one first intermediate probe, at least one second intermediate probe, and at least one initiator (landing) probe, as described herein.
The encoding probes used in the methods described herein, constructs and libraries described herein use barcoded probes. The barcoded probes represent a probe/sequence that is specific to a sample or target sequence in the sample with a unique code.
In some embodiments, the barcoded probes include the encoding probes, DNA amplifiers, and readout sequences described herein.
In some embodiments, each sample or target in the sample to be identified is assigned a unique n-bit binary code selected from a plurality of unique n-bit binary codes, where n is an integer greater than 1.
A “binary code” refers to a representation of target sequence in a sample using a string made up of a plurality of “0” and “1” from the binary number system. The binary code is made up of a pattern of n binary digits (n-bits), where n is an integer representing the number of labels used. The bigger the number n, the greater number of targets can be represented using the binary code. For example, a binary code of eight bits (an 8-bit binary code, using 8 different labels) can represent up to 255 (28-1) possible targets. (One is subtracted from the total possible number of codes because no target sequence is assigned a code of all zeros “00000000.” A code of all zeros would mean no decoding sequence, and thus no label, is attached. In other words, there are no non-labeled target sequences.) Similarly, a binary code of ten bits (a 10-bit binary code) can represent up to 1023 (210-1) possible target sequences. In some embodiments a binary code may be translated into and represented by a decimal number. For example, the 10-bit binary code “0001100001” can also be represented as the decimal number “97.”
Each digit in a unique binary code represents whether a readout probe and the fluorophore corresponding to that readout probe are present for the selected species. In some embodiments, each digit in the binary code corresponds to a Readout probe (from Readout probe 1 (R1) through Readout probe n (Rn) in an n-bit coding scheme). In a specific embodiment, the n is 10 and the digits of an n-bit code correspond to R1 through R10. In some embodiments, the fluorophores that correspond to R1 through Rn are determined arbitrarily. For example, n is 10, and R1 corresponds to an Alexa 488 fluorophore, R2 corresponds to an Alexa 546 fluorophore, R3 corresponds to a 6-ROX (6-Carboxy-X-Rhodamine, or Rhodamine Red X) fluorophore, R4 corresponds to a PacificGreen fluorophore, R5 corresponds to a PacificBlue fluorophore, R6 corresponds to an Alexa 610 fluorophore, R7 corresponds to an Alexa 647 fluorophore, R8 corresponds to a DyLight-510-LS fluorophore, R9 corresponds to an Alexa 405 fluorophore, and R10 corresponds to an Alex532 fluorophore. In some embodiments, other labels/fluorophores are used in the n-bit encoding system.
In some embodiments, the n-bit binary code is selected from the group consisting of 2-bit binary code, 3-bit binary code, 4-bit binary code, 5-bit binary code, 6-bit binary code, 7-bit binary code, 8-bit binary code, 9-bit binary code, 10-bit binary code, 11-bit binary code, 12-bit binary code, 13-bit binary code, 14-bit binary code, 15-bit binary code, 16-bit binary code, 17-bit binary code, 18-bit binary code, 19-bit binary code, 20-bit binary code, 21-bit binary code, 22-bit binary code, 23-bit binary code, 24-bit binary code, 25-bit binary code, 26-bit binary code, 27-bit binary code, 28 bit binary code, 29-bit binary code, and 30-bit binary code.
The methods and constructs described herein have significant advantages of those currently available in the art.
For example, HiPR-Cycle has extremely high target multiplexity (N):
N=2n−1,
HiPR-Cycle can also be performed in imaging rounds. This significant increases the potential for target multiplexity to:
N=2n*r−1,
The examples given above are essential for looking at gene expression in environmental microbiome samples. Even though a single bacteria strain has hundreds of genes on average, a collection of bacteria with S different strains will have potentially tens of thousands of different genes.
Because of the aforementioned, HiPR-Cycle is much cheaper than other methods targeting the same number of mRNAs (or other molecules) because it uses less reagent volume and reduces the number of fluorescently conjugated probes (the main driver of cost outside of encoding probes). For example, using two rounds of HiPR-Cycle and seven readout probes once could achieve 8000 targets, roughly the same number as 80 cycles of other methods that use 80-240 readout probes.
HiPR-Cycle is faster than other methods known in the art. HiPR-Cycle can be performed in 6-24 hours for a single round, and additional rounds could take an additional 1-12 hours. Currently, other methods can take days to a week for complete imaging or 24 hours per round. Because of this, HiPR-Cycle can be used to optically section tissue and resolve gene expression in 3D structures with higher z-ranges than with a wide field epifluorescence microscope. Thus, HiPR-Cycle could be used to look at gene expression in tissues in three dimensions.
HiPR-Cycle amplification can amplify signals of mRNA molecules. This is critical when rRNA and mRNA are to be measured simultaneously, as the rRNA signal is very high across cells. Without wishing to be bound to theory, the HiPR-Cycle amplification strategy, decreases amplification times by including emissive readout probes as described herein.
In some embodiments, a support vector machine is trained on reference data to predict the barcode of single cells in the synthetic communities and environmental samples. In a specific embodiment, the support vector machine is Support Vector Regression (SVR) from Python package. As used herein, the term “support-vector machine” (SVM) refers to a supervised learning model with associated learning algorithms that analyze data used for classification and regression analysis. Given a set of training examples, each marked as belonging to one or the other of two categories, an SVM training algorithm builds a model that assigns new examples to one category or the other, making it a non-probabilistic binary linear classifier. An SVM model is a representation of the examples as points in space, mapped so that the examples of the separate categories are divided by a clear gap that is as wide as possible. New examples are then mapped into that same space and predicted to belong to a category based on which side of the gap they fall.
In some embodiments, the reference spectra are obtained through a brute force approach involving the measurement of the spectra of all possible barcodes using barcoded test E. coli cells. In some embodiments, the n-bit binary encoding is a 10-bit binary encoding and tire reference spectra are obtained through measuring 1023 reference spectra.
In some embodiments the reference spectra are obtained by simulation of all possible spectra. In some embodiments, the simulated spectral data can be used as reference examples for the support vector machine. In some embodiments, the spectra corresponding to individual n-bit binary codes are simulated by adding together the measured spectra of each individual fluorophore (e.g., the reference spectrum for 0000010011 is generated by adding the spectra of R1, R2, and R5; or the reference spectrum for 1010010100 is generated by adding the spectra of R3, R5, R8 and R10). In some embodiments, the spectra corresponding to individual n-bit binary codes are simulated by adding the measured spectra of each individual fluorophore weighted by the relative contribution to the emission signal of each fluorophore. In some embodiments, the relative contribution of each fluorophore is calculated using a Forster Resonant Energy Transfer (FRET) model.
In some embodiments, specimens can be imaged using any one of the listed microscopy techniques and can include superresolution methods (e.g., Airy scan) to detect signals. A single or multiple field(s) of view can be acquired for each specimen. With multiple channels or excitations being performed with samples remaining in the same position for image acquisition. Image files (including metadata) can be saved (e.g., .czi or .nd2 filetypes). Then, data can be imported into a custom script. An optional noise reduction technique can be used to increase the signal-to-noise ratio in images. A generic or whole-cell stain (e.g., 16S rRNA, Eub, DAPI, etc.) can used to determine the boundaries of each bacterium. The channels across which the whole-cell stain is to be used can be integrated into a single image. A segmentation algorithm (e.g., trained U-net, HiPR-FISH-based segmentation, watershed algorithm, etc.) can be used to determine pixels belonging to bacteria and those belonging to the background. The algorithm can then be extended to determine the boundaries of adjacent bacteria. Individual masks can be generated for each bacterium, which receive a unique identifying label and a physical location identifier (e.g., X,Y,Z cartesian coordinates in the volumetric field of view). The channels corresponding to 16S rRNA imaging can be used to generate spectra and taxonomic barcodes using the HiPR-FISH analysis pipeline. Each bacterium identified can then be associated with a taxonomic ID. The channels corresponding to mRNA targeting probes can be integrated into a single image and a spot-detection algorithm can be used to generate a list of potential transcripts, providing each with a unique identifier and a physical location ID. Each ambiguous transcript can be assigned to a specific bacterium using the limits of the segmented mask, generated above. For each spot, a spectrum can be generated across all channels relevant to mRNA detection. The spectra can be compared to a library of spectra generated for combinations of fluorophores. A machine learning method (e.g. UMAP), can be used to generate the barcode of the spot from the trained data of the spectral library. Error correction can be performed to address potential issues; for example, colocalized transcripts could generate a signal that can be deconvoluted. Each bacterium can be assigned a list of identified transcripts. A downstream analysis can then be performed. For this downstream analysis, data structures including a matrix for each bacterial taxa identified can list, for example, each bacterium (columns) and each gene (rows), with the entries representing the number of transcripts for each gene in the cell. Then, physical interaction networks can show the proximity of each taxonomic group and, within each group, each cell state.
In another aspect, a method for analyzing a cell can include:
Deciphering what each cell within taxonomically distinct microbes in a wide variety of samples and specimens is doing through gene expression and metabolic signatures represents the next frontier in understanding and interpreting microbial systems, with wide ranging applicability from clinical to agricultural domains. Here, we describe a novel technology for spatially-resolved multiplexed detection of gene expression within cells, called HiPR-Cycle. HiPR-Cycle uses hybridization chain reactions (HCR) to extend DNA polymers upon encountering ‘initiator’ sequences attached to DNA probes that recognize target genes of interest within cells in situ. The HCR products that form at the site of detected transcripts bear emissive readout probe binding sites, in numbers proportional to the size of the HCR product. Barcoding these HCR products with, for example, 10 readout probes can be used to distinguish >1000 distinct targets. Moreover, by physically amplifying the fluorescent signal of encoding probe binding events through HCR, we are able to detect lowly expressed genes, otherwise overlooked.
Materials
30% probe hybridization buffer: 30% formamide, 5× sodium chloride sodium citrate (SCC), 10% dextran sulfate, 0.1% Tween 20, optional 9 mM citric acid (pH 6.0), optional 50 μg/mL heparin, and optional 1×Denhardt's solution.
30% probe wash buffer: 30% formamide, 5× sodium chloride sodium citrate (SCC), 0.1% Tween 20, optional 9 mM citric acid (pH 6.0), and optional 50 μg/mL heparin.
Amplification buffer: 5× sodium chloride sodium citrate (SCC), 10% dextran sulfate, and 0.1% Tween 20.
Method
We placed samples on a slide and allowed to thoroughly dry. Then, added lysozyme to digest cell walls for 15 min at 37° C. and washed with 1×PBS for 10 min at room temperature. Then, added 30% formamide encoding buffer without probes to samples for 30 min at 37° C. followed by addition of 30% formamide encoding buffer with probes (400 nM) to samples for 3 hours at 37° C. and washed with wash buffer for 5 min at 37° C. We repeated this step two more times then washed with 5×SSC for 5 min at room temperature. The amplifier snap-cool procedure was started as follows: placed each amplifier in its own tube of strip tube and heated to 95° C. for 2 minutes in PCR block, removed and cooled at room temperature for 30 mins. After the washing, the amplification buffer (without probes) was added to the sample for 30 mins at room temperature. The snap-cool amplifiers were added to amplification buffer and placed onto samples. The sample was placed in a covered box to allow for amplification (overnight at room temperature). Then, we removed amplification buffer, washed with 5×SSC for 5 min at room temperature and repeated the step three more times. We then added HiPR readout buffer (2×SSC, 5×Denhardt's solution, 10% dextran sulfate, 10% ethylene carbonate, 0.01% SDS, 400 nM readout probes) for 1 hour at room temperature, washed with HiPR wash buffer (215 mM NaCl, 20 mM Tris-HCl pH 8.0, 10 mM EDTA) for 5 min at room temperature, and repeated this step two more times. We then rinsed with 5×SSCT and allowed to dry. We then added mountant and placed a coverslip over samples then proceeded to imaging.
Imaging was done with a Zeiss i880 confocal microscope with Zen Black (Zeiss) to take the images. For each laser excitation, photons were collected from the excitation wavelength up to about 690 nm in wavelength bins that were 8.9 nm wide. For instance, for 633 nm excitation photons were collected into 6 bins (633-642, 642-651, 651-660, 660-669, 669-678, 678-687 nm). For each image, 5 separate excitations were performed and about 90 channels are collected. Channels were selected or merged as needed to illustrate the success/failure of the assay. The laser settings were in accordance with Table 1.
1All used a scanning repeat of 1 s, Master Gain of 800, Digital Offset of 0, and Digital Gain of 1.
A 2000×2000 pixel image was typically taken. Other settings were also used, for example zoom in 2× and take a 1000×1000 image (so the resolution is the same), in this case, the pixel dwell time is doubled to 16.8 μsec.
The amplifier probes used in the following examples are shown in Table 2.
The readout probes used in the following examples are shown in Table 3.
As a proof of concept, we performed validation experiments with E. coli with GFP/ampR plasmid. The validation experiments included: Target fixed GFP+/− E. coli with mRNA encoding probes and a two bit encoding scheme leading to excitation of either Alexa 405 or Alexa 647.
As shown in
These experiments showed that HiPR-Cycle probe intensity correlated to protein expression and that genes can be specifically barcoded. There was also transcript abundance scales in channels with multiple barcodes. In contrast, the GFP− cells have low barcode intensity. Further, confocal imaging in HiPR-Cycle revealed distinct spectra for different barcodes.
The encoding probes used in this example are shown in Table 4 Amplifier probes 1-4 (SEQ ID NO: 21-24), as shown in Table 2, were used in this example. Readout probes 1-10 (SEQ ID NO: 25-34), as shown in Table 3, were used in this example.
As shown in
Here, HiPR-FISH encoding probes designed to target the E. coli 16S and 23S rRNA segments (with direct R2 readout probes) were added in the encoding probe mixture with HiPR-Cycle encoding probes for GFP transcripts (with indirect R7 and R9 probes through hybridization chain reaction). The encoding was performed for 3 hours at 37° C., while the amplification was performed overnight using two sets of amplifier probes. A readout hybridization was performed with all ten readout probes for one hour at room temperature after amplification.
These experiments showed that there was no change in mRNA encoding with or without the inclusion of rRNA HiPR-FISH probes. Importantly, a high intensity signal was detected in the appropriate emission channel (570 nm after excitation with 561 nm laser), whereas it was undetectable when rRNA probes were not used.
As a further extension of the use of rRNA probes, we compared HiPR-FISH and HiPR-Cycle probes to detect rRNA. Here, we designed HiPR-Cycle probes that were identical to the previously used HiPR-FISH rRNA probes for E. coli, except we replaced the flanking readout regions of these probes with flanking initiators to trigger the amplifiers.
The encoding probes used in this example are shown in Table 5 Amplifier probes 1-2 and 5-6 (SEQ ID NO: 21-22 and 85-86), as shown in Table 2, were used in this example. Readout probes 1-10 (SEQ ID NO: 25-34), as shown in Table 3, were used in this example.
As shown in
Imaging was performed using the 405 nm, 488 nm, and 633 nm lasers. Captured images from specific fields of view were contrasted equivalently. In the image, a single emission channel (414 nm) is shown for a field of view from each condition after stimulation with a 405 nm laser.
Amplifier probes 1-4 (SEQ ID NO: 21-24), as shown in Table 2, were used in this example. Readout probes 1-10 (SEQ ID NO: 25-34), as shown in Table 3, were used in this example. Encoding probes 1-20 (SEQ ID NO: 1-20), as shown in Table 4, were used in this example.
We also performed experiments with E. coli with GFP/ampR plasmid. The experiments included: Target fixed GFP+/− E. coli with mRNA encoding probes and a three bit encoding scheme leading to excitation of Alexa 405, Alexa 561, or Alexa 633.
All E. coli were barcoded with a single set of HiPR-Cycle probes targeting GFP transcripts. In here, only one readout probe, Readout Probe 2, (R2) should have bound to the amplifiers used in this sample. As shown in
As expected GFP and R2 were highly detected across cells, while Readout Probe 7 (R7) and Readout Probe 9 (R9) were less abundant. Consistent with the design, only R2 showed heightened signal across cells. In this pure sample (R2 only) the majority of cells were correctly classified, suggesting limited background signal from other probes.
We also developed a synthetic mixture of uniquely barcoded cells. Sample 1 (Red): all cells barcoded with R7 readout probes=0001000001. Sample 2 (Blue): all cells barcoded with R9 readout probes=0100000001. Sample 3 (magenta): all cells barcoded with R2 readout probes=0000000011. We then mixed these samples 1:1:1. As shown in
Signal thresholding identifies uniquely barcoded cells in mixtures. As expected only a subset of cells were positive for each readout fluorophore. Consistent with our mixture, only a subset of cells exhibit heightened signal for each readout fluorophore.
HiPR-Cycle barcoded cells can be resolved in mixtures. As shown in
Encoding probes 1-20 (SEQ ID NO: 1-20), as shown in Table 4, and encoding probes 71-80 as shown in Table 6 below were used in this example. Amplifier probes 1-6 (SEQ ID NO: 21-24 and 85-86), as shown in Table 2, were used in this example. Readout probes 1-10 (SEQ ID NO: 25-34), as shown in Table 3, were used in this example.
One of the primary goals of the HiPR-Cycle technology is to detect endogenous bacterial gene expression. Because of the small cell size and sparsity of transcripts of most endogenous genes in bacteria, detecting measuring gene expression with fluorescence-based imaging poses a significant challenge. We therefore sought to use HiPR-Cycle to detect gene expression from an inducible gene endogenous to E. coli.
LacZ is a well-studied Beta-D-Galactosidase gene whose expression can be induced by the presence of galactose or a galactose mimic, Isopropyl ß-D-1-thiogalactopyranoside (IPTG), within the bacterial culture media. To examine the ability to detect LacZ expression with HiPR-Cycle, we grew E. coli cultures in the presence of IPTG at varying concentrations. Specifically E. coli were grown in media containing 0 mM, 0.1 mM, or 1 mM IPTG for 90 minutes before being collected for HiPR-Cycle.
To detect LacZ transcripts, we designed 32 HiPR-Cycle encoding probes targeting the mRNA sequence of LacZ and performed HiPR-Cycle on bacteria from each of the three culture conditions. As can be seen in
Encoding probes 58-70 (SEQ ID NO: 72-84), as shown in Table 5, and encoding probes 81-112, as shown in Table 7 below, were used in this example Amplifier probes 7-10 (SEQ ID NO: 129-132), as shown in Table 2, were used in this example. Readout probes 1-10 (SEQ ID NO: 25-34), as shown in Table 3, were used in this example.
Another important application for HiPR-Cycle is to observe how changes in environmental conditions change bacterial gene expression, including how the bacteria respond to stresses.
To test the ability of HiPR-Cycle to detect stress response in bacteria, we created a stress response panel (all genes encoded with same initiator) to detect bacterial stress response to heat. The panel included the following genes: ibpA, ipbB, hslJ, hslR, hspQ, yedK, rpoS, recA, and rssB (each with 7 to 12 probes per gene), each pooled at equimolar proportion.
To generate a stress response, E. coli (ATCC 25922) were cultured in tryptic soy broth. Prior to reaching the logarithmic phase of growth E. coli were moved from a 37° C. incubator to a 53° C. water bath for a prescribed amount of time (15 to 60 minutes); a negative control remained at 37° C. for the entirety of the experiment. At the conclusion of the exposure, the bacteria were immediately fixed in 2% formaldehyde.
To detect stress response and bacterial taxonomy, 400 nM of each encoding probe in the stress response panel (corresponding to readout probe 9) and 400 nM of a 16S/23S rRNA panel (corresponding to readout probe 2) were used, respectively. The results of the experiment are shown in
Encoding probes 46-57 (SEQ ID NO: 60-71), as shown in Table 5, and encoding probes 113-221, as shown in Table 8 below, were used in this example Amplifier probes 1-2 and 11-14 (SEQ ID NO: 21-22 and 242-245), as shown in Table 2, were used in this example. Readout probes 1-10 (SEQ ID NO: 25-34), as shown in Table 3, were used in this example.
A concern in using HiPR-Cycle to detect transcripts may be false initialization of amplifier chains from, for example, encoding probes that are incorrectly hybridized to targets, endogenous sequences with homology to initiators, or amplifiers that move out of the hairpin state to the “unraveled” state and initiate a reaction.
If the aforementioned problems are present, an option to solve this is using split initiators as primary probes. The probes split the initiator sequence into two separate probes with neighboring encoding regions. In order for initiation of the amplifiers to occur, both encoding probes forming the initiating complex must be bound as neighbors (see
Another key application of HiPR-Cycle is the ability to perform measurements at high multiplexity with barcoding and spectral readouts. This allows us to theoretically detect 2d−1 targets where d is the number of dyes used in an assay. The targets can be given barcodes (based on the encoding sequences) of d bits.
The addition of more rounds allows the barcode to be extended. For R rounds, the target multiplexity becomes (2d−1)R and allows for dR-bit barcodes.
For example, for a 10-bit system (using 10 dyes), one such code may be 0010011101. In a second round using the 10-bit system, the same code could be extended with additional bits, e.g. 1101011110. Thus, the full 2*10-bit code would include 20 bit, and in this example would be 00100111011101011110. In a 10-bit system with two rounds of HiPR-Cycle, we could use 20-bit barcodes and achieve 1,046,529 targets.
There are several ways that multiple rounds can be achieved.
One method includes repeating HiPR-Cycle, in its entirety, twice (or more) using two (or more) different sets of encoding probes, which could be accomplished as follows:
Another method includes repeating HiPR-Cycle amplification/readout twice (or more) using two different sets of encoding probes. which could be accomplished as follows:
Another method includes bleaching readout probes, which could be accomplished as follows:
Procedure for Stripping Probes
The entire HiPR-Cycle procedure (fixation, digestion, encoding, amplification+readout, imaging) can be performed. Then, probes can be stripped (performed on 37° C. heat block with parafilm covering it) as follows: Cover glass is gently removed. 2×SSC is added to the samples and aspirated to remove imaging buffer. Stripping buffer (60% formamide) is added to the samples. Samples are incubated for 20 minutes. Stripping buffer is aspirated. 1×PBS is added, incubate for 15 minutes. Aspirate 1×PBS and replace with fresh 1×PBS for 15 minutes. Again, aspirate 1×PBS and replace with fresh 1×PBS for 15 minutes. Aspirate 1×PBS and add 2×SSC. Two optional steps can be (A) image samples again to ensure the signal is gone and (B) repeat amplification+readout steps to ensure the encoding probes are gone. Then, HiPR-Cycle is repeated.
In this experiment, we showed that additional rounds of amplification can be performed to generate brighter signals.
Method
We cultured E. coli in the presence of cAMP and IPTG to generate high LacZ expression. The cell suspensions were fixed with 2% formaldehyde (90 minutes at room temperature), washed, and stored in 50% ethanol at −20° C.
For the experiment, cell suspensions were deposited on glass slides and treated with lysozyme (10 mg/mL) for 30 minutes at 37° C. to digest the cell wall. The slide was then washed with PBS (15 min., room temperature) and a pre-encoding buffer was added for 30 minutes (37° C.). A hybridization buffer with LacZ encoding probes (200 nM) and Eubacterium probes (1 μM) was added to the slides and incubated for 16 hours at 37° C. Following the encoding probe hybridization, cells were washed (wash buffer, 48° C. for 15 minutes; 5×SSCT, room temperature for 5 minutes) and a pre-amplification buffer was added for 30 minutes at room temperature.
Amplifier probes and corresponding readout probes were added to the slide for 5 hours at 30° C. The original amplification buffer was removed and a new amplification buffer with new amplifiers and readout probes that could expand off of the old product was added to the samples for 16 hours at 30° C. At the conclusion of amplification, the slides were washed in 2×SSC+Tween 20 at 42° C. for 15 minutes and mounted in ProLong Antifade.
Slides were imaged using a Zeiss i880 confocal in lambda mode with lasers set for 405 nm, 488 nm, 514 nm, 561 nm, and 633 nm excitation modes.
Results
Amplification for two separate rounds was detected by different colors, as shown in
Encoding probes 222-254, as shown in Table 9 below, were used in this example. Amplifier probes 15-18 (SEQ ID NO: 279-282), as shown in Table 2, were used in this example. Readout probes 1 and 9-10 (SEQ ID NO: 25 and 33-34), as shown in Table 3, were used in this example.
Another key application of HiPR-Cycle is the ability to perform measurements at high multiplexity with barcoding and spectral readouts, which allows us to theoretically detect 2d−1 targets where d is the number of dyes used in an assay. The targets can be given barcodes (based on the encoding sequences) of d bits.
HiPR-Cycle boosts signals for specific targets, if amplification is not sufficient, further rounds can be used to amplify off of amplifiers. For n rounds, this requires n amplifier pairs and if the signal from a single round of a single target is some coefficient S, then the theoretical signal amplification from n rounds is Sn.
For example, take the case of two rounds of amplification and a single target. An encoding probe will include initiator A1, the first set of amplifiers H1,1 and H2,1 will be triggered by A1. Importantly, rather than a readout sequence H1,1 will have an overhanging amplifier A2. A2 will trigger a second set of amplifiers H1,2 and H2,2 which contain readout landing pads. The process is shown above. An example of the design is shown in
In this experiment, we show that additional rounds of amplification can be performed to generate brighter signals using an exponentially growing (branched) amplification strategy.
Method
We cultured E. coli in the presence of cAMP and IPTG to generate high LacZ expression. The cell suspensions were fixed with 2% formaldehyde (90 minutes at room temperature), washed, and stored in 50% ethanol at −20° C.
For the experiment, cell suspensions were deposited on glass slides and treated with lysozyme (10 mg/mL) for 30 minutes at 37° C. to digest the cell wall. The slide was then washed with PBS (15 min., room temperature) and a pre-encoding buffer was added for 30 minutes (37° C.). A hybridization buffer with LacZ encoding probes (200 nM) and Eubacterium probes (1 μM) was added to the slides and incubated for 16 hours at 37° C. Following the encoding probe hybridization, cells were washed (wash buffer, 48° C. for 15 minutes; 5×SSCT, room temperature for 5 minutes) and a pre-amplification buffer was added for 30 minutes at room temperature.
First-stage amplifier probes were added to the slide for 5 hours at 30° C. The original amplification buffer was removed and a new amplification buffer with second-stage amplifiers and readout probes that could expand off of the old product was added to the samples for 16 hours at 30° C. At the conclusion of amplification, the slides were washed in 2×SSC+Tween 20 at 42° C. for 15 minutes and mounted in ProLong Antifade. Slides were imaged using a Zeiss i880 confocal in lambda mode with lasers set for 405 nm, 488 nm, 514 nm, 561 nm, and 633 nm excitation modes.
Results
As shown in
Encoding probes 222-254 (SEQ ID NO: 246-278), as shown in Table 9, were used in this example. Amplifier probes 19-22 (SEQ ID NO: 283-286), as shown in Table 2, were used in this example. Readout probes 9-10 (SEQ ID NO: 33-34), as shown in Table 3, were used in this example.
Improvements to imaging resolution can be boosted, not only by increasing the magnification on a microscope, but also by physically expanding the observed specimen. This technique has been termed expansion microscopy and is compatible with single molecule FISH methods. For imaging applications in bacteria and other microorganisms, combining sample expansion approaches with RNA detection could enhance the ability to identify and quantify molecules, such as RNA, within cells by increasing the physical distance between them. Moreover, covalently embedding target molecules of a sample within a gel matrix makes it possible to “clear” (digest/remove) other biomolecules (proteins, lipids, etc.) which would otherwise contribute to light scattering and background autofluorescence.
To embed a sample within a gel in a way that preserves the RNA target locations, we can first chemically modify nucleic acids with LabelX (described below), which adds an acryloyl group to guanine nucleotides. This modification enables target RNA molecules to be incorporated into the polyacrylamide gel matrix. To embed the labeled samples within a gel matrix, we can perfuse (incubate) our preserved/fixed specimen with “monomer” solution (Stock X for expandable gels, or Stock Z for non-expandable gels, described below). We then can add initiator reagents to induce the polymerization of polyacrylamide and formation of a gel. Before expansion, proteins within the specimen can be digested to facilitate clearing of unwanted biomolecules from the specimen and enable isotropic expansion of the sample material. The entire matrix and embedded specimen is then expanded by adding water to the gel. Because small molecules, such DNA probes or amplifiers, can freely diffuse in and out of the gel, HiPR-Cycle can be directly performed on the sample to target RNA molecules directly integrated into the gel matrix before or after the expansion process.
Reagents
LabelX Solution
Label-IT amine is resuspended at 1 mg/mL using the commercial resuspension buffer and mixed. Resuspended Label-ITis then reacted with the AcX/DMSO stock solution at equal mass ratio (e.g. 10 μL of AcX/DMSO stock (at 10 mg/ml) added to 100 μL of Label-IT solution. The reaction is carried out overnight at RT with gentle agitation.
MelphaX
MelphaX can be used instead of LabelX solution to label DNA and RNA within cells for matrix integration. Melphalan (Cayman Chemicals, 16665) is dissolved in anhydrous DMSO (sigma) to 2.5 mg/ml in anhydrous DMSO (Sigma). Acryloyl-X, SE (Thermo Fisher, 20770) is dissolved in anhydrous DMSO to 10 mg/mL. To create MelphaX, Melphalan stock is combined Acryloyl-X 4:1 respectively, SE stock and incubated overnight at room temperature with shaking to make MelphaX (2 mg/ml). Aliquots can be stored at −20° C. in a desiccated environment. Working solution is prepared by diluting MelphaX stock to 1 mg/ml by MOPS buffer (20 mM, pH 7.7)
HiPR-Cycle Reagents
Prepping Bacterial Cells for Embedding
Starting with 100 μL fixed frozen stocks of log-phase growth E. coli, pellet cells (10000×G for 5 mins), resuspend in 100 μL 10 mg/mL Lysozyme and incubate at 37° C. for 30 min to 12 hrs, wash cells 2× with 1×PBS, wash cells 1× with 20 mM MOPS pH 7.7, pellet cells, re-suspend in 100 μL LabelX or MelphaX Solution and incubate at 20-37° C. overnight.
Polymer Synthesis (Gelation):
Before gelation prepare gel casting chamber by placing two coverslides on glass slide with a square gap between them (note sides can remain exposed to air). Then, pellet and wash labeled cells 2× with 1×PBS, pellet and re-suspend cells in 50 μL monomer solution, incubate at RT for 1 minute before proceeding.
Initiate polymerization by adding APS and TEMED to 0.2% w/w final Immediately pipette solution into gel cast and add a coverslip to the top. Transfer specimen (in casting setup) to a humidified incubator set to 22-42° C. (e.g., 37° C.). Wait 1-2 hrs for gelation to occur.
As an alternative to performing preparation and polymer synthesis (gelation) of bacteria in solution, the above steps can be performed on (fixed) bacteria plated on a flat surface (e.g. glass slide). A benefit of this approach is that bacteria will be embedded within the gel in the same plane (closest to the glass surface). To perform steps on a glass slide, a silicone gasket with small chambers can be placed on the glass to contain bacteria and reaction volumes. Fixed bacteria are placed into a well and allowed to dry. Lysozyme treatment and subsequent washes are performed on the dried bacteria within the well. After washes, the bacteria are allowed to dry on the glass and the silicone gasket is removed. A gel casting chamber is then assembled as described, with the dried bacteria in the center. The gelation mix is prepared as described above, but in the absence of the bacteria/sample. Acting quickly, the gelation mix is pipetted onto the specimen located within the casting chamber and the sample is moved to a humidified incubator set to 22-42° C. (for example at 37° C.) and allowed to solidify for 1-2 hrs.
Non Expanding Gelation: To embed samples within non-expandable polyacrylamide gels, replace Stock X (described above) can be used instead of Stock Z (described above) during polymer synthesis steps.
Proteinase K Digestion
Dilute proteinase K (final concentration between 1 U/mL and 200 U/mL; optimal 8 U/mL) in digestion buffer, once gels have solidified, carefully remove coverslip lid and chamber walls, place microscope slide with gel on top into petri-dish containing enough volume (about 5 mL) of digestion buffer to cover gel, leave in digestion buffer for 3-24 hrs (e.g., 12 hrs) at 37° C. Note: After digestion, gels can be stored in 1×PBS, or other saline solutions (such as 5×SSC) until expansion.
HiPR-Cycle on Gelled Samples:
Gently transfer gel into 24-well glass bottom plate, or any other suitable container with a lid. Smaller containers reduce reagent requirements. Containers with glass bottoms are desirable because they enable direct imaging without the need to move gels.
Add encoding buffer (without probes) to samples within wells (30 min at 37° C.). Add 300 μL encoding buffer (with probes) to samples and incubate for 2 to 24 hours at 25-45° C. (for example 3 hrs at 37° C.). Wash gels 3× with excess volume of wash buffer (e.g., 500 μL for 24-well plates; 5 mins each at room temp). Wash with 5×SSCT (5 min at RT).
Start amplifier snap-cool procedure. Place each amplifier oligo (about 5 μL/sample) in its own tube of strip tube. Heat to 95° C. for 2 minutes on a PCR block. Remove and let cool at RT for 30 min. Pool snap-cooled amplifier. Then, add amplification buffer (without probes to sample). 30 min at RT.
Combined Amplification and Readout Probe Binding.
To gels within 24-well dish add the following Mixture:
Place the sample in a covered box to allow for amplification. Allow amplification to proceed for 2-24 hrs at 20-40° C. (e.g., 12 hrs at 25° C.). Remove amp buffer. Wash 3× with 5×SSCT for 5 min at RT.
Expansion and Imaging
At this point the samples can be imaged (pre and/or post expansion). To expand the samples, wash with 0.05×SSCT for 10 minutes.
The expansion factor can be tuned by altering the salt concentration; imaging can also be performed in regular PBS for about 2× expansion.
We examined whether E. coli cells expand uniformly when embedded in a swellable gel matrix (
As described in Example 9 and Example 9.1, multiple rounds of HiPR-Cycle can be performed directly on gel-embedded specimen. In the basic approach, both encoding and amplifier probes would be stripped out of the gel embedded specimen using high formamide washes solutions (described in Example 9). The fluorophore bleaching approach (also described in Example 9) could similarly be applied to gel-embedded specimen.
Alternatively, integrating nucleic acids directly into a gel matrix (either expandable or non-swelling) offers another approach for performing multiple rounds of HiPR-Cycle. Here encoding probe hybridization can be performed on the specimen prior to treatment with LabelX or MelphaX, which would chemically modify encoding probes directly, enabling their integration into the gel matrix with their locations relative to target genes preserved. In this scheme, because the encoding probes are covalently integrated within the gel matrix, they would be resistant to any stripping solutions applied to the specimen. Therefore, only one round of encoding hybridization would be needed, and multiple rounds of amplification and stripping could be used to sequentially create and image amplified signals.
We have shown that rather than using a stripping buffer with a high concentration of formamide, a hypotonic solution (e.g. water) can be used to break down amplification products and remove HiPR-Cycle signal for a target gene. Finally, by re-performing the HiPR-Cycle amplification reaction we were able to restore the fluorescent signal (
Encoding probes 58-70 (SEQ ID NO: 72-84), as shown in Table 5, were used in this example. Amplifier probes 1-2 (SEQ ID NO: 21-22), as shown in Table 2, were used in this example. Readout probes 1-10 (SEQ ID NO: 25-34), as shown in Table 3, were used in this example.
This example shows that we are able to use the rRNA of bacteria as indicators of different conditions using different barcodes/colors. For example, and E. coli exposed two separate drugs can be encoded as 100 when exposed to drug 1 and 010 when exposed to drug 2. The two samples can then be mixed together in a single assay to examine gene expression changes from different conditions. This technique is valuable for screening purposes because it allows the user to highly multiplex the imaging and sample processing steps after encoding.
Methods
E. coli cells were separately grown and harvested under three separate heat stress conditions. Two samples were considered non-heat stressed and were grown at either 30° C. (Sample 1) or 37° C. (Sample 2) and harvested at mid-log growth phase (˜0D600=0.5). Sample 3 was grown at 30° C. until mid-log phase growth, at which point it was heated to 46° C. for 5 minutes to induce heat stress response and then collected. We separately performed HiPR-Cycle encoding probe hybridization on cells from each sample, targeting the same three transcript species within each sample: 16S/23S rRNA, atpD mRNA and clpB mRNA. In order to label cells by sample, we used 16s rRNA encoding probes with a distinct initiator to encode cells from each condition. Sample 1 (no HS 30° C.) cells were labeled so that they would fluoresce upon 488 nm excitation, Sample 2 (no HS 37° C.) with 633 nm, and Sample 3 (5′ HS 46° C.) with 405 nm. The same encoding probes targeting atpD (561 nm excitation) and clpB (514 nm excitation) mRNAs were used for all three samples. After encoding hybridization, cells from each sample were mixed at approximately equal proportions and the sample mixture was plated on glass slides. HiPR-Cycle amplification was performed on the mixture of cells followed by imaging. Consistent with sample-based rRNA encoding, E. coli (ATCC 25922) cells only exhibit fluorescence in one of the three possible channels, with no cells showing blended signals (
Encoding probes 255-348, as shown in Table 10 below, were used in this example. Amplifier probes 1-2, 17-18 and 23-28 (SEQ ID NO: 21-22, 281-282 and 381-386), as shown in Table 2, were used in this example. Readout probes 1-10 (SEQ ID NO: 25-34), as shown in Table 3, were used in this example.
In here, we show that HiPR-Cycle can be used to visualize and quantify gene expression in multiple taxa in a single field of view. While we have shown that we are capable of detecting multiple transcripts in a single taxa, this is the first evidence that a combined assay of HiPR-FISH and HiPR-Cycle probes identifies mRNA and rRNA in specific and broad use cases. An additional valuable piece of evidence in this experiment is the ability to detect antimicrobial resistant genes, as shown by the detection of bla mRNA in K. pneumoniae.
Method
To validate HiPR-Cycle's ability to detect multiple genes across multiple taxa, we performed HiPR-Cycle in a synthetic mixed community. The following bacteria under the following conditions were used: (1) E. coli (ATCC 25922) cultured in exponential growth phase and exposed to a large temperature shock (+16° C.) for 5 minutes; (2) carbapenem-resistant Klebsiella pneumoniae (ATCC BAA-1705) cultured under standard conditions; (3) Pseudomonas aeruginosa (ATCC 10145) cultured to the point it formed a biofilm. The three taxa were fixed in 2% formaldehyde and mixed in equal volumes. We targeted the rRNA of each taxa using HiPR-FISH probes containing a distinct readout for each taxa (Readout probe 4 for E. coli, Readout probe 6 for K. pneumoniae, and Readout probe 8 for P. aeruginosa.). We targeted several mRNAs including specific genes (bla(4) in K. pneumoniae with Readout probe 9, clpB in heat-shocked E. coli with Readout probe 1) and broad genes (me with Readout probe 7 and rho with Readout probe 5).
Encoding probes were added at a concentration of 400 nm and we performed encoding for 3 hours, followed by an overnight amplification/readout at 30° C. with amplifiers at a concentration of 200 nm and readouts at a concentration of 400 nm. Confocal imaging was performed using excitation wavelengths of 405 nm, 488 nm, 514 nm, 561 nm, and 633 nm.
Encoding probes 349-513, as shown in Table 11 below, were used in this example. Amplifier probes 11-14, 17-18, and 23-24 (SEQ ID NO: 242-245, 281-282, and 381-382), as shown in Table 2, were used in this example. Readout probes 1-10 (SEQ ID NO: 25-34), as shown in Table 3, were used in this example.
This example shows that HiPR-Cycle is easily adaptable to profile mammalian gene expression in FFPE tissue. This can have a wide range of applications including and beyond microbiome studies and microbiology. Importantly, the expression of the targeted gene, muc2, seems to be isolated to a small fraction cells which aligns with what is known about colonic goblet cells, which highly express this gene.
Method
We examined the possibility to detect gene expression of host-related genes while simultaneously profiling the taxa of several bacteria present in the healthy mouse colon. Encoding probes were designed to target the expression of mucin transcripts (muc2), which are highly expressed in mucus-producing goblet cells present in the mouse gastrointestinal tract. Additionally, HiPR-FISH encoding probes were used to detect several species of bacteria. A specimen (formalin-fixed paraffin-embedded mouse colon slice on a microscope slide) was first washed in xylene and ethanol in ethanol before HiPR-Cycle was performed. Encoding hybridization was performed at 37° C. for 24 hours, followed by an amplification/readout step performed at 30° C. for 15 hours. After amplification: (1) off target signal was removed with heated (42° C.) washes, (2) nuclei were stained with DAPI (1:50000 in 5×SSC), and (3) the tissue was cleared with Vector Laboratories' TrueVIEW Autofluorescence Quenching Kit. Mounted slides were then imaged on the confocal microscope.
We detected high muc2 gene expression is a small fraction of cells, which agreed with past reviews and work showing goblet cell fractions do not exceed 1 in 6 intestinal epithelial cells (Kim Y S, Ho S B. Intestinal goblet cells and mucins in health and disease: recent insights and progress. Curr Gastroenterol Rep. 2010; 12(5):319-330) (
Encoding probes 514-531, as shown in Table 11 below, were used in this example. Amplifier probes 17-18 (SEQ ID NO: 281-282), as shown in Table 2, were used in this example. Readout probes 1-10 (SEQ ID NO: 25-34), as shown in Table 3, were used in this example
This experiment allowed us to show the ability to detect the expression of multiple genes in tissue using HiPR-Cycle, for example, to detect uncommon cell types like Gcg-expressing enteroendocrine cells, as shown in
Method
Fresh frozen colon tissue was embedded in Tissue-Tek O.C.T. compound and sectioned at a thickness of 10 microns at −19° C. onto Ultrastick glass slides. Following sectioning, sections were covered with 2% formaldehyde and incubated in a chemical fume hood for 90 minutes at room temperature. Following fixation, samples were rinsed with 1×PBS, three times to remove the fixative. Specimens were placed in mailers with 70% ethanol and chilled to 4° C. for four hours to permeabilize the cell membrane.
After fixation, we added 10 μg/ml lysozyme to digest bacterial cell walls and incubated the sections for 30 minutes at 37° C.; sections were then washed with 1×PBS for 15 minutes at room temperature. We then added pre-encoding buffer to samples for 30 minutes at 37° C. Encoding buffer containing probes for Gcg (readout probe 10 [R10]), Gsn (readout probe 1 and 3 [R1+R3]), Aqp4 (readout probe 2 [R2]), Pam (readout probe 4 [R4]), Krt8 (readout probe 5 [R5]), Prdx1 (readout probe 6 [R6]), Col3a (readout probe 7 [R7]), Atp5a1 (readout probe 8 [R8]), and Kif3a (readout probe 9 [R9]) (at a concentration of 400 nM per gene pool) were added to specimens. The specimens were incubated overnight (16 hours) at 37° C.
The following day, specimens were washed in HiPR-FISH wash buffer for 15 minutes at 48° C. followed by 5×SSC+Tween 20 (5 minutes, room temperature) to remove unbound encoding probes. A pre-amplification buffer was added to specimens for 30 minutes at room temperature. During this incubation, amplifier probes were annealed by heating to 95° C. for 2 minutes and allowed to cool to room temperature for 30 minutes. An amplification buffer was then prepared by adding ten pairs of amplifier probes (90 nM each; stocks of probes at 9 μM). Amplification proceeded at 30° C., overnight (20 hours).
The following day, specimens were washed with 2×SSC+Tween 20 at 42° C. for 15 minutes. We then performed a readout step, where amplification buffer with all 10 readout probes (400 nM each) was added to specimens and incubated for 90 minutes at room temperature in the dark. Specimens were again washed with 2×SSC+Tween 20 at 42° C. for 15 minutes. Mammalian tissue was then cleared using TruView (Vector Laboratories) according to manufacturer instructions. Following the manufacturer-described wash step, specimens were incubated in 5×SSC+DAPI (1:1,000,000 dilution) for two minutes. The specimens were then mounted in Prolong Antifade.
Slides were imaged using a Zeiss i880 confocal in lambda mode with lasers set for 405 nm, 488 nm, 514 nm, 561 nm, and 633 nm excitation modes.
Encoding probes 222 and 570-756, as shown in Table 13 below, were used in this example. Amplifier probes 7-10, 17-18, and 27-36 (SEQ ID NO: 129-132, 281-282, 385-386, and 795-802), as shown in Table 2, were used in this example. Readout probes 1-10 (SEQ ID NO: 25-34), as shown in Table 3, were used in this example.
In this experiment we show the ability to detect the expression of multiple genes in relation to the microbial taxa (
Method
Fresh frozen colon tissue was embedded in Tissue-Tek O.C.T. compounds and sectioned at a thickness of 10 microns at −19° C. onto Ultrastick glass slides. Following sectioning, sections were covered with 2% formaldehyde and incubated in a chemical fume hood for 90 minutes at room temperature. Following fixation, samples were rinsed with 1×PBS, three times to remove the fixative. Specimens were placed in mailers with 70% ethanol and chilled to 4° C. for four hours to permeabilize the cell membrane.
After fixation, we added 10 μg/ml lysozyme to digest bacterial cell walls and incubated the sections for 30 minutes at 37° C.; sections were then washed with 1×PBS for 15 minutes at room temperature. We then added pre-encoding buffer to samples for 30 minutes at 37° C. An encoding buffer was synthesized to include probes for: mRNA from Ceacam20 (readout probe 3 [R3]), Myh11 (readout probe 10 [R10]), Lypd8 (readout probe 1 [R1]), Cd52 (readout probe 2 [R2]), Ubc (readout probe 7 [R7]), and Acta2 (readout probe 9 [R9]) (at a concentration of 400 nM per gene pool); 16S rRNA from Duncaniella (readout probe 1 [R1]), Bacteroides (readout probe 2 [R2]), Turicibacter (readout probe 3 [R3]), Akkermansia (readout probe 6 [R6]), Ruminococcus (readout probe 7 [R7]), Enterococcus (readout probe 8 [R8]), Anaeroplasma (readout probe 10 [R10]), and broad Eubacterium (readout probe 9 [R9]) (at a concentration of 200 nM per pool). The encoding probes were added to specimens. The specimens were incubated overnight (16 hours) at 37° C.
The following day, specimens were washed in HiPR-FISH wash buffer for 15 minutes at 48° C. followed by 5×SSC+Tween 20 (5 minutes, room temperature) to remove unbound encoding probes. A pre-amplification buffer was added to specimens for 30 minutes at room temperature. During this incubation, amplifier probes were annealed by heating to 95° C. for 2 minutes and allowed to cool to room temperature for 30 minutes. An amplification buffer was prepared by adding ten pairs of amplifier probes (90 nM each; stocks of probes at 9 μM) and readout probes (400 nM each) and incubated overnight at 30° C. at room temperature in the dark. Specimens were again washed with 2×SSC+Tween 20 at 42° C. for 15 minutes. Mammalian tissue was then cleared using TruView (Vector Laboratories) according to manufacturer instructions. Following the manufacturer-described wash step, specimens were incubated in 5×SSC+DAPI (1:1,000,000 dilution) for two minutes. The specimens were then mounted in Prolong Antifade.
Slides were imaged using a Zeiss widefield epifluorescence microscope (20Z air objective) with channels (excitation plus filters) set for dyes present in the experiment.
Encoding probes 757-928, as shown in Table 14 below, were used in this example. Amplifier probes 7-10, 29-30, and 39-44 (SEQ ID NO: 129-132, 795-796, 805-806, and 979-982), as shown in Table 2, were used in this example. Readout probes 1-10 (SEQ ID NO: 25-34), as shown in Table 3, were used in this example.
In here, we show the ability to create spatial maps illustrating the distance between host cells and microbes (
Method
Fresh frozen colon tissue was embedded in Tissue-Tek O.C.T. compound and sectioned at a thickness of 10 microns at −19° C. onto Ultrastick glass slides. Following sectioning, sections were covered with 2% formaldehyde and incubated in a chemical fume hood for 90 minutes at room temperature. Following fixation, samples were rinsed with 1×PBS, three times to remove the fixative. Specimens were placed in mailers with 70% ethanol and chilled to 4° C. for four hours to permeabilize the cell membrane.
After fixation, we added 10 μg/ml lysozyme to digest bacterial cell walls and incubated the sections for 30 minutes at 37° C.; sections were then washed with 1×PBS for 15 minutes at room temperature. We then added pre-encoding buffer to samples for 30 minutes at 37° C. An encoding buffer was synthesized to include probes for: mRNA from Hif1a (R3), Muc2 (R10), Sprr2a1 (R1), Epcam (R2), Mki67 (R7), and Acta2 (R9) (at a concentration of 400 nM per gene pool); 16S rRNA from Duncaniella (R1), Bacteroides (R2), Turicibacter (R3), Akkermansia (R6), Ruminococcus (R7), Enterococcus (R8), Anaeroplasma (R10), and broad Eubacterium (R9) (at a concentration of 200 nM per pool). The encoding probes were added to specimens. The specimens were incubated overnight (16 hours) at 37° C.
The following day, specimens were washed in HiPR-FISH wash buffer for 15 minutes at 48° C. followed by 5×SSC+Tween 20 (5 minutes, room temperature) to remove unbound encoding probes. A pre-amplification buffer was added to specimens for 30 minutes at room temperature. During this incubation, amplifier probes were annealed by heating to 95° C. for 2 minutes and allowed to cool to room temperature for 30 minutes. An amplification buffer was prepared by adding ten pairs of amplifier probes (90 nM each; stocks of probes at 9 μM) and readout probes (400 nM each) and incubated overnight at 30° C. at room temperature in the dark. Specimens were again washed with 2×SSC+Tween 20 at 42° C. for 15 minutes. Mammalian tissue was then cleared using TruView (Vector Laboratories) according to manufacturer instructions. Following the manufacturer-described wash step, specimens were incubated in 5×SSC+DAPI (1:1,000,000 dilution) for two minutes. The specimens were then mounted in Prolong Antifade.
Slides were imaged using a Zeiss i880 confocal in lambda mode with lasers set for 405 nm, 488 nm, 514 nm, 561 nm, and 633 nm excitation modes.
We processed the images (five matched .CZI files) using our standard HiPR-FISH pipeline to segment bacterial cells (using the maximum projection of all 95 collected channels), determine their spectral signature, and convert the spectra to a barcode to reveal the taxonomic identity (here, genus). Cell types were determined by examining the intensity corresponding to label transcripts (e.g. examining channels corresponding to high Alexa Fluor 488 signal. we determined cells containing R1-tagged transcripts and hence those with high Sprr2a1 expression).
We assigned a cell type to each identified mammalian cell and determined the centroid for each mammalian cell and each microbial cell. The distances between each mammalian cell and microbial cell were determined and used to assemble a heat map, as well as examine broader trends of association between cell types/states and microbes.
Encoding probes 757-796, 905-928 (SEQ ID NO: 807-846, 955-978), as shown in Table 14, and encoding probes 983-1084, as shown in Table 15 below, were used in this example. Amplifier probes 7-10, 17-18, 27-30, and 39-40 (SEQ ID NO: 129-132, 281-282, 385-386, 795-796, and 805-806), as shown in Table 2, were used in this example. Readout probes 1-10 (SEQ ID NO: 25-34), as shown in Table 3, were used in this example.
Here, as shown in
Method
Fixed suspensions of cells were prepared separately as follows:
E. coli (ATCC 25922) were cultured in suspension at 37° C. in a shaker for multiple passages. In the last passage, cells were incubated at 30° C. for one hour and then the tube containing cells were placed in a water bath at 46° C. for 5 minutes to induce heat shock. Following the shock, cells were placed on ice for 30 seconds and then immediately fixed in an equal volume of 2% formaldehyde for 90 minutes at room temperature. Pelleted cells were rinsed with 1×PBS and stored in 70% ethanol at −20° C.
Candida albicans cells were cultured on YM media plates for several passages at 30° C. Cells were pelleted and fixed in 2% formaldehyde for 60 minutes at room temperature in a rotator. Pelleted cells were rinsed with 1×PBS and resuspended in ice-cold Buffer B (1×PBS with 1.2 M sorbitol and 100 mM of potassium phosphate dibasic). 5 μL of zymolyase (per 1 mL of cell suspension) was added to the suspension and mixed by vortexing. The cells were incubated for 30 minutes at 30° C. to enable cell wall digestion. Cells were then washed with ice-cold Buffer B and stored in 70% ethanol at −20° C.
Mouse 3T3 fibroblast cells were cultured in Complete Growth Medium (DMEM+10% bovine calf serum+1× Penicillin and Streptomycin) in Petri dishes at 37° C. (5% CO2). At collection, adherent cells were released from the plate using a Trypsin-EDTA solution and incubated for several minutes. Cells were then washed in 1×PBS before being fixed in 3.7% formaldehyde for 10 minutes at room temperature. Fixed stocks were washed in 1×PBS and resuspended in 70% ethanol at −20° C.
A mixture of fixed 3T3 cells, C. albicans cells, and heat-shocked E. coli were mixed in a 1:1:5 volume ratio and deposited on Ultrastick glass slides. We added 10 μg/ml lysozyme to digest bacterial cell walls and incubated the sections for 30 minutes at 37° C.; sections were then washed with 1×PBS for 15 minutes at room temperature. We then added pre-encoding buffer to samples for 30 minutes at 37° C. An encoding buffer was synthesized to include probes for: mRNA from C. albicans-specific ALS10 (R6), E. coli-specific clpB (R2), and murine-specific Vtn (R2) and Col1a1 (R6) (at a concentration of 400 nM per gene pool); 16S rRNA from E. coli (R4) and 18S rRNA from C. albicans (R1) (at a concentration of 2 μM per pool). The encoding probes were added to specimens. The specimens were incubated overnight (16 hours) at 37° C.
The following day, specimens were washed in HiPR-FISH wash buffer for 15 minutes at 48° C. followed by 5×SSC+Tween 20 (5 minutes, room temperature) to remove unbound encoding probes. A pre-amplification buffer was added to specimens for 30 minutes at room temperature. During this incubation, amplifier probes were annealed by heating to 95° C. for 2 minutes and allowed to cool to room temperature for 30 minutes. An amplification buffer was prepared by adding ten pairs of amplifier probes (60 nM each; stocks of probes at 3 μM) and readout probes (400 nM each) and incubated overnight at 30° C. at room temperature in the dark. Specimens were again washed with 2×SSC+Tween 20 at 42° C. for 15 minutes.
Encoding probes 287-311 (SEQ ID NO: 319-343), as shown in Table 10, and encoding probes 1031-1130, as shown in Table 16 below, were used in this example Amplifier probes 41-42 and 45-46 (SEQ ID NO: 979-980 and 1185-1186), as shown in Table 2, were used in this example. Readout probes 1-2 and 5-6 (SEQ ID NO: 25-26 and 29-30), as shown in Table 3, were used in this example.
Here we demonstrate that after amplification readout probes could be unbound and bound without disturbing the amplified structures.
Method
E. coli cultures (ATCC 25922) were established by incubating bacteria in suspension in tryptic soy media at 37° C. in a shaker. For one batch, on the final passage, media included 1 mM IPTG and cAMP to induce LacZ expression. For another batch, in the last passage, cells were incubated at 30° C. for one hour and then the tube containing cells were placed in a water bath at 46° C. for 5 minutes to induce heat shock. Following the shock, cells were placed on ice for 30 seconds. In both cases, suspensions were then immediately fixed in an equal volume of 2% formaldehyde for 90 minutes at room temperature. Pelleted cells were rinsed with 1×PBS and stored in 50% ethanol at −20° C.
Suspensions were mixed in roughly equal concentrations and deposited on glass coverslip. Lysozyme (10 mg/mL) was deposited on the coverslip for 30 minutes at 37° C. Cells were then washed with 1×PBS at room temperature for 15 minutes. The coverslip was subsequently dried by dunking in 100% ethanol. An encoding buffer containing gene encoding probes for LacZ and clpB (200 nM) and Eubacterium (200 nM) was deposited on cells and incubated overnight at 37° C. The following day the coverslip was washed with HiPR-FISH wash buffer at 48° C. for 15 minutes and the coverslip was again dried with amplification 100% ethanol.
A pre-amplification buffer (i.e. containing no probes) was added to the specimens and incubated for 30 minutes at 30° C. Amplifier probes corresponding to encoding probe initiators were annealed and added to amplification buffer. The pre-amplification buffer was aspirated and replaced with amplification buffer and incubated at 30° C. for overnight.
The following day, the coverslip was washed in 2×SSCT at 42° C. for 15 minutes. The coverslip was then mounted on an FCS2 (bioptechs) flow cell and attached to an Aria flow system (Fluigent) to deliver buffers while the setup was on the confocal microscope (Zeiss i880).
First, a readout buffer containing readout probes 9-11 (each at 400 nM) was flowed onto the cells and incubated for 1 hour at 37° C. The cells were then washed with HiPR-FISH wash buffer for 15 minutes at 42° C. Finally, 2×SSC solution was flowed onto the cells and they were imaged.
In the second round, a readout/exchange buffer containing readout probe 12 (400 nM) and exchange probes 1 and 2 (10 μM) was flowed onto the cells and incubated for 1 hour at 37° C. The cells were then washed with HiPR-FISH wash buffer for 15 minutes at 42° C. Finally, 2×SSC solution was flowed onto the cells and they were imaged.
In the third round, a readout/exchange buffer containing readout probe 11 and 10 (400 nM) and exchange probe 3 (10 μM) was flowed onto the cells and incubated for 1 hour at 37° C. The cells were then washed with HiPR-FISH wash buffer for 15 minutes at 42° C. Finally, 2×SSC solution was flowed onto the cells and they were imaged.
Results
As seen in
Encoding probes 287-311 (SEQ ID NO: 319-343), as shown in Table 10, 1031-1034 (SEQ ID NO: 1085-1088), and encoding probes 1131-1178, as shown in Table 17 below, were used in this example. Table 17 also contains the additional readout probes and exchange probes used in this example. Amplifier probes 17-18 and 47-50 (SEQ ID NO: 281-282 and 1240-1243), as shown in Table 2, were used in this example. Readout probes 9-10 (SEQ ID NO: 33-34), as shown in Table 3, were used in this example.
Here, we demonstrate the ability of HiPR-Cycle to measure molecular targets that extended beyond nucleic acids (
Fixed GFP-expressing and non-GFP-expressing E. coli were mixed in a 1:1 ratio. The fixed E. coli stock was deposited onto a glass slide and lysozyme (10 μL of 10 mg/mL) was added to the slide and incubated at 37° C. for 15 minutes to digest the cell wall. The cells were washed twice with 1×PBS for 10 minutes at room temperature. Blocking buffer (5% bovine serum albumin (BSA) in PBS) was added to the slide for one hour at room temperature. Following blocking, we performed primary protein hybridization overnight at 4° C. with at a 1:500 dilution from stock. On the following day, slides were washed five times (5 minutes at room temperature, each) with PBST (PBS+0.1% Tween 20). A secondary antibody protein hybridization was performed with an initiator-conjugated protein for one hour at room temperature. At completion, we washed the slides with PBST three times (5 minutes at room temperature, each). Finally, the slides were re-fixed in 4% formaldehyde (Image-IT) for 10 minutes at room temperature and rinsed with 1×PBS.
Slides were then treated through the standard HiPR-Cycle assay. HiPR-Cycle encoding buffer containing probes for 16S rRNA and GFP mRNA (each pool at 80 nM and barcoded uniquely), was added to slides and incubated for 3 hours at 37° C. Samples were then washed with HiPR-Cycle wash buffer (5 minutes at 37° C., three times) and once with 5×SSC+Tween 20 for 5 minutes. A pre-amplification was performed (adding amplification buffer without HiPR-Cycle amplifier probes) for 30 minutes at room temperature, before adding amplification buffer with amplifier and readout probes corresponding to targets for amplification, and incubating at 30° C. overnight. Finally, slides were washed with 2×SSC+Tween 20 and incubated for 15 min at 37° C. before mourning with Prolong Antifade and a coverslip.
Slides were imaged using a Zeiss i880 confocal in the Airy Scan (super-resolution) mode with lasers set for 488 nm, 561 nm, and 633 nm excitation modes.
Encoding probes 71-80 (SEQ ID NO: 87-96), as shown in Table 6, and encoding probes 1179-1190, as shown in Table 18 below, were used in this example. Table 18 also contains the initiator sequence used to conjugate with the protein. Amplifier probes 7-8 (SEQ ID NO: 129-130), as shown in Table 2, were used in this example. Readout probes 7 and 9 (SEQ ID NO: 31 and 33), as shown in Table 3, were used in this example.
Here, we demonstrate the ability of HiPR-Cycle to measure molecular targets that extended beyond nucleic acids in mammalian cell types, as shown in
Mouse 3T3 fibroblast cells were cultured in Complete Growth Medium (DMEM+10% bovine calf serum+1× Penicillin and Streptomycin) in Petri dishes at 37° C. (5% CO2). At collection, adherent cells were released from the plate using a Trypsin-EDTA solution and incubated for several minutes. Cells were then washed in 1×PBS before being fixed in 3.7% formaldehyde for 10 minutes at room temperature. Fixed stocks were washed in 1×PBS and resuspended in 70% ethanol at −20° C.
The fixed 3T3 cells were deposited onto a glass slide and rinsed twice with 1×PBS. The cells were then permeabilized by adding Permeabilization Buffer (1×PBS with 0.1% Triton X-100). The slides were incubated for one hour at room temperature and then placed at 4° C. overnight. On the following day, the slides were washed with 1×PBS, twice, at room temperature. Blocking buffer (5% BSA in PBS) was deposited on the cells for one hour at room temperature. At the conclusion of blocking, the primary protein buffer was deposited on the cells and the slides were stored at 4° C., overnight. On the following day, specimens were washed with PBST (1×PBS with 0.1% Tween 20) for five minutes at room temperature (repeated four times). A secondary protein stain solution containing an initiator-conjugated protein (conc. 1 μg/mL in blocking buffer) was prepared and deposited on the cells. The slides were incubated for one hour at room temperature. Slides were washed with PBST for five minutes at room temperature (repeat two times).
Because no RNA molecules were targeted in this assay, we proceed to the amplification step. Samples were then washed once with 5×SSC+Tween 20 for 5 minutes. A pre-amplification was performed (adding amplification buffer without HiPR-Cycle amplifier probes) for 30 minutes at room temperature, before adding amplification buffer with amplifier and readout probes corresponding to targets for amplification, and incubating at 30° C. overnight. Slides were washed with 2×SSC+Tween 20 and incubated for 15 minutes at 42° C. Nuclei were stained with 5×SSC containing DAPI (20 ng/mL) and incubated for 5 minutes at room temperature in the dark. Prolong Antifade was added to each well with a coverslip to mount the samples.
Slides were imaged using a Zeiss i880 confocal in the lambda mode with lasers set for 488 nm, 561 nm, and 633 nm excitation modes.
Amplifier probes 45-46 (SEQ ID NO: 1185-1186), as shown in Table 2, were used in this example. Readout probes 1 and 6 (SEQ ID NO: 25 and 30), as shown in Table 3, were used in this example. The initiator sequence use in this example has the following sequence: GCTCGACGTTCCTTTGCAACA/3AmMO/(SEQ ID NO: 1257).
Details of one or more embodiments are set forth in the accompanying drawings and description. Other features, objects, and advantages will be apparent from the description, drawings, and claims. Although a number of embodiments of the invention have been described, it will be understood that various modifications may be made without departing from the spirit and scope of the invention. It should also be understood that the appended drawings are not necessarily to scale, presenting a somewhat simplified representation of various features and basic principles of the invention.
This application claims the benefit of U.S. Provisional Application No. 63/312,253, filed on Feb. 21, 2022, U.S. Provisional Application No. 63/317,180, filed on Mar. 7, 2022 and U.S. Provisional Application No. 63/339,291, filed on May 6, 2022, each of which is incorporated by reference in their entireties.
Number | Date | Country | |
---|---|---|---|
63312253 | Feb 2022 | US | |
63317180 | Mar 2022 | US | |
63339291 | May 2022 | US |