This application claims the benefit of European Patent Application No. 22167058.1, filed Apr. 7, 2022, the entire contents of which are incorporated herein by reference in its entirety.
The present invention relates to a method for sequencing and in situ localization of rolonies comprising a Spatial Unique Molecular Identifier (SUMI) combined with target capture of RNA or DNA by hybridization and/or amplification from tissue sections for high density spatial analysis by in vitro sequencing. A modification of the sequencing method allows localizing protein and metabolite molecules.
Padlock oligonucleotides have proven to be very successful in polymerizing short portion of nucleic acids to which it has been hybridized to. Most padlock approaches begin by reverse transcribing the target into cDNA.
Padlock methods are for example disclosed in “Highly multiplexed subcellular RNA sequencing in situ” by Lee et al., Science. 2014 Mar. 21; 343(6177): 1360-1363. doi:10.1126/science.1250212 or “Efficient In Situ Detection of mRNAs using the Chlorella virus DNA ligase for Padlock Probe Ligation” by Nils Schneider and Matthias Meier; Feb. 5, 2020—Cold Spring Harbor Laboratory Press.
A comprehensive assay for targeted multiplex amplification of human DNA sequences is published by Sujatha Krishnakumar et al.; PNAS sent for review Feb. 19, 2008.
Further, WO2017143155A2 discloses multiplex alteration of cells using a pooled nucleic acid library and analysis thereof and WO2018045181A1 discloses Methods of generating libraries of nucleic acid sequences for detection via fluorescent in situ sequencing.
The published Padlock methods allow sequencing of DNA or RNA, but do not give any spatial information within a cell and tissue location the sequenced DNA or RNA origins from.
Microscopy imaging that allow for multiple mRNAs to be resolved at a single cell level provides valuable information regarding transcript amount and localization, which is a crucial factor for understanding tissue heterogeneity, the molecular development and treatment of diseases.
Fluorescence in situ hybridization (FISH)-based methods allow for transcripts to be directly labelled in tissue sections and for spatial information to be captured. However, the numbers of probes that can be used is limited and overlap of fluorescence signals is often an issue. Moreover, the optical resolution of confocal microscopy limits often are reached and therefore the amount of probes that can be detected concomitantly is reduced. SeqFISH+, is an sequential fluorescent in situ hybridization approach, that does not use probes already labelled with fluorophores but rather uses transcript-specific ones that contain barcode sequences which serve as target sites for fluorescently labelled secondary probes. The various target-specific probes are identified using secondary probes that bind to these barcode sites during sequential rounds of probing. By limiting the amount of probes that are detected by the secondary probes a limited amount are fluorescing and therefore the signal can be discernible. Multiple separated images are collected and aggregated computationally to create a composite high-resolution image without requiring high resolution instrument microscope.
However, although these approaches allow for the evaluation of several genes simultaneously, the sequence information of the transcript is not captured. Other methods based on single-cell RNA sequencing (scRNA-seq) can profile whole transcriptomes and capture the sequence information. However, the original location at the tissue or single cell level is often also missing. A method where both sequence and spatial information is be captured at a resolution approaching the single-cell remains a difficult challenge. Some approaches have used FISSEQ and BaristaSeq (another gap-filling padlock based approach to achieve that task with a limited read-length of about 15 bases).
Recently in situ genome sequencing (IGS) has been described as a method to simultaneously sequence and image genomes within a sample. This method describes a workflow to localize unique molecular identifiers (UMIs) by short read in situ sequencing followed by amplicon dissociation, PCR and ex situ sequencing of amplicons associated to genomic sequences with UMIs by paired-end sequencing published by A. C. Payne et al., Science 10.1126/science.aay3446 (2020), first online release 31 Dec. 2020.
Very recently a “Method combining targeted RNA or c-DNA in vitro sequencing using padlock oligonucleotides comprising a Spatial Unique Molecular Identifier (SUMI) and in situ sequencing” was described (EP22154712.8). This method describes spatial identification of a target sequence incorporated into the padlock by a combination of in situ sequencing (SUMI) and in vitro sequencing (SUMI and target sequence). As the target sequence and the SUMI sequence are part of the same padlock and the resulting rolony, the density of the target information is limited by the number of rolonies which can be sequenced in situ within the area of a cell. The following invention is overcoming this limitation.
Object of the invention is a method for obtaining the spatial location and sequence information of a target sequence in a sample comprising at least one RNA or single stranded DNA comprising the steps:
Obtaining a spatial location requires a solid surface like a tissue or an immobilized sample. The method of the invention is therefore preferable directed to obtaining the spatial location and sequence information of a target sequence on the surface of a sample, wherein a fourth oligonucleotide is provided at dedicated special locations on the surface of the sample.
The method of the invention can be further used to obtain the spatial location of proteins or metabolites in a sample. In this variant, the third oligonucleotide may comprise an antigen recognizing moity capable of binding proteins.
The method of the invention and its embodiments are further explained referring to the drawings.
All embodiments and variants of the method to obtain the spatial location and sequence information of a target sequence in a sample comprising at least one RNA or single stranded DNA strand can also be applied in the method for spatial single cell protein expression.
Preferably, the spatial unique molecular identifier (SUMI) comprises 2-500 bp.
In a first embodiment of the invention, the second oligonucleotide is hybridized adjacent to the first oligonucleotide, allowing direct ligation of the first and second oligonucleotide.
In a second embodiment of the invention, the second oligonucleotide is hybridized upstream to the first oligonucleotide thereby creating a gap of 2 to 100 nucleotides between the first and second oligonucleotide and obtaining the third oligonucleotide by filling the gap between the first oligonucleotide and the second oligonucleotide with nucleotides complementary to the RNA or single DNA strand.
The target sequence includes at least the nucleic acids of the hybridized the 3′ end of the first oligonucleotide and the hybridized 5′ end of the second oligonucleotide as defined in step a and b of the object of the invention for single stranded DNA and RNA targets, but may also include the sequence of the region of the oligonucleotides filling the gap after hybridization to the RNA or single stranded DNA.
In a further embodiment of the method, the fourth oligonucleotide may be generated by rolling circle amplification of a circular oligonucleotide. The fourth oligonucleotide may also comprise a sequence allowing the extended third oligonucleotide to be segmented by a restriction enzyme of chemically.
In the present invention, the extended target SUMI sequences may be amplified by a generic PCR before the collection of the extended target SUMI sequences or extended barcode tag SUMI sequences in step g) and before or after determining the spatial location of the rolonies by in situ sequencing or sequential fluorescent in situ hybridization of the SUMI.
As an alternative embodiment to the generic PCR amplification, the extended target SUMI sequence or extended barcode tag SUMI sequence may become part of a padlock probe itself. Here the 5′ end and the 3′ end of the extended target SUMI sequence or extended barcode tag SUMI sequence would bind to a complementary region 5′ and 3′ to the SUMI sequence of the SUMI rolony, thereby forming a padlock. After padlock gap fill of the SUMI sequence and ligation, the target sequence would be linked to the SUMI sequence forming a circle. The padlock probes for which the gap has been filled and ligated to form a circular template (the padlock can also be filled but ligated only further in the process) are used to code for the SUMI in the gap fill portion of the padlocks. Finally, the circularized padlock probes may be used as a template for rolling circle amplification (RCA) to generate a DNA strand used for sequencing as defined in step h.
In the present invention, the workflow can be modified to allow spatial localization by SUMI sequencing for other classes of biomolecules. Here, an oligonucleotide would be linked to the biomolecule binder. The oligonucleotide would contain a sequence (barcode-tag) coding for the binder of the biomolecule (e.G. specific antibody as binder for specific protein). After linking the SUMI sequence with the barcode-tag sequence of the SUMI rolony, followed by in situ and in vitro sequencing spatial multiomics results for the biomolecules linked to the binder will be obtained as the result of step i).
In the present invention before step d) the formation of the rolony as the fourth oligonucleotide can be initiated by an external force (e.g. light or heat) which can be linked to a potential digital pathology imaging process.
In the present invention after step d) the cells may be harvested and subjected to single cell sequencing analysis.
The current invention describes a method overcoming spatial resolution constrains from situ sequencing or sequential in situ hybridization by using SUMI rolonies as a template for spatial localization of target nucleic acids to be identified by in vitro sequencing (SUMI and target).
The method of the invention may comprise that the first sequencing step is performed after incorporating the SUMI sequence into the extended third oligonucleotide.
Preferable, the third oligonucleotide comprises a barcode-tag sequence to which an antigen recognizing moity is linked. For protein localization the third oligonucleotide may include a barcode-tag next to the first and second PCR handle. The barcode-tag is coding for the protein. The third oligonucleotide is linked to an antibody which is binding to the protein.
Accordingly, the third oligonucleotide comprises an antigen recognizing moity capable of binding proteins. Preferable, the third oligonucleotide comprises a barcode-tag sequence to which the an antigen recognizing moity is linked.
The term “antigen recognizing moiety” refers to any kind of antibody or fragmented antibody or fragmented antibody derivatives, directed against markers expressed on the cells of the cell sample. The term relates to fully intact antibodies, fragmented antibody or fragmented antibody derivatives, e.g., Fab, Fab¢, F(ab¢)2, sdAb, scFv, di-scFv, nanobodies. Such fragmented antibody derivatives may be synthesized by recombinant procedures including covalent and non-covalent conjugates containing these kind of molecules. Further examples of antigen recognizing moieties are peptide/MHC-complexes targeting TCR molecules, cell adhesion receptor molecules, receptors for costimulatory molecules, artificial engineered binding molecules, e.g., peptides or aptamers which target, e.g., cell surface molecules. Such antigen recognizing moieties antibody directed may be against antigen expressed by the biological specimens (target cells) intracellular, like IL2, FoxP3, CD154, or extracellular, like CD3, CD14, CD4, CD8, CD25, CD34, CD56, and CD133.
As shown in
In one embodiment of the invention, the generic sequences may not directly follow the SUMI sequences and may be located at variable distance from the SUMI sequence.
In another embodiment of the invention, the primer region to initiate rolling circle amplification may be identical to the generic sequences.
In a further embodiment the oligonucleotide which contains the SUMI may not be circular when provided onto the tissue section, but circularization may happen directly on the tissue.
In one embodiment of the present invention target capture may start with DNA as a target or Proximity Ligation. The respective design adjustments of the linear oligonucleotides needs to be considered.
As shown in
In a further embodiment of the invention the circular oligonucleotide which contains a Spatial Unique Molecular Identifier (SUMI) as shown in
The general steps of the invention are shown in
The subcellular location of the target sequence including identified mutations is revealed by linking the target sequence/SUMI obtained in vitro to the SUMI sequence from in situ sequencing. As hundreds of unique SUMI rolonies can spatially be resolved in one cell and each rolony provides thousands of concatemerized SUMI sequences as templates, hundreds of thousands target captured mRNA sequences can theoretically be spatially identified with subcelluar resolution inside one cell, and therefore providing a High Density (HD) method.
In the second embodiment of the invention the subcellular location of proteins and metabolites will be revealed by the sequencing workflow. The linear oligonucleotide designs as shown in
As a variant to the first embodiment a circular oligonucleotide for a hybridization-based SUMI decoding principle for the spatial localization of the rolonies is shown in
In a further embodiment, the method is limited to tissue areas of interest. Tissue areas of interest are identified by classical imaging technologies as microscopy. In order to focus the method of in situ sequencing to the areas of interest, the rolony formation shall be controlled by an external force (as light or heat). As rolonies serve as the sequencing template, without rolonies no sequencing will take place. Polymerization and initiation of rolony formation may be inhibited by blocking the polymerase or by blocking the primer. The blocking principle may be removed by an external force as light or heat which can conceptually be directed by the imaging technologies.
As a further embodiment, after SUMI decoding (e.G. by in situ sequencing) the tissue section may be digested and the individual cells are isolated. The rolony containing cells are sorted and eventually subjected to single cell sequencing. Sorting of the rolony containing cells may be accomplished by the increased nucleic acid content as a result of the rolling circle amplification or by fluorescent intensity derived from hybridization probes directed against the rolony sequence. As the SUMI sequence from in situ sequencing may also be identified by single cell sequencing, the information content from single cell sequencing may be linked to the spatial location via the SUMI derived from in situ sequencing.
In this embodiment, specific rolonies can be generated from circular oligonucleotides by using specific primers corresponding to targeted gene (
In a further embodiment, as visualized in
Samples to be analysed with the disclosed method may originate from any specimen, like whole animals, organs, tissues slices, cell aggregates, or single cells of invertebrates, (e.g., Caenorhabditis elegans, Drosophila melanogaster), vertebrates (e.g., Danio rerio, Xenopus laevis) and mammalians (e.g., Mus musculus, Homo sapiens). A biological sample may have the form of a tissues slice, cell aggregate, suspension cells, or adherent cells. The cells may be living or dead.
In yet another variant of the method according to the invention, the sample is converted into isolated cells which are then immobilized by trapping in microcavities or by adherence.
The cells of the sample may then be further subjected to single cell sequencing.
The spatial information of the fourth oligomers like rolonies i.e. their location on the sample is determined for example by an imaging step.
Imaging may be performed for example with techniques are known as “Multi Epitope Ligand Cartography”, “Chip-based Cytometry” or “Multiomics, described for example, in EP 0810428, EP1181525, EP 1136822 or EP1224472. In this technology, cells are immobilized and contacted with antibodies coupled to fluorescent moiety. The antibodies are recognized by the respective antigens on the biological specimen (for example on a cell surface) and after removing the unbound marker and exciting the fluorescent moieties, the location of the antigen is detected by the fluorescence emission of the fluorescent moieties. In certain variants, instead of antibodies coupled to fluorescent moieties, antibodies coupled to moieties detectable for MALDI-Imaging or CyTOF can be used. The person skilled in the art is aware how to modify the technique based on fluorescent moiety to work with these detection moieties. The location of the target moieties is achieved by a digital imaging device with a sufficient resolution and sensitivity in for the wavelength of the fluorescence radiation. The digital imaging device may be used with or without optical enlargement for example with a fluorescence microscope. The resulting images are stored on an appropriate storing device like a hard drive, for example in RAW, TIF, JPEG, or HDF5 format.
Number | Date | Country | Kind |
---|---|---|---|
22167058.1 | Apr 2022 | EP | regional |