The present invention is directed a method for identifying cDNA, DNA or RNA of target cells from a cell population by single cell indexing using a color-coded composition comprising a solid particle which comprises dyes having different emission sprectra as additional information.
Currently, single cell sequencing gets into the focus of scientists of different research areas, because they are interested in unravelling the cellular heterogeneity of tissues. In addition, an analysis on the single cell level can help to better understand the association of phenotype and function of a cell by deciphering the transcriptome of different subtypes of a given cell population.
During the last years, powerful technologies came into the market that allow the analysis of single cells' transcriptomes on a high throughput scale. Many of these techniques are based on microfluidic separation of one single cell and one single bead into one droplet. In the droplet, bead-specific oligos that are bound to the beads capture the mRNA after cell lysis and become cell-specific barcodes after reverse transcription reaction.
Some of the available technologies also allow to selectively isolate single cells from the cell population, e.g. by laser-detection of antibody-staining and subsequent sorting of marked cells.
However, so far it is not possible to selectively isolate single cells from a heterogeneous cell population on a high-throughput level and to assign the sequencing results of the single cells to one of the different cell types.
In a different field of technology, it is known to identify genetic information obtained from a single cell by conjugating the cell with a polynucleotide as barcode. These methods involve synthesizing a library comprising this barcode which can be sequenced in order to identify a single cell.
The biology and the necessary hardware to isolate cells is for example disclosed in U.S. Pat. Nos. 9,388,456 or 9,695,468. However, this technology is focused on single, isolated cells rather than cells in the context of a complex mixture of cells.
It was therefore an object of the invention to provide a conjugate which enables the identification of DNA and RNA molecules of single cells, combined with their phenotype as detectable by binders.
First object of the invention is therefore a method for identifying nucleic acids of a target cell from a cell population comprising
The target cells/the cell population and the color-coded compositions are provided as mixture of various subpopulations. The isolation step is conducted by choosing at best one target cell and one color-coded composition according to pre-selected properties into one compartment.
To this end, the pre-selected physical properties of the target cell and the color-coded composition (bead) are used to select and isolate a certain cell subpopulation together with a certain bead population. For example, if the pre-selected physical property of the target cell is the presence of a CD4 marker and pre-selected physical property of the color-coded composition (bead) is blue color, all cells and beads having these properties will be sorted into compartments in a 1:1 ratio. Cells having for example CD5 markers and red beads will not be selected/isolated. It is of course possible to pre-select a plurality of physical properties for both cells and beads as long as a 1:1 relation of the selected properties is maintained. For example it is possible to pre-select 5 different physical properties which enables sorting 5 pairs of cells and beads into compartments in a preferable 1:1 ratio.
The pre-selected physical property of the target cells may be selected from the group consisting of shape, size, granularity, organelle composition, ion composition, sugar composition, lipid composition and protein composition.
If the protein composition is used as pre-selected physical property, at least one intracellular or extracellular protein is marked by fluorescence staining. The term protein composition refers to protein expression and post-translational modification.
The pre-selected physical properties of the color-coded composition are defined by the solid particle and may be selected from the group consisting of size, granularity, charge, magnetic moment, one or more colors and one or more intensities of at least one color.
The target moiety to be detected with the method of the invention can be on any biological specimen, like tissues slices, cell aggregates, suspension cells, or adherent cells.
The color-coded compositions comprise a solid particle conjugated to an oligonucleotide.
In a first variant of the method of the invention, the color-coded composition has a composition according to one of the general formulas (Ia) or (Ib)
X-(P-C-B-BR)n (Ia)
X-(P-B-C-BR)n (Ib)
wherein
X is a solid particle,
In another variant of the method, the color-coded composition has a composition according to one of the general formulas (IIa), (IIb), (IIc), (IId), (IIe), (IIe) or (IIf)
X-(P-C-B-U-BR)n (IIa)
X-(P-C-U-B-BR)n (IIb)
X-(P-B-C-U-BR)n (IIc)
X-(P-B-U-C-BR)n (IId)
X-(P-U-B-C-BR)n (IIe)
X-(P-U-C-B-BR)n (IIf)
wherein
In the present application, oligonucleotides C (color specific barcode), B (bead specific barcode) and U (unique molecular identifier) are referred to as “barcode” since they allow identifying a single target by their unique sequence.
The moieties C, B and U carrying a barcode may comprise same of different oligonucleotide sequences with the respective, disclosed number of nucleotide residues. As nucleotide residues, the naturally occurring cytosine (C), adenine (A), guanine (G)and thymine (T) are preferred. By randomly polymerizing these units, a library of oligonucleotides with different sequences can be obtained. For example, a library randomly producing oligonucleotides comprising 10 nucleotide residues will have 410=1048576 members.
The oligonucleotide sequences P (PCRhandle), C (color specific barcode), B (bead specific barcode), U (unique molecular identifier) and BR (binding region) are bound to each other either directly or via further oligonucleotide units as spacer unit. The spacer units may by the same or different oligonucleotides comprising each 0 to 30 nucleotide residues. Preferable, the spacer units are non-specific oligonucleotides.
In a preferred embodiment, one or more spacer unit comprise 0 (zero) nucleotide residues, i.e. the oligonucleotides P (PCRhandle), C (color specific barcode), B (bead specific barcode), U (unique molecular identifier) and BR (binding region) are directly bound to each other.
Oligonucleotid P (PCRhandle) may comprise 4 to 30 nucleotide residues and is serving as binding region for primer for subsequent amplification reactions.
Oligonucleotid C (color specific barcode) may comprise 1 to 8 nucleotide residues which allows the identification of the cell or cell type.
Oligonucleotid B (bead specific barcode) may comprise 8 to 30 nucleotide residues and serves as cell specific barcodes allowing to assign sequencing information to the origin cell
Oligonucleotid U (unique molecular identifier) may comprise 5 to 15 nucleotide residues and serves as identifier for each single nucleic acid molecule in the target cell.
Oligonucleotid BR (binding region) may comprise 3 to 30 nucleotide residues and serves as binding region for nucleic acid molecules of interest of the target cell.
The technique to produce oligonucleotides and libraries thereof is well known to a person skilled in the art, as well as the technologies to amplify isolated oligonucleotides to obtain larger amounts thereof. U.S. Pat. No. 9,388,465 summarizes these technologies.
The term “solid particle” refers any material which is not or not readily solvable in aqueous systems usually used for cell handling. The term does not necessarily refer to a certain hardness or a composition/material.
Solid particles X as used in the present invention may be manufactured from any material as long as the solvability in aqueous systems is so low that the particle remains observable or detectable during the method of the invention. For example, solid particle X may comprise poly styrene, poly dextran, both optionally chemically modified with reactive groups to bind dyes or oligonucleotids as spacer units or the PCRhandle P. Suitable reactive groups are for example amino or carboxylic groups.
Solid particles useful for the present invention may be prepared with methods known to the skilled person or as described in the literature. For example, they can be prepared by incorporating dyes into pre-formed polymer beads either by swelling of particles in organic solvent mixtures containing dyes either at room temperature (U.S. Pat. No. 6,514,295 B1) or at elevated temperatures (U.S. Pat. No. 7,507,588 B2). A further method involves shifting of phase equilibria due to water addition to force hydrophobic dyes into the polymer phase (U.S. Pat. No. 6,964,747 B2). Solid particles X beads can also be prepared by polymerization of monomer mixtures including dye labeled monomers (J. Am. Chem. Soc. 2004, 126, 21, 6562-6563) or physical entrapment of hydrophobic dyes during particle formation by polymerization (U.S. Pat. No. 5,073,498).
In another embodiment of the invention, solid particles may comprise multiple (like 5-50) subunits linked via magnetic force, electrostatic interaction or chemical linkage, which can be covalent or non-covalent. These subunits may be released from each other upon droplet formation, e.g. by chemical or enzymatically induced cleavage.
The size of the solid particle is of minor importance and may be between 1 and 200 μm.
Preferable, the solid particles X comprise at least two dyes having different emission spectra with a difference in emission maxima at least 10 nm, more preferable at least two dyes having different emission spectra with a difference in emission maxima at least 20 nm. While an increasing number of different dyes improves the quality and amount of information, in practical use, 2 to 10 different dyes are sufficient. 1 dye is sufficient in case of only using the concentration as selection criterium.
The concentration and the difference in emission maxima of the dyes are preferable selected in a way that discrimination of at least 30, preferable at least 50 different solid particles is possible.
Useful dyes are for example protein-based, such as phycobiliproteins, polymeric, such as polyfluorenes, small organic molecule dyes, such as xanthenes, like fluorescein, or rhodamines, cyanines, oxazines, coumarins, acridines, oxadiazoles, pyrenes, pyrromethenes, or metallo-organic complexes, such as Ru, Eu, Pt complexes. Besides single molecule entities, clusters of fluorescent proteins or small organic molecule dyes, as well as nanoparticles, such as quantum dots, upconverting nanoparticles, gold nanoparticles, dyed polymer nanoparticles can also be used as fluorescent moieties.
In a further embodiment of the method, the nucleic acids of a target cell to be identified is single-stranded and wherein the complementary strand of the nucleic acid molecule is obtained and coupled to the BR units of the color-coded composition thereby forming a second conjugate and sequencing the second conjugate thereby identifying the target cell.
Single-stranded nucleic acids are for example RNA, denatured DNA or nucleic acid molecules attached to the target cells during the sample preparation procedure. One example for the later are antibody-oligonucleotide conjugates which are used to label the target cells.
The term “determining the sequence of the first/second conjugate” relates to any method known in the art of nucleic acid sequencing and may comprise amplification steps and/or generating a library. In any case, the sequence of the conjugate is obtained, thereby identifying the target cell.
The necessary techniques for coupling strands of nucleic acids with the BR units of the color-coded composition and subsequent sequencing are not of particular relevance for the invention and are known to the person skilled in the art.
In a variant of the methods according to the invention, the at least one target cell is isolated from the cell population with at least one color-coded composition into one compartment by placing the at least one target cell and the at least one color-coded composition into one aqueous droplet surrounded by a fluid immiscible with water.
It is further possible that the target cells belonging to the same cell type or phenotype or cells binding to the same antibodies/analyte are provided with color-coded compositions having the same solid particle X.
In the methods of the invention, the color-coded compositions may be provided with at least 2 different solid particles X in order to provide at least 2 different cell types with color-coded compositions having different solid particles X.
The cell type of the target cells may be identified by sequencing C (color specific barcode) of the conjugate. In a further variant, prior to isolating at least one target cell from the cell population with at least one color-coded composition according to the invention into one compartment, the cell type of the target cell is determined by fluorescence staining.
The method of the invention can be used for various applications in research, diagnostics and cell therapy. The method of the invention is especially useful for identifying nucleic acids of a target cell from a cell population. Analytes may be used to identify and measure biomarkers or therapeutic targets.
The following are hypothetical processes according to the method of the invention. The process step of isolating at least one target cell from the cell population with at least one color-coded composition according into one compartment shall be performed on a MACSQuant Tyto machine (obtainable from Miltenyi Biotec B.V. & Co. KG, in the following referred to as “Tyto”) equipped with a MEMS valve positioned in a disposable cartridge capable of placing at least one target cell and at least one color-coded composition into one aqueous droplet surrounded by a fluid immiscible with water. Such valves/cartridges are described in PCT/US 19/27577.
CASE 1: Analysing Gene Expression in Different Subsets of Immune Cells from Blood by Single-Cell-Sequencing.
Process Steps
Case 2: Analyze the T Cell Receptor (TCR) Repertoire of Different TIL Subpopulations by Single-Cell-Sequencing
Process Steps:
Number | Date | Country | Kind |
---|---|---|---|
19209395.3 | Nov 2019 | EP | regional |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2020/081851 | 11/12/2020 | WO |