The application contains a Sequence Listing which has been submitted electronically in .XML format and is hereby incorporated by reference in its entirety. Said .XML copy, created on Jan. 23, 2023, is named “4006-22178.xml” and is 14,845 bytes in size. The sequence listing contained in this .XML file is part of the specification and is hereby incorporated by reference herein in its entirety.
The present invention relates to a method for labeling cells with barcodes, and a kit for use in the method. The present invention also relates to a method for multiplexed analysis of cell samples.
Comprehensive gene expression analysis is an essential technology for analyzing properties of cells and tissues. Comparing gene expression profiles in different cells and tissues allows to reveal the diversity of cell types present, properties of various cells in different tissues, and their ratios in populations. In the conventional bulk analysis, numerous cells are put together to extract mRNAs from mixed samples and to be subjected to expression analysis, which means seeing the average of the gene expression in their cells. However, biological tissues are composed of complex and diverse cell populations. Therefore, it is difficult to grasp the cell conditions in the tissue and accurately understand the groups of expressed genes and their functions by the bulk analysis which uses the whole tissue. In addition, even cultured cell lines which are considered to be homogeneous can strictly be heterogeneous cell populations. Therefore, the bulk RNA-seq analysis is unable to accurately grasp the changes in gene expression that fluctuate according to, for example, the cell cycle or other modalities.
To overcome such a problem, the SINGLE CELL RNA-seq analysis (scRNA-seq analysis) is a very useful technique (NPL 1). The scRNA-seq includes capturing SINGLE CELLS (“SINGLE CELL” herein is used as a “cell group containing a plurality of single cells, or a single cell”, unless otherwise specified), and creating an RNA-seq library from each cell for transcriptome analysis. Nevertheless, to use a large number of single cells as samples requires a huge amount of labor and cost, and the scRNA-seq was not an easily applicable technique until recently. However, techniques such as the droplet scRNA-seq in which a microfluidic/microdroplet reaction system is used, or an scRNA-seq in which a microwell is used have currently been developed, and equipment for these techniques has been developed and commercially available (e.g., BD Rhapsody system (NPL 2), and 10× chromium (NPL 3)). Progression of techniques in such a SINGLE CELL analysis has allowed to prepare a scRNA-seq library from 10,000 cells or more, leading to significantly reduce the analyzing cost per cell (NPL 4).
In spite of the fact that the cost per cell can be reduced, the analyzing cost as a whole is still high when analyzing a plurality of samples or performing time series analysis at a plurality of points of time. In addition, when performing the scRNA-seq experiment on a plurality of samples at different points of time and comparing them, analysis errors in each experiment batch (i.e., batch effects) may occur (NPL 5). Therefore, if a plurality of samples can be combined together and simultaneously processed in one library preparation, not only the analysis cost is significantly reduced, but also the batch effects can be lowered to perform a more precise scRNA-seq analysis. A technique for combining a plurality of samples together and analyzing them in a single experimental run is referred to as “sample multiplexing”.
In sample multiplexing methods, a plurality of cell populations are labeled in advance with barcoded DNA tags having corresponding respective independent sequences, and then mixed together to perform SINGLE CELL analysis. From the RNA-seq data thus obtained, the origin of each cell can be traced based on the barcoded DNA sequences. The droplet scRNA-seq analysis or scRNA-seq analysis using microwells can cause a plurality of cells to be captured in one compartment, and such a multiple cell contamination error makes data interpretation difficult. Nevertheless, if different samples can be distinguished by different DNA tags, errors caused by multiplets can be reduced.
Regarding a multiplexing method using this principle, for example, a method of using an antibody for cell labeling bound to a barcoded oligo DNA as an antibody that recognizes a specific cell surface protein has been reported (NPL 6). However, an issue on this cell hashing method is on the assumption that the cell surface protein is expressed in all cells in the samples to be analyzed. Cell hashtag antibodies are antibody cocktails to be used for the purpose of comprehensively labeling cells originating from adult tissues (e.g., TotalSeq® anti-mouse hashtag antibody, BioLegend, Inc.). In a case of hashtags, antibodies to CD45 and MHC Class I are used (the former is for detecting cells of the immune system, and the latter is generally for detecting differentiated cells). However, for example, pluripotent stem cells, which are often used as an embryogenesis model, do not express CD45 or MHC Class I, and therefore the above method cannot be applicable (NPLs 7 and 8). Also, the above method has been found to be not applicable to some non-embryonic cells such as chondrocytes.
In addition to that, as a labeling technique for multiplexing, a transient barcoding method in which barcoded DNA is introduced into a cell (NPL 11), a CellTag indexing method in which lentivirus is used for introduction of barcoded DNA introduction (NPL 12), and the like have been developed. However, these methods are difficult to be applied to cells other than cultured cells, and therefore, the versatility is limited. Further, the ClickTag multiplexing method in which click chemistry and NHS-ester chemistry are combined (NPL 13) is a technique for attaching a barcoded DNA tag to a cell protein; however, the labeling efficiency of viable cells is poor, and this method can only be applied to methanol fixed cells. The CellPlex method (3′ CellPlex Kit, 10× Genomics) uses a reagent that introduces a DNA tag into lipid in the cell membrane for universal cell labeling (NPL 14), but it is difficult to be applied to methanol fixed cells.
PTL 1: WO 2020/010366 A1
The present inventors have made intensive studies to solve the above problems, and as a result, have developed a novel cell labeling technique for multiplexing scRNA-seq analysis that does not depend on specific cell surface antigens, and therefore have conceived the present invention. The method of the present invention has characteristics, one of which is including a step of directly or indirectly bringing a cell surface protein of a cell into contact with a modified barcode. In one aspect, the present invention is a universal surface biotinylation (USB) method. In another aspect, the method of the present invention is a method for directly binding a barcode to a cell surface protein of a cell (single-step method).
The present invention includes, but not limited to, the following aspects.
[1]
A method for labeling a cell with a barcode, including:
A method for labeling a cell with a barcode, including:
The method according to [2], wherein the method including: (ia) for the cell group containing a plurality of single cells, biotinylating the cell surface protein;
The method according to [2] or [3], including biotinylating an amino group, a sulfhydryl group, or a carboxyl group of the cell surface protein in (i).
[5]
The method according to any one of [2] to [4], in which a reagent used for biotinylation in (i) is selected from the group consisting of sulfo-N-hydroxysuccinimide-biotin, N-hydroxysuccinimide-biotin, and pentafluorophenyl-biotin,
The method according to [5], in which the reagent used for biotinylation in (i) has been cryopreserved.
[7]
The method according to any one of [2] to [6], in which the biotin binding substance is selected from the group consisting of streptavidin, avidin, and anti-biotin antibody.
[8]
The method according to [7], in which the biotin binding substance is streptavidin.
[9]
The method according to any one of [1] to [8], in which the barcode is oligo DNA.
[10]
The method according to any one of [1] to [9], wherein the method is for labeling a viable cell.
[11]
The method according to any one of [1] to [9], wherein the method is for labeling a fixed cell.
[12]
A kit for use in a method for labeling a cell with a barcode, including a reagent for biotinylating a cell surface protein of the cell, and
The kit according to [12], in which the reagent for biotinylating the cell surface protein of the cell is sulfo-N-hydroxysuccinimide-biotin.
[14]
A method for multiplexed analysis of a cell sample, including:
The method according to [14], including: (ia) for the cell group containing a plurality of single cells, biotinylating the cell surface protein;
The method according to [15], in which the analysis is cellular RNA analysis.
[17]
A method for labeling a cell with a barcode, including:
The method according to [17], in which the compound that binds to the amino group of the cell surface protein of the cell is selected from the group consisting of sulfo-N-hydroxysuccinimide, N-hydroxysuccinimide, and pentafluorophenyl.
[19]
The method according to [18], in which the compound that binds to the amino group of the cell surface protein of the cell has been cryopreserved.
[20]
The method according to any one of [17] to [19], in which the barcode is oligo DNA.
[21]
The method according to [17] to [20], wherein the method is for labeling a viable cell.
[22]
The method according to any one of [17] to [21], wherein the method is for labeling a fixed cell.
[23]
A kit for use in a method for labeling a cell with a barcode, including a barcode modified with a compound that binds to an amino group of a cell surface protein of the cell.
[24]
The kit according to [23], in which the compound that binds to the amino group of the cell surface protein of the cell is sulfo-N-hydroxysuccinimide.
[25]
A method for multiplexed analysis of a cell sample, including:
The method according to [25], in which the analysis is cellular RNA analysis.
Since the method for labeling a cell with a barcode of the present invention targets a cell surface protein of the cell in both the USB method and the single-step method, it is possible to label cells in any state as long as the cell surface protein exists. For example, it does not depend on the expression of cadherin 1 (CDH1), and therefore both undifferentiated cells and differentiated cells can be targeted. In addition, the type of cells to be labeled is not limited, and the method can be applied to all cell types. Further, it can also be applied to cells fixed with methanol or the like.
In the USB method, biotinylation is not dependent on specific cell surface proteins and can be carried out on any cell surface proteins. The method of the present invention, which uses biotinylation on cell surface proteins, is a universal labeling technique capable of comprehensively labeling cells. The technique of the present invention (hereinafter, referred to as “USB method”) is highly versatile and applicable to any type of cell.
The method of the present invention enables labeling of all cell types by using biotin that can be introduced into any cell surface protein. The USB method does not give any adverse effect on the growth or colony formation ability of the tested cells. Further, it has been revealed in an experiment using the differentiation process of mouse embryonic stem(ES) cells as a model that when sample multiplexing and applying the USB method to the SINGLE CELL analysis, various differentiated cells are successfully captured without adverse effects on detection ability for genes or transcriptome acquisition on each cell. The USB method can be used for many cell types, including human iPS-derived chondrocytes and rat lung-derived cells, to which conventional cell hashing methods have not been successfully applied so far. The USB method can theoretically be applied to all cell types, and is convenient, highly efficient and cost effective method for multiplexing scRNA-seq.
The single-ste+p method includes a step of bringing into contact with a barcode modified with a compound that binds to an amino group of a cell surface protein of the cell. Since the single-step method also targets a cell surface protein of the cell as in the USB method, it is possible to label all cell types. The single-step method does not give any adverse effect on the growth or colony formation ability of the tested cells. Further, it has been confirmed in Examples herein that when applying the single-step method to the SINGLE CELL analysis, various differentiated cells are successfully captured without adverse effects on detection ability for genes or transcriptome acquisition on each cell.
The present invention can be used for disease analysis, such as performing scRNA-seq with cancer tissues. The present invention can also be used for confirming whether iPS cells have been differentiated into cell types of interest, in transplantation therapies with iPS cells used. The present invention can further be used to verify the effects of agents on patient-derived iPS cells, when screening candidates of therapeutic drugs for genetic diseases.
The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawings will be provided by the Office upon request and payment of the necessary fee.
The present invention non-limitingly includes the following aspects. Unless otherwise stated herein, technical and scientific terms used herein have the same meanings as commonly understood by those skilled in the art. The substances, materials and examples disclosed herein are merely exemplary, and not intended to be limiting. When mentioning “in an aspect” herein, it means it is not limited to the aspect, in other words, is not limiting.
In an aspect, the present invention relates to a method for labeling a cell with a barcode. Non-limitingly, the method of the present invention includes:
The method of the present invention has characteristics, one of which is directly or indirectly bringing a cell surface protein of a cell into contact with a modified barcode.
In one aspect, the present invention is a universal surface biotinylation (USB) method. In the USB method, cell surface proteins of a cell are indirectly (via biotin-biotin binding substance) brought into contact with a barcode. In the USB method, the cell surface proteins are biotinylated, and the barcode is modified with the biotin binding substance. In the USB method, the “modified barcode” means a barcode to which a biotin binding substance is added, in other words, a barcoded biotin binding substance.
In another aspect, the method of the present invention is a method for directly binding a barcode to cell surface proteins of a cell (single-step method (or direct method)). In the single-step method, cell surface proteins of a cell are directly brought into contact with a modified barcode. In the single-step method, a barcode is modified with a compound that binds to an amino group of a cell surface protein of the cell. In the single-step method, the “modified barcode” means a barcode modified with a compound that binds to an amino group of a cell surface protein of the cell.
1. Method for Labeling Cell with Barcode (1) (USB Method)
In an aspect, the present invention relates to a method for labeling a cell with a barcode. Non-limitingly, the method includes:
The method includes biotinylating cell surface proteins present on a cell surface. Biotinylation is applicable to any cell surface protein and capable of comprehensively labeling cells without dependent on cell type. In other words, since labeling does not depend on a specific cell surface protein, the cell type to be the subject of the method is not limited. All cells, including animal cells, plant cells, and cultured cells are to be the subject.
First, the method includes biotinylating cell surface proteins of the cell.
Biotinylation of proteins may be performed on a cell group containing a plurality of single cells or a single cell; alternatively, cell surface proteins of a plurality of cells may be biotinylated followed by separating the biotinylated cells into cell groups containing a plurality of single cells, or into single cells. (ii) Before the step of bringing into contact with the barcoded biotin binding substance, the cells are required to be separated into cell groups containing a plurality of single cells, or into single cells. Preferably, biotinylation of proteins is performed on the cell group containing a plurality of single cells or on a single cell.
The “cell group containing a plurality of single cells” are, for example, cell groups of mass of single cells which properties or origin are common, such as a cell group consisting of a plurality of single cells having common properties, and a cell group consisting of a plurality of single cells obtained from the common organ.
In an aspect, the method includes:
As the method for separating the mass of cells containing a plurality of cells (e.g., organ and the like) into cell groups containing a plurality of single cells, or into single cells, any known method can be used without particular limitation. Depending on the mass of the type, the cell type contained in the mass of cells, the cell group containing a plurality of single cells to be obtained (to be labeled) and to be separated, or the type of single cell, those skilled in the art can use a known method. The same applies to the case of separating after biotinylation.
Biotin is a water-soluble vitamin classified as vitamin B, and also referred to as vitamin B7. Biotinylation means the process of binding biotin to proteins or macromolecules.
Examples of biotinylation include a method of chemically binding an amino group, a sulfhydryl group, a carboxyl group, or an aldehyde group (oxidation of sugar chain) to a functional group on a protein, using a biotin labeling reagent. Many types of biotin labeling reagents are known, and those skilled in the art can select an adequate biotin labeling reagent based on the functional group to be used, the hydrophilicity of the labeling reagent, presence or absence of the cleavage after labeling reaction, or the like.
Amino groups (mainly, primary amines: —NH2) are present at a side chain or an amino terminus of lysine residues contained in many proteins. Since, in particular, the ϵ-amino group of a lysine residue is highly reactive, and in many cases, lysine residues are present on the surface of protein tertiary structure, the amino groups are functional groups that are particularly easy to be used for biotinylation. The representative of amino group-reactive biotin labeling reagents is biotin with an N-hydroxysuccinimide (abbreviation: NHS)-ester. NHS-esters react with amino groups to form amide bonds. NHS-ester biotin includes sulfo-NHS-ester biotin that has a sulfonic acid group, and NHS-ester biotin that does not have a sulfonic acid group. The former includes, for example, sulfo-N-hydroxysuccinimide-biotin, and the latter includes, for example, N-hydroxysuccinimide-biotin.
In addition, those with a long spacer between biotin and NHS-ester can also be used. The length of a spacer is non-limitingly 0 to 300 Å, and preferably 10 to 50 Å. Alternatively, those with polyethylene glycol (PEG) introduced in a spacer part (e.g., NHS-PEG4-biotin) can also be used. Alternatively, those in which biotin can appropriately be cut off can also be used. In some cases, the bond between biotin and a labeled protein can be cleaved by a spacer.
Additionally, pentafluorophenyl-biotin, biotinylated isothiocyanate, and the like can be used for biotinylation of amino groups. Pentafluorophenyl-biotin and biotinylated isothiocyanate can bind to both primary amines and secondary amines. Sulfhydryl groups (—SH) of cysteine residues can also be used for biotinylation. In particular, when an amino group is present at an active site of a protein, and there is a concern that the protein is inactivated due to labeling of the amino group, or the like, the sulfhydryl group may be biotinylated. However, when performing labeling reaction, the sulfhydryl group needs to be in a reduced state, in other words, it needs not to form a disulfide bond (S-S bond) by oxidation. When any sulfhydryl group in a reduced state is not present, the disulfide bond is cleaved by a reducing agent. When the cell surface protein does not contain cysteine residues, lysine residues may be modified to generate a sulfhydryl group for use. As the biotin labeling reagent that is reactive to a sulfhydryl group, for example, biotin with a maleimide group, biotin with a bromoacetamide group, or the like can be used.
A carboxyl group (—COOH) is present at a carboxyl terminus, as well as at a side chain of aspartic acid or glutamic acid. When targeting a carboxyl group, biotin with an amino group (—NH2) or hydrazide-derivatized biotin is reacted via a crosslinker with a carbodiimide group to form an amide bond. As the crosslinker with a carbodiimide group, for example, 1-ethyl-3-[3-dimethylaminopropyl] carbodiimide hydrochloride (EDC) or the like can be used.
Cis-diol of sialic acid on a sugar chain generates an aldehyde group (—CHO) by mild oxidative cleavage using sodium periodate (NaIO4), for example. The aldehyde group may be reacted with hydrazide to form hydrazone bond. Mildly oxidizing the glycoprotein to generate an aldehyde group can lead hydrazide-derivatized biotin to be labeled.
Alternatively, using photoreactive biotin compounds, which non-specifically react upon exposure ultraviolet (UV) light, can also lead to non-specific biotinylation. An example thereof is use of a photoreactive biotin labeling reagent having an arylazide group. Arylazide groups are activated upon ultraviolet light irradiation, and an aryl nitrene (half-life is 10−4 seconds) non-specifically reacts with a high electron density region of double bonds or hydrogen bonds present in the vicinity, or with an amino group or a sulfhydryl group. The use of photoreactive biotin compounds is useful in a case where a non-specific labeling is required, or the like.
In an aspect, an amino group, a sulfhydryl group, or a carboxyl group of a cell surface protein is biotinylated in (i).
In an aspect, an amino group of a cell surface protein is biotinylate. In an aspect, a reagent used for biotinylation in (i) (biotin labeling reagent, reagent for biotinylating a cell surface protein of a cell) is selected from the group consisting of sulfo-N-hydroxysuccinimide-biotin, N-hydroxysuccinimide-biotin, and pentafluorophenyl-biotin.
The binding of biotin with sulfo-N-hydroxysuccinimide, N-hydroxysuccinimide, or pentafluorophenyl can include a spacer. The length of a spacer is non-limitingly 0 to 300 Å, and preferably 10 to 50 Å.
In an aspect,
In an aspect, the concentration of the reagent used for biotinylation can be appropriately applied according to the cell type used. The concentration of S-NHS-biotin in Examples was 10 μg/mL; however, it can be set to, for example, 1 to 50 μg/mL according to the cell type used.
In an aspect of the method, the reagent used for biotinylation in (i) may be those that has been cryopreserved. For example, as sulfo-N-hydroxysuccinimide-biotin, those pre-adjusted and cryopreserved can be used at an appropriate time.
The method above biotinylates cell surface proteins by step (i), followed by (ii) bringing each cell group or each single cell into contact with a barcoded biotin binding substance.
The “biotin binding substance” is not particularly limited as long as it has biotin binding properties. Preferred is a substance that less affects the cell growth and survival, or does not affect the cell growth and survival, due to the binding to cell surface proteins via biotin.
Examples of the biotin binding substance is non-limitingly include streptavidin, avidin, and anti-biotin antibody. In an aspect, the biotin binding substance is selected from the group consisting of streptavidin, avidin, and anti-biotin antibody. In an aspect, the biotin binding substance is streptavidin.
The “barcode” is a substance that includes information for identifying each cell group or each single cell. The type of the barcode is not particularly limited, as long as the barcode is a substance that can be added to a biotin binding substance, and includes information for identifying each cell group or each single cell. Examples of the barcode non-limitingly includes oligo DNA and oligo RNA. In an aspect, the barcode is oligo DNA.
The “barcoded biotin binding substance” is a substance in which a barcode (e.g., oligo DNA) is added to a biotin binding substance (e.g., streptavidin). To the barcoded biotin binding substance, labeling for detection can further be added. For example, by adding a fluorescent substance to the barcoded biotin binding substance, labeled cells can be analyzed by flow cytometry. Alternatively, by adding an epitope containing a specific protein or the like to the barcoded biotin binding substance and detecting with fluorescence or the like using an antibody that binds to the epitope or the like, immunofluorescence analysis can be performed.
The method of bringing each cell group or each single cell into contact with the barcoded biotin binding substance is not particularly limited. For example, it is performed by suspending each cell group or each single cell to which biotin is added, in a solution containing the barcoded biotin binding substance.
In an aspect of the method above, cells to be labeled may be viable cells ex vivo or in vivo. Alternatively, the cells to be labeled may be cells that have been fixed prior to biotinylation. In an aspect of the method above, the cells to be labeled may be cells that have been fixed and stored prior to biotinylation. The fixation and storage of cells can be performed by a known method according to the cell type. For example, cells in which methanol-fixed samples is stored at a low temperature (e.g., −80° C.) can be used. Alternatively, the cells may be those that have been cryopreserved prior to biotinylation. The cryopreservation of cells can be performed by a known method according to the cell type.
The method above may be any of in vivo, ex vivo, and in vitro. Preferred is a method in vitro.
In an aspect, the present invention also relates to a kit for use in a method for labeling cells with barcodes. Non-limitingly, the kit includes:
In an aspect, the kit can be used in “1. Method for Labeling Cell with Barcode (1)” or “3. Method for Multiplexed Analysis of Cell Samples (1)”
The “reagent for biotinylating a cell surface protein of a cell (reagent used for biotinylation, biotin labeling reagent)”, the “barcode”, the “biotin binding substance”, and the “barcoded biotin binding substance” are as described in “1. Method for Labeling Cell with Barcode (1)”.
In an aspect, the reagent for biotinylating the cell surface protein of the cell is sulfo-N-hydroxysuccinimide-biotin.
In an aspect, the “barcode” is oligo DNA. In an aspect, the “biotin binding substance” is streptavidin.
The present invention also relates to a method for multiplexed analysis of cell samples. Non-limitingly, the method includes:
The method for multiplexed analysis of cell samples described above includes, after step (ii),
(iii) mixing all or a part of the cell groups or single cells barcode-labeled in (ii), and
In the method, since labeling is performed with a barcode that can identify each cell group or each single cell, after labeling, a mixture of all or a part of the cell groups or single cells barcode-labeled can be collectively analyzed. In other words, even if analysis is collectively performed, it is possible to identify which results are from which cell group or single cell, by a barcode that can identify each cell group or each single cell.
The analysis is not limited as long as it is means for analyzing cells. Examples to be analyzed include the cell content (e.g., nucleic acids such as DNA and RNA). In an aspect, the analysis is cellular RNA analysis.
Flow cytometry, immunofluorescence analysis or the like can be performed by directly or indirectly (secondarily) labeling the barcoded biotin binding substance with fluorescence or the like.
Alternatively, by using a uniform manifold approximation and projection (UMAP) method or the like based on data obtained by scRNA-seq analysis, cells with similar nature can be classified into a plurality of clusters. UMAP is an analysis tool that performs dimension reduction (dimension compression) on a given point sequence and visualizes points that are “close” to each other in a high dimension by disposing them “close” to each other in a low dimension as well. Differences in properties of each sample can be analyzed, for example, by comparing the difference between the cell counts of the samples contained in each cluster, classified by identifying samples that are origins using barcodes different for each sample as an index.
Also, the mixture of all or a part of the cell groups or single cells barcode-labeled can be individually analyzed by collectively sequencing and distinguishing which sequence is for which cell group or each single cell by barcode identification.
In addition to RNA analysis, application to genomic DNA analysis is also conceivable. For example, it becomes possible to create cell division lineage for each cell by genomic DNA sequencing in single cells to detect DNA base sequence mutation for each single cell. Sample multiplexing by adding barcodes when genomic DNA sequencing in single cells makes it possible to significantly reduce analysis costs for creating cell division lineage from single cell groups derived from a plurality of tissues. In addition, for example, the quantitative detection of mitochondrial DNA by DNA sequencing in single cells makes it possible to trace variation in the number of mitochondria in cell groups. In this case, sample multiplexing by adding barcodes makes it possible to reduce analysis costs.
In an aspect, the method for multiplexed analysis of the cell samples includes:
The present inventors have newly developed a simple yet reliable method for labeling cells for multiplexing in scRNA-seq analysis. Conventional cell hashing methods target specific proteins that are widely expressed on the surface of the cell tested, and are therefore difficult to be used when such proteins are not widely expressed in samples. In contrast, the USB method does not require such specific proteins, and overcomes disadvantages of existing technologies by using a universal label that can target any protein.
Multiplex reagents that are currently available, use antibodies against cell surface proteins such as CD298 and/or MHC Class I antigens or CD45. These antigens are thought to be universally expressed in adult tissues or the immune system. However, embryonic cells do not express these proteins, and thus the current cell hashing method is not applicable. Contrary to this, the USB method has applicability to all cell surface proteins, and can be used for multiplexing regardless of cell type (as long as cell surface proteins exist). In fact, to the multiplexing of rat lung cells, which was not able to be studied by the Ab method that uses chondrocytes derived from iPS cells of mouse, monkey or human, and CD45 or MHC antibodies, the USB method was successfully applied (
Conventional labeling techniques for multiplexing, such as the transient barcoding method that introduces barcoded DNA into cells, and CellTag indexing method that uses a lentivirus for barcoded DNA introduction, are difficult to be applied to cells other than cultured cells, and therefore, its versatility is limited. In contrast, the USB method is highly versatile and applicable to any type of cell.
The USB method is also applicable to animals other than mouse or human. For other species. the information on cell surface proteins or antibodies against them may not be readily available, and thus the conventional Ab method is difficult to be applied thereto. However, the USB method has been proved to be applicable to monkeys and rats, and is theoretically applicable to cells of any species.
Further, the ClickTag multiplexing method in which click chemistry and NHS-ester are combined (NPL 13) is a technique for attaching a barcoded DNA tag to a cell protein, as is the USB method. However, the labeling efficiency of viable cells is poor, and this method can only be applied to methanol fixed cells. The CellPlex method (3′ CellPlex Kit, 10× Genomics) uses a reagent that introduces a DNA tag into lipid in the cell membrane for universal cell labeling (NPL 14), but it is difficult to be applied to methanol fixed cells. Contrary to this, the USB method is also applicable to fixed cells, and enabling more flexible sample multiplexing.
As described in the section “Materials and Methods” in EXAMPLES, all components used in the USB method are commercially available and relatively inexpensive. The stock solution of S-NHS-biotin is stable at −80° C. for more than 1 year, and a vial containing 50 mg of S-NHS-biotin is sufficient for labeling up to 10,000 samples. 10 micrograms of DNA-barcoded streptavidin can be used for no less than 150 multiplexing experiments. Since 10 types of DNA-barcoded streptavidin are commercially available, 10-plex analysis can be performed at once. Streptavidin to which arbitrary barcoded oligo DNA is added can be custom-made, and thus 10-plex multiplexing experiments or more can also be performed. That is, the USB method is a highly cost-effective method, which makes SINGLE CELL multiplexing analysis more affordable, and the USB method is a universal labeling method that is applicable to both viable cells and fixed cells, and that makes all kinds of multiplexed scRNA-seq analysis greatly easy.
Comparing to the single-step method, the USB method is characterized by the fact that it is easy to be newly introduced because existing reagents can be used, the cost per sample is lower than the single-step method, and the like.
The method above may be any of in vivo, ex vivo, and in vitro. Preferred is a method in vitro.
4. Method for Directly Labeling Cell with Barcode (2) (Single-Step Method).
In an aspect, the present invention relates to a method for labeling a cell with a barcode. Non-limitingly, the method includes:
The “cell group containing a plurality of single cells”, “cell”, “barcode”, “cell surface protein”, and the like are as described in “1. Method for Labeling Cell with Barcode (1)”. By bringing a cell into contact with a barcode modified with a compound that binds to an amino group of a cell surface protein of the cell, the compound that binds to an amino group of surface proteins in the modified barcode binds to an amino group of cell surface proteins, and then the cell is labeled with the barcode.
In an aspect, the compound that binds to the amino group of the cell surface protein of the cell is selected from the group consisting of sulfo-N-hydroxysuccinimide, N-hydroxysuccinimide, and pentafluorophenyl.
Amino groups (mainly, primary amines: —NH2) are present at a side chain or an amino terminus of lysine residues contained in many proteins. Since, in particular, the ϵ-amino group of a lysine residue is highly reactive, and in many cases, lysine residues are present on the surface of protein tertiary structure, the amino groups are functional groups that are particularly easy to be used for protein modification. The representative of the compound that binds to the amino group of the cell surface protein of the cell is N-hydroxysuccinimide (abbreviation: NHS)-ester. An NHS-ester barcode reacts with the amino group of the cell surface protein to form an amide bond. NHS-esters includes sulfo-NHS-ester that has a sulfonic acid group, and NHS-ester that does not have a sulfonic acid group. The former includes, for example, sulfo-N-hydroxysuccinimide (sulfo-NHS), and the latter includes, for example, N-hydroxysuccinimide (NHS).
In addition, a spacer can be provided between the barcode and NHS-ester. The length of a spacer is non-limitingly 0 to 300 Å, and preferably 10 to 50 Å. Alternatively, those with polyethylene glycol (PEG) introduced in a spacer part (e.g., NHS-PEG4) can also be used.
The method for binding a barcode and NHS-ester is not particularly limited, and can be performed by any method.
In an aspect, a carbodiimide compound, such as 1-ethyl-3-[3-dimethylaminopropyl] carbodiimide hydrochloride (EDC), N,N′-dicyclohexylcarbodiimide or the like is used. Incorporation of sulfo-NHS or NHS into EDC binding protocol makes reaction efficiency to be raised, and a dry-stable (amine-reactive) intermediate is generated. Specifically, first, oligo DNA (barcode) modified with a carboxyl group (—COOH) at the 5′ end is prepared, for example. The addition of a carboxyl group to oligo DNA can be performed by a known method. For example, it can be non-limitingly performed by using 10-carboxy-decyl-(2-cyanoethyl) (N,N-diisopropyl)-phosphoramidite, or N-hydroxysuccinimide ester (e.g., reagent, 5′-Carboxy-Modifier C10 (Glen Research). Next, NHS is coupled to-COOH of 5′-COOH-DNA by EDC to form an extremely highly stable NHS-ester relative to O-acylisourea intermediate. DNA that has been “(sulfo-) NHS-esterified” (activated state) form a stable bond with the amino group in the cell surface proteins. This allows cells to be labeled by the (sulfo-) NHS-esterified DNA (modified barcode).
Alternatively, the “(sulfo-) NHS-esterified” DNA may be made by the following method.
For example, the “(sulfo-) NHS-esterified” DNA is induced by reacting —NH2— modified oligo DNA with disuccinimidyl suberate (DSS) (Thermo Fisher Scientific™, 21655 and the like) or bis(succinimidyl) suberate (BS3) (Thermo Fisher Scientific™, 21580, and the like). Specifically, first, oligo DNA (barcode) modified with an amino group (—NH2) at the 5′ end is prepared, for example. The addition of —NH2 to DNA can be performed by a known method. For example, it can be non-limitingly performed by using 6-(4-monomethoxytritylamino) hexyl-(2-cyanoethy)-(N,N-diisopropyl)-phosphoramidite (e.g., reagent, 5′-Amino-Modifier C6 (Glen Research)). Next, DNA with —NH2 added at the 5′ end is reacted with DSS or BS3. DSS is a reagent that have two NHS-esters; one NHS-ester is coupled to —NH2 by reacting it with DNA to creates a state in which the other is exposed, and the activated NHS-ester is added to DNA terminus. BS3 is DSS with a sulfo group added, and as is DSS, one NHS-ester is coupled to —NH2 by reacting it with DNA to creates a state in which the other is exposed, and the activated NHS-ester is added to DNA terminus. In this method, once two DNAs and one DSS or BS3 bind, they can no longer bind to cells afterwards. Therefore, adjustment of reaction conditions (reaction time, and the like) is needed so that only one DNA binds to DSS or BS3.
In addition, pentafluorophenyl, isothiocyanate, and the like also modify a barcode, and can be used for binding to the amino groups of the cell surface proteins. Pentafluorophenyl and isothiocyanate can bind to both primary amines and secondary amines.
Note that 6-FAM (6-carboxyfluorescein), biotin, and the like may be bound to a portion of the barcode that has not been NHS-ester modified, to use for detection or the like other than multiplexed analysis or the like. For example, in a case where oligo DNA is NHS-ester modified at the 5′ end, 6-FAM or biotin may be added to the 3′ end of the DNA.
In an aspect, the compound that binds to an amino group of cell surface proteins of the cell has been cryopreserved.
In an aspect, the barcode is oligo DNA.
In an aspect, the method is for labeling a viable cell. In an aspect, the method is for labeling a fixed cell. The “viable cell”, “fixed cell”, and the like are as described in “1. Method for Labeling Cell with Barcode (1)”.
The method above may be any of in vivo, ex vivo, and in vitro. Preferred is a method in vitro.
The method above may be any of in vivo, ex vivo, and in vitro. Preferred is a method in vitro.
In addition, unless there are any technical problems, the content described in “1. Method for Labeling Cell with Barcode (1) (USB Method)” also applies to “4. Method for Directly Labeling Cell with Barcode (2) (Single-Step Method)” in this section.
The single-step method directly brings cell surface proteins of a cell into contact with a modified barcode, which allows the working process to be simplified, and generally the working time is shorter than that of the USB method (2 hours of process time). A shorter working time can reduce influence on cells. Since the number of centrifugation operations is reduced, loss of the cells due to the operations can be reduced.
In an aspect, the present invention also relates to a kit for use in a method for labeling cells with barcodes. The kit non-limitingly includes a barcode modified with a compound that binds to an amino group of a cell surface protein of the cell.
In an aspect, the kit can be used in “4. Method for Directly Labeling Cell with Barcode (2)” or “6. Method for Multiplexed Analysis of Cell Samples (2)”.
The “method for labeling cells with barcodes”, “barcode modified with a compound that binds to an amino group of a cell surface protein of the cell”, and the like are as described in “4. Method for Directly Labeling Cell with Barcode (2)”.
In an aspect, the compound that binds to an amino group of a cell surface protein of the cell is sulfo-N-hydroxysuccinimide.
In addition, unless there are any technical problems, the content described in “2. Kit (1)” applies to Kit (2) in this section.
The present invention also relates to a method for multiplexed analysis of cell samples. Non-limitingly, the method includes:
In an aspect, the analysis is cellular RNA analysis.
The method above may be any of in vivo, ex vivo, and in vitro. Preferred is a method in vitro.
In addition, unless there are any technical problems, the content described in “3. Method for Multiplexed Analysis of Cell Samples (1)” also applies to Method for Multiplexed Analysis of Cell Samples (2) in this section.
The present invention will be described in detail by reference to Examples; however, the present invention is not limited to the following Examples. Those skilled in the art can easily modify or change the present invention based on the description herein, and such modification and changes are included in the technical scope of the present invention.
Unless otherwise clearly stated, the following materials and methods were used in EXAMPLES.
R1 ES cells were provided from Dr. Sado (Kindai University). EB3 ES cells (AES0139) (NPLs 15 and 16) were provided from Cell Engineering Division (Cell Bank) of Riken BioResource Research Center.
Undifferentiated ES cells were maintained according to the method described in NPL 17 with some modifications. Briefly speaking, ES cells were cultured in Glasgow's Minimum Essential Medium (GMEM; Merck KGaA, G5154) to which 14% Knockout Serum Replacement (KSR; Thermo Fisher Scientific™, 10828028), 1% fetal bovine serum (FBS), 1,000 U/mL leukemia inhibitory factor (LIF), 0.11 mg/mL sodium pyruvate (Natalai Tesque, 29806-12), 0.1 mM 2-mercaptoethanol (Thermo Fisher Scientific™, 21985023), 1×non-essential amino acid (Thermo Fisher Scientific™, 11140076) and 1×GlutaMAX supplement (Thermo Fisher Scientific™, 350061) were added, on mouse embryonic fibroblast feeder cells treated with mitomycin C. For passage, 0.25% trypsin (FUJIFILM Corporation, 201-16945) was used.
After removing the feeder cells, 1×104 ES cells were seeded in 100 μL differentiation medium (GMEM supplemented with 15% FBS, 0.11 mg/mL sodium pyruvate, 0.1 mM 2-mercaptoethanol, 1×non-essential amino acid, and 1×GlutaMAX supplement) in a U-bottom 96-well low cell attachment plate (Nunclon Sphera 96-Well U-Shaped-Bottom Microplate; Thermo Fisher Scientific™, 174925), and cultured for 24 hours. Spheroids made in the plate were transferred to a 90 mm, low cell attachment culture dish (Nunclon Sphera 90 mm dish, 174945), and cultured for further 4 days to form embryoid bodies (EBs). The EBs were washed three times with PBS, dissociated into SINGLE CELLs (group of single cells) using 0.25% trypsin, the cells were seeded on a gelatin-treated culture dish, and cultured for 7 days to induce further differentiation.
3. Preparation of SINGLE CELLs for scRNA-seq Analysis
ES cells were dispersed into SINGLE CELLs, cultured for 1 hour on the gelatin-treated culture dish, and the feeder cells were removed. Suspension cells were collected as undifferentiated ES cells.
Differentiated ES cells were washed three times with PBS, treated with 0.25% trypsin, and then dispersed into SINGLE CELLs. Dead cells contained in a cell suspension solution were removed by Percoll centrifugation method. Briefly speaking, the cells were suspended in a 25% Percoll PLUS solution (Cytiva, Inc., 17544501) which concentration was adjusted by using RPMI 1640 (FUJIFILM Corporation, 183-02165) added with 5% FBS and 10 mM HEPES (Natalai Tesque, 17557-94), and layered on a 65% Percoll PLUS solution. After centrifugation at 1,000 g for 20 minutes, the interphase was gathered and washed with differentiation medium. The cell suspension solution was filtered through 40 μm cell strainer, and trypan blue-unstained cell percentage was measured as cell viability.
A S-NHS-biotin stock solution was prepared by dissolving EZ-Link sulfo-NHS-biotin (Thermo Fisher Scientific™, 21217) in PBS at a concentration of 10 mg/mL, and dispensed and stored at Δ80° C. until use.
Single cells containing 0.5-2×106 SINGLE CELLs prepared in “3. Preparation of SINGLE CELLs for scRNA-seq Analysis” were washed once with ice-cold PBS added with 1% FBS. The washed cells were suspended in 500 μL ice-cold NHS-biotin working solution in PBS added with 1% FBS, and incubated on ice for 10 minutes. The concentration of S-NHS-biotin in EXAMPLES was 10 μg/mL.
After incubation on ice, 3 mL ice-cold PBS containing 3% FBS was added to the reaction, and the cells were centrifuged at 4° C., 300 g for 5 minutes. The cells were further washed twice with Cell Staining Buffer (BioLegend, Inc., 420201), and then kept on ice until barcoded oligo DNA binding in the next step.
5. Antibody Labeling with Anti-CDH1 Antibody (Control Experiment)
Single cells containing 0.5-2×106 SINGLE CELLs prepared in “3. Preparation of SINGLE CELLs for scRNA-seq Analysis” were washed once with ice-cold Cell Staining Buffer, suspended in 50 μL ice-cold blocking solution (0.05 μg/mL TruStain FcX PLUS antibody [BioLegend, Inc., 156603] in Cell Staining Buffer), and incubated on ice for 10 minutes. An ice-cold primary antibody solution (0.05 μg/mL biotinylated CDC324 [E-cadherin] rat monoclonal antibody [Thermo Fisher Scientific™, 13-3249-82]) was mixed with the cell suspension solution, and the cell was incubated on ice for 30 minutes. Next, the cells were washed three times with ice-cold Cell Staining Buffer, and then kept on ice until binding of barcoded DNA.
As an isotype control, biotinylated rat IgG1 isotype control (Thermo Fisher Scientific™, 13-4301-82) was used.
By using 5 types of TotalSeq® PE streptavidin (BioLegend, Inc., 405251, 405253, 405255, 405257, 405259) or TotalSeq® anti-biotin antibody (BioLegend, Inc., 409008), the biotin-labeled cells prepared in “4. USB Labeling” or the antibody-labeled cells prepared in “5. Antibody Labeling with Anti-CDH1 Antibody (Control Experiment)” were allowed to bind to barcoded oligo DNA as follows. The information on the barcoded oligo DNA used is shown in the table below.
The cells were suspended in 100 μL solution containing 0.6 μg/mL ice-cold TotalSeq® PE streptavidin or 2 μg/mL TotalSeq® anti-biotin antibody, diluted with Cell Staining Buffer, incubated on ice for 30 minutes, and then washed three times. The cell suspension solution was filtered through 40 μm cell strainer, the same number of cells from each sample was gathered in a single tube and used for cell trapping and cDNA synthesis using a BD Rhapsody™ apparatus.
First, by the same method as in the above 1-3, SINGLE CELL suspension solutions of undifferentiated ES cells and differentiated ES cells were prepared.
Specifically, healthy F344 rat lungs were cut into 0.5 mm2 with razor blade, digested in a Liberase solution [RPMI-1640 (Natalai Tesque)] added with 10% FBS, 10 mM HEPES pH 7.2-7.4, 0.25 mg/ml Liberase™ (F. Hoffmann-La Roche Ltd.), and 2000 U/mL DNase I (Merck KGaA) for 60 minutes at 37° C. to prepare a SINGLE CELL suspension solution. Dead cells were removed by Percoll. Alternatively, growth plate cartilage from proximal tibia was collected from 3-week-old mice and cut finely into 1 to 2 mm pieces. Next, the cut pieces were dispersed into SINGLE CELLs (group of single cells) with Liberse™ (F. Hoffmann-La Roche Ltd.) solution for 120 to 210 minutes while continuously agitating.
Reagents for cell staining were biotinylated CDC324 (E-cadherin) rat monoclonal antibody (Thermo Fisher Scientific™, 13-3249-82), TotalSeq® PE streptavidin (BioLegend, Inc., 405251), PerCP-Cy5.5 anti-rat CD45 (Bio-Rad Laboratories, Inc., clone OX-1), PE anti-rat CD31 (Bio-Rad Laboratories, Inc., Clone TLD-3A12), streptavidin-APC (BioLegend, Inc., 405207), APC rat anti-mouse CD45 (BD Biosciences, 559864), and PE anti-mouse H-2 antibody (BioLegend, Inc., 125505).
The stained cells were analyzed by flow cytometry using EC800 Cell Analyzer (Sony Corporation), Cyto FLEX flow cytometer (Beckman Coulter, Inc.), or FACS Arria II flow cytometer (BD Biosciences).
All animal experimentation was reviewed and approved by the Animal Committee in Kyoto University, and the Animal Care and Use Committee of Tokyo University of Science (approval numbers: S17034, S18029, S19024, and S20019).
Cells cultured in gelatin-treated culture dish were fixed with 4% paraformaldehyde in PBS overnight at 4° C., and then washed three times with washing buffer solution (PBS containing 0.1% Triton X-100). The cells were incubated in a blocking buffer solution (PBS containing 1% BSA and 0.1% Triton X-100), and then incubated overnight with a primary antibody diluted in the blocking buffer solution.
After washing with the washing buffer solution, the cells were incubated for 1 hour at room temperature with a secondary antibody diluted in the blocking buffer solution. Next, the cells were mounted with a solution adjusted to 2% DABCO (Sigma Aldrich Co. LLC, D2522) with PBS containing 40% glycerol. Images were captured with LEICA AF6500 fluorescent imaging system (Leica Microsystems).
The antibodies and dilution ratios used in EXAMPLES were as follows.
Primary antibody-biotinylated CDC324 (E-cadherin: CDH1) rat monoclonal antibody (Thermo Fisher Scientific™, 13-3249-82) (1:200) and anti-OCT4 rabbit polyclonal antibody (Abcam plc, ab19857) (1:400);
Secondary antibody-Alexa Fluor 488 anti-rat donkey IgG (Thermo Fisher Scientific™, A21208) (1:500) and Alexa Fluor 555 anti-rabbit donkey IgG (Thermo Fisher Scientific™, A31572) (1:500).
Undifferentiated ES cells were treated with 100 μg/mL S-NHS-biotin (10-fold the concentration used for multiplex scRNA-seq) and TotalSeq® PE streptavidin, and 3×103 cells were seeded on a 3.5 cm, gelatin-treated culture dish. Concurrently, the same number of untreated cells were seeded. After 5-day culture, the cells were stained with hematoxylin, and the colony formation ratios of untreated control and treated sample were examined.
10. scRNA-seq
For preparation of a scRNA-seq library, a part of BD Rhapsody Express Single-cell Analysis system (BD Biosciences) and Targeted mRNA and AbSeq Amplification Kit (BD Biosciences, 633771) was used.
Cells (1.6×104) labeled with barcoded oligo DNA were trapped in a microwell cartridge, and cell entrapping beads to which mRNA and barcoded oligo DNA derived from SINGLE CELLs were bound were collected to be used for cDNA synthesis.
The cDNA synthesis was performed according to the BD Rhapsody Express manual. cDNA and barcoded oligo DNA were amplified according to TAS-seq protocol (NPL 10). Briefly speaking, by treating with 0.75 U/μL Terminal deoxynucleotidyl transferase (TdT, Enzymatics, Inc., P7070L) solution (containing 1 mM dCTP [GE healthcare technologies Inc., 28406512] and 0.05 mM ddCTP [GE healthcare technologies Inc., 27206101] in 1×TdT buffer solution [Thermo Fisher Scientific™, 16314015]) for 30 minutes while mixing at 1,200 rpm, a poly-C tail was added to the cell entrapping beads to which cDNA and barcoded oligo DNA were bound. For synthesizing a second-strand cDNA sequence, poly-C tail cDNA beads were treated by 1×KAPA HiFi HS ReadyMix (F. Hoffmann-La Roche Ltd., 7958935001) using 5′-BDWTAv2-9G primer at a program of [98° C. for 20 seconds, 47° C. for 1 minute, and 72° C. for 2 minutes]×16 cycles.
Next, 1×KAPA HiFi HS ReadyMix using 3 primers (5′-BDWTAv2, Universal Oligo-long, and TotalSeq-ADT-oligo 1) was added to the reaction, and the whole transcriptome was amplified using a program of [98° C. for 20 seconds, 63° C. for 20 seconds, and 72° C. for 5 minutes]×7 cycles. The amplified cDNA and barcoded oligo DNA were fractionated by size sorting using AMPure XP beads (Beckman Coulter, Inc., A63881).
The cDNA was further treated by 1×KAPA HiFi HS ReadyMix using 2 primers (5′-BDWTAv2 and Universal Oligo-long) using a program of [98° C. for 20 seconds, 65° C. for 20 seconds, and 72° C. for 5 minutes]×5 cycles. The barcoded oligo DNA was further amplified using 2 primers (TotalSeq-ADT-oligo 2 and Universal Oligo-long) using a program of [98° C. for 20 seconds, 65° C. for 20 seconds, and 72° C. for 5 minutes]×12 cycles.
The sequences of the primers are as shown in Table below.
NH2-C12 is bound to the 5′ ends of SEQ ID NOs: 8 and 9.
The sequences of the amplified cDNA and barcoded oligo DNA were purified using AMPure XP beads, and the size distribution and yields were checked by a bioanalyzer (Agilent Technologies, Inc.) using High Sensitivity DNA Kit (Agilent Technologies, Inc., 5067-4626). Sequence data were collected using NovaSeq 6000 S4 flow cell (Illumina, Inc.) at Immuno GeneTeqs, Inc.
Sequence data that have been demultiplexed and DBEC-treated were imported to R (4.0.2), and the cells determined to be “doublets” or “not detected” from tag count analysis and the cells with a small number of sequence reads were deleted. Also, genes with a small number of reads, and genes that were not found in a sufficient number of cells were deleted. Seurat package (4.0.5) was used for subsequent analysis, and parameters were modified according to the tutorial workflow (NPL 18). The tutorial workflow is typically “creating Seurat object by using expression table. After performing quality check of the data, normalizing to extract characteristic genes. Performing dimensional compression to form clusters of cells which are the same cell type and conditions. By cluster analysis, classifying cells in the dataset and labeling. In order to determine the cell type and annotate, confirming the marker gene and expression variable genes per cluster”.
In the present EXAMPLE, cells in which mitochondrial genes accounts for more than 10% were excluded. Next, using the count matrix, LogNormalize, which is a default, global scaling normalization method, was performed. As the extraction method, vst was used. The “vst” basically detects gene in which the variance of the expression level is greater than that of the average expression level obtained in all cells. First, a straight line is fitted to the relationship of log (variance) and log (mean) using local polynomial regression (loess). As a result, the feature amount can be standardized without removing unexpected variations. Next, using the observed mean and expected variance (given by the fitted line), the feature amount is standardized. After clipping to the maximum value, the variance of the feature amount to a standardized value is calculated. In order to reduce the impact of technical outliers, the standardized value is clipped to a maximum value. The top 2,000 highly variable genes were calculated by FindVariableFeatures (command to identify a feature amount (feature, e.g., gene) which is an outlier on a “mean variability plot”).
After scaling the data, PCA was performed. Based on the Euclidean distances in a space occupied by the first 30 identified principal components, K-nearest neighbor (KNN) graph was constructed, and Louvain algorithm was applied for cluster identification. The first clustering yielded 11 clusters. In addition, UMAP was performed using 2,000 highly variable genes and 30 principal components to obtain 11 subclusters. Marker genes for each cluster were identified using FindAllMarker (command to identify a gene enriched in each cluster formed, in this EXAMPLE, Wilcoxon Rank Sum test was applied as default) (min.pct=0.25, logfc.threshold=0.25, set.only.pos=TRUE). The expression of E-cadherin gene (Cdh1) was plotted by FeaturePlot (command to visualize “feature” on a dimension reduced plot).
All raw FASTQ sequence files was registered to DNA Databank of Japan (DDBJ) under the Accession numbers of DRR333192 to DRR333193.
The present inventors have developed a novel cell labeling technique for multiplexing scRNA-seq using S-NHS-biotin named universal surface biotinylation (USB) method (
In this EXAMPLE, as a proof-of-concept experiment, mouse ES cells were used to compare two methods for analysis using undifferentiated cells and differentiated cells to be analyzed. Almost all mouse ES cells express adhesion molecules, E-cadherin (CDH1) (NPL 9) in the undifferentiated state; however, the number of cells that do not express CDH1 increases as the differentiation proceeds (
SINGLE CELLs were prepared from undifferentiated mouse ES cells (R1 line) and cells obtained by differentiation of RI cells by embryoid body formation, and used in the following experiments.
The cell samples were divided in two; one half was treated by the Ab method, in other words, using a biotin-conjugated anti-CDH1 antibody, and the other half was added with biotin to cell surface proteins using the USB method. Since barcoded DNA-labeled streptavidin used was coupled to fluorochrome phycoerythrin (PE), flow cytometry analysis was performed to examine labeling efficiency (
When using antibody labeling, the percentage of PE-positive cells in the undifferentiated ES cells was 87.8%, whereas the percentage of PE-positive cells in the differentiated ES cells was about 10%. When immunostaining was performed by using anti-CDH1 antibodies, almost all cells in the undifferentiated ES cell samples were CDH1-positive (
In contrast, when using the USB method, the percentages of PE-positive cells in the undifferentiated ES cells and differentiated cell samples were 99.6% and 96.4%, respectively (
This EXAMPLE revealed that the operation of the USB method does not affect cell viability.
A series of experimental operations by the USB method has a possibility to compromise cell states and growth. To examine this possibility, ES cells were treated by the USB method, and seeded on a culture dish to measure colony formation ability (
The USB method of the present invention can comprehensively bind DNA barcodes to most various cells through biotin-streptavidin binding, and its experimental operations does not cause the cell viability to be compromised. In this EXAMPLE, examined were whether the USB technique can be used for multiplexing in scRNA-seq analysis, and whether the expression profile of the cells are affected by the experimental operations.
R1 ES cells in the undifferentiated and differentiated states were treated by either the USB method or the Ab method. Further, undifferentiated and differentiated cells of another ES cell line, EB3, were separately labeled using the USB method. Equal amounts of these 6 samples were mixed, and approximately 16,000 SINGLE CELLs were trapped using the BD Rhapsody™ system. Some of these cells (about 4,000 cells) were used for cDNA synthesis and amplification for the whole transcriptome analysis, as well as amplification of barcoded oligo DNA tags according to the TAS-seq protocol (NPL 10). When estimating the size distribution and yields of the amplified cDNA and barcoded oligo DNA, appropriate amplification was confirmed (
After removing dead cells, doublet cells and cells for which no tag could be identified, the remaining 2,089 cells were subjected to informatics analysis (Table 3).
Data obtained from the undifferentiated and differentiated states of R1 ES cells labeled by either the USB method or the Ab method are shown (
Clusters 0, 1, and 3 were considered to correspond to pluripotent cell, naive cells, and primed pluripotent cells, respectively, and to be cells in the undifferentiated state, from the expressed genes (
It is worth noting that cells in these clusters labeled by two different methods fell into the same clusters (
Clusters other than 0, 1, and 3 such as vascular smooth muscle cells (cluster 2), dopamine neurons (cluster 4), epithelial cells (cluster 5), and neural cells (cluster 7) were identified as differentiated cells, and cell clusters thought of as macrophages (cluster 8), glial cells (cluster 9), and vascular endothelial cells (cluster 10) were detected. Mouse embryonic fibroblast (MEF) feeder cells which was not able to completely be removed were detected as cluster 6.
The cells labeled by the USB method were revealed to be present in all the clusters from the analysis of the cell samples belonging to each cluster (
In the differentiated cell clusters 2, 4, and 7, the numbers of cells labeled by the Ab method were reduced compared to the cells labeled by the UBS method (
This is considered that the expression of Cdh1 genes were suppressed in these differentiated cells, leading to a decrease in labeling efficiency by the Ab method using the CDH1 antibody. In Cdh1-positive clusters (clusters 0, 1, 3, and 5), almost the same numbers of cells were detected between the samples labeled by the USB method and the Ab method. However, in Cdh1-negative clusters (clusters 2, 4, 6, 7, 8, 9, and 10), the numbers of cells labeled by the Ab method were clearly lower than those by the USB method (
This trend was observed in the EB3 line, and the USB method was shown to be capable of applying to different ES cell lines (
CDH1-negative cells are thought not to have been labeled by the anti-CDH1 antibody, and should not be detected. Nevertheless, in the antibody-treated cells, the Cdh1-negative cells were clearly included, albeit in a clearly reduced number (Table 5,
Since the amplification by scRNA-seq is thought to be more sensitive than that by flow cytometry, some of the cells that express no CDH1 (or express very tiny amount) may be classified as positive cells, due to the non-specific binding of streptavidin and/or isotype control antibody. However, in cell labeling for multiplexing scRNA-seq, it is very important for all cells in the samples to be evenly labeled, and the tiny amount of non-specific binding by the Ab method itself is not considered to impact on the results of multiplexing.
In this EXAMPLE, sulfo-NHS-esterification of barcoded oligo DNA was performed. The materials used are as follows.
For a verification experiment for cell labeling, 6-FAM modification at the 3′ end was also performed. (6-FAM modification is not required for sample multiplexing of scRNA-seq.)
The methods used in the EXAMPLE are as follows.
The sulfo-NHS-esterified oligo DNA obtained above was stored at −80° C. until used for subsequent examples.
In this EXAMPLE, cell were labeled using sulfo-NHS-esterified oligo DNA (NEO) (single-step method) and the labeling efficiency was verified.
All operations and reaction of this EXAMPLE were performed at 4° C. or on ice.
The above labeled cells were used for SINGLE CELL RNA-seq analysis (scRNA-seq analysis).
Further, the labeling efficiency of cells was verified as follows.
The suspension solution of the cells labeled by using 6-FAM modified, sulfo-NHS-esterified oligo DNA (NEO) was passed through a Cell strainer, and then flow cytometry analysis was performed.
The results are shown in
Undifferentiated ES cells cultured in naive pluripotent stem cell maintenance medium (2i medium), undifferentiated ES cells cultured in normal ES cell medium (normal ESC medium), differentiated ES cells cultured in LIF(−) serum medium for 5 days (Day 5 differentiated), and differentiated ES cells cultured in LIF(−) serum medium for 10 days (Day 10 differentiated) were each labeled with different barcode tags by the single-step method, and then multiplexed scRNA-seq analysis was performed (
9 cell types were detected by clustering with UMAP. Cell types and marker genes (top representative 20) of the 9 cell types are as follows.
The results are shown in
In the 2i medium, many naive pluripotent cells were confirmed, and in the normal ESC medium. primed pluripotent cells were the most. In Day 5 differentiated. cardiovascular cells accounted for the majority, and in Day 10 differentiated, differentiated cells were detected, including mainly vascular smooth muscle cells.
Number | Date | Country | Kind |
---|---|---|---|
2022-016576 | Feb 2022 | JP | national |
The present application is a continuation application of International Application No. PCT/JP2023/004476, filed on Feb. 2, 2023, which claims priority under 35 U.S.C. 119 (a) to Patent Application No. 2022-016576, filed in the Japan on Feb. 4, 2022, all of which are hereby expressly incorporated by reference into the present application.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/JP2023/004476 | Feb 2023 | WO |
Child | 18793020 | US |