This sequence listing entitled as COMPOSITION AND THE USE OF CELL LYSIS REAGENTS.XML, created on Jul. 7, 2023 and having a size of 15,306 bytes is incorporated hereby by reference pursuant to 37 C.F.R. section 1.52(e) (5).
The present invention relates to cell lysis reagents for increasing the gene and transcript capture during copy DNA synthesis.
Gene expression analysis constitutes one of the most common techniques in cell and molecular biology. To evaluate the gene expression, the cells must be first disrupted to release their intracellular nucleic acids (e.g., RNA), which can then be captured, amplified and analysed according to the needs of a study. To ensure efficient RNA release, the cells are typically exposed to lysis reagents such as detergents (surfactants), which preferentially bind to cell membrane and solubilize lipids.
In gene expression and transcriptomics studies cell lysis is typically performed under conditions that minimally affect the activity of nucleic acid processing enzymes. Finding the right balance between the efficient cell lysis and compatibility with subsequent enzymatic steps often relies on extensive trial-and-error efforts. So far, the non-ionic detergents belonging to the TRITON, TWEEN and IGEPAL class have been used most extensively in cell and molecular biology assays, where cell lysis and nucleic acid analysis are conducted without a nucleic acid purification step. However, the “lysis” efficiency of non-ionic detergents for releasing the intracellular nucleic acids for subsequent analysis, remains poorly characterized. The large variety of chemical structures and properties makes it difficult to establish the general rules for the efficiency of cell lysis and nucleic acid release [1]. Typically, ionic detergents provide more efficient means for cell membrane solubilization as compared to the non-ionic or zwitterionic ones [2], yet the use of ionic detergents (e.g., SDS) is undesirable because they tend to inhibit enzymatic reactions (e.g., by denaturation of proteins). In situations where a given enzymatic reaction and cell lysis are performed in the same reaction mix, the use of enzyme-compatible cell lysis reagents is required. Relevant to this invention, for the enzymatic reactions involving reverse transcription (RT) reaction, polymerase chain reaction (PCR), and nucleic acid amplification and synthesis, it is desirable to use non-inhibitory lysis reagents that can efficiently disrupt the cells and release intracellular nucleic acids. Such non-inhibitory lysis reagents (detergents) often belong to the TRITONS, TERGITOLS, TWEENS and BRIJ class. It has been shown that in many cases the optimal hydrophobic chain of non-ionic detergents is ˜18 and that hydrophilic head length can effect cell lysis efficiency [3]. The size of the surfactant head group can be a determining factor for the efficacy by which a surfactant is able to disrupt a cell membrane [4]. Many non-ionic surfactants show fairly high (3×103/M) membrane partition coefficient. For example, Triton X-100 can be regarded as a “strong” detergent because the large headgroups affect the lipid packing and may lead to a more efficient lipid membrane disruption. In contrast, “weak” detergents typically have small headgroups. The tritons, CmEOn with n=7 to 8, and the alkyl maltosides are “strong” detergents, whereas the alkyl glucosides and C12EOn with n=3-6 are “weak” detergents [5].
The extent of the surface activity of a detergent is mainly determined by the hydrophilic-hydrophobic balance of the molecule [6]. At concentrations just below the critical micelle concentration (CMC) individual detergent molecules or small aggregates causes the membrane to become more permeable. When the bilayer becomes saturated with detergent, detergent-lipid micelles may appear in the aqueous medium, and membrane solubilization will take place. Usually mixed micelles occur only at or near the CMC of the pure detergent. At detergent concentrations close to or above the CMC range, rapid breakdown of the membrane typically occurs [7]. Loss of cell viability is typically recorded below CMC [8], before full membrane solubilization.
When detergent is applied to membranes, solubilization typically occurs by three successive stages: detergent incorporation, lamellar-micellar phase transition (formation of mixed micelles) and total solubilization (separation of protein and lipid mixed micelles). The insoluble fraction that remains after progression through these events is typically enriched in sphingolipids, cholesterol and raft proteins [9]. Even though increasing the temperature increases the lipid solubilization, up to 30% of lipids may remain insoluble under these change in environment [10]. As a result, cell lysis and thus RNA release using only non-ionic detergent might be limited because it may generate detergent resistant cell membranes and prevent efficient release of mRNA molecules or restrict RT enzyme access to some mRNA molecules.
The use of cholesterol-binding detergents and compounds often facilitates membrane solubilization by non-ionic detergents [11-13]. In particular, cholesterol-binding molecules tend to disrupt protein-lipid interactions [11], facilitate protein solubilization by non-ionic detergent [10, 14, 15] and by doing so improve the overall cell lysis efficiency. However, whether the use of cholesterol-binding detergents (e.g., saponin, methyl-β-cyclodextrin, digitonin, etc.) is compatible with reverse transcription (RT) reaction and whether their use alone or in combination with non-ionic detergents improves copy DNA (cDNA) yields, remains so far unknown and uncharacterized.
Herein, we reveal that the use of cholesterol-binding compounds may improve cDNA yields when used alone or in combination with non-ionic detergents. Specifically, we reveal that the use of triterpene glycosides (also known as saponins) alone, or in combination with non-ionic detergents, is compatible with single-cell RNA-Seq protocols, does not destabilize water-in-oil emulsions at elevated temperatures (>37° C.), and increases the diversity of RNA species captured from lysed-cells during the reverse transcription reaction.
Many state-of-the-art molecular biology protocols for single-cell nucleic acid analysis rely on a step where single-cells, or a mixture of cells, are lysed to release their intracellular nucleic acids [16-30]. Some lysis reagents (e.g., SDS, guanidinium isothiocyanate), despite being highly efficient in lysing the cells, cannot be employed because their presence in the cell lysate may inhibit subsequent enzymatic reactions. For example, it is well known for a person experienced in the field that even small traces of SDS (<0.01%) may inhibit the reverse transcription and/or PCR reactions [31]. Therefore, for many molecular biology protocols that are based on nucleic acid analysis it is highly desirable to employ such cell lysis reagents that would efficiently lyse the cells to release their intracellular RNAs, and would not inhibit subsequent RT and/or PCR reactions. In addition, it is highly desirable to employ such lysis reagents that would improve intracellular RNA release and/or increase RNA capture and/or cDNA yields. As we reveal in this invention, the use of triterpene glycosides and in particular the use of saponin for cell lysis, solves the unmet need by improving intracellular RNA release upon cell lysis and increasing cDNA yields. Moreover, the disclosed composition improves the transcript capture of individual cells when performing single cell RNA-Seq experiments. The cell lysis reagents disclosed in this invention may be used directly in the RT, RT-PCR or PCR reactions. As a result, the use of triterpene glycosides with or without non-ionic detergents may improve analytical sensitivity of a large variety of assays that rely on efficient single-cell lysis, and subsequent nucleic acid amplification and analysis. Experienced in the art will be able to adapt the disclosed lysis reagents and composition for different molecular biology protocols involving single-cells, or mixture of cells. The non-limiting examples of assays that may benefit from the disclosed invention include RT, PCR, RT-PCR, qPCR, RT-qPCR sequencing and a large variety of single-cell-omic assays [16-30] amongst other. Finally, another important aspect of the invention relates to the fact that the use of triterpene glycosides and in particular the use of saponin, with or without non-ionic detergents, does not compromise the emulsion stability during elevated temperatures (>37° C.), when the water droplets are suspended in the fluorinated carrier oil comprising fluorosurfactant.
According the present invention, there is provided a composition of cell lysis reagent comprising a triterpene glycoside amphiphile that preferentially binds to fatty acids, phospholipids, glycolipids, and/or sterols and provided that said triterpene glycoside amphiphile does not lower the activity of the enzymatic reaction needed to synthesize DNA by more than 10%. The invention further provides that the cell lysis reagent does not inhibit the copy DNA synthesis by the reverse transcriptase. In one aspect, the triterpene glycoside is saponin.
In a second aspect, the composition further comprising triterpene glycoside amphiphile used in combination with the non-ionic amphiphile belonging to the class of detergents comprising of n-alkyl-glucopyranosides, n-alkyl-maltosides, n-alkyl-thioglucopyranosides, ethoxylated nonylphenols (known as different types of Tergitol), octylphenoxypolyethoxy ethanols (known as different types of Triton and Igepals), polyoxyethylene ethers (known as different types of Brij) and alkyl polyethylene glycol ethers, genapols.
In a third aspect, the triterpene glycoside amphiphile belongs to the class of detergents that bind the cholesterol such as CHAPSO/CHAPS, Digitonin, DC-cholesterol, Cholesteryl-PEG600, CHOBIMALT, CHAPSTEROL, Saponin, or cyclodextrin. Further, the non-ionic amphiphile(s) is one of, TERGITOL 15-5-7, TERGITOL 15-S-9, TERGITOL 15-S-30, TERGITOL 15-S-40, TERGITOL NP7, NP40, TRITON X100, GENAPOL C100, IGEPAL CA-630, IGEPAL CA-720, BRIJ58, BRIJ35, and TWEEN20.
In a fourth aspect, the concentration of the amphiphile(s) is in the range of 0.00001% to 10% (w/v), but more preferably in the range of 0.001% to 1% (w/v).
In a fifth aspect, according to the present invention, it is provides a composition of a reaction mixture that comprises cell lysis reagents, triterpene glycoside and non-ionic amphiphiles, that preferentially bind to fatty acids, phospholipids, glycolipids, and/or sterols and that do not inhibit the enzymatic reaction needed to synthesize DNA, and further provided that the reagent concentrations are 0.001% to 1% (w/v) and reagents necessary for DNA synthesis, and further that cell lysis and DNA synthesis reactions are performed in the same reaction mix.
In a sixth aspect, the present invention provides a method of cell lysis and DNA synthesis, the method comprising adding a cell(s), cell lysis reagents, comprising triterpene glycoside with or without a non-ionic amphiphile, and DNA synthesis reagents to a reaction mixture; lysing the cell(s) to release the desired component from the cell, wherein the desired component is nucleic acid; and amplifying the nucleic acid.
In a seventh aspect, DNA synthesis reagents comprises reverse transcription (RT) reagents in the method.
In an eighth aspect, DNA synthesis reagents comprises PCR reagents in the method.
In a ninth aspect, the reaction mixture of the method is loaded in a microreactor such as micro-well, nano-well, water-in-oil droplet, or any other compartment that has volume below 10 μl and above 1 pl.
In a tenth aspect, the reaction mixture of the method is loaded on a microfluidic device or other device, assembly or instrument capable of forming a water-in-oil droplet.
In an eleventh aspect, the present invention provides a method of cell lysis and DNA synthesis on single-cells comprising adding cells, DNA barcodes, cell lysis reagents comprising triterpene glycoside with or without of non-ionic amphiphiles, and DNA synthesis reagents in to water-in-oil droplets suspended in the fluorinated carrier oil, further comprising a fluorosurfactant; lysing the cell(s) to release the desired component from the cell, wherein the desired component is nucleic acid; and amplifying the nucleic acid.
In yet another aspect, the method provides encapsulating a plurality of cells and a plurality of DNA barcoding beads in a plurality of droplets together with the reaction composition, wherein each bead of the plurality of beads comprises a molecular tag, and some droplets comprise cells and beads, and further provided that the plurality of droplets comprises cell lysis reagents, RT reaction reagents, and DNA synthesis reagents and no more than one cell and no more than one bead, and the molecular tag of the bead in one droplet is distinguishable from the molecular tags of the beads in the other droplets; lysing the cells; and converting RNA into cDNA using molecular tags at RT primers.
In a further aspect of the method, it is provided that the molecular tag is released from the bead and the molecular tag is attached to the nucleic acid in the droplet released by lysed cell. The molecular tag is a barcoding DNA oligonucleotide, which allows the nucleic acid in one droplet to be uniquely identified from among the nucleic acids of the other droplets.
In aspects relating to the provided methods, it is provided that nucleic acid amplification is chosen from one of: polymerase chain reaction (PCR), loop-mediated isothermal amplification (LAMP), isothermal amplification, linear amplification, reverse transcription, replication, polymerase chain reaction (PCR), quantitative PCR, real-time PCR, digital PCR, reverse transcriptase PCR (RT-PCR), multiplex RT-PCR, and any combination thereof.
In a final aspect, the present invention provides a kit for cell lysis comprising a triterpene glycoside, a kit for cell lysis comprising triterpene glycoside and a non-ionic amphiphile, and a kit for DNA synthesis reaction comprising a reaction composition for cell lysis and DNA synthesis.
To assist understanding of the present disclosure and to show how embodiments may be put into effect, reference is made by way of example to the accompanying drawings described below. It should be noted that in the schematics provided various aspects are not drawn to scale.
Throughout the text the terms of ‘comprising’ and ‘containing’ have been used interchangeably and have the same meaning.
Articles such as “a,” “an,” and “the” include the singular and the plural reference unless the context clearly indicates otherwise. For example, a reference to “a cell” includes one cell and two or more of such cells.
The terms “detergent”, “surfactant”, “amphiphile” are used herein interchangeably and have the same meaning.
In the context of this inventions, if the presence of amphiphile in the reaction mix reduces the catalytic activity of DNA polymerase by less than 10%, it is shall be considered as non-inhibitory (absence of inhibition).
The present invention generally relates to the composition and methods of lysing cells for the purposed of nucleic acid processing and analysis. In particular, in one aspect it relates to a method of lysing a mixture of cells or single cells for the purpose of converting the cellular RNA into cDNA in the presence of lysis reagents. Certain aspects are generally directed to methods for performing reverse transcription (RT) and/or polymerase chain reaction (PCR) in the presence of cell lysis reagents.
The methods of the present invention may also comprise lysing the single-cell, or more than one cell, in the presence of nucleic acid processing enzymes, and that upon the cell lysis the nucleic acid processing enzyme retains most of its catalytic activity. In some scenarios, the presence of lysis reagents in the reaction mix may provide stabilizing effect, meaning that enzyme will retain catalytic activity under given conditions for a longer period of time as compared to the conditions where the lysis reagents are absent.
The present invention also relates to the composition and methods of lysing cells under conditions that do not substantially inhibit reverse transcription reaction so that RNA molecules released from lysed cells can be captured and converted to cDNA by an RT enzyme, in the presence of lysis reagents.
The disclosed invention may be relevant to a large variety of methods and techniques that involve processing of nucleic acids from lysed cells. Efficient cell lysis is particularly important in assays that aim to evaluate the transcriptional activity of single-cells [16-30]. For example, assays that are used to evaluate the gene expression of a cell, require a lysis step in order to release the intracellular nucleic acid molecules and make them accessible to biochemical reactions. Similarly, the assays that are used to evaluate RNA content of cells or tissues, require a lysis step in order to release the RNA molecules encoded by the said cells and make them accessible to biochemical reactions.
The present invention also relates to certain amphiphiles and their combination thereof to improve cell lysis and increase the diversity of RNA species that may be captured and amplified by RT and PCR. In particular, the use of triterpene glycosides and more specifically the use of saponins may improve cell lysis efficiency and as a result ensure better RNA capture as compared to the commonly used lysis reagents in the state-of-the-art methods.
In some scenario the use of saponin in combination with non-ionic detergent(s) may provide higher cDNA yields as compared to reaction conditions where the same non-ionic detergent or saponin are used alone.
Using the amphiphiles disclosed in this invention more than 50% of the total RNA must be released from the lysed mammalian cell.
Using the amphiphiles disclosed in this invention more than 5% of all mRNA molecules must be captured from the lysed mammalian cell during RT reaction.
In particular, the present invention involves lysis reagents belonging to triterpene glycosides, ethoxylated nonylphenols (known as different types of Tergitol non-ionic detergents), octylphenoxypolyethoxy ethanols (known as different types of Triton and Igepal non-ionic detergents), polyoxyethylene ethers (known as different types of Brij non-ionic detergents) and alkyl polyethylene glycol ethers, or any combination thereof, to achieve efficient cell lysis and obtain increased cDNA yields upon RT reaction. Moreover, said lysis reagents or combination thereof may be used to achieve a better capture of transcripts in single-cells.
Because the biological entities, such as cells, are efficiently lysed in the presence of triterpene glycosides, their intracellular RNA is released and becomes readily accessible to nucleic acid processing enzymes. Therefore, using the triterpene glycoside alone or in mixture with non-ionic detergents, one can release more RNA from lysed cells and thus obtain higher yields of cDNA and capture higher diversity of RNA molecules.
The present invention also simplifies the use of laboratory protocols that involve RNA capture and cDNA synthesis. Because disclosed lysis reagents are compatible with nucleic acid processing enzymes (e.g., RNA polymerase, DNA polymerase, RT enzyme), the cell lysis and nucleic acid analysis (e.g., reverse transcription) can be performed in the same reaction mix, without a need to purify the nucleic acids. This simplifies the use of laboratory protocols that involve RNA capture and cDNA synthesis.
Another important feature of the invention is that the disclosed composition of cell lysis reagents does not destabilize emulsion droplets at elevated temperatures. Therefore, a plurality of single cells isolated in a plurality of microfluidic droplets can be efficiently lysed and their RNA converted to cDNA without a negative effect on droplet stability. As a result, the composition of the cell lysis reagents described herein have broad applicability in high-throughput analytical methods where efficient cell lysis is required under conditions compatible with reverse transcriptase enzymes.
The disclosed composition of cell lysis reagents in combination with reagents necessary for DNA synthesis can be used for a method of cell lysis and DNA synthesis in the same reaction mix, as shown in
The present invention is also directed to a method comprising a step of encapsulating cells, lysis reagents, RT reaction mix and barcoding beads in microfluidic droplets. In particular, the present invention involves lysis reagents that do not compromise water-in-oil droplet stability during cell encapsulation, cell lysis, reverse transcription reaction and incubation at high temperatures (>80° C.) for extended periods of time (>10 min). As detailed below the droplets are made on a microfluidics device using fluorinated liquid as a carrier oil and which is supplemented with a fluorinated surfactant.
The methods of the present invention may also comprise a step of encapsulating the cells. In particular, it may comprise a step of encapsulating a plurality of cells into a plurality of microfluidic droplets with lysis reagents. Those of ordinary skill in the art will be aware of techniques for encapsulating cells within microfluidic droplets [32, 33]; see also for example, U.S. Pat. Nos. 7,708,949, 8,337,778, 8,765,485 each incorporated herein by reference. The cells preferably are encapsulated at a density such that on average one droplet would contain one cell or less. In a non-limiting example, the lysis reagents are encapsulated with cells and hydrogel beads carrying DNA barcodes.
In one set of embodiments, the cells are loaded in microfluidic droplets such that, on average, each droplet has less than 1 cell in it. For example, the average loading rate may be less than about 1 cell/droplet, less than about 0.8 cells/droplet, less than about 0.6 cells/droplet, less than about 0.4 cells/droplet, less than about 0.3 cells/droplet, less than about 0.2 cells/droplet, less than about 0.2 cells/droplet, less than about 0.1 cells/droplet, less than about 0.05 cells/droplet, less than about 0.01 cells/droplet.
The present invention makes use of such droplets to lyse encapsulated cells in order to release the nucleic acids from the said cells, and so that the nucleic acids can be processed further using DNA polymerization reaction.
In some embodiments the methods of the invention use droplets comprising cells and lysis reagents to disrupt the cell membrane and release nucleic acids. In other embodiments, the methods may comprise the steps of lysing the cells by increasing the temperature in the presence of lysis reagents.
In some cases, the nucleic acids that are released upon exposure to cell lysis reagents of this invention may be subjected to further processing, for example, by including suitable reagents specific to the nucleic acid processing method. Examples of nucleic processing known to those of ordinary skill in the art include, but are not limited to, reverse transcriptase, primer extension reaction, polymerase chain reaction, multiple displacement amplification, poly(A) tailing, nucleic acid hydrolysis, ligation to an oligonucleotide, modification of 5′- or 3′-end, or both. As in one non-limiting example of the invention that is shown in
Also provided by the present invention is a kit for improved lysis of the cells and release of intracellular nucleic acids, comprising amphiphiles at a concentration between 0.01-50% (w/v). The kit may comprise triterpene glycoside (e.g., saponin). The kit may comprise non-ionic detergents such as ethoxylated nonylphenols (commonly known as different types of Tergitols), octylphenoxypolyethoxy ethanols (commonly known as different types of Tritons and Igepals), polyoxyethylene ethers (commonly known as different types of Brijs) and alkyl polyethylene glycol ethers, at concentration 0.01-50% (w/v). The kit may comprise a mixture of triterpene glycoside and non-ionic detergent(s) and can be supplied in a separate container (tube) or supplemented in the reaction mix. For example, a mixture comprising triterpene glycoside and non-ionic detergent(s) can be supplemented in RT reaction buffer. The said lysis reagents can be part of a kit that is used to achieve increased cDNA yields, and/or to achieve a better capture of transcripts from individual cells. The kit may optionally comprise other reagents such as reverse transcription enzyme, buffer, oligonucleotides, dNTPs, microfluidic chip and/or microfluidics consumables (e.g., tubing, needles, etc.). The kit may further comprise instructions for performance of the methods described herein. Finally, the kit and compositions can be adapted to single-cell RNA-Seq assays on hundreds of thousands of isolated cells and molecular species, in a massively parallel fashion.
As discussed below the cell lysis may be performed within water-in-oil droplet. However, it should be understood that other compartments may be used to perform cell lysis. Those experienced in the field will be able to choose the desirable reaction format and volume for conducting cell lysis. For example, single-cells, or a mixture of cells, can be lysed in the laboratory tubes, well(s) of a microtiter plate, micro- or nano-wells, in microreactors, cages or other compartments that have reaction volumes smaller than 1 millilitre but larger than 1 picolitre (pl), and more preferably cell lysis is performed in the compartments that have volumes larger than 1 pl and smaller than 100 μl and even more preferably having volumes larger than 10 pl and smaller than 1 μl.
The barcoded hydrogel (bead) that is co-encapsulated into the droplet with a cell, lysis reagents and reaction mix may comprise a reagent for use in the processing of the nucleic acid, for example a molecular tag or label, such as a nucleic acid tag or a barcode, an enzyme, an antibody, a lytic reagent, a DNA primer, or a dye. In a preferred embodiment of the methods of the invention the bead comprises an oligonucleotide comprising a barcode.
In particular, a barcode is a user-defined DNA sequence preferably longer than 4 nucleotides but shorter than 100 nucleotides and more preferably in the range of 6-70 nucleotides and even more preferably in the range of 8-16 nucleotides long.
In the methods of the invention discussed herein, where nucleic acid from a plurality of cells is being barcoded, the diversity of unique barcode sequences is at least 100, and more preferably more than 1000, and more preferably more than 10,000 and more preferably more than 100,000 and even more preferably more than 1,000,000 but less than 10{circumflex over ( )}12.
In a preferred embodiment the barcode is comprised in a barcoding DNA oligonucleotide which is 12 to 300 nucleotides in length, preferably 20 to 150 nucleotides in length, and more preferably 30 to 120 nucleotides in length. The barcoding DNA oligonucleotide may further comprise one or more of (i) a unique molecular identifier (UMI), wherein the UMI is a random nucleotide sequence longer than 4 nucleotides but shorter than 50 nucleotides, and preferably in the range of 4-12 nucleotides and still more preferably 8 to 16 nucleotides in length; (ii) a cell barcode preferably longer than 4 but shorter than 100 nucleotides, and more preferably 6 to 70 nucleotides in length, (iii) a sequence able to specifically bind to a region of interest in the nucleic acid (e.g. a poly-T sequence or a gene specific sequence); (iv) an overhang, (e.g. a sticky end) or a blunt end; and (v) an adapter sequence (e.g. a PCR adapter and/or a sequencing adapter and/or hybridization adapter).
The beads used for co-encapsulation may be solid particles comprising a molecular tag [34] or squishy particles (hydrogel beads) comprising a molecular tag [17]. The bead may be hard or soft, it can be made of organic or inorganic material, it can be a solid particle, a hydrogel particle, a hydrogel bead, or composite hydrogel bead. It may also comprise polyacrylamide, agarose, polystyrene and/or poly-N-isopropylacrylamide. Preferably the particle has a size of in the range of 1-100 μm and preferably in the range of 10-80 μm, and more preferably in the range of 20-70 μm, and more preferably approximately 60 μm.
It is preferred that the reagent, and in particular the molecular tag or barcoding DNA oligonucleotide referred to herein, is covalently attached to the bead. However, this is not essential and in other examples, the bead may comprise a short single stranded nucleotide stub to which the reagent base-pairs.
Where the attachment is covalent, it may comprise a cleavable linker, such as a photocleavable linkers, a chemically cleavable linker, or an enzymatically cleavable linker, such that the reagent and in particular the molecular tag or barcoding DNA oligonucleotide referred to herein can be released from the bead by exposing the plurality of droplets to illumination (preferably below 450 nm wavelength), to a chemical agent (e.g. a reducing agent), or an enzyme (e.g. endonuclease), respectively, to release the molecular tag covalently attached to the bead when the bead is in the droplet.
However, it should be noted that the methods described herein do not always involve a step of releasing the reagent (e.g. the molecular tag or barcoding DNA oligonucleotide) from the bead. In particular, the reagent may remain attached to the bead inside the droplet, while the cell is lysed inside the droplet to release the nucleic acids and allow it to come into contact with the reagent. In other embodiments the bead may comprise more than one reagent. In an example of such an embodiment, the bead may comprise two molecular tags, e.g. one barcoding DNA oligonucleotide that is to be attached to DNA released from the lysed cell, and one barcoding DNA oligonucleotide that is to be attached to cDNA produced with a reverse transcriptase reaction with the RNA released from the lysed cell. In such an embodiment it may be desirable for the barcoded DNA to become attached by the barcoding DNA oligonucleotide to the bead, while the barcoded RNA is untethered, or vice versa.
The cells are co-encapsulated in a droplet. The droplet may be a microfluidic droplet, i.e. a droplet generated with a microfluidic device. Preferably the droplet is an oil-in-water droplet generated using a microfluidic device.
Such microfluidic droplets have particular utility in high throughput methods involving a large number of samples of biological species and in particular a large number of cells. Accordingly, the method of the present invention can be readily scaled to attach barcodes to a large number of cells and samples.
One non-limiting example of the invention is shown in
In some embodiments, the encapsulation conditions are chosen such that droplets contain one hydrogel bead. The presence of empty droplets and/or droplets with single hydrogel beads but without cells, and/or droplets with cells but without hydrogel beads, do not substantially affect performance. The co-encapsulated cells and hydrogel beads may be collected in the form of an emulsion and processed according to the aim of the particular application. For example, in one particular embodiment, the RNA of lysed cells is converted into barcoded complimentary DNA upon reverse transcription or other DNA polymerization reaction with barcoded oligonucleotides i.e., introduced with a hydrogel bead.
In one set of embodiments, the nucleic acid molecules may be “barcoded” or include unique sequences attached that can be used to distinguish nucleic acids in a droplet from those in another droplet, or even when the nucleic acids are pooled together. The barcoded oligonucleotides having unique sequences are delivered to individual droplets by hydrogel beads attached thereto and the nucleic acids containing within the droplets are “barcoded” during enzymatic reaction by barcoded oligonucleotides. The barcodes are used to distinguish nucleic acids, e.g., originating from different cells. The barcoded oligonucleotides attached to nucleic acid molecules within a droplet may be distinguishable from oligonucleotide tags in other droplets. Barcoding nucleic acid molecules is important when pooling the nucleic acids from different droplets.
In another set of embodiments, the barcoding DNA oligonucleotide are introduced into the droplets by a hydrogel bead or a solid particle and then released from the bead once the bead is loaded into a droplet. When the hydrogel beads are loaded within the droplets at a density such that one droplet on average contains no more than 1 bead, then once the barcoding DNA oligonucleotide are released from the bead, then most or all of the droplets will contain one unique barcoding DNA oligonucleotide, thus allowing each droplet (and the nucleic acids contained therein) to be uniquely labelled and identified.
It should be understood that the barcoding DNA oligonucleotide introduced into the droplets by a bead (hydrogel bead or a solid particle) may remain attached to the bead once the bead is loaded into a droplet. When the barcoding DNA oligonucleotide remains attached the bead, the nucleic acids (released by lysed cells) will be captured by the said barcoding DNA oligonucleotide attached to the bead.
It should be understood that the barcoding oligonucleotides are initially attached to hydrogel beads in order to facilitate the introduction of only one unique barcoded oligonucleotide type to each droplet. After introducing a hydrogel bead with barcoding oligonucleotides to droplet and releasing them, a droplet will contain a plurality (typically in other range of 10{circumflex over ( )}3-10{circumflex over ( )}9) of barcoded oligonucleotides containing the same unique barcode. Therefore, each barcoding oligonucleotide present within a droplet will be distinguishable from the barcoding oligonucleotide present in the other droplets.
To release barcoding oligonucleotides from the hydrogel-bead a light at approximately 400 nm wavelength range may be applied to cleave the photolabile bond and release barcoding oligonucleotides from the hydrogel. However, it should be understood that this is an example only, and that other methods of cleavage or release can also be used, for example by melting the hydrogel bead with reducing agent [18]. In one set of embodiments, agarose particles containing oligonucleotides may be used, and the oligonucleotides may be released by heating the agarose, e.g., until the agarose at least partially liquefies or softens. However, in some other embodiments, cleavage may be nonessential.
In another set of embodiments, the droplets may be broken to release the barcoded nucleic acids and other contents. The barcoded nucleic acids may then be pooled together and since the barcoded nucleic acids molecules are labelled with different barcoded oligonucleotide tags, the nucleic acids from one droplet (i.e., from one cell) can be distinguished from those from other droplets (i.e. from other cell) by the sequencing of barcoded oligonucleotide tags.
In another aspect, the present invention provides systems and methods for the massively parallel barcoding of DNA or RNA from large numbers of cells. This process may rely on the co-encapsulation of cells along with barcoded nucleic acids, or other suitable oligonucleotide tags attached to hydrogel or polymer beads, together with lysis reagents and/or other reagents that may be used for RNA and/or DNA capture, and/or extension and/or amplification.
In one set of embodiments, the nucleic acid contents of each cell may be labelled with a unique barcode and may allow for hundreds, thousands, or millions of cells to be barcoded in a single experiment for the purpose of determining the cell composition in a population or for screening cell populations.
In yet another aspect, the present invention provides systems and methods for the massively parallel capture, barcoding and quantification of nucleic acid molecules from a large number of single cells, for the purpose of profiling cell populations, characterizing their transcriptome, characterizing their genome, or other purposes.
After purification and optional DNA amplification, the base composition and barcode identity of cellular nucleic acids may be determined, for instance, by sequencing. Alternatively, in some embodiments, DNA oligonucleotides introduced with hydrogel beads may serve as primers for amplification of region of interest in the genomic DNA.
In one non-limiting embodiment, the 3′ end of a barcoded oligonucleotide is terminated with a poly-T sequence that may be used to capture cellular mRNA. The 3′ end of poly-T oligonucleotide may serve as a reverse transcription primer and may be extended to create cDNA. In one set of embodiments, the 3′ end of the barcoded primers may terminate with a random DNA sequence that can be used to capture the RNA or DNA of a cell lysate. In another embodiment, the 3′ end of the barcoded primers may terminate with a specific DNA sequence, e.g., that can be used to capture DNA or RNA species (“genes”) of interest, or to hybridize to a DNA probe that is delivered into the droplets in addition to the hydrogel beads, together with the enzyme reagents. In another embodiment, a hydrogel bead may carry a number of different primers to target several genes of interest. Analytical techniques can be used to analyze, for example, genomes, transcriptomes, epigenomes, single nucleotide polymorphisms, specific gene expression levels, non-coding RNA, etc. However, the invention should not be limited to only these applications.
In some cases, systems and methods revealed here are related to barcoding the specific set of genes (e.g., tens, or hundreds or even thousands of genes) of individual cells with a unique barcode and prepare genetic material of hundreds, thousands, or even hundreds of thousands or more of individual cells in a single experiment. For example, in some applications it may be desirable to analyze a sub-set of genes of interest, for example between tens to hundreds of genes, rather than whole-transcriptome or whole-genome sequencing.
In some cases, systems and methods revealed here are related to barcoding the DNA fragments arising from individual cells with a unique barcode, and prepare barcoded genetic material from hundreds, thousands, or even hundreds of thousands or more of individual cells in a single experiment.
Some embodiments of the invention may be used to quantify protein abundance in single cells in parallel to RNA or DNA, for example, by first treating cells with DNA-tagged antibodies as discussed in references [19, 20]. After sequencing, the data may be split according to the barcodes and provide information about the molecule count, origin of nucleic acids and/or proteins of interest.
The above discussions are non-limiting examples of various embodiments of the present invention. However, other embodiments are also possible. Accordingly, more generally, various aspects of the invention are directed to various systems and methods for barcoding nucleic acids within microfluidic droplet carrying a cell, cell lysis reagents and other assay reagents and a hydrogel bead carrying barcoded oligonucleotides, as discussed below.
In one aspect, the present invention is generally directed to systems and methods for barcoding nucleic acids within a plurality of microfluidic droplets having an average diameter of the droplet of less than 1000 μm, and more preferably having an average diameter of approximately 100 μm.
The barcoding oligonucleotides may be of any suitable length or comprise any suitable number of nucleotides. The oligonucleotide tags may comprise DNA, RNA, and/or other nucleic acids such as PNA, and/or combinations of these and/or other nucleic acids. In some cases, the oligonucleotide tag is single stranded, although it may be double stranded in other cases. For example, the oligonucleotide tag may have a length of at least about 10 nt, at least about 30 nt, at least about 50 nt, at least about 100 nt, at least about 300 nt, at least about 500 nt, at least about 1000 nt, etc. The length of the barcoding oligonucleotide may vary depending on the application and could be either single-stranded or double-stranded.
As described above, the barcoding oligonucleotides may contain a variety of sequences. For example, the oligonucleotides may contain one or more primer sequences, one or more unique or “barcode” sequences, random sequences, degenerative sequences, one or more promoter sequences, one or more spacer sequences, or the like. Other examples include barcoding oligonucleotides may include a poly-A tail, enzyme recognition sequences, or the like. The oligonucleotide tag may also contain, in some embodiments one or more cleavable spacers, e.g., photocleavable linker. The oligonucleotide tag may be attached to a particle chemically (e.g., via a linker) or physically (e.g., without necessarily requiring a linker).
In addition, in one set of embodiments, a plurality of hydrogel beads, for instance, containing barcoding oligonucleotide tags on their surface may be loaded to droplets, e.g., such that, on average, each droplet contains one hydrogel bead, or less in some cases. After being added to the droplet, the barcoding oligonucleotides may be released from the bead, e.g., using light, reducing agents, or other suitable techniques, to allow the oligonucleotides to become present in solution, i.e., within the interior of the droplet. In such fashion, by loading of the hydrogel beads into the droplets in some embodiments, then releasing the oligonucleotides within the droplet, e.g., to interact with nucleotides or other species, such as is discussed herein.
In one set of embodiments, plurality of droplets is formed in a way that majority of individual droplets would contain no more than one cell and a hydrogel bead comprising barcoding oligonucleotides as described above. A wide variety of different techniques for forming droplets are known to those of ordinary skill in the art. For example, a junction of channels may be used to create the droplets. The junction may be, for instance, a T-junction, a Y-junction, a channel-within-a-channel junction (e.g., in a coaxial arrangement, or comprising an inner channel and an outer channel surrounding at least a portion of the inner channel), a cross (or “X”) junction, a flow-focusing junction, or any other suitable junction for creating droplets. See, for example, [37], WO 2004/091763, WO 2004/002627, each of which is incorporated herein by reference.
A relatively large number of droplets may be created, e.g., at least about 100, at least about 500, at least about 1,000, at least about 3,000, at least about 5,000, at least about 10,000, at least about 30,000, at least about 50,000, at least about 100,000 droplets, at least about 300,000 droplets, at least about 500,000 droplets, at least about 1,000,000 droplets, etc.
In yet another set of embodiments, the nucleic acids released from the lysed cells may be tagged with the barcoding oligonucleotide and/or amplified using PCR (polymerase chain reaction) or other suitable amplification techniques. In one set of embodiments, nucleic acid amplification may be performed within the droplets. Those of ordinary skill in the art will be aware of suitable PCR techniques and variations, such as assembly PCR or polymerase cycling assembly, which may be used in some embodiments to produce an amplified nucleic acid. In addition, in some cases, suitable primers may be used to initiate polymerization, or other primers known to those of ordinary skill in the art.
In some cases, the droplets may be broken, or otherwise fused. A wide variety of methods for bursting droplets are available to those of ordinary skill in the art, and the exact method chosen is not critical. For example, droplets contained in a carrying fluid may be disrupted using techniques such as mechanical disruption or ultrasound. Droplets may also be disrupted using chemical agents or surfactants, for example, 1H,1H,2H,2H-perfluorooctanol.
Once droplets are broken the nucleic acids (labelled with barcoding oligonucleotides) may be pooled or combined together, amplified, sequenced, etc. Because nucleic acids from different droplets are tagged with barcoding oligonucleotides the nucleic acid molecules can be computationally deconvoluted.
In one set of embodiments, the microfluidics chip is manufactured from an elastomeric polymer such as polydimethylsiloxane (“PDMS”), polytetrafluoroethylene (“PTFE” or Teflon®), or the like [38]. Other examples of potentially suitable polymers include, but are not limited to, polyethylene terephthalate (PET), polyacrylate, polymethacrylate, polycarbonate, polystyrene, polyethylene, polypropylene, polyvinylchloride, cyclic olefin copolymer, polytetrafluoroethylene, a fluorinated polymer, a silicone such as polydimethylsiloxane, polyvinylidene chloride, bis-benzocyclobutene, a polyimide, a fluorinated derivative of a polyimide, or the like. The device may also be formed from composite materials, for example, a composite of a polymer and a semiconductor material. In a non-limiting example, the PDMS polymer is used which is sold under the trademark Sylgard by Dow Chemical Co., Midland, MI, and particularly Sylgard 182, Sylgard 184, and Sylgard 186.
In a still further aspect, the present invention may provide kits for use in performing the methods described herein. The kits may comprise the lysis reagents, including assay reagents, the beads described herein. In addition, the kits may include one or more microfluidic chips for loading the droplets with cells, lysis and assay reagents and hydrogel beads, as well as microfluidic consumables and appropriate reagents. It should be understood that lysis reagents can be supplied with hydrogel beads or assay reagents or separately.
The following are intended as examples only and do not limit the present disclosure.
The detailed composition and reaction conditions are provided below. Briefly, cell lysis and RT reaction were carried out in bulk with MCF7 and K562 cells. Samples containing 0.02%-0.6% of different cell lysis reagent were evaluated. After the RT reaction, cDNA was pass through a column to remove cell debris and gDNA, purified with 1.2× AMPure, eluted in 20 μl water, amplified by 12-cycles of PCR, purified reaction product with 1.2× AMPure and analyzed on BioAnalyzer chip.
1. Preparation of Detergents:
Different sets of detergents were prepared in DEPC-water. Each set was tested separately. The cDNA yield for each detergent was compared to the cDNA yield obtained with 0.1% NP40 final concentration.
2. Preparation of Cells
K-562 cells (CCL-243, ATCC) were washed 3 times with ice cold PBS+0.04% BSA, counted, diluted to 2.2 million cells per 1 mL and stored on ice.
3. Preparation of RT Reaction Mixture:
“1.54×RT mixture” was prepared for 8 reactions.
4. Setting Up RT Reaction:
5. cDNA Amplification:
Results
The yield of cDNA using different types of cell lysis reagents, or their combinations is reported in Table 5 below. The “Relative lysis efficiency” was estimated by first quantifying the cDNA yield (ng/μl) obtained at a given lysis condition and then dividing the said cDNA yield by the cDNA yield obtained when using 0.1% NP40.
This example illustrates certain techniques for barcoding nucleic acids using lysis reagents revealed in this invention. These examples are better understood together with
The hydrogel bead used in this example contained barcoding DNA sequence which schematically is indicated in
As one non-limiting example, the fluids are delivered into the microfluidics device. The device has four inlets and one outlet. The inlets are used to introduce i) cells, ii) hydrogel beads, iii) lysis reagent and RT reaction mix and iv) carrier oil. Importantly, lysis reagents can be introduced along with hydrogel beads or along with RT reaction mix. The cells, hydrogel beads, and assay reagents can be introduced into a microfluidics chip through either inlet. Droplet generation occurs at or downstream the flow-focusing junction and co-encapsulated cells and hydrogel beads are then collected at the outlet. The flow rate of each inlet can be adjusted in order to obtain optimal flow conditions. The flow rates of all phases can be adjusted independently between 1 and 10,000 ul/h, depending on the particular application.
Once encapsulated, the nucleic acid molecules released from lysed cells can be processed further such as the mRNA molecules can be converted to cDNA using reverse transcription (RT). During the RT step the barcoding oligonucleotides might anneal to poly(A) part of mRNA molecules via 3′ end poly(T) tail, and get extended to cDNA by RT enzyme.
In one particular example the RT reaction mixture was prepared as indicated in Table 6. 150 μl of RT reaction mix was infused into a microfluidics chip alongside with DNA barcoding hydrogel beads and cell suspension in order to perform cell lysis and nucleic acid barcoding.
Droplets, carrying single cells co-encapsulated with hydrogel beads and assay reagents, may be collected off-chip and retain integrity. Droplets collected off-chip can be incubated at desirable temperature for an extended period of time. For example, collected droplets can be incubated at 50° C. for 60 min followed by 15 min incubation at 85° C. in order to perform RT reaction.
To release the barcoded nucleic acid (DNA or RNA) molecules into a bulk solution, droplets can be broken by chemical or physical techniques. Typically, the emulsion droplets are broken using perfluoro-octanol. The water-in-oil droplets in this particular example were broken by adding 10% perfluoro-octanol (Sigma-Aldrich, 370533) onto the collected emulsion. The barcoded nucleic acid molecules (e.g., cDNA) present in aqueous phase of a broken emulsion can be amplified or further processed for sequencing. In one non-limiting example revealed here the aqueous phase of broken emulsion was spun down through Zymo Spin-IC column, the flow-through fraction was collected, purified 2-times with 0.8× AMPure magnetic beads (Beckman Coulter, A63880) and then the barcoded-cDNA was amplified by PCR. Tables 7 and 8 indicate the PCR reaction mixture and cycling conditions for amplifying the barcoded cDNA.
The amplified cDNA can be further processed to construct sequencing library. For example, the barcoded-cDNA amplified by PCR was purified 2-times with 0.6× AMPure magnetic beads, eluted into 20 μl of water and fragmented as follows (Table 9):
Samples were vortexed and spin-down and DNA fragmentation was carried out at 37° C. for 8 min, followed by 30 min at 65° C. After reaction double size selection (0.6×-0.8×SPRI) was performed. Purified fragmented DNA was eluted in 17.5 μL of water and ligated to adapter using following reaction mix (Table 10):
Ligation was carried out at 20° C. for 15 min (lid at 30° C.). After reaction total volume was brought to 100 μL, 0.8× ampure was performed and ligation product was eluted in 40 μL.
Next indexing PCR was conducted by preparing the PCR reaction mix and cycling conditions indicated in Table 11 and 12 below:
The amplified DNA library was purified using double size selection (0.6-0.8× AMPure magnetic beads) and eluted in 15 μL. The DNA library quality was then verified on Agilent BioAnalyzer HS DNA chip (Agilent, 5067-4626). Once DNA library is generated the primary sequence of nucleic acids, including the barcode and unique molecular identifiers can be determined by sequencing. It should be understood that the above example can be used to prepare and analyse, as non-limiting examples, genomes, transcriptomes, epigenomes, single nucleotide polymorphisms, specific gene expression levels, non-coding RNA, entire genes or their sections, etc.
Sequencing of DNA libraries was performed on NextSeq Illumina instrument using NextSeq 2000 P3 reagents (50 Cycles) and following cycling numbers: Read 1-16 cycles, Read 2-58 cycles, i5 read—8 cycles, i7 read—6 cycles. The sequencing data was processed using STARsolo. According to the obtained metrics the single-cell transcriptomics libraries prepared using cholesterol-binding detergent or in combination with non-ionic detergents gave higher percentage of reads mapped to genes and higher fraction of reads mapped to cells. At similar sequencing depth, using saponin alone or in combination with non-ionic detergents generated higher UMI count and gene counts than regular conditions libraries (
This is another example illustrating the advantage of the cell lysis reagents disclosed in this invention for obtaining improved barcoding of nucleic acids of single cells. In an exemplary embodiment, the 5 different types of cells [A549 (CCL-185), HeLa (CCL-2), MDA-MB-231 (HTB-26), PC3 (CRL-1435), K562 (CCL-243)] were mixed at equal proportion and a mixture of cells encapsulated in microfluidic droplets alongside with DNA barcoding hydrogel beads and assay/lysis reagents. In yet another exemplary embodiment, the 5 different types of cells, namely A375, Ker-Tomato, Caki-1, Jurkat, BxPC3 cells were mixed together at equal proportion and a mixture of cells encapsulated in microfluidic droplets along with DNA barcoding hydrogel beads and assay/lysis reagents. The lysis/RT conditions tested comprised different types of lysis agents that are listed in Table 13. The RT reaction mixture was prepared as described above (Table 6).
Following single cell isolation in water-in-oil droplets they were lysed by the lysis agent(s) supplemented in a reaction mixture to release the inner content such as mRNA. The barcoding oligonucleotides attached to hydrogel bead were released by means of photo-illumination (e.g. exposing for 20 s using 405 nm LED from Droplet Genomics). The hydrogel bead used in this example contained barcoding DNA sequence, which schematically is indicated in
To pool the barcoded nucleic acid molecules, droplets were broken by adding 10% perfluoro-octanol onto the post-RT emulsion. The aqueous phase of a broken emulsion was applied through Zymo Spin-IC column, the flow-through fraction was collected and purified 2-times with 0.8× AMPure magnetic beads, and eluted in 20 μl of water. The barcoded-cDNA was then amplified by PCR. Tables 7 and 8 indicate the PCR reaction composition and cycling conditions for amplifying the barcoded cDNA. The barcoded-cDNA amplified by PCR was purified 2-times with 0.6× AMPure magnetic beads, eluted into 20 μl of water and fragmented by assembling reaction indicated in Table 9. Fragmentation was carried out at 37° C. for 8 min, followed by 30 min at 65° C. Next, fragmented DNA was purified with SPRI beads (e.g. using double size selection 0.6-0.8×). Purified fragmented-DNA was eluted in 17.5 μL of water and ligated to adapter by assembling a reaction mix indicated in Table 10. Ligation was carried out at 20° C. for 15 min, purified with 0.8× AMPure magnetic beads and eluted in 40 μL. Finally, indexing PCR was conducted by preparing the PCR reaction mix and cycling conditions indicated in Tables 11 and 12. The amplified DNA library was purified using double size selection (0.6-0.8× AMPure magnetic beads (Beckman Coulter, A63880)) and eluted in 15 μL.
The DNA library quality was then verified on Agilent BioAnalyzer HS DNA chip (Agilent, 5067-4626) and sequenced on NextSeq Illumina instrument using NextSeq 2000 using following cycling numbers: Read 1-16 cycles, Read 2-58 cycles, i5 read—8 cycles, i7 read—6 cycles.
The sequencing data was processed using STARsolo. The solo-in-drops pipeline (github.com/jsimonas/solo-in-drops) was used to obtain prepare raw sequencing data for STARsolo to obtain cells×genes matrices and BAM files. For further analysis filtered matrices outputted by STARsolo default filtering parameters were used. Scanpy package and custom functions from github.com/AllonMKlein/Pfirschke et al 2021 were used for filtering, normalization, selection of high variable genes using Fano factor, visualization with UMAP and clustering with spectral clustering. Doublets were identified using Scrublet package, batch correction were done using harmomy-py package. Differentially expressed gene analysis between different lysis conditions was done by performing two-sided Mann-Whitney U test and p-values were corrected with p-values correction with Benjamini-Hochberg false discovery rate method and family-wise error rate of 0.05. Significantly expressed genes were then sorted based on gene expression log 2-fold change. Gene Set Enrichment Analysis was performed using gseapy package.
To generate sequencing saturation curves BAM files from STARsolo output were used to downsample the sequencing data to a series of different raw read number depth. The downsampling factors used were as follows—0.05, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1.0. Average UMI and gene counts per cell were generated at each sequencing depth and plotted against average reads per cell at different sequencing depths. Gene biotypes were extracted from GRCh38 reference genome gtf file.
The results presented in
This application claims priority to the U.S. provisional application No. 63/359,854 filed on Jul. 10, 2022.
Number | Date | Country | |
---|---|---|---|
63359854 | Jul 2022 | US |