Bead-based single cell RNA sequencing technologies (such as Dropseq, Indrops, Seqwell) allow users to rapidly profile the transcriptomes of thousands of individual cells. These methods work by marking the RNA transcripts originating from the same cell with a unique nucleotide-based cellular barcode. By using this barcode, it is possible to determine which profiled RNA transcripts originated from the same cell. However, these methods do not allow researchers to determine from which specific cell each transcript originated. Therefore, bead-based single cell RNA sequencing methods cannot be used to couple transcriptome information to any previously obtained single cell phenotypic data (e.g. cell size, protein expression etc.).
Thus, there is a need for improved single-cell RNA analysis tools.
Disclosed herein are methods, systems, and apparatus for creating a large number (e.g. millions) of microparticles in solution, each of which emits light with unique spectral characteristics. These spectral characteristics, which are generated through the process of optical resonance, allow each particle (a resonator) to act as an optical barcode. In this disclosure, we discuss how the nucleotide-based cellular barcodes used in many single-cell RNA sequencing technologies could be incorporated onto these resonators to create collections of individual particles, each with their own cellular barcode and, simultaneously, their own optical barcode. The methodology we describe establishes a mapping between a set of optical barcodes and nucleotide-based cellular barcodes. By reading out these optical barcodes during the sequencing process, it is then possible to identify from which cell individual RNA transcripts originated.
Many commercially available high throughput single cell RNA sequencing methods can only determine which RNA transcripts came from the same cell. However, they are unable to tell from which individual cell the transcript originated. Therefore, any phenotypic characteristics of the cell cannot be associated with transcriptomic information obtained using these methods.
For example, if a researcher has 10,000 cells, and she measures the fluorescence intensity of a protein of interest in each of these cells, she can obtain quantitative data, reflective of the protein's expression levels in individual cells. However, if she then performs high throughput single cell sequencing using current, commercially available methods she will be able to deduce the transcriptomes of individual cells, but not which transcript is associated with a particular value of fluorescence intensity.
In this disclosure, we show how this challenge can be overcome by creating microparticles each of which possess a unique optical barcode. These barcodes can easily be read using a specific form of microscopy. Furthermore, we disclose how these microparticles can be adorned with nucleotide-based cellular barcodes in a way that establishes a mapping between each particle's optical and cellular barcode. The result is a particle that possesses a nucleotide barcode that can be read optically at speeds of one thousand barcode per second (kilohertz) or faster. We expect these particles to have broad application across the field of single cell analysis.
Thus in one embodiment the invention provides an apparatus for capturing biological material. The apparatus includes: an optically readable capture particle (ORCP) including: one or more optically readable particles (ORPs) each including an optical barcode to identify the ORCP; and a plurality of biological capture sites associated with the one or more ORPs, each of the plurality of biological capture sites including a cellular barcode to identify the ORCP.
In various embodiments of the apparatus, the one or more ORPs may comprise a resonator and gain medium.
In various other embodiments of the apparatus, the one or more ORPs may comprise a microsphere.
In certain other embodiments of the apparatus, the microsphere may be doped with the gain medium.
In still other embodiments of apparatus, the gain medium may include a fluorescent material.
In yet other embodiments of the apparatus, the fluorescent material may be at least one of a fluorescent dye, a quantum dot, or a protein.
In certain other embodiments of the apparatus, the microsphere may include a polystyrene microsphere.
In various other embodiments of the apparatus, the microsphere may have a diameter of at least 3 μm.
In yet other embodiments of the apparatus, the optical barcode may include an emission spectrum having at least one peak.
In still other embodiments of the apparatus, the ORCP may include a plurality of gain media and a plurality of resonators, wherein each of the plurality of gain media includes a different emission spectrum.
In certain other embodiments of the apparatus, the one or more ORPs may include a semiconductor particle which includes the resonator and the gain medium.
In yet other embodiments of the apparatus, the semiconductor particle may be contained within a transparent coating.
In certain other embodiments of the apparatus, the one or more ORPs may include a plurality of semiconductor particles contained within the transparent coating.
In still other embodiments of the apparatus, each of the plurality of semiconductor particles may include a gain medium having a different emission spectrum from the other of the plurality of semiconductor particles.
In various other embodiments of the apparatus, each of the plurality of biological capture sites may include a nucleotide strand configured to capture RNA.
In certain other embodiments of the apparatus, the nucleotide strand may include an oligonucleotide cellular barcode sequence.
In another embodiment the invention provides a method of capturing biological material, including: combining an ORCP and a cell within an aqueous environment, the ORCP comprising: one or more optically readable particles (ORPs) each comprising an optical barcode to identify the ORCP, and a plurality of biological capture sites coupled with the one or more ORPs, each of the plurality of biological capture sites including a cellular barcode to identify the ORCP, and each of the plurality of biological capture sites including a capture site for capture of biological material; reading the optical barcode of the ORCP; identifying a phenotypic property of the cell; capturing contents of the cell such that they interact with the plurality of biological capture sites of the ORCP; and processing the ORCP to identify the contents of the cell associated with the plurality of biological capture sites.
In various embodiments of the method, the one or more ORPs may include a resonator and a gain medium.
In certain embodiments of the method, the one or more ORPs may include a microsphere.
In yet other embodiments of the method, the microsphere may be doped with the gain medium.
In still other embodiments of the method, the gain medium may include a fluorescent material including at least one of a fluorescent dye, a quantum dot, or a protein.
In various embodiments of the method, the microsphere may include a polystyrene microsphere having a diameter of at least 3 μm.
In certain embodiments of the method, the one or more ORCP may include a semiconductor particle.
In various embodiments of the method, the semiconductor particle may include the resonator and the gain medium.
In yet other embodiments of the method, the semiconductor particle may be contained within a transparent coating.
In various other embodiments of the method, the ORCP may include a plurality of semiconductor particles contained within the transparent coating.
In yet other embodiments of the method, each of the plurality of semiconductor particles may include a gain medium having a different emission spectrum from the other of the plurality of semiconductor particles.
In certain embodiments of the method, the optical barcode may include an emission spectrum having at least one peak.
In some embodiments of the method, each of the plurality of biological capture sites may include a nucleotide strand configured to capture RNA.
In various embodiments of the method, the nucleotide strand may include an oligonucleotide cellular barcode sequence.
In certain embodiments of the method, combining an ORCP and a cell within an aqueous environment may further include: combining the ORCP and the cell within an aqueous droplet that is immersed in oil.
In some embodiments of the method, processing the ORCP to identify the contents of the cell associated with the plurality of biological capture sites may further include: identifying the cellular barcode associated with the plurality of biological capture sites.
In still other embodiments of the method, reading the optical barcode of the ORCP may further include: directing a light source at the ORCP; detecting a return light spectrum emitted by the ORCP based on directing the light source at the ORCP; and determining the optical barcode by analyzing the return light spectrum to identify at least one peak within the return light spectrum.
In yet other embodiments of the method, identifying a phenotypic property of the cell may further include: identifying an observable property related to the cell relating to at least one of protein quantification, cell cycle information, gene expression, cell location, cell mass, and intercellular interactions.
In certain other embodiments of the method, identifying a phenotypic property of the cell may further include: directing a fluorescent excitation source at the cell; detecting fluorescent emission light from a fluorescent reporter associated with the cell based on directing the fluorescent excitation source at the cell; and identifying the phenotypic property of the cell based on detecting the fluorescent emission light.
In further embodiments of the method, combining an ORCP and a cell within an aqueous environment may further include: providing a plurality of ORCPs on a surface; placing the cell adjacent the plurality of ORCPs; identifying a location of each of the plurality of ORCPs relative to the cell by reading the optical barcode of each of the plurality of ORCPs; identifying the phenotypic property of the cell; releasing cellular contents from the cell; and processing each of the plurality of ORCPs to identify the contents of the cell associated with the plurality of biological capture sites associated with each of the plurality of ORCPs.
In still another embodiment the invention provides an apparatus for capturing biological material, including: a plurality of optically readable capture particles (ORCPs), each ORCP including a plurality of optically readable particles (ORPs) and a plurality of biological capture sites, each of the plurality of ORPs including an optical barcode to identify the ORCP; and a plurality of biological capture sites coupled to each of the plurality of ORPs, each of the plurality of biological capture sites including a cellular barcode to identify the biological capture site.
In yet another embodiment the invention provides an apparatus for capturing biological material including an optically readable capture particle (ORCP) including: a plurality of optically readable particles (ORPs) and a plurality of oligonucleotide-based cellular barcodes in which an association has been established between an optical barcode of the plurality of ORPs and the oligonucleotide-based cellular barcode, wherein: knowledge of a sequence of the oligonucleotide-based cellular barcode enables a determination of the optical barcode of the plurality of ORPs, or knowledge of the optical barcode of the plurality of ORPs enables a determination of the oligonucleotide-based cellular barcode.
In other embodiments of the apparatus, each of the plurality of ORCPs may include a plurality of resonators wherein each resonator has a different size, and each of the plurality of ORPs may have a different optical barcode based on the different size.
In still other embodiments of the apparatus, each of the plurality of ORPs may include a plurality of resonators and a plurality of gain media and has a same gain medium.
In certain other embodiments of the apparatus, the plurality of ORPs and the plurality of biological capture sites may be embedded within a hydrogel bead.
In various other embodiments of the apparatus, the plurality of ORPs may include a resonator and gain medium.
In yet other embodiments of the apparatus, the plurality of ORPs and the plurality of biological capture sites may be embedded within the hydrogel bead using a microfluidic device, and the microfluidic device may form an emulsion of aqueous based droplets containing the plurality of ORPs and the plurality of biological capture sites, and the droplets may be subsequently cured into hydrogel beads.
In various embodiments, a method of associating an optical barcode with a cellular barcode for an apparatus for capturing biological material may include: forming the cellular barcode through a plurality of rounds of split-and-pool synthesis procedures wherein a known subsection of the cellular barcode associated with an ORP of the one or more ORPs is added during each round; and recording an identity of the optical barcode of the one or more ORPs of the ORCP during each of the plurality of rounds and associating the identity with the added subsection of the cellular barcode.
The foregoing and other aspects and advantages of the invention will appear from the following description. In the description, reference is made to the accompanying drawings which form a part hereof, and in which there is shown by way of illustration preferred embodiments of the invention. Such embodiments do not necessarily represent the full scope of the invention, however, and reference is made therefore to the claims herein for interpreting the scope of the invention.
Further objects, features and advantages of the present disclosure will become apparent from the following detailed description taken in conjunction with the accompanying figures showing illustrative embodiments of the present disclosure, in which:
Complex tissues exhibit an enormous amount of phenotypic heterogeneity. Historically, these individual cells were categorized into groups based on only a few observable properties (e.g. size, shape, presence of stainable markers). Recently, the advent of single cell nucleic acid sequencing technologies has made it possible to quantitatively determine the transcriptome of individual cells, allowing researchers to better appreciate these intercellular differences. Such analysis has become fundamental to our understanding of the signatures of disease at the single cell level.
One high throughput method to perform single cell RNA sequencing involves the co-compartmentalization of individual cells with individual RNA capture microbeads. In one representative design (WO 2015/164212) each of these beads is lined with an RNA capture strand including a PCR amplification region, an oligo(dT) region for mRNA capture, a unique molecular identifier (to calibrate readouts for amplification noise), and a cellular barcode including a specific nucleotide sequence that is designed to be unique to each bead. Ideally there are a large number (e.g. at least several thousand) of beads each with different cellular barcode sequences. In many high throughput, single cell RNA sequencing protocols, each bead is randomly compartmentalized with just a single cell. The cell is then lysed and its RNA is captured by the RNA capture strands attached to the bead. Reverse transcription is then performed on the captured RNA, forming a cDNA library of the captured RNA. All beads are then pooled for in vitro amplification and high throughput sequencing.
Following sequencing analysis, it is possible to determine which RNA transcripts originated from the same cell by analyzing the cellular barcode sequence that was transferred, by reverse transcription, from the RNA capture strand to the cDNA. Unfortunately, while this tells us which transcripts originated from the same cell, it does not tell us which particular cell it came from, since the capture beads were randomly assigned to each cell. This means that current bead-based high throughput RNA sequencing technology does not allow the obtained single transcriptome analysis to be coupled to previously collected data regarding each cell's phenotype (e.g. size, protein expression levels, spatial location etc.). Others have recognized and stated this as a clear limitation of this family of methods (e.g. Zilionis et al. Nature Protocols 12, 44-73, 2017). Overcoming this limitation would allow researchers to couple genotype with phenotype at the single cell level.
One possible solution is to optically encode each RNA capture bead in such a way that the cellular barcode nucleotide sequence can be associated with the optical emission of each bead. Such a method would map each permutation of cellular barcode nucleotides to a unique optical property of the bead, essentially allowing the nucleotide-based cellular barcode sequence to be read optically. Any data obtained from subsequent sequencing could then be associated with the optical property of the bead, which in turn could be linked to phenotypic traits of the sequenced cell.
A potential optical encoding method would be to label each bead separately by using different fluorescent dyes at a range of concentrations so as to modulate their emission spectra in both intensity and center wavelength. However, commercial fluorescence-based systems that rely on this method can create only a few hundred unique optical barcodes; this number is too few to reliably perform high throughput sequencing, which often requires at least several thousand RNA capture beads. This fundamental problem stems from the fact that dyes, fluorescent probes, and even quantum dots have broad emission spectra, making unique optical identification in an unambiguous manner a challenging problem, due to spectral overlap.
Unlike the relatively broadband emission spectra of the aforementioned molecules, an optical resonator, when coupled with an appropriate gain medium, generally has an emission spectrum with a narrow full width at half maximum in the spectral domain, making it ideal for applications in which significant multiplexing is needed. Furthermore, the peak emission wavelength(s) can be coarsely tuned by varying the gain medium or finely tuned by making slight alterations to the size of the cavity responsible for the optical resonance, e.g. while using the same gain medium.
In this patent description, we introduce a method to fabricate RNA capture particles in such a way that their cellular barcode sequence can be mapped to the particle's optical emission spectrum in an unambiguous manner. This allows the nucleotide-based cellular barcode to be deduced by determining the particle's optical emission spectrum, essentially creating an optically readable capture particle (ORCP). In order to achieve a large number of unique emission spectra, we propose either physically coupling a miniature optical resonator along with a gain medium to each RNA capture particle, or using the resonator itself as the RNA capture particle.
An ORCP may include two functional components that are combined into a single physical entity. The first component is generally an RNA capture particle such as those described in WO 2015/164212 and WO 2016/040476. The second component is one or more optically readable particles (ORPs). One embodiment of an optically readable particle includes a micron-scale resonator, which, when optically pumped by an external light source, emits a unique spectral signature that can be measured by a spectrometer. The combination of these two functional parts possesses an unambiguous oligonucleotide cellular barcode as well as an unambiguous optical barcode. Embodiments of the disclosed invention are applicable to particle- or bead-based assays in which cellular material is captured on individual beads. While one particular application includes mapping of a single cell's transcriptome to prior phenotypic information regarding that same cell, in various embodiments the ORCPs may also be used with multiplexed bead-based assays to alternative single-cell profiling platforms such as DNA sequencing, ATAC-seq, and ChIP-seq.
In various embodiments, the invention includes RNA capture microparticles containing both a nucleotide-based cellular barcode and an optical barcode, split-and-pool tools to fabricate such optically-encoded, RNA-capture microparticles, microscopy systems designed to read out the optical barcodes of the capture particles, and/or single cell sequencing apparatus based on optical barcoding readout.
In one embodiment, a resonator particle may be formed from a polystyrene microsphere 120 (the cavity) doped with a fluorescent material such as a fluorescent dye, quantum dots, or protein (the gain medium 130). In each case, the gain medium should have an absorption spectrum spanning wavelengths over which the medium will absorb excitation light. Similarly, it should possess an optical emission spectrum. This encompasses a significant number of materials. Specific examples of fluorescent dyes that could act as a gain media include, but are not limited to, fluorescein isothiocyanate and tetramethylrhodamine. Examples of potential quantum dot gain media include, but are not limited to, PbS quantum dots, CsPbBr3, CH3NH3PbCl3. Examples of protein gain media include, but are not limited, to yellow fluorescent protein and green fluorescent protein (GFP). Typically, the gain medium should be chosen so as to minimize spectral crosstalk with other fluorescent sources that might be used during analysis. For example, if sequencing data is to be obtained from GFP expressing cells, a GFP-based gain medium would be less suitable for cellular identification since optical spillover from the ORCP could influence measurement of the cells' native GFP expression. To prevent ORCP emission spectra interfering with most common cell dyes, most versatility is afforded by using an optical gain medium for the ORCP that is active in the infrared. However, the relatively high cost to performance ratio of detectors at this wavelength make visible gain media more inviting to use when possible. In general, a single resonator will be coupled to a single gain medium. However, the spectral signature of an ORCP that allows it to be uniquely identifiable can provide additional multiplexing ability if it consists of multiple resonators coupled to multiple gain media.
Normally, optical illumination of such a particle by an external light source would produce broad spectral, fluorescence emission, as shown in an exemplary fluorescence emission spectrum 200 in
where the corresponding wavelength is given 2π/k, and l is the polar mode number, a is the microsphere radius, ns and nr are the refractive indices of the microsphere and the surrounding medium respectively, aw are the q-th zeroes of the Airy function with q denoting the possible radial modes. χ=1 for TE modes and χ=1/nr for TM modes.
One of the advantages of using resonant wavelength based barcoding strategies is that it maps discrete (and therefore simple to detect and analyze) values of emission wavelengths to variables n and a that can be continuously varied to generate a diverse set of barcodes. Nevertheless, given a particular resonator and gain material, there are limitations on its size that render it useful for barcoding. One theoretical upper bound on resonator size is the point at which the free spectral range (FSR), which scales approximately as,
becomes too small to be able to distinguish separate peaks, which happens when the spacing is on the order of the emission linewidth. This linewidth is governed by resonator loss due to outcoupled radiation and, when laser emission is occurring, by optical gain generated by the gain medium. While there exist theoretical expressions that describe this linewidth, in practice it is best measured empirically since resonator loss can be significantly affected by scattering and absorption by inherent features of the material used. For a polystyrene microsphere of diameter approximately 10 μm, the energy required to observe a spectrum with sufficient signal to noise ratio of peaks is on the order of 1-10 nJ.
This behavior is observed because the microsphere acts as an optical resonator: polystyrene's relatively high refractive index (˜1.6) confines light within its interior, causing more photons, produced by stimulated emission, to build up within specific optical modes, which are known as whispering gallery modes. The resulting emission from the resonator includes one or more spectrally narrow peaks whose center wavelength is determined by the dimensions of the polystyrene microsphere. Under certain conditions, this resonance phenomenon can even lead to lasing from the microresonator.
A polydisperse collection of such microspheres containing fluorescent emitters may be used as a collection of unique optical barcodes, where the barcode of each microsphere is determined in part by a size (e.g. radius) of the microsphere. Using a variety of different gain media along with microspheres of different sizes, a large number of unique barcodes can be formed, each of which can be read by optically pumping the particle and analyzing its spectral output. Our work has shown that a polydisperse set of beads in a size range of 8 μm to 12 μm and a single fluorescent dye can result in approximately 2,000 distinguishable optical emission spectra (Humar et al., Lab Chip, 2017, 17, 2777). Therefore, by combining multiple particles together, in various embodiments it is possible to form many millions of unique optical barcodes (for example, with 3 different resonators joined together, this would permit formation of C(2000, 3)≈109 different optical barcodes).
In another embodiment, a set of direct-bandgap semiconductor resonator particles with polydisperse sizes are used. In this embodiment, the semiconductor material itself serves as both the gain medium and the microresonator. Interband transitions within the semiconductor may lead to optical gain, and resonance may occur by confinement within the semiconductor both due to its relatively high refractive index (typically >2.5) and its geometric shape (e.g. toroidal, discoid, cuboid, spherical etc.). Suitable semiconductor materials include III-V groups such as InAlGaAs, InGaAsP, and GaN, II-VI groups, and organic semiconductors etc. A polydisperse set of resonator particles can be fabricated by lithographic means (e.g. optical lithography, electron beam lithography, interference lithography etc.) and etched using standard semiconductor fabrication processes. One advantage of many semiconductor materials is their relatively high refractive index, which can allow for the production of narrow peaks in their emission spectra even at diameters smaller than a few microns and thicknesses of a few hundreds of nanometers in a microdisk geometry. Optionally, a transparent coating of material with refractive index lower than that of the semiconductor material can be used to prevent coupling of the optical modes of the particle to the surrounding environment (including other particles). An example of such a coating would be SiO2 although other materials such as Si3N4 are also possible.
Attaching the RNA capture strands to the resonator particle can be performed by a variety of procedures. In some embodiments, a method similar to that disclosed in publication numbers WO 2015/164212 and WO 2016/040476 (which detail techniques to fabricate RNA capture particles) may be used.
In the next step 410, this collection is split into a number of subcollections 420, 422, and 424. For example, if the starting number of resonator particles lined with DNA stubs is 100,000, we might split this collection into 384 subcollections, each of which reside in the well of a 384 well plate. Following this splitting process, approximately 260 particles should reside within each well. To create an associative map between the optical barcode of an ORCP and its oligonucleotide cellular barcode, the splitting process, albeit random, should be performed in a known manner. Therefore, it is necessary to determine which particles were placed in which well.
During this splitting process 410, the optical barcodes are read out and recorded. Once the identities (defined by the emission spectra) of the particles which reside in each well have been determined, an oligonucleotide strand that represents part of the cellular barcode sequence, can be added to each well. A different oligonucleotide sequence, 430, 432, and 434, is added to each well. Next, these strands are joined to the particles' existing DNA stubs to form optical particles with DNA barcodes 440, 442, and 444. At this point, however, there would only be 384 different oligonucleotide barcodes. To achieve a more diverse set of cellular barcodes, these oligonucleotide-tagged resonator particles can then be pooled 450 into a large collection 460, and the procedure repeated 470 for splitting with optical barcode readout 410. After N rounds of this procedure, there would be approximately 384N different nucleotide barcodes. By performing spectral readout of each particle to determine into which well it was deposited, the optical barcode can be associated with the oligonucleotide-based cellular barcode. Therefore, by measuring the output emission spectrum from the RNA capture resonator particle, the nucleotide barcode can be deduced and vice versa.
Following this exemplary procedure, a large collection of ORCPs can be formed. Ideally, each of these particles will have a unique cellular barcode sequence as well as a unique optically readable spectral barcode. Furthermore, by performing a spectral readout of the optical barcode at each stage of the modified split-and-pool synthesis, a mapping between the cellular barcode and the optical barcode is known. In order to ensure there are no repeated optical barcodes, the apparatus may employ a sorting arrangement that is configured to remove any particles with duplicated spectra from the original collection of fabricated particles.
In one exemplary embodiment for the splitting and optical readout process 410, the splitting process could be performed by spectral flow cytometry.
Alternatively, this optical barcode readout could be performed by spectral imaging cytometry.
Optical reporters are used heavily in biological assays to quantitatively measure protein expression. While measurements of the reporters' output can be performed at a single cell level, for example, by using imaging cytometry or flow cytometry, it is not currently possible to associate the results of these assays with single cell RNA sequencing data obtained via high throughput, bead-based methods. The ORCPs disclosed in this invention offer a way to make this association.
Following the reading of both the fluorescent probe and the ORCPs' optical barcodes, the cells may then be lysed, and the contents of each individual cell may be captured by the RNA capture strands of the particular RNA capture laser particle with which the cell is compartmentalized. A cDNA library representing the captured RNA transcripts is then formed, amplified, and sequenced. Since each cDNA strand contains the cellular barcode sequence, it is possible to determine which strands originated from the same cell. Because the RNA capture particles are fabricated with a known mapping between the particle's nucleotide-based cellular barcode and its optical barcode, sequencing of the cellular barcode may also allow for association of this cellular barcode with its associated optical barcode. Since, prior to sequencing the optical barcodes may be read and associated with a particular fluorescent reporter intensity, the single cell RNA content of a particular cell can thus be associated with the phenotypic information gleaned from analysis of the fluorescent reporter.
In droplet-based single cell sequencing methods, individual cells are encapsulated within aqueous droplets along with single RNA capture beads. However, this technique can be inefficient. For example, in a method such as Dropseq (Macosko et al. Cell 161(5), 1202-1214, 2015), the capture rate can be lower than 5% since the likelihood of a single cell being encapsulated with a single RNA capture bead follows Poisson statistics. Furthermore, if two cells are encapsulated with a single bead, the technique will incorrectly determine that the RNA from the two different cells originated from the same cell. If a single cell is encapsulated with two separate beads, the technique will incorrectly determine that the RNA from just that cell came from two separate cells.
If ORCPs are used instead of traditional RNA capture beads, the identity of the RNA capture particles that have been compartmentalized with each cell can be identified by an optical system that pumps the resonator and thus reads its optical barcode (as shown for example in
Similarly, events in which two or more cells are compartmentalized inside the same droplet can be eliminated from subsequent analysis by identifying the RNA capture resonator with which they were co-encapsulated. Events in which this is deemed to occur could be discarded from the final analysis. Therefore, theoretically, the error rate of transcripts from multiple cells being incorrectly identified as coming from just a single cell could be made to approach zero.
One key deficiency in many high throughput sequencing technologies is that they do not preserve spatial information regarding the origin of the cell. The use of laser RNA capture particles can be used to overcome this deficiency. Recently, Stahl et al. (WO 2016/162309 A1 and Science 353(6294), 78-82, 2016) determined the spatial origin of RNA from fixed tissue samples by permeabilizing cells in the tissue and allowing the contained RNA to diffuse onto capture strands lining an adjacent glass slide. However, the spatial resolution of their method was low (on the order of 100 microns) blurring spatial variations in RNA that take place at the single cell level.
The use of ORCPs disclosed in this invention can be used to overcome this problem by attaching these particles to a glass slide at a density dependent on the desired sampling density. In one exemplary embodiment, this attachment might be accomplished by conjugating laser particles coated in streptavidin with a biotin coated slide.
In the preferred embodiment of this invention, the ORCPs that are used may include semiconductor-based resonators. Since their size can be relatively small, a greater spatial resolution of the location of captured RNA can be obtained.
Upon RNA capturing, reverse transcription reagents are then added to the slide with the attached RNA laser capture particles, generating a cDNA library. The tissue sample can then be removed, and the RNA, cDNA hybrids are cleaved from the slide for subsequent in vitro amplification and next-generation sequencing. Following sequencing of the cDNA library generated from the RNA captured on the ORCPs, the position of origin of the initially captured RNA can be deduced. This is performed by associating the cellular barcode, which was transferred to the cDNA strand during reverse transcription of the RNA oligonucleotide capture sequence, with the particle's optical barcode.
Since both the diffusion distance of the RNA released from the tissue and the size of the ORCP can be significantly smaller than the dimensions of a single cell, the method disclosed in this invention can be used to sequence RNA at a spatial resolution smaller than that of a single cell. This invention thus allows cellular and subcellular transcriptome analysis. RNA sequencing data can then be digitally overlaid with image of the tissue section to label each cell (or even each part of the cell) with the RNA transcripts that originated from that location. Such a tool would be of great use in diagnostic pathology.
Turning to
In some embodiments, computing device 910 and/or server 920 can be any suitable computing device or combination of devices, such as a desktop computer, a laptop computer, a smartphone, a tablet computer, a wearable computer, a server computer, a virtual machine being executed by a physical computing device, etc. As described herein, system for capturing and analyzing biological material 904 can present information indicative of an output of system for capturing and analyzing biological material 904, such as an identity of the biological material in the sample, to a user (e.g., researcher and/or physician).
In some embodiments, communication network 906 can be any suitable communication network or combination of communication networks. For example, communication network 906 can include a Wi-Fi network (which can include one or more wireless routers, one or more switches, etc.), a peer-to-peer network (e.g., a Bluetooth network), a cellular network (e.g., a 3G network, a 4G network, a 5G network, etc., complying with any suitable standard, such as CDMA, GSM, LTE, LTE Advanced, WiMAX, etc.), a wired network, etc. In some embodiments, communication network 906 can be a local area network, a wide area network, a public network (e.g., the Internet), a private or semi-private network (e.g., a corporate or university intranet), any other suitable type of network, or any suitable combination of networks. Communications links shown in
In some embodiments, communications systems 1008 can include any suitable hardware, firmware, and/or software for communicating information over communication network 906 and/or any other suitable communication networks. For example, communications systems 1008 can include one or more transceivers, one or more communication chips and/or chip sets, etc. In a more particular example, communications systems 1008 can include hardware, firmware and/or software that can be used to establish a Wi-Fi connection, a Bluetooth connection, a cellular connection, an Ethernet connection, etc.
In some embodiments, memory 1010 can include any suitable storage device or devices that can be used to store instructions, values, etc., that can be used, for example, by processor 1002 to present content using display 1004, to communicate with server 920 via communications system(s) 1008, etc. Memory 1010 can include any suitable volatile memory, non-volatile memory, storage, or any suitable combination thereof. For example, memory 1010 can include RAM, ROM, EEPROM, one or more flash drives, one or more hard disks, one or more solid state drives, one or more optical drives, etc. In some embodiments, memory 1010 can have encoded thereon a computer program for controlling operation of computing device 910. In such embodiments, processor 1002 can execute at least a portion of the computer program to present content (e.g., images, user interfaces, graphics, tables, etc.), receive content from server 920, transmit information to server 920, etc.
In some embodiments, server 920 can include a processor 1012, a display 1014, one or more inputs 1016, one or more communications systems 1018, and/or memory 1020. In some embodiments, processor 1012 can be any suitable hardware processor or combination of processors, such as a central processing unit, a graphics processing unit, etc. In some embodiments, display 1014 can include any suitable display devices, such as a computer monitor, a touchscreen, a television, etc. In some embodiments, inputs 1016 can include any suitable input devices and/or sensors that can be used to receive user input, such as a keyboard, a mouse, a touchscreen, a microphone, etc.
In some embodiments, communications systems 1018 can include any suitable hardware, firmware, and/or software for communicating information over communication network 906 and/or any other suitable communication networks. For example, communications systems 1018 can include one or more transceivers, one or more communication chips and/or chip sets, etc. In a more particular example, communications systems 1018 can include hardware, firmware and/or software that can be used to establish a Wi-Fi connection, a Bluetooth connection, a cellular connection, an Ethernet connection, etc.
In some embodiments, memory 1020 can include any suitable storage device or devices that can be used to store instructions, values, etc., that can be used, for example, by processor 1012 to present content using display 1014, to communicate with one or more computing devices 910, etc. Memory 1020 can include any suitable volatile memory, non-volatile memory, storage, or any suitable combination thereof. For example, memory 1020 can include RAM, ROM, EEPROM, one or more flash drives, one or more hard disks, one or more solid state drives, one or more optical drives, etc. In some embodiments, memory 1020 can have encoded thereon a server program for controlling operation of server 920. In such embodiments, processor 1012 can execute at least a portion of the server program to transmit information and/or content (e.g., information regarding the virtual lens, the desired intensity pattern, the phase mask, any data collected from a sample that is illuminated, a user interface, etc.) to one or more computing devices 910, receive information and/or content from one or more computing devices 910, receive instructions from one or more devices (e.g., a personal computer, a laptop computer, a tablet computer, a smartphone, etc.), etc.
In some embodiments, any suitable computer readable media can be used for storing instructions for performing the functions and/or processes described herein. For example, in some embodiments, computer readable media can be transitory or non-transitory. For example, non-transitory computer readable media can include media such as magnetic media (such as hard disks, floppy disks, etc.), optical media (such as compact discs, digital video discs, Blu-ray discs, etc.), semiconductor media (such as RAM, Flash memory, electrically programmable read only memory (EPROM), electrically erasable programmable read only memory (EEPROM), etc.), any suitable media that is not fleeting or devoid of any semblance of permanence during transmission, and/or any suitable tangible media. As another example, transitory computer readable media can include signals on networks, in wires, conductors, optical fibers, circuits, or any suitable media that is fleeting and devoid of any semblance of permanence during transmission, and/or any suitable intangible media.
The apparatus and methods disclosed herein are not limited to the capture of RNA from single cells. In another embodiment, ‘accessible chromatin in single cells using sequencing’ (scATAC-seq) (Sapathy et al. Nature Biotechnology 37, 925-936 (2019)) is used with ORCPs to measure chromatin expression and associate that with some cellular/nuclear phenotypic property. chromatin is instead captured by the capture sites of the ORCP. This can be accomplished by isolating cells or cell nuclei from a cell suspension and performing a bulk transposition with transposase Tn5. This enzyme cuts and ligates adapter sequences to the nuclear chromatin, in open, accessible regions of DNA. The single cells/nuclei are then isolated with ORCPs, for example, in a water-in-oil droplets (
Each of the following references is incorporated herein by reference in its entirety:
It will be apparent to those skilled in the art that numerous changes and modifications can be made in the specific embodiments of the invention described above without departing from the scope of the invention. Accordingly, the whole of the foregoing description is to be interpreted in an illustrative and not in a limitative sense.
This application claims the benefit of U.S. Provisional Patent Application Ser. No. 62/748,849 filed on Oct. 22, 2018, and entitled “Multiplexed Single-Cell Analysis Using Optically-Encoded RNA Capture Particles,” which is incorporated by reference herein in its entirety.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US19/57320 | 10/22/2019 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62748849 | Oct 2018 | US |