The volume of genetic and gene expression information that is available for both bulk populations and individual cells has grown to the point where it has become unwieldy for investigators. For example, cellular gene expression data can include gene expression data for thousands of genes (e.g., 10,000-30,000 or more genes), can now be measured for individual cells, and thousand cells can be measured per sample. This has presented a tremendous technical problem in the art of visualizing, analyzing, exploring, and making sense of cellular gene expression data.
For example, with conventional approaches to using computers to facilitate visualization of cellular gene expression data, the visualization is a final end point and the visualization is reached as a result of a user manually writing scripts using the R programming language, which requires the user to have knowledge of different libraries in order to perform data input, reformatting, manipulations, calculations and graphing. These scripts usually have to be customized for particular data sets, and their creation requires expert knowledge of the programming language, existing libraries, and required inputs to produce results. Moreover, such conventional approaches prevent the deep exploration of heterogeneous cell populations.
As a solution to this technical problem, the inventors disclose the application of computer technology using innovative scatterplot displays across various dimensions of cellular expression data including cell (or cell population) view scatterplots in which cells are visualized as individual data points (e.g., gene vs gene scatterplots of cells) and gene view scatterplots in which genes are visualized as individual data points (e.g., cell population vs cell population scatterplots of genes). Gating can be performed within these scatterplots to create cell populations and gene sets respectively that can serve as biologically-relevant dimensions to be added as new data objects in a workspace for use in augmenting the cellular gene expression data and opening new avenues for meaningful investigation. By way of contrast, performing such analysis in an isolated, siloed manner on the basis of individual genes quickly becomes unwieldy, whereas the ability to pivot between cell view scatterplots and gene view scatterplots allows users to find biologically-relevant grouping of genes that can then be further investigated as synthetic parameters of a cell view scatterplot.
As noted above, with conventional visualization systems in the art, the visualization serves as an endpoint in the process and cannot serve as a starting point for further creating further visualization refinements for further investigations. As an example, samples of immune cells from metastatic melanoma patients may contain T cells, and conventional visualization systems in the art would be able only to identify this subset within the immune cells. However, the innovative computer systems described herein allow for deep exploration and analysis of the T cell subset to identify multiple subsets within T cells, for example, T cells which are “exhausted”, track this state to individual genes which can then be targeted to reverse this exhaustion, activate T cells, and thus possibly stimulate an immune response to eradicate the metastases, as explained in greater detail below with reference to example embodiments.
Accordingly, through the innovative visualization techniques described herein, computer technology can be applied to cellular gene expression data to find new relationships between cells and genes and create new associative data structures within the cellular gene expression data that represents these relationships.
Through these and other features, example embodiments of the invention provide significant technical advances in the applied bioinformatics arts.
Processor 102 can take the form of any processor suitable for performing the operations described herein. For example, the CPU of a laptop or workstation would be suitable for use as processor 102. It should be understood that processor 102 may comprise multiple processors, including distributed processors that communicate with each other over a network to carry out tasks described herein (e.g., cloud computing processing resources). Memory 104 can take the form of any computer memory suitable for cooperating with processor 102 in the execution of the tasks described herein. It should be understood that memory 104 may take the form of multiple memory devices, including memory that is distributed across a network. Similarly, database 106 can take the form of any data repository accessible to the processor 102 (e.g., a file system on a computer, relational database, etc.), and it should be understood that database 106 may take the form of multiple distributed databases (e.g., cloud storage). Display 108 can take the form of a computer monitor or screen that is capable of generating the visualizations described herein.
The innovative data analysis and visualization techniques described herein can be performed on cellular gene expression data 112. Cellular gene expression data 112 can be generated by next-generation sequencing (e.g. for the measurement of RNA-Sequencing (RNASeq) and single cell RNA sequencing (scRNA-Seq) among other sequencing approaches). However, this is only an example, and other techniques for generating cellular gene expression data 112 may be employed. Additional examples include polymerase chain reaction approaches including digital droplet and reverse transcriptase. Still more examples include RNA measurement by flow cytometry, and microarrays, among others, that produce data files which contain the quantification of DNA and/or RNA, or through software programs that process the raw read data (primary and secondary analysis) to generate the gene expression data files. Yet another example is gene expression data derived from stochastic labeling of a sample. Examples of stochastically labeled gene expression data can be found in U.S. Pat. Nos. 9,567,645 and 8,835,358 and in patent application Ser. No. 15/715,028, filed Sep. 25, 2017, and entitled “Measurement of Protein Expression Using Reagents with Barcoded Oligonucleotide Sequences”, the entire disclosures of each of which are incorporated by reference.
Furthermore, the innovative data analysis and visualization techniques described herein can be applied to data generated from single cells by a variety of means. Single cell analysis may include the stochastic labeling of nucleic acids or proteins or any combination of proteins and nucleic acids. As an example, the innovative data analysis and visualization techniques described herein may be used to analyze quantitative features of protein density or gene expression or any combination thereof. The innovative data analysis and visualization techniques described herein can also provide for an improved visualization of quantitative data generated by stochastic labeling of a variety of proteins or nucleic acids in single cells. Quantitative values of biological populations (nucleic acids, proteins etc.) in individual cells may be compared with other individual cells, or between cell types or even between methods of generating the data. For example gene expression values may be visualized as a function of the method used of generating the data set. The innovative data analysis and visualization techniques described herein can also be used to compare quantitative data generated by a variety of means described here in ways that provide for the visualization of quantitative biological population data independent of the method of generating such data.
This data 112 can be characterized as a large multi-parameter data set which poses special technical challenges in terms of difficulty in creating meaningful visualizations, particularly when considered with respect to underlying biology so that biologically-relevant information is meaningfully presented to users in a visual manner. For example, the cellular gene expression data may comprise data for large numbers of individual cells and cell populations, with parameters for each cell or cell population that may stretch into 10,000-30,000 or more parameters. Cellular gene expression data 112 can be read out of files in database 106 and loaded into memory 104 as a plurality of data structures 116 to be manipulated by processor 102 during execution of an analysis and visualization program 114. Program 114 may comprise processor-executable computer code in the form of a plurality of processor-executable instructions that are resident on a non-transitory computer-readable storage medium such as memory 104.
For a first axis, the processor selects a gene in the cellular gene expression data 112 based on user input (step 402). For example, the processor can respond to user input to select a gene column in table 200.
For a second axis, the processor selects another gene in the cellular gene expression data 112 based on user input (step 406). For example, the processor can respond to user input to select another gene column in table 200.
At step 410, the process parses the list to find the maximum values for each selected gene (the highest counts). These maximum values are then used by the processor to define the appropriate scales for the X-axis and Y-axis in the scatterplot (step 412). For example, if the maximum value for the X-axis gene is 10, the X-axis scale can be from 0-10. At step 414, the processor draws the scatterplot based on the cell list and the defined scales using the cell's associated count values as X,Y coordinates in the scatterplot. The result is a scatterplot 300 as shown by
Returning to
Also, the cell view mode of
Furthermore, when in cell view mode, it is expected that for many cell populations there will be large numbers of cells where the expression of the selected genes for those cells is at the zero level. This leads to a large clustering of dots 308 at the zero levels 310 and 312 on the X-axis and Y-axis respectively of scatterplot 300 shown by
As shown by the bottom scatterplot in
The cell view scatterplot can also display cell information for selected parameters in the cellular gene expression data 112 other than genes. As indicated above, the cellular gene expression data 112 may include parameters from dimensionality reduction such as those resulting from tSNE, LDA, PCA, etc. and quality control parameters (above threshold, parameters relating to ribosomal RNA (rRNA) abundance, etc.) These parameters can be presented as options on the parameter selection menus 500 and 506.
The cell view user interface of
Through scale controls 1002, the user can adjust the X-axis of the scatterplot in a variety of ways. For example, the X-axis scale can be defined to exhibit a liner scale or some other scale (such as a log 2 scale) (see control 1004). Also, through min/max controls 1006, the user can define the minimum and maximum boundaries on the X-axis. For example, through these controls, the minimum value can be defined to be a value greater than zero, which would remove the zero spike from the histogram and re-present a newly scaled histogram where the distribution of non-zero values across the X-axis can be more clearly seen. Sliders 1008 can provide users with easy control over the transformation variables. Also, it should be understood that additional transformation options can be provided, including user-supplied transforms whose variables can be adjusted by user input (see, e.g., US Pat App Pub 2016/0328249 entitled “Plugin Interface and Framework for Integrating External Algorithms with Sample Data Analysis Software”, the entire disclosure of which is incorporated herein by reference).
A particularly innovative and powerful aspect of the inventive system disclosed herein is the ability to pivot the scatterplot display from a cell view mode to a gene view mode.
With this pivot, with reference to cellular gene expression data 112 such as table 200 from
Returning to
The diagonal where y=x is where genes 1106 are positioned if those genes are equally expressed in both cell populations according to the metric defined via 1120. Thus, the distance of a gene 1106 away from this diagonal in either direction indicates the extent of differential expression of the subject gene 1106 as between two selected cell populations. Given that it is expected that most genes 1106 will not be expressed (or will be only lightly expressed) in many cell populations, it is expected that there will typically be large cluster of genes 1106 in the lower left quadrant of scatterplot 1100.
Furthermore, since users can create multiple, hierarchically-related cell populations that exist as data objects in the workspace as described in connection with
Another powerful and innovative aspect of the gene view mode described herein is an ability for users to gate genes while in the gene view mode to thereby create gene sets.
Gene set controls 1710 (shown in greater detail in
Another powerful and innovative aspect of the visualizations provided by program 116 include an ability to overlap a third dimension on the cell view and/or gene view scatterplots. For example, color coding (e.g., a heatmap) can be applied to the cells 308 or genes 1106 in a scatterplot to provide another dimension to the data presentation. An example of this is shown by
Furthermore, according to another aspect of the disclosed system, reports can be created in a report editor by dragging cell populations from the workspace into the report editor (see
Example Use Case:
As indicated above, the disclosed system provides a powerful mechanism for investigating cellular gene expression data 112 by switching between the cell view mode and gene view mode (or vice versa) while performing gating in those two viewing modes to focus on data of interest.
At step 2104, the processor selects cell view axis parameters in response to user input (e.g., selection of parameters such as genes or gene sets). At step 2106, the processor generates a cell view scatterplot data structure from the cellular gene expression data 112 based on the axis parameters selected at step 2104. This step may involve selecting the columns corresponding to the selected parameters in table 200 to obtain a list of cells and their associated values for each selected parameter. At step 2108, the processor generates the cell view scatterplot 300 from the data within the cell view scatterplot data structure created at step 2106 for presentation to the user.
At step 2110, the processor receives a gate specification with respect to the cell view scatterplot in response to input from a user. This gating creates a cell population (step 2112), where the created cell population gets saved as a new data structure in the workspace. At this point, the user can choose whether to (1) work with a new sample (see step 2114 with progression back to step 2100), (2) define one or more new axis parameters with respect to the current sample while in the cell view mode (see step 2116 with progression back to step 2104), (3) define a new gate with respect to the current sample while in the cell view mode (see step 2118 with progression back to step 2110), or (4) switch to the gene view mode (see step 2120 with progression to step 2122).
At step 2122, the processor pivots the cellular genetic expression data 112 as discussed above. Then, at step 2124, the processor selects cell populations from the workspace in response to user input. At step 2126, the processor generates a gene view scatterplot data structure from the pivoted cellular gene expression data based on the cell populations selected at step 2124. This step may involve selecting the pivoted columns corresponding to the selected cell populations in a pivoted version of table 200 to obtain a list of genes and their associated metrics for each selected cell population. At step 2128, the processor generates the gene view scatterplot 1100 from the data within the gene view scatterplot data structure created at step 2126 for presentation to the user.
At step 2130, the processor receives a gate specification with respect to the gene view scatterplot in response to input from a user. This gating creates a gene set (step 2132), where the created gene set gets saved as a new synthetic parameter in the workspace for association with the cellular genetic expression data. The cellular genetic expression data can thus be augmented with new data values corresponding to the gene set created at step 2132. At this point, the user can choose whether to (1) work with a new sample (see step 2134 with progression back to step 2100), (2) define one or more new cell populations with respect to the current sample while in the gene view mode (see step 2136 with progression back to step 2124), (3) define a new gate with respect to the current sample while in the gene view mode (see step 2138 with progression back to step 2130), or (4) switch to the cell view mode (see step 2140 with progression to step 2104).
Thus,
As an example, a powerful and innovative mode of operation is shown by the example process flow of
For example, through the tools provided herein, a user might be able to analyze cell populations to identify differentially expressed gene sets that are correlated to better chances for survival with respect to a particular Cancer X (which we can label as “Survival Gene Set”). At the same time, a user might be able to analyze cell populations to identify differentially expressed gene sets that are correlated to poor chances for survival with respect to Cancer X (which we can label as “Not Survival Gene Set”). Further still, a user might be able to analyze cell populations to identify differentially expressed gene sets that are correlated to responding well to Therapy Y for Cancer X (which we can label “Therapy Responsive Gene Set”). Then, these gene sets can be used as synthetic parameters in the cell view mode of the system to find cell populations in patients that are genetically predisposed to respond well to treatment by Therapy Y in order to survive Cancer X. For example, the cell view scatterplot can use the Survival Gene Set as one axis parameter (e.g., X-axis parameter) and the Not Survival Gene Set as the other axis parameter (e.g., Y-axis parameter), while using the Therapy Responsive Gene Set as the third dimensional overlay. The resultant scatterplot can show cell populations that will correlate well with both survival and therapy responsiveness (as well as cell populations that do not correlate well with survival or therapy responsiveness).
While the invention has been described above in relation to its example embodiments, various modifications may be made thereto that still fall within the invention's scope. For example, while the example scatterplots shown herein present the X-axis as a horizontal axis and the Y-axis as a vertical axis, it should be understood that some practitioners may find it desirable to tilt the scatterplots. An example would a scenario where the diagonal y=x is deemed biologically important. In such a case, the scatterplot might be tilted so that the y=x diagonal is presented as a horizontal or vertical line rather than a 45 degree line to thereby help focus users on how far data might lie away from the y=x line. Accordingly, it should be understood that these and other modifications to the invention will be recognizable upon review of the teachings herein.
This patent application claims priority to U.S. provisional patent application Ser. No. 62/433,930, filed Dec. 14, 2016, and entitled “Applied Computer Technology for Management, Synthesis, Visualization, and Exploration of Parameters in Large Multi-Parameter Data Sets”, the entire disclosure of which is incorporated herein by reference.
Number | Name | Date | Kind |
---|---|---|---|
4845653 | Conrad et al. | Jul 1989 | A |
5627040 | Bierre et al. | May 1997 | A |
5739000 | Bierre et al. | Apr 1998 | A |
5795727 | Bierre et al. | Aug 1998 | A |
5962238 | Sizto et al. | Oct 1999 | A |
6014904 | Lock | Jan 2000 | A |
6221592 | Schwartz et al. | Apr 2001 | B1 |
6560546 | Shenk et al. | May 2003 | B1 |
6769030 | Bournas | Jul 2004 | B1 |
6944338 | Lock et al. | Sep 2005 | B2 |
7010582 | Cheng et al. | Mar 2006 | B1 |
7194531 | Donker et al. | Mar 2007 | B2 |
7277938 | Duimovich et al. | Oct 2007 | B2 |
7356598 | Giroir et al. | Apr 2008 | B1 |
7472342 | Haut et al. | Dec 2008 | B2 |
7492372 | Abshear | Feb 2009 | B2 |
8835358 | Fodor et al. | Sep 2014 | B2 |
9567645 | Fan et al. | Feb 2017 | B2 |
9762598 | Jagpal et al. | Sep 2017 | B1 |
20020067358 | Casara et al. | Jun 2002 | A1 |
20030009470 | Leary | Jan 2003 | A1 |
20030078703 | Potts et al. | Apr 2003 | A1 |
20030088657 | Eggers | May 2003 | A1 |
20040019690 | Cardno et al. | Jan 2004 | A1 |
20040061713 | Jennings | Apr 2004 | A1 |
20040161767 | Baldwin et al. | Aug 2004 | A1 |
20040242216 | Boutsikakis | Dec 2004 | A1 |
20040250118 | Andreev et al. | Dec 2004 | A1 |
20050038608 | Chandra et al. | Feb 2005 | A1 |
20050239125 | Hodge | Oct 2005 | A1 |
20050247114 | Kahn et al. | Nov 2005 | A1 |
20050272085 | Hodge | Dec 2005 | A1 |
20060014192 | Hodge | Jan 2006 | A1 |
20060063264 | Turner et al. | Mar 2006 | A1 |
20060148063 | Fauzzi et al. | Jul 2006 | A1 |
20070014305 | Assad | Jan 2007 | A1 |
20070031823 | Bentwich | Feb 2007 | A1 |
20070041395 | Boucek | Feb 2007 | A1 |
20070128633 | Zozulya et al. | Jun 2007 | A1 |
20070219728 | Papageorgiou et al. | Sep 2007 | A1 |
20080097917 | Dicks et al. | Apr 2008 | A1 |
20080109175 | Michalak | May 2008 | A1 |
20080154513 | Kovatchev | Jun 2008 | A1 |
20080212643 | McGahhey et al. | Sep 2008 | A1 |
20080263468 | Cappione et al. | Oct 2008 | A1 |
20090070841 | Buga et al. | Mar 2009 | A1 |
20090192363 | Case | Jul 2009 | A1 |
20090204557 | Zhang | Aug 2009 | A1 |
20090246782 | Kelso et al. | Oct 2009 | A1 |
20090307757 | Groten | Dec 2009 | A1 |
20100042351 | Covey et al. | Feb 2010 | A1 |
20100043047 | Archer et al. | Feb 2010 | A1 |
20100070459 | Zigon et al. | Mar 2010 | A1 |
20100070904 | Zigon et al. | Mar 2010 | A1 |
20100161561 | Moore et al. | Jun 2010 | A1 |
20100254581 | Neeser et al. | Oct 2010 | A1 |
20110066385 | Rajwa et al. | Mar 2011 | A1 |
20110099497 | Fok et al. | Apr 2011 | A1 |
20110191899 | Ainley et al. | Aug 2011 | A1 |
20110282870 | Herzenberg et al. | Nov 2011 | A1 |
20120029832 | Dodgson | Feb 2012 | A1 |
20120140641 | Reese et al. | Jun 2012 | A1 |
20120179779 | Awasthi | Jul 2012 | A1 |
20120214190 | Hou | Aug 2012 | A1 |
20120215481 | Covey et al. | Aug 2012 | A1 |
20120239297 | Yokota et al. | Sep 2012 | A1 |
20120245889 | Zhu et al. | Sep 2012 | A1 |
20130091135 | Yokoi et al. | Apr 2013 | A1 |
20130117298 | Ray | May 2013 | A1 |
20130177933 | Malisauskas | Jul 2013 | A1 |
20130197894 | Sablinski | Aug 2013 | A1 |
20130226813 | Voltz | Aug 2013 | A1 |
20130289925 | Jiang et al. | Oct 2013 | A1 |
20140072189 | Jena et al. | Mar 2014 | A1 |
20140154789 | Polwart et al. | Jun 2014 | A1 |
20140164564 | Hoofnagle et al. | Jun 2014 | A1 |
20140213468 | Ehrenkranz et al. | Jul 2014 | A1 |
20140216128 | Trotter | Aug 2014 | A1 |
20140222866 | Joneja | Aug 2014 | A1 |
20150120883 | Gurtowski | Apr 2015 | A1 |
20150295972 | Hagan | Oct 2015 | A1 |
20150363563 | Hallwachs | Dec 2015 | A1 |
20160122341 | Vakalopoulos et al. | May 2016 | A1 |
20160130574 | Sadekova et al. | May 2016 | A1 |
20160170980 | Stadnisky et al. | Jun 2016 | A1 |
20160243251 | Blainey et al. | Aug 2016 | A1 |
20160328249 | Simm et al. | Nov 2016 | A1 |
20160337786 | Kafle et al. | Nov 2016 | A1 |
20160362408 | Vakalopoulos et al. | Dec 2016 | A1 |
20160370350 | Rajwa et al. | Dec 2016 | A1 |
20170102310 | Xu | Apr 2017 | A1 |
20180010134 | Sharp et al. | Jan 2018 | A1 |
20180340890 | Roederer et al. | Nov 2018 | A1 |
Number | Date | Country |
---|---|---|
62-31815 | Feb 1987 | JP |
63-259442 | Oct 1988 | JP |
05-209821 | Aug 1993 | JP |
2001-511546 | Aug 2001 | JP |
2005-352771 | Dec 2005 | JP |
2006-230333 | Sep 2006 | JP |
2012-505460 | Mar 2012 | JP |
WO 04072866 | Aug 2004 | WO |
WO 13143533 | Oct 2013 | WO |
WO 14022787 | Feb 2014 | WO |
Entry |
---|
Archival Cytometry Standard, Oct. 13, 2010, International Society for Advancement of Cytometry Candidate Recommendation (draft), version 1000929, downloaded from http://flowcyt.sf.net/acs/latest.pdf. |
International Search Report and Written Opinion dated Jun. 16, 2016 in International application No. PCT/US15/65045. |
Amir, El-ad David et al. “viSNE enables visualization of high dimensional single-cell data and reveals phenotypic heterogeneity of leukemia.” Nature biotechnology 31.6 (2013): 545-552. |
Bauer et al. (eds.), Clinical Flow Cytometry: Principles and Applications, Williams & Wilkins (1993). |
Hsiao et al. “Mapping cell populations in flow cytometry data for cross-sample comparision using the Friedman-Rafsky test statistic as a distance meaure: FCM Cross-Sample Comparision.” Cytometry, Part A, vol. 89, No. 1, pp. 71-88, Aug. 14, 2015. |
International Search Report for International Application No. PCT/US2017/065987 dated Feb. 23, 2018. |
International Search Report for International Application No. PCT/US2018/034199 dated Jul. 26, 2018. |
Jaroszeski et al. (eds.), Flow Cytometry Protocols, Methods in Molecular Biology No. 91, Humana Press (1998). |
Landy et al. (eds.), Clinical Flow Cytometry, Annals of the New York Academy of Sciences vol. 677 (1993). |
Macosko, Evan Z et al. “Highly parallel genome-wide expression profiling of individual cells using nanoliter droplets.” Cell 161.5 (2015): 1202-1214. |
Newell et. al. Cytometry by Time-of-Flight Shows Combinatorial Cytokine Expression and Virus-Specific Cell Niches within a Continuum of CD8+ T Cell Phenotypes Immunity, 2012. |
Ormerod (ed.), Flow Cytometry: A Practical Approach, Oxford Univ. Press (1994). |
Pawley (ed.), Handbook of Biological Confocal Microscopy, 2nd Edition, Plenum Press (1989). |
R. E. Bellman; Rand Corporation (1957). Dynamic Programming. Princeton University Press. Republished: Richard Ernest Bellman (2003). Dynamic Programming. Courier Dover Publications. & Richard Ernest Bellman (1961). Adaptive Control Processes: a guided tour. Princeton University Press.]. |
Roederer et. al. The genetic architecture of the human immune system: a bioresource for autoimmunity and disease pathogenesis. Cell. Apr. 9, 2015;161(2):387-403. doi: 10.1016/j.cell.2015.02.046. Epub Mar. 12, 2015. |
Roederer et al. “Frequency difference gating: A multivariate method for identifying subsets that differ between samples.” Cytometry. vol. 45, No. 1, pp. 56-64, Aug. 24, 2001. |
Roderer et al. “Probability binning comparison: a metric for quantitating multivariate distribution differences.” Cytometry, vol. 45, No. 1, pp. 47-55, Aug. 24, 2001. |
Shapiro, Howard. Practical Flow Cytometry, 4th ed., Wiley-Liss (2003). |
Shekhar, Karthik et al. “Automatic classification of cellular expression by nonlinear stochastic embedding (ACCENSE).” Proceedings of the National Academy of Sciences 111.1 (2014): 202-207. |
Supplementary European Search Report for Application No. EP 15 86 6701 dated Jun. 21, 2018. |
Tirosh, Itay et al. “Dissecting the multicellular ecosystem of metastatic melanoma by single-cell RNA-seq.” Science 352.6282 (2016): 189-196. |
Van den Bulcke, T. et al. SynTREN: A Generator of Synthetic Gene Expression Data for Design and Analysis of Structure Learning Algorithms, BMC Bioinformatics, Jan. 26, 2006; vol. 7, No. 43; pp. 1-12. |
Van der Maaten, Laurens, and Geoffrey Hinton. “Visualizing data using t-SNE.” Journal of Machine Learning Research 9.2579-2605 (2008): 85. |
Van Der Maaten, Laurens, Eric Postma, and Jaap Van den Herik. “Dimensionality reduction: a comparative.” J Mach Learn Res 10 (2009): 66-71. |
DeTomaso et al., Aug. 23, 2016, FastProject: a tool for low-dimensional analysis of single-cell RNA-Seq data, BMC Bioinformatics, 17(1):1-12. |
Kiselev et al., Aug. 20, 2016, 9. Seurat: analysis of single cell RNA-seq data, https://scrnaseq-course.cog.sanger.ac.uk/website/seurat-chapter.html, retrieved on Jun. 26, 2020, 37 pp. |
Melamed et al., May 2, 2007, GenePattern: Multiplot (v2), https://www.genepattern.org/modules/docs/Multiplot/2, retrieved on Jun. 25, 2020, 11 pp. |
Roederer, 2001, Probability binning comparison: a metric for quantitating univariate distribution differences, Cytometry, 45:37-46. |
Number | Date | Country | |
---|---|---|---|
20180165414 A1 | Jun 2018 | US |
Number | Date | Country | |
---|---|---|---|
62433930 | Dec 2016 | US |