Claims
- 1. A method of analyzing biological samples for determinable properties using a computer system comprising:
capturing an image of said sample into a computer system; using said computer system, placing tile outlines over said image according to a placement process; using said computer system, analyzing said image by scoring detectable characteristics of said image within one or more of said tile outlines; and using said computer system, preparing an output from scored detectable characteristics of said image.
- 2. The method according to claim 1 wherein said sample comprises one or more of:
a thin section from a tissue biopsy; a dense cellular monolayer prepared from disaggregated cells; or a smear preparation.
- 3. The method according to claim 1 wherein said image is produced using an extended focus process.
- 4. The method according to claim 1 wherein said image is a two-dimensional image.
- 5. The method according to claim 1 wherein said placement process comprises one or more of:
placing tile outlines such that outlines are abutting; placing tile outlines in a regular grid over said image. placing tile outlines such that outlines are not necessarily abutting; or placing tile outlines such that tile outlines do not necessarily cover said image.
- 6. The method according to claim 1 wherein said placement process comprises an iterative search of said image comprising:
searching said image to find a region of said image providing a desired signal strength of a detectable signal; placing a tile outline over said region, thereby defining a tiled area and a remaining area; and on said remaining area, iteratively continuing said searching and said placing until a stop condition is reached.
- 7. The method according to claim 6 wherein said detectable signal is the total fluorescence intensity of a cellular nuclear stain.
- 8. The method according to claim 6 wherein said searching comprises searching for a tile outline region that produces a highest value of said detectable signal.
- 9. The method according to claim 6 wherein said stop condition comprises determining that a placed tile has a signal value with a predefined relationship to another value.
- 10. The method according to claim 9 wherein said another value is derived from one or more values found for one or more previously placed tiles.
- 11. The method according to claim 1 wherein said output further comprises:
an estimation of gene copy number.
- 12. The method according to claim 1 wherein said output further comprises:
detection of gene amplification.
- 13. The methods according to claim 8 wherein said signal value is measured using fluorescence in situ hybridization probes and/or DAPI.
- 14. The method according to claim 1 wherein said tile outlines are one or more of:
roughly rectangular in shape; roughly polygonal in shape; or roughly circular in shape
- 15. The method according to claim 1 wherein said tile outlines are selected to have an area roughly equal to or slightly larger than a largest cross-sectional area of a largest expected cell in said sample.
- 16. The method according to claim 15 wherein:
said largest expected cell is a tumor cell.
- 17. The method according to claim 1 wherein said analyzing further comprises:
detecting two or more signal values in a placed tile outline; and calculating a value using a ratio of said two or more signal values.
- 18. A method of analyzing a biological sample image comprising:
determining a plurality of subareas over said image; in each subarea, computing a ratio from detectable signals; computing an original histogram of said ratios; computing a normal-corrected histogram of said ratios; and from said normal-corrected histogram, estimating a ratio value for one or more cells in said sample.
- 19. The method according to claim 18 wherein said ratio comprises one or more of:
a first count divided by a second count; a first signal value divided by a second signal value; or a test value divided by a control value.
- 20. The method according to claim 18 wherein said analyzing further comprises:
determining one or more numerical results of said sample.
- 21. The method according to claim 18 wherein said analyzing further comprises:
in each subarea, computing a subarea ratio from said detectable signals; computing a sample histogram of said subarea ratios; determining a normalized reference histogram for said subarea ratios; subtracting said normalized reference histogram from said sample histogram to produce a corrected histogram of said subarea ratios; and estimating a ratio value of said sample from said corrected histogram.
- 22. The method according to claim 21 wherein said computing comprises converting per-subarea data to a sample histogram by allocating subarea data to generally equal-width buckets.
- 23. The method according to claim 22 wherein said estimating comprises detecting a notable shoulder or a second peak to the offset from a normal peak of said sample histogram.
- 24. The method according to claim 21 wherein said determining comprises:
fitting a normalized reference histogram to a normal peak in a mixed tumor sample histogram, and wherein said fitting comprises:
proportionately adjusting counts of reference histogram buckets so that a normalized reference histogram matches said sample histogram in an unamplified region as closely as possible.
- 25. The method according to claim 22 wherein said corrected histogram comprises
a third histogram where a count of every bucket is a difference between corresponding counts of said sample histogram and said reference histogram and wherein if said difference for any bucket is negative, that bucket's value is set to zero.
- 26. The method according to claim 21 further comprising:
estimating an amplified ratio R directly from said corrected histogram by a method comprising:
for each histogram bucket i, letting pi be the proportion of the count remaining after subtracting said normalized reference; estimating the ratio R by R=Σ(piciRi)/Σ(pici);
where i indexes subareas; t indicates test values; c indicates control values; and Ri indicates a ratio ti/ci of values of a single cell (or tile), with Ri set to 1 if ci=0; and further comprising:
verifying an estimated tumor ratio by calculating an amplified tumor proportion; and not reporting the ratio R as verified unless said amplified tumor proportion exceeds a minimum threshold further wherein said amplified tumor proportion P is estimated as P=Σ(pici)/Σci.
- 27. The method according to claim 21 wherein a best shape of a normalized reference histogram is varied from sample to sample.
- 28. The method according to claim 18 wherein said plurality of subareas comprise a plurality of areas containing separated cells.
- 29. The method according to claim 18 wherein said plurality of subareas comprise a plurality of outlines placed in a regular grid.
- 30. The method according to claim 18 wherein said plurality of subareas comprise a plurality of targeted outlines placed by a placement method.
- 31. The method according to claim 1 wherein said analyzing further comprises:
estimating a tumor proportion and a tumor ratio by simultaneous equations.
- 32. The method according to claim 31 wherein said simultaneous equations comprise:
- 33. The method according to claim 32 wherein said squares of per-tile spot counts comprise:
- 34. The method according to claim 33 wherein P and R are determined form the formulas:
- 35. The method according to claim 34 wherein if an estimate of P is >1.0, R=RO is output.
- 36. The method according to claim 34 wherein if an estimate of P is <0.1, then P=0.1 is used to compute R; and
if R is computed to be negative, R=RO is output.
- 37. The method according to claim 44 wherein said plurality of subareas comprise a plurality of areas containing separated cells.
- 38. The method according to claim 44 wherein said plurality of subareas comprise a plurality of outlines placed in a regular grid.
- 39. The method according to claim 44 wherein said plurality of subareas comprise a plurality of targeted outlines placed by a placement method.
- 40. The method according to claim 1 wherein said analyzing further comprises:
using an expectation maximization method to estimate an output from said scored detectable characteristics.
- 41. The method according to claim 1 wherein said analyzing further comprises:
using an expectation maximization method to estimate a tumor proportion and a tumor ratio of said sample.
- 42. A method of analyzing a biological sample image comprising:
determining a plurality of subareas over said image; in each subarea, determining a data set of one or more detectable characteristics; and using an expectation maximization method to estimate an output from said scored detectable characteristics.
- 43. The method according to claim 40 wherein said analyzing further comprises:
using a set of per tile scored detectable characteristic data pairs (ti,ci) representing test and control detectable values in an expectation maximization method.
- 44. The method according to claim 42 wherein said analyzing further comprises:
providing plausible initial starting values to said expectation maximization method, said starting values describing a bivariate probability distribution of data sets in unamplified subareas and describing a bivariate probability distribution of data sets in amplified subareas; comparing a an unamplified subarea's data set with each of said bivariate probability distributions to determine a relative likelihood that said subarea data set was generated by the first probability distribution and a relative likelihood that said subarea data set was generated by the second probability distribution; using said pairs of relative likelihoods for a plurality of subareas as weighting factors in a re-estimation of the parameters of the two generating bivariate probability distributions; using said pairs of relative likelihoods for a plurality of subareas to estimate the relative proportions of each component distribution; iterating the process until the bivariate probability distribution parameters have converged to stable values; after convergence of expectation maximization method, computing a ratio implied by each of the two bivariate probability distributions by dividing each test count mean distribution by the corresponding control count mean; reporting a higher ratio as a Tumor Ratio; and reporting a relative proportion of a corresponding distribution as a Tumor Proportion.
- 45. The method according to claim 44 further wherein:
each bivariate probability distribution used in the expectation maximization method is a product of a univariate Poisson distribution for test values and a univariate Poisson distribution for control values.
- 46. The method according to claim 44 further wherein:
spot counts of any subarea with either a test value of zero or a control value of zero are not used.
- 47. The method according to claim 46 further wherein:
estimation of each univariate Poisson distribution is modified to take account of deliberate exclusion from the set of observed data of any subarea with either a test count value of zero or a control count value of zero; and each univariate Poisson distribution is modified using a Monte Carlo method to generate correction factors between an underlying Poisson mean and the corresponding observed mean when subareas with zero values are excluded.
- 48. The method according to claim 44 wherein said analyzing further comprises:
fitting data with a single bivariate distribution using known statistical techniques.
- 49. The method according to claim 44 further comprising:
comparing a goodness of fit of the single bivariate distribution with the goodness of fit of the mixture of two bivariate distributions by computing the joint likelihood of the data set of all subareas if generated by the single bivariate distribution, and the joint likelihood of the data set of all subareas if generated by the mixture of two bivariate distributions; and if the single bivariate distribution has higher joint likelihood, then reporting the overall ratio RO instead of the higher ratio.
- 50. The method according to claim 44 further comprising:
constraining the fitting process by requiring a ratio of one bivariate distribution to be identically 1.0 after every iteration.
- 51. The method according to claim 44 wherein said plurality of subareas comprise a plurality of areas containing separated cells.
- 52. The method according to claim 44 wherein said plurality of subareas comprise a plurality of outlines placed in a regular grid.
- 53. The method according to claim 44 wherein said plurality of subareas comprise a plurality of targeted outlines placed by a placement method.
- 54. A system for analyzing biologic samples comprising:
an information processor for handling digital data; data storage for storing digital data, including captured image data; a logic module able to analyze said captured image data to estimate observable features of said data; and a set of user interfaces allowing a user to request analysis and/or set analysis parameters and/or review analyzed image data.
- 55. The system of claim 54 wherein said set of user interfaces further comprise:
a gallery review interface.
- 56. The system of claim 54 wherein said set of user interfaces further comprise:
one or more analysis option interfaces.
- 57. The system of claim 56 further wherein said one or more processing option indication comprise:
an indication to perform grid subarea placement; and an indication to perform targeted subarea placement.
- 58. The system of claim 54 further comprising:
an image capture camera operationally connected to said information processor. a light source. one or more light filters. a microscope. a slide handling unit.
- 59. The system of claim 54 further comprising:
one or more subarea placement rule sets stored in said data storage.
- 60. The system of claim 54 further comprising:
one or more analysis logic routines stored in said data storage.
- 61. A system for analyzing biologic samples comprising:
means for capturing digital image data from one or more biologic samples; means for storing digital image data; means for interacting with a user to receive user instructions and user review of image data; and means for logically analyzing said captured digital image data to estimate one or more observable parameters; and means for outputting estimated observable parameters.
CROSS REFERENCE TO RELATED APPLICATIONS
[0001] This application claims priority from U.S. provisional patent application No. 60/349,318, filed Jan. 15, 2002, which is incorporated herein by reference.
Provisional Applications (1)
|
Number |
Date |
Country |
|
60349318 |
Jan 2002 |
US |