Claims
- 1. A computer-implemented method for time-aligning at least two chromatography-mass spectrometry data sets, each comprising a plurality of mass chromatograms, said method comprising:
a) computing a distance function between said data sets in dependence on at least two mass chromatograms from each data set; and b) aligning said data sets by minimizing said distance function to obtain aligned data sets.
- 2. The method of claim 1, wherein one of said data sets is a reference data set and one of said data sets is a test data set, and wherein said test data set is aligned to said reference data set.
- 3. The method of claim 1, wherein said data sets are liquid chromatography-mass spectrometry data sets.
- 4. The method of claim 1, wherein said distance function is computed in dependence on between about 200 and about 400 mass chromatograms from each data set.
- 5. The method of claim 1, further comprising selecting said at least two mass chromatograms according to a selection criterion.
- 6. The method of claim 1, wherein said distance function is computed in dependence on a chromatogram-dependent weighting factor.
- 7. The method of claim 6, wherein said chromatogram-dependent weighting factor is a function of at least one of a peak number, an intensity threshold, and a signal-to-noise ratio.
- 8. A plurality of chromatography-mass spectrometry data sets aligned according to the method of claim 1.
- 9. A program storage device accessible by a processor, tangibly embodying a program of instructions executable by said processor to perform method steps for a method for time-aligning chromatography-mass spectrometry data sets, each comprising a plurality of mass chromatograms, said method steps comprising:
a) computing a distance function between said data sets in dependence on at least two mass chromatograms from each data set; and b) aligning said data sets by minimizing said distance function to obtain aligned data sets.
- 10. A method for comparing at least two samples, comprising:
a) performing chromatography-mass spectrometry on each sample to obtain at least two data sets, each comprising a plurality of mass chromatograms; b) computing a distance function between two selected data sets in dependence on at least two mass chromatograms from each selected data set; c) aligning said selected data sets by minimizing said distance function to obtain aligned selected data sets; and d) comparing said aligned selected data sets.
- 11. The method of claim 10, wherein one of said selected data sets is a reference data set and another of said selected data sets is a test data set, and wherein said test data set is aligned to said reference data set.
- 12. The method of claim 10, wherein said chromatography-mass spectrometry is liquid chromatography-mass spectrometry.
- 13. The method of claim 10, further comprising aligning two additional data sets, wherein at least one of said additional data sets differs from said selected data sets.
- 14. The method of claim 10, further comprising selecting said at least two mass chromatograms according to a selection criterion.
- 15. The method of claim 14, wherein said selection criterion is a user-provided selection criterion.
- 16. The method of claim 14, wherein said selection criterion comprises an intensity threshold.
- 17. The method of claim 14, wherein said selection criterion comprises a number of chromatograms.
- 18. The method of claim 14, wherein said selection criterion comprises an orthogonality metric.
- 19. The method of claim 14, wherein said selection criterion comprises a retention time range.
- 20. The method of claim 10, wherein said distance function is computed in dependence on between about 200 and about 400 mass chromatograms.
- 21. The method of claim 10, wherein said distance function is computed in dependence on between about 200 and about 300 mass chromatograms.
- 22. The method of claim 10, wherein said distance function is computed in dependence on about 200 mass chromatograms.
- 23. The method of claim 10, wherein said distance function is computed in dependence on a weighting factor.
- 24. The method of claim 23, wherein said weighting factor is a chromatogram-dependent weighting factor.
- 25. The method of claim 24, wherein said chromatogram-dependent weighting factor is a function of at least one of a peak number, an intensity threshold, and a signal-to-noise ratio.
- 26. The method of claim 10, further comprising identifying features that differentiate said aligned selected data sets.
- 27. A plurality of samples compared according to the method of claim 10.
- 28. A method for identifying a biomarker differentiating two cohorts, comprising:
a) comparing at least two samples according to the method of claim 10, at least one each of said samples representing a different one of said two cohorts; and b) identifying a biomarker in dependence on said comparison.
- 29. A biomarker identified by the method of claim 28.
- 30. A diagnostic method comprising detecting a biomarker identified by the method of claim 28.
- 31. A computer-implemented method for time-aligning at least two two-dimensional chromatography-mass spectrometry data sets, comprising:
a) selecting peaks in said data sets; b) identifying potentially corresponding peaks from said selected peaks; and c) performing a locally-weighted regression smoothing on said potentially corresponding peaks to obtain aligned data sets.
- 32. The method of claim 31, wherein one of said data sets is a reference data set and one of said data sets is a test data set, and wherein said test data set is aligned to said reference data set.
- 33. The method of claim 31, wherein said data sets are liquid chromatography-mass spectrometry data sets.
- 34. The method of claim 31, wherein said locally-weighted regression smoothing is a robust locally-weighted regression smoothing.
- 35. The method of claim 34, wherein said robust locally-weighted regression smoothing comprises robust LOESS.
- 36. The method of claim 31, wherein said peaks are selected automatically.
- 37. The method of claim 31, wherein said locally-weighted regression smoothing is performed in dependence on a span.
- 38. A plurality of chromatography-mass spectrometry data sets aligned according to the method of claim 31.
- 39. A program storage device accessible by a processor, tangibly embodying a program of instructions executable by said processor to perform method steps for a method for time-aligning two-dimensional chromatography-mass spectrometry data sets, said method steps comprising:
a) selecting peaks in said data sets; b) identifying potentially corresponding peaks from said selected peaks; and c) performing a locally-weighted regression smoothing on said potentially corresponding peaks to obtain aligned data sets.
- 40. A method for comparing at least two samples, comprising:
a) performing chromatography-mass spectrometry on each sample to obtain at least two two-dimensional data sets; b) selecting peaks in two selected data sets; c) identifying potentially corresponding peaks from said selected peaks; d) performing a locally-weighted regression smoothing on said potentially corresponding peaks to obtain aligned selected data sets; and e) comparing said aligned selected data sets.
- 41. The method of claim 40, wherein one of said selected data sets is a reference data set and another of said selected data sets is a test data set, and wherein said test data set is aligned to said reference data set.
- 42. The method of claim 40, wherein said chromatography-mass spectrometry is liquid chromatography-mass spectrometry.
- 43. The method of claim 40, further comprising aligning two additional data sets, wherein at least one of said additional data sets differs from said selected data sets.
- 44. The method of claim 40, further comprising identifying features that differentiate said aligned selected data sets.
- 45. A plurality of samples compared according to the method of claim 40.
- 46. A method for identifying a biomarker differentiating two cohorts, comprising:
a) comparing at least two samples according to the method of claim 40, at least one each of said samples representing a different one of said two cohorts; and b) identifying a biomarker in dependence on said comparison.
- 47. A biomarker identified by the method of claim 40.
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application claims the benefit of U.S. Provisional Application No. 60/379,003, “Methods for Time-Aligning of Liquid Chromatography-Mass Spectrometry Data for Differential Phenotyping,” filed May 9, 2002, which is incorporated herein by reference.
Provisional Applications (1)
|
Number |
Date |
Country |
|
60379003 |
May 2002 |
US |