Claims
- 1. A method of screening an array of samples which comprises obtaining a Raman spectrum of each sample and determining which, if any, of the spectra share a spectral feature.
- 2. The method of claim 1 wherein the spectral feature is unique to a particular form of a compound-of-interest.
- 3. A method of screening an array of samples for the presence of a particular form of a compound-of-interest, which comprises obtaining a Raman spectrum of each sample.
- 4. A method of screening an array of samples for the absence of a particular form of a compound-of-interest, which comprises obtaining a Raman spectrum of each sample.
- 5. The method of claim 1, 2, or 3 wherein the particular form is a solid form.
- 6. The method of claim 5 wherein the solid form is a crystalline or amorphous form.
- 7. The method of claim 1, 2, or 3 wherein the particular form is a hydrated form.
- 8. A system for detecting similarities among a plurality of samples, which comprises:
a) a device for obtaining a spectrum for each sample; and b) a computer configured to analyze each of the spectra and to generate a plurality of bins, wherein each bin corresponds to samples sharing at least one spectral feature.
- 9. The system of claim 8 wherein the device is an infrared spectrometer, near infrared spectrometer, NMR spectrometer, X-ray diffractometer, neutron diffractometer, light microscope, electron microscope, second harmonic generator, circular dichroism spectrometer, linear dichroism spectrometer, differential scanning calorimeter, thermal gravimetric analyzer, or melting point analyzer.
- 10. The system of claim 8 wherein the device is a Raman spectrometer.
- 11. The system of claim 8 wherein the computer is further configured to generate a binary spectral representation for a spectrum that reflects the presence or absence of a spectral feature.
- 12. The system of claim 8 wherein the computer is configured to mutually compare a plurality of spectra and generate a hierarchical clustering dendrogram.
- 13. The system of claim 8 wherein the computer is configured to cluster the plurality of spectra.
- 14. The system of claim 13 wherein the computer is configured to cluster the plurality of spectra in accordance with iterative k-means clustering.
- 15. The system of claim 8 wherein the computer is configured to cluster the plurality of spectra such that if a majority of spectra obtained from a single sample are assigned to a particular bin, then all spectra from that sample are assigned to that bin.
- 16. The system of claim 8 wherein the computer is configured to assign newly obtained spectra to at least one of the plurality of bins.
- 17. The system of claim 8 wherein the computer is configured to modify, in response to an analysis of newly obtained spectra, at least one of the plurality of bins.
- 18. The system of claim 8 wherein the computer is configured to add, in response to an analysis of newly obtained spectra, at least one bin to the plurality of bins.
- 19. The system of claim 8 wherein the computer is configured to generate a similarity matrix representing the similarity between at least two of the plurality of samples.
- 20. The system of claim 19 wherein the computer is further configured to sort the samples such that they are arranged to reflect their similarity.
- 21. The system of claim 19 wherein the computer is further configured to sort the similarity matrix such that a diagonal in the matrix represents samples exhibiting the greatest similarity.
- 22. A method of detecting similarities among a plurality of samples, which comprises:
a) collecting a spectrum for each of the plurality of samples; b) calculating a similarity metric between the spectrum of one sample and that of at least one other of the plurality; c) clustering, based on the similarity metric, the spectra into bins, each bin containing similar spectra; and d) presenting the clustered spectra with similar spectra located close to each other.
- 23. The method of claim 22 wherein the spectra are preprocessed after they are collected.
- 24. The method of claim 23 wherein the positions of one or more spectral peaks in the preprocessed spectra are used to generate real value vectors.
- 25. The method of claim 24 wherein binary spectra are generated from the vectors.
- 26. The method of claim 22 wherein the spectra are Raman spectra.
- 27. A method of analyzing a plurality of samples, which comprises:
a) analyzing the samples with a spectrometer to produce spectral data; b) under processor control, identifying similarities between the spectra; and c) grouping the spectra into bins of similarity.
- 28. The method of claim 27 wherein the spectrometer is a Raman spectrometer.
- 29. A database containing a plurality of spectral samples organized into a plurality of bins, the bins corresponding to a hierarchical organization of the plurality of spectral samples based on pair-wise similarity scores calculated in accordance with a similarity metric.
- 30. The database of claim 29 wherein the similarity metric is a Tanimoto coefficient, Tversky index, Euclidean distance, or Hamming distance.
Parent Case Info
[0001] This application claims priority to U.S. provisional application Nos. 60/318,152, 60/318,157, and 60/318,138, each of which was filed on Sep. 7, 2001, and each of which is incorporated herein in its entirety.
Provisional Applications (3)
|
Number |
Date |
Country |
|
60318152 |
Sep 2001 |
US |
|
60318157 |
Sep 2001 |
US |
|
60318138 |
Sep 2001 |
US |