Claims
- 1. A method of analyzing DNA sequencing eletropherograms comprising the steps of:
obtaining a signal produced by a sequence of electrophorectically separated DNA fragments; and utilizing an expert system including a knowledge base and an inference engine to determine a plurality of base-calls from said signal.
- 2. The method of claim 1 further comprising the step of assigning corresponding numerical confidences to said base-calls.
- 3. The method of claim 1 further comprising the step of using rules to modify said base estimates.
- 4. The method of claim 1 wherein said inference engine utilizes forward-chaining reasoning.
- 5. The method of claim 1 wherein said inference engine utilizes backward-chaining reasoning.
- 6. The method of claim 1 wherein said step of assigning is based on decision trees.
- 7. The method of claim 1 wherein said step of obtaining comprises obtaining a real-time signal.
- 8. The method of claim 1 wherein said step of obtaining comprises obtaining a stored signal.
- 9. The method of claim 1 wherein said inference engine executes at least one rule for performing color separation.
- 10. The method of claim 1 wherein said inference engine executes at least one rule for subtracting a baseline.
- 11. The method of claim 1 wherein said inference engine executes at least one rule for detecting peaks based on a noise threshold.
- 12. The method of claim 1 wherein said inference engine executes at least one rule for assigning an initial base to each peak which exceeds a height threshold.
- 13. The method of claim 1 further comprising the step of estimating an expected height and width of peaks containing a single base for all migration times.
- 14. The method of claim 1 further comprising the step of assigning an initial number of bases to each peak based on a comparison of said peaks characteristics to an expected singleton peak.
- 15. The method of claim 1 wherein said inference engine executes at least one rule for determining final base-calls.
- 16. The method of claim 1 wherein said inference engine executes at least one rule for determining call amplitudes.
- 17. The method of claim 1 wherein said inference engine executes at least one rule for determining call confidences.
- 18. The method of claim 1 wherein said inference engine executes at least one rule for determining call positions.
- 19. The method of claim 1 wherein said inference engine executes at least one rule for determining dye mobility shifts.
- 20. The method of claim 1 wherein said inference engine executes at least one rule for determining dye spectra.
- 21. The method of claim 1 wherein said inference engine executes at least one rule for determining dye traces.
- 22. The method of claim 1 wherein said inference engine executes at least one rule for determining individual peak locations.
- 23. The method of claim 1 wherein said inference engine executes at least one rule for determining a peak-containing region.
- 24. The method of claim 1 wherein said inference engine executes at least one rule for determining signal derivatives.
- 25. The method of claim 1 wherein said inference engine executes at least one rule for estimating singleton peak properties.
- 26. The method of claim 1 wherein said inference engine executes at least one rule for determining initial base-calls.
- 27. The method of claim 1 wherein said inference engine executes at least one rule for measuring properties of a plurality of peaks to estimate a number of base-calls.
- 28. The method of claim 27 wherein said rule for measuring properties of a plurality of peaks to estimate a number of base calls comprises the steps of:
determining a height for a singleton peak; determining a full width at half height for said singleton peak; determining base spacing; and calculating a number of base-calls by subtracting said full width at half height for said singleton peak from a measured width of a selected one of said plurality of peaks at said half height of said singleton peak to obtain a first value, dividing said first value by said base spacing to obtain a second value, and adding 1 to said second value.
- 29. The method of claim 1 wherein said inference engine executes at least one rule for determining normalized peak skyline, height or width.
- 30. The method of claim 1 wherein said inference engine executes at least one rule for determining a number of strong minima in a second derivative of said signal.
- 31. The method of claim 1 wherein said inference engine executes at least one rule for determining a peak is unusually wide for a predetermined number of bases.
- 32. The method of claim 1 wherein said inference engine executes at least one rule for determining a peak area is small for a predetermined number of bases.
- 33. The method of claim 1 wherein said inference engine executes at least one rule for determining a peak height threshold for one base-call.
- 34. The method of claim 1 wherein said inference engine executes at least one rule for determining a peak area threshold for one base-call.
- 35. The method of claim 1 wherein said inference engine executes at least one rule for determining whether singleton peak widths are accurate.
- 36. The method of claim 1 wherein said inference engine executes at least one rule for subdividing peaks using derivatives.
CROSS REFERENCE TO RELATED APPLICATIONS
[0001] This application is a continuing application of U.S. patent application Ser. No. 09/711,449 filed Nov. 13, 2000, which is a continuing application of U.S. patent application Ser. No. 09/291,679 filed Apr. 14, 1999 which claims benefit under 35 U.S.C. §119(e) of U.S. Provisional Patent Application No. 60/081,990 filed Apr. 16, 1998 entitled EXPERT SYSTEM FOR ANALYSIS OF DNA SEQUENCING ELECTROPHEROGRAMS.
STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT
[0002] Part of the work leading to this invention was made with United States Government funds under Human Genome Project Grant Number DE-FG02-90ER 60985.
Provisional Applications (1)
|
Number |
Date |
Country |
|
60081990 |
Apr 1998 |
US |
Continuations (2)
|
Number |
Date |
Country |
Parent |
09711449 |
Nov 2000 |
US |
Child |
10151795 |
May 2002 |
US |
Parent |
09291679 |
Apr 1999 |
US |
Child |
09711449 |
Nov 2000 |
US |