Claims
- 1. A method for detecting DNA variation in a sample DNA sequence, comprising:
aligning trace data of the sample DNA sequence to trace data of a reference DNA sequence to produce an aligned sample DNA sequence, whereby the trace data of the reference DNA sequence and the trace data of the sample DNA sequence are traces of data for each base in a DNA sequence and whereby the trace data of the reference DNA sequence and the trace data of the sample DNA sequence includes frame numbers; and inputting the trace data of the bases of both the reference DNA sequence and the aligned sample DNA sequence for a particular frame number into a non-linear mathematical function of an anti-correlation calculation scheme for all the frame numbers to produce minimal values at the particular frame number for DNA base trace data of the aligned sample DNA sequence which are not a variation as compared to the reference DNA sequence and to produce values above the minimal values at the particular frame number for DNA base trace data of the aligned sample DNA sequence which are a variation as compared to the reference DNA sequence.
- 2. The method of claim 1, further including comparing the trace data of a reference DNA sequence to trace data of the sample DNA sequence before alignment to determine if there are any Contigs present, whereby a Contig is at least two matches of sequence data between reference DNA sequence data and sample DNA sequence data.
- 3. The method of claim 2, wherein matching parameters used to determine if there are any Contigs present are fragment size and fragment number.
- 4. The method of claim 3, whereby default values for fragment and fragment number indicate that if at least 5 fragments having 12 bases match, the reference and sample sequences are in the same Contig.
- 5. The method of claim 1, wherein during a aligning of the reference DNA sequence trace data to the sample DNA sequence trace data, a general alignment is performed by finding identical bases contained in the reference and sample DNA sequences and then the sample DNA sequence trace data is adjusted to match the sample DNA sequence trace data to the reference DNA sequence trace data.
- 6. The method of claim 5, wherein a more accurate alignment is performed after the general alignment has been performed.
- 7. The method of claim 1, wherein the non-linear mathematical function includes a multiple of a first base trace data from the reference sequence and a second base trace data from the sample sequence, whereby the second base trace data is not of the same base as the base of the first base trace data.
- 8. The method of claim 1, wherein the non-linear mathematical function can be any of the following; a product, division, power, log, exponential operation, trigonometry functions, integration, and deviation, and any type of nonlinear function combination of a product, division, power, log, exponential operation, trigonometry functions, integration, and deviation.
- 9. The method of claim 1, wherein outputs of the non-linear equation are plotted in an electropherogram format to produce a plot which shows the variation of the sample sequence as a well defined peak.
- 10. The method of claim 1, further including normalizing the trace data of the reference DNA sequence and the trace data of the sample DNA sequence after alignment of the trace data of the reference DNA sequence and the trace data of the sample DNA sequence to create normalized trace data for the reference DNA sequence and the sample DNA sequence.
- 11. The method of claim 1, further including comparing intensity of base trace data of the reference DNA sequence relative to intensity of base peak trace data of the sample DNA sequence prior to alignment and ignoring all base peak trace data of the sample DNA sequence which does not show a drop in intensity as compared to the same trace data of the same base peak of the reference DNA sequence, whereby the base peak trace data of the sample DNA sequence which is not ignored indicates a variation in the sample DNA sequence and can be used to confirm a variation found after the alignment process and calculation of the of the non-linear mathematical function an anti-correlation calculation scheme.
- 12. The method of claim 1, further including a scoring system to provide confidence in results of the method.
- 13. The method of claim 1, further including performing the method in a first direction along the sample DNA sequence and performing the method in a reverse direction to the first direction along the sample DNA sequence.
- 14. The method of claim 1, further including comparing the trace data of a reference DNA sequence to trace data of the sample DNA sequence before alignment to determine if there are any Contigs present, whereby a Contig is at least two matches of sequence data between reference DNA sequence data and sample DNA sequence data; wherein during a aligning of the reference DNA sequence trace data to the sample DNA sequence trace data, a general alignment is performed by finding identical bases contained in the reference and sample DNA sequences and then the sample DNA sequence trace data is adjusted to match the sample DNA sequence trace data to the reference DNA sequence trace data; wherein a more accurate alignment is performed after the general alignment has been performed; wherein outputs of the non-linear equation are plotted in an electropherogram format to produce a plot which shows the variation of the sample sequence as a well defined peak; and further including normalizing the trace data of the reference DNA sequence and the trace data of the sample DNA sequence after alignment of the trace data of the reference DNA sequence and the trace data of the sample DNA sequence to create normalized trace data for the reference DNA sequence and the sample DNA sequence.
- 15. A method for detecting DNA variation in a sample DNA sequence, comprising:
comparing intensity of base peak trace data of a reference DNA sequence relative to intensity of base peak trace data of a sample DNA sequence and ignoring all base peak trace data of the sample DNA sequence which does not show a drop in intensity as compared to the same trace data of the same base peak of the reference DNA sequence, whereby the base peak trace data of the sample DNA sequence which is not ignored indicates a variation in the sample DNA sequence.
- 16. A method for detecting insertion and deletion in a sample DNA sequence, comprising:
aligning trace data of a sample DNA sequence to trace data of the reference DNA sequence to produce an aligned sample DNA sequence, whereby the trace data of the reference DNA sequence and the trace data of the sample DNA sequence include frame numbers; chose pairs of data points from any of the trace data for different positions along the trace data of the reference DNA sequence or the sample DNA sequence before alignment, whereby the number of frame numbers between the chosen pairs is the same for each chosen pair; locate the chosen pairs of data points chosen from either the reference DNA sequence or the sample DNA sequence in the aligned sample DNA sequence; measure the number of frame numbers between the located chosen pairs of data points in the aligned sample DNA sequence; and find any dramatic increase and decrease of number of frame number from the measured number of frame numbers of the aligned sample sequence as compared to the same number of frame numbers used when choosing the paired data points from either the reference DNA sequence or the sample DNA sequence before alignment, whereby a dramatic increase is an insertion and a dramatic decrease is a deletion if the chosen points are from the reference DNA sequence and whereby a dramatic increase is a deletion and a dramatic decrease is an insertion if the chosen points are from the sample DNA sequence before alignment.
- 17. The method of claim 16, wherein the same number of frame numbers used when choosing the paired data points of the sample sequence before alignment is ten frame numbers.
- 18. The method of claim 16, wherein the difference between the measured number of frame numbers of the aligned sample sequence and the same number of frame numbers used when choosing the paired data points of the sample sequence before alignment is plotted to show any dramatic increase and decrease in the frame number after alignment to indicate a deletion and insertion.
- 19. The method of claim 16, wherein during a aligning of the reference DNA sequence trace data to the sample DNA sequence trace data, a general alignment is performed by finding identical bases contained in the reference and sample DNA sequences and then the sample DNA sequence trace data is adjusted to match the sample DNA sequence trace data to the reference DNA sequence trace data.
- 20. The method of claim 19, wherein a more accurate alignment is performed after the general alignment has been performed.
- 21. The method of claim 16, wherein said sample DNA sequence qualifies as a heterozygous insertion and deletion; and wherein a correction process is used during alignment of the reference and sample DNA sequences in areas of the sample DNA sequence which are complicated to align in order to remove trace data of the sample DNA sequence which is not required for alignment.
- 22. The method of claim 21, said correction process is a subtraction method between the reference and sample DNA sequences.
- 23. The method of claim 21, wherein an intensity ratio is used for the subtraction process; wherein the intensity ratio is peak intensity value of a base of the sample DNA sequence divided by the peak intensity value of the same base in the reference DNA sequence; and wherein subtraction using the intensity ratio is used where multiple peaks are shown at the same frame number in the sample DNA sequence.
- 24. The method of claim 21, said correction process is a mathematical function in an anti-correlation calculation scheme using the reference and sample DNA sequences.
- 25. The method of claim 21, further including normalizing the trace data of the reference DNA sequence and the trace data of the sample DNA sequence after alignment of the trace data of the reference DNA sequence and the trace data of the sample DNA sequence to create normalized trace data for the reference DNA sequence and the sample DNA sequence.
- 26. The method of claim 21, wherein during a aligning of the reference DNA sequence trace data to the sample DNA sequence trace data, a general alignment is performed by finding identical bases contained in the reference and sample DNA sequences and then the sample DNA sequence trace data is adjusted to match the sample DNA sequence trace data to the reference DNA sequence trace data.
- 27. The method of claim 26, wherein a more accurate alignment is performed after the general alignment has been performed.
Parent Case Info
[0001] This application claims the benefit of and hereby incorporates by reference U.S. Provisional Application No. 60/384,280 filed May 30, 2002.
Provisional Applications (1)
|
Number |
Date |
Country |
|
60384280 |
May 2002 |
US |