Claims
- 1. A method of resolving a mixture comprising DNA of more than one individual into genotype profiles for individuals in the mixture comprising:
(a) obtaining quantitative allele peak data at a first locus; (b) solving a best fit mass ratio coefficient vector using the quantitative allele peak data obtained in step (a) for allele combinations that can be contributed by the individuals; (c) calculating residuals for the allele combinations of step (b); (d) selecting an allele combination for the individuals at the first locus having the smallest residual, wherein the smallest residual does not cluster with the second smallest residual, wherein the allele combination selected comprises the genotype profiles of the individuals.
- 2. The method of claim 1 wherein the quantitative allele peak data are measurements of peak heights.
- 3. The method of claim 1 wherein the quantitative allele peak data are measurements of peak areas.
- 4. The method of claim 1 wherein the quantitative allele peak data are measurements of optical densities.
- 5. The method of claim 1 wherein the quantitative allele peak data have a known theoretical relationship to the masses of alleles present at the first locus in the mixture and wherein the quantitative allele peak data approximately satisfy the theoretical relationship.
- 6. The method of claim 1 wherein the residuals are a sum of squared differences between components of a quantitative allele peak data vector and components of a predicted allele peak data vector are calculated using the best-fit mass ratio coefficient.
- 7. The method of claim 1 wherein the residuals are squares of norms of vector differences between a quantitative allele peak data vector and predicted allele peak data vectors.
- 8. The method of claim 1 wherein the quantitative allele peak data are measured following an amplification reaction.
- 9. The method of claim 1 wherein the first locus harbors short tandem repeats (STRs).
- 10. The method of claim 1 wherein the first locus is selected from the group consisting of CSF1PO, FGA, TH01, TPOX, VWA, D3S1358, D5S818, D7S820, D8S1179, D13S317, D16S539, D18S51, and D21S11.
- 11. The method of claim 1 wherein the steps of obtaining, solving, calculating, and selecting are repeated for a second locus.
- 12. The method of claim 1 further comprising the step of:
(e) matching one of the genotype profiles with information about a subject.
- 13. The method of claim 12 wherein the step of matching is performed by:
obtaining a profile of the subject; and comparing the profile of the subject to the genotype profiles of the individuals in the mixture.
- 14. The method of claim 12 wherein the step of matching is performed by searching for a match to at least one of the genotype profiles of the individuals in the mixture against the contents of a database of profiles of individuals.
- 15. The method of claim 14 wherein the database is a convicted offenders DNA database.
- 16. The method of claim 14 wherein the database is a forensic database.
- 17. The method of claim 14 wherein the database is implemented using any version of the Combined DNA Index System (CODIS) software.
- 18. The method of claim 11I wherein genotype profiles of the first locus and the second locus are compiled to form a composite genotype profile.
- 19. The method of claim 1 further comprising the step of:
(e) comparing a profile of one individual whose DNA is known to be in the mixture to the genotype profiles of the individuals selected in step (d).
- 20. A method of analyzing quantitative allele peak data from a sample comprising DNA of more than one individual into genotype profiles for individuals in the sample comprising:
(a) solving a best fit mass ratio coefficient vector using the allele peak data for allele combinations at a first locus that can be contributed by the individuals; (b) calculating residuals for the allele combinations of step (a); and (c) selecting an allele combination for the individuals at the first locus having the smallest residual, wherein the smallest residual does not cluster with the second smallest residual, wherein the allele combination selected comprises the genotype profiles of the individuals.
- 21. The method of claim 20 wherein the allele peak data is obtained for a locus selected from the group consisting of CSF1PO, FGA, TH01, TPOX, VWA, D3S1358, D5S818, D7S820, D8S1179, D13S317, D16S539, D18S51, and D21S11.
- 22. The method of claim 20 wherein the steps of, solving, calculating and selecting are repeated for a second locus.
- 23. The method of claim 22 wherein genotype profiles of the first locus and the second locus are compiled to form a composite genotype profile.
- 24. A method of remotely accessing a software application in a secure manner for resolving a mixture of DNA comprising the steps of:
hosting said software application on a secure server; accessing said software application from a client remotely via a network; protecting the secure server and the client via a firewall; transmitting DNA mixture data to the secure server; and receiving analysis results from the secure server at the client.
- 25. The method of claim 34 wherein said network is a public IP network.
- 26. The method of claim 34 wherein said network is a private wide area network.
- 27. The method of claim 34 wherein said DNA mixture data is encrypted for transmission.
- 28. The method of claim 34 wherein said secure server uses an encrypted communications protocol.
- 29. The method of claim 34 wherein said DNA mixture data is compressed for transmission.
- 30. The method of claim 34 that further comprises the step of accessing a national DNA database via a secure database server.
- 31. A method of generating genotype profiles for individuals who contribute DNA to a sample comprising DNA of more than one individual comprising.
(a) obtaining quantitative allele peak data for a set of more than one loci in the sample; (b) separately assigning the quantitative allele peak data for each locus of the set of loci to allele combinations that can comprise the genotype profiles of the individuals at each locus of the set of loci; (c) separately computing a residual error and a mass ratio for the allele combinations that can comprise the genotype profiles of the individuals at each locus of the set of loci; and (d) selecting the allele combinations for each locus of the set of loci, wherein the mass ratio for the allele combinations selected are consistent, wherein the residual error for the allele combinations selected is the smallest or second smallest residual error, and wherein the allele combinations selected comprise the genotype profiles of the individuals who contribute DNA to the sample.
- 32. The method of claim 31 wherein the quantitative allele peak data are measurements of peak heights.
- 33. The method of claim 31 wherein the quantitative allele peak data are measurements of peak areas.
- 34. The method of claim 31 wherein the quantitative allele peak data are measurements of optical density.
- 35. The method of claim 31 wherein the quantitative allele peak data have a known theoretical relationship to the masses of alleles present in the sample and wherein the quantitative allele peak data approximately satisfy the theoretical relationship.
- 36. The method of claim 31 wherein the residual error for the allele combinations is a sum of the squared differences between components of a quantitative allele peak data vector and components of a predicted allele peak data vector are calculated using the mass ratio.
- 37. The method of claim 41 wherein the residual error for the allele combinations is a norm of vector difference between a quantitative allele peak data vector and a predicted allele peak data vector.
- 38. The method of claim 31 wherein at least one locus of the set of loci is selected from the group consisting of: CSF1PO, FGA, TH01, TPOX, VWA, D3S1358, D5S818, D7S820, D8S1179, D13S317, D16S539, D18S51, and D21S11.
- 39 The method of claim 31 where at least one locus of the set of loci harbor short tandem repeats (STRs).
- 40. The method of claim 31 wherein at least one locus of the set of loci comprises three alleles.
- 41. The method of claim 31 wherein at least one locus of the set of loci comprises four alleles.
- 42. The method of claim 31 wherein each locus of the set of loci comprise either three or four alleles.
- 43. The method of claim 31 wherein step (d) further comprises use of least squares analysis.
- 44. A method of analyzing least square deconvolution output data wherein the data include a mass ratio and residual for allele combinations at a first locus in a set of loci in a sample comprising DNA of two individuals comprising:
(a) preliminarily selecting either a genotype combination for the two individuals having a residual that is smallest, wherein the smallest residual does not cluster with the second smallest residual, or preliminarily selecting more than one genotype combination for the two individuals, wherein the more than one genotype combination comprises residuals that are smallest and that cluster; and (b) determining the genotype combination for the two individuals from that preliminarily selected, wherein the genotype combination determined has a mass ratio consistent with that of a second locus determined for the sample.
- 45. The method of claim 44 wherein the step of determining is further verified by checking against a known profile of one of the two individuals with DNA in the mixture.
Government Interests
[0001] The U.S. Government retains certain rights in this invention due to funding provided by contract J-FBI-98-083 awarded by the Federal Bureau of Investigation.