Claims
- 1. A method for generating a custom database of sequences comprising:
a) providing a database of sequences; b) providing at least one sequence region in the database having a highly conserved start sequence and a highly conserved end sequence; c) providing at least one validation condition for said sequence region; d) comparing at least one selected input sequence to said at least one validation condition to determine whether the input sequence is a valid input sequence; and e) adding valid input sequences to the custom database.
- 2. The method of claim 1, wherein said selected input sequence includes characters constituting wildcards and wherein said at least one validation condition comprises in the input sequence and a threshold for allowable wildcards when adding a sequence.
- 3. The method of claim 2, wherein said at least one validation condition comprises a threshold for an allowable number of wildcards.
- 4. The method of claim 1, wherein said at least one validation condition comprises a threshold for the number of characters in a character run in the input sequence.
- 5. The method of claim 1, wherein said at least one validation condition comprises the presence of the highly conserved start sequence and a highly conserved end sequence in the input sequence.
- 6. The method of claim 1, including the step of obtaining the at least one input sequence of step d) from an external database.
- 7. The method of claim 6, wherein said external database is selected from the group of GenBank and TIGR.
- 8. The method of claim 6, wherein said external database comprises GenBank.
- 9. The method of claim 1 including the step of performing selected biological identification techniques to identify the at least one selected input sequence and the step of adding the at least one input sequence of step d) from the input sequence identified by the selected biological identification techniques.
- 10. The method of claim 1, comprising the step of identifying the selected input sequence as an invalid sequence if the input sequence fails to meet the at least one validation condition.
- 11. A method for generating a custom database of sequences comprising:
a) providing a first database of existing sequences; b) comparing a selected isolated sequence to the existing sequences in the database; c) identifying the isolated sequence as a new sequence if the isolated sequence is different from the existing sequences in the first database; d) comparing the new sequence with an external database of sequences to identify the new sequence as an identified new sequence when the new sequence is the same as one of the sequences in the external database; e) comparing the identified new sequence with selected validation criteria to determine whether the identified new sequence is a valid new sequence for the first database of sequences; and f) updating the first database of sequences to include the identified new sequence if the identified new sequence is a valid new sequence.
- 12. The method of claim 11 including the step of identifying the isolated sequence as an existing sequence if the isolated sequence is the same as one of the existing sequences in the first database.
- 13. The method of claim 11 wherein the isolated sequence is compared to selected input validation criteria to determine whether the isolated sequence is a proper sequence for comparison to the first database of existing sequences.
- 14. The method of claim 13 including the step of identifying the isolated sequence as an improper sequence if the isolated sequence fails to meet the selected input validation criteria.
- 15. The method of claim 13 including the step of identifying the isolated sequence as an existing sequence if the isolated sequence is the same as one of the existing sequences in the final database.
- 16. The method of claim 11 wherein the step of comparing the new sequence with the external database of sequences includes the step of designating the new sequence to be an unknown sequence if the new sequence is different from the sequences of the external database.
- 17. The method of claim 16 including the step of performing selected biological identification techniques on a sample containing the unknown sequence to identify the unknown sequence as the identified new sequence if the sample containing the unknown sequence is identifiable from the biological identification techniques.
- 18. The method of claim 11 wherein the external database includes GenBank.
- 19. The method of claim 11 wherein the external database is selected from the group of GenBank and TIGR.
- 20. The method of claim 1, 2, 3, 4, 5, 6, 7, or 8 wherein the step of providing a database of sequences includes the step of providing the database of sequences for the identification of Mycobacterium.
- 21. The method of claim 1, wherein said at least one input sequence of step d) is obtained through sequencing of at least one region within the genome of identified Mycobacterium isolates.
- 22. The method of claim 21, wherein said at least one region within the genome is the ITS region and is amplified using a primer set comprising
- 23. The method of claim 21, wherein said at least one region within the genome is the 16S rRNA gene region and is amplified using a primer set comprising
- 24. The method of claim 1, wherein said at least one sequence region of step b) is the 16S rRNA gene comprising the highly conserved start sequence GTCGAACGG (SEQ ID NO: 1) and the highly conserved end sequence GGCCAACTACGT (SEQ ID NO: 2).
- 25. The method of claim 1, wherein said at least one sequence region of step b) is the ITS region located between the 16S and 23S genes of the ribosomal gene cluster comprising the highly conserved start sequence CACCTCCTTTCT (SEQ ID NO: 3) and the end sequence GGGGTGTGG (SEQ ID NO: 4).
- 26. The method of claim 1, wherein said at least one sequence region of step b) include the ITS region located between the 16S and 23S genes of the ribosomal gene cluster comprising the highly conserved start sequence CACCTCCTTTCT (SEQ ID NO: 3) and the end sequence GGGGTGTGG (SEQ ID NO: 4) and the 16S rRNA gene comprising the highly conserved start sequence GTCGAACGG (SEQ ID NO: 1) and the highly conserved end sequence GGCCAACTACGT (SEQ ID NO: 2).
- 27. The custom database generated by the method of claim 1.
- 28. The custom database generated by the method of claim 20.
- 29. A method of searching a custom database of sequences to identify an unknown sample comprising:
a) obtaining a unknown sequence from said unknown sample; b) selecting custom database sequence regions of the database to be searched; c) validating the unknown sequence against selected custom database validation conditions; d) returning an error message if said unknown sequence fails the validation conditions; e) comparing the unknown sequence to the selected database sequence regions; f) computing similarity scores for each selected region of said unknown sequence relative to the custom database sequence regions to determine the similarity thereof if the unknown sequence is valid; and g) sorting the similarity scores from highest to lowest.
- 30. The method of claim 29, wherein the unknown sample is from the genus Mycobacterium.
- 31. The method of claim 30, wherein said sequence from said unknown sample is obtained by amplification of the ITS region with a primer set comprising
- 32. The method of claim 20, wherein said sequence from said unknown sample is obtained by amplification of the 16S rRNA region with a primer set comprising
- 33. A method for identifying a sample as M. tuberculosis or M. bovis in a biological sample comprising:
a) obtaining a sample suspected of containing M. tuberculosis or M. bovis; b) amplifying a nucleic acid comprising the pcnA gene region from said sample; c) mixing the amplified nucleic acid of step b) with a M. tuberculosis probe and with a M. bovis probe such that hybridization occurs and forms polynucleotide complexes; d) subjecting formed complexes to denaturing high performance liquid chromatography; and e) analyzing the peak pattern of the eluates to determine whether said sample is M. tuberculosis or M. bovis.
- 34. The method of claim 33 wherein said M. tuberculosis probe comprises SEQ ID NO: 19.
- 35. The method of claim 33 wherein said M. tuberculosis probe comprises SEQ ID NO: 21.
- 36. The method of claim 33 wherein said M. bovis probe comprises SEQ ID NO: 20.
- 37. A method for determining the PZA resistance status of a Mycobacterium in a biological sample comprising:
a) obtaining a sample suspected of containing M. tuberculosis or M. bovis; b) amplifying a nucleic acid comprising the pcnA gene region from said sample; c) mixing the amplified nucleic acid of step b) with a M. tuberculosis probe and with a M. bovis probe such that hybridization occurs and forms polynucleotide complexes; d) subjecting formed complexes to denaturing high performance liquid chromatography; and e) analyzing the peak pattern of the eluates to determine the PZA resistance status of said Mycobacterium sample.
- 38. The method of claim 37 wherein said M. tuberculosis probe comprises SEQ ID NO: 19.
- 39. The method of claim 37 wherein said M. tuberculosis probe comprises SEQ ID NO: 21.
- 40. The method of claim 37 wherein said M. bovis probe comprises SEQ ID NO: 20.
- 41. A method for determining the PZA resistance status of a Mycobacterium and identifying a sample as M. tuberculosis or M. bovis in a biological sample comprising:
a) obtaining a sample suspected of containing M. tuberculosis or M. bovis; b) amplifying a nucleic acid comprising the pcnA gene region from said sample; c) mixing the amplified nucleic acid of step b) with a M. tuberculosis probe and with a M. bovis probe such that hybridization occurs and forms polynucleotide complexes; d) subjecting formed complexes to denaturing high performance liquid chromatography; and e) analyzing the peak pattern of the eluates to determine the PZA resistance status of said Mycobacterium sample and whether said sample is M. tuberculosis or M. bovis.
- 42. The method of claim 37 wherein said M. tuberculosis probe comprises SEQ ID NO: 19.
- 43. The method of claim 37 wherein said M. tuberculosis probe comprises SEQ ID NO: 21.
- 44. The method of claim 37 wherein said M. bovis probe comprises SEQ ID NO: 20.
Parent Case Info
[0001] This invention claims priority under 35 U.S.C. §119 (e) to U.S. Provisional Application No. 60/381,015 filed May 15, 2002. The entire disclosure of the above-identified application is incorporated by reference herein.
Provisional Applications (1)
|
Number |
Date |
Country |
|
60381015 |
May 2002 |
US |