Claims
- 1. A method for identifying oligonucleotide sequences suitable for the amplification of a unique sequence within a genomic region of interest, said method comprising the steps of:
executing a first process on a digital computer to identify repeat sequences that occur within said genomic region of interest; executing a second process on a digital computer to compare repeat sequence-free subsequences within said genomic region of interest to a nucleotide sequence database, whereby nucleotide sequences within said nucleotide sequence database that are substantially similar to said repeat sequence-free subsequences are identified; executing a third process on a digital computer to identify oligonucleotide sequences that are suitable for use as primers in an amplification reaction to amplify a product within any of said repeat sequence-free subsequences for which a defined number of substantially similar sequences are identified in said nucleotide sequence database; and outputting said oligonucleotide sequences.
- 2. The method of claim 1, wherein said genomic region is from a human genome.
- 3. The method of claim 1, wherein said number of substantially similar sequences is zero.
- 4. The method of claim 1, wherein said oligonucleotide sequences are outputted by displaying the sequences on a computer screen or on a computer printout.
- 5. The method of claim 1, wherein said oligonucleotide sequences are outputted by executing a fourth process on a digital computer to direct the synthesis of oligonucleotide primers comprising said oligonucleotide sequences.
- 6. The method of claim 5, wherein said computer directs the synthesis of said oligonucleotide primers by ordering said synthesis from an external source.
- 7. The method of claim 5, wherein said computer is in communication with an oligonucleotide synthesizer, and wherein said computer directs the synthesis of said oligonucleotide primers by said synthesizer.
- 8. The method of claim 1, wherein said substantially similar sequences are at least about 50% identical to said repeat sequence-free subsequences.
- 9. The method of claim 1, wherein said substantially similar sequences are at least about 70% identical to said repeat sequence-free subsequences.
- 10. The method of claim 1, wherein said substantially similar sequences are at least about 90% identical to said repeat sequence-free subsequences.
- 11. The method of claim 1, wherein said first process is executed using Repeat Masker software.
- 12. The method of claim 1, wherein said second process is executed using a BLAST algorithm.
- 13. The method of claim 1, wherein said third process is executed using Primer3 software.
- 14. The method of claim 5, further comprising producing an amplification product using said oligonucleotide primers.
- 15. The method of claim 14, wherein said amplification product is a FISH probe.
- 16. The method of claim 15, wherein said FISH probe is fluorescently labeled.
- 17. The method of claim 14, wherein said amplification product is an array CGH target.
- 18. A method for identifying oligonucleotide sequences suitable for the amplification of a unique sequence within a genomic region of interest, said method comprising the steps of:
analyzing a genomic nucleotide sequence that encompasses said genomic region of interest to identify repeat sequences within said genomic region; comparing at least one repeat sequence-free subsequence within said genomic nucleotide sequence to a nucleotide sequence database to identify sequences within said database that are substantially similar to said repeat sequence-free subsequence; for at least one of said repeat sequence-free subsequences for which a defined number of substantially similar sequences are identified within said nucleotide sequence database, selecting oligonucleotide sequences that are suitable for use as primers in an amplification reaction to amplify a product within said repeat sequence-free subsequence.
- 19. The method of claim 18, wherein said genomic region is from a human genome.
- 20. The method of claim 18, wherein said defined number of substantially similar sequences is zero.
- 21. The method of claim 18, further comprising displaying said oligonucleotide sequences on a computer screen or on a computer printout.
- 22. The method of claim 18, further comprising directing the synthesis of oligonucleotide primers comprising said oligonucleotide sequences.
- 23. The method of claim 22, wherein said synthesis is directed by ordering the synthesis of said primers from an external source.
- 24. The method of claim 18, wherein said substantially similar sequences are at least about 50% identical to said repeat sequence-free subsequences.
- 25. The method of claim 18, wherein said substantially similar sequences are at least about 70% identical to said repeat sequence-free subsequences.
- 26. The method of claim 18, wherein said substantially similar sequences are at least about 90% identical to said repeat sequence-free subsequences.
- 27. The method of claim 18, wherein the identification of repeat sequences within said genomic region is performed using Repeat Masker software.
- 28. The method of claim 18, wherein the comparison of said at least one repeat sequence-free subsequence with said genome database is performed using a BLAST algorithm.
- 29. The method of claim 18, wherein said oligonucleotide sequences are selected using Primer3 software.
- 30. The method of claim 22, further comprising generating an amplification product using said oligonucleotide primers.
- 31. The method of claim 30, wherein said amplification product is a FISH probe.
- 32. The method of claim 31, wherein said FISH probe is fluorescently labeled.
- 33. The method of claim 30, wherein said amplification product is an array CGH target.
- 34. A computer program product designing and outputting oligonucleotide sequences suitable for use as primers to amplify unique sequences within a genomic region of interest, said computer program product comprising:
a storage structure having computer program code embodied therein, said computer program code comprising: computer program code for causing a computer to analyze a nucleotide sequence encompassing said genomic region of interest to identify repeat sequences within said nucleotide sequence; computer program code for causing a computer to, for each subsequence of said nucleotide sequence that does not contain any of said repeat sequences, compare said subsequence against a nucleotide sequence database to identify nucleotide sequences within said database that are substantially similar to said subsequence; computer program code for causing a computer to, for each of said subsequences for which a defined number of substantially similar sequences are found in said database, identify oligonucleotide sequences suitable for use as primers in an amplification reaction to amplify a product within said subsequence; and computer program code for outputting said oligonucleotide sequences.
- 35. The method of claim 34, wherein said defined number of substantially similar sequences is zero.
- 36. The method of claim 34, wherein said substantially similar sequences are at least about 50% identical to said subsequences.
- 37. The method of claim 34, wherein said substantially similar sequences are at least about 70% identical to said subsequences.
- 38. The method of claim 34, wherein said substantially similar sequences are at least about 90% identical to said subsequences.
- 39. A method for identifying genes within a genomic region of interest, said method comprising the steps of:
executing a first process on a digital computer to identify repeat sequences that occur within said genomic region of interest; executing a second process on a digital computer to compare repeat sequence-free subsequences within said genomic region of interest to a nucleotide sequence database, whereby nucleotide sequences within said nucleotide sequence database that are substantially similar to said repeat sequence-free subsequences are identified; executing a third process on a digital computer to select repeat sequence-free subsequences having no substantially similar sequences to identify a repeat sequence-free subsequence may represent a gene family. identify oligonucleotide sequences that are suitable for use as primers in an amplification reaction to amplify a product within any of said repeat sequence-free subsequences for which a defined number of substantially similar sequences are identified in said nucleotide sequence database; and outputting said oligonucleotide sequences.
STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER FEDERALLY SPONSORED RESEARCH AND DEVELOPMENT
[0001] This invention was made with Government support under Grant No. CA58207, awarded by the National Institutes of Health. The Government has certain rights in this invention.