Claims
- 1. A method for creating a unique DNA genome database, comprising the steps of:
providing available genomic sequence data in a first database enumerating regions of identity between genomic sequences in the first data base and other genomic sequences in the first database as if the first database was also a query database, the enumerating being done on a computer having a processor, memory, input/output mechanisms; and removing from the first database, genomic sequences that are nearly identical to a region of a longer genomic sequence, whereby a unique DNA genome database is created.
- 2. A system for creating a unique DNA genome database, comprising;
means for providing available genomic sequence data in a first database means for enumerating regions of identity between genomic sequences in the first data base and other genomic sequences in the first database as if the first database was also a query database, the enumerating being done on a computer having a processor, memory, input/output mechanisms; and means for removing from the first database, genomic sequences that are nearly identical to a region of a longer genomic sequence, whereby a unique DNA genome database is created.
- 3. A computer program for finding near identities in a DNA sequence database, comprising:
a first code mechanism for comparing a DNA sequence on a query database to DNA sequences on a data database, wherein a tag array (designated as Qtags) is generated for each of the DNA sequences on the query database and wherein a tag array (designated Dtags) is generated for each of the DNA sequences on the data database; and a second code mechanism for comparing each Qtag to each Dtag, using a comparison model, wherein near identities of sequences in the two databases are identified.
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application is a divisional of U.S. patent application Ser. No. 09/772,813, filed Jan. 30, 2001, which claims the benefit of U.S. Provisional application No. 60/179,309 (filed Jan. 31, 2000), the contents of which are incorporated by reference.
Provisional Applications (1)
|
Number |
Date |
Country |
|
60179309 |
Jan 2000 |
US |
Divisions (1)
|
Number |
Date |
Country |
Parent |
09772813 |
Jan 2001 |
US |
Child |
10762900 |
Jan 2004 |
US |