Claims
- 1. A computer readable medium bearing sequences of instructions for searching a database, said sequences of instructions comprising:
- selecting a database having at least one document from a plurality of databases;
- creating a plurality of subdocuments of approximately equal length from said database;
- representing the occurrence of a plurality of terms in each of said subdocuments by an encoded bit string;
- combining a plurality of said bit strings, wherein said combination represents a search result from said database.
- 2. The computer readable medium of claim 1, wherein said sequence of instructions further comprise:
- creating at least one subdocument start boundary and a corresponding number of subdocument end boundaries in a document such that a subdocument comprises that part of said document between each start boundary and corresponding end boundary; and
- assigning an identifier to each subdocument that identifies a location of said subdocument in said database.
- 3. The computer readable medium of claim 2, wherein said sequence of instructions further comprise:
- parsing each subdocument into a plurality of terms; and
- generating a term list that associates each term with said subdocument that includes that term.
- 4. The computer readable medium of claim 3, wherein said sequence of instructions further comprise:
- selecting a plurality of terms; and
- linking said terms by a logical operator.
- 5. The computer readable medium of claim 4, wherein said logical operator includes one of the set of operators consisting of union, intersection, inversion or testing for the existence of an element in the set.
- 6. The computer readable medium of claim 1, further comprising generating a comparison list indicating the relation between a first encoded bit string and a second encoded bit string.
- 7. The computer readable medium of claim 6, wherein said comparison list indicates the intersection between said first and said second encoded bit strings.
- 8. The computer readable medium of claim 7, wherein said comparison list indicates the union between said first and said second encoded bit strings.
Parent Case Info
This application is a divisional of patent application Ser. No. 08/900,562, filed Jul. 25, 1997, now U.S. Pat. No. 5,893,094.
US Referenced Citations (21)
Non-Patent Literature Citations (2)
Entry |
S Kwong et al., Using Multiple Huffman Trees for Compressing Chinese Character Fonts, Paper No. 95-70 (Sep. 6, 1995). |
Data Compression (internet posting), no date. |
Divisions (1)
|
Number |
Date |
Country |
Parent |
900562 |
Jul 1997 |
|