Claims
- 1. A method of storing data, comprising:creating a plurality of subdocuments of approximately equal length from a database; representing the occurrence of a plurality of terms in each of said subdocuments by an encoded bit string; combining a plurality of said bit strings, wherein said combination represents a search result from said database.
- 2. The method of claim 1, further comprising generating a comparison list indicating the relation between a first encoded bit string and a second encoded bit string.
- 3. The method of claim 2, wherein said comparison list indicates the intersection between said first and said second encoded bit strings.
- 4. The method of claim 2, wherein said comparison list indicates the union between said first and said second encoded bit strings.
- 5. An apparatus for storing data, comprising;a computer coupled to a disk storage unit, said disk storage unit stores a database, said computer creates a plurality of subdocuments of approximately equal length from said database; said computer represents the occurrence of a plurality of terms in each of said subdocuments by an encoded bit string; said computer combines a plurality of said encoded bit strings; and said computer stores said plurality of said encoded bit strings.
- 6. The apparatus of claim 5, wherein said processor further generates a comparison list that indicates the relation between a first and a second encoded bit string.
- 7. The apparatus of claim 5, wherein said comparison list indicates the intersection between said first and said second encoded bit string.
- 8. The apparatus of claim 5, wherein said comparison list indicates the union between said first and said second encoded bit string.
- 9. A method of retrieving data from a database, comprising the steps of:creating a plurality of subdocuments from a database; representing the occurrence of at least one term in said subdocuments by an encoded bit string; identifying said subdocuments containing said bit string; and retrieving said subdocuments containing said bit string.
- 10. The method of claim 9, wherein said encoded bit string represents the number of sequential subdocuments that contain the occurrence of said term.
- 11. The method of claim 9 further comprising the step of storing said subdocuments containing said bit string.
CROSS REFERENCE TO RELATED APPLICATIONS
This is a divisional of U.S. patent application Ser. No. 09/203,408 filed Dec. 2, 1998 now U.S Pat. No. 6,112,204, which is a division of U.S. patent application Ser. No. 08/900,562 filed Jul. 25, 1997, which issued on Apr. 6, 1999 as U.S. Pat. No. 5,893,094.
US Referenced Citations (23)
Non-Patent Literature Citations (2)
Entry |
S Kwong et al., Using Multiple Huffman Trees for Compressing Chinese Character Fonts, Paper No. 95-70 (Sep. 6, 1995). |
Data Compression (internet posting). |