Claims
- 1. A method of structuring data in a data-mining-ready format, wherein said data has been previously organized in a bit-Sequential (bSQ) format that comprises a plurality of binary files identified by a bit position, said method comprising the steps of:
dividing each of said plurality of binary files into first quadrants; recording the count of 1-bits for each first quadrant on a first level; dividing each of said first quadrants into new quadrants; recording the count of 1-bits for each of said new quadrants on a new level; and repeating the two steps immediately above until all of said new quadrants comprise a pure-1 quadrant or a pure-0 quadrant to form a basic tree structure.
- 2. The method of claim 1, wherein said basic tree structure has a complement that provides the 0-bit counts for each of said first quadrants and said new quadrants.
- 3. The method of claim 1, further comprising the step of masking said basic tree structure.
- 4. The method of claim 3, wherein said step of masking comprises using a 3-value logic to represent said pure-1 quadrants, said pure-0 quadrants, and a non-pure quadrant.
- 5. The method of claim 1, further comprising constructing a value tree structure from said basic tree structure.
- 6. The method of claim 5, further comprising constructing a tuple tree structure from said value tree structure.
- 7. The method of claim 1, wherein said data comprises two-dimensional spatial data.
- 8. The method of claim 7, wherein said basic tree structure comprises a lossless format of said two-dimensional spatial data.
- 9. The method of claim 1, wherein said data comprises bioinformatics data.
- 10. The method of claim 9, wherein said bioinformatics data comprises microarray data.
- 11. The method of claim 1, wherein said data comprises nanotechnology data.
- 12. The method of claim 1, wherein said data comprises data representing an integrated circuit.
- 13. A system for structuring data in a data-mining-ready format, wherein said data has been previously organized in a bit-Sequential (bSQ) format that comprises a plurality of binary files identified by a bit position, said system comprising:
a computer system and a set of computer readable instructions, wherein said set of instructions include directing said computer to system to:
divide each of said plurality of binary files into first quadrants; record the count of 1-bits for each first quadrant on a first level; divide each of said first quadrants into new quadrants; record the count of 1-bits for each of said new quadrants on a new level; and repeat recursively until all of said new quadrants comprise a pure-1 or pure-0 quadrant to form a basic tree structure.
- 14. The system of claim 13, wherein said set of instructions further include directing said computer system to determine a complement of said basic tree structure that provides the 0-bit counts for each of said first quadrants and said new quadrants.
- 15. The system of claim 13, wherein said set of instructions further include directing said computer system to mask said basic tree structure.
- 16. The system of claim 15, wherein the masking of said basic tree structure comprises using a 3-value logic to represent said pure-1 quadrants, said pure-0 quadrants, and a non-pure quadrant.
- 17. The system of claim 13, wherein said set of instructions further include directing said computer system to construct a value tree structure from said basic tree structure.
- 18. The system of claim 17, wherein said set of instructions further include directing said computer system to construct a tuple tree structure from said basic tree structure.
- 19. The system of claim 13, wherein said data comprises two-dimensional spatial data.
- 20. The system of claim 19, wherein said basic tree structure comprises a lossless format of said two-dimensional spatial data.
- 21. The system of claim 19, wherein said data comprises bioinformatics data.
- 22. The system of claim 21, wherein said data comprises microarray data.
- 23. The system of claim 13, wherein said data comprises nanotechnology data.
- 24. The system of claim 23, wherein said data comprises data representing an integrated circuit.
- 25. A system for formatting data, wherein said data is in the form of an n-dimensional array of binary data, said binary data comprising a plurality of bits that are identified by a bit position within the n-dimensional array, the system comprising:
a computer system and a set of computer readable instructions, wherein said set of instructions include directing said computer to system to:
create one file for each bit position of said binary data wherein the bit position identification is maintained; and store the bit with the corresponding bit position identification from said binary data within the created file.
- 26. The system of claim 25, wherein said data comprises two-dimensional spatial data.
- 27. The system of claim 25, wherein said data comprises bioinformatics data.
- 28. The system of claim 27, wherein said bioinformatics data comprises microarray data.
- 29. The system of claim 25, wherein said data comprises nanotechnology data.
- 30. The system of claim 25, wherein said data comprises data representing an integrated circuit.
- 31. A method of formatting data, wherein said data is in the form of an n-dimensional array of binary data, said binary data comprising a plurality of bits that are identified by a bit position within the n-dimensional array, said method comprising the steps of
creating one file for each bit position of said binary data while maintaining the bit position identification; and storing the bit with the corresponding bit position identification from said binary data within the created file.
- 32. The method of claim 31, wherein said data comprises two-dimensional spatial data.
- 33. The method of claim 31, wherein said data comprises bioinformatics data.
- 34. The method of claim 33, wherein said bioinformatics data comprises microarray data.
- 35. The method of claim 31, wherein said data comprises nanotechnology data.
- 36. The method of claim 1, wherein said data comprises data representing an integrated circuit.
CLAIM TO PRIORITY
[0001] The present application claims priority to U.S. Provisional Patent Application No. 60/234,050, filed Sep. 20, 2000, and entitled “System and Method for Imagery Organization, Compression, and Data Mining” and to U.S. Provisional Patent Application No. 60/237,778, filed Oct. 4, 2000, and entitled “System and Method for Imagery Organization, Compression, and Data Mining.” Both of the identified United States provisional patent applications are hereby incorporated by reference.
Provisional Applications (2)
|
Number |
Date |
Country |
|
60234050 |
Sep 2000 |
US |
|
60237778 |
Oct 2000 |
US |