Claims
- 1. A decision tree system for use in data mining utilizing a storage module and an object oriented linking module for linking said decision tree system and said storage module; comprising:
an object oriented pattern recognition algorithms module for pattern recognition, comprising:
a decision tree system including
an object oriented module to read said data, an object oriented module to sort said data if sorting is necessary, an object oriented module to determine the best manner to split said data according to some criterion, and an object oriented module to split said data.
- 2. The decision tree system of claim 1 wherein said object oriented module to determine the best manner to split said data is based on tests on single attributes of said data.
- 3. The decision tree system of claim 1 wherein said object oriented module to determine the best manner to split said data is based on a OC1 algorithm.
- 4. The decision tree system of claim 1 wherein said object oriented module to determine the best manner to split said data is based on a CART-LC algorithm.
- 5. The decision tree system of claim 1 wherein said object oriented module to determine the manner to best split said data is based on an evolutionary algorithm.
- 6. The decision tree system of claim 1 wherein said criterion is the Gini index.
- 7. The decision tree system of claim 1 wherein said criterion is the information gain.
- 8. The decision tree system of claim 1 wherein said criterion is the information ratio.
- 9. The decision tree system of claim 1 wherein said criterion is the twoing rule.
- 10. A decision tree system for use in a parallel object oriented data mining system, comprising:
a parallel object oriented module to read said data, said data containing data items with features, a parallel object oriented module to sort said data if sorting is necessary, a parallel object oriented module to determine the best manner to split said data into subsets according to some criterion, a parallel object oriented module to split said data, a storage module to store the features for each data item, a parallel object oriented linking module for linking said decision tree system and said storage module.
- 11. The decision tree system of claim 10 wherein said parallel object oriented module to determine the best manner to split said data is based on tests on single attributes of said data.
- 12. The decision tree system of claim 10 wherein said parallel object oriented module to determine the best manner to split said data is based on a OC1 algorithm.
- 13. The decision tree system of claim 10 wherein said parallel object oriented module to determine the best manner to split said data is based on a CART-LC algorithm.
- 14. The decision tree system of claim 10 wherein said parallel object oriented module to determine the manner to best split said data is based on an evolutionary algorithm.
- 15. The decision tree system of claim 10 wherein said criterion is the Gini index.
- 16. The decision tree system of claim 10 wherein said criterion is the information gain.
- 17. The decision tree system of claim 10 wherein said criterion is the information ratio.
- 18. The decision tree system of claim 10 wherein said criterion is the twoing rule.
- 19. A decision tree method for use in a method of data mining data files wherein said data files contain objects having relevant features, comprising the steps of:
recognizing patterns among said objects based upon said features, creating a decision tree system, reading said data using an object oriented module, sorting said data using an object oriented module if sorting is necessary, determining the best manner to split said data into subsets according to some criterion using an object oriented module, and splitting said data using an object oriented module.
- 20. The decision tree method of claim 19 wherein said parallel object oriented module to determine the best manner to split said data is based on tests on single attributes of said data.
- 21. The decision tree method of claim 19 wherein said parallel object oriented module to determine the best manner to split said data is based on a OC1 algorithm.
- 22. The decision tree method of claim 19 wherein said parallel object oriented module to determine the best manner to split said data is based on a CART-LC algorithm.
- 23. The decision tree method of claim 19 wherein said parallel object oriented module to determine the manner to best split said data is based on an evolutionary algorithm.
- 24. The decision tree method of claim 19 wherein said criterion is the Gini index.
- 25. The decision tree method of claim 19 wherein said criterion is the information gain.
- 26. The decision tree method of claim 19 wherein said criterion is the information ratio.
- 27. The decision tree method of claim 19 wherein said criterion is the twoing rule.
- 28. A decision tree method for use in a method of data mining that includes the steps of reading and displaying data files, said data files containing objects having relevant features, identifying said objects in said data files, and extracting relevant features for each of said objects, comprising the steps of:
recognizing patterns among said objects based upon said features. creating a decision tree by
reading said data, sorting said data if sorting is necessary, determining the best manner to split said data into subsets according to some criterion, and splitting said data.
- 29. The decision tree method of claim 28 wherein said step of determining the best manner to split said data is based on tests on single attributes of said data.
- 30. The decision tree method of claim 28 wherein said step of determining the best manner to split said data is based on a OC1 algorithm.
- 31. The decision tree method of claim 1 wherein said step of determining the best manner to split said data is based on a CART-LC algorithm.
- 32. The decision tree method of claim 28 wherein said step of determining the manner to best split said data is based on an evolutionary algorithm.
- 33. The decision tree method of claim 28 wherein said criterion is the Gini index.
- 34. The decision tree method of claim 28 wherein said criterion is the information gain.
- 35. The decision tree method of claim 28 wherein said criterion is the information ratio.
- 36. The decision tree method of claim 28 wherein said criterion is the twoing rule.
CROSS REFERENCE TO RELATED APPLICATIONS
[0001] Related subject matter is disclosed and claimed in the following commonly owned, copending, U.S. patent applications, “PARALLEL OBJECT-ORIENTED DATA MINING SYSTEM,” by Chandrika Kamath and Erick Cantu-Paz, patent application number 09/xxxxxx, filed xxxxxxx, 2001, and, “PARALLEL OBJECT-ORIENTED, DENOISING SYSTEM USING WAVELET MULTIRESOLUTION ANALYSIS,” by Chandrika Kamath, Chuck H. Baldwin, Imola K. Fodor, and Nu A. Tang, patent application number 09/xxxxxx, filed xxxxxxx, 2001, which are hereby incorporated by reference in their entirety.
Government Interests
[0002] The United States Government has rights in this invention pursuant to Contract No. W-7405-ENG-48 between the United States Department of Energy and the University of California for the operation of Lawrence Livermore National Laboratory.