Claims
- 1. A computer readable medium having computer readable program code means embodied therein, the computer program code means comprising:
a computer readable program code that records in a dictionary database dictionary information which is used for processing of determining whether a predetermined data element is contained in data to be analyzed and links a data element and category information representing at least one category to which the data element belongs; a computer readable program code that receives a designation of the category; and a computer readable program code that extracts a data element linked to category information representing the designated category by referring to the dictionary database and sets the extracted data element as the predetermined data element to be used for determination in the processing.
- 2. The medium according to claim 1, comprising
a computer readable program code that extracts a candidate data element contained in the data to be analyzed in accordance with a predetermined rule when it is determined by the processing that the predetermined data element is contained in the data to be analyzed, totalizes an extraction frequency of the candidate data element, and records in the database dictionary information which links the candidate data element and category information representing the extraction frequency of the candidate data element.
- 3. The medium according to claim 1, comprising
a computer readable program code that extracts a candidate data element contained in the data to be analyzed in accordance with a predetermined rule when it is determined by the processing that the predetermined data element is contained in the data to be analyzed, extracts time information added to the data to be analyzed, and records in the database dictionary information which links the candidate data element and category information representing the extracted time information.
- 4. The medium according to claim 1, wherein
the category information has a structure obtained by hierarchically combining a plurality of categories, and the designation of the category represents the hierarchical combination of the plurality of categories.
- 5. A data analysis system which executes processing of determining whether a predetermined data element is contained in data to be analyzed, comprising:
a recording unit that records in a dictionary database dictionary information which links a data element and category information representing at least one category to which the data element belongs; a category designating unit that receives a designation of the category; and an extracting unit that extracts a data element linked to category information representing the designated category by referring to the dictionary database and sets the extracted data element as the predetermined data element to be used for determination in the processing.
- 6. The system according to claim 5, comprising
a totalizing unit that extracts a candidate data element contained in the data to be analyzed in accordance with a predetermined rule when it is determined by the processing that the predetermined data element is contained in the data to be analyzed, totalizes an extraction frequency of the candidate data element, and records in the database dictionary information which links the candidate data element and category information representing the extraction frequency of the candidate data element.
- 7. The system according to claim 5, comprising
a totalizing unit that extracts a candidate data element contained in the data to be analyzed in accordance with a predetermined rule when it is determined by the processing that the predetermined data element is contained in the data to be analyzed, extracts time information added to the data to be analyzed, and records in the database dictionary information which links the candidate data element and category information representing the extracted time information.
- 8. The system according to claim 5, wherein
the category information has a structure obtained by hierarchically combining a plurality of categories, and the designation of the category represents the hierarchical combination of the plurality of categories.
- 9. A data analysis method of executing processing of determining whether a predetermined data element is contained in data to be analyzed, comprising:
recording in a dictionary database dictionary information which links a data element and category information representing at least one category to which the data element belongs; receiving a designation of the category; and extracting a data element linked to category information representing the designated category by referring to the dictionary database and setting the extracted data element as the predetermined data element to be used for determination in the processing.
- 10. The method according to claim 9, comprising
extracting a candidate data element contained in the data to be analyzed in accordance with a predetermined rule when it is determined by the processing that the predetermined data element is contained in the data to be analyzed, totalizing an extraction frequency of the candidate data element, and recording in the database dictionary information which links the candidate data element and category information representing the extraction frequency of the candidate data element.
- 11. The method according to claim 9, comprising
extracting a candidate data element contained in the data to be analyzed in accordance with a predetermined rule when it is determined by the processing that the predetermined data element is contained in the data to be analyzed, extracting time information added to the data to be analyzed, and recording in the database dictionary information which links the candidate data element and category information representing the extracted time.
- 12. The method according to claim 9, wherein
the category information has a structure obtained by hierarchically combining a plurality of categories, and the designation of the category represents the hierarchical combination of the plurality of categories.
Priority Claims (2)
Number |
Date |
Country |
Kind |
2001-241131 |
Aug 2001 |
JP |
|
2002-214324 |
Jul 2002 |
JP |
|
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application is based upon and claims the benefit of priority from the prior Japanese Patent Applications No. 2001-241131, filed Aug. 8, 2001; and No. 2002-214324, filed Jul. 23, 2002, the entire contents of both of which are incorporated herein by reference.