BRIEF DESCRIPTION OF THE DRAWINGS
FIG. 1 is a functional block diagram of an information extracting apparatus according to a first embodiment of the present invention;
FIG. 2 is an example of a description in a knowledge dictionary shown in FIG. 1;
FIG. 3 is an example of 4W1H-plus-predicate information extracted by an element extracting unit shown in FIG. 1;
FIG. 4 is a schematic for explaining an example in which a supplementary-information obtaining unit shown in FIG. 1 supplements the 4W1H-plus-predicate information from attribute information;
FIG. 5 is a schematic for explaining a definition of document;
FIG. 6 is a schematic for explaining an example in which the supplementary-information obtaining unit extracts information from other parts of text for information supplement;
FIG. 7 is a schematic for explaining an example in which the supplementary-information obtaining unit extracts information from other parts of the text and a document property for information supplement;
FIG. 8 is an output example of each extracted data shown in FIGS. 3, 4, 6, and 7;
FIG. 9 is a flowchart of a 4W1H-plus-predicate information extraction process according to the first embodiment;
FIG. 10 is a flowchart of an analysis process;
FIG. 11 is another flowchart of the 4W1H-plus-predicate information extraction process;
FIG. 12 is a functional block diagram of an information extracting apparatus according to a second embodiment of the present invention;
FIG. 13 is a schematic for explaining conversion examples in which an obtained extraction element is converted into an RDF/XML syntax and an RDF graph by a converter shown in FIG. 12;
FIG. 14 is a functional block diagram of an information extracting apparatus according to a third embodiment of the present invention;
FIG. 15 is a schematic for explaining a document-relationship specifying rule applied to specify an inter-document relationship by a document-relationship specifying unit shown in FIG. 14;
FIG. 16 is another example of a description in the knowledge dictionary shown in FIG. 14;
FIG. 17 is a schematic for explaining extraction of an inter-document relationship in an email document group by the information extracting apparatus shown in FIG. 14;
FIG. 18 is a schematic for explaining extraction of 4W1H-plus-predicate information from a document B shown in FIG. 17;
FIG. 19 is a schematic for explaining extraction of 4W1H-plus-predicate information from documents B and C shown in FIG. 17;
FIG. 20 is a schematic for explaining reconstruction of elements from documents A, B, and C in FIG. 17 by an element reconstructing unit shown in FIG. 14;
FIG. 21 is a flowchart of an information extraction process according to the third embodiment;
FIG. 22 is a flowchart of a document relationship-specifying process;
FIG. 23 is a flowchart of a process in which the element reconstructing unit reconstructs 4W1H-plus-predicate information;
FIG. 24 is a schematic for explaining conversion examples in which 4W1H-plus-predicate information is converted into an RDF syntax and an RDF graph by a converter of an information extracting apparatus according to a fourth embodiment of the present invention;
FIG. 25 is a block diagram of a hardware configuration of the information extracting apparatus according to the embodiments;
FIG. 26 is still another example of a description in the knowledge dictionary;
FIG. 27 is an example of 4W1H-plus-predicate information extracted from an English sentence;
FIG. 28 is an example of a document property;
FIG. 29 is a schematic for explaining an example in which the supplementary-information obtaining unit extracts information from the document property for information supplement; and
FIG. 30 is an output example of each data extracted from Example 1 and Example 2 shown in FIGS. 27 and 29.