Claims
- 1. A structured-text cataloging method for a text searching system, in which a set of texts is searched for specific text contents, comprising the following steps:a structure-index creating step of creating a structure index, by sequentially superposing logical structures of a plurality of texts to be cataloged in said structure index; wherein said structure index has a tree-like structure composed of a plurality of metanodes; wherein a context identifier that uniquely identifies one of said metanodes is assigned to each metanode of said structure index; wherein a group of structure elements having the same position of appearance and the same element type for a plurality of texts are represented by a single metanode; wherein the tree-like structures of two of said texts to be cataloged are superposed on each other in said structure index by: comparing nodes of one of said tree-like structures of said two texts with nodes of the other of said tree-like structures of said two texts; regarding a root node in said one of said tree-like structures as a counterpart that mutually corresponds to a root node in said other of said tree-like structures; regarding a non-root node in said one of said tree-like structures of said two texts as a counterpart that mutually corresponds to a non-root node in said other of said tree-like structures if the non-root node in said one of said tree-like structures has a directly superordinate node mutually corresponding to a directly superordinate node of the non-root node in said other of said tree-like structures, if the non-root node in said one of said tree-like structures is the same type of node as the non-root node in said other of said tree-like structures, and if the non-root node in said one of said tree-like structures has the same appearance order number as the non-root node in said other of said tree-like structures, wherein the appearance order number indicates a position in a normal order of an array of brother nodes of the same type of node found by counting said brother nodes, starting from the head of said array; and representing two nodes mutually corresponding nodes by a single metanode in said structure index.
- 2. A structured-text cataloging method for a text searching system, in which a set of texts is searched for specific text contents, comprising the following steps:a structure-index creating step of creating a structure index, by sequentially superposing logical structures of a plurality of texts to be cataloged in said structure index; wherein said structure index has a tree-like structure composed of a plurality of metanodes; wherein a context identifier that uniquely identifies one of said metanodes is assigned to each metanode of said structure index; wherein a group of structure elements having the same position of appearance and the same element type for a plurality of texts are represented by a single metanode; and wherein the tree-like structures of two of said texts to be cataloged are superposed on each other in said structure index by: comparing nodes of one of said tree-like structures of said two texts with nodes of the other of said tree-like structures of said two texts; regarding a root node in said one of said tree-like structures as a counterpart that mutually corresponds to a root node in said other of said tree-like structures; regarding a non-root node in said one of said tree-like structures of said two texts as a counterpart that mutually corresponds to a non-root node in said other of said tree-like structures if the non-root node in said one of said tree-like structures has a directly superordinate node mutually corresponding to a directly superordinate node of the non-root node in said other of said tree-like structures, if the non-root node in said one of said tree-like structures is the same type of node as the non-root node in said other of said tree-like structures, and if the non-root node in said one of said tree-like structures has the same appearance order number as the non-root node in said other of said tree-like structures, wherein said appearance order number is a number indicating a position in a reversed order of an array of brother nodes having the same type of node found by counting said brother nodes, starting from the end of said array; and representing two nodes mutually corresponding nodes by a single metanode in said structure index.
Priority Claims (1)
Number |
Date |
Country |
Kind |
9-041855 |
Feb 1997 |
JP |
|
Parent Case Info
The above-referenced patent application is a continuation of U.S. Ser. No. 09/814,692, filed Mar. 15, 2001, now U.S. Pat No. 6,389,413 which is a continuation application of U.S. application Ser. No. 09/589,226, filed on Jun. 8, 2000 (now U.S. Pat. No. 6,226,632), which is a continuation application of U.S. Ser. No. 09/028,513, filed Feb. 23, 1998 (now U.S. Pat. No. 6,105,022), from which priority is claimed under 35 U.S.C. §120.
US Referenced Citations (12)
Number |
Name |
Date |
Kind |
5519694 |
Brewer et al. |
May 1996 |
A |
5557789 |
Mase et al. |
Sep 1996 |
A |
5666645 |
Thomas et al. |
Sep 1997 |
A |
5717925 |
Harper et al. |
Feb 1998 |
A |
5813009 |
Johnson et al. |
Sep 1998 |
A |
5895446 |
Takeda et al. |
Apr 1999 |
A |
5950214 |
Rivette et al. |
Sep 1999 |
A |
5956705 |
Stevens et al. |
Sep 1999 |
A |
5956734 |
Schmuck et al. |
Sep 1999 |
A |
5970490 |
Morgenstern |
Oct 1999 |
A |
6105022 |
Takahashi et al. |
Aug 2000 |
A |
6226632 |
Takahashi et al. |
May 2001 |
B1 |
Foreign Referenced Citations (2)
Number |
Date |
Country |
8-147311 |
Jun 1996 |
JP |
8-194718 |
Jul 1996 |
JP |
Non-Patent Literature Citations (5)
Entry |
Published material concerning Livelink Search, a product of Open Text Corporation, printed from the Internet (no data available). |
International Standard ISO 8879, Information Processing—Text and Office Systems—Standard Generalized Markup Language (SGML), First Edition, 1986, pp. 1-155. |
Overlapping B+trees for temporal data by Manolopoulos et al, Information Technology 1990, proceedings of the 5th Jerusalem Conference, pp. 248-253. |
Multi-mode indices for effective image retrieval in multi systems by Cha et al, IEEE Intern'l Conference Multimedia computing systems, pp. 152-159. |
New access index for fast execution of conjuctive queries over text data by Yang et al, Ohio University, Database Engineering and Applications, pp. 248-253. |
Continuations (3)
|
Number |
Date |
Country |
Parent |
09/814692 |
Mar 2001 |
US |
Child |
10/095569 |
|
US |
Parent |
09/589226 |
Jun 2000 |
US |
Child |
09/814692 |
|
US |
Parent |
09/028513 |
Feb 1998 |
US |
Child |
09/589226 |
|
US |