Claims
- 1. A method for generating a layout model to define objects of a document image by an apparatus having an input unit, an area generation unit, a layout model generation unit, a storage unit and a display unit, comprising the steps of:
- analyzing an inputted document image with said area generation unit to extract separators to separate the objects of the document in order to segment said document image into a plurality of areas and create a tree structure in accordance with said areas and said separators;
- displaying said document image on said display unit together with a schematic representation of said tree structure;
- correcting said tree structure, if required, with said area generation unit by manipulation of said areas;
- generating a layout model, having nodes assigned, wherein the nodes have no attribute data associated therewith, wherein each node corresponds to each of respective said areas displayed in said schematic representation, by said layout model generation unit in order to display said layout model on said display unit; and
- assigning a previously inputted macroparameter to each of said nodes with no attribute data associated therewith, wherein said macroparameter rearranges said nodes when said layout model does not correspond to the document image.
- 2. A document image layout model generation apparatus for generating a layout model to define objects of a document image, comprising:
- an automatic area segmentation unit for analyzing an inputted document image to extract separators to separate the objects of the document image and to segment said document image into a tree structure in accordance with said separators;
- means for displaying said tree structure on a display unit together with said document image;
- an area structure modification unit for modifying said tree structure by manipulation of elements of said tree structure; and
- a layout model generation unit for completing a layout model having nodes assigned, wherein the nodes have no attribute data associated therewith, wherein each node corresponds to each of a respective area of said tree structure for display on said display unit, and for setting a previously inputted macroparameter which rearranges said nodes when said layout model does not correspond to the document image.
Priority Claims (1)
Number |
Date |
Country |
Kind |
3-333778 |
Dec 1991 |
JPX |
|
Parent Case Info
The application is a continuation, of application Ser. No. 07/956,702, filed Jan. 5, 1992 now abandoned.
US Referenced Citations (3)
Number |
Name |
Date |
Kind |
5073953 |
Westdijk |
Dec 1991 |
|
5185813 |
Tsujimoto |
Feb 1993 |
|
5379373 |
Hayashi et al. |
Jan 1995 |
|
Non-Patent Literature Citations (3)
Entry |
Tsujimoto et al., "Understanding Multi-articled Documents", Proceedings of the 10th International Conference on Pattern Recognition, (IEEE 1990), pp. 551-556. |
Casey, "Intelligent Forms Processing", IBM Systems Journal, vol. 29 No. 3 (1990), pp. 435-450. |
Fletcher, "A Robust Algorithm for Text String Separation from Mixed Text/Graphics Images", IEEE Transaction on Pattern Analysis and Machine Intelligence, vol. 10 No. 6 (Nov. 1988), pp. 910-918. |
Continuations (1)
|
Number |
Date |
Country |
Parent |
956702 |
Oct 1992 |
|