We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
An Integrated Approach to Document Decomposition and Structural Analysis.
- Authors
Niyogi, Debashish; Srihari, Sargur N.
- Abstract
A document image is a visual representation of a paper document, such as a journal article page, a cover page of facsimile transmission, office correspondence, an application form, etc. Document image understanding as a research endeavor consists of developing processes for faking a document through various representations, from scanned image to semantic representation. This article describes document decomposition and structural analysis, which constitutes one of the major processes involved in document image understanding. The current state of the art and future directions in the areas of document segmentation, layout analysis, and logical block grouping are indicated. A system that performs decomposition and structural analysis (including logical grouping and read-order determination) on complex multi-articled documents is presented. This system uses bottom-up segmentation techniques to identify the block structure of a document, and layout rules to classify and group these blocks into logical units that represent meaningful subdivisions of the document. Experimental results showing the efficiency of this approach are presented and discussed.
- Subjects
STRUCTURAL analysis (Engineering); STRUCTURAL engineering; DOCUMENT imaging systems; IMAGING systems; DATA transmission systems; IMAGE transmission
- Publication
International Journal of Imaging Systems & Technology, 1996, Vol 7, Issue 4, p330
- ISSN
0899-9457
- Publication type
Article
- DOI
10.1002/(SICI)1098-1098(199624)7:4<330::AID-IMA8>3.0.CO;2-9