A Multi-functional Approach for Document Layout Analysis
نویسنده
چکیده
The important pre-requisites in document layout analysis are identifying number of text lines, number of columns and segmentation of textual and non-textual regions. The literature reveals two major procedures viz. global and local approaches used for extraction of text lines. The examples of global approaches are projection profile and Hough transform, which have serious problem with multicolumn layouts. The local approach is connected component linking which consume lot of computational time, or requires complete page layout analysis as input prior to the reliable identification of text lines. In this paper, we propose a new and efficient method for segmentation and identification of number of text lines based on image dilation and region labeling for a machine printed binary document image containing text or/and non-text, single or multi-column layouts. The proposed method segments the individual text lines present in the document, reports the number of columns in the documents, detects and segments the number of nontextual regions present in the given document image. The flexibility of this method further permit’s to work on text line features, which are invariably used in the later phase of image analysis for higher level interpretation and matching. The experimental results are encouraging and indicate that better accuracy could be achieved by using this method.
منابع مشابه
An integrated approach to document decomposition and structural analysis
A document image is a visual representation of a paper document, such as a journal article page, a cover page of facsimile transmission, ooce correspondence, an application form, etc. Document image understanding as a research endeavor consists of developing processes for taking a document through various representations: from scanned image to semantic representation. This paper describes docum...
متن کاملAn Integrated Approach for Automatic Semantic Structure Extraction in Document Images
In this paper we present an integrated approach for semantic structure extraction in document images. Document images are initially processed to extract both their layout and logical structures on the base of geometrical and spatial information. Then, textual content of logical components is employed for automatic semantic labeling of layout structures. To support the whole process different ma...
متن کاملText Block Recognition in Multi-Oriented Handwritten Documents
Automatic detection of text blocks is an important step before applying OCR or word-spotting techniques to document images. Our approach focusses on handwritten (historical) documents and uses the Gabor Transformation to facilitate this task. Apart from the main text, which often consists of rectangular shaped text blocks, marginalia are of special interest here. These areas are generally uncon...
متن کاملDiscovering Knowledge through Multi-modal Association Rule Mining for Document Image Analysis
The paper introduces a descriptive data mining method to discover knowledge for the task of automatic categorization in document image analysis. We argue that a document image is a multi-modal unit of analysis whose semantics is deduced from a combination of textual content, layout structure and logical structure. So, the method considers simultaneously different modalities of document represen...
متن کاملA Local-to-Global Approach to Complex Document Layout Analysis
Document layout analysis is concerned about the decomposition of raster representation of a document into several regions which contain homogeneous entities. This paper describes a new approach to segment documents with complex layout and degraded image quality. The approach uses a local-to-global strategy which can be adapted to a variety of documents. The system was tested on different Englis...
متن کامل