directional stroke width transform to separate text and graphics in city maps
نویسندگان
چکیده
one of the complex documents in the real world is city maps. in these kinds of maps, text labels overlap by graphics with having a variety of fonts and styles in different orientations. usually, text and graphic colour is not predefined due to various map publishers. in most city maps, text and graphic lines form a single connected component. moreover, the common regions of text and graphic lines have similar features; hence, the separation of text and graphic lines is a challenging task in document analysis. generally, these text labels could not be recognized efficiently by current commercial ocr systems in city map processing. in this paper, we propose an image decomposition approach based on stroke width feature to extract text labels from city maps. in our approach, we assign to each pixel of image a local stroke width based on minimum distance from borders in four directional borders. this mapping generates a suitable representation to distinguish text and non-text pixels. the experimental results on several varieties of city maps are promising
منابع مشابه
Directional Stroke Width Transform to Separate Text and Graphics in City Maps
One of the complex documents in the real world is city maps. In these kinds of maps, text labels overlap by graphics with having a variety of fonts and styles in different orientations. Usually, text and graphic colour is not predefined due to various map publishers. In most city maps, text and graphic lines form a single connected component. Moreover, the common regions of text and graphic lin...
متن کاملText Detection in Natural Scenes with Stroke Width Transform
My project aims at detecting text segments in an image of a natural scene, by using an enhanced version of the Stroke Width Transform [1]. The application receives an RGB image to search in, and returns a new image where the discovered text segments are marked. Due to the features of the SWT, the resulting system is able to detect text regardless of its scale, direction, font and language.
متن کاملText/Graphics Separation in Maps
The separation of overlapping text and graphics is a challenging problem in document image analysis. This paper proposes a specific method of detecting and extracting characters that are touching graphics. It is based on the observation that the constituent strokes of characters are usually short segments in comparison with those of graphics. It combines line continuation with the feature line ...
متن کاملDetection and Extraction of Text Connected to Graphics in Maps
The separation of text from graphics has been challenging researchers for many years. The difficulty arises when there is text connected to graphics. This paper proposes a specific method of detecting and extracting graphics-connected characters. The proposed method is based on the observation that the constituent strokes of characters are usually short segments in comparison with those of grap...
متن کاملScene Text Detection Based on Robust Stroke Width Transform and Deep Belief Network
Text detection in natural scene images is an open and challenging problem due to the significant variations of the appearance of the text itself and its interaction with the context. In this paper, we present a novel text detection method combining two main ingredients: the robust extension of Stroke Width Transform (SWT) and the Deep Belief Network (DBN) based discrimination of text objects fr...
متن کاملImage Text Detection Using a Bandlet-Based Edge Detector and Stroke Width Transform
A slew of semantic image content analysis techniques are specialized in extracting text embedded in images since it is a vital source of semantic information. A robust text detection step is the basic requirement for a scheme designed to extract text information from images. Text detection is still a challenging issue due to unconstrained color, sizes, alignments of characters, lighting and als...
متن کاملمنابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
journal of computer and roboticsجلد ۷، شماره ۲، صفحات ۱-۷
میزبانی شده توسط پلتفرم ابری doprax.com
copyright © 2015-2023