directional stroke width transform to separate text and graphics in city maps

نویسندگان

ali ghafari-beranghar

department of computer engineering, science and research branch, islamic azad university, tehran, iran ehsanollah kabir

department of electrical and computer engineering, tarbiat modarres university, tehran, iran kaveh kangarloo

department of electrical engineering, central tehran branch, islamic azad university, tehran, iran

چکیده

one of the complex documents in the real world is city maps. in these kinds of maps, text labels overlap by graphics with having a variety of fonts and styles in different orientations. usually, text and graphic colour is not predefined due to various map publishers. in most city maps, text and graphic lines form a single connected component. moreover, the common regions of text and graphic lines have similar features; hence, the separation of text and graphic lines is a challenging task in document analysis. generally, these text labels could not be recognized efficiently by current commercial ocr systems in city map processing. in this paper, we propose an image decomposition approach based on stroke width feature to extract text labels from city maps. in our approach, we assign to each pixel of image a local stroke width based on minimum distance from borders in four directional borders. this mapping generates a suitable representation to distinguish text and non-text pixels. the experimental results on several varieties of city maps are promising

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Directional Stroke Width Transform to Separate Text and Graphics in City Maps

One of the complex documents in the real world is city maps. In these kinds of maps, text labels overlap by graphics with having a variety of fonts and styles in different orientations. Usually, text and graphic colour is not predefined due to various map publishers. In most city maps, text and graphic lines form a single connected component. Moreover, the common regions of text and graphic lin...

متن کامل

Text Detection in Natural Scenes with Stroke Width Transform

My project aims at detecting text segments in an image of a natural scene, by using an enhanced version of the Stroke Width Transform [1]. The application receives an RGB image to search in, and returns a new image where the discovered text segments are marked. Due to the features of the SWT, the resulting system is able to detect text regardless of its scale, direction, font and language.

متن کامل

Text/Graphics Separation in Maps

The separation of overlapping text and graphics is a challenging problem in document image analysis. This paper proposes a specific method of detecting and extracting characters that are touching graphics. It is based on the observation that the constituent strokes of characters are usually short segments in comparison with those of graphics. It combines line continuation with the feature line ...

متن کامل

Detection and Extraction of Text Connected to Graphics in Maps

The separation of text from graphics has been challenging researchers for many years. The difficulty arises when there is text connected to graphics. This paper proposes a specific method of detecting and extracting graphics-connected characters. The proposed method is based on the observation that the constituent strokes of characters are usually short segments in comparison with those of grap...

متن کامل

Scene Text Detection Based on Robust Stroke Width Transform and Deep Belief Network

Text detection in natural scene images is an open and challenging problem due to the significant variations of the appearance of the text itself and its interaction with the context. In this paper, we present a novel text detection method combining two main ingredients: the robust extension of Stroke Width Transform (SWT) and the Deep Belief Network (DBN) based discrimination of text objects fr...

متن کامل

Image Text Detection Using a Bandlet-Based Edge Detector and Stroke Width Transform

A slew of semantic image content analysis techniques are specialized in extracting text embedded in images since it is a vital source of semantic information. A robust text detection step is the basic requirement for a scheme designed to extract text information from images. Text detection is still a challenging issue due to unconstrained color, sizes, alignments of characters, lighting and als...

متن کامل

منابع من

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید


عنوان ژورنال:
journal of computer and robotics

جلد ۷، شماره ۲، صفحات ۱-۷

میزبانی شده توسط پلتفرم ابری doprax.com

copyright © 2015-2023