FASTex: Efficient Unconstrained Scene Text Detector
ثبت نشده
چکیده
Observing that text in virtually any script is formed of strokes, we propose a novel easy-to-implement stroke detector which is significantly faster and produces significantly less false detections than the detectors commonly used in scene text localization. First, stroke-specific keypoints are efficiently detected. Text fragments are subsequently extracted by local thresholding guided by keypoint properties. Classification based on effectively calculated features eliminates non-text segmentations. The stroke-specific keypoints produce 2 times less segmentations and still detects 25% more characters than the commonly exploited MSER detector and the process is 4 times faster. After a novel efficient classification step, the number of segmentations is reduced to 7 times less than the standard method and is still almost 3 times faster. All stages of the proposed pipeline are scaleand rotationinvariant and support a wide variety of scripts (Latin, Hebrew, Chinese, etc.) and fonts. When the proposed detector is plugged into a scene text localization and recognition pipeline, a state-of-the-art text localization accuracy is maintained whilst the processing time is significantly reduced.
منابع مشابه
ArbiText: Arbitrary-Oriented Text Detection in Unconstrained Scene
Arbitrary-oriented text detection in the wild is a very challenging task, due to the aspect ratio, scale, orientation, and illumination variations. In this paper, we propose a novel method, namely Arbitrary-oriented Text (or ArbText for short) detector, for efficient text detection in unconstrained natural scene images. Specifically, we first adopt the circle anchors rather than the rectangular...
متن کاملTextBoxes++: A Single-Shot Oriented Scene Text Detector
Scene text detection is an important step of scene text recognition system and also a challenging problem. Different from general object detection, the main challenges of scene text detection lie on arbitrary orientations, small sizes, and significantly variant aspect ratios of text in natural images. In this paper, we present an end-to-end trainable fast scene text detector, named TextBoxes++,...
متن کاملA Real-Time Scene Text to Speech System
The system is based on an efficient end-to-end real-time scene text localization and recognition method [1,2,3] Individual characters detected as Class-Specific Extremal Regions (CSERs) [4] An efficient sequential classifier selects only ERs with locally maximal probability p(region|character) with complexity linear in the number of image pixels The stability requirement of MSERs [5] is...
متن کاملA robust arbitrary text detection system for natural scene images
Text detection in the real world images captured in unconstrained environment is an important yet challenging computer vision problem due to a great variety of appearances, cluttered background, and character orientations. In this paper, we present a robust system based on the concepts of Mutual Direction Symmetry (MDS), Mutual Magnitude Symmetry (MMS) and Gradient Vector Symmetry (GVS) propert...
متن کاملE2E-MLT - an Unconstrained End-to-End Method for Multi-Language Scene Text
An end-to-end method for multi-language scene text localization, recognition and script identification is proposed. The approach is based on a set of convolutional neural nets. The method, called E2E-MLT, achieves state-of-theart performance for both joint localization and script identification in natural images and in cropped word script identification. E2E-MLT is the first published multi-lan...
متن کامل