Contextual Text Block Detection Towards Scene Text Understanding

نویسندگان

چکیده

Most existing scene text detectors focus on detecting characters or words that only capture partial messages due to missing contextual information. For a better understanding of in scenes, it is more desired detect blocks (CTBs) which consist one multiple integral units (e.g., characters, words, phrases) natural reading order and transmit certain complete messages. This paper presents detection, new setup detects CTBs for texts scenes. We formulate the by dual detection task first then groups them into CTB. To this end, we design novel clustering technique treats as tokens (belonging same CTB) an ordered token sequence. In addition, create two datasets SCUT-CTW-Context ReCTS-Context facilitate future research, where each CTB well annotated sequence units. Further, introduce three metrics measure local accuracy, continuity, global accuracy. Extensive experiments show our method accurately effectively facilitates downstream tasks such classification translation. The project available at https://sg-vilab.github.io/publication/xue2022contextual/ .

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Natural Scene Text Understanding

In a society driven by visual information and with the drastic expansion of low-priced cameras, vision techniques are more and more considered and text recognition is nowadays a fast changing field, which is included in a large spectrum, named text understanding. Previously, text recognition was dealing with documents only; those which were acquired with flatbed, sheet-fed or mounted imaging de...

متن کامل

Scene Text Understanding

The task is character recognition from real world images such as sign boards. This report presents a work in progrss along with the some interesting results. Sparse autoencoders have been used to learn a codebook of basis functions, and the settings for obtaining the best results on this task is studied

متن کامل

Understanding Text in Scene Images

With the rapid growth of camera-based mobile devices, applications that answer questions such as, “What does this sign say?" are becoming increasingly popular. This is related to the problem of optical character recognition (OCR) where the task is to recognize text occurring in images. The OCR problem has a long history in the computer vision community. However, the success of OCR systems is la...

متن کامل

Towards Scene Understanding: Object Detection, Segmentation, and Contextual Reasoning

OF THE DISSERTATION Towards Scene Understanding: Object Detection, Segmentation, and Contextual Reasoning

متن کامل

Towards Interactive Text Understanding

This position paper argues for an interactive approach to text understanding. The proposed model extends an existing semantics-based text authoring system by using the input text as a source of information to assist the user in re-authoring its content. The approach permits a reliable deep semantic analysis by combining automatic information extraction with a minimal amount of human intervention.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2022

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-031-19815-1_22