Contextual Text Block Detection Towards Scene Text Understanding
نویسندگان
چکیده
Most existing scene text detectors focus on detecting characters or words that only capture partial messages due to missing contextual information. For a better understanding of in scenes, it is more desired detect blocks (CTBs) which consist one multiple integral units (e.g., characters, words, phrases) natural reading order and transmit certain complete messages. This paper presents detection, new setup detects CTBs for texts scenes. We formulate the by dual detection task first then groups them into CTB. To this end, we design novel clustering technique treats as tokens (belonging same CTB) an ordered token sequence. In addition, create two datasets SCUT-CTW-Context ReCTS-Context facilitate future research, where each CTB well annotated sequence units. Further, introduce three metrics measure local accuracy, continuity, global accuracy. Extensive experiments show our method accurately effectively facilitates downstream tasks such classification translation. The project available at https://sg-vilab.github.io/publication/xue2022contextual/ .
منابع مشابه
Natural Scene Text Understanding
In a society driven by visual information and with the drastic expansion of low-priced cameras, vision techniques are more and more considered and text recognition is nowadays a fast changing field, which is included in a large spectrum, named text understanding. Previously, text recognition was dealing with documents only; those which were acquired with flatbed, sheet-fed or mounted imaging de...
متن کاملScene Text Understanding
The task is character recognition from real world images such as sign boards. This report presents a work in progrss along with the some interesting results. Sparse autoencoders have been used to learn a codebook of basis functions, and the settings for obtaining the best results on this task is studied
متن کاملUnderstanding Text in Scene Images
With the rapid growth of camera-based mobile devices, applications that answer questions such as, “What does this sign say?" are becoming increasingly popular. This is related to the problem of optical character recognition (OCR) where the task is to recognize text occurring in images. The OCR problem has a long history in the computer vision community. However, the success of OCR systems is la...
متن کاملTowards Scene Understanding: Object Detection, Segmentation, and Contextual Reasoning
OF THE DISSERTATION Towards Scene Understanding: Object Detection, Segmentation, and Contextual Reasoning
متن کاملTowards Interactive Text Understanding
This position paper argues for an interactive approach to text understanding. The proposed model extends an existing semantics-based text authoring system by using the input text as a source of information to assist the user in re-authoring its content. The approach permits a reliable deep semantic analysis by combining automatic information extraction with a minimal amount of human intervention.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2022
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-031-19815-1_22