Towards open-set text recognition via label-to-prototype learning

نویسندگان

چکیده

• Formulating a new open-set task requires spotting and cognizing novel characters. Proposing framework that handles characters without retraining. fast rectification technique for text recognition. Scene recognition is popular research topic which also extensively utilized in the industry. Although many methods have achieved satisfactory performance close-set challenges, these lose feasibility scenarios, where collecting data or retraining models could yield high cost. For example, annotating samples foreign languages can be expensive, whereas model each time when “novel” character discovered from historical documents costs both resources. In this paper, we introduce formulate demands capability to spot recognize A label-to-prototype learning proposed as baseline task. Specifically, introduces generalizable mapping function build prototypes (class centers) seen unseen classes. An predictor then reject according prototypes. The implementation of rejection over out-of-set allows automatic unknown incoming stream. Extensive experiments show our method achieves promising on variety zero-shot, close-set, datasets.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Named Entity Recognition in Persian Text using Deep Learning

Named entities recognition is a fundamental task in the field of natural language processing. It is also known as a subset of information extraction. The process of recognizing named entities aims at finding proper nouns in the text and classifying them into predetermined classes such as names of people, organizations, and places. In this paper, we propose a named entity recognizer which benefi...

متن کامل

Label embedding for text recognition

The standard approach to recognizing text in images consists in first classifying local image regions into candidate characters and then combining them with high-level word models such as conditional random fields (CRF). This paper explores a new paradigm that departs from this bottom-up view. In our approach, every label from a lexicon is embedded to an Euclidean vector space. We refer to this...

متن کامل

Towards Open-Text Semantic Parsing via Multi-Task Learning of Structured Embeddings

Open-text (or open-domain) semantic parsers are designed to interpret any statement in natural language by inferring a corresponding meaning representation (MR). Unfortunately, large scale systems cannot be easily machine-learned due to lack of directly supervised data. We propose here a method that learns to assign MRs to a wide range of text (using a dictionary of more than 70,000 words, whic...

متن کامل

RODRIGUEZ-SERRANO, PERRONNIN: LABEL EMBEDDING FOR TEXT RECOGNITION 1 Label embedding for text recognition

The standard approach to recognizing text in images consists in first classifying local image regions into candidate characters and then combining them with high-level word models such as conditional random fields (CRF). This paper explores a new paradigm that departs from this bottom-up view. We propose to embed word labels and word images into a common Euclidean space. Given a word image to b...

متن کامل

Learning a Neural-network-based Representation for Open Set Recognition

Open set recognition problems exist in many domains. For example in security, new malware classes emerge regularly; therefore malware classi€cation systems need to identify instances from unknown classes in addition to discriminating between known classes. In this paper we present a neural network based representation for addressing the open set recognition problem. In this representation insta...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Pattern Recognition

سال: 2023

ISSN: ['1873-5142', '0031-3203']

DOI: https://doi.org/10.1016/j.patcog.2022.109109