Similarity-based Text Recognition by Deeply Supervised Siamese Network

نویسندگان

  • Ehsan Hosseini-Asl
  • Angshuman Guha
چکیده

In this paper, we propose a new text recognition model based on measuring the visual similarity of text and predicting the content of the unlabeled texts. First a Siamese network is trained with deep supervision on a labeled training dataset. This network projects texts into a similarity manifold. The Deeply Supervised Siamese network learns visual similarity of texts. Then a K-nearest neighbor classifier is used to predict unlabeled text based on similarity distance to labeled texts. The performance of the model is evaluated on three datasets of machine-print and hand-written text combined. We demonstrate that the model reduces the cost of human estimation by 50%− 85%. The error of the system is less than 0.1%. The results also demonstrate that the predicted labels are sometimes better than human labels e.g. spelling correction.

منابع مشابه

Image Classification using Transfer Learning from Siamese Networks based on Text Metadata Similarity

Convolutional neural networks learn about underlying image representations just by optimizing to a supervised classification. This project attempts to learn better features from images by training a network based on the similarity of pairs of images. Similarity of images will be computed based on the similarity of the text associated with the images as metadata, specifically captions for MSCOCO...

متن کامل

Off-Topic Spoken Response Detection Using Siamese Convolutional Neural Networks

In this study, we developed an off-topic response detection system to be used in the context of the automated scoring of nonnative English speakers’ spontaneous speech. Based on transcriptions generated from an ASR system trained on non-native speakers’ speech and various semantic similarity features, the system classified each test response as an on-topic or off-topic response. The recent succ...

متن کامل

Similarity Learning Based Query Modeling for Keyword Search

In this paper, we propose a novel approach for query modeling using neural networks for posteriorgram based keyword search (KWS). We aim to help the conventional large vocabulary continuous speech recognition (LVCSR) based KWS systems, especially on out-of-vocabulary (OOV) terms by converting the task into a template matching problem, just like the query-by-example retrieval tasks. For this, we...

متن کامل

Named Entity Recognition in Persian Text using Deep Learning

Named entities recognition is a fundamental task in the field of natural language processing. It is also known as a subset of information extraction. The process of recognizing named entities aims at finding proper nouns in the text and classifying them into predetermined classes such as names of people, organizations, and places. In this paper, we propose a named entity recognizer which benefi...

متن کامل

A deep scattering spectrum - Deep Siamese network pipeline for unsupervised acoustic modeling

Recent work has explored deep architectures for learning acoustic features in an unsupervised or weakly-supervised way for phone recognition. Here we investigate the role of the input features, and in particular we test whether standard mel-scaled filterbanks could be replaced by inherently richer representations, such as derived from an analytic scattering spectrum. We use a Siamese network us...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

متن کامل
عنوان ژورنال:
  • CoRR

دوره abs/1511.04397  شماره 

صفحات  -

تاریخ انتشار 2015