End-to-End page-Level assessment of handwritten text recognition
نویسندگان
چکیده
The evaluation of Handwritten Text Recognition (HTR) systems has traditionally used metrics based on the edit distance between HTR and ground truth (GT) transcripts, at both character word levels. This is very adequate when experimental protocol assumes that GT text lines are same, which allows distances to be independently computed each given line. Driven by recent advances in pattern recognition, increasingly face end-to-end page-level transcription a document, where precision locating different their corresponding reading order (RO) play key role. In such case, standard do not take into account inconsistencies might appear. this paper, problem evaluating page level introduced detail. We analyse convenience using two-fold evaluation, accuracy RO goodness considered separately. Different alternatives proposed, analysed empirically compared through partially simulated real, full experiments. Results support validity proposed approach. An important conclusion an can adequately achieved just two simple well-known metrics: Word Error Rate (WER), takes sequentiality account, here re-formulated Bag Words (bWER), ignores order. While latter directly accurately assess intrinsic recognition errors, difference gracefully correlates with Normalised Spearman's Foot Rule Distance (NSFD), metric explicitly measures errors associated layout analysis flaws.
منابع مشابه
Joint Recognition of Handwritten Text and Named Entities with a Neural End-to-end Model
When extracting information from handwritten documents, text transcription and named entity recognition are usually faced as separate subsequent tasks. This has the disadvantage that errors in the first module affect heavily the performance of the second module. In this work we propose to do both tasks jointly, using a single neural network with a common architecture used for plain text recogni...
متن کاملSEE: Towards Semi-Supervised End-to-End Scene Text Recognition
Detecting and recognizing text in natural scene images is a challenging, yet not completely solved task. In recent years several new systems that try to solve at least one of the two sub-tasks (text detection and text recognition) have been proposed. In this paper we present SEE, a step towards semi-supervised neural networks for scene text detection and recognition, that can be optimized end-t...
متن کاملEnd-to-End Text Recognition with Hybrid HMM Maxout Models
The problem of detecting and recognizing text in natural scenes has proved to be more challenging than its counterpart in documents, with most of the previous work focusing on a single part of the problem. In this work, we propose new solutions to the character and word recognition problems and then show how to combine these solutions in an end-to-end text-recognition system. We do so by levera...
متن کاملEnd-to-end Video-level Representation Learning for Action Recognition
From the frame/clip-level feature learning to the videolevel representation building, deep learning methods in action recognition have developed rapidly in recent years. However, current methods suffer from the confusion caused by partial observation training, or without end-to-end learning, or restricted to single temporal scale modeling and so on. In this paper, we build upon two-stream ConvN...
متن کاملJoint Line Segmentation and Transcription for End-to-End Handwritten Paragraph Recognition
Offline handwriting recognition systems require cropped text line images for both training and recognition. On the one hand, the annotation of position and transcript at line level is costly to obtain. On the other hand, automatic line segmentation algorithms are prone to errors, compromising the subsequent recognition. In this paper, we propose a modification of the popular and efficient Multi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Pattern Recognition
سال: 2023
ISSN: ['1873-5142', '0031-3203']
DOI: https://doi.org/10.1016/j.patcog.2023.109695