Using BLSTM for Interpretation of 2D Languages - Case of Handwritten Mathematical Expressions

نویسندگان

  • Ting Zhang
  • Harold Mouchère
  • Christian Viard-Gaudin
چکیده

In this work, we study how to extend the capability of BLSTM networks to process data which are not only text strings but graphical two-dimensional languages such as handwritten mathematical expressions. The proposed solution aims at transforming the mathematical expression description into a sequence including at the same time symbol labels and relationship labels, so that classical supervised sequence labeling with recurrent neural networks can be applied. For simple one-dimensional (1-D) expression, we use the Right label to segment one symbol from the next one, as with the standard blank label for regular text. For genuine twodimensional (2-D) expressions, we introduce additional specific labels assigned to each of the different possible spatial relationships that exist between sub-expressions. As a result, BLSTM network is able to perform at the same time the symbol recognition task and the segmentation task, which is a new perspective for the mathematical expression domain. MOTS-CLÉS : Reconnaissance d’expressions mathématiques, écriture manuscrite, réseau récurrent, BLSTM.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Spotting handwritten words and REGEX using a two stage BLSTM-HMM architecture

In this article, we propose a hybrid model for spotting words and regular expressions (REGEX) in handwritten documents. The model is made of the state-of-the-art BLSTM (Bidirectional Long Short Time Memory) neural network for recognizing and segmenting characters, coupled with a HMM to build line models able to spot the desired sequences. Experiments on the Rimes database show very promising re...

متن کامل

A hybrid classifier for handwritten mathematical expression recognition

In this paper we propose a hybrid symbol classifier within a global framework for online handwritten mathematical expression recognition. The proposed architecture aims at handling mathematical expression recognition as a simultaneous optimization of symbol segmentation, symbol recognition, and 2D structure recognition under the restriction of a mathematical expression grammar. To improve the c...

متن کامل

A Hybrid BLSTM-HMM for Spotting Regular Expressions

This article concerns the spotting of regular expressions (REGEX) in handwritten documents using a hybrid model. Spotting REGEX in a document image allow to consider further extraction tasks such as document categorization or named entities extraction. Our model combines state of the art BLSTM recurrent neural network for character recognition and segmentation with a HMM model able to spot the ...

متن کامل

Multimedia and Data Management

Despite the recent advances in handwriting recognition, handwritten twodimensional (2D) languages are still a challenge. Electrical schemas, chemical equations and mathematical expressions are examples of such 2D languages. In this case, the recognition problem is particularly difficult due to the two dimensional layout of the language. The main goal of our work is to study the application of t...

متن کامل

Recognition of on-line handwritten mathematical expressions using 2D stochastic context-free grammars and hidden Markov models

This paper describes a formal model for the recognition of on-line handwritten mathematical expressions using 2D stochastic context-free grammars and hidden Markov models. Hidden Markov models are used to recognize mathematical symbols, and a stochastic context-free grammar is used to model the relation between these symbols. This formal model makes possible to use classic algorithms for parsin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Document Numérique

دوره 19  شماره 

صفحات  -

تاریخ انتشار 2016