A Graph-Based Framework for Structured Prediction Tasks in Sanskrit

نویسندگان

چکیده

We propose a framework using energy-based models for multiple structured prediction tasks in Sanskrit. Ours is an arc-factored model, similar to the graph-based parsing approaches, and we consider of word segmentation, morphological parsing, dependency syntactic linearization, prosodification, “prosody-level” task introduce this work. search-based framework, which expects graph as input, where relevant linguistic information encoded nodes, edges are then used indicate association between these nodes. Typically, state-of-the-art morphosyntactic morphologically rich languages still rely on hand-crafted features their performance. But here, automate learning feature function. The function so learned, along with search space construct, encode consider. This enables us substantially reduce training data requirements low 10%, compared neural models. Our experiments Czech Sanskrit show language-agnostic nature train highly competitive both languages. Moreover, our incorporate language-specific constraints prune filter candidates during inference. obtain significant improvements by incorporating into model. In all discuss Sanskrit, either achieve results or ours only data-driven solution those tasks.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

HC-Search: A Learning Framework for Search-based Structured Prediction

Structured prediction is the problem of learning a function that maps structured inputs to structured outputs. Prototypical examples of structured prediction include part-ofspeech tagging and semantic segmentation of images. Inspired by the recent successes of search-based structured prediction, we introduce a new framework for structured prediction called HC-Search. Given a structured input, t...

متن کامل

Reducing Labeling Effort for Structured Prediction Tasks

A common obstacle preventing the rapid deployment of supervised machine learning algorithms is the lack of labeled training data. This is particularly expensive to obtain for structured prediction tasks, where each training instance may have multiple, interacting labels, all of which must be correctly annotated for the instance to be of use to the learner. Traditional active learning addresses ...

متن کامل

a framework for identifying and prioritizing factors affecting customers’ online shopping behavior in iran

the purpose of this study is identifying effective factors which make customers shop online in iran and investigating the importance of discovered factors in online customers’ decision. in the identifying phase, to discover the factors affecting online shopping behavior of customers in iran, the derived reference model summarizing antecedents of online shopping proposed by change et al. was us...

15 صفحه اول

Graph-Based Posterior Regularization for Semi-Supervised Structured Prediction

We present a flexible formulation of semisupervised learning for structured models, which seamlessly incorporates graphbased and more general supervision by extending the posterior regularization (PR) framework. Our extension allows for any regularizer that is a convex, differentiable function of the appropriate marginals. We show that surprisingly, non-linearity of such regularization does not...

متن کامل

Structured Prediction Theory Based on Factor Graph Complexity

We present a general theoretical analysis of structured prediction with a series of new results. We give new data-dependent margin guarantees for structured prediction for a very wide family of loss functions and a general family of hypotheses, with an arbitrary factor graph decomposition. These are the tightest margin bounds known for both standard multi-class and general structured prediction...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Computational Linguistics

سال: 2021

ISSN: ['1530-9312', '0891-2017']

DOI: https://doi.org/10.1162/coli_a_00390