A Graph-Based Framework for Structured Prediction Tasks in Sanskrit
نویسندگان
چکیده
We propose a framework using energy-based models for multiple structured prediction tasks in Sanskrit. Ours is an arc-factored model, similar to the graph-based parsing approaches, and we consider of word segmentation, morphological parsing, dependency syntactic linearization, prosodification, “prosody-level” task introduce this work. search-based framework, which expects graph as input, where relevant linguistic information encoded nodes, edges are then used indicate association between these nodes. Typically, state-of-the-art morphosyntactic morphologically rich languages still rely on hand-crafted features their performance. But here, automate learning feature function. The function so learned, along with search space construct, encode consider. This enables us substantially reduce training data requirements low 10%, compared neural models. Our experiments Czech Sanskrit show language-agnostic nature train highly competitive both languages. Moreover, our incorporate language-specific constraints prune filter candidates during inference. obtain significant improvements by incorporating into model. In all discuss Sanskrit, either achieve results or ours only data-driven solution those tasks.
منابع مشابه
HC-Search: A Learning Framework for Search-based Structured Prediction
Structured prediction is the problem of learning a function that maps structured inputs to structured outputs. Prototypical examples of structured prediction include part-ofspeech tagging and semantic segmentation of images. Inspired by the recent successes of search-based structured prediction, we introduce a new framework for structured prediction called HC-Search. Given a structured input, t...
متن کاملReducing Labeling Effort for Structured Prediction Tasks
A common obstacle preventing the rapid deployment of supervised machine learning algorithms is the lack of labeled training data. This is particularly expensive to obtain for structured prediction tasks, where each training instance may have multiple, interacting labels, all of which must be correctly annotated for the instance to be of use to the learner. Traditional active learning addresses ...
متن کاملa framework for identifying and prioritizing factors affecting customers’ online shopping behavior in iran
the purpose of this study is identifying effective factors which make customers shop online in iran and investigating the importance of discovered factors in online customers’ decision. in the identifying phase, to discover the factors affecting online shopping behavior of customers in iran, the derived reference model summarizing antecedents of online shopping proposed by change et al. was us...
15 صفحه اولGraph-Based Posterior Regularization for Semi-Supervised Structured Prediction
We present a flexible formulation of semisupervised learning for structured models, which seamlessly incorporates graphbased and more general supervision by extending the posterior regularization (PR) framework. Our extension allows for any regularizer that is a convex, differentiable function of the appropriate marginals. We show that surprisingly, non-linearity of such regularization does not...
متن کاملStructured Prediction Theory Based on Factor Graph Complexity
We present a general theoretical analysis of structured prediction with a series of new results. We give new data-dependent margin guarantees for structured prediction for a very wide family of loss functions and a general family of hypotheses, with an arbitrary factor graph decomposition. These are the tightest margin bounds known for both standard multi-class and general structured prediction...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Computational Linguistics
سال: 2021
ISSN: ['1530-9312', '0891-2017']
DOI: https://doi.org/10.1162/coli_a_00390