Preliminary Experiments in Polish Dependency Parsing
نویسندگان
چکیده
Preliminary experiments presented in this paper consist in the induction and evaluation of a dependency parser for Polish. We train data-driven dependency models with publicly available parser-generation systems (MaltParser and MSTParser) given a converted dependency structure bank for Polish. Induced Polish dependency parsers are evaluated against a set of gold standard dependency structures using labelled and unlabelled accuracy metrics.
منابع مشابه
An improved joint model: POS tagging and dependency parsing
Dependency parsing is a way of syntactic parsing and a natural language that automatically analyzes the dependency structure of sentences, and the input for each sentence creates a dependency graph. Part-Of-Speech (POS) tagging is a prerequisite for dependency parsing. Generally, dependency parsers do the POS tagging task along with dependency parsing in a pipeline mode. Unfortunately, in pipel...
متن کاملOnline Service for Polish Dependency Parsing and Results Visualisation
The paper presents a new online service for the dependency parsing of Polish. Given raw text as input, the service processes it and visualises output dependency trees. The service applies the parsing system – MaltParser – with a parsing model for Polish trained on the Polish Dependency Bank, and some additional publicly available tools.
متن کاملتأثیر ساختواژهها در تجزیه وابستگی زبان فارسی
Data-driven systems can be adapted to different languages and domains easily. Using this trend in dependency parsing was lead to introduce data-driven approaches. Existence of appreciate corpora that contain sentences and theirs associated dependency trees are the only pre-requirement in data-driven approaches. Despite obtaining high accurate results for dependency parsing task in English langu...
متن کاملA Dependency Treebank for Telugu
In this paper, we describe the annotation and development of Telugu treebank following the Universal Dependencies framework. We manually annotated 1328 sentences from a Telugu grammar textbook and the treebank is freely available from Universal Dependencies version 2.1.1 In this paper, we discuss some language specific annotation issues and decisions; and report preliminary experiments with POS...
متن کاملExperiments on Semi-supervised Dependency Parsing of a Morphologically Rich Language
This paper1 presents a set of preliminary experiments that have the aim of improving dependency parsing of Basque by using a semi-supervised technique. Our approach will make use of large unannotated corpora (over 140M word forms). We will investigate the use of information induced from a large raw corpus as well as an automatically parsed version. The first results show encouraging improvement...
متن کامل