morphosyntactic features

نتایج جستجو برای: morphosyntactic features

تعداد نتایج: 524361 فیلتر نتایج به سال:

Trigram morphosyntactic tagger for Polish

2004

Lukasz Debowski

We introduce an implementation of a plain trigram part-of-speech tagger which appears to work well on Polish texts. At this moment the tagger achieves 9.4% error rate, which makes it signficantly better than our previous stochastic disambiguator. Since the trigram model for Polish behaves similarly to Czech, we hope to reach Czech state-of-art error rate when the quality of the training data im...

متن کامل

Morphosyntactic annotation of CHILDES transcripts.

Journal: :Journal of child language 2010

Kenji Sagae Eric Davis Alon Lavie Brian Macwhinney Shuly Wintner

Corpora of child language are essential for research in child language acquisition and psycholinguistics. Linguistic annotation of the corpora provides researchers with better means for exploring the development of grammatical constructions and their usage. We describe a project whose goal is to annotate the English section of the CHILDES database with grammatical relations in the form of label...

متن کامل

Slavic Languages in Universal Dependencies

2015

Daniel Zeman D. Zeman

Universal Dependencies (UD) is a project that is developing crosslinguistically consistent treebank annotation for many languages, with the goal of facilitating multilingual parser development, cross-lingual learning and linguistic research from a language typology perspective. It is a merger and extension of several previous efforts aimed at finding unified approaches to parts of speech, morph...

متن کامل

The Impact of Semantic and Morphosyntactic Ambiguity on Automatic Humour Recognition

2009

Antonio Reyes Davide Buscaldi Paolo Rosso

Humour is one of the most amazing characteristics that defines us as human beings and social entities. Its study supposes a deep insight into several areas such as linguistics, psychology or philosophy. From the Natural Language Processing (NLP) perspective, recent researches have shown that humour can be automatically generated and recognized with some success. In this work we present a study ...

متن کامل

Bottom Up Tagset Design from Maximally Reduced Tagset

2000

Péter Dienes Csaba Oravecz

For highly innectional languages, where the number of morpho-syntactic descriptions (MSD) is very high, the use of a reduced tagset is crucial for reasons of implementation problems as well as the problem of sparse data. The standard procedure is to start from the large set of MSDs incorporating all morphosyntactic features and design a reduced tagset by eliminating the attributes which play no...

متن کامل

Parsing Hindi with MDParser

2013

Alexander Volokh Günter Neumann

We describe our participation in the MTPIL Hindi Parsing Shared Task-2012. Our system achieved the following results: 82.44% LAS/90.91% UAS (auto) and 85.31% LAS/92.88% UAS (gold). Our parser is based on the linear classification, which is suboptimal as far as the accuracy is concerned. The strong point of our approach is its speed. For parsing development the system requires 0.935 seconds, whi...

متن کامل

The Morphosyntax of Romanian Cliticization

1998

Paola Monachesi

In this paper, motivations are presented to argue in favor of the affixal status of Romanian pronominal clitics. It will be suggested that they should not be considered lexical items, i.e. ‘signs’, which are located in a special position by rules of syntax, but a complex of syntactic and semantic information which is provided in the lexicon for the morphophonological realization of the cliticiz...

متن کامل

Morphosyntactic Evaluation Protocol (MEP): validation of content

Journal: :CoDAS 2020

متن کامل

Towards a theory of morphosyntactic focus marking

Journal: :Natural Language and Linguistic Theory 2023

Abstract Based on six detailed case studies of languages in which focus is marked morphosyntactically, we propose a novel formal theory marking, can capture these as well the familiar English-type prosodic marking. Special attention paid to patterns syncretism, that is, when different size and/or location are indistinguishably realized by same form. The key ingredients our approach complex cons...

متن کامل

Retrieving Annotated Corpora for Corpus Annotation

2004

Kyôsuke Yoshida Taiichi Hashimoto Takenobu Tokunaga Hozumi Tanaka

This paper introduces a tool Bonsai which supports human in annotating corpora with morphosyntactic information, and in retrieving syntactic structures stored in the database. Integrating annotation and retrieval enables users to annotate a new instance while looking back at the already annotated sentences which share the similar morphosyntactic structure. We focus on the retrieval part of the ...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید