part of speech

نتایج جستجو برای: part of speech

تعداد نتایج: 21195534 فیلتر نتایج به سال:

Instance Metrics Improvement by Probabilistic Support

2000

Héctor Jiménez Guillermo Morales

The use of distance functions in order to determine nearest instance class at Memory Based Learning methods may be crucial when there are no exact matchings. We add relative information over unknown feature values to improve the information extract on the training instances. An experiment was carried out for Spanish Part-Of-Speech tagging of unknown words nding a better performance with our mod...

متن کامل

Adding More Languages Improves Unsupervised Multilingual Part-of-Speech Tagging: a Bayesian Non-Parametric Approach

2009

Benjamin Snyder Tahira Naseem Jacob Eisenstein Regina Barzilay

We investigate the problem of unsupervised part-of-speech tagging when raw parallel data is available in a large number of languages. Patterns of ambiguity vary greatly across languages and therefore even unannotated multilingual data can serve as a learning signal. We propose a non-parametric Bayesian model that connects related tagging decisions across languages through the use of multilingua...

متن کامل

Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments

2011

Kevin Gimpel Nathan Schneider Brendan T. O'Connor Dipanjan Das Daniel Mills Jacob Eisenstein Michael Heilman Dani Yogatama Jeffrey Flanigan Noah A. Smith

We address the problem of part-of-speech tagging for English data from the popular microblogging service Twitter. We develop a tagset, annotate data, develop features, and report tagging results nearing 90% accuracy. The data and tools have been made available to the research community with the goal of enabling richer text analysis of Twitter and related social media data sets.

متن کامل

Morphological and Part-of-Speech Tagging of Historical Language Data: A Comparison

Journal: :JLCL 2011

Stefanie Dipper

This paper deals with morphological and part-of-speech tagging applied to manuscripts written in Middle High German. I present the results of a set of experiments that involve different levels of token normalization and dialect-specific subcorpora. As expected, tagging with “normalized”, quasi-standardized tokens performs best. Normalization improves accuracies by .–. percentage points, r...

متن کامل

STTS 2.0? Improving the Tagset for the Part-of-Speech-Tagging of German Spoken Data

2014

Swantje Westpfahl

Part-of-speech tagging (POS-tagging) of spoken data requires different means of annotation than POS-tagging of written and edited texts. In order to capture the features of German spoken language, a distinct tagset is needed to respond to the kinds of elements which only occur in speech. In order to create such a coherent tagset the most prominent phenomena of spoken language need to be analyze...

متن کامل

The Computational Complexity of Rule-Based Part-of-Speech Tagging

2003

Karel Oliva Pavel Kveton Roman Ondruska

متن کامل

Data-Driven Methods for PoS Tagging and Chunking of Swedish

2001

Beáta Megyesi

متن کامل

An Improved Tag Dictionary for Faster Part-of-Speech Tagging

2015

Robert Moore

Ratnaparkhi (1996) introduced a method of inferring a tag dictionary from annotated data to speed up part-of-speech tagging by limiting the set of possible tags for each word. While Ratnaparkhi’s tag dictionary makes tagging faster but less accurate, an alternative tag dictionary that we recently proposed (Moore, 2014) makes tagging as fast as with Ratnaparkhi’s tag dictionary, but with no decr...

متن کامل

Choosing a Spanish Part-of-Speech tagger for a lexically sensitive task

Journal: :Procesamiento del Lenguaje Natural 2015

Carla Parra Escartín Héctor Martínez Alonso

In this article, four Part-of-Speech (PoS) taggers for Spanish are compared. The evaluation has been carried out without prior training or tuning of the PoS taggers. To allow for a comparison across PoS taggers, their tagsets have been mapped to the universal PoS tagset (Petrov, Das, and McDonald, 2012). The PoS taggers have also been compared as regards the information they provide and how the...

متن کامل

Towards Robust Cross-Domain Domain Adaptation for Part-of-Speech Tagging

2013

Tobias Schnabel Hinrich Schütze

We investigate the robustness of domain adaptation (DA) representations and methods across target domains using part-ofspeech (POS) tagging as a case study. We find that there is no single representation and method that works equally well for all target domains. In particular, there are large differences between target domains that are more similar to the source domain and those that are less s...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید