parts of speech tagging

نتایج جستجو برای: parts of speech tagging

تعداد نتایج: 21177608 فیلتر نتایج به سال:

An Empirical Examination of Challenges in Chinese Parsing

2013

Jonathan K. Kummerfeld Daniel Tse James R. Curran Dan Klein

Aspects of Chinese syntax result in a distinctive mix of parsing challenges. However, the contribution of individual sources of error to overall difficulty is not well understood. We conduct a comprehensive automatic analysis of error types made by Chinese parsers, covering a broad range of error types for large sets of sentences, enabling the first empirical ranking of Chinese error types by t...

متن کامل

Multilingual Lexicalized Constituency Parsing with Word-Level Auxiliary Tasks

2017

Benoît Crabbé Maximin Coavoux

We introduce a constituency parser based on a bi-LSTM encoder adapted from recent work (Cross and Huang, 2016b; Kiperwasser and Goldberg, 2016), which can incorporate a lower level character biLSTM (Ballesteros et al., 2015; Plank et al., 2016). We model two important interfaces of constituency parsing with auxiliary tasks supervised at the word level: (i) part-of-speech (POS) and morphological...

متن کامل

Part-of-Speech Tagging Guidelines for the Penn Treebank Project

2009

Beatrice Santorini

متن کامل

STTS goes Kiez - Experiments on Annotating and Tagging Urban Youth Language

Journal: :JLCL 2013

Ines Rehbein Sören Schalowski

The Stuttgart-Tübingen Tag Set (STTS) (Schiller et al., 1995) has long been established as a quasi-standard for part-of-speech (POS) tagging of German. It has been used, with minor modifications, for the annotation of three German newspaper treebanks, the NEGRA treebank (Skut et al., 1997), the TiGer treebank (Brants et al., 2002) and the TüBa-D/Z (Telljohann et al., 2004). One major drawback, ...

متن کامل

Feature extraction in opinion mining through Persian reviews

Journal: Journal of Artificial Intelligence and Data Mining 2015

E. Golpar-Rabooki, J. Rezaeenour, S. Zarghamifar,

Opinion mining deals with an analysis of user reviews for extracting their opinions, sentiments and demands in a specific area, which can play an important role in making major decisions in such area. In general, opinion mining extracts user reviews at three levels of document, sentence and feature. Opinion mining at the feature level is taken into consideration more than the other two levels d...

متن کامل

A Probe into Ambiguities of Determinative-Measure Compounds

Journal: :IJCLCLP 2005

Shih-Min Li Su-Chu Lin Keh-Jiann Chen

This paper aims to further probe into the problems of ambiguities for automatic identification of determinative-measure compounds (DMs) in Chinese and to develop sets of rules to identify DMs and their parts of speech. It is known that Chinese DMs are identifiable by regular expressions. DM rule matching helps one solve word segmentation ambiguities, and parts of speech help one improve sense r...

متن کامل

A Coherent Scrutinization on Syntactic Categories for Tagging Tamil Lexicon

2015

Ananthi Sheshasaayee

The arrangement of words based on rules is termed as Syntax. Natural languages have their renowned syntactic rules that demonstrate their latent features. It is attributed in a form of free word order and some have conditions on the word order arrangement. As a consequence, the smallest unit in a sentence called word or lexicon has its unique function which determines the nature of the sentence...

متن کامل

Two-level Word Class Categorization Model in Analytic Languages and Its Implications for POS Tagging in Modern Chinese Corpora

2015

Renqiang Wang Changning Huang

The study of word classes has a history of over 4000 years, and the word class problem in over 1000 analytic languages like Modern Chinese can be seen as the Goldbach Conjecture in linguistics. This paper first outlines the existing problems in the POS tagging of Modern Chinese corpora with a case study of 自信. Then it introduces the Two-level Word Class Categorization Model in analytic language...

متن کامل

معرفی رویکردی ماشینی با استفاده از الگوریتم لسک و برچسبدهی نحوی جهت رفع ابهام از معنای کلمات

ژورنال: پژوهشنامه پردازش و مدیریت اطلاعات 2018

علایی ابوذر, الهام,

The present study introduces a machine-based approach for word sense disambiguation (WSD). In Persian, a morphologically complex language, POS tag which lots of homographs are made, one way for doing WSD is allocating the right Part Of Speech (POS) tags to words prior to WSD. Since the frequency of noun and adjective homographs in different Persian POS tag text corpuses is high, POS tag disambi...

متن کامل

Automatic Text Categorization of Mathematical Word Problems

2009

Suleyman Cetintas Luo Si Yan Ping Xin Dake Zhang Joo Young Park

This paper describes a novel application of text categorization for mathematical word problems, namely Multiplicative Compare and Equal Group problems. The empirical results and analysis show that common text processing techniques such as stopword removal and stemming should be selectively used. It is highly beneficial not to remove stopwords and not to do stemming. Part of speech tagging shoul...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید