speech tagging

نتایج جستجو برای: speech tagging

تعداد نتایج: 128613 فیلتر نتایج به سال:

Unsupervised Structure Prediction with Non-Parallel Multilingual Guidance

2011

Shay B. Cohen Dipanjan Das Noah A. Smith

We describe a method for prediction of linguistic structure in a language for which only unlabeled data is available, using annotated data from a set of one or more helper languages. Our approach is based on a model that locally mixes between supervised models from the helper languages. Parallel data is not used, allowing the technique to be applied even in domains where human-translated texts ...

متن کامل

Distributional Part-of-Speech Tagging

1995

Hinrich Schitze

This paper presents an algorithm for tagging words whose part-of-speech properties are unknown. Unlike previous work, the algorithm categorizes word tokens in con$ezt instead of word ~ypes. The algorithm is evaluated on the Brown Corpus.

متن کامل

Variation in noun and pronoun frequencies in a sociohistorical corpus of English

Journal: :LLC 2011

Tanja Säily Terttu Nevalainen Harri Siirtola

Many corpus linguists make the tacit assumption that part-of-speech frequencies remain constant during the period of observation. In this article, we will consider two related issues: (1) the reliability of part-of-speech tagging in a diachronic corpus, and (2) shifts in tag ratios over time. The purpose is both to serve the users of the corpus by making them aware of potential problems, and to...

متن کامل

Adapting a Parser to Clinical Text by Simple Pre-processing Rules

2013

Maria Skeppstedt

Sentence types typical to Swedish clinical text were extracted by comparing sentence part-of-speech tag sequences in clinical and in standard Swedish text. Parsings by a syntactic dependency parser, trained on standard Swedish, were manually analysed for the 33 sentence types most typical to clinical text. This analysis resulted in the identification of eight error types, and for two of these e...

متن کامل

Opinion Sentences Extraction and Polarity Classification Using Automatically Generated Templates

2010

Wan-Chi Huang Meng-Chun Lin Shih-Hung Wu

The paper reports the approach of cyut system in NTCIR-8 MOAT subtask. We submitted the results of opinion judgment and polarity judgment in Traditional Chinese. Our study focused on automatically generated templates as the only features of classifier. The templates combining words with Part-of-speech or named-entity (POS/NE) tags are acquired from the training set. Experiment results show that...

متن کامل

Part-of-speech implications of affixes

Journal: :Mech. Translat. & Comp. Linguistics 1966

Lois L. Earl

This paper describes a systematic investigation of the extent to which the part of speech of words can be identified from their prefixes and suffixes. The results indicate that it is possible to determine, with 95 per cent accuracy, the inclusive part of speech of an affixed word from a consideration of its prefixes, suffixes, and length. By "inclusive" parts of speech we mean a string that wil...

متن کامل

Shallow Parsing as Part-of-Speech Tagging

2000

Miles Osborne

Treating shallow parsing as part-of-speech tagging yields results comparable with other, more elaborate approaches. Using the CoNLL 2000 training and testing material, our best model had an accuracy of 94.88%, with an overall FB1 score of 91.94%. The individual FB1 scores for NPs were 92.19%, VPs 92.70% and PPs 96.69%.

متن کامل

Design of Chinese Morphological Analyzer

2002

Hui-hsin Tseng Keh-Jiann Chen

This is a pilot study which aims at the design of a Chinese morphological analyzer which is in state to predict the syntactic and semantic properties of nominal, verbal and adjectival compounds. Morphological structures of compound words contain the essential information of knowing their syntactic and semantic characteristics. In particular, morphological analysis is a primary step for predicti...

متن کامل

Instance Metrics Improvement by Probabilistic Support

2000

Héctor Jiménez Guillermo Morales

The use of distance functions in order to determine nearest instance class at Memory Based Learning methods may be crucial when there are no exact matchings. We add relative information over unknown feature values to improve the information extract on the training instances. An experiment was carried out for Spanish Part-Of-Speech tagging of unknown words nding a better performance with our mod...

متن کامل

Adding More Languages Improves Unsupervised Multilingual Part-of-Speech Tagging: a Bayesian Non-Parametric Approach

2009

Benjamin Snyder Tahira Naseem Jacob Eisenstein Regina Barzilay

We investigate the problem of unsupervised part-of-speech tagging when raw parallel data is available in a large number of languages. Patterns of ambiguity vary greatly across languages and therefore even unannotated multilingual data can serve as a learning signal. We propose a non-parametric Bayesian model that connects related tagging decisions across languages through the use of multilingua...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید