نتایج جستجو برای: speech tagging

تعداد نتایج: 128613  

2000
James R. Curran Raymond K. Wong

Research in automatic Part of Speech (POS) tagging has been dominated by Markov Model (MM) taggers. Brill [1, 3, 6], has recently described a transformation-based system with comparable accuracy, and simpler algorithms and representation than MM taggers. We present a set-based formal model of natural language ambiguity and semantic tagging that forms a basis for the generalisation of the transf...

2013
Paul Rodrigues Sandra Kübler

This paper investigates incremental part of speech tagging for speech transcripts that contain multilingual intrasentential code-mixing, and compares the accuracy of a monolithic tagging model trained on a heterogeneous-language dataset to a model that switches between two homogeneous-language tagging models dynamically using word-by-word language identification. We find that the dynamic model,...

2005
Saichon Jaiyen Peerasak Intarapaiboon Ponrudee Netisopakul

Thai part of speech (POS) tagging is a challenged problem in natural language processing. Many techniques including artificial neural network techniques are suggested for POS tagging. Research works in Thai POS tagging so far only focused on assigning word types, but not word features. This paper proposed a technique using multilayer perception for tagging word features in Thai sentences. The f...

2005
Elliott Franco Drábek David Yarowsky

This paper presents an original approach to part-of-speech tagging of fine-grained features (such as case, aspect, and adjective person/number) in languages such as English where these properties are generally not morphologically marked. The goals of such rich lexical tagging in English are to provide additional features for word alignment models in bilingual corpora (for statistical machine tr...

2014
Lori Levin Manfred Stede

Part-of-speech tagging (POS-tagging) of spoken data requires different means of annotation than POS-tagging of written and edited texts. In order to capture the features of German spoken language, a distinct tagset is needed to respond to the kinds of elements which only occur in speech. In order to create such a coherent tagset the most prominent phenomena of spoken language need to be analyze...

Journal: :CoRR 2012
Arif Nurwidyantoro Edi Winarko

In this paper, MapReduce programming model is used to parallelize training and tagging proceess in maximum entropy part of speech tagging for Bahasa Indonesia. In training process, MapReduce model is implemented dictionary, tagtoken, and feature creation. In tagging process, MapReduce is implemented to tag lines of document in parallel. The training experiments showed that total training time u...

2014
Swantje Westpfahl

Part-of-speech tagging (POS-tagging) of spoken data requires different means of annotation than POS-tagging of written and edited texts. In order to capture the features of German spoken language, a distinct tagset is needed to respond to the kinds of elements which only occur in speech. In order to create such a coherent tagset the most prominent phenomena of spoken language need to be analyze...

2007
Yuejie Zhang Zhiting Xu

We explore methods to implement Conditional Random Fields (CRF) for Chinese Part-Of-Speech Tagging. We focus on the task of POS tagging without pre-segmentation, and propose a hierarchical Conditional Random Fields to do Segmenta-tion and POS Tagging at one time step. Experiments are going to be done for my method to compare it with existent methods on this task.

1997
Jae-Hoon Kim Chul-Su Lim Jungyun Seo

In this paper, we describe a method for assigning a part-of-speech tag in Korean to each morpheme. The method is based on a hidden Markov model which can be trained without using any tagged corpus. To relax the amount of computation to process multiple observation sequences, which are extraordinarily occurred in Korean part-of-speech tagging, we develop a revised Viterbi algorithm for determini...

1996
Mihai Pop

Diierent approaches have been taken in order to solve the part-of-speech tagging problem. Several methods for unsupervised tagging have obtained good accuracies in practice. The approach taken by Brill Bri95] obtains results comparable to the best existing taggers. In this paper we explore the details of this unsupervised part-of-speech tagger and we present a comparison to the Xerox tagger, wh...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید