WSD system based on specialized Hidden Markov Model (upv-shmm-eaw)
نویسندگان
چکیده
We present a supervised approach to Word Sense Disambiguation (WSD) based on Specialized Hidden Markov Models. We used as training data the Semcor corpus and the test data set provided by Senseval 2 competition and as dictionary the Wordnet 1.6. We evaluated our system on the English all-word task of the Senseval-3 competition. 1 Description of the WSD System We consider WSD to be a tagging problem (Molina et al., 2002a). The tagging process can be formulated as a maximization problem using the Hidden Markov Model (HMM) formalism. Let O be the set of output tags considered, and I , the input vocabulary of the application. Given an input sentence, I = i1, . . . , iT , where ij ∈ I , the tagging process consists of finding the sequence of tags (O = o1, . . . , oT , where oj ∈ O) of maximum probability on the model, that is: Ô = arg max O P (O|I) = arg max O ( P (O) · P (I|O)
منابع مشابه
Selective Prediction of Financial Trends with Hidden Markov Models
Focusing on short term trend prediction in a financial context, we consider the problem of selective prediction whereby the predictor can abstain from prediction in order to improve performance. We examine two types of selective mechanisms for HMM predictors. The first is a rejection in the spirit of Chow’s well-known ambiguity principle. The second is a specialized mechanism for HMMs that iden...
متن کاملHandwritten Character Recognition Using Structural Hidden Markov Models
This paper introduces a methodology to recognize handwritten characters using “Structural Hidden Markov Models” (SHMM). The proposed approach is motivated by the need to model complex structures which are encountered in many areas such as speech/handwriting recognition, content-based information retrieval etc. The observations considered are strings that produce the structures. These observatio...
متن کاملDIC Structural HMM based IWAK-means to Enclosed Face Data
This paper identifies two novel techniques for face features extraction based on two different multi-resolution analysis tools; the first called curvelet transform while the second is waveatom transform. The resultant features are trained and tested via three improved hidden Markov Model (HMM) classifiers, such as: Structural HMM (SHMM), Deviance Information CriterionInverse Weighted Average K-...
متن کاملInformation Retrieval and Text Categorization with Semantic Indexing
In this paper, we present the effect of the semantic indexing using WordNet senses on the Information Retrieval (IR) and Text Categorization (TC) tasks. The documents have been sense-tagged using a Word Sense Disambiguation (WSD) system based on Specialized Hidden Markov Models (SHMMs). The preliminary results showed that a small improvement of the performance was obtained only in the TC task. ...
متن کاملIntrusion Detection Using Evolutionary Hidden Markov Model
Intrusion detection systems are responsible for diagnosing and detecting any unauthorized use of the system, exploitation or destruction, which is able to prevent cyber-attacks using the network package analysis. one of the major challenges in the use of these tools is lack of educational patterns of attacks on the part of the engine analysis; engine failure that caused the complete training, ...
متن کامل