Linguistic Knowledge and Word Sense Disambiguation
نویسندگان
چکیده
pd The work in this thesis has been carried out under the auspices of the Beha-vioral and Cognitive Neurosciences (BCN) research school, Groningen, and has been part of the Pionier project Algorithms for Linguistic Processing, supported by grant number 220-70-001 from the Netherlands Organisation for Scientific Research (NWO). Acknowledgements As with every PhD project, even though there is but one author on the front cover, many different people have implicitly contributed to the book you now have in front of you. They have encouraged and helped me before and during my PhD position at the University of Groningen and I would like to use this opportunity to thank them. It all started with a few of my University teachers who were involved in my MA thesis project: Georges Lüdi and Pius ten Hacken at the University of Basel, and Ulrich Heid at the Institut für Maschinelle Sprachverarbeitung in Stuttgart. Their feedback and encouragement made me believe in my talents as a researcher and helped me decide to continue my education with a PhD. I am especially grateful to my supervisors, John Nerbonne and Gertjan van Noord. Without all their support, scrupulous (and fast!) reading and critical comments this thesis would not be what it now is. I would also like to thank the members of my reading committee, for their valuable comments and suggestions. Among the people who have made me feel welcome and at home in Groningen during the last four and a half years, I would like to mention my colleagues at the Alfa Informatica corridor. Stasinos Konstantopoulos, Ivelin Stoianov and Wouter Jansen were a great " welcoming committee " , always ready to go for coffee, a beer or a movie. Special thanks go to my (current and past) office mates Leonoor van der Beek, for discussions on subjects ranging from linguistic examples, translations to various languages, and L A T E X questions to traveling, music and recipes (tested during the AiO etentjes with Leonoor, Menno and Robbert). I would also like to express my gratitude to everyone involved in the Pionier project for listening to (unfinished) ideas and giving valuable comments on presentations: Gertjan am especially indebted to Rob v vi for patiently explaining his maximum entropy package to me. I would also like to thank Leonie Bosveld-de Smet for the many moments we have spent discussing life and everything related to it, and the …
منابع مشابه
Towards High-performance Word Sense Disambiguation by Combining Rich Linguistic Knowledge and Machine Learning Approaches
متن کامل
Evaluating the Impact of Some Linguistic Information on the Performances of a Similarity-based and Translation-oriented Word-Sense Disambiguation Method
In this article, we present an experiment of linguistic parameter tuning in the representation of the semantic space of polysemous words. We evaluate quantitatively the influence of some basic linguistic knowledge (lemmas, multi-word expressions, grammatical tags and syntactic relations) on the performances of a similarity-based Word-Sense disambiguation method. The question we try to answer, b...
متن کاملCommonsense Reasoning with Verbs
KT encodes commonsense knowledge (CK) about entities and relations in a cross-classified Ontology and a database of generic (prototypical and inherent) information. The verb Ontology combines linguistic and psycholinguistic classification. The generic information for verbs encodes typical associations with actions and events. This knowledge base is useful for word-sense disambiguation, inferenc...
متن کاملWord Sense vs. Word Domain Disambiguation: A Maximum Entropy Approach
In this paper, a supervised learning system of word sense disambiguation is presented. It is based on conditional maximum entropy models. This system acquires the linguistic knowledge from an annotated corpus and this knowledge is represented in the form of features. The system were evaluated both using WordNet’s senses and domains as the sets of classes of each word. Domain labels are obtained...
متن کاملSemi-Automatic Extension of Large-Scale Linguistic Knowledge Bases
Linguistic resources are essential for the success of many AI tasks. Building a new lexical resource from scratch or combining heterogeneous resources is not only complex and time-consuming, but can also lead to knowledge inconsistency and redundancy. In this paper, we present a novel method for the large-scale semantic enrichment of a computational linguistic resource. To this end, with the ai...
متن کاملRobust and Efficient Page Rank for Word Sense Disambiguation
Graph-based methods that are en vogue in the social network analysis area, such as centrality models, have been recently applied to linguistic knowledge bases, including unsupervised Word Sense Disambiguation. Although the achievable accuracy is rather high, the main drawback of these methods is the high computational demanding whenever applied to the large scale sense repositories. In this pap...
متن کامل