Computational Models of Similarity in Lexical Ontologies

نویسنده

  • Nuno Alexandre Lopes Seco
چکیده

This thesis mainly concerns itself with the issue of semantic similarity and computational applications of it. Semantic similarity has for a long time been a subject of intense scholarship in the fields of Artificial Intelligence, Psychology and Cognitive Science. Computational models trying to imitate aspects of this cognitive ability date back to Quillian and his spreading activation algorithm. In the research presented here we propose a new computational approach to the assessment of semantic similarity between concepts. This work builds upon previous information theoretic attempts, from researchers interested in natural language processing, to mimic this ill-defined human aptitude. WordNet, an ontological lexical knowledge base, is the only source of knowledge exploited in the proposed algorithm. Hence, this is where our work differs from previous information theoretic attempts, since these have relied upon statistical information gathered from analyzing semantically tagged corpora. As a consequence, the sparse data problem that these corpus based strategies have to face is completely avoided, as is the time consuming effort to construct tagged corpora. Experimental studies presented show that this new approach not only outperforms previous formulations in the information theoretic category, but also other non-information theoretic formulations. We also outline how an analogy can be established between information theory and set theory that allows us to view most information theoretic models in terms of Tversky’s feature model. As a consequence of the correspondence we also propose a new similarity measure based on Tversky’s model. We also study a specific application of similarity that, by way of polysemy, will allow us infer novel and useful re-categorizations of existing concepts in WordNet. These creative re-categorizations are inspired by the Torrance test of creative thinking. Since our proposal depends on the quality of the lexical ontology, we will also propose an algorithm that explores the conceptual space defined by the ontology and suggests the inclusion of new concepts. These new concepts can be added at specific locations in the ontology, filling-in some of the lexical holes and asymmetries existent in WordNet. The filling of these holes provide a better conceptual grounding on which to base our model of similarity.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exploring the Effectiveness of Lexical Ontologies for Modeling Temporal Relations with Markov Logic

Temporal analysis of events is a central problem in computational models of discourse. However, correctly recognizing temporal aspects of events poses serious challenges. This paper introduces a joint modeling framework and feature set for temporal analysis of events that utilizes Markov Logic. The feature set includes novel features derived from lexical ontologies. An evaluation suggests that ...

متن کامل

GMO: A Graph Matching for Ontologies

Ontology matching is an important task to achieve interoperation between semantic web applications using different ontologies. Structural similarity plays a central role in ontology matching. However, the existing approaches rely heavily on lexical similarity, and they mix up lexical similarity with structural similarity. In this paper, we present a graph matching approach for ontologies, calle...

متن کامل

A novel algorithm for ontology matching

Ontology matching is an essential aspect of the Semantic Web with a goal of finding alignments among the entities of given ontologies. Ontology matching is a necessary step for establishing interoperation and knowledge sharing among Semantic Web applications. In this study we present an algorithm and a tool developed based on this algorithm to find correspondences among entities of input ontolo...

متن کامل

Preliminary Results of Logical Ontology Pattern Detection using SPARQL and Lexical Heuristics

Ontology design patterns were proposed in order to assist the ontology engineering task, providing models of specific construction representing a particular form of knowledge. Various kinds of patterns have since been introduced and classes of patterns identified. Detecting these patterns in existing ontologies is needed in various scenarios, for example the detection of the the two parts of an...

متن کامل

IAMA results for OAEI 2013

This paper presents the results of IAMA on OAEI 2013. IAMA (Institute of Automation’s Matcher) is an ontology matching system with the capability to deal with large scale ontologies. IAMA is designed to find out the correspondences between two ontologies by using multiple similarity measures. Candidate filtering technique is adopted when processing ontologies at large scale. 1 Presentation of t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005