Ontology Evaluation through Text Classification

نویسندگان

  • Yael Dahan Netzer
  • David Gabay
  • Meni Adler
  • Yoav Goldberg
  • Michael Elhadad
چکیده

We present a new method to evaluate a search ontology, which relies on mapping ontology instances to textual documents. On the basis of this mapping, we evaluate the adequacy of ontology relations by measuring their classification potential over the textual documents. This data-driven method provides concrete feedback to ontology maintainers and a quantitative estimation of the functional adequacy of the ontology relations towards search experience improvement. We specifically evaluate whether an ontology relation can help a semantic search engine support exploratory search. We test this ontology evaluation method on an ontology in the Movies domain, that has been acquired semi-automatically from the integration of multiple semi-structured and textual data sources (e.g., IMDb and Wikipedia). We automatically construct a domain corpus from a set of movie instances by crawling the Web for movie reviews (both professional and user reviews). The 1-1 relation between textual documents (reviews) and movie instances in the ontology enables us to translate ontology relations into text classes. We verify that the text classifiers induced by key ontology relations (genre, keywords, actors) achieve high performance and exploit the properties of the learned text classifiers to provide concrete feedback on the ontology. The proposed ontology evaluation method is general and relies on the possibility to automatically align textual documents to ontology instances.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Evaluation of Search Ontologies in the Entertainment Domain using Text Classification

Information Retrieval (IR) research has recently started addressing the information need of exploratory search. where the searcher may be unfamiliar with the domain or not have decided what is the goal of his query. A popular tool to support exploratory search is the use of faceted search. The implementation of faceted search requires that documents be annotated by metadata in the form of attri...

متن کامل

Topic Modeling and Classification of Cyberspace Papers Using Text Mining

The global cyberspace networks provide individuals with platforms to can interact, exchange ideas, share information, provide social support, conduct business, create artistic media, play games, engage in political discussions, and many more. The term cyberspace has become a conventional means to describe anything associated with the Internet and the diverse Internet culture. In fact, cyberspac...

متن کامل

English Sentence Evaluation Method using Text Clustering and Semantic Ontology

Aiming at the problem that the identification precision in intonation evaluation and analysis of English sentences is not high, this paper proposes a method of English sentences on the basis of ontology graph clustering evaluation. First, we study the ontology-graph evaluation model of English sentences, conduct objective evaluation to English sentences based on the fundamental framework of KL ...

متن کامل

Automated compound classification using a chemical ontology

UNLABELLED BACKGROUND Classification of chemical compounds into compound classes by using structure derived descriptors is a well-established method to aid the evaluation and abstraction of compound properties in chemical compound databases. MeSH and recently ChEBI are examples of chemical ontologies that provide a hierarchical classification of compounds into general compound classes of bio...

متن کامل

Ontology learning from text: Evaluation of learned ontologies

Ontologies provide a structural organizational knowledge, they support the exchange and sharing of information. Moreover, one of the main benefits of using ontologies is the ability to infer new knowledge that allows the development of more realistic applications such as semantic search, automated reasoning, classification, query answering, among others. The need for overcoming the bottleneck, ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009