The Ontologymapper Plug-in: Supporting Semantic Annotation of Text-documents by Classification

نویسندگان

  • Peter Scheir
  • Philip Hofmair
  • Michael Granitzer
  • Stefanie N. Lindstaedt
چکیده

In this contribution we present a tool for annotating documents, which are used for workintegrated learning, with concepts from an ontology. To allow for annotating directly while creating or editing an ontology, the tool was realized as a plug-in for the ontology editor Protégé. Annotating documents with semantic metadata is a laborious task, most of the time knowledge representations are created independently from the resources that should be annotated and additionally in most work environments a high number of documents exist. To increase the efficiency of the person annotating, in our tool the process of assigning concepts to text-documents is supported by automatic text-classification.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

PDForum A Threaded Interface for Collaborative Annotation of PDF Documents

The Portable Document Format (PDF) is currently becoming the standard for digital documents. One of the tasks regularly performed on digital documents is annotating and reviewing annotated documents. In collaborative annotation, the document is a heterogeneous collection of annotations by several persons, which at some level include annotating previous annotations done by other persons. Two dif...

متن کامل

Semantator: Semantic annotator for converting biomedical text to linked data

More than 80% of biomedical data is embedded in plain text. The unstructured nature of these text-based documents makes it challenging to easily browse and query the data of interest in them. One approach to facilitate browsing and querying biomedical text is to convert the plain text to a linked web of data, i.e., converting data originally in free text to structured formats with defined meta-...

متن کامل

Linguistic Annotation for the Semantic Web

Establishing the semantic web on a large scale implies the widespread annotation of web documents with ontology-based knowledge markup. For this purpose, tools have been developed that allow for semi-automatic annotation of web documents with ontology-based metadata. However, given that a large number of web documents consist either fully or at least partially of free text, language technology ...

متن کامل

An Improved K-Nearest Neighbor with Crow Search Algorithm for Feature Selection in Text Documents Classification

The Internet provides easy access to a kind of library resources. However, classification of documents from a large amount of data is still an issue and demands time and energy to find certain documents. Classification of similar documents in specific classes of data can reduce the time for searching the required data, particularly text documents. This is further facilitated by using Artificial...

متن کامل

A Joint Semantic Vector Representation Model for Text Clustering and Classification

Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, their shortcoming in capturing semantic concepts of text motivated researches to use...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007