Accessing Heterogeneous Linguistic Data — Generic XML-based Representation and Flexible Visualization

نویسندگان

  • Stefanie Dipper
  • Michael Götze
چکیده

Annotation of linguistic data increasingly focuses on information beyond the (morpho-)syntactic level. Moreover, annotated data of less-studied languages is growing in importance. To maximally profit from this data, straightforward and user-friendly access has to be provided. In this paper, we describe a linguistic database that is accessed via a web browser and offers flexible visualization of multiply annotated data. Data is internally represented by a generic XML-based format.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cost-Based Query Optimization in a Heterogeneous Distributed Semi-Structured Environment

How to efficiently process queries in an heterogeneous and distributed data integration environment is an interesting and unsolved topic. Our research project proposes an approach for providing a generic cost framework for query optimization in an XML-based mediation system called XLive, which integrates heterogeneous data sources. Our approach relies on cost annotation on an XQuery logical rep...

متن کامل

ANNIS3: A new architecture for generic corpus query and visualization

This paper is concerned with the data structures, properties of query languages and visualization facilities required for the generic representation of richly annotated, heterogeneous linguistic corpora. We propose that above and beyond a general graph based data-model, which is becoming increasingly popular in many complex annotation formats, a well-defined concept of multiple, potentially con...

متن کامل

Integrating Xml with Relational Databases Using Middleware Approach

Over the past few years, XML has become the undisputable lingua franca standard both for semi-structured data representation and exchange format over the Internet, and also content management in various e-business worlds, especially the B2B and B2C enterprise applications. However, most of these organisations still rely heavily on existing relational database management systems (RDBMS) to store...

متن کامل

A generic approach to software support for linguistic annotation using XML

Large-scale linguistic annotation is currently employed for a wide range of purposes, including comparing communication under different conditions, testing psycholinguistic hypotheses, and training natural language engines. Current software support for linguistic annotation is poor, with much of it written for one-off tasks using special purpose data representations and handling routines. This ...

متن کامل

A generic approach to software support for linguistic annotation using

Large-scale linguistic annotation is currently employed for a wide range of purposes, including comparing communication under different conditions, testing psycholinguistic hypotheses, and training natural language engines. Current software support for linguistic annotation is poor, with much of it written for one-off tasks using special purpose data representations and handling routines. This ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005