Towards Intelligent Written Cultural Heritage Processing - Lexical processing

نویسنده

  • Kiril Ribarov
چکیده

Through ACT (Annotated Corpora of Text) software package for lexical and corpus processing of European written cultural sources (currently used for processing of mediaeval Slavonic manuscripts) this work presents another step forward towards a contextual and intelligent heritage Information Technology framework. ACT is suitable for capturing characteristics of old written sources including rich language variability on word and sentential level. It is not the word-form, but its "understandings" that become central processing units, which can be assigned morphology distinctions, head-words (including recensional), translation equivalents, multi-word units, and correlation to other sources. The whole annotation process is automated, and individual sorting orders and morphology tags structures can be defined. ACT incorporates modules for: complex searches on one or more sources, creation of various ready-to-use documents, web text and image access, incorporation of lexical card-files into a corpus, and text-from-card-files reconstruction.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Latest Prague Contributions to Written Cultural Heritage Processing

This work presents a software package ACT (Annotated Corpora of Text) for lexical and corpus processing of European written cultural sources (currently used for processing of mediaeval Slavonic manuscripts). I use ACT as a contribution towards a contextual and intelligent heritage Information Technology framework. The software is suitable for capturing characteristics of old written sources inc...

متن کامل

Voice knowledge acquisition system for the management of cultural heritage

This document presents our work on a definition and experimentation of a voice interface for cultural heritage inventory. This hybrid system includes signal processing, natural language techniques and knowledge modeling for future retrieval. We discuss the first results and give some points on future work.

متن کامل

Natural Language Processing for Cultural Heritage Domains

Museums, archives, libraries and other cultural heritage institutes maintain large collections of artefacts which are valuable knowledge sources for both experts and interested lay persons. Recently, more and more cultural heritage institutes have started to digitise their collections, for instance to make them accessible via web portals. However, while digitisation is a necessary first step to...

متن کامل

Ontology-Driven Processing and Management of Digital Rock Art Objects in IndianaMAS

This paper presents the Indiana Ontology for modeling the knowledge about Mount Bego’s rock art and its exploitation in the IndianaMAS project. Although many projects use ontologies for semantic processing of cultural heritage digital objects, we are not aware of such ontologies in the rock art domain. Also, the Indiana Ontology is fully and seamlessly integrated with the IndianaMAS framework c...

متن کامل

Towards Automated 3D Reconstruction of Defective Cultural Heritage Objects

Due to recent improvements in 3D acquisition and shape processing technology, the digitization of Cultural Heritage (CH) artifacts is gaining increased application in context of archival and archaeological research. This increasing availability of acquisition technologies also implies a need for intelligent processing methods that can cope with imperfect object scans. Specifically for Cultural ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004