Bilingual Corpus – Digital Repository for Preservation of Language Heritage

نویسندگان

  • Ludmila Dimitrova
  • Radovan Garabík
چکیده

The article briefly reviews bilingual Slovak-Bulgarian/BulgarianSlovak parallel and aligned corpus. The corpus is collected and developed as results of the collaboration in the frameworks of the joint research project between Institute of Mathematics and Informatics, Bulgarian Academy of Sciences, and Ľ. Štúr Institute of Linguistics, Slovak Academy of Sciences. The multilingual corpora are large repositories of language data with an important role in preserving and supporting the world's cultural heritage, because the natural language is an outstanding part of the human cultural values and collective memory, and a bridge between cultures. This bilingual corpus will be widely applicable to the contrastive studies of the both Slavic languages, will also be useful resource for language engineering research and development, especially in machine translation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bilingual children and adult heritage speakers: The range of comparison

This paper compares the language of child bilinguals and adult unbalanced bilinguals (heritage speakers) against that of bilingual native speakers of their home language (baseline). We identify four major vectors of correspondence across the language spoken by these three groups. First, all varieties may represent a given linguistic property in a similar way (child bilinguals = adult heritage s...

متن کامل

Tone restoration in transcribed Kammu: Decision-list word sense disambiguation for an unwritten language

The RWAAI (Repository and Workspace for Austroasiatic Intangible heritage) project aims at building a digital archive out of existing legacy data from the Austroasiatic language family. One aspect of the project is the preservation of analogue legacy data. In this context, we have at our hands a large number of mostly-phonemic transcriptions of narrative monologues, often with accompanying soun...

متن کامل

Relational Database Preservation through XML modelling

Digital Archives are complex structures composed of human resources, state of the art technologies, policies and data. Due to the heritage keeping role that archives assume in our society, it is important to make sure that, the data that is produced by our organizations is preserved accordingly in order do document is activity and provide evidence of their activities. Information stored in an a...

متن کامل

Preservation Planning: A Comparison Between Two Implementations

This paper examines preservation planning as it is implemented within the National Library’s preservation repository (Rosetta) and compares it directly to the PLATO tool created as part of the PLANETS project. Preservation planning is both a business precondition and the systematic framework defining any preservation action. At the National Library of New Zealand Te Puna Mātauranga o Aotearoa, ...

متن کامل

Culture Heritage Digital Repositories. Research Questions

This discussion is about innovative solutions for assembling multimedia digital repositories for collaborative use in specific contexts and communities and enhancing scholarly understanding and experiences of digital cultural heritage. Several aspects are stress such as the dynamic aggregation of cross-media resources across existing institutional digital libraries and repositories. Research qu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014