Creating Tools for Morphological Analysis of Sumerian
نویسندگان
چکیده
Sumerian is a long-extinct language documented throughout the ancient Middle East, arguably the first language for which we have written evidence, and is a language isolate (i.e. no related languages have so far been identified). The Electronic Text Corpus of Sumerian Literature (ETCSL), based at the University of Oxford, aims to make accessible on the web over 350 literary works composed during the late third and early second millennia BCE. The transliterations and translations can be searched, browsed and read online using the tools of the website. In this paper we describe the creation of linguistic analysis and corpus search tools for Sumerian, as part of the development of the ETCSL. This is designed to enable Sumerian scholars, students and interested laymen to analyse the texts online and electronically, and to further knowledge about the language.
منابع مشابه
Study of genetic diversities and relatedness of Iranian citrus genotypes using morphological and molecular markers
Having knowledge about genetic relationships among accessions is necessary for developing breeding strategies to produce improved cultivars. In present study, genetic diversity and inter-relationship among 29 genotypes of citrus were comparatively analyzed using morphological and RAPD markers. Significant variability was observed among citrus genotypes for 61 quantitative and qualitative morpho...
متن کاملEnhancing Sumerian Lemmatization by Unsupervised Named-Entity Recognition
Lemmatization for the Sumerian language, compared to the modern languages, is much more challenging due to that it is a long dead language, highly skilled language experts are extremely scarce and more and more Sumerian texts are coming out. This paper describes how our unsupervised Sumerian named-entity recognition (NER) system helps to improve the lemmatization of the Cuneiform Digital Librar...
متن کاملUnsupervised Sumerian Personal Name Recognition
This paper describes an unsupervised named-entity recognition (NER) system to identify personal names in Sumerian cuneiform documents from the Ur III period. We are motivated by the needs of social and economic historians of that period to identify specific persons of importance and such historically relevant facts as can be discerned by the surviving texts. The work was confronted by the chall...
متن کاملContents Modelling of Neo-Sumerian Ur III Economic Text Corpus
This paper describes a system for processing economic documents written in the ancient Sumerian language. The system is application-oriented and takes advantage of the simplicity of ancient economy. We have developed an ontology for a selected branch of economic activities. We translate the documents into a meaning representation language by means of a semantic grammar. The meaning representati...
متن کاملThe Cosmological Hoe
The hoe—the sound of the word is sweet...the hoe makes everything prosper, the hoe makes everything flourish. The hoe is good barley...the hoe is brick moulds, the hoe has made people exist. It is the hoe that is the strength of young manhood. The hoe and the basket are the tools for building cities. It builds the right kind of house, it cultivates the right kind of fields. It is you, hoe, that...
متن کامل