The Zulu locative prefix ku- revisited: a corpus-based approach
نویسنده
چکیده
This article re-examines the distribution of the class 17 locative prefix kuand its variants kwiand koin the locativisation of nouns in Zulu. To this end an electronic corpus of 5 million running Zulu words — the University of Pretoria Zulu Corpus (PZC) — is queried. We indicate how PZC can be used to highlight previously under-emphasised and overlooked aspects of a seemingly well-documented language feature such as the class 17 locative prefix ku-. Analysing the output proffered by PZC — an organic corpus that has been organised chronologically, and that consists of a number of sub-corpora stratified according to genre — enables us to reach conclusions regarding not only the frequencies with which the variants ku-, kwiand koof the class 17 locative prefix are used, but also regarding the possible changes in use that these prefixes have undergone with time. We also show how these prefixes relate to each other in the different sub-corpora.
منابع مشابه
Containing overgeneration in Zulu computational morphology1
The development of a large-coverage, computational morphological analyser for Zulu requires the modelling not only of the regular phenomena often associated with word formation, but also the idiosyncratic behaviour that may occur in Zulu morphology. This paper discusses the application of an existing rule-based, finite-state morphological analyser prototype ZulMorph in semi-automating the minin...
متن کاملIntelligent Systems Group University of Bristol Additional material for the Ukwabelana Zulu corpus
In this document we describe the scheme used for labelling the open-source Ukwabelana Zulu corpus as well as the rules employed for the Part-of-speech (POS) tagger used to assign POS to morphologically analysed words. A detailed description of the Zulu morphology, the corpus itself and its generation is given in [2]. All resources can be downloaded from http://www.cs.bris.ac.uk/Research/Machine...
متن کاملThe verbal prefix na– in Croatian: A cognitive linguistic analysis
and na– encodes reaching an abstract goal, which is related to the meaning of the preposition na that regularly combines with abstract landmarks in directional and locative contexts. Examples (4a–b) illustrate frequent usage types of the construction nado}i na + ACC: physical motion onto stands for mental activities, in which the verb implies ’find’, ’remember’: (4) (a) Stoga je FIA nado{la na ...
متن کاملUkwabelana - An open-source morphological Zulu corpus
Zulu is an indigenous language of South Africa, and one of the eleven official languages of that country. It is spoken by about 11 million speakers. Although it is similar in size to some Western languages, e.g. Swedish, it is considerably under-resourced. This paper presents a new open-source morphological corpus for Zulu named Ukwabelana corpus. We describe the agglutinating morphology of Zul...
متن کاملExperimental Bootstrapping of Morphological Analysers for Nguni Languages
This paper addresses the experimental bootstrapping of the development of broad-coverage finite-state morphological analysers for Xhosa, Swati and (Southern) Ndebele by using an existing prototype of a morphological analyser for Zulu. These languages are both morphologically complex and resource-scarce. The research question is whether bootstrapping is feasible across the language boundaries be...
متن کامل