Exploring and categorising the Arabic copula and auxiliary<i>k?na</i>through enhanced part-of-speech tagging
نویسندگان
چکیده
Arabic syntax has yet to be studied in detail from a corpus-based perspective. The copula k?na (‘be’), functions also as an auxiliary, creating periphrastic tense–aspect constructions; but the literature on these is far exhaustive. To analyse within one-million word Corpus of Contemporary Arabic, part-of-speech tagging (using novel, targeted enhancements previously described program which improves accessibility for linguistic analysis output Habash et al.’s [2012] mada disambiguator Buckwalter morphological analyser) applied disambiguate and auxiliary at high rate accuracy. Concordances both are extracted, 10 percent samples (499 instances 387 k?na) analysed manually identify surface-level grammatical patterns meanings. This raw then systematised according more general patterns’ main parameters variation; special descriptions developed specific, apparently fixed-form expressions (including two phraseologies afford expression verbal adjectival modality). Overall, we uncover substantial new detail, not mentioned existing grammars (e.g., quantitative predominance past imperfect construction over other uses k?na). There exists notable potential findings inform enhance only pedagogy first or second/foreign language.
منابع مشابه
Arabic Part of Speech Tagging
Arabic is a morphologically rich language, which presents a challenge for part of speech tagging. In this paper, we compare two novel methods for POS tagging of Arabic without the use of gold standard word segmentation but with the full POS tagset of the Penn Arabic Treebank. The first approach uses complex tags that describe full words and does not require any word segmentation. The second app...
متن کاملthe analysis of the role of the speech acts theory in translating and dubbing hollywood films
از محوری ترین اثراتی که یک فیلم سینمایی ایجاد می کند دیالوگ هایی است که هنرپیش گان فیلم میگویند. به زعم یک فیلم ساز, یک شیوه متأثر نمودن مخاطب از اثر منظوره نیروی گفتارهای گوینده, مثل نیروی عاطفی, ترس آور, غم انگیز, هیجان انگیز و غیره, است. این مطالعه به بررسی این مسأله مبادرت کرده است که آیا نیروی فراگفتاری هنرپیش گان به مثابه ی اعمال گفتاری در پنج فیلم هالیوودی در نسخه های دوبله شده باز تولید...
15 صفحه اولthe effects of speech rate,prosodic features, and blurred speech on iranian efl learners listening comprehension
کلید واژه ها به زبان انگلیسی: effect of speech rate on listening comprehension, blurred speech,segmental and suprasegmental features,authentic speech,intelligibility, discrimination, omission, assimilation چکیده: سرعت مطالب شنیداری در کلام پیوسته بطور کلی همواره کابوسی بوده برای یادگیرنده های زبان دوم و بالاخص برای شنوندگان ایرانی. علی رغم عقل سلیم که کلام با سرعت کندتری فعالیتهای درک مطلب شن...
15 صفحه اولMorphological Segmentation and Part of Speech Tagging for Religious Arabic
We annotate a small corpus of religious Arabic with morphological segmentation boundaries and fine-grained segment-based part of speech tags. Experiments on both segmentation and POS tagging show that the religious corpus-trained segmenter and POS tagger outperform the Arabic Treebak-trained ones although the latter is 21 times as big , which shows the need for building religious Arabic linguis...
متن کاملJoint Arabic Segmentation and Part-Of-Speech Tagging
Arabic has a very complex morphological system, though a very structured one. Character patterns are often indicative of word class and word segmentation. In this paper, we e xplore a novel approach to Arabic word segmentation and part-of-speech tagging relying on character information. The approach is lexicon-free and does not require any morphological analysis, eliminat ing the factor of dict...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Corpora
سال: 2021
ISSN: ['1755-1676', '1749-5032']
DOI: https://doi.org/10.3366/cor.2021.0225