نتایج جستجو برای: corpora

تعداد نتایج: 19685  

Journal: :Revista Signos 2021

The present study looks into the variations in frequencies and pragmatic functions of metadiscourse markers known as boosters, particular, with regard to their verb forms. Three corpora have been compiled this end, covering fields engineering, medicine linguistics. were manually annotated for markers, boosters included, by a group annotators. A predetermined list was used annotation, but throug...

Journal: :Discours. Revue de linguistique, psycholinguistique et informatique.A journal of linguistics, psycholinguistics and computational linguistics 2022

This paper is about two competing marking strategies for contrastive topics in a topic shift context French: emphatic pronouns and adverbs. We analyse the choice between these markers one specific syntactic position, i.e. when they occur subject finite verb, different corpora consisting of formal written, informal written spoken French. Our results indicate that frequency adverbs register-depen...

Journal: :Asian journal of social sciences and legal studies 2023

Corpora have been regarded as useful sources for English language teaching. As various studies are devoted to the use of corpus in teaching grammar and vocabulary, more research is needed ascertain how learners could benefit from corpus-based data improving their writing skills, reflective efficiency, particular. In present study, conducted at one leading international universities Uzbekistan, ...

2013
Marek Grác

The extended usage of written corpora not only for manual querying but also for machine learning led to the creation of massive corpora. These corpora are almost solely crawled from the internet and contain texts of various quality. Corpora that contain more typos or ungrammatical texts are more difficult to use for computational linguists and are thus a major obstacle in automatic development....

2013
Mona Diab Nizar Habash Owen Rambow Ryan Roth

The Linguistic Data Consortium (LDC) has developed hundreds of data corpora for natural language processing (NLP) research. Among these are a number of annotated treebank corpora for Arabic. Typically, these corpora consist of a single collection of annotated documents. NLP research, however, usually requires multiple data sets for the purposes of training models, developing techniques, and fin...

2012
Hongsuck Seo Jonghoon Lee Seokhwan Kim Kyusong Lee Sechun Kang Gary Geunbae Lee

We introduce a novel method for grammatical error correction with a number of small corpora. To make the best use of several corpora with different characteristics, we employ a meta-learning with several base classifiers trained on different corpora. This research focuses on a grammatical error correction task for article errors. A series of experiments is presented to show the effectiveness of...

2010
Lilian Lee

This paper presents a newly designed comparable corpora of book reviews consisting of two parts: Russian and English representing two very different languages. The corpora are comparable in terms of domain, style and size. This set of corpora may be of use for cross-lingual experiments in document-level sentiment classification. We also present brief description of the languageand domain-specif...

Journal: :Language Resources and Evaluation 2011
Wilson Wong Wei Liu Mohammed Bennamoun

The role of the Web for text corpus construction is becoming increasingly significant. However, the contribution of the Web is largely confined to building a general virtual corpus or low quality specialised corpora. In this paper, we introduce a new technique called SPARTAN for constructing specialised corpora from the Web by systematically analysing website contents. Our evaluations show that...

2011
Stanley E. Trauth STANLEY E. TRAUTH

Macroscopic and histological analyses of the ovarian cycle of Crotaphytus col/aris were conducted during 1971 and 1972. The reproductive season extends from April to late June; examination of ovarian h istosections revealed two distinct sets of corpora lutea, indicating that two clutches of eggs are produced per season. The average clutch size was 6.4. Corpora lutea regress into permanent ovari...

2014
Pavel A. Skrelin Nina B. Volskaya Karina Evgrafova Riikka Ullakonoja

In the paper we propose to exploit existing corpora of wellresourced languages as a basis for developing similar corpora of under-resourced ones. The construction of this type of corpora will allow finding common patterns of acoustic manifestation of similar functional states regardless of the language. The analysis of these corpora will also allow investigating universal and language-specific ...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید