Self Embedded Relative Clauses in a Corpus of German Newspaper Texts
نویسندگان
چکیده
The distribution of center self-embeddings and extrapositions in German is assumed to reflect a universal performance strategy of minimizing memory load during parsing. Self-embedded relative clauses of embedding depth 2 were semi-automatically analysed in a treebank of German newspaper texts. Clause length and especially extraposition distance are found as the main distinctive parameters between center embeddings and extrapositions.1
منابع مشابه
Information Density as a Factor for Variation in the Embedding of Relative Clauses
In German, relative clauses can be positioned in-situ or extraposed. A potential factor for the variation might be information density. In this study, this hypothesis is tested with a corpus of 17 th century German funeral sermons. For each referent in the relative clauses and their matrix clauses, the attention state was determined (first calculation). In a second calculation, for each word th...
متن کاملA Self-Expanding Corpus Based on Newspapers on the Web
A Unix-based system is presented which automatic collects newspaper articles from the web, converts the texts, and includes these texts in a newspaper corpus. This corpus can be searched from a web-browser. The corpus is currently 70 millions words and increases by 4 millions words each month.
متن کاملWord Order in German: A Corpus Study
This paper presents a corpus study of the order between subject and object in German main and embedded clauses. Since prior studies have shown that object-subject (OS) sentences are rare in comparison to subject-object (SO) sentences, for both main and embedded clauses two corpora were assembled: One corpus containing both SO and OS sentences, and a second corpus containing only OS sentences. I...
متن کاملRecent Developments in DRK
This paper gives an overview of recent developments in the German Reference Corpus DRK in terms of growth, maximising relevant corpus strata, metadata, legal issues, and its current and future research interface. Due to the recent acquisition of new licenses, DRK has grown by a factor of four in the first half of 2014, mostly in the area of newspaper text, and presently contains over 24 b...
متن کاملProcessing relative clauses in Chinese.
This paper reports results from a self-paced reading study in Chinese that demonstrates that object-extracted relative clause structures are less complex than corresponding subject-extracted structures. These results contrast with results from processing other Subject-Verb-Object languages like English, in which object-extracted structures are more complex than subject-extracted structures. A k...
متن کامل