Self Embedded Relative Clauses in a Corpus of German Newspaper Texts

نویسندگان

  • Christian Korthals
  • Thorsten Brants
  • Reinhard Köhler
چکیده

The distribution of center self-embeddings and extrapositions in German is assumed to reflect a universal performance strategy of minimizing memory load during parsing. Self-embedded relative clauses of embedding depth 2 were semi-automatically analysed in a treebank of German newspaper texts. Clause length and especially extraposition distance are found as the main distinctive parameters between center embeddings and extrapositions.1

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Information Density as a Factor for Variation in the Embedding of Relative Clauses

In German, relative clauses can be positioned in-situ or extraposed. A potential factor for the variation might be information density. In this study, this hypothesis is tested with a corpus of 17 th century German funeral sermons. For each referent in the relative clauses and their matrix clauses, the attention state was determined (first calculation). In a second calculation, for each word th...

متن کامل

A Self-Expanding Corpus Based on Newspapers on the Web

A Unix-based system is presented which automatic collects newspaper articles from the web, converts the texts, and includes these texts in a newspaper corpus. This corpus can be searched from a web-browser. The corpus is currently 70 millions words and increases by 4 millions words each month.

متن کامل

Word Order in German: A Corpus Study

This paper presents a corpus study of the order between subject and object in German main and embedded clauses. Since prior studies have shown that object-subject (OS) sentences are rare in comparison to subject-object (SO) sentences, for both main and embedded clauses two corpora were assembled: One corpus containing both SO and OS sentences, and a second corpus containing only OS sentences. I...

متن کامل

Recent Developments in DRK

This paper gives an overview of recent developments in the German Reference Corpus DRK in terms of growth, maximising relevant corpus strata, metadata, legal issues, and its current and future research interface. Due to the recent acquisition of new licenses, DRK has grown by a factor of four in the first half of 2014, mostly in the area of newspaper text, and presently contains over 24 b...

متن کامل

Processing relative clauses in Chinese.

This paper reports results from a self-paced reading study in Chinese that demonstrates that object-extracted relative clause structures are less complex than corresponding subject-extracted structures. These results contrast with results from processing other Subject-Verb-Object languages like English, in which object-extracted structures are more complex than subject-extracted structures. A k...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002