Annotation and Classification of Light Verbs and Light Verb Variations in Mandarin Chinese

نویسندگان

  • Jingxia Lin
  • Hongzhi Xu
  • Menghan Jiang
  • Chu-Ren Huang
چکیده

Light verbs pose an a challenge in linguistics because of its syntactic and semantic versatility and its unique distribution different from regular verbs with higher semantic content and selectional resrictions. Due to its light grammatical content, earlier natural language processing studies typically put light verbs in a stop word list and ignore them. Recently, however, classification and identification of light verbs and light verb construction have become a focus of study in computational linguistics, especially in the context of multi-word expression, information retrieval, disambiguation, and parsing. Past linguistic and computational studies on light verbs had very different foci. Linguistic studies tend to focus on the status of light verbs and its various selectional constraints. While NLP studies have focused on light verbs in the context of either a multi-word expression (MWE) or a construction to be identified, classified, or translated, trying to overcome the apparent poverty of semantic content of light verbs. There has been nearly no work attempting to bridge these two lines of research. This paper takes this challenge by proposing a corpus-bases study which classifies and captures syntactic-semantic difference among all light verbs. In this study, we first incorporate results from past linguistic studies to create annotated light verb corpora with syntactic-semantics features. We next adopt a statistic method for automatic identification of light verbs based on this annotated corpora. Our results show that a language resource based methodology optimally incorporating linguistic information can resolve challenges posed by light verbs in NLP.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Transitivity in Light Verb Variations in Mandarin Chinese - A Comparable Corpus-based Statistical Approach

This paper adopts a comparable corpus-based approach to light verb variations in two varieties of Mandarin Chinese and proposes a transitivity (Hopper and Thompson 1980) based theoretical account. Light verbs are highly grammaticalized and lack strong collocation restrictions; hence it has been a challenge to empirical accounts. It is even more challenging to consider their variations between d...

متن کامل

The Ordering of Mandarin Chinese Light Verbs

Two or more are sometimes found to co-occur in both Taiwan and Mainland Mandarin Chinese, e.g., jiāyǐ and jìnxíng in duì xuéshēng jiāyǐ jìnxíng yǐndǎo ‘to guide students’, and a particular ordering is often preferred, e.g., jiāyǐ jìnxíng over jìnxíng jiāyǐ. This study argues that the order of the light verbs is closely associated to two kinds of information that the verbs specify, i.e. aspectua...

متن کامل

Corpus-based Study and Identification of Mandarin Chinese Light Verb Variations

When PRC was founded on mainland China and the KMT retreated to Taiwan in 1949, the relation between mainland China and Taiwan became a classical Cold War instance. Neither travel, visit, nor correspondences were allowed between the people until 1987, when government on both sides started to allow small number of Taiwan people with relatives in China to return to visit through a third location....

متن کامل

On Delexicalization Features of Light Verbs in Mandarin

Delexicalization is the tendency of meaning, reflecting the structural feature of light verbs. It emerges from collocation usages of semantic sharing. Based on the corpus and exemplified with “ ” and “ ”, the current paper addresses the delexicalizing features of Chinese light verbs, while proposing a lexicographic approach concerning the conventionality of light verb construction and endeavori...

متن کامل

L1 Transfer in L2 Acquisition of the There-Insertion Construction by Mandarin EFL Learners

This study examined the role of the native language (L1) transfer in a non-native language (L2) acquisition of the there-insertion construction at the syntax-semantics interface. Specifically, the study investigated if Mandarin EFL learners would make overgeneralization errors in the situation where an L1 argument structure constitutes a superset of its L2 counterpart. Verbs of existence and ap...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014