Recent Developments in the Theor Y of Valency in the Light of the Prague Dependency Treebank*
نویسنده
چکیده
1 THE FRAMEWORK The Functional Generative Description (FGD, see Sgall, 1967, Sgall et al., 1986) was applied as a general framework for the development of the valency theory (see Panevová, 1974-75, 1980, 1994) as well as for the design of the Czech syntactically annotated corpus (PDT, see Hajič, 1998, Hajičová et al., 2001). Valency is understood as a lexico-syntactic attribute of a word – more precisely, of a particular lexical sense of the lemma, called here lexis (“lexie” in Czech terminology, see Filipec and Čermák, 1985). More precisely, we can understand a lexis as a pair formed by a lexical unit and one of its meanings.1 A valency frame (VF) is assigned to every autosemantic lexical unit (lexis). This, however, may be empty, e.g. with the Czech verb pršet [to rain], with nouns such as stůl [the table], adjectives as hezký [beautiful]. The labels used for the valency slots belong to the underlying structure (tectogrammatics) and, together with the lexical unit (lexis), they constitute a tectogrammatical representation of the lexical entry. With regard to the applied tasks, we include the morphemic counterparts of the particular valency slots as a part of the (complex) frame of the given unit. Valency is prototypically connected with verbs. We have distinguished two main classes of verbal complements: (i) inner participants, IP in the sequel (ACT(or), PAT(ient), ADDR(essee), ORIG(in) and EFF(ect)), (ii) free modifications, FM in the sequel.
منابع مشابه
An annotation scheme for Persian based on Autonomous Phrases Theory and Universal Dependencies
A treebank is a corpus with linguistic annotations above the level of the parts of speech. During the first half of the present decade, three treebanks have been developed for Persian either originally or subsequently based on dependency grammar: Persian Treebank (PerTreeBank), Persian Syntactic Dependency Treebank, and Uppsala Persian Dependency Treebank (UPDT). The syntactic analysis of a sen...
متن کاملValency in the Prague Dependency Treebank: Building the Valency Lexicon
In this article we focus on valency, which belongs to the core phenomena being captured in the underlying level of the Prague Dependency Treebank (PDT). We present a summary of the basic principles of the applied theoretical framework including proposals for suitable refinement relevant to NLP. The current status of description of valency behavior of verbs, nouns and adjectives is outlined. We ...
متن کاملDistribution of Valency Complements in Czech Complex Predicates: Between Verb and Noun
In this paper, we focus on Czech complex predicates formed by a light verb and a predicative noun expressed as the direct object. Although Czech – as an inflectional language encoding syntactic relations via morphological cases – provides an excellent opportunity to study the distribution of valency complements in the syntactic structure with complex predicates, this distribution has not been d...
متن کاملArgument/Valency Structure in PropBank, LCS Database and Prague Dependency Treebank: A Comparative Pilot Study
متن کامل
Czech-English Bilingual Valency Lexicon Online
We describe CzEngVallex, a bilingual Czech–English valency lexicon which aligns verbal valency frames and their arguments. It is based on a parallel Czech-English corpus, the Prague Czech-English Dependency Treebank (PCEDT), where for each occurrence of a verb, a reference to the underlying Czech and English valency lexicons (PDT-Vallex and CzEngVallex, respectively) is recorded. The CzEngValle...
متن کامل