Predictability of Distributional Semantics in Derivational Word Formation
نویسندگان
چکیده
Compositional distributional semantic models (CDSMs) have successfully been applied to the task of predicting the meaning of a range of linguistic constructions. Their performance on semicompositional word formation process of (morphological) derivation, however, has been extremely variable, with no large-scale empirical investigation to date. This paper fills that gap, performing an analysis of CDSM predictions on a large dataset (over 30,000 German derivationally related word pairs). We use linear regression models to analyze CDSM performance and obtain insights into the linguistic factors that influence how predictable the distributional context of a derived word is going to be. We identify various such factors, notably part of speech, argument structure, and semantic regularity.
منابع مشابه
Derivational Smoothing for Syntactic Distributional Semantics
Syntax-based vector spaces are used widely in lexical semantics and are more versatile than word-based spaces (Baroni and Lenci, 2010). However, they are also sparse, with resulting reliability and coverage problems. We address this problem by derivational smoothing, which uses knowledge about derivationally related words (oldish→ old) to improve semantic similarity estimates. We develop a set ...
متن کاملOn the Role of Derivational Processes in the Formation of Non-Taxonomic Classes of Lexical Units in Russian
The paper is focused on classes of lexical units which arise as a result of derivational processes – word formation and semantic transfers, acting either in isolation or together, on the basis of common semantic foundations that bind targets and sources of derivation. The lexical items which constitute the classes under study vary in their denotative characteristics and due to their categ...
متن کاملTowards Semantic Validation of a Derivational Lexicon
Derivationally related lemmas like friendN – friendlyA – friendshipN are derived from a common stem. Frequently, their meanings are also systematically related. However, there are also many examples of derivationally related lemma pairs whose meanings differ substantially, e.g., objectN – objectiveN . Most broad-coverage derivational lexicons do not reflect this distinction, mixing up semantica...
متن کاملSyntactic category information and the semantics of derivational morphological rules
In standard generative approaches, word-formation rules contain, among other things, information on the semantics of the suffix and the syntactic category (or word-class) of possible bases. Based on the general assumption that word-class specification of the input is a crucial ingredient of derivational morphology, far-reaching claims have been made. For example, the unitary base hypothesis (Ar...
متن کاملAre doggies cuter than dogs? Emotional valence and concreteness in German derivational morphology
The semantic behavior of derivational processes has been investigated with compositional distributional models relating the meaning of base, affix, and derivative (e.g., anti+capitalist→ anticapitalist). While broadly successful, these approaches model how the distributional behavior generally is affected by derivation. Meanwhile, their predictions can not be interpreted at the level of linguis...
متن کامل