An Experimental Study Comparing Human with Corpus-derived Associations
نویسنده
چکیده
Collocations often comprise two components where one dominates the other in the sense that knowing the dominant one makes it almost inevitable to think of the other one as well. In contrast, knowing the non-dominant component does not necessarily evoke such a strong preference regarding the missing component. In the combination Pyrrhic victory, for example, the first component almost “demands” the second whereas the second does not suggest the first as strongly. Moreover, a range of other words that can go with victory come to mind, for example narrow, landslide or decisive. The topic of this thesis is to examine to what extent corpus-derived association measures are able to capture these asymmetric relations that can occur in collocations. To this end, four asymmetric measures that can express different strengths of association between two components of a collocation were defined. The new measures are based on well-established association measures and were applied to a large data set. In order to evaluate their performance against empirical data, an experiment with human subjects was carried out. The measures are shown to be capable of predicting the direction of association within collocations. Accuracy varies between 76% and 86% for the different measures.
منابع مشابه
Asymmetry in Corpus-Derived and Human Word Associations
We investigate asymmetry in corpus-derived and human word associations. Most prior work has studied paradigmatic relations, either derived from free association norms or from large corpora using measures of statistical association and semantic relatedness. By contrast, we investigate the syntagmatic relation between words in adjective-noun and noun-noun combinations and present a new experiment...
متن کاملThe Effect of Neurotoxicity of Mangane Chloride on Corpus Striatum of Mouse Embryos
Purpose: Manganese, are of the elements plenty & which is found in nature, is widely used in Agricalture and in dustry.For years, toxicity of magnasium and its derivatives has been proven yet the findings are not valid enough to overgeneralize the result to human.In the present study the effect of neurotoxicity of manganese chloride on the mouse cortex was investigated. Materials and Methods: ...
متن کاملCinnamaldehyde and eugenol change the expression folds of AKT1 and DKC1 genes and decrease the telomere length of human adipose-derived stem cells (hASCs): An experimental and in silico study
Objective(s): To investigate the effect of cinnamaldehyde and eugenol on the telomere-dependent senescence of stem cells. In addition, to search the probable targets of mentioned phytochemicals between human telomere interacting proteins (TIPs) using in silico studies. Materials and Methods: Human adipose derived stem cells (hASCs) were studied under treatments with 2.5 µM/ml cinnamaldehyde, 0....
متن کاملConcordance-Based Data-Driven Learning Activities and Learning English Phrasal Verbs in EFL Classrooms
In spite of the highly beneficial applications of corpus linguistics in language pedagogy, it has not found its way into mainstream EFL. The major reasons seem to be the teachers’ lack of training and the unavailability of resources, especially computers in language classes. Phrasal verbs have been shown to be a problematic area of learning English as a foreign language due to their semantic op...
متن کاملComparing predictions of lexical norm data obtained using word associations and word collocation
We compared the quality of prediction of word variables based on a Dutch word association and text corpus. We derived estimates for: valence, arousal, dominance, concreteness and age of acquisition (AoA) for 2831 words. Based on the similarity between words we: (1) used projections on a dimension identified as the variable in question in a multidimensional representation, (2) used the k-nearest...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007