A Study on Dispersion Measures for Core Vocabulary Compilation
نویسندگان
چکیده
Core vocabulary is a set of words that are stable used across different text types, theme, and application scenario. In natural language, the number of core vocabulary is relatively small, the core vocabulary, however, plays an important part in language learning because it constitutes a major part of communication content. The traditional core vocabulary selection method is mainly based on the expert knowledge and rule of experience. With the rise of corpus linguistics, word frequency and dispersion uniformity provide objective statistical data to assist the selection of core vocabulary. In this paper, we propose a formula that integrates multi-dimensional uniformity , so that the estimation of word uniformity can take different classification dimensions into account. Secondly, we also propose a method of word frequency normalization for the problem of deviation of the traditional method. For evaluation, a method of evaluating the core vocabulary with a heterogeneous corpus is proposed and it can compare the advantages, disadvantages, and characteristics of various statistical formulas. In the results, we actually compare the different core vocabulary selection formulas, analyzed the characteristics of different formulas, and verified the word frequency normalization can correct the shortcomings of the traditional formula. Finally, we also verified that the proposed method which integrates multi-dimensional uniformity can pick out the vocabulary with more core characteristics. 關鍵詞:語料庫語言學、核心詞彙、邊緣詞彙、分布均勻度。
منابع مشابه
基於詞語分布均勻度的核心詞彙選擇之研究(A Study on Dispersion Measures for Core Vocabulary Compilation )[In Chinese]
متن کامل
کاربست واژگان پایه برای دانشآموزان با نیازهای ویژه
Background: Core vocabulary is one of the appealing subjects in special education. In this study, some researches related to core vocabulary is reviewed. Then, utilization of core vocabulary for students with special needs is elaborated. Method: The research is descriptive-analytic. Researches on Farsi indexed since 1973 in Scientific Information Database, The Comprehensive Portal of Human S...
متن کاملPsycholinguistic Ambiance of Short Stories in Enhancing Students’ Reading Comprehension and Vocabulary Power
Abstract The present study was carried out to investigate the effect of short stories on students’ reading comprehension, vocabulary power and attitude towards the skill and the new instructional materials. The participants of the study were 120 grade 9 students of Dilla Secondary and preparatory school. In order to gather data for the study, pre- and posttest of reading comprehension, pre and ...
متن کاملPsycholinguistic Ambiance of Short Stories in Enhancing Students’ Reading Comprehension and Vocabulary Power
Abstract The present study was carried out to investigate the effect of short stories on students’ reading comprehension, vocabulary power and attitude towards the skill and the new instructional materials. The participants of the study were 120 grade 9 students of Dilla Secondary and preparatory school. In order to gather data for the study, pre- and posttest of reading comprehension, pre and ...
متن کاملAttempts and outcomes of liquisolid technology: An updated chronological compilation of innovative ideas and adjuvants in the field
It has been observed that most of the chemical entities have high lipophilicity and poor aqueous solubility, which result in poor bioavailability. In order to improve the bioavailability, the release behavior of such drugs should be improved. Although there are numerous techniques to handle solubility related issue, but they are expensive due to involvement of complicated equipments, advanced m...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IJCLCLP
دوره 21 شماره
صفحات -
تاریخ انتشار 2016