Comparing the Hierarchy of Keywords in On-Line News Portals
نویسندگان
چکیده
Hierarchical organization is prevalent in networks representing a wide range of systems in nature and society. An important example is given by the tag hierarchies extracted from large on-line data repositories such as scientific publication archives, file sharing portals, blogs, on-line news portals, etc. The tagging of the stored objects with informative keywords in such repositories has become very common, and in most cases the tags on a given item are free words chosen by the authors independently. Therefore, the relations among keywords appearing in an on-line data repository are unknown in general. However, in most cases the topics and concepts described by these keywords are forming a latent hierarchy, with the more general topics and categories at the top, and more specialized ones at the bottom. There are several algorithms available for deducing this hierarchy from the statistical features of the keywords. In the present work we apply a recent, co-occurrence-based tag hierarchy extraction method to sets of keywords obtained from four different on-line news portals. The resulting hierarchies show substantial differences not just in the topics rendered as important (being at the top of the hierarchy) or of less interest (categorized low in the hierarchy), but also in the underlying network structure. This reveals discrepancies between the plausible keyword association frameworks in the studied news portals.
منابع مشابه
The Essential Components of a Comprehensive Nutrition and Dietetic Portal
Education and information delivery is an integral part of an effective strategy to control the prevalence of obesity and diabetes. Nutrition and dietetic portals provide integrated access to a wide range of information resources for nutritionists, physicians and the public due. The nutrition and dietetic portals in selected countries was reviewed to gain insight into the essential elements of s...
متن کاملArabic News Articles Classification Using Vectorized-Cosine Based on Seed Documents
Besides for its own merits, text classification (TC) has become a cornerstone in many applications. Work presented here is part of and a pre-requisite for a project we have overtaken to create a corpus for the Arabic text process. It is an attempt to create modules automatically that would help speed up the process of classification for any text categorization task. It also serves as a tool for...
متن کاملLexicon Analysis Based Automatic News Classification Approach – A Review
The news classification approach is the primary approach for the online news portals with the news data sourced from the various portals. The various types of data is received and accepted over the news classification portals. The lexicon analysis plays the key role in the categorization of the news automatically using the automatic news category recognition by analyzing the keyword data extrac...
متن کاملIdentifying the technical requirements for designing health portals
Aim: Considering technical requirements in the design of health portals increases the validity of information. This study identified the technical and content structure required to create these portals. Methods: This was a qualitative study which was conducted in 2020. A combination of comprehensive review and interview was used. The search was performed in Elsevier, EBSCO, Scopus, Web of Scie...
متن کاملTeaching How to Break Bad News: Comparing Role-Play and Group Discussion on Practice of Medical Interns in Jahrom Medical School
Introduction: The Main challenge in training about breaking bad news is selection of appropriate educational method. This study was performed to assess the results of role-playing method versus group discussion in training about this skill. Methods: This was an interventional double blind study, performed in 2009-2010 in Jahrom University of Medical Sciences. 30 medical students were involved ...
متن کامل