Topics (Automated Content Analysis)
نویسندگان
چکیده
Topics describe the main issue discussed in an article, for example: Does article deal with politics, economics or sports? Field of application/theoretical foundation: In context “Agenda Setting”, studies analyze which issues are on public agenda. “News Values”, may why some topics more prominently covered than others. References/combination other methods data collection: Many combine manual inspection their automated detection. Quinn et al. (2010) demonstrate analyses legislative speeches how increase validity results. Similarly, Hase (2020) use content analysis to find and map similar coding is then conducted. Such combinations contribute a better detailed understanding by themselves. The datasets referred table described following paragraph: Puschmann (2019a) uses New York Times articles (1996-2006, N = 30,862) as well from Die Zeit (2011-2016, 377) identify using supervised machine learning. another tutorial, (2019b) Sherlock Holmes stories (18th century, 12), debate transcripts (1970-2017, 7,897) apply LDA structural topic modeling. her tutorials, Silge (2018a, 2018b) also 12) news corpus containing comments (2006-ongoing, 100,000). Robinson modeling Associated Press (1992, 2,246) books Dickens, Wells, Verne Austen 4). Roberts (2019) blogposts (2008, 13,248) Watanabe Müller newspaper Guardian (2016, 6,000). Van Atteveldt Welbers (2019, 2020) State Union (1981-2017, 10 1789-2017, 58) analyses. Lastly, Wiedemann Niekler (2017) same (1790-2017, 223). Table 1. Measurement “Topics” analysis. Author(s) Sample Procedure Formal check benchmark* Code (a) Newspaper (b) Supervised learning Reported http://inhaltsanalyse-mit-r.de/maschinelles_lernen.html (c) United Nations General Debate Transcripts modeling; Not reported http://inhaltsanalyse-mit-r.de/themenmodelle.html (2018a) & (2018b) News t Structural https://juliasilge.com/blog/sherlock-holmes-stm/ https://juliasilge.com/blog/evaluating-stm/ Books https://www.tidytextmining.com/topicmodeling.html Blogposts https://www.jstatsoft.org/article/view/v091i02 https://tutorials.quanteda.io/machine-learning/topicmodel/ van https://github.com/ccs-amsterdam/r-course-material/blob/master/tutorials/r_text_stm.md https://github.com/ccs-amsterdam/r-course-material/blob/master/tutorials/r_text_lda.md https://tm4ss.github.io/docs/Tutorial_6_Topic_Models.html https://tm4ss.github.io/docs/Tutorial_7_Klassifikation.html *Please note that many sources listed here tutorials conducted – therefore not focused validation Readers should simply read this column indication terms they can refer if interested References Hase, V., Engelke, K., Kieslich, K. (2020). things we fear. Combining uncover themes, threats fear-related news. Journalism Studies, 21(10), 1384-1402. Puschmann, C. (2019). Automatisierte Inhaltsanalyse mit R. Retrieved http://inhaltsanalyse-mit-r.de/index.html Quinn, M., Monroe, B. L., Colaresi, Crespin, M. H., Radev, D. (2010). How political attention minimal assumptions costs. American Journal Political Science, 54(1), 209–228. Roberts, E., Stewart, Tingley, stm: An R Package Topic Model. Statistical Software, 91(2), 1–40. Silge, J. (2018a). game afoot! stories. (2018b). Training, evaluating, interpreting models. J., Robinson, Text Mining A tidy approach. https://www.tidytextmining.com/ Atteveldt, W., Welbers, Modeling. Fitting models Watanabe, Müller, S. Quanteda tutorials. https://tutorials.quanteda.io/ Wiedemann, G., Niekler, A. (2017). Hands-on: five day text mining course humanists social scientists Proceedings 1st Workshop Teaching NLP Digital Humanities (Teach4DH@GSCL 2017), Berlin. https://tm4ss.github.io/docs/index.html
منابع مشابه
Examining the Automated Inference of Tweet Topics
The increasing volume of information exchange over online social networks (e.g. Twitter, Facebook) has led to the growing interest in technique for automated inference of the topic of individual posts/tweets in recent years. Short length, lack of a well defined set of topics, and use of acronyms in tweets are some of the reasons that make topic inference of tweets challenging. In this study, we...
متن کاملAutomated image analysis for high-content screening and analysis.
The field of high-content screening and analysis consists of a set of methodologies for automated discovery in cell biology and drug development using large amounts of image data. In most cases, imaging is carried out by automated microscopes, often assisted by automated liquid handling and cell culture. Image processing, computer vision, and machine learning are used to automatically process h...
متن کاملContent Analysis of Media Coverage of Childhood Obesity Topics in UAE Newspapers and Popular Social Media Platforms, 2014-2017
The 2017 prevalence of obesity among children (age 5–17 years) in the United Arab Emirates (UAE) is 13.68%. Childhood obesity is one of the 10 top health priorities in the UAE. This study examines the quality, frequency, sources, scope and framing of childhood obesity in popular social media and three leading UAE newspapers from 2014 to 2017. During the review period, 152 newspaper articles fro...
متن کاملContent-Based Image Retrieval: Advanced Topics
Image collections are growing at a rapid rate, motivating the need for efficient and effective tools to search through these databases. Content-based image retrieval (CBIR) techniques extract features directly from image data and use these to query image collections. In this paper, we focus on some more advanced issues in CBIR, namely the extraction of image features from compressed images, the...
متن کاملA Method of Automated Nonparametric Content Analysis for Social Science
The increasing availability of digitized text presents enormous opportunities for social scientists. Yet hand coding many blogs, speeches, government records, newspapers, or other sources of unstructured text is infeasible. Although computer scientists have methods for automated content analysis, most are optimized to classify individual documents, whereas social scientists instead want general...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: DOCA
سال: 2021
ISSN: ['2673-8597']
DOI: https://doi.org/10.34778/1e