Cross-Domain Topic Classification for Political Texts
نویسندگان
چکیده
Abstract We introduce and assess the use of supervised learning in cross-domain topic classification. In this approach, an algorithm learns to classify topics a labeled source corpus then extrapolates unlabeled target from another domain. The ability existing training data makes method significantly more efficient than within-domain learning. It also has three advantages over unsupervised models: can be specifically targeted research question resulting are easier validate interpret. demonstrate using case party platforms (source corpus) parliamentary speeches (target corpus). addition standard error metrics, we further performance by labeling subset target-corpus documents. find that classifier accurately assigns speeches, although accuracy varies substantially topic. propose tools diagnosing To illustrate usefulness method, present two studies on how electoral rules gender parliamentarians influence choice speech topics.
منابع مشابه
Cross-Lingual Classification of Topics in Political Texts
In this paper, we propose an approach for cross-lingual topical coding of sentences from electoral manifestos of political parties in different languages. To this end, we exploit continuous semantic text representations and induce a joint multilingual semantic vector spaces to enable supervised learning using manually-coded sentences across different languages. Our experimental results show tha...
متن کاملMultidimensional topic analysis in political texts
Automatic content analysis is more and more becoming an accepted research method in social science. In political science researchers are using party manifestos and transcripts of political speeches to analyze the positions of different actors. Existing approaches are limited to a single dimension, in particular, they cannot distinguish between the positions with respect to a specific topic. In ...
متن کاملA link-bridged topic model for cross-domain document classification
0306-4573/$ see front matter 2013 Elsevier Ltd. All rights reserved. http://dx.doi.org/10.1016/j.ipm.2013.05.002 ⇑ Corresponding author at: Department of Computer Science, South China University of Technology, Guangzhou, China. Tel.: +852 39438461; f 26035505. E-mail addresses: [email protected] (P. Yang), [email protected] (W. Gao), [email protected] (Q. Tan), [email protected] (K.-F. Wong)...
متن کاملTopic Correlation Analysis for Cross-Domain Text Classification
Cross-domain text classification aims to automatically train a precise text classifier for a target domain by using labeled text data from a related source domain. To this end, the distribution gap between different domains has to be reduced. In previous works, a certain number of shared latent features (e.g., latent topics, principal components, etc.) are extracted to represent documents from ...
متن کاملTransductive Distributional Correspondence Indexing for Cross-Domain Topic Classification
Obtaining high-quality annotated data for training a classifier for a new domain is often costly. Domain Adaptation (DA) aims at leveraging the annotated data available from a different but related source domain in order to deploy a classification model for the target domain of interest, thus alleviating the aforementioned costs. To that aim, the learning model is typically given access to a se...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Political Analysis
سال: 2021
ISSN: ['1047-1987', '1476-4989']
DOI: https://doi.org/10.1017/pan.2021.37