A Stylometric Inquiry into Hyperpartisan and Fake News
نویسندگان
چکیده
This paper reports on a writing style analysis of hyperpartisan (i.e., extremely onesided) news in connection to fake news. It presents a large corpus of 1,627 articles that were manually fact-checked by professional journalists from BuzzFeed. The articles originated from 9 well-known political publishers, 3 each from the mainstream, the hyperpartisan left-wing, and the hyperpartisan right-wing. In sum, the corpus contains 299 fake news, 97% of which originated from hyperpartisan publishers. We propose and demonstrate a new way of assessing style similarity between text categories via Unmasking—a meta-learning approach originally devised for authorship verification—, revealing that the style of left-wing and right-wing news have a lot more in common than any of the two have with the mainstream. Furthermore, we show that hyperpartisan news can be discriminated well by its style from the mainstream (F1 = 0.78), as can be satire from both (F1 = 0.81). Unsurprisingly, stylebased fake news detection does not live up to scratch (F1 = 0.46). Nevertheless, the former results are important to implement pre-screening for fake news detectors.
منابع مشابه
Stylometric features for emotion level classification in news related blogs
Breaking news and events are often posted in the blogosphere before they are published by any media agency. Therefore, the blogosphere is a valuable resource for news-related blog analysis. However, it is crucial to first sort out newsunrelated content like personal diaries or advertising blogs. Besides, there are different levels of emotionality or involvement which bias the news information t...
متن کاملFake news propagate differently from real news even at early stages of spreading
Social media can be a double-edged sword for modern communications, either a convenient channel exchanging ideas or an unexpected conduit circulating fake news through a large population. Existing studies of fake news focus on efforts on theoretical modelling of propagation or identification methods based on black-box machine learning, neglecting the possibility of identifying fake news using o...
متن کاملAutomatic Detection of Fake News
The proliferation of misleading information in everyday access media outlets such as social media feeds, news blogs, and online newspapers have made it challenging to identify trustworthy news sources, thus increasing the need for computational tools able to provide insights into the reliability of online content. In this paper, we focus on the automatic identification of fake content in online...
متن کاملFake News Detection Through Multi-Perspective Speaker Profiles
Automatic fake news detection is an important, yet very challenging topic. Traditional methods using lexical features have only very limited success. This paper proposes a novel method to incorporate speaker profiles into an attention based LSTM model for fake news detection. Speaker profiles contribute to the model in two ways. One is to include them in the attention model. The other includes ...
متن کاملInfluence of fake news in Twitter during the 2016 US presidential election
We investigate the influence of fake and traditional, fact-based, news outlets on Twitter during the 2016 US presidential election. Using a comprehensive dataset of 171 million tweets covering the five months preceding election day, we identify 30 million tweets, sent by 2.2 million users, which are classified as spreading fake and extremely biased news, based on a list of news outlets curated ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1702.05638 شماره
صفحات -
تاریخ انتشار 2017