Operation Heron: latent topic changes in an abusive letter series
نویسندگان
چکیده
The paper presents a two-part forensic linguistic analysis of an historic collection abuse letters, sent to individuals in the public eye and individuals’ private homes between 2007 2009. We employ technique structural topic modelling (stm) identify distinctions core topics gauging value this relatively under-used methodology linguistics. Four key were identified ‘Politics A’ ‘B’, ‘Healthcare’ ‘Immigration’, their coherence, correlation shifts evaluated. Following stm, qualitative corpus was undertaken, coding concordance lines according topic, with reliability coders tested. This demonstrated that various connected statements within same tend gain or lose prevalence over time, ultimately confirmed consistency content four through stm throughout letter series. discussion conclusions reflect on findings also consider utility these methodologies for linguistics particular. study demonstrates real revisiting dataset such as test develop field.
منابع مشابه
Detecting speaker roles and topic changes in multiparty conversations using latent topic models
Accessing and browsing archives of multiparty conversations can be significantly facilitated by labeling them in terms of high level information. In this paper, we investigate automatic labeling of speaker roles and topic changes in professional meetings. Using the framework of unsupervised topic modeling we express speaker utterances as mixture of latent variables, each of which is governed by...
متن کاملLatent Topic Embedding
Topic modeling and word embedding are two important techniques for deriving latent semantics from data. General-purpose topic models typically work in coarse granularity by capturing word co-occurrence at the document/sentence level. In contrast, word embedding models usually work in fine granularity by modeling word co-occurrence within small sliding windows. With the aim of deriving latent se...
متن کاملLatent Topic Conversational Models
Despite much success in many large-scale language tasks, sequence-to-sequence (seq2seq) models have not been an ideal choice for conversational modeling as they tend to generate generic and repetitive responses. In this paper, we propose a Latent Topic Conversational Model (LTCM) that augments the seq2seq model with a neural topic component to better model human-human conversations. The neural ...
متن کاملCross-Lingual Latent Topic Extraction
Probabilistic latent topic models have recently enjoyed much success in extracting and analyzing latent topics in text in an unsupervised way. One common deficiency of existing topic models, though, is that they would not work well for extracting cross-lingual latent topics simply because words in different languages generally do not co-occur with each other. In this paper, we propose a way to ...
متن کاملBayesian latent topic clustering model
Document modeling is important for document retrieval and categorization. The probabilistic latent semantic analysis (PLSA) and latent Dirichlet allocation (LDA) are popular paradigms of document models where word/document correlations are inferred by latent topics. In PLSA and LDA, the unseen words and documents are not explicitly represented at the same time. Model generalization is constrain...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Corpora
سال: 2022
ISSN: ['1755-1676', '1749-5032']
DOI: https://doi.org/10.3366/cor.2022.0255