Building and Evaluating an Annotated Corpus for Automated Recognition of Chat-Based Social Engineering Attacks
نویسندگان
چکیده
Chat-based Social Engineering (CSE) is widely recognized as a key factor to successful cyber-attacks, especially in small and medium-sized enterprise (SME) environments. Despite the interest preventing CSE attacks, few studies have considered specific features of language used by attackers. This work contributes area early-stage automated attack recognition proposing an approach for building annotating specific-purpose corpus presenting its application domain. The resulting then evaluated training bi-directional long short-term memory (bi-LSTM) neural network purpose named entity (NER). results this study emphasize importance adding plethora metadata dataset provide critical in-context produce that broadens our understanding tactics social engineers. outcomes can be applied dedicated cyber-defence mechanisms utilized protect SME employees using Electronic Medium Communication (EMC) software.
منابع مشابه
Building an annotated corpus for Amazighe
This paper gives an overview of the morpho-syntactic features of the Amazighe language and corpus encoding, afterwards we present our experience of constructing an annotated corpus with part-of-speech (POS) information. The annotated corpora consist of 20,667 Moroccan Amazighe tokens chosen from different materials; it is to our knowledge the first one dealing with Amazighe language. The experi...
متن کاملBuilding an Annotated Corpus for Text Summarization and Question Answering
We describe ongoing work in semi-automatic annotating corpus, with the goal to answer “why” question in question answering system and give a construction of the coherent tree for text summarization. In this paper we present annotation schemas for identifying the discourse relations that hold between the parts of text as well as the particular textual of span that are related via the discourse r...
متن کاملBuilding an Annotated Textual Inference Corpus for Motion and Space
This paper presents an approach for building a corpus for the domain of motion and spatial inference using a specific class of verbs. The approach creates a distribution of inference features that maximize the discriminatory power of a system trained on the corpus. The paper addresses the issue of using an existing textual inference system for generating the examples. This enables the corpus an...
متن کاملMalToBI – Building an Annotated Corpus of Spoken Maltese
Research on the phonetics and phonology of Maltese, and in particular on different aspects of its prosody, is, thus far, rather limited. This is in part due to the lack of structured resources for use in research. One resource which, to date, has been unavailable, is a corpus of spoken Maltese. Such a corpus, could, amongst other things, be used as a ready resource for the analysis of various a...
متن کاملWeb-Based Sources for an Annotated Corpus Building and Composite Proper Name Identification
Nowadays, collections of texts with annotations on several levels are useful resources. Huge efforts are required to develop this resource for languages like Spanish. In this work, we present the initial step, lexical level annotation, for the compilation of an annotated Mexican corpus using Web-based sources. We also describe a method based on heterogeneous knowledge and simple Web-based sourc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied sciences
سال: 2021
ISSN: ['2076-3417']
DOI: https://doi.org/10.3390/app112210871