Email phishing: text classification using natural language processing
نویسندگان
چکیده
منابع مشابه
Multi Stage Phishing Email Classification
Phishing emails risk increases progressively, which pose a real threat to users of computers, organizations and lead to significant financial losses. Fighting zero day phishing emails using content based server side classifiers is considered as the best method to detect such attacks. This technique which is based on machine learning algorithms is trained by the set of phishing email features an...
متن کاملText Classification using Language-independent Pre-processing
A number of language-independent text pre-processing techniques, to support multi-class single-label text classification, are described and compared. A simple but effective statistical keyword identification approach is proposed, coupled with a number of phrase identification mechanisms. Experimental results are presented.
متن کاملImproving Email Response in an Email Management System Using Natural Language Processing Based Probabilistic Methods
Article history Received: 06-03-2014 Revised: 10-03-2014 Accepted: 04-08-2014 Abstract: Email based communication over the course of globalization in recent years has transformed into an all-encompassing form of interaction and requires automatic processes to control email correspondence in an environment of increasing email database. Relevance characteristics defining class of email in general...
متن کاملFeature Extraction or Feature Selection for Text Classification: A Case Study on Phishing Email Detection
Dimensionality reduction is generally performed when high dimensional data like text are classified. This can be done either by using feature extraction techniques or by using feature selection techniques. This paper analyses which dimension reduction technique is better for classifying text data like emails. Email classification is difficult due to its high dimensional sparse features that aff...
متن کاملWeb Text Corpus for Natural Language Processing
Web text has been successfully used as training data for many NLP applications. While most previous work accesses web text through search engine hit counts, we created a Web Corpus by downloading web pages to create a topic-diverse collection of 10 billion words of English. We show that for context-sensitive spelling correction the Web Corpus results are better than using a search engine. For t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Computer Science and Information Technologies
سال: 2020
ISSN: 2722-3221,2722-323X
DOI: 10.11591/csit.v1i1.p1-12