Three-Way Decisions Solution to Filter Spam Email: An Empirical Study
نویسندگان
چکیده
A three-way decisions solution based on Bayesian decision theory for filtering spam emails is examined in this paper. Compared to existed filtering systems, the spam filtering is no longer viewed as a binary classification problem. Each incoming email is accepted as a legitimate or rejected as a spam or undecided as a further-exam email by considering the misclassification cost. The three-way decisions solution for spam filtering can reduce the error rate of classifying a legitimate email to spam, and provide a more meaningful decision procedure for users. The solution is not restricted to a specific classifier. Experimental results on several corpus show that the three-way decisions solution can get a better total cost ratio value and a lower weighted error.
منابع مشابه
2 Misleading Learners: Co-opting Your Spam Filter
Using statistical machine learning for making security decisions introduces new vulnerabilities in large scale systems. We show how an adversary can exploit statistical machine learning, as used in the SpamBayes spam filter, to render it useless—even if the adversary’s access is limited to only 1% of the spam training messages. We demonstrate three new attacks that successfully make the filter ...
متن کاملAn E-mail Server-based Spam Filtering Approach
The spam has now become a significant security issue and a massive drain on financial resources. In this paper, a spam filter is introduced, which works at the server side. The proposed filter is a combination of antispam techniques. The integrated solution create a spam filtering system which is more robust and effective than each of the comprising techniques. The task of proposed filter is to...
متن کاملA Survey on Various Classifiers Detecting Gratuitous Email Spamming
Email becomes the major source of communication these days. Most humans on the earth use email for their personal or professional use. Email is an effective, faster and cheaper way of communication. The importance and usage for the email is growing day by day. It provides a way to easily transfer information globally with the help of internet. Due to it the email spamming is increasing day by d...
متن کاملA Memory-Based Approach to Anti-Spam Filtering
This paper presents an extensive empirical evaluation of memory-based learning in the context of anti-spam filtering, a novel cost-sensitive application of text categorization. Unsolicited commercial e-mail, also known as “spam”, floods the mailboxes of users, causing frustration, wasting bandwidth and money, and exposing minors to unsuitable content. Using a recently introduced publicly availa...
متن کاملA New Approach to Spam Mail Detection
The ever increasing menace of spam is bringing down productivity. More than 70% of the email messages are spam, and it has become a challenge to separate such messages from the legitimate ones. I have developed a spam identification engine which employs naive Bayesian classifier to identify spam. A new concept-based mining model that analyzes terms on the sentence, document is introduced. . The...
متن کامل