PCA document reconstruction for email classification
نویسندگان
چکیده
منابع مشابه
A New Document Embedding Method for News Classification
Abstract- Text classification is one of the main tasks of natural language processing (NLP). In this task, documents are classified into pre-defined categories. There is lots of news spreading on the web. A text classifier can categorize news automatically and this facilitates and accelerates access to the news. The first step in text classification is to represent documents in a suitable way t...
متن کاملQuestion Classification for Email
Question classifiers are used within Question Answering to predict the expected answer type for a given question. This paper describes the first steps towards applying a similar methodology to identifying question classes in dialogue contexts, beginning with a study of questions drawn from the Enron email corpus. Human-annotated data is used as a gold standard for assessing the output from an e...
متن کاملAutomatic email classification
The endlessly increasing volume of unsolicited emails (a.k.a. spam) has become more and more of a concern. Its hassles range from a daily loss of time for the end-user, required to keep her mailbox clean, to a financial loss for the ISPs, constantly in need of larger bandwidths and disk space. According to a recent study, MSN and AOL discard together almost five billion of such emails every day...
متن کاملAnalyzing Behavioral Features for Email Classification
Many researchers have applied statistical analysis techniques to email for classification purposes, such as identifying spam messages. Such approaches can be highly effective, however many examine incoming email exclusively — which does not provide detailed information about an individual user’s behavior. Only by analyzing outgoing messages can a user’s behavior be ascertained. Our contribution...
متن کاملA Comparative Study for Email Classification
Email has become one of the fastest and most economical forms of communication. However, the increase of email users have resulted in the dramatic increase of spam emails during the past few years. In this paper, email data was classified using four different classifiers (Neural Network, SVM classifier, Naïve Bayesian Classifier, and J48 classifier). The experiment was performed based on differ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Computational Statistics & Data Analysis
سال: 2012
ISSN: 0167-9473
DOI: 10.1016/j.csda.2011.09.023