Outlier Diagnostics in Logistic Regression: A Supervised Learning Technique
نویسندگان
چکیده
The goal of supervised learning is to build a concise model of the distribution of class labels in terms of predictor features. Logistic regression is one of the most popular supervised learning technique that is used in classification. Fields like computer vision, image analysis and engineering sciences frequently encounter data with outliers (noise). Presence of outliers in the training sample may be the cause of large training time, misclassification, and to design a faulty classifier. This article provides a new method for identifying outliers in logistic regression. The significance of the measure is shown by well-referred data sets.
منابع مشابه
Finding Anomaly With Fuzzy C-means ANN Using Semi-Supervised Approach
The FC-ANN (Artificial Neural Network) is used to speed up the technique. The anomaly Outlier detection is primary in various data-mining applications. Outlier detection methods have been suggested for number of application such as, fraud detection, voting irregularity analysis, data cleansing, clinical trials, network intrusion, severe weather prediction, geographic information system, credit ...
متن کاملSublinear Algorithms for Penalized Logistic Regression in Massive Datasets
Penalized logistic regression (PLR) is a widely used supervised learning model. In this paper, we consider its applications in largescale data problems and resort to a stochastic primal-dual approach for solving PLR. In particular, we employ a random sampling technique in the primal step and a multiplicative weights method in the dual step. This technique leads to an optimization method with su...
متن کاملThe Effect of Transitive Closure on the Calibration of Logistic Regression for Entity Resolution
This paper describes a series of experiments in using logistic regression machine learning as a method for entity resolution. From these experiments the authors concluded that when a supervised ML algorithm is trained to classify a pair of entity references as linked or not linked pair, the evaluation of the model’s performance should take into account the transitive closure of its pairwise lin...
متن کاملOutlier Detection Using Unsupervised and Semi-Supervised Technique on High Dimensional Data
Outlier detection is useful for credit card fraud detection. Due to drastic increase in digital frauds, there is a lot of financial losses and therefore various techniques are developed for fraud detection and applied to diverse business fields. In high-dimensional data, outlier detection presents some challenges because of increment of dimensionality. In this paper, the proposed model aims to ...
متن کاملEpisodic Reinforcement Learning by Logistic Reward-Weighted Regression
It has been a long-standing goal in the adaptive control community to reduce the generically difficult, general reinforcement learning (RL) problem to simpler problems solvable by supervised learning. While this approach is today’s standard for value function-based methods, fewer approaches are known that apply similar reductions to policy search methods. Recently, it has been shown that immedi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011