Cost-sensitive Classification: Status and Beyond

نویسنده

  • Hsuan-Tien Lin
چکیده

The rows represent the actual patient status, and the columns represent the diagnosis made by the doctor. For instance, on any correct diagnosis, the society pays no (additional) cost. However, if an H1N1-infected patient is predicted as coldinfected or healthy, the whole society may suffer from a huge amount of cost. On the other hand, if a cold-infected patient is predicted as healthy, the society needs to pay some cost—but not as serious as the ones paid in the previous scenario. These different costs are important for a human doctor when making any diagnosis. For instance, the doctor would be very careful on the slightest H1N1 symptom to prevent the “1000000” level mis-prediction. If we were to build an automatic system—a “computer doctor”—to make the diagnosis, how can the system use the cost information appropriately? Many real-world applications that share similar needs can be found in medical decision making, target marketing, and object recognition. Those applications belong to cost-sensitive classification. In fact, costsensitive classification can be used to express any finite-choice and bounded-loss machine learning problems [2]. Thus, it has been attracting much research attention in the past decade [3], [4], [5], [6], [7], [8], [2], [9], [10], [11], [12], [1], [13].

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Proposing a Novel Cost Sensitive Imbalanced Classification Method based on Hybrid of New Fuzzy Cost Assigning Approaches, Fuzzy Clustering and Evolutionary Algorithms

In this paper, a new hybrid methodology is introduced to design a cost-sensitive fuzzy rule-based classification system. A novel cost metric is proposed based on the combination of three different concepts: Entropy, Gini index and DKM criterion. In order to calculate the effective cost of patterns, a hybrid of fuzzy c-means clustering and particle swarm optimization algorithm is utilized. This ...

متن کامل

A New Formulation for Cost-Sensitive Two Group Support Vector Machine with Multiple Error Rate

Support vector machine (SVM) is a popular classification technique which classifies data using a max-margin separator hyperplane. The normal vector and bias of the mentioned hyperplane is determined by solving a quadratic model implies that SVM training confronts by an optimization problem. Among of the extensions of SVM, cost-sensitive scheme refers to a model with multiple costs which conside...

متن کامل

Ensemble Classification and Extended Feature Selection for Credit Card Fraud Detection

Due to the rise of technology, the possibility of fraud in different areas such as banking has been increased. Credit card fraud is a crucial problem in banking and its danger is over increasing. This paper proposes an advanced data mining method, considering both feature selection and decision cost for accuracy enhancement of credit card fraud detection. After selecting the best and most effec...

متن کامل

مقایسه روش‌های مختلف یادگیری ماشین در تشخیص پرفشاری خون در بیماران دیابتی با و بدون در نظر گرفتن هزینه‌ها

Background and Objectives: Diabetic patients are always at risk of hypertension. In this paper, the main goal was to design a native cost sensitive model for the diagnosis of hypertension among diabetics considering the prior probabilities. Methods: In this paper, we tried to design a cost sensitive model for the diagnosis of hypertension in diabetic patients, considering the distribution of...

متن کامل

Beyond Fano's inequality: bounds on the optimal F-score, BER, and cost-sensitive risk and their implications

Fano’s inequality lower bounds the probability of transmission error through a communication channel. Applied to classification problems, it provides a lower bound on the Bayes error rate and motivates the widely used Infomax principle. In modern machine learning, we are often interested in more than just the error rate. In medical diagnosis, different errors incur different cost; hence, the ov...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010