One-Pass AUC Optimization

نویسندگان

  • Wei Gao
  • Rong Jin
  • Shenghuo Zhu
  • Zhi-Hua Zhou
چکیده

AUC is an important performance measure and many algorithms have been devoted to AUC optimization, mostly by minimizing a surrogate convex loss on a training data set. In this work, we focus on one-pass AUC optimization that requires going through the training data only once without storing the entire training dataset, where conventional online learning algorithms cannot be applied directly because AUC is measured by a sum of losses defined over pairs of instances from different classes. We develop a regression-based algorithm which only needs to maintain the first and second-order statistics of training data in memory, resulting a storage requirement independent from the size of training data. To efficiently handle high-dimensional data, we develop a randomized algorithm that approximates the covariance matrices by low-rank matrices. We verify, both theoretically and empirically, the effectiveness of the proposed algorithm.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Sparse Nonlinear Classifier Design Using AUC Optimization

AUC (Area under the ROC curve) is an important performance measure for applications where the data is highly imbalanced. Learning to maximize AUC performance is thus an important research problem. Using a max-margin based surrogate loss function, AUC optimization problem can be approximated as a pairwise rankSVM learning problem. Batch learning methods for solving the kernelized version of this...

متن کامل

A Structural SVM Based Approach for Optimizing Partial AUC

The area under the ROC curve (AUC) is a widely used performance measure in machine learning. Increasingly, however, in several applications, ranging from ranking and biometric screening to medical diagnosis, performance is measured not in terms of the full area under the ROC curve, but instead, in terms of the partial area under the ROC curve between two specified false positive rates. In this ...

متن کامل

Support Vector Algorithms for Optimizing the Partial Area under the ROC Curve

The area under the ROC curve (AUC) is a widely used performance measure in machine learning. Increasingly, however, in several applications, ranging from ranking to biometric screening to medicine, performance is measured not in terms of the full area under the ROC curve but in terms of the partial area under the ROC curve between two false-positive rates. In this letter, we develop support vec...

متن کامل

Dose-dependent pharmacokinetics of itraconazole after intravenous or oral administration to rats: intestinal first-pass effect.

The dose-dependent pharmacokinetics of itraconazole after intravenous (10, 20, or 30 mg/kg) and oral (10, 30, or 50 mg/kg) administration and the first-pass effects of itraconazole after intravenous, intraportal, intragastric, and intraduodenal administration at a dose of 10 mg/kg were evaluated in rats. After intravenous administration at a dose of 30 mg/kg, the area under the plasma concentra...

متن کامل

Evaluation of the POSSUM, P-POSSUM and E-PASS scores in the surgical treatment of hilar cholangiocarcinoma

BACKGROUND The Physiological and Operative Severity Score for the enUmeration of Mortality and morbidity (POSSUM) model, its Portsmouth (P-POSSUM) modification and the Estimation of physiologic ability and surgical stress (E-PASS) are three surgical risk scoring systems used extensively to predict postoperative morbidity and mortality in general surgery. The aim was to undertake the first study...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Artif. Intell.

دوره 236  شماره 

صفحات  -

تاریخ انتشار 2013