Neyman-Pearson Classification under High-Dimensional Settings

نویسندگان

  • Anqi Zhao
  • Yang Feng
  • Lie Wang
  • Xin Tong
چکیده

Most existing binary classification methods target on the optimization of the overall classification risk and may fail to serve some real-world applications such as cancer diagnosis, where users are more concerned with the risk of misclassifying one specific class than the other. Neyman-Pearson (NP) paradigm was introduced in this context as a novel statistical framework for handling asymmetric type I/II error priorities. It seeks classifiers with a minimal type II error and a constrained type I error under a user specified level. This article is the first attempt to construct classifiers with guaranteed theoretical performance under the NP paradigm in high-dimensional settings. Based on the fundamental Neyman-Pearson Lemma, we used a plug-in approach to construct NP-type classifiers for Naive Bayes models. The proposed classifiers satisfy the NP oracle inequalities, which are natural NP paradigm counterparts of the oracle inequalities in classical binary classification. Besides their desirable theoretical properties, we also demonstrated their numerical advantages in prioritized error control via both simulation and real data studies.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Neyman-Pearson classification under a strict constraint

Motivated by problems of anomaly detection, this paper implements the Neyman-Pearson paradigm to deal with asymmetric errors in binary classification with a convex loss. Given a finite collection of classifiers, we combine them and obtain a new classifier that satisfies simultaneously the two following properties with high probability: (i), its probability of type I error is below a pre-specifi...

متن کامل

A plug-in approach to neyman-pearson classification

The Neyman-Pearson (NP) paradigm in binary classification treats type I and type II errors with different priorities. It seeks classifiers that minimize type II error, subject to a type I error constraint under a user specified level α. In this paper, plug-in classifiers are developed under the NP paradigm. Based on the fundamental Neyman-Pearson Lemma, we propose two related plug-in classifier...

متن کامل

Detection and Classification of Heart Premature Contractions via α-Level Binary Neyman-Pearson Radius Test: A Comparative Study

The aim of this study is to introduce a new methodology for isolation of ectopic rhythms of ambulatory electrocardiogram (ECG) holter data via appropriate statistical analyses imposing reasonable computational burden. First, the events of the ECG signal are detected and delineated using a robust wavelet-based algorithm. Then, using Binary Neyman-Pearson Radius test, an appropriate classifie...

متن کامل

Neyman-Pearson Classification, Convexity and Stochastic Constraints

Motivated by problems of anomaly detection, this paper implements the Neyman-Pearson paradigm to deal with asymmetric errors in binary classification with a convex loss. Given a finite collection of classifiers, we combine them and obtain a new classifier that satisfies simultaneously the two following properties with high probability: (i) its probability of type I error is below a pre-specifie...

متن کامل

OFDM-BPSK Wireless Communication System Performance under Employment of Generalized Receiver

In this paper, we discuss the performance of the orthogonal frequency division multiplexing (OFDM) wireless communication systems using the binary phase shift keying (BPSK) modulation over Rayleigh fading channel implementing the generalized receiver (GR) constructed in accordance with the generalized approach to signal processing in noise. We compare OFDM-BPSK wireless communication systems ba...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of Machine Learning Research

دوره 17  شماره 

صفحات  -

تاریخ انتشار 2016