On Wasserstein Two-Sample Testing and Related Families of Nonparametric Tests

نویسندگان

  • Aaditya Ramdas
  • Nicolás García Trillos
  • Marco Cuturi
چکیده

Nonparametric two-sample or homogeneity testing is a decision theoretic problem that involves identifying differences between two random variables without making parametric assumptions about their underlying distributions. The literature is old and rich, with a wide variety of statistics having being designed and analyzed, both for the unidimensional and the multivariate setting. In this short survey, we focus on test statistics that involve the Wasserstein distance. Using an entropic smoothing of the Wasserstein distance, we connect these to very different tests including multivariate methods involving energy statistics and kernel based maximum mean discrepancy and univariate methods like the Kolmogorov–Smirnov test, probability or quantile (PP/QQ) plots and receiver operating characteristic or ordinal dominance (ROC/ODC) curves. Some observations are implicit in the literature, while others seem to have not been noticed thus far. Given nonparametric two-sample testing’s classical and continued importance, we aim to provide useful connections for theorists and practitioners familiar with one subset of methods but not others.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Nonparametric tests for the gap time distributions of serial events based on censored data.

This article deals with the problem of comparing two populations with respect to the distribution of the gap time between two successive events when each subject can experience a series of events and when the event times are potentially right censored. Several families of nonparametric tests are developed, all of which allow arbitrary distributions and dependence structures for the serial event...

متن کامل

HYPOTHESIS TESTING FOR AN EXCHANGEABLE NORMAL DISTRIBUTION

Consider an exchangeable normal vector with parameters ????2, and ?. On the basis of a vector observation some tests about these parameters are found and their properties are discussed. A simulation study for these tests and a few nonparametric tests are presented. Some advantages and disadvantages of these tests are discussed and a few applications are given.

متن کامل

سری آمار:روش‌های متداول ناپارامتری

There are situations in medical studies, wherein it is impossible to use the methods based on normal distribution (parametric methods). This paper objects to introduce common nonparametric methods and the inferences based on the methods in medical studies. Principles and method of calculations along with the software codes for common nonparametric methods and inference based on them were presen...

متن کامل

Wasserstein Identity Testing

Uniformity testing and the more general identity testing are well studied problems in distributional property testing. Most previous work focuses on testing under L1-distance. However, when the support is very large or even continuous, testing under L1-distance may require a huge (even infinite) number of samples. Motivated by such issues, we consider the identity testing in Wasserstein distanc...

متن کامل

dbEmpLikeGOF: An R Package for Nonparametric Likelihood Ratio Tests for Goodness-of-Fit and Two Sample Comparisons Based on Sample Entropy

We introduce and examine dbEmpLikeGOF, an R language for performing goodnessof-fit tests based on sample entropy. This package also performs the two sample distribution comparison test. For a given vector of data observations, the provided function dbEmpLikeGOF tests the data for the proposed null distributions, or tests for distribution equality between two vectors of observations. The propose...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Entropy

دوره 19  شماره 

صفحات  -

تاریخ انتشار 2017