$k$-Variance: A Clustered Notion of Variance

نویسندگان

چکیده

We introduce $k$-variance, a generalization of variance built on the machinery random bipartite matchings. $K$-variance measures expected cost matching two sets $k$ samples from distribution to each other, capturing local rather than global information about measure as increases; it is easily approximated stochastically using sampling and linear programming. In addition defining $k$-variance proving its basic properties, we provide in-depth analysis this quantity in several key cases, including one-dimensional measures, clustered concentrated low-dimensional subsets $\mathbb R^n$. conclude with experiments open problems motivated by new way summarize distributional shape.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ecient Algorithms for Variance-based K-clustering

In this paper we consider the k-clustering problem for n points in the d-dimensional space, motivated from the problem of computing a color lookup table for frame bu er display, and that of compressing two-dimensional image data. Using the technique of computational geometry, this clustering problem is investigated in a uni ed manner.

متن کامل

Ordering of Order Statistics Using Variance Majorization

 In this paper, we study stochastic comparisons of order statistics of independent random variables with proportional hazard rates, using the notion of variance majorization.

متن کامل

A resampling variance estimator for the k nearest neighbours technique

Current estimators of variance for the k nearest neighbours (kNN) technique are designed for estimates of population totals. Their efficiency in small-area estimation problems can be poor. In this study, we propose a modified balanced repeated replication estimator of variance (BRR) of a kNN total that performs well in small-area estimation problems and under both simple random and cluster samp...

متن کامل

Hedging (Co)Variance Risk with Variance Swaps∗

Abstract In this paper we introduce a new criterion in order to measure the variance and covariance risks in financial markets. In an asset allocation framework with stochastic (co)variances, we consider the possibility to invest also in variance swaps, that are assets which span the volatility as well as the co-volatility risks. We provide explicit solutions for the portfolio optimization prob...

متن کامل

The Wavelet Variance , Allan Variance and Leakage

Wavelets have recently been a subject of great interest in geophysics, mathematics and signal processing. The discrete wavelet transform can be used to decompose a time series with respect to a set of basis functions, each one of which is associated with a particular scale. The properties of a time series at different scales can then be summarized by the wavelet variance, which decomposes the v...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: SIAM journal on mathematics of data science

سال: 2022

ISSN: ['2577-0187']

DOI: https://doi.org/10.1137/20m1385895