Spectrum Estimation from Samples

نویسندگان

  • Weihao Kong
  • Gregory Valiant
چکیده

We consider the problem of approximating the set of eigenvalues of the covariance matrix of a multivariate distribution (equivalently, the problem of approximating the “population spectrum”), given access to samples drawn from the distribution. The eigenvalues of the covariance of a distribution contain basic information about the distribution, including the presence or lack of structure in the distribution, the effective dimensionality of the distribution, and the applicability of higher-level machine learning and multivariate statistical tools. We consider this fundamental recovery problem in the regime where the number of samples is comparable, or even sublinear in the dimensionality of the distribution in question. First, we propose a theoretically optimal and computationally efficient algorithm for recovering the moments of the eigenvalues of the population covariance matrix. We then leverage this accurate moment recovery, via a Wasserstein distance argument, to show that the vector of eigenvalues can be accurately recovered. Specifically, we show that our eigenvalue reconstruction algorithm is asymptotically consistent as the dimensionality of the distribution and sample size tend towards infinity, even in the sublinear sample regime where the ratio of the sample size to the dimensionality tends to zero. In addition to our theoretical results, we show that our approach performs well in practice for a broad range of distributions and sample sizes.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parameter Estimation in Low-Rank Models from Small Sample Size and Undersampled Data: DOA and Spectrum Estimation by

In estimation theory, a set of parameters are estimated from a finite number of measurements (samples). In general, the quality of estimation degrades as the number of samples is reduced. In this thesis, the problem of parameter estimation in low-rank models from a small number of samples is studied. Specifically, we consider two related problems that fit in this system model: direction-of-arri...

متن کامل

Determination of the vocal tract spectrum from the articulatory movements based on the search of an articulatory-acoustic database

This paper presents a method for determining the vocaltract spectrum from the positions of xed points on the articulatory organs. The method is based on the search of a database comprised of pairs of articulatory and acoustic data representing the direct relationship between the articulator position and vocal-tract spectrum. To compile the database, the electro-magnetic articulograph (EMA) syst...

متن کامل

Bandlimited Spectrum Estimation under the Constraint of a Minimum Inter-Sample Spacing

In the matter of sampling of a continuous time stationary stochastic process for the purpose of spectrum estimation, a practical constraint that comes in the way is that of a minimum separation between successive samples. It is well known that for a given threshold of separation, uniform sampling can be used to estimate only those power spectral densities which are limited to the band implied b...

متن کامل

Spectrum Estimation by Noise-Compensated Data Extrapolation

High-resolution spectrum estimation techniques have been extensively studied in recent publications. Knowledge of the noise variance is vital for spectrum estimation from noise-corrupted observations. This paper presents the use of noise compensation and data extrapolation for spectrum estimation. We assume that the observed data sequence can be represented by a set of autoregressive parameters...

متن کامل

Application of Compressed Sampling for Spectrum Sensing and Channel Estimation in Wideband Cognitive Radio Networks

In the last few years Compressed Sampling (CS) has been well used in the area of signal processing and image compression. Recently, CS has been earning a great interest in the area of wireless communication networks. CS exploits the sparsity of the signal processed for digital acquisition to reduce the number of measurement, which leads to reductions in the size, power consumption, processing t...

متن کامل

Wheat Leaf Rust Disease Severity Estimation Using Reflectance Spectrum Coding Methods

Using spectroradiometry and remote sensing techniques is an effective and rapid method in diagnosing vegetation diseases which enforced mostly by using spectral vegetation indices and statistical methods.  The present study aimed to deploy encoding technique for the reflectance spectrum of the wheat leaves to assess the severity of the Rust disease. This is unlike to the spectral vegetation ind...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1602.00061  شماره 

صفحات  -

تاریخ انتشار 2016