On recovering missing values with minimal error

نویسنده

  • Nikolai Dokuchaev
چکیده

The paper studies frequency criterions of recoverability of a single missing value from an observed sequence. Stochastic Gaussian stationary processes that are non recoverable in this sense are called minimal processes; they have a degenerate spectral density. The paper considers this problem in a pathwise setting without probabilistic assumptions; recoverability criterions are formulated in the terms of a degeneracy of the Z-transforms. For processes with non-degenerate Z-transform, an optimal recovering algorithm is suggested. This algorithm is based on projections on a closest recoverable sequence; it leads to explicit formulae for the recovered value. Some robustness of the solution with respect to noise contamination and truncation is established.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On recovering missing values in a pathwise setting

The paper suggests sufficient conditions of error-free recoverability of a missing value for sequences, i.e., discrete time processes, in the pathwise setting, without using probabilistic assumptions on the ensemble. This setting targets situations where we deal with a single sequence that is deemed to be unique and such that we cannot rely on statistics collected from other similar samples. A ...

متن کامل

Approximation of Frame Based Missing Data Recovery

Recovering missing data from its partial samples is a fundamental problem in mathematics and it has wide range of applications in image and signal processing. While many such algorithms have been developed recently, there are very few papers available on their error estimations. This paper is to analyze the error of a frame based data recovery approach from random samples. In particular, we est...

متن کامل

Missing data imputation in multivariable time series data

Multivariate time series data are found in a variety of fields such as bioinformatics, biology, genetics, astronomy, geography and finance. Many time series datasets contain missing data. Multivariate time series missing data imputation is a challenging topic and needs to be carefully considered before learning or predicting time series. Frequent researches have been done on the use of diffe...

متن کامل

Optimal data recovery and forecasting with dummy long-horizon forecasts

The paper suggests a method of recovering of missing values based on optimal approximation by band-limited processes for sequences, i.e. discrete time processes, that are not necessarily band-limited. The problem is considered in the pathwise setting, without using probabilistic assumptions on the ensemble. The method requires to solve a closed linear equation in the time domain connecting the ...

متن کامل

Solving Incomplete Datasets in Soft Set Using Supported Sets and Aggregate Values

The theory of soft set proposed by Molodtsovin 1999[1]is a new method for handling uncertain data and can be defined as a Boolean-valued information system. Ithas been applied to data analysis and decision support systems based on large datasets. In this paper, it is shown that calculated support value can be used to determine missing attribute value of an object. However, in cases when more th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1604.06980  شماره 

صفحات  -

تاریخ انتشار 2016