SAITS: Self-attention-based imputation for time series
نویسندگان
چکیده
Missing data in time series is a pervasive problem that puts obstacles the way of advanced analysis. A popular solution imputation, where fundamental challenge to determine what values should be filled in. This paper proposes SAITS, novel method based on self-attention mechanism for missing value imputation multivariate series. Trained by joint-optimization approach, SAITS learns from weighted combination two diagonally-masked (DMSA) blocks. DMSA explicitly captures both temporal dependencies and feature correlations between steps, which improves accuracy training speed. Meanwhile, weighted-combination design enables dynamically assign weights learned representations blocks according attention map missingness information. Extensive experiments quantitatively qualitatively demonstrate outperforms state-of-the-art methods time-series task efficiently reveal SAITS’ potential improve learning performance pattern recognition models incomplete real world.
منابع مشابه
Missing data imputation in multivariable time series data
Multivariate time series data are found in a variety of fields such as bioinformatics, biology, genetics, astronomy, geography and finance. Many time series datasets contain missing data. Multivariate time series missing data imputation is a challenging topic and needs to be carefully considered before learning or predicting time series. Frequent researches have been done on the use of diffe...
متن کاملTraining energy-based models for time-series imputation
Imputing missing values in high dimensional time-series is a difficult problem. This paper presents a strategy for training energy-based graphical models for imputation directly, bypassing difficulties probabilistic approaches would face. The training strategy is inspired by recent work on optimization-based learning (Domke, 2012) and allows complex neural models with convolutional and recurren...
متن کاملAlgorithms for Segmenting Time Series
As with most computer science problems, representation of the data is the key to ecient and eective solutions. Piecewise linear representation has been used for the representation of the data. This representation has been used by various researchers to support clustering, classication, indexing and association rule mining of time series data. A variety of algorithms have been proposed to obtain...
متن کاملKNN-DTW Based Missing Value Imputation for Microarray Time Series Data
Microarray technology provides an opportunity for scientists to analyze thousands of gene expression profiles simultaneously. However, microarray gene expression data often contain multiple missing expression values due to many reasons. Effective methods for missing value imputation in gene expression data are needed since many algorithms for gene analysis require a complete matrix of gene arra...
متن کاملa time-series analysis of the demand for life insurance in iran
با توجه به تجزیه و تحلیل داده ها ما دریافتیم که سطح درامد و تعداد نمایندگیها باتقاضای بیمه عمر رابطه مستقیم دارند و نرخ بهره و بار تکفل با تقاضای بیمه عمر رابطه عکس دارند
ذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Expert Systems With Applications
سال: 2023
ISSN: ['1873-6793', '0957-4174']
DOI: https://doi.org/10.1016/j.eswa.2023.119619