MIGHT: Statistical Methodology for Missing-Data Imputation in Food Composition Databases
نویسندگان
چکیده
منابع مشابه
Missing data imputation in multivariable time series data
Multivariate time series data are found in a variety of fields such as bioinformatics, biology, genetics, astronomy, geography and finance. Many time series datasets contain missing data. Multivariate time series missing data imputation is a challenging topic and needs to be carefully considered before learning or predicting time series. Frequent researches have been done on the use of diffe...
متن کاملHandling Missing Data in Trees: Surrogate Splits or Statistical Imputation
In many applications of data mining a sometimes considerable part of the data values is missing. This may occur because the data values were simply never entered into the operational systems from which the mining table was constructed, or because for example simple domain checks indicate that entered values are incorrect. Despite the frequent occurrence of missing data, most data mining algorit...
متن کاملMultiple Imputation for Missing Data
Multiple imputation provides a useful strategy for dealing with data sets with missing values. Instead of filling in a single value for each missing value, Rubin’s (1987) multiple imputation procedure replaces each missing value with a set of plausible values that represent the uncertainty about the right value to impute. These multiply imputed data sets are then analyzed by using standard proc...
متن کاملMissing Data Imputation within the Statistical learning Paradigm
In the framework of missing data imputation, Rubin [7] formalized three types of missing data mechanisms upon definition of a missing data indicator matrix pointing out the missing-ness in the data matrix, assigning it a random variable with a conditional probability distribution given the data matrix depending on unknown parameters. Within Rubin’s paradigm, missing data imputation can be under...
متن کاملMissing Data Imputation for Supervised Learning
This paper compares methods for imputing missing categorical data for supervised learning tasks. The ability of researchers to accurately fit a model and yield unbiased estimates may be compromised by missing data, which are prevalent in survey-based social science research. We experiment on two machine learning benchmark datasets with missing categorical data, comparing classifiers trained on ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied Sciences
سال: 2019
ISSN: 2076-3417
DOI: 10.3390/app9194111