منابع مشابه
EM-based stepwise regression imputation using standard and robust methods
Imputation of missing values is one of the major tasks for data pre-processing in many areas. Whenever imputation of data from official statistics comes into mind, several (additional) challenges almost always arise, like large data sets, data sets consisting of a mixture of different variable types, or data outliers. The aim of this contribution is to propose an automatic algorithm called IRMI...
متن کاملIterative stepwise regression imputation using standard and robust methods
Imputation of missing values is one of the major tasks for data pre-processing in many areas. Whenever imputation of data from official statistics comes into mind, several (additional) challenges almost always arise, like large data sets, data sets consisting of a mixture of different variable types, or data outliers. The aim is to propose an automatic algorithm called IRMI for iterative model-...
متن کاملOn stepwise regression
Given data y and k covariates x one problem in linear regression is to decide which in any of the covariates to include when regressing y on the x. If k is small it is possible to evaluate each subset of the x. If however k is large then some other procedure must be use. Stepwise regression and the lasso are two such procedures but they both assume a linear model with error term. A different ap...
متن کاملStepwise regression for unsupervised learning
I consider unsupervised extensions of the fast stepwise linear regression algorithm [5]. These extensions allow one to efficiently identify highly-representative feature variable subsets within a given set of jointly distributed variables. This in turn allows for the efficient dimensional reduction of large data sets via the removal of redundant features. Fast search is effected here through th...
متن کاملFast Stepwise Regression on Linked Data
The main focus of research in machine learning and statistics is on building more advanced and complex models. However, in practice it is often much more important to use the right variables. One may hope that recent popularity of open data would allow researchers to easily find relevant variables. However current linked data methodology is not suitable for this purpose since the number of matc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Applied Statistics
سال: 2002
ISSN: 0266-4763,1360-0532
DOI: 10.1080/02664760220136168