Adaptive Basis Function Construction: An Approach for Adaptive Building of Sparse Polynomial Regression Models 127 Adaptive Basis Function Construction: An Approach for Adaptive Building of Sparse Polynomial Regression Models
نویسنده
چکیده
The task of learning useful models from available data is common in virtually all fields of science, engineering, and finance. The goal of the learning task is to estimate unknown (input, output) dependency (or model) from training data (consisting of a finite number of samples) with good prediction (generalization) capabilities for future (test) data (Cherkassky & Mulier, 2007; Hastie et al., 2003). One of the specific learning tasks is regression – estimating an unknown real-valued function. The process of regression model learning is also called regression modelling or regression model building. Many practical regression modelling methods use basis function representation – these are also called dictionary methods (Friedman, 1994; Cherkassky & Mulier, 2007; Hastie et al., 2003), where a particular type of chosen basis functions constitutes a “dictionary”. Further distinction is then made between non-adaptive methods and adaptive (also called flexible) methods. The most widely used form of basis function expansions is polynomial of a fixed degree. If a model always includes a fixed (predetermined) set of basis functions (i.e. they are not adapted to training data), the modelling method is considered non-adaptive (Cherkassky & Mulier, 2007; Hastie et al., 2003). Using adaptive modelling methods however the basis functions themselves are adapted to data (by employing some kind of search mechanism). This includes methods where the restriction of fixed polynomial degree is removed and the model’s degree now becomes another parameter to fit. Adaptive methods use a very wide dictionary of candidate basis functions and can, in principle, approximate any continuous function with a pre-specified accuracy. This is also known as the universal approximation property (Kolmogorov & Fomin, 1975, Cherkassky & Mulier, 2007). However, in polynomial regression the increase in the model’s degree leads to exponential growth of the number of basis functions in the model (Cherkassky & Mulier, 2007; Hastie et al., 2003). With finite training data, the number of basis functions along with the number of model’s parameters (coefficients) quickly exceeds the number of data samples, making model’s parameter estimation impossible. Additionally the model should not be overly 8
منابع مشابه
Relevance vector machine and multivariate adaptive regression spline for modelling ultimate capacity of pile foundation
This study examines the capability of the Relevance Vector Machine (RVM) and Multivariate Adaptive Regression Spline (MARS) for prediction of ultimate capacity of driven piles and drilled shafts. RVM is a sparse method for training generalized linear models, while MARS technique is basically an adaptive piece-wise regression approach. In this paper, pile capacity prediction models are developed...
متن کاملEnsembling Adaptively Constructed Polynomial Regression Models
The approach of subset selection in polynomial regression model building assumes that the chosen fixed full set of predefined basis functions contains a subset that is sufficient to describe the target relation sufficiently well. However, in most cases the necessary set of basis functions is not known and needs to be guessed – a potentially non-trivial (and long) trial and error process. In our...
متن کاملEvaluation of moving bed biofilm reactor (MBBR) by applying adaptive neuro-fuzzy inference systeme (ANFIS), radial basis function (RBF) and Fuzzy Regression Analysis
The purpose of this study is to investigate the accuracy of predictions of aniline removal efficiency in a moving bed biofilm reactor (MBBR) by various methods, namely by RBF, ANFIS, and fuzzy regression analysis. The reactor was operated in an aerobic batch and was filled by light expanded clay aggregate (LECA) as a carrier for the treatment of Aniline synthetic wastewater. Exploratory data an...
متن کاملDynamic Modeling of the Electromyographic and Masticatory Force Relation Through Adaptive Neuro-Fuzzy Inference System Principal Dynamic Mode Analysis
Introduction: Researchers have employed surface electromyography (EMG) to study the human masticatory system and the relationship between the activity of masticatory muscles and the mechanical features of mastication. This relationship has several applications in food texture analysis, control of prosthetic limbs, rehabilitation, and teleoperated robots. Materials and Methods: In this paper, w...
متن کاملAdaptive Polynomial Chaos Techniques for Uncertainty Quantification of a Gas Cooled Fast Reactor Transient
Uncertainty quantification plays an increasingly important role in the nuclear community, especially with the rise of Best Estimate Plus Uncertainty methodologies. Sensitivity analysis, surrogate models, Monte Carlo sampling and several other techniques can be used to propagate input uncertainties. In recent years however polynomial chaos expansion has become a popular alternative providing hig...
متن کامل