Efficient Estimators for Generalized Additive Models
نویسنده
چکیده
Generalized additive models are a powerful generalization of linear and logistic regression models. In this paper we show that a natural regression graph learning algorithm efficiently learns generalized additive models. Efficiency is proven in two senses: the estimator’s future prediction accuracy approaches optimality at rate inverse polynomial in the size of the training data, and its runtime is polynomial in the size of the training data. Furthermore, the guarantees are nearly linear in terms of the dimensionality (number of regressors) of the problem, and hence the algorithm does not suffer from the “curse of dimensionality.” The algorithm is a simple generalization of Mansour and McAllester’s classification algorithm that generates decision graphs, i.e., decision trees with merges. Our analysis is also viewed as defining a natural extension of the original classification boosting theorems (Schapire, 1990) to the regression setting. Loosely speaking, we define a weak correlator to be a real-valued predictor that has a correlation coefficient with the target function that is bounded from zero. We show how to efficiently boost weak correlators to get predictions with correlation arbitrarily close to 1 (error arbitrarily close to 0). Our boosting analysis is a natural extension of the classification boosting analysis of Kearns and Mansour (1999) and Mansour and McAllester (2002).
منابع مشابه
Efficient semiparametric estimation in generalized partially linear additive models for longitudinal/clustered data
We consider efficient estimation of the Euclidean parameters in a generalized partially linear additive models for longitudinal/clustered data when multiple covariates need to be modeled nonparametrically, and propose an estimation procedure based on a spline approximation of the nonparametric part of the model and the generalized estimating equations (GEE). Although the model in consideration ...
متن کاملWeighted Local Polynomial Regression, Weighted Additive Models and Local Scoring
This article describes the asymptotic properties of local polynomial regression estimators for univariate and additive models when observation weights are included. The implications of these ndings are discussed for local scoring estimators, a widely used class of estimators for generalized additive models described in Hastie and Tibshirani (1990).
متن کاملA Note on Local Scoring and Weighted Local Polynomial Regression in Generalized Additive Models
This article describes the asymptotic properties of local polynomial regression estimators for univariate and additive models when observation weights are included. Such weighted additive models are a crucial component of local scoring, the widely used estimation algorithm for generalized additive models described in Hastie and Tibshirani (1990). The statistical properties of the univariate loc...
متن کاملGeneralized Ridge Regression Estimator in Semiparametric Regression Models
In the context of ridge regression, the estimation of ridge (shrinkage) parameter plays an important role in analyzing data. Many efforts have been put to develop skills and methods of computing shrinkage estimators for different full-parametric ridge regression approaches, using eigenvalues. However, the estimation of shrinkage parameter is neglected for semiparametric regression models. The m...
متن کاملEstimation and Variable Selection for Generalized Additive Partial Linear Models By
We study generalized additive partial linear models, proposing the use of polynomial spline smoothing for estimation of nonparametric functions, and deriving quasi-likelihood based estimators for the linear parameters. We establish asymptotic normality for the estimators of the parametric components. The procedure avoids solving large systems of equations as in kernel-based procedures and thus ...
متن کاملNon-linear Bayesian prediction of generalized order statistics for liftime models
In this paper, we obtain Bayesian prediction intervals as well as Bayes predictive estimators under square error loss for generalized order statistics when the distribution of the underlying population belongs to a family which includes several important distributions.
متن کامل