Nonparametric Regression in R An Appendix to An R Companion to Applied Regression, Second Edition

نویسنده

  • John Fox
چکیده

In traditional parametric regression models, the functional form of the model is specified before the model is fit to data, and the object is to estimate the parameters of the model. In nonparametric regression, in contrast, the object is to estimate the regression function directly without specifying its form explicitly. In this appendix to Fox and Weisberg (2011), we describe how to fit several kinds of nonparametric-regression models in R, including scatterplot smoothers, where there is a single predictor; models for multiple regression; additive regression models; and generalized nonparametric-regression models that are analogs to generalized linear models. 1 Nonparametric Regression Models The traditional nonlinear regression model (described in the Appendix on nonlinear regression) fits the model y = m(x, ) + " where is a vector of parameters to be estimated, and x is a vector of predictors; the errors " are assumed to be normally and independently distributed with mean 0 and constant variance 2. The function m(x, ), relating the average value of the response y to the predictors, is specified in advance, as it is in a linear regression model. The general nonparametric regression model is written in a similar manner, but the function m is left unspecified: y = m(x) + " = m(x1, x2, . . . , xp) + " for the p predictors x = (x1, x2, . . . , xp) ′. Moreover, the object of nonparametric regression is to estimate the regression function m(x) directly, rather than to estimate parameters. Most methods of nonparametric regression implicitly assume that m is a smooth, continuous function.1 As in nonlinear regression, it is standard to assume that "i ∼ NID(0, 2). An important special case of the general model is nonparametric simple regression, where there is only one predictor: y = m(x) + " Nonparametric simple regression is often called “scatterplot smoothing” because an important application is to tracing a smooth curve through a scatterplot of y against x. We frequently use nonparametric regression in this manner in the body of the text. An exception to the implicit assumption of smoothness is wavelet regression, not discussed in this appendix, which is implemented in R, e.g., in the wavethresh package; see Nason and Silverman (1994, 2000); Nason (2008).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Time-Series Regression and Generalized Least Squares in R An Appendix to An R Companion to Applied Regression, Second Edition

Generalized least-squares (GLS ) regression extends ordinary least-squares (OLS) estimation of the normal linear model by providing for possibly unequal error variances and for correlations between different errors. A common application of GLS estimation is to time-series regression, in which it is generally implausible to assume that errors are independent. This appendix to Fox and Weisberg (2...

متن کامل

Time-Series Regression and Generalized Least Squares Appendix to An R and S-PLUS Companion to Applied Regression

with covariance matrix V (bOLS) = σ (XX) Let us, however, assume more generally that ε ∼ Nn(0,Σ), where the error-covariance matrix Σ is symmetric and positive-definite. Different diagonal entries in Σ correspond to non-constant error variances, while nonzero off-diagonal entries correspond to correlated errors. Suppose, for the time-being, that Σ is known. Then, the log-likelihood for the mode...

متن کامل

Parametric and Nonparametric Regression with Missing X’s—A Review

This paper gives a detailed overview of the problem of missing data in parametric and nonparametric regression. Theoretical basics, properties as well as simulation results may help the reader to get familiar with the common problem of incomplete data sets. Of course, not all occurences can be discussed so this paper could be seen as an introduction to missing data within regression analysis an...

متن کامل

Bootstrapping Regression Models Appendix to An R and S-PLUS Companion to Applied Regression

Bootstrapping is a general approach to statistical inference based on building a sampling distribution for a statistic by resampling from the data at hand. The term ‘bootstrapping,’ due to Efron (1979), is an allusion to the expression ‘pulling oneself up by one’s bootstraps’ – in this case, using the sample data as a population from which repeated samples are drawn. At first blush, the approac...

متن کامل

A Comparison of Thin Plate and Spherical Splines with Multiple Regression

Thin plate and spherical splines are nonparametric methods suitable for spatial data analysis. Thin plate splines acquire efficient practical and high precision solutions in spatial interpolations. Two components in the model fitting is considered: spatial deviations of data and the model roughness. On the other hand, in parametric regression, the relationship between explanatory and response v...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010