Scrambled Objects for Least-Squares Regression
نویسندگان
چکیده
We consider least-squares regression using a randomly generated subspace GP ⊂ F of finite dimension P , where F is a function space of infinite dimension, e.g. L2([0, 1]). GP is defined as the span of P random features that are linear combinations of the basis functions of F weighted by random Gaussian i.i.d. coefficients. In particular, we consider multi-resolution random combinations at all scales of a given mother function, such as a hat function or a wavelet. In this latter case, the resulting Gaussian objects are called scrambled wavelets and we show that they enable to approximate functions in Sobolev spaces H([0, 1]). As a result, given N data, the least-squares estimate ĝ built from P scrambled wavelets has excess risk ||f∗ − ĝ||P = O(||f||Hs([0,1]d)(logN)/P + P (logN)/N) for target functions f∗ ∈ H([0, 1]) of smoothness order s > d/2. An interesting aspect of the resulting bounds is that they do not depend on the distributionP from which the data are generated, which is important in a statistical regression setting considered here. Randomization enables to adapt to any possible distribution. We conclude by describing an efficient numerical implementation using lazy expansions with numerical complexity Õ(2N logN + N), where d is the dimension of the input space.
منابع مشابه
Brownian Motions and Scrambled Wavelets for Least-Squares Regression
We consider ordinary (non penalized) least-squares regression where the regression function is chosen in a randomly generated sub-space GP ⊂ S of finite dimension P , where S is a function space of infinite dimension, e.g. L2([0, 1]). GP is defined as the span of P random features that are linear combinations of the basis functions of S weighted by random Gaussian i.i.d. coefficients. We charac...
متن کاملEvaluation of hybrid fuzzy regression capability based on comparison with other regression methods
In this paper, the difference between classical regression and fuzzy regression is discussed. In fuzzy regression, nonphase and fuzzy data can be used for modeling. While in classical regression only non-fuzzy data is used. The purpose of the study is to investigate the possibility of regression method, least squares regression based on regression and linear least squares linear regression met...
متن کاملDetermination of 137Ba Isotope Abundances in Water Samples by Inductively Coupled Plasma-optical Emission Spectrometry Combined with Least-squares Support Vector Machine Regression
A simple and rapid method for the determination of 137Ba isotope abundances in water samples by inductively coupled plasma-optical emission spectrometry (ICP-OES) coupled with least-squares support vector machine regression (LS-SVM) is reported. By evaluation of emission lines of barium, it was found that the emission line at 493.408 nm provides the best results for the determination...
متن کاملA robust least squares fuzzy regression model based on kernel function
In this paper, a new approach is presented to fit arobust fuzzy regression model based on some fuzzy quantities. Inthis approach, we first introduce a new distance between two fuzzynumbers using the kernel function, and then, based on the leastsquares method, the parameters of fuzzy regression model isestimated. The proposed approach has a suitable performance to<b...
متن کاملDetermination of Protein and Moisture in Fishmeal by Near-Infrared Reflectance Spectroscopy and Multivariate Regression Based on Partial Least Squares
The potential of Near Infrared Reflectance Spectroscopy (NIRS) as a fast method to predict the Crude Protein (CP) and Moisture (M) content in fishmeal by scanning spectra between 1000 and 2500 nm using multivariate regression technique based on Partial Least Squares (PLS) was evaluated. The coefficient of determination in calibration (R2C) and Standard Error of Calibra...
متن کامل