modeling of soil aggregate stability using support vector machines and multiple linear regression
نویسندگان
چکیده
introduction: soil aggregate stability is a key factor in soil resistivity to mechanical stresses, including the impacts of rainfall and surface runoff, and thus to water erosion (canasveras et al., 2010). various indicators have been proposed to characterize and quantify soil aggregate stability, for example percentage of water-stable aggregates (wsa), mean weight diameter (mwd), geometric mean diameter (gmd) of aggregates, and water-dispersible clay (wdc) content (calero et al., 2008). unfortunately, the experimental methods available to determine these indicators are laborious, time-consuming and difficult to standardize (canasveras et al., 2010). therefore, it would be advantageous if aggregate stability could be predicted indirectly from more easily available data (besalatpour et al., 2014). the main objective of this study is to investigate the potential use of support vector machines (svms) method for estimating soil aggregate stability (as quantified by gmd) as compared to multiple linear regression approach. materials and methods: the study area was part of the bazoft watershed (31° 37′ to 32° 39′ n and 49° 34′ to 50° 32′ e), which is located in the northern part of the karun river basin in central iran. a total of 160 soil samples were collected from the top 5 cm of soil surface. some easily available characteristics including topographic, vegetation, and soil properties were used as inputs. soil organic matter (som) content was determined by the walkley-black method (nelson & sommers, 1986). particle size distribution in the soil samples (clay, silt, sand, fine sand, and very fine sand) were measured using the procedure described by gee & bauder (1986) and calcium carbonate equivalent (cce) content was determined by the back-titration method (nelson, 1982). the modified kemper & rosenau (1986) method was used to determine wet-aggregate stability (gmd). the topographic attributes of elevation, slope, and aspect were characterized using a 20-m by 20-m digital elevation model (dem). the data set was divided into two subsets of training and testing. the training subset was randomly chosen from 70% of the total set of the data and the remaining samples (30% of the data) were used as the testing set. the correlation coefficient (r), mean square error (mse), and error percentage (error%) between the measured and the predicted gmd values were used to evaluate the performance of the models. results and discussion: the description statistics showed that there was little variability in the sample distributions of the variables used in this study to develop the gmd prediction models, indicating that their values were all normally distributed. the constructed svm model had better performance in predicting gmd compared to the traditional multiple linear regression model. the obtained mse and r values for the developed svm model for soil aggregate stability prediction were 0.005 and 0.86, respectively. the obtained error% value for soil aggregate stability prediction using the svm model was 10.7% while it was 15.7% for the regression model. the scatter plot figures also showed that the svm model was more accurate in gmd estimation than the mlr model, since the predicted gmd values were closer in agreement with the measured values for most of the samples. the worse performance of the mlr model might be due to the larger amount of data that is required for developing a sustainable regression model compared to intelligent systems. furthermore, only the linear effects of the predictors on the dependent variable can be extracted by linear models while in many cases the effects may not be linear in nature. meanwhile, the svm model is suitable for modelling nonlinear relationships and its major advantage is that the method can be developed without knowing the exact form of the analytical function on which the model should be built. all these indicate that the svm approach would be a better choice for predicting soil aggregate stability. conclusion: the pixel-scale soil aggregate stability predicted that using the developed svm and mlr models demonstrates the usefulness of incorporating topographic and vegetation information along with the soil properties as predictors. however, the svm model achieved more accuracy in predicting soil aggregate stability compared to the mlr model. therefore, it appears that support vector machines can be used for prediction of some soil physical properties such as geometric mean diameter of soil aggregates in the study area. furthermore, despite the high predictive accuracy of the svm method compared to the mlr technique which was confirmed by the obtained results in the current study, the advantages of the svm method such as its intrinsic effectiveness with respect to traditional prediction methods, less effort in setting up the control parameters for architecture design, the possibility of solving the learning problem according to constrained quadratic programming methods, etc., should motivate soil scientists to work on it further in the future.
منابع مشابه
STAGE-DISCHARGE MODELING USING SUPPORT VECTOR MACHINES
Establishment of rating curves are often required by the hydrologists for flow estimates in the streams, rivers etc. Measurement of discharge in a river is a time-consuming, expensive, and difficult process and the conventional approach of regression analysis of stage-discharge relation does not provide encouraging results especially during the floods. P
متن کاملSupport Vector Regression Machines
A new regression technique based on Vapnik’s concept of support vectors is introduced. We compare support vector regression (SVR) with a committee regression technique (bagging) based on regression trees and ridge regression done in feature space. On the basis of these experiments, it is expected that SVR will have advantages in high dimensionality space because SVR optimization does not depend...
متن کاملDeep Learning using Linear Support Vector Machines
Recently, fully-connected and convolutional neural networks have been trained to achieve state-of-the-art performance on a wide variety of tasks such as speech recognition, image classification, natural language processing, and bioinformatics. For classification tasks, most of these “deep learning” models employ the softmax activation function for prediction and minimize cross-entropy loss. In ...
متن کاملProperties of Support Vector Machines for Regression Properties of Support Vector Machines for Regression
In this report we show that the-tube size in Support Vector Machine (SVM) for regression is 2= p 1 + jjwjj 2. By using this result we show that, in the case all the data points are inside the-tube, minimizing jjwjj 2 in SVM for regression is equivalent to maximizing the distance between the approximating hyperplane and the farest points in the training set. Moreover, in the most general setting...
متن کاملClassiication Properties of Support Vector Machines for Regression Classiication Properties of Support Vector Machines for Regression
In this report we show some consequences of the work done by Pontil et al. in 1]. In particular we show that in the same hypotheses of the theorem proved in their paper, the optimal approximating hyperplane f R found by SVM regression classiies the data. This means that y i f R (x i) > 0 for points which live externally to the margin between the two classes or points which live internally to th...
متن کاملمنابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
آب و خاکجلد ۲۹، شماره ۲، صفحات ۴۰۶-۰
میزبانی شده توسط پلتفرم ابری doprax.com
copyright © 2015-2023