prediction of protein thermostability by an efficient neural network approach
نویسندگان
چکیده
introduction: manipulation of protein stability is important for understanding the principles that govern protein thermostability, both in basic research and industrial applications. various data mining techniques exist for prediction of thermostable proteins. furthermore, ann methods have attracted significant attention for prediction of thermostability, because they constitute an appropriate approach to mapping the non-linear input-output relationships and massive parallel computing. method: an extreme learning machine (elm) was applied to estimate thermal behavior of 1289 proteins. in the proposed algorithm, the parameters of elm were optimized using a genetic algorithm (ga), which tuned a set of input variables, hidden layer biases, and input weights, to and enhance the prediction performance. the method was executed on a set of amino acids, yielding a total of 613 protein features. a number of feature selection algorithms were used to build subsets of the features. a total of 1289 protein samples and 613 protein features were calculated from uniprot database to understand features contributing to the enzymes’ thermostability and find out the main features that influence this valuable characteristic. results:at the primary structure level, gln, glu and polar were the features that mostly contributed to protein thermostability. at the secondary structure level, helix_s, coil, and charged_coil were the most important features affecting protein thermostability. these results suggest that the thermostability of proteins is mainly associated with primary structural features of the protein. according to the results, the influence of primary structure on the thermostabilty of a protein was more important than that of the secondary structure. it is shown that prediction accuracy of elm (mean square error) can improve dramatically using ga with error rates rmse=0.004 and mape=0.1003. conclusion: the proposed approach for forecasting problem significantly improves the accuracy of elm in prediction of thermostable enzymes. elm tends to require more neurons in the hidden-layer than conventional tuning-based learning algorithms. to overcome these, the proposed approach uses a ga which optimizes the structure and the parameters of the elm. in summary, optimization of elm with ga results in an efficient prediction method; numerical experiments proved that our approach yields excellent results.
منابع مشابه
Prediction of Cardiovascular Diseases Using an Optimized Artificial Neural Network
Introduction: It is of utmost importance to predict cardiovascular diseases correctly. Therefore, it is necessary to utilize those models with a minimum error rate and maximum reliability. This study aimed to combine an artificial neural network with the genetic algorithm to assess patients with myocardial infarction and congestive heart failure. Materials & Methods: This study utilized a m...
متن کاملLarge amplitude vibration prediction of rectangular plates by an optimal artificial neural network (ANN)
In this paper, nonlinear equations of motion for laminated composite rectangular plates based on the first order shear deformation theory were derived. Using a perturbation method, the nonlinear equation of motion was solved and analytical relations were obtained for natural and nonlinear frequencies. After proving the validity of the obtained analytical relations, as an alternative and simple ...
متن کاملDrilling Stuck Pipe Prediction in Iranian Oil Fields: An Artificial Neural Network Approach
متن کامل
Signal Prediction by Layered Feed - Forward Neural Network (RESEARCH NOTE).
In this paper a nonparametric neural network (NN) technique for prediction of future values of a signal based on its past history is presented. This approach bypasses modeling, identification, and parameter estimation phases that are required by conventional parametric techniques. A multi-layer feed forward NN is employed. It develops an internal model of the signal through a training operation...
متن کاملAn Overview of the Protein Thermostability Prediction: Databases and Tools
Environmental temperature plays an important role in the cell life [1]. There are four classes of organism in relation to their optimal growth temperature namely hyperthermophile (>80◦C), thermophile (45-80◦C), mesophile (20-45◦C) and psychrophile (<20◦C) [2]. Thermal stability is defined as the ability of material to resist changes in physical structure or chemical irreversibility, or spatial ...
متن کاملPrediction of Energy Consumption in the First Line of Tehran Metro: GMDH Neural Network Approach
Today, energy and its consumption are the main strategic plan of organizations and also the development of urban transport systems by considering a variety of economic, scientific, industrial, climate and growing urbanization is essential. Analysis of past trends in energy is the key to predict future trends, with regard to the rate of development of metro, for planning and future-oriented macr...
متن کاملمنابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
journal of health management and informaticsجلد ۳، شماره ۴، صفحات ۱۰۲-۰
میزبانی شده توسط پلتفرم ابری doprax.com
copyright © 2015-2023