prediction of protein thermostability by an efficient neural network approach

نویسندگان

jalal rezaeenour

mansoureh yari eili

zahra roozbahani

mansour ebrahimi

چکیده

introduction: manipulation of protein stability is important for understanding the principles that govern protein thermostability, both in basic research and industrial applications. various data mining techniques exist for prediction of thermostable proteins. furthermore, ann methods have attracted significant attention for prediction of thermostability, because they constitute an appropriate approach to mapping the non-linear input-output relationships and massive parallel computing. method: an extreme learning machine (elm) was applied to estimate thermal behavior of 1289 proteins. in the proposed algorithm, the parameters of elm were optimized using a genetic algorithm (ga), which tuned a set of input variables, hidden layer biases, and input weights, to and enhance the prediction performance. the method was executed on a set of amino acids, yielding a total of 613 protein features. a number of feature selection algorithms were used to build subsets of the features. a total of 1289 protein samples and 613 protein features were calculated from uniprot database to understand features contributing to the enzymes’ thermostability and find out the main features that influence this valuable characteristic. results:at the primary structure level, gln, glu and polar were the features that mostly contributed to protein thermostability. at the secondary structure level, helix_s, coil, and charged_coil were the most important features affecting protein thermostability. these results suggest that the thermostability of proteins is mainly associated with primary structural features of the protein. according to the results, the influence of primary structure on the thermostabilty of a protein was more important than that of the secondary structure. it is shown that prediction accuracy of elm (mean square error) can improve dramatically using ga with error rates rmse=0.004 and mape=0.1003. conclusion: the proposed approach for forecasting problem significantly improves the accuracy of elm in prediction of thermostable enzymes. elm tends to require more neurons in the hidden-layer than conventional tuning-based learning algorithms. to overcome these, the proposed approach uses a ga which optimizes the structure and the parameters of the elm. in summary, optimization of elm with ga results in an efficient prediction method; numerical experiments proved that our approach yields excellent results.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prediction of Cardiovascular Diseases Using an Optimized Artificial Neural Network

Introduction:  It is of utmost importance to predict cardiovascular diseases correctly. Therefore, it is necessary to utilize those models with a minimum error rate and maximum reliability. This study aimed to combine an artificial neural network with the genetic algorithm to assess patients with myocardial infarction and congestive heart failure.   Materials & Methods: This study utilized a m...

متن کامل

Large amplitude vibration prediction of rectangular plates by an optimal artificial neural network (ANN)

In this paper, nonlinear equations of motion for laminated composite rectangular plates based on the first order shear deformation theory were derived. Using a perturbation method, the nonlinear equation of motion was solved and analytical relations were obtained for natural and nonlinear frequencies. After proving the validity of the obtained analytical relations, as an alternative and simple ...

متن کامل

Signal Prediction by Layered Feed - Forward Neural Network (RESEARCH NOTE).

In this paper a nonparametric neural network (NN) technique for prediction of future values of a signal based on its past history is presented. This approach bypasses modeling, identification, and parameter estimation phases that are required by conventional parametric techniques. A multi-layer feed forward NN is employed. It develops an internal model of the signal through a training operation...

متن کامل

An Overview of the Protein Thermostability Prediction: Databases and Tools

Environmental temperature plays an important role in the cell life [1]. There are four classes of organism in relation to their optimal growth temperature namely hyperthermophile (>80◦C), thermophile (45-80◦C), mesophile (20-45◦C) and psychrophile (<20◦C) [2]. Thermal stability is defined as the ability of material to resist changes in physical structure or chemical irreversibility, or spatial ...

متن کامل

Prediction of Energy Consumption in the First Line of Tehran Metro: GMDH Neural Network Approach

Today, energy and its consumption are the main strategic plan of organizations and also the development of urban transport systems by considering a variety of economic, scientific, industrial, climate and growing urbanization is essential. Analysis of past trends in energy is the key to predict future trends, with regard to the rate of development of metro, for planning and future-oriented macr...

متن کامل

منابع من

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید


عنوان ژورنال:
journal of health management and informatics

جلد ۳، شماره ۴، صفحات ۱۰۲-۰

میزبانی شده توسط پلتفرم ابری doprax.com

copyright © 2015-2023