Enhancing Protein Fold Prediction Accuracy Using Evolutionary and Structural Features
نویسندگان
چکیده
Protein fold recognition (PFR) is considered as an important step towards the protein structure prediction problem. It also provides crucial information about the functionality of the proteins. Despite all the efforts that have been made during the past two decades, finding an accurate and fast computational approach to solve PFR still remains a challenging problem for bioinformatics and computational biology. It has been shown that extracting features which contain significant local and global discriminatory information plays a key role in addressing this problem. In this study, we propose the concept of segmented-based feature extraction technique to provide local evolutionary information embedded in Position Specific Scoring Matrix (PSSM) and structural information embedded in the predicted secondary structure of proteins using SPINE-X. We also employ the concept of occurrence feature to extract global discriminatory information from PSSM and SPINE-X. By applying a Support Vector Machine (SVM) to our extracted features, we enhance the protein fold prediction accuracy to 7.4% over the best results reported in the literature.
منابع مشابه
Predicting Protein Solvent Accessibility with Sequence, Evolutionary Information and Context-based Features
Solvent-accessible surface areas of residues in proteins are key factors in protein folding. Predicting solvent accessibility from protein sequences is significant for modeling the structural and functional characteristics of many proteins. In this work, we introduce an approach of enhancing solvent accessibility prediction accuracy. We derive pseudo-potentials, by considering high-orderinter-r...
متن کاملPrediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks
Background: Prediction of the protein localization is among the most important issues in the bioinformatics that is used for the prediction of the proteins in the cells and organelles such as mitochondria. In this study, several machine learning algorithms are applied for the prediction of the intracellular protein locations. These algorithms use the features extracted from pro...
متن کاملA Novel Fuzzy-Genetic Differential Evolutionary Algorithm for Optimization of A Fuzzy Expert Systems Applied to Heart Disease Prediction
This study presents a novel intelligent Fuzzy Genetic Differential Evolutionary model for the optimization of a fuzzy expert system applied to heart disease prediction in order to reduce the risk of heart disease. To this end, a fuzzy expert system has been proposed for the prediction of heart disease. The proposed model can be used as a tool to assist physicians. In order to: (1) tune the para...
متن کاملPFRES: protein fold classification by using evolutionary information and predicted secondary structure
MOTIVATION The number of protein families has been estimated to be as small as 1000. Recent study shows that the growth in discovery of novel structures that are deposited into PDB and the related rate of increase of SCOP categories are slowing down. This indicates that the protein structure space will be soon covered and thus we may be able to derive most of remaining structures by using the k...
متن کاملEnhancing Protein Fold Prediction Accuracy Using an Ensemble of Different Classifiers
Protein fold prediction problem is considered as a key point to protein structure recognition and structural discoveries. Recent advances in pattern recognition field brought a great interest to apply pattern classification techniques to tackle this problem. From the pattern recognition point of view, the protein fold prediction problem can be expressed as a multi-class classification task that...
متن کامل