Protein Secondary Structure Prediction Using RT-RICO: A Rule-Based Approach
نویسندگان
چکیده
Protein structure prediction has always been an important research area in biochemistry. In particular, the prediction of protein secondary structure has been a well-studied research topic. The experimental methods currently used to determine protein structure are accurate, yet costly both in terms of equipment and time. Despite the recent breakthrough of combining multiple sequence alignment information and artificial intelligence algorithms to predict protein secondary structure, the Q3 accuracy of various computational prediction methods rarely has exceeded 75%. In this paper, a newly developed rule-based data-mining approach called RT-RICO (Relaxed Threshold Rule Induction from Coverings) is presented. This method identifies dependencies between amino acids in a protein sequence and generates rules that can be used to predict secondary structure. RT-RICO achieved a Q3 score of 81.75% on the standard test dataset RS126 and a Q3 score of 79.19% on the standard test dataset CB396, an improvement over comparable computational methods.
منابع مشابه
Protein Secondary Structure Prediction Using Parallelized Rule Induction from Coverings
Protein 3D structure prediction has always been an important research area in bioinformatics. In particular, the prediction of secondary structure has been a well-studied research topic. Despite the recent breakthrough of combining multiple sequence alignment information and artificial intelligence algorithms to predict protein secondary structure, the Q3 accuracy of various computational predi...
متن کاملProtein Secondary Structure Prediction: a Literature Review with Focus on Machine Learning Approaches
DNA sequence, containing all genetic traits is not a functional entity. Instead, it transfers to protein sequences by transcription and translation processes. This protein sequence takes on a 3D structure later, which is a functional unit and can manage biological interactions using the information encoded in DNA. Every life process one can figure is undertaken by proteins with specific functio...
متن کاملProtein Structure Prediction and Interpretation with Support Vector Machines and Decision Trees
Prediction of protein structures from protein sequences using computers is an important step to discover proteins' 3D conformation structures and their functions and hence has profound theoretical and practical significance in areas such as protein engineering and drug design. In this talk, we will discuss our new results in protein secondary structure and Transmembrane protein prediction using...
متن کاملA Fugacity Approach for Prediction of Phase Equilibria of Methane Clathrate Hydrate in Structure H
In this communication, a thermodynamic model is presented to predict the dissociation conditions of structure H (sH) clathrate hydrates with methane as help gas. This approach is an extension of the Klauda and Sandler fugacity model (2000) for prediction of phase boundaries of sI and sII clathrate hydrates. The phase behavior of the water and hydrocarbon system is modeled using the Peng-Robinso...
متن کاملPrediction of Secondary Structure of Citrus Viroids Reported from Southern Iran
Abstract Viroids are smallest, single-stranded, circular, highly structured plant pathogenic RNAs that do not code for any protein. Viroids belong to two families, the Avsunviroidae and the Pospiviroidae. Members of the Pospiviroidae family adopt a rod-like secondary structure. In this study the most stable secondary structures of citrus viroid variants that reported from Fars province wer...
متن کامل