Protein Secondary Structure Prediction Accuracy versus Reduction Methods
نویسندگان
چکیده
Predicting protein secondary structure is a key step in determining the 3D structure of a protein that determines its function. The Dictionary of Secondary Structure of Proteins (DSSP) uses eight classes to describe a protein. The DSSP database is a database of secondary structure assignments for all protein entries in the Protein Data Bank (PDB) with an algorithm designed to standardize these secondary structure assignments. Five methods that reduce theses eight classes into the adopted three classes: alpha helices (H) beta strands (E), and coils (C) are implemented in this research. A protein secondary structure classifier (NN-GORV-II) has been used to evaluate the five reduction methods under the same hardware, platforms, and environment to allow stringent and reliable comparison of these methods and then arrive at a clear conclusion. This paper explains and discusses the effect of these reduction methods on the prediction accuracy and quality.
منابع مشابه
Protein Secondary Structure Prediction: a Literature Review with Focus on Machine Learning Approaches
DNA sequence, containing all genetic traits is not a functional entity. Instead, it transfers to protein sequences by transcription and translation processes. This protein sequence takes on a 3D structure later, which is a functional unit and can manage biological interactions using the information encoded in DNA. Every life process one can figure is undertaken by proteins with specific functio...
متن کاملEvaluation and improvement of multiple sequence methods for protein secondary structure prediction.
A new dataset of 396 protein domains is developed and used to evaluate the performance of the protein secondary structure prediction algorithms DSC, PHD, NNSSP, and PREDATOR. The maximum theoretical Q3 accuracy for combination of these methods is shown to be 78%. A simple consensus prediction on the 396 domains, with automatically generated multiple sequence alignments gives an average Q3 predi...
متن کاملSecondary Structure Prediction Based on Statistical Mechanics
Recent genome science has been developing from two sides: one is biological evolution due to mutation of DNA sequence and the other is biological function of protein molecules formed according to amino acid sequences. We have devoted ourselves to elucidate the mechanism of protein folding, and to develop the method for protein structure prediction [1, 2]. Our method has successfully yielded the...
متن کاملCONCORD: a consensus method for protein secondary structure prediction via mixed integer linear optimization
Most of the protein structure prediction methods use a multi-step process, which often includes secondary structure prediction, contact prediction, fragment generation, clustering, etc. For many years, secondary structure prediction has been the workhorse for numerous methods aimed at predicting protein structure and function. This paper presents a new mixed integer linear optimization (MILP)-b...
متن کاملReview: protein secondary structure prediction continues to rise.
Methods predicting protein secondary structure improved substantially in the 1990s through the use of evolutionary information taken from the divergence of proteins in the same structural family. Recently, the evolutionary information resulting from improved searches and larger databases has again boosted prediction accuracy by more than four percentage points to its current height of around 76...
متن کامل