Protein Secondary Structure Prediction Using Parallelized Rule Induction from Coverings

نویسندگان

  • Leong Lee
  • Cyriac Kandoth
  • L. Leopold
  • Ronald L. Frank
چکیده

Protein 3D structure prediction has always been an important research area in bioinformatics. In particular, the prediction of secondary structure has been a well-studied research topic. Despite the recent breakthrough of combining multiple sequence alignment information and artificial intelligence algorithms to predict protein secondary structure, the Q3 accuracy of various computational prediction algorithms rarely has exceeded 75%. In a previous paper [1], this research team presented a rule-based method called RT-RICO (Relaxed Threshold Rule Induction from Coverings) to predict protein secondary structure. The average Q3 accuracy on the sample datasets using RT-RICO was 80.3%, an improvement over comparable computational methods. Although this demonstrated that RT-RICO might be a promising approach for predicting secondary structure, the algorithm’s computational complexity and program running time limited its use. Herein a parallelized implementation of a slightly modified RT-RICO approach is presented. This new version of the algorithm facilitated the testing of a much larger dataset of 396 protein domains [2]. Parallelized RTRICO achieved a Q3 score of 74.6%, which is higher than the consensus prediction accuracy of 72.9% that was achieved for the same test dataset by a combination of four secondary structure prediction methods [2]. Keywords—data mining, protein secondary structure prediction, parallelization.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Protein Secondary Structure Prediction Using RT-RICO: A Rule-Based Approach

Protein structure prediction has always been an important research area in biochemistry. In particular, the prediction of protein secondary structure has been a well-studied research topic. The experimental methods currently used to determine protein structure are accurate, yet costly both in terms of equipment and time. Despite the recent breakthrough of combining multiple sequence alignment i...

متن کامل

Protein Secondary Structure Prediction: a Literature Review with Focus on Machine Learning Approaches

DNA sequence, containing all genetic traits is not a functional entity. Instead, it transfers to protein sequences by transcription and translation processes. This protein sequence takes on a 3D structure later, which is a functional unit and can manage biological interactions using the information encoded in DNA. Every life process one can figure is undertaken by proteins with specific functio...

متن کامل

FTIR Investigation of Secondary Structure of Reteplase Inclusion Bodies Produced in Escherichia coli in Terms of Urea Concentration

Recent studies suggest that reducing the induction temperature would improve the quality of some recombinant inclusion bodies (IB) by providing a native-like secondary structure and leading to an improvement in protein recovery. This study focused on optimizing the solubilization condition of Reteplase, a recombinant protein with 9 disulfide bonds. The influence of lowering induction temperatur...

متن کامل

FTIR Investigation of Secondary Structure of Reteplase Inclusion Bodies Produced in Escherichia coli in Terms of Urea Concentration

Recent studies suggest that reducing the induction temperature would improve the quality of some recombinant inclusion bodies (IB) by providing a native-like secondary structure and leading to an improvement in protein recovery. This study focused on optimizing the solubilization condition of Reteplase, a recombinant protein with 9 disulfide bonds. The influence of lowering induction temperatur...

متن کامل

Protein Structure Prediction and Interpretation with Support Vector Machines and Decision Trees

Prediction of protein structures from protein sequences using computers is an important step to discover proteins' 3D conformation structures and their functions and hence has profound theoretical and practical significance in areas such as protein engineering and drug design. In this talk, we will discuss our new results in protein secondary structure and Transmembrane protein prediction using...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012