Transition Initiation Sites (TIS) Recognition in DNA Sequence using Machine Learning

نویسنده

  • Muhammad Hossain
چکیده

Transition Initiation Sites (TIS) prediction is a challenging problem in computational biology. In the literature TIS is predicted using various machine learning techniques such as Neural Network (NN), Support Vector Machine, etc. We have applied Principal Component Analysis (PCA) to remove highly correlated features which improves the performance in terms of time and accuracy. In this paper we have used Group Model of Data Handling (GMDH) based algorithm Abductive Network (AN) to predict TIS and got accuracy of 93%. KeywordsBioinformatics, Transition Initiation Sites (TIS), mRNA sequence, Machine Learning, Neural Network, Abductive Network, GMDH.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prediction of Eukaryotic Translation Initiation Sites Using Machine Learning

The computational identification of translation initiation sites (TIS) is a major component of every gene prediction system, and is thus of major importance in genome annotation projects. A large number of machine learning methods have been described to identify TIS in transcripts such as mRNA, EST and cDNA sequences. In this regard, most of the prediction methods have focused on recognizing TI...

متن کامل

Engineering support vector machine kernels that recognize translation initiation sites

Motivation: In order to extract protein sequences from nucleotide sequences, it is an important step to recognize points at which regions start that code for proteins. These points are called translation initiation sites (TIS). Results: The task of finding TIS can be modeled as a classification problem. We demonstrate the applicability of support vector machines for this task, and show how to i...

متن کامل

Engineering Support Vector Machine Kerneis That Recognize Translation Initialion Sites

MOTIVATION In order to extract protein sequences from nucleotide sequences, it is an important step to recognize points at which regions start that code for proteins. These points are called translation initiation sites (TIS). RESULTS The task of finding TIS can be modeled as a classification problem. We demonstrate the applicability of support vector machines for this task, and show how to i...

متن کامل

Quantitative analysis of mammalian translation initiation sites by FACS-seq

An approach combining fluorescence-activated cell sorting and high-throughput DNA sequencing (FACS-seq) was employed to determine the efficiency of start codon recognition for all possible translation initiation sites (TIS) utilizing AUG start codons. Using FACS-seq, we measured translation from a genetic reporter library representing all 65,536 possible TIS sequences spanning the -6 to +5 posi...

متن کامل

Initiation of mtDNA transcription is followed by pausing, and diverges across human cell types and during evolution.

Mitochondrial DNA (mtDNA) genes are long known to be cotranscribed in polycistrones, yet it remains impossible to study nascent mtDNA transcripts quantitatively in vivo using existing tools. To this end, we used deep sequencing (GRO-seq and PRO-seq) and analyzed nascent mtDNA-encoded RNA transcripts in diverse human cell lines and metazoan organisms. Surprisingly, accurate detection of human mt...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012