A circular code in the protein coding genes of mitochondria.
نویسندگان
چکیده
A new maximal circular code X0(MIT) with two permutated maximal circular codes X1(MIT) and X2(MIT) is identified in the protein coding genes of mitochondria. The three subsets of 20 trinucleotides X0(MIT)={ACA, ACC, ATA, ATC, CTA, CTC, GAA, GAC, GAT, GCA, GCC, GCT, GGA, GGC, GGT, GTA, GTC, GTT, TTA, TTC}, X1(MIT) and X2(MIT) are in frame 0 (reading frame), 1 and 2 respectively. X1(MIT) and X2(MIT) are deduced by one and two circular permutations of X0(MIT) respectively. The code X0(MIT) has four important properties: a length of the minimal window to automatically retrieve frame 0 which is equal to five nucleotides; an occurrence probability equal to 6.3 x 10(-5); a low frequency (12% in average) of misplaced trinucleotides in the shifted frames; and an occurrence of four types of nucleotides in the first and second trinucleotide sites but no nucleotide G in the third trinucleotide site. Several biological consequences are presented in the Discussion.
منابع مشابه
Long non-coding RNAs and their significance in human diseases
Protein-coding genes account for only a small fraction of the human genome and most of the genomic sequences are transcriptionally silent, but recent observations indicate significant functional elements, including non-coding protein transcripts in the human genome. Long non-coding RNAs (lncRNAs) have been defined as transcripts of >200 nucleotides without protein-coding capacity that perform t...
متن کاملCircular RNA: features, functions and their correlation with diseases especially cancer
In early 2012, the world of science saw a fascinating discovery called circular RNA as a transcription product of thousands of genes in mice and humans. These circular RNAs have recently been grouped as the encoding RNA in an independent group that their remarkable difference with other RNAs is that these RNAs are not linear, in which two ends connect with a covalent connection creating a loop-...
متن کاملIdentification of protein coding genes in genomes with statistical functions based on the circular code.
A new statistical approach using functions based on the circular code classifies correctly more than 93% of bases in protein (coding) genes and non-coding genes of human sequences. Based on this statistical study, a research software called 'Analysis of Coding Genes' (ACG) has been developed for identifying protein genes in the genomes and for determining their frame. Furthermore, the software ...
متن کاملPhylogenetic Analysis of Three Long Non-coding RNA Genes: AK082072, AK043754 and AK082467
Now, it is clear that protein is just one of the most functional products produced by the eukaryotic genome. Indeed, a major part of the human genome is transcribed to non-coding sequences than to the coding sequence of the protein. In this study, we selected three long non-coding RNAs namely AK082072, AK043754 and AK082467 which show brain expression and local region conservation among vertebr...
متن کاملThe Maximal C3 Self-Complementary Trinucleotide Circular Code X in Genes of Bacteria, Archaea, Eukaryotes, Plasmids and Viruses
In 1996, a set X of 20 trinucleotides was identified in genes of both prokaryotes and eukaryotes which has on average the highest occurrence in reading frame compared to its two shifted frames. Furthermore, this set X has an interesting mathematical property as X is a maximal C 3 self-complementary trinucleotide circular code. In 2015, by quantifying the inspection approach used in 1996, the ci...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Journal of theoretical biology
دوره 189 3 شماره
صفحات -
تاریخ انتشار 1997