The ambush hypothesis: hidden stop codons prevent off-frame gene reading.
نویسندگان
چکیده
Coding sequences lack stop codons, but many stops appear off-frame. Off-frame stops (stops in -1 and +1 shifted reading frames, termed hidden stops) terminate frame-shifted translation, potentially decreasing energy, and resource waste on nonfunctional proteins. Benefits may include reduced waste elimination costs and avoidance of potentially cytotoxic frame-shifted products. Our "ambush" hypothesis suggests that hidden stops are sometimes selected for. Codons of many amino acids can contribute to hidden stops, depending on the synonymous position state and adjacent codons. In vertebrate mitochondria, 31.75% of all amino acid combinations can form hidden stops. Codons with more potential to form hidden stops have greater usage frequency and bias in their favor among synonymous codons. Among primates, predicted mitochondrial rRNA secondary structure stability correlates negatively with the number of hidden stops in the mitochondrial genome. The taxonomic distribution of genetic codes suggests that +1 frameshifts might be more frequent than -1 frameshifts. This is confirmed by analyses of primate mitochondrial genomes: species with unstable rRNAs have more +1 stops, but the correlation is weak for -1 stops. High hidden stop density seems to be an adaptation in species with slippage prone ribosomes (unstable rRNAs). Hidden stops may thus compensate for reduced efficiency of some parts of the biosynthetic machinery. Some experimental data confirm our hypothesis: gene expression increases with the experimentally manipulated number of stops in the promoter region of a gene, suggesting biotechnological applications.
منابع مشابه
Ambush hypothesis revisited: Evidences for phylogenetic trends
Recoding events occur in competition with standard readout of the transcript, and are site-specific. Recoding is the reprogramming of mRNA translation by localized alterations in the standard translational rules. Frame-shifting is one class of recoding and defined as protein translations that start not at the first, but either at the second (+1 frame-shift) or the third (-1 frame-shift) nucleot...
متن کاملLong non-stop reading frames on the antisense strand of heat shock protein 70 genes and prion protein (PrP) genes are conserved between species.
Several mammalian genes, including heat shock protein (Hsp70) and prion protein (PrP) genes, have been reported to have long open reading frames (ORFs) or non-stop reading frames (NRFs) in the antisense direction. A simple explanation would be that these long antisense reading frames, which are usually in the same triplet frame as the coding strand, are the fortuitous byproduct of a high overal...
متن کاملOrigin of noncoding DNA sequences: molecular fossils of genome evolution.
The total amount of noncoding sequences on chromosomes of contemporary organisms varies significantly from species to species. We propose a hypothesis for the origin of these noncoding sequences that assumes that (i) an approximately equal to 0.55-kilobase (kb)-long reading frame composed the primordial gene and (ii) a 20-kb-long single-stranded polynucleotide is the longest molecule (as a geno...
متن کاملA Segment-based Dynamic Programing Algorithm for Parsing Gene Structure (Running Head: Segment-based Dynamic Programming)
Note: This version is a preliminary draft. Comments and suggestions are welcome. Abstract Predicting gene structure requires search within a combinatorially large space of possible gene structures. The search space may be narrowed by two types of computational tools: optimality criteria and consistency constraints. Consistency constraints are requirements concerning reading frame and stop codon...
متن کاملOrigin of eukaryotic introns: a hypothesis, based on codon distribution statistics in genes, and its implications.
A hypothesis for the origin of introns in eukaryotic genes is developed. By computer simulation it was found that the reading-frame lengths in a random nucleotide sequence are distributed in a negative exponential manner and that there exists an upper limit of about 200 codons in the length of the reading frames (RFs). These characteristics suggest that, if primordial DNA contained a random nuc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- DNA and cell biology
دوره 23 10 شماره
صفحات -
تاریخ انتشار 2004