Profile-Statistical Periodicity of DNA Coding Regions
نویسندگان
چکیده
Novel methods for identifying a new type of DNA latent periodicity, called latent profile periodicity or latent profility, are used to search for periodic structures in genes. These methods reveal two distinct levels of organization of genetic information encoding. It is shown that latent profility in genes may correlate with specific structural features of their encoded proteins.
منابع مشابه
تخمین مکان نواحی کدکننده پروتئین در توالی عددی DNA با استفاده پنجره با طول متغیر بر مبنای منحنی سه بعدی Z
In recent years, estimation of protein-coding regions in numerical deoxyribonucleic acid (DNA) sequences using signal processing tools has been a challenging issue in bioinformatics, owing to their 3-base periodicity. Several digital signal processing (DSP) tools have been applied in order to Identify the task and concentrated on assigning numerical values to the symbolic DNA sequence, then app...
متن کاملHeteroGenome: database of genome periodicity
We present the first release of the HeteroGenome database collecting latent periodicity regions in genomes. Tandem repeats and highly divergent tandem repeats along with the regions of a new type of periodicity, known as profile periodicity, have been collected for the genomes of Saccharomyces cerevisiae, Arabidopsis thaliana, Caenorhabditis elegans and Drosophila melanogaster. We obtained data...
متن کاملPrediction of protein coding regions by the 3-base periodicity analysis of a DNA sequence.
With the exponential growth of genomic sequences, there is an increasing demand to accurately identify protein coding regions (exons) from genomic sequences. Despite many progresses being made in the identification of protein coding regions by computational methods during the last two decades, the performances and efficiencies of the prediction methods still need to be improved. In addition, it...
متن کاملPeriodicity in DNA primary structure is defined by secondary structure of the coded protein.
A 10.5-base periodicity found earlier is inherent in both eu- and prokaryotic coding nucleotide sequences. In the case of noncoding eukaryotic sequences no periodicity is found, so the 10.5-base oscillation seemingly does not correlate with the nucleosomal organization of DNA. It is shown that the DNA fragments, coding the alpha-helical protein segments, manifest the pronounced 10.5-base period...
متن کاملPrediction of probable genes by Fourier analysis of genomic sequences
MOTIVATION The major signal in coding regions of genomic sequences is a three-base periodicity. Our aim is to use Fourier techniques to analyse this periodicity, and thereby to develop a tool to recognize coding regions in genomic DNA. RESULT The three-base periodicity in the nucleotide arrangement is evidenced as a sharp peak at frequency f = 1/3 in the Fourier (or power) spectrum. From exte...
متن کامل