Autoregressive Modeling of Coding Sequence Lengths in Bacterial Genome
نویسندگان
چکیده
Previous investigation of coding sequence lengths (CDS) in the bacterial circular chromosome revealed short range correlation in the series of these data. We have further analyzed the averaged periodograms of these series and we found that the organization of CDS can be well described by first order autoregressive processes. This involves interaction between the neighboring terms. The autoregressive analysis may have great potential in modeling various physical and biological processes like light emission of galaxies, protein organization, cell flickering, cognitive processes and perhaps others.
منابع مشابه
Phylogenetic Analysis of Three Long Non-coding RNA Genes: AK082072, AK043754 and AK082467
Now, it is clear that protein is just one of the most functional products produced by the eukaryotic genome. Indeed, a major part of the human genome is transcribed to non-coding sequences than to the coding sequence of the protein. In this study, we selected three long non-coding RNAs namely AK082072, AK043754 and AK082467 which show brain expression and local region conservation among vertebr...
متن کاملReductive evolution of proteomes and protein structures.
The lengths of orthologous protein families in Eukarya are almost double the lengths found in Bacteria and Archaea. Here we examine protein structures in 745 genomes and show that protein length differences between superkingdoms arise as much shorter prokaryotic nondomain linker sequences. Eukaryotic, bacterial, and archaeal linkers are 250, 86, and 73 aa residues in length, respectively, where...
متن کاملAutoregressive Modeling and Feature Analysis of DNA Sequences
A parametric signal processing approach for DNA sequence analysis based on autoregressive (AR) modeling is presented. AR model residual errors and AR model parameters are used as features. The AR residual error analysis indicates a high specificity of coding DNA sequences, while AR feature-based analysis helps distinguish between coding and noncoding DNA sequences. An AR model-based string sear...
متن کاملUNIVERSITÄT AUGSBURG Predictive Modeling for Lossless Audio Compression
Autoregressive (AR) modeling by linear prediction (LP) provides the basis of a wide variety of signal processing and communication systems including parametric spectral estimation and system identification. Perhaps the greatest success of linear prediction techniques is to be found in speech analysis and audio coding. In this paper, we first reviewed the general frameworks of predictive signal ...
متن کاملBioinformatics Designing of 10-23 Deoxyribozyme against Coding Region of Beta-galactosidase Gene
Background: Deoxyribozymes (Dzs) can play a role as gene expression inhibitors at mRNA level. Among Dzs, the 10-23 deoxyribozyme has significant potentials for treatment of diseases. Designed Dz includes a catalytic core made of 15 deoxyribonucleotides and two binding arms consisted of 6-12 nucleotides for site specific binding to target RNA and hydrolysis. The enzyme has characteristic feature...
متن کامل