Sequence-structure relations of biopolymers

نویسندگان

  • Christopher L. Barrett
  • Fenix W. D. Huang
  • Christian M. Reidys
چکیده

Motivation DNA data is transcribed into single-stranded RNA, which folds into specific molecular structures. In this paper we pose the question to what extent sequence- and structure-information correlate. We view this correlation as structural semantics of sequence data that allows for a different interpretation than conventional sequence alignment. Structural semantics could enable us to identify more general embedded ‘patterns’ in DNA and RNA sequences. Results We compute the partition function of sequences with respect to a fixed structure and connect this computation to the mutual information of a sequence–structure pair for RNA secondary structures. We present a Boltzmann sampler and obtain the a priori probability of specific sequence patterns. We present a detailed analysis for the three PDB-structures, 2JXV (hairpin), 2N3R (3-branch multi-loop) and 1EHZ (tRNA). We localize specific sequence patterns, contrast the energy spectrum of the Boltzmann sampled sequences versus those sequences that refold into the same structure and derive a criterion to identify native structures. We illustrate that there are multiple sequences in the partition function of a fixed structure, each having nearly the same mutual information, that are nevertheless poorly aligned. This indicates the possibility of the existence of relevant patterns embedded in the sequences that are not discoverable using alignments. Availability and Implementation The source code is freely available at http://staff.vbi.vt.edu/fenixh/Sampler.zip Contact [email protected] Supplimentary Information Supplementary data are available at Bioinformatics online.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Rhetorical Structure Analysis of EFLs’ Written Narratives of a Picture Story

This study was set to reveal how second language learners use rhetorical relations in their written narratives in terms of Rhetorical Structure Theory (RST) primarily proposed by Mann & Thompson (1987) and developed by Mann, Matthiessen & Thompson (1992). To this end, sixty written narratives based on the picture story book ‘Frog, where are you?’ were collected from EFL learners and were put to...

متن کامل

New Relations in the Edge Ideal Metrics Family for Biopolymers

Some new relations in the family of edge ideal metrics for biopolymers are proved. Also, some statistical relations among the metrics are shown using a big synthetic contact structure set; for unary contact structures, the relations are linear with a high significance.

متن کامل

Relation Between RNA Sequences, Structures, and Shapes via Variation Networks

Background: RNA plays key role in many aspects of biological processes and its tertiary structure is critical for its biological function. RNA secondary structure represents various significant portions of RNA tertiary structure. Since the biological function of RNA is concluded indirectly from its primary structure, it would be important to analyze the relations between the RNA sequences and t...

متن کامل

Generic Properties of the Sequence-structure Relations of Biopolymers

Biological evolution is a highly sophisticated dynamical phenomenon, and its complexity is often confusing. For the purpose of analysis it may be partitioned into the four partial processes depicted in gure 1 25]: population dynamics, (population) support dynamics, genotype-phenotype mapping, and tness evaluation. These components are properly visualized as map-pings between abstract metric spa...

متن کامل

Physicochemical and Immunomodulatory Properties of Gum Exudates Obtained from Astragalus myriacanthus and Some of Its Isolated Carbohydrate Biopolymers

Plants gums are complex mixtures of different polysaccharides with a variety of biological activities and pharmaceutical applications. Few studies have focused on physicochemical and biological properties of gums obtained from different plants. This study was designed to determine potential pharmaceutical and pharmacological values of the gum exudates and its isolated biopolymers obtained from ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bioinformatics

دوره 33 3  شماره 

صفحات  -

تاریخ انتشار 2016