Valid Plane Trees: Combinatorial Models for RNA Secondary Structures with Watson-Crick Base Pairs

نویسندگان

  • Francis Black
  • Elizabeth Drellich
  • Julianna Tymoczko
چکیده

The combinatorics of RNA plays a central role in biology. Mathematical biologists have several commonly-used models for RNA: words in a fixed alphabet (representing the primary sequence of nucleotides) and plane trees (representing the secondary structure, or folding of the RNA sequence). This paper considers an augmented version of the standard model of plane trees, one that incorporates some observed constraints on how the folding can occur. In particular we assume the alphabet consists of complementary pairs, for instance the Watson-Crick pairs A-U and C-G of RNA. Given a word in the alphabet, a valid plane tree is a tree for which, when the word is folded around the tree, each edge matchs two complementary letters. Consider the graph whose vertices are valid plane trees for a fixed word and whose edges are given by Condon, Heitsch, and Hoos’s local moves. We prove this graph is connected. We give an explicit algorithm to construct a valid plane tree from a primary sequence, assuming that at least one valid plane tree exists. The tree produced by our algorithm has other useful characterizations, including a uniqueness condition defined by local moves. We also study enumerative properties of valid plane trees, analyzing how the number of valid plane trees depends on the choice of sequence length and alphabet size. Finally we show that words with at least one valid plane tree are unusual, in the sense that the proportion of words with at least one valid plane tree goes to zero as the word size increases. Open questions and conjectures are given throughout the document.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Non-Watson-Crick base pairs in RNA-protein recognition.

The cellular functions of most RNA molecules involve protein binding, and non-Watson-Crick base pairs are hallmark sites for interactions with proteins. The determination of three-dimensional structures of RNA-peptide and RNA-protein complexes reveals the molecular basis of non-Watson-Crick base-pair recognition.

متن کامل

Modified Amber Force Field Correctly Models the Conformational Preference for Tandem GA pairs in RNA

Molecular mechanics with all-atom models was used to understand the conformational preference of tandem guanine-adenine (GA) noncanonical pairs in RNA. These tandem GA pairs play important roles in determining stability, flexibility, and structural dynamics of RNA tertiary structures. Previous solution structures showed that these tandem GA pairs adopt either imino (cis Watson-Crick/Watson-Cric...

متن کامل

Geometric nomenclature and classification of RNA base pairs.

Non-Watson-Crick base pairs mediate specific interactions responsible for RNA-RNA self-assembly and RNA-protein recognition. An unambiguous and descriptive nomenclature with well-defined and nonoverlapping parameters is needed to communicate concisely structural information about RNA base pairs. The definitions should reflect underlying molecular structures and interactions and, thus, facilitat...

متن کامل

Watson-Crick pairing, the Heisenberg group and Milnor invariants.

We study the secondary structure of RNA determined by Watson-Crick pairing without pseudo-knots using Milnor invariants of links. We focus on the first non-trivial invariant, which we call the Heisenberg invariant. The Heisenberg invariant, which is an integer, can be interpreted in terms of the Heisenberg group as well as in terms of lattice paths. We show that the Heisenberg invariant gives a...

متن کامل

Breaking pseudo-twofold symmetry in the poliovirus 3'-UTR Y-stem by restoring Watson-Crick base pairs.

The previously described NMR structure of a 5'-CU-3'/5'-UU-3' motif, which is highly conserved within the 3'-UTR Y-stem of poliovirus-like enteroviruses, revealed striking regularities of the local helix geometry, thus retaining the pseudo-twofold symmetry of the RNA helix. A mutant virus with both pyrimidine base pairs changed into Watson-Crick replicated as wild type, indicating the functiona...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • SIAM J. Discrete Math.

دوره 31  شماره 

صفحات  -

تاریخ انتشار 2017