A common class of transcripts with 5'-intron depletion, distinct early coding sequence features, and N1-methyladenosine modification.
نویسندگان
چکیده
Introns are found in 5' untranslated regions (5'UTRs) for 35% of all human transcripts. These 5'UTR introns are not randomly distributed: Genes that encode secreted, membrane-bound and mitochondrial proteins are less likely to have them. Curiously, transcripts lacking 5'UTR introns tend to harbor specific RNA sequence elements in their early coding regions. To model and understand the connection between coding-region sequence and 5'UTR intron status, we developed a classifier that can predict 5'UTR intron status with >80% accuracy using only sequence features in the early coding region. Thus, the classifier identifies transcripts with 5' proximal-intron-minus-like-coding regions ("5IM" transcripts). Unexpectedly, we found that the early coding sequence features defining 5IM transcripts are widespread, appearing in 21% of all human RefSeq transcripts. The 5IM class of transcripts is enriched for non-AUG start codons, more extensive secondary structure both preceding the start codon and near the 5' cap, greater dependence on eIF4E for translation, and association with ER-proximal ribosomes. 5IM transcripts are bound by the exon junction complex (EJC) at noncanonical 5' proximal positions. Finally, N1-methyladenosines are specifically enriched in the early coding regions of 5IM transcripts. Taken together, our analyses point to the existence of a distinct 5IM class comprising ∼20% of human transcripts. This class is defined by depletion of 5' proximal introns, presence of specific RNA sequence features associated with low translation efficiency, N1-methyladenosines in the early coding region, and enrichment for noncanonical binding by the EJC.
منابع مشابه
Can Cenik 1 A Common Class of Transcripts with 5 ’ - Intron Depletion , Distinct Early Coding 1 Sequence Features , and N 1 - Methyladenosine Modification
1 Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA 7 2 Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical 8 School, Worcester, MA 01605, USA 9 3 Donnelly Centre and Departments of Molecular Genetics and Computer Science, University of Toronto 10 and Lunenfeld-Tanenbaum Research Institute, Mt Sinai Hospital, Toronto M...
متن کاملFunctions of the Heterologous Intron-Derived Fragments Intra and Extra Factor IX-cDNA Coding Region on the Human Factor IX Expression in HepG2 and Hek-293T Cells
Background: Human FIX (hFIX) gene transfer into hepatocytes has provided a novel approach for treatment of hemophilia B. To obtain an improved expression of hFIX, the functional hFIX-expressing plasmids with appropriate intron-derived fragments which facilitate transcription and promote an efficient 3′-end formation of mRNAs are required.Objectives: We ai...
متن کاملBase-Resolution Mapping Reveals Distinct m1A Methylome in Nuclear- and Mitochondrial-Encoded Transcripts.
Gene expression can be post-transcriptionally regulated via dynamic and reversible RNA modifications. N1-methyladenosine (m1A) is a recently identified mRNA modification; however, little is known about its precise location and biogenesis. Here, we develop a base-resolution m1A profiling method, based on m1A-induced misincorporation during reverse transcription, and report distinct classes of m1...
متن کاملStrand‐specific, high‐resolution mapping of modified RNA polymerase II
Reversible modification of the RNAPII C-terminal domain links transcription with RNA processing and surveillance activities. To better understand this, we mapped the location of RNAPII carrying the five types of CTD phosphorylation on the RNA transcript, providing strand-specific, nucleotide-resolution information, and we used a machine learning-based approach to define RNAPII states. This reve...
متن کاملPerturbation of m6A writers reveals two distinct classes of mRNA methylation at internal and 5' sites.
N6-methyladenosine (m6A) is a common modification of mRNA with potential roles in fine-tuning the RNA life cycle. Here, we identify a dense network of proteins interacting with METTL3, a component of the methyltransferase complex, and show that three of them (WTAP, METTL14, and KIAA1429) are required for methylation. Monitoring m6A levels upon WTAP depletion allowed the definition of accurate a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- RNA
دوره 23 3 شماره
صفحات -
تاریخ انتشار 2017