Reliable prediction of Drosha processing sites improves microRNA gene prediction

نویسندگان

  • Snorre A. Helvik
  • Ola R. Snøve
  • Pål Sætrom
چکیده

MOTIVATION Mature microRNAs (miRNAs) are processed from long hairpin transcripts. Even though it is only the first of several steps, the initial Drosha processing defines the mature product and is characteristic for all miRNA genes. Methods that can separate between true and false processing sites are therefore essential to miRNA gene discovery. RESULTS We present a classifier that predicts 5' Drosha processing sites in hairpins that are candidate miRNAs. The classifier, called Microprocessor SVM, correctly predicts the processing site for 50% of known human 5' miRNAs, and 90% of its predictions are within two nucleotides of the true site. Another classifier that is trained on the output from the Microprocessor SVM outperforms existing methods for prediction of unconserved miRNAs. Reanalysis of characteristics and supporting evidence for a set of newly annotated miRNAs shows that some miRNAs may be misannotated. This suggests that expressed hairpins should not be annotated as miRNAs until they are verified to be Drosha and Dicer substrates. AVAILABILITY The classifiers are publicly available at https://demo1.interagon.com/miRNA/

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

MiRmat: Mature microRNA Sequence Prediction

BACKGROUND MicroRNAs are known to be generated from primary transcripts mainly through the sequential cleavages by two enzymes, Drosha and Dicer. The sequence of a mature microRNA, especially the 'seeding sequence', largely determines its binding ability and specificity to target mRNAs. Therefore, methods that predict mature microRNA sequences with high accuracy will benefit the identification ...

متن کامل

Recognition and cleavage of primary microRNA precursors by the nuclear processing enzyme Drosha.

A critical step during human microRNA maturation is the processing of the primary microRNA transcript by the nuclear RNaseIII enzyme Drosha to generate the approximately 60-nucleotide precursor microRNA hairpin. How Drosha recognizes primary RNA substrates and selects its cleavage sites has remained a mystery, especially given that the known targets for Drosha processing show no discernable seq...

متن کامل

Structure of Human DROSHA

MicroRNA maturation is initiated by RNase III DROSHA that cleaves the stem loop of primary microRNA. DROSHA functions together with its cofactor DGCR8 in a heterotrimeric complex known as Microprocessor. Here, we report the X-ray structure of DROSHA in complex with the C-terminal helix of DGCR8. We find that DROSHA contains two DGCR8-binding sites, one on each RNase III domain (RIIID), which me...

متن کامل

New support vector machine-based method for microRNA target prediction.

MicroRNA (miRNA) plays important roles in cell differentiation, proliferation, growth, mobility, and apoptosis. An accurate list of precise target genes is necessary in order to fully understand the importance of miRNAs in animal development and disease. Several computational methods have been proposed for miRNA target-gene identification. However, these methods still have limitations with resp...

متن کامل

Microprocessor dynamics and interactions at endogenous imprinted C19MC microRNA genes.

Nuclear primary microRNA (pri-miRNA) processing catalyzed by the DGCR8-Drosha (Microprocessor) complex is highly regulated. Little is known, however, about how microRNA biogenesis is spatially organized within the mammalian nucleus. Here, we image for the first time, in living cells and at the level of a single microRNA cluster, the intranuclear distribution of untagged, endogenously-expressed ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bioinformatics

دوره 23 2  شماره 

صفحات  -

تاریخ انتشار 2007