The eukaryotic promoter database in its 30th year: focus on non-vertebrate organisms

نویسندگان

  • René Dreos
  • Giovanna Ambrosini
  • Romain Groux
  • Rouaïda Cavin Périer
  • Philipp Bucher
چکیده

We present an update of the Eukaryotic Promoter Database EPD (http://epd.vital-it.ch), more specifically on the EPDnew division, which contains comprehensive organisms-specific transcription start site (TSS) collections automatically derived from next generation sequencing (NGS) data. Thanks to the abundant release of new high-throughput transcript mapping data (CAGE, TSS-seq, GRO-cap) the database could be extended to plant and fungal species. We further report on the expansion of the mass genome annotation (MGA) repository containing promoter-relevant chromatin profiling data and on improvements for the EPD entry viewers. Finally, we present a new data access tool, ChIP-Extract, which enables computational biologists to extract diverse types of promoter-associated data in numerical table formats that are readily imported into statistical analysis platforms such as R.

منابع مشابه

EPD in its twentieth year: towards complete promoter coverage of selected model organisms

The Eukaryotic Promoter Database (EPD) is an annotated non-redundant collection of eukaryotic POL II promoters, experimentally defined by a transcription start site (TSS). Access to promoter sequences is provided by pointers to positions in the corresponding genomes. Promoter evidence comes from conventional TSS mapping experiments for individual genes, or, starting from release 73, from mass g...

متن کامل

The Eukaryotic Promoter Database: expansion of EPDnew and new promoter analysis tools

We present an update of EPDNew (http://epd.vital-it.ch), a recently introduced new part of the Eukaryotic Promoter Database (EPD) which has been described in more detail in a previous NAR Database Issue. EPD is an old database of experimentally characterized eukaryotic POL II promoters, which are conceptually defined as transcription initiation sites or regions. EPDnew is a collection of automa...

متن کامل

PromFD 1.0: a computer program that predicts eukaryotic pol II promoters using strings and IMD matrices

MOTIVATION A large number of new DNA sequences with virtually unknown functions are generated as the Human Genome Project progresses. Therefore, it is essential to develop computer algorithms that can predict the functionality of DNA segments according to their primary sequences, including algorithms that can predict promoters. Although several promoter-predicting algorithms are available, they...

متن کامل

Human Virome

Viruses are dominant entities in the biosphere and parasitize all cellular life forms. The relative abundances of different classes of viruses are dramatically different between prokaryotes and eukaryotes. In marine, soil and animal-associated environments, virus particles consistently outnumber cells by one to two orders of magnitude. It is estimated that 10 quintillion (1030) viral particles ...

متن کامل

EPD and EPDnew, high-quality promoter resources in the next-generation sequencing era

The Eukaryotic Promoter Database (EPD), available online at http://epd.vital-it.ch, is a collection of experimentally defined eukaryotic POL II promoters which has been maintained for more than 25 years. A promoter is represented by a single position in the genome, typically the major transcription start site (TSS). EPD primarily serves biologists interested in analysing the motif content, chro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

متن کامل
عنوان ژورنال:

دوره 45  شماره 

صفحات  -

تاریخ انتشار 2017