MOROKOSHI: Transcriptome Database in Sorghum bicolor

نویسندگان

  • Yuko Makita
  • Setsuko Shimada
  • Mika Kawashima
  • Tomoko Kondou-Kuriyama
  • Tetsuro Toyoda
  • Minami Matsui
چکیده

In transcriptome analysis, accurate annotation of each transcriptional unit and its expression profile is essential. A full-length cDNA (FL-cDNA) collection facilitates the refinement of transcriptional annotation, and accurate transcription start sites help to unravel transcriptional regulation. We constructed a normalized FL-cDNA library from eight growth stages of aerial tissues in Sorghum bicolor and isolated 37,607 clones. These clones were Sanger sequenced from the 5' and/or 3' ends and in total 38,981 high-quality expressed sequence tags (ESTs) were obtained. About one-third of the transcripts of known genes were captured as FL-cDNA clone resources. In addition to these, we also annotated 272 novel genes, 323 antisense transcripts and 1,672 candidate isoforms. These clones are available from the RIKEN Bioresource Center. After obtaining accurate annotation of transcriptional units, we performed expression profile analysis. We carried out spikelet-, seed- and stem-specific RNA sequencing (RNA-Seq) analysis and confirmed the expression of 70.6% of the newly identified genes. We also downloaded 23 sorghum RNA-Seq samples that are publicly available and these are shown on a genome browser together with our original FL-cDNA and RNA-Seq data. Using our original and publicly available data, we made an expression profile of each gene and identified the top 20 genes with the most similar expression. In addition, we visualized their relationships in gene co-expression networks. Users can access and compare various transcriptome data from S, bicolor at http://sorghum.riken.jp.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

De novo transcriptome assembly of Sorghum bicolor variety Taejin

Sorghum (Sorghum bicolor), also known as great millet, is one of the most popular cultivated grass species in the world. Sorghum is frequently consumed as food for humans and animals as well as used for ethanol production. In this study, we conducted de novo transcriptome assembly for sorghum variety Taejin by next-generation sequencing, obtaining 8.748 GB of raw data. The raw data in this stud...

متن کامل

Functional and expression analyses of transcripts based on full-length cDNAs of Sorghum bicolor

Sorghum bicolor is one of the most important crops for food and bioethanol production. Its small diploid genome and resistance to environmental stress make sorghum an attractive model for studying the functional genomics of the Saccharinae and other C4 grasses. We analyzed the domain-based functional annotation of the cDNAs using the gene ontology (GO) categories for molecular function to chara...

متن کامل

Most photorespiratory genes are preferentially expressed in the bundle sheath cells of the C4 grass Sorghum bicolor.

One of the hallmarks of C4 plants is the division of labor between two different photosynthetic cell types, the mesophyll and the bundle sheath cells. C4 plants are of polyphyletic origin and, during the evolution of C4 photosynthesis, the expression of thousands of genes was altered and many genes acquired a cell type-specific or preferential expression pattern. Several lines of evidence, incl...

متن کامل

Transcriptome Characterization and Functional Marker Development in Sorghum Sudanense

Sudangrass, Sorghum sudanense, is an important forage in warm regions. But little is known about its genome. In this study, the transcriptomes of sudangrass S722 and sorghum Tx623B were sequenced by Illumina sequencing. More than 4Gb bases were sequenced for each library. For Tx623B and S722, 88.79% and 83.88% reads, respectively were matched to the Sorghum bicolor genome. A total of 2,397 diff...

متن کامل

DNA methylation and gene expression regulation associated with vascularization in Sorghum bicolor

Plant secondary cell walls constitute the majority of plant biomass. They are predominantly found in xylem cells, which are derived from vascular initials during vascularization. Little is known about these processes in grass species despite their emerging importance as biomass feedstocks. The targeted biofuel crop Sorghum bicolor has a sequenced and well-annotated genome, making it an ideal mo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 56  شماره 

صفحات  -

تاریخ انتشار 2015