Galaxy Integrated Omics: Web-based Standards-Compliant Workflows for Proteomics Informed by Transcriptomics*
نویسندگان
چکیده
With the recent advent of RNA-seq technology the proteomics community has begun to generate sample-specific protein databases for peptide and protein identification, an approach we call proteomics informed by transcriptomics (PIT). This approach has gained a lot of interest, particularly among researchers who work with nonmodel organisms or with particularly dynamic proteomes such as those observed in developmental biology and host-pathogen studies. PIT has been shown to improve coverage of known proteins, and to reveal potential novel gene products. However, many groups are impeded in their use of PIT by the complexity of the required data analysis. Necessarily, this analysis requires complex integration of a number of different software tools from at least two different communities, and because PIT has a range of biological applications a single software pipeline is not suitable for all use cases. To overcome these problems, we have created GIO, a software system that uses the well-established Galaxy platform to make PIT analysis available to the typical bench scientist via a simple web interface. Within GIO we provide workflows for four common use cases: a standard search against a reference proteome; PIT protein identification without a reference genome; PIT protein identification using a genome guide; and PIT genome annotation. These workflows comprise individual tools that can be reconfigured and rearranged within the web interface to create new workflows to support additional use cases.
منابع مشابه
Paintomics: a web based tool for the joint visualization of transcriptomics and metabolomics data
MOTIVATION The development of the omics technologies such as transcriptomics, proteomics and metabolomics has made possible the realization of systems biology studies where biological systems are interrogated at different levels of biochemical activity (gene expression, protein activity and/or metabolite concentration). An effective approach to the analysis of these complex datasets is the join...
متن کاملPrecision Medicine: A New Revolution in Healthcare System
Every human being is different based on genetics, lifestyle, and environmental factors. Novel medical technologies have become more precise owing to molecular information, including genomics, transcriptomics, proteomics, metabolomics, etc. The “omics” technologies have opened up new horizons for healthcare systems, enabling them to prevent and/or diagnose diseases more precisel...
متن کاملThe Taverna workflow suite: designing and executing workflows of Web Services on the desktop, web or in the cloud
The Taverna workflow tool suite (http://www.taverna.org.uk) is designed to combine distributed Web Services and/or local tools into complex analysis pipelines. These pipelines can be executed on local desktop machines or through larger infrastructure (such as supercomputers, Grids or cloud environments), using the Taverna Server. In bioinformatics, Taverna workflows are typically used in the ar...
متن کاملEstablishing reporting standards for metabolomic and metabonomic studies: a call for participation.
Metabolite concentrations in cellular systems are very much dependent on the physiological, environmental, and genetic status of an organism and are regarded as the ultimate result of cellular regulation, resulting in the visible phenotypes. Therefore, the comprehensive analysis of metabolite levels and fluxes renders a suitable tool for assessing the degree of perturbation in biological system...
متن کاملIntegrated pathway-level analysis of transcriptomics and metabolomics data with IMPaLA
SUMMARY Pathway-level analysis is a powerful approach enabling interpretation of post-genomic data at a higher level than that of individual biomolecules. Yet, it is currently hard to integrate more than one type of omics data in such an approach. Here, we present a web tool 'IMPaLA' for the joint pathway analysis of transcriptomics or proteomics and metabolomics data. It performs over-represen...
متن کامل