Global probabilistic annotation of metabolic networks enables enzyme discovery
نویسندگان
چکیده
Annotation of organism-specific metabolic networks is one of the main challenges of systems biology. Importantly, owing to inherent uncertainty of computational annotations, predictions of biochemical function need to be treated probabilistically. We present a global probabilistic approach to annotate genome-scale metabolic networks that integrates sequence homology and context-based correlations under a single principled framework. The developed method for global biochemical reconstruction using sampling (GLOBUS) not only provides annotation probabilities for each functional assignment but also suggests likely alternative functions. GLOBUS is based on statistical Gibbs sampling of probable metabolic annotations and is able to make accurate functional assignments even in cases of remote sequence identity to known enzymes. We apply GLOBUS to genomes of Bacillus subtilis and Staphylococcus aureus and validate the method predictions by experimentally demonstrating the 6-phosphogluconolactonase activity of YkgB and the role of the Sps pathway for rhamnose biosynthesis in B. subtilis.
منابع مشابه
Metabolomic strategies for the identification of new enzyme functions and metabolic pathways
Recent technological advances in accurate mass spectrometry and data analysis have revolutionized metabolomics experimentation. Activity-based and global metabolomic profiling methods allow simultaneous and rapid screening of hundreds of metabolites from a variety of chemical classes, making them useful tools for the discovery of novel enzymatic activities and metabolic pathways. By using the m...
متن کاملDealing with Uncertainty in Lexical Annotation
We present ALA, a tool for the automatic lexical annotation (i.e. annotation w.r.t. a thesaurus/lexical resource) of structured and semi-structured data sources and the discovery of probabilistic lexical relationships in a data integration environment. ALA performs automatic lexical annotation through the use of probabilistic annotations, i.e. an annotation is associated to a probability value....
متن کاملReconstruction of metabolic pathways by combining probabilistic graphical model-based and knowledge-based methods
Automatic reconstruction of metabolic pathways for an organism from genomics and transcriptomics data has been a challenging and important problem in bioinformatics. Traditionally, known reference pathways can be mapped into an organism-specific ones based on its genome annotation and protein homology. However, this simple knowledge-based mapping method might produce incomplete pathways and gen...
متن کاملUse of a global metabolic network to curate organismal metabolic networks
The difficulty in annotating the vast amounts of biological information poses one of the greatest current challenges in biological research. The number of genomic, proteomic, and metabolomic datasets has increased dramatically over the last two decades, far outstripping the pace of curation efforts. Here, we tackle the challenge of curating metabolic network reconstructions. We predict organism...
متن کاملGlobal reconstruction of the human metabolic network based on genomic and bibliomic data.
Metabolism is a vital cellular process, and its malfunction is a major contributor to human disease. Metabolic networks are complex and highly interconnected, and thus systems-level computational approaches are required to elucidate and understand metabolic genotype-phenotype relationships. We have manually reconstructed the global human metabolic network based on Build 35 of the genome annotat...
متن کامل