Subproject 4 : Bioinformatics of Glycan Expression
نویسندگان
چکیده
Glycomics is an emerging discipline, which still lags behind proteomics in terms of the development of bioinformatic tools that are required to track and process vast amounts of raw data, make it accessible to scientists with diverse backgrounds, and deduce important but non-obvious data relationships that can be interpreted in the context of developmental and pathological states of human cells. Our approach to this challenge is the development of an integrated data system that includes workflow protocols and tools for keeping track of experimental samples and processes, data processing tools to extract relevant information from the raw data, database schema to save the resulting data, and ontological tools that will facilitate access to the information and reveal systematic relationships within the data collected here, as well as among diverse data that is distributed in databases throughout the world and within the domain knowledge of the ontology itself. The basic design of this system has been developed, placing highest priority on the interoperability of its component parts. To this end we are currently developing two ontologies, GlycO and ProPreO. GlycO incorporates knowledge of glycan structure, function, biosynthesis, and metabolism. ProPreO incorporates knowledge of proteomic analysis and the resulting experimental data. These ontologies thus describe fundamental relationships between glycomics concepts and their association to experimental data, allowing individual elements of the data to be classified and viewed in the overall context of the biological/biochemical system. These ontlogies will serve as the glue that ties the components of our bioinformatics system together and as a semantic basis for a portal that we will develop to facilitate data access and to reveal relationships within the data. A key component of this portal will be a graphical browsing and querying interface that we are developing. The highly integrated nature of our bioinformatics system for glycomics is a prerequisite for its optimal functionality, with each component being designed such that its format and content are consistent with the GlycO and ProPreO ontologies.
منابع مشابه
Prediction of glycan structures from gene expression data based on glycosyltransferase reactions
MOTIVATION Glycan chains are synthesized by a combination of several kinds of glycosyltransferases (GTs). Thus, once we know the repertoire of GTs in the genome, in the transcriptome or in the proteome, it should in principle be possible to predict the repertoire of possible glycan structures in an organism or at a specific stage of the cell. Here, we show that a repertoire of glycan structures...
متن کاملBioinformatics for glycomics: Status, methods, requirements and perspectives
The term 'glycomics' describes the scientific attempt to identify and study all the glycan molecules - the glycome - synthesised by an organism. The aim is to create a cell-by-cell catalogue of glycosyltransferase expression and detected glycan structures. The current status of databases and bioinformatics tools, which are still in their infancy, is reviewed. The structures of glycans as second...
متن کاملPrediction of Glycan Structures from Microarray Data Prediction of Glycan Structures from the Glycan Related Microarray Expression Profiles
Glycans, which attach to some lipids and to Asn/Ser/Thr residues of proteins, draw attention as the third biological chains next to DNA and protein, since they play a key role in embryogenesis, immunity and diseases. Glycans consist of carbohydrate sugars and their derivatives such as glucose (Glc), mannose (Man), N-acetyl-glucosamine (GlcNAc) and sialic acid (Neu5Ac), and form linear and branc...
متن کاملGlycan Reader is improved to recognize most sugar types and chemical modifications in the Protein Data Bank
Motivation Glycans play a central role in many essential biological processes. Glycan Reader was originally developed to simplify the reading of Protein Data Bank (PDB) files containing glycans through the automatic detection and annotation of sugars and glycosidic linkages between sugar units and to proteins, all based on atomic coordinates and connectivity information. Carbohydrates can have ...
متن کاملGS-align for glycan structure alignment and similarity measurement
MOTIVATION Glycans play critical roles in many biological processes, and their structural diversity is key for specific protein-glycan recognition. Comparative structural studies of biological molecules provide useful insight into their biological relationships. However, most computational tools are designed for protein structure, and despite their importance, there is no currently available to...
متن کامل