PICKLE 2.0: A human protein-protein interaction meta-database employing data integration via genetic information ontology
نویسندگان
چکیده
It has been acknowledged that source databases recording experimentally supported human protein-protein interactions (PPIs) exhibit limited overlap. Thus, the reconstruction of a comprehensive PPI network requires appropriate integration of multiple heterogeneous primary datasets, presenting the PPIs at various genetic reference levels. Existing PPI meta-databases perform integration via normalization; namely, PPIs are merged after converted to a certain target level. Hence, the node set of the integrated network depends each time on the number and type of the combined datasets. Moreover, the irreversible a priori normalization process hinders the identification of normalization artifacts in the integrated network, which originate from the nonlinearity characterizing the genetic information flow. PICKLE (Protein InteraCtion KnowLedgebasE) 2.0 implements a new architecture for this recently introduced human PPI meta-database. Its main novel feature over the existing meta-databases is its approach to primary PPI dataset integration via genetic information ontology. Building upon the PICKLE principles of using the reviewed human complete proteome (RHCP) of UniProtKB/Swiss-Prot as the reference protein interactor set, and filtering out protein interactions with low probability of being direct based on the available evidence, PICKLE 2.0 first assembles the RHCP genetic information ontology network by connecting the corresponding genes, nucleotide sequences (mRNAs) and proteins (UniProt entries) and then integrates PPI datasets by superimposing them on the ontology network without any a priori transformations. Importantly, this process allows the resulting heterogeneous integrated network to be reversibly normalized to any level of genetic reference without loss of the original information, the latter being used for identification of normalization biases, and enables the appraisal of potential false positive interactions through PPI source database cross-checking. The PICKLE web-based interface (www.pickle.gr) allows for the simultaneous query of multiple entities and provides integrated human PPI networks at either the protein (UniProt) or the gene level, at three PPI filtering modes.
منابع مشابه
Identification and prioritization genes related to Hypercholesterolemia QTLs using gene ontology and protein interaction networks
Gene identification represents the first step to a better understanding of the physiological role of the underlying protein and disease pathways, which in turn serves as a starting point for developing therapeutic interventions. Familial hypercholesterolemia is a hereditary metabolic disorder characterized by high low-density lipoprotein cholesterol levels. Hypercholesterolemia is a quantitativ...
متن کاملUsing the Protein-protein Interaction Network to Identifying the Biomarkers in Evolution of the Oocyte
Background Oocyte maturity includes nuclear and cytoplasmic maturity, both of which are important for embryo fertilization. The development of oocyte is not limited to the period of follicular growth, and starts from the embryonic period and continues throughout life. In this study, for the purpose of evaluating the effect of the FSH hormone on the expression of genes, GEO access codes for this...
متن کاملHuman Protein Meta-Interaction Database (HPMD) Potentiates Integration for Meta-Analysis
Difficulties in integration of biological databases has been a long standing issue [1]. Problems in combining databases include different identification schemes, redundancies, and varying levels of information cataloged within each database. This work seeks to address these concerns and design a corresponding implementation. Analysis via Monte-Carlo samples the shortest distances between pairs ...
متن کاملStudy of Human Albumin Protein Interaction with Fluorouracil Anticancer Drug Using Molecular Docking Method
Introduction: Drugs are mainly delivered to the target tissues by plasma proteins, such as human serum albumin, in the human body. Practical information about the thermodynamic parameters of drugs and their stability can be obtained using simulation methods, such as molecular docking. Material & Methods: This study, investigated the molecular docking of human serum albumin with fluorouracil an...
متن کاملHINO: BFO-aligned ontology representation of human molecular interactions and pathways
Reactome is a manually-curated, peer-reviewed knowledgebase of human pathways and processes. As a standard pathway data exchange format, BioPAX represents individual interactors (e.g., a protein), interactions, and pathways as instances instead of classes. This study aims to represent various human interaction pathways and networks as classes via a formal ontology aligned with the Basic Formal ...
متن کامل