Transmembrane proteins in Protein Data Bank: identification and classification
نویسندگان
چکیده
Motivation: Integral membrane proteins play important roles in living cells. Although these proteins are estimated to constitute around 25% of proteins at a genomic scale, the Protein Data Bank (PDB) contains only a few hundred membrane proteins due to the difficulties with experimental techniques. The presence of transmembrane proteins in the structure data bank, however, is quite invisible, as the annotation of these entries is rather poor. Even if a protein is identified as a transmembrane one, the possible location of the lipid bilayer is not indicated in the PDB because these proteins are crystallized without their natural lipid bilayer, and currently no method is publicly available to detect the possible membrane plane using the atomic coordinates of membrane proteins. Results: Here we present a new geometrical approach to distinguish between transmembrane and globular proteins using structural information only and to locate the most likely position of the lipid bilayer. An automated algorithm (TMDET) is given to determine the membrane planes relative to the position of atomic coordinates, together with a discrimination function which is able to separate transmembrane and globular proteins even in cases of low resolution or incomplete structures such as fragments or parts of large multi chain complexes. This method can be used for the proper annotation of protein structures containing transmembrane segments and paves the way to an up-to-date database containing the structure of all known transmembrane proteins and fragments (PDB_TM) which can be automatically updated. The algorithm is equally important for the purpose of constructing databases purely of globular proteins. Availability: The PDB_TM database is available for academic users on {{http: //www.enzim.hu/PDB_TM}}. Contact: [email protected], [email protected], [email protected] Supplementary Information: Data files used in this article can be found under the PDB_TM homepage ({{http://www.enzim.hu/PDB_TM/index.php?method=
منابع مشابه
Transmembrane proteins in the Protein Data Bank: identification and classification
MOTIVATION Integral membrane proteins play important roles in living cells. Although these proteins are estimated to constitute 25% of proteins at a genomic scale, the Protein Data Bank (PDB) contains only a few hundred membrane proteins due to the difficulties with experimental techniques. The presence of transmembrane proteins in the structure data bank, however, is quite invisible, as the an...
متن کاملOPM: Orientations of Proteins in Membranes database
SUMMARY The Orientations of Proteins in Membranes (OPM) database provides a collection of transmembrane, monotopic and peripheral proteins from the Protein Data Bank whose spatial arrangements in the lipid bilayer have been calculated theoretically and compared with experimental data. The database allows analysis, sorting and searching of membrane proteins based on their structural classificati...
متن کاملPropensity based classification: Dehalogenase and non-dehalogenase enzymes
The present work was designed to classify and differentiate between the dehalogenase enzyme to non–dehalogenases (other hydrolases) by taking the amino acid propensity at the core, surface and both the parts. The data sets were made on an individual basis by selecting the 3D structures of protein available in the PDB (Protein Data Bank). The prediction of the core amino acid were predicted by I...
متن کاملO-5: Identification of Novel ImmunodominantEpididymal Sperm Proteins Using CombinatorialApproach
Background: Alteration in the protein signatures of functionally immature testicular spermatozoa occurs during their journey through the epididymis. This leads to acquisition of sperm domain specific functions essential for successful fertilization. Epididymal sperm proteins are preferred targets for immunocontraception as well as in elucidating the causes of infertility. The Background of the ...
متن کاملTOPDB: topology data bank of transmembrane proteins
The Topology Data Bank of Transmembrane Proteins (TOPDB) is the most complete and comprehensive collection of transmembrane protein datasets containing experimentally derived topology information currently available. It contains information gathered from the literature and from public databases available on the internet for more than a thousand transmembrane proteins. TOPDB collects details of ...
متن کامل