High-Throughput 3D Homology Detection via NMR Resonance Assignment
نویسندگان
چکیده
One goal of the structural genomics initiative is the identification of new protein folds. Sequencebased structural homology prediction methods are an important means for prioritizing unknown proteins for structure determination. However, an important challenge remains: two highly dissimilar sequences can have similar folds — how can we detect this rapidly, in the context of structural genomics? High-throughput NMR experiments, coupled with novel algorithms for data analysis, can address this challenge. We report an automated procedure, called HD, for detecting 3D structural homologies from sparse, unassigned protein NMR data. Our method identifies 3D models in a protein structural database whose geometries best fit the unassigned experimental NMR data. HD does not use, and is thus not limited by sequence homology. The method can also be used to confirm or refute structural predictions made by other techniques such as protein threading or homology modelling. The algorithm runs in O(pn log (cn) + p log p) time, where p is the number of proteins in the database, n is the number of residues in the target protein and c is the maximum edge weight in an integer-weighted bipartite graph. Our experiments on real NMR data from 3 different proteins against a database of 4,500 representative folds demonstrate that the method identifies closely related protein folds, including sub-domains of larger proteins, with as little as 10-30% sequence homology between the target protein (or sub-domain) and the computed model. In particular, we report no false-negatives or false-positives despite significant percentages of missing experimental data. Dartmouth Computer Science Technical Report TR2004-487 Abbreviations used: NMR, nuclear magnetic resonance; RDC, residual dipolar coupling; 3D, three-dimensional; HSQC, heteronuclear single-quantum coherence; HN, amide proton; SAR, structure activity relation; SO(3), special orthogonal (rotation) group in 3D.
منابع مشابه
High-Throughput 3D Structural Homology Detection via NMR Resonance Assignment
One goal of the structural genomics initiative is the identification of new protein folds. Sequence-based structural homology prediction methods are an important means for prioritizing unknown proteins for structure determination. However, an important challenge remains: two highly dissimilar sequences can have similar folds — how can we detect this rapidly, in the context of structural genomic...
متن کاملA Polynomial-Time Nuclear Vector Replacement Algorithm for Automated NMR Resonance Assignments
High-throughput NMR structural biology can play an important role in structural genomics. We report an automated procedure for high-throughput NMR resonance assignment for a protein of known structure, or of an homologous structure. These assignments are a prerequisite for probing protein-protein interactions, protein-ligand binding, and dynamics by NMR. Assignments are also the starting point ...
متن کامل3D Structural Homology Detection via Unassigned Residual Dipolar Couplings
Recognition of a protein's fold provides valuable information about its function. While many sequence-based homology prediction methods exist, an important challenge remains: two highly dissimilar sequences can have similar folds-- how can we detect this rapidly, in the context of structural genomics? High-throughput NMR experiments, coupled with novel algorithms for data analysis, can address ...
متن کاملNOEnet–Use of NOE networks for NMR resonance assignment of proteins with known 3D structure
MOTIVATION A prerequisite for any protein study by NMR is the assignment of the resonances from the (15)N-(1)H HSQC spectrum to their corresponding atoms of the protein backbone. Usually, this assignment is obtained by analyzing triple resonance NMR experiments. An alternative assignment strategy exploits the information given by an already available 3D structure of the same or a homologous pro...
متن کاملStructural Proteomics by NMR
Nuclear magnetic resonance (NMR) is a powerful spectroscopic technique that permits the detailed study at atomic resolution of the three-dimensional structure and dynamics of macromolecules and their complexes in solution. In this brief chapter, I discuss various aspects of NMR that are pertinent to structural proteomics, that is, the high-throughput study of protein–protein complexes at the at...
متن کامل