A Turn-Key Approach for Large-Scale Identification of Complex Posttranslational Modifications
نویسندگان
چکیده
The conjugation of complex post-translational modifications (PTMs) such as glycosylation and Small Ubiquitin-like Modification (SUMOylation) to a substrate protein can substantially change the resulting peptide fragmentation pattern compared to its unmodified counterpart, making current database search methods inappropriate for the identification of tandem mass (MS/MS) spectra from such modified peptides. Traditionally it has been difficult to develop new algorithms to identify these atypical peptides because of the lack of a large set of annotated spectra from which to learn the altered fragmentation pattern. Using SUMOylation as an example, we propose a novel approach to generate large MS/MS training data from modified peptides and derive an algorithm that learns properties of PTM-specific fragmentation from such training data. Benchmark tests on data sets of varying complexity show that our method is 80-300% more sensitive than current state-of-the-art approaches. The core concepts of our method are readily applicable to developing algorithms for the identifications of peptides with other complex PTMs.
منابع مشابه
Proteomics: Challenges, Techniques and Possibilities to Overcome Biological Sample Complexity
Proteomics is the large-scale study of the structure and function of proteins in complex biological sample. Such an approach has the potential value to understand the complex nature of the organism. Current proteomic tools allow large-scale, high-throughput analyses for the detection, identification, and functional investigation of proteome. Advances in protein fractionation and labeling techni...
متن کاملInsPecT: identification of posttranslationally modified peptides from tandem mass spectra.
Reliable identification of posttranslational modifications is key to understanding various cellular regulatory processes. We describe a tool, InsPecT, to identify posttranslational modifications using tandem mass spectrometry data. InsPecT constructs database filters that proved to be very successful in genomics searches. Given an MS/MS spectrum S and a database D, a database filter selects a s...
متن کاملLarge Scale Experiments Data Analysis for Estimation of Hydrodynamic Force Coefficients
This paper describes the various frequency domain methods which may be used to analyze experiments data on the force experienced by a circular cylinder in wave and current to estimate drag and inertia coefficients for use in Morison’s equation. An additional approach, system identification techniques (SIT) is also introduced. A set of data obtained from experiments on heavily roughened circular...
متن کاملA Comparative Analysis and Review of lysyl Residues Affected by Posttranslational Modifications
Post-translational modification is the most common mechanism of regulating protein function. If phosphorylation is considered a key event in many signal transduction pathways, other modifications must be considered as well. In particular the side chain of lysine residues is a target of different modifications; notably acetylation, methylation, ubiquitylation, sumoylation, neddylation, etc. Mass...
متن کاملIdentification and characterization of posttranslational modifications of proteins by MALDI ion trap mass spectrometry.
Matrix-assisted laser desorption/ionization (MALDI) ion trap mass spectrometry is shown to be a powerful tool for the elucidation of protein modifications. Low-energy covalent bonds that originate from certain posttranslational modifications dissociate preferentially to produce characteristic mass spectrometric signatures that prove useful for the accurate, confident identification and characte...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 13 شماره
صفحات -
تاریخ انتشار 2014