Non-parametric estimation of posterior error probabilities associated with peptides identified by tandem mass spectrometry
نویسندگان
چکیده
MOTIVATION A mass spectrum produced via tandem mass spectrometry can be tentatively matched to a peptide sequence via database search. Here, we address the problem of assigning a posterior error probability (PEP) to a given peptide-spectrum match (PSM). This problem is considerably more dif.cult than the related problem of estimating the error rate associated with a large collection of PSMs. Existing methods for estimating PEPs rely on a parametric or semiparametric model of the underlying score distribution. RESULTS We demonstrate how to apply non-parametric logistic regression to this problem. The method makes no explicit assumptions about the form of the underlying score distribution; instead, the method relies upon decoy PSMs, produced by searching the spectra against a decoy sequence database, to provide a model of the null score distribution. We show that our non-parametric logistic regression method produces accurate PEP estimates for six different commonly used PSM score functions. In particular, the estimates produced by our method are comparable in accuracy to those of PeptideProphet, which uses a parametric or semiparametric model designed speci.cally to work with SEQUEST. The advantage of the non-parametric approach is applicability and robustness to new score functions and new types of data. AVAILABILITY C++ code implementing the method as well as supplementary information is available at http://noble.gs. washington.edu/proj/qvality
منابع مشابه
Posterior error probabilities and false discovery rates: two sides of the same coin.
A variety of methods have been described in the literature for assigning statistical significance to peptides identified via tandem mass spectrometry. Here, we explain how two types of scores, the q-value and the posterior error probability, are related and complementary to one another.
متن کاملEfficient marginalization to compute protein posterior probabilities from shotgun mass spectrometry data.
The problem of identifying proteins from a shotgun proteomics experiment has not been definitively solved. Identifying the proteins in a sample requires ranking them, ideally with interpretable scores. In particular, "degenerate" peptides, which map to multiple proteins, have made such a ranking difficult to compute. The problem of computing posterior probabilities for the proteins, which can b...
متن کاملQuantification of Melittin in Iranian Honey Bee (Apis mellifera meda) Venom by Liquid Chromatography-electrospray Ionization-ion Trap Tandem Mass Spectrometry (LC-ESI-IT-MS/MS)
The current research aimed to quantify melittin (MEL) in Iranian honey bee (Apis mellifera meda) venom. To this end, a liquid chromatography-electrospray ionization-ion trap tandem mass spectrometry (LC-ESI-IT-MS/MS) approach was employed. Melittin is the main toxic peptide of honey bee venom with various biological and pharmacological activities. It was extracted with...
متن کاملDevelopment and Validation of Bioanalytical Method for Simultaneous Estimation of Nebivolol Enantiomers in Human Plasma Using Liquid Chromatography-tandem Mass Spectrometry
The present study describes a liquid chromatography-tandem mass spectrometry (LC-MS/MS) method for the simultaneous determination of S-RRR and R-SSS nebivolol (nebivolol enantiomers) in human plasma using solid phase extraction technique. Method of both S-RRR and R-SSS nebivolol (nebivolol enantiomers) has been developed and validated using racemic nebivolol D4 as an internal standard. Analytes...
متن کاملProteome analysis of Cryptosporidium parvum and C. hominis using two-dimentional electrophoresis, image analysis and tandem mass spectrometry
Until recently, Cryptosporidium was thought to be a single species genus. Molecular studies now showthat there are at least 10 valid species of this parasite. Among them, two morphologically identical species, C.hominis and C. parvum are the most pathogenic identified to date and share 97% of identical genomes.Post-genomic analyses is therefore necessary to explore further the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Bioinformatics
دوره 24 16 شماره
صفحات -
تاریخ انتشار 2008