Non-parametric estimation of posterior error probabilities associated with peptides identified by tandem mass spectrometry

نویسندگان

  • Lukas Käll
  • John D. Storey
  • William Stafford Noble
چکیده

MOTIVATION A mass spectrum produced via tandem mass spectrometry can be tentatively matched to a peptide sequence via database search. Here, we address the problem of assigning a posterior error probability (PEP) to a given peptide-spectrum match (PSM). This problem is considerably more dif.cult than the related problem of estimating the error rate associated with a large collection of PSMs. Existing methods for estimating PEPs rely on a parametric or semiparametric model of the underlying score distribution. RESULTS We demonstrate how to apply non-parametric logistic regression to this problem. The method makes no explicit assumptions about the form of the underlying score distribution; instead, the method relies upon decoy PSMs, produced by searching the spectra against a decoy sequence database, to provide a model of the null score distribution. We show that our non-parametric logistic regression method produces accurate PEP estimates for six different commonly used PSM score functions. In particular, the estimates produced by our method are comparable in accuracy to those of PeptideProphet, which uses a parametric or semiparametric model designed speci.cally to work with SEQUEST. The advantage of the non-parametric approach is applicability and robustness to new score functions and new types of data. AVAILABILITY C++ code implementing the method as well as supplementary information is available at http://noble.gs. washington.edu/proj/qvality

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Posterior error probabilities and false discovery rates: two sides of the same coin.

A variety of methods have been described in the literature for assigning statistical significance to peptides identified via tandem mass spectrometry. Here, we explain how two types of scores, the q-value and the posterior error probability, are related and complementary to one another.

متن کامل

Efficient marginalization to compute protein posterior probabilities from shotgun mass spectrometry data.

The problem of identifying proteins from a shotgun proteomics experiment has not been definitively solved. Identifying the proteins in a sample requires ranking them, ideally with interpretable scores. In particular, "degenerate" peptides, which map to multiple proteins, have made such a ranking difficult to compute. The problem of computing posterior probabilities for the proteins, which can b...

متن کامل

Quantification of Melittin in Iranian Honey Bee (Apis mellifera meda) Venom by Liquid Chromatography-electrospray Ionization-ion Trap Tandem Mass Spectrometry (LC-ESI-IT-MS/MS)

The current research aimed to quantify melittin (MEL) in Iranian honey bee (Apis mellifera meda) venom. To this end, a liquid chromatography-electrospray ionization-ion trap tandem mass spectrometry (LC-ESI-IT-MS/MS) approach was employed. Melittin is the main toxic peptide of honey bee venom with various biological and pharmacological activities. It was extracted with...

متن کامل

Development and Validation of Bioanalytical Method for Simultaneous Estimation of Nebivolol Enantiomers in Human Plasma Using Liquid Chromatography-tandem Mass Spectrometry

The present study describes a liquid chromatography-tandem mass spectrometry (LC-MS/MS) method for the simultaneous determination of S-RRR and R-SSS nebivolol (nebivolol enantiomers) in human plasma using solid phase extraction technique. Method of both S-RRR and R-SSS nebivolol (nebivolol enantiomers) has been developed and validated using racemic nebivolol D4 as an internal standard. Analytes...

متن کامل

Proteome analysis of Cryptosporidium parvum and C. hominis using two-dimentional electrophoresis, image analysis and tandem mass spectrometry

Until recently, Cryptosporidium was thought to be a single species genus. Molecular studies now showthat there are at least 10 valid species of this parasite. Among them, two morphologically identical species, C.hominis and C. parvum are the most pathogenic identified to date and share 97% of identical genomes.Post-genomic analyses is therefore necessary to explore further the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bioinformatics

دوره 24 16  شماره 

صفحات  -

تاریخ انتشار 2008