De Novo Peptide Identification Via Mixed-Integer Linear Optimization And Tandem Mass Spectrometry
نویسندگان
چکیده
A novel methodology for the de novo identification of peptides via mixedinteger linear optimization (MILP) and tandem mass spectrometry is presented. The overall mathematical model is presented and the key concepts of the proposed approach are described. A pre-processing algorithm is utilized to identify important m/z values in the tandem mass spectrum. Missing peaks, due to residue-dependent fragmentation characteristics, are dealt with using a twostage algorithmic framework. A cross-correlation approach is used to resolve missing amino acid assignments and to select the most probable peptide by comparing the theoretical spectra of the candidate sequences that were generated from the MILP sequencing stages with the experimental tandem mass spectrum. The novel proposed de novo method, denoted as PILOT, is compared to existing popular methods such as Lutefisk, PEAKS, PepNovo, EigenMS and NovoHMM for a set of spectra resulting from QTOF instruments.
منابع مشابه
A Mixed-Integer Optimization Framework for De Novo Peptide Identification.
A novel methodology for the de novo identification of peptides by mixed-integer optimization and tandem mass spectrometry is presented in this article. The various features of the mathematical model are presented and examples are used to illustrate the key concepts of the proposed approach. Several problems are examined to illustrate the proposed method's ability to address (1) residue-dependen...
متن کاملA Mixed-Integer Linear Optimization Framework for the Identification and Quantification of Targeted Post-translational Modifications of Highly Modified Proteins using Multiplexed ETD Tandem Mass Spectrometry
1 Abbreviations:-MILP Mixed-integer linear optimization-LP linear programming-ETD Electron transfer dissociation-ECD Electron capture dissociation-CID Collision induced dissociation-MS Mass spectrometry-LC Liquid chromatography-PTM Post-translational modification-HILIC Hydrophilic interaction liquid chromatography 2 Summary In this article, we present a novel methodology for the identification ...
متن کاملA Two-way Parallel Searching for Peptide Identification via Tandem Mass Spectrometry
De novo peptide sequencing that determines the amino acid sequence of a peptide via tandem mass spectrometry (MS/MS) has been increasingly used nowadays in proteomics for protein identification. Current de novo methods generally employ a graph theory, which usually produces a large number of candidate sequences and causes heavy computational cost while trying to determine a sequence with less a...
متن کاملA Suboptimal Algorithm for De Novo Peptide Sequencing via Tandem Mass Spectrometry
Tandem mass spectrometry has emerged to be one of the most powerful high-throughput techniques for protein identification. Tandem mass spectrometry selects and fragments peptides of interest into N-terminal ions and C-terminal ions, and it measures the mass/charge ratios of these ions. The de novo peptide sequencing problem is to derive the peptide sequences from given tandem mass spectral data...
متن کاملAuDeNS: A Tool for Automatic De Novo Peptide Sequencing
We have developed and implemented a framework for de novo sequencing of peptides using tandem mass spectrometry data. It first cleans the input spectrum with a number of data cleaning algorithms (“grass mowers”), followed by a sequencing algorithm that is a modification of a dynamic programming algorithm introduced in [CKT00]. In first experiments, our prototype performs well (but not better) i...
متن کامل