Inter- and intra-chain disulfide bond prediction based on optimal feature selection.

نویسندگان

  • Shen Niu
  • Tao Huang
  • Kai-Yan Feng
  • Zhisong He
  • Weiren Cui
  • Lei Gu
  • Haipeng Li
  • Yu-Dong Cai
  • Yixue Li
چکیده

Protein disulfide bond is formed during post-translational modifications, and has been implicated in various physiological and pathological processes. Proper localization of disulfide bonds also facilitates the prediction of protein three-dimensional (3D) structure. However, it is both time-consuming and labor-intensive using conventional experimental approaches to determine disulfide bonds, especially for large-scale data sets. Since there are also some limitations for disulfide bond prediction based on 3D structure features, developing sequence-based, convenient and fast-speed computational methods for both inter- and intra-chain disulfide bond prediction is necessary. In this study, we developed a computational method for both types of disulfide bond prediction based on maximum relevance and minimum redundancy (mRMR) method followed by incremental feature selection (IFS), with nearest neighbor algorithm as its prediction model. Features of sequence conservation, residual disorder, and amino acid factor are used for inter-chain disulfide bond prediction. And in addition to these features, sequential distance between a pair of cysteines is also used for intra-chain disulfide bond prediction. Our approach achieves a prediction accuracy of 0.8702 for inter-chain disulfide bond prediction using 128 features and 0.9219 for intra-chain disulfide bond prediction using 261 features. Analysis of optimal feature set indicated key features and key sites for the disulfide bond formation. Interestingly, comparison of top features between interand intra-chain disulfide bonds revealed the similarities and differences of the mechanisms of forming these two types of disulfide bonds, which might help understand more of the mechanisms and provide clues to further experimental studies in this research field.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Prediction Method of Protein Disulfide Bond Based on Hybrid Strategy

A prediction method of protein disulfide bond based on support vector machine and sample selection is proposed in this paper. First, the protein sequences selected are encoded according to a certain encoding, input data for the prediction model of protein disulfide bond is generated; Then sample selection technique is used to select a portion of input data as training samples of support vector ...

متن کامل

Prediction Method of Protein Disulfide Bond Based on Pattern Selection

The effect of the different training samples is different for the classifier when pattern recognition system is established. The training samples were selected randomly in the past protein disulfide bond prediction methods, therefore the prediction accuracy of protein contact was reduced. In order to improve the influence of training samples, a prediction method of protein disulfide bond on the...

متن کامل

Improving the accuracy of predicting disulfide connectivity by feature selection

Disulfide bonds are primary covalent cross-links formed between two cysteine residues in the same or different protein polypeptide chains, which play important roles in the folding and stability of proteins. However, computational prediction of disulfide connectivity directly from protein primary sequences is challenging due to the nonlocal nature of disulfide bonds in the context of sequences,...

متن کامل

Removal of a C-terminal serine residue proximal to the inter-chain disulfide bond of a human IgG1 lambda light chain mediates enhanced antibody stability and antibody dependent cell-mediated cytotoxicity

Optimization of biophysical properties is a critical success factor for the developability of monoclonal antibodies with potential therapeutic applications. The inter-domain disulfide bond between light chain (Lc) and heavy chain (Hc) in human IgG1 lends structural support for antibody scaffold stability, optimal antigen binding, and normal Fc function. Recently, human IgG1λ has been suggested ...

متن کامل

Intra-A chain disulphide bond forms first during insulin precursor folding.

In this study, we investigated the folding pathway of insulin precursor and compared it with that of insulin-like growth factor I (IGF-I). The intra-A chain disulphide bond was found to form early in insulin precursor folding, whereas the corresponding disulphide bond in IGF-I formed late. Intra-A chain disulphide-bond deleted [A6, A11-Ser] proteins, including proinsulin, insulin, and A chain, ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Protein and peptide letters

دوره 20 3  شماره 

صفحات  -

تاریخ انتشار 2013