Multicoil2: Predicting Coiled Coils and Their Oligomerization States from Sequence in the Twilight Zone

نویسندگان

  • Jason Trigg
  • Karl Gutwin
  • Amy E. Keating
  • Bonnie Berger
چکیده

The alpha-helical coiled coil can adopt a variety of topologies, among the most common of which are parallel and antiparallel dimers and trimers. We present Multicoil2, an algorithm that predicts both the location and oligomerization state (two versus three helices) of coiled coils in protein sequences. Multicoil2 combines the pairwise correlations of the previous Multicoil method with the flexibility of Hidden Markov Models (HMMs) in a Markov Random Field (MRF). The resulting algorithm integrates sequence features, including pairwise interactions, through multinomial logistic regression to devise an optimized scoring function for distinguishing dimer, trimer and non-coiled-coil oligomerization states; this scoring function is used to produce Markov Random Field potentials that incorporate pairwise correlations localized in sequence. Multicoil2 significantly improves both coiled-coil detection and dimer versus trimer state prediction over the original Multicoil algorithm retrained on a newly-constructed database of coiled-coil sequences. The new database, comprised of 2,105 sequences containing 124,088 residues, includes reliable structural annotations based on experimental data in the literature. Notably, the enhanced performance of Multicoil2 is evident when tested in stringent leave-family-out cross-validation on the new database, reflecting expected performance on challenging new prediction targets that have minimal sequence similarity to known coiled-coil families. The Multicoil2 program and training database are available for download from http://multicoil2.csail.mit.edu.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Computational characterization of parallel dimeric and trimeric coiled-coils using effective amino acid indices.

The coiled-coil, which consists of two or more α-helices winding around each other, is a ubiquitous and the most frequently observed protein-protein interaction motif in nature. The coiled-coil is known for its straightforward heptad repeat pattern and can be readily recognized based on protein primary sequences, exhibiting a variety of oligomer states and topologies. Due to the stable interact...

متن کامل

Molecular basis of coiled-coil oligomerization-state specificity.

Coiled coils are extensively and successfully used nowadays to rationally design multistranded structures for applications, including basic research, biotechnology, nanotechnology, materials science, and medicine. The wide range of applications as well as the important functions these structures play in almost all biological processes highlight the need for a detailed understanding of the facto...

متن کامل

MultiCoil: a program for predicting two- and three-stranded coiled coils.

A new multidimensional scoring approach for identifying and distinguishing trimeric and dimeric coiled coils is implemented in the MultiCoil program. The program extends the two-stranded coiled-coil prediction program PairCoil to the identification of three-stranded coiled coils. The computations are based upon data gathered from a three-stranded coiled-coil database comprising 6,319 amino acid...

متن کامل

An autonomous folding unit mediates the assembly of two-stranded coiled coils.

Subunit oligomerization of many proteins is mediated by coiled-coil domains. Although the basic features contributing to the thermodynamic stability of coiled coils are well understood, the mechanistic details of their assembly have not yet been dissected. Here we report a 13-residue sequence pattern that occurs with limited sequence variations in many two-stranded coiled coils and that is abso...

متن کامل

Complex Networks Govern Coiled-Coil Oligomerization – Predicting and Profiling by Means of a Machine Learning Approach*

Understanding the relationship between protein sequence and structure is one of the great challenges in biology. In the case of the ubiquitous coiled-coil motif, structure and occurrence have been described in extensive detail, but there is a lack of insight into the rules that govern oligomerization, i.e. how many α-helices form a given coiled coil. To shed new light on the formation of two- a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 6  شماره 

صفحات  -

تاریخ انتشار 2011