Sequence-structure specificity--how does an inverse folding approach work?
نویسندگان
چکیده
The inverse folding approach is a powerful tool in protein structure prediction when the native state of a sequence adopts one of the known protein folds. This is because some proteins show strong sequence-structure specificity in inverse folding experiments that allow gaps and insertions in the sequence-structure alignment. In those cases when structures similar to their native folds are included in the structure database, the z-scores (which measure the sequence-structure specificity) of these folds are well separated from those of other alternative structures. In this paper, we seek to understand the origin of this sequence-structure specificity and to identify how the specificity arises on passing from a short peptide chain to the entire protein sequence. To accomplish this objective, a simplified version of inverse folding, gapless inverse folding, is performed using sequence fragments of different sizes from 53 proteins. The results indicate that usually a significant portion of the entire protein sequence is necessary to show sequence-structure specificity, but there are regions in the sequence that begin to show this specificity at relatively short fragment size (15-20 residues). An island picture, in which the regions in the sequence that recognize their own native structure grow from some seed fragments, is observed as the fragment size increases. Usually, more similar structures to the native states are found in the top-scoring structural fragments in these high-specificity regions.
منابع مشابه
Inverse folding of RNA II
In the Oxford Summer School in Computational Biology 2011, one of the projects was aimed at creating an improved method for inverse RNA folding based on a genetic algorithm (GA) approach. This turned out to be sufficiently successful [16] and inspiring, that we will extend on it in this year’s summer school. The inverse RNA folding problem consists of finding an RNA sequence that, as a molecule...
متن کاملXylanase homology modeling using the inverse protein folding approach.
Xylanase has been used in wood pulp bleaching in an effort to reduce chlorine release into the environment and pollution associated with paper production. The three-dimensional structure of xylanase is important to enable better understanding of the enzyme mechanism and to help design a more thermostable xylanase mutant. At the time this work was begun, there was no sequence homologous protein ...
متن کاملStructural Characteristics of Stable Folding Intermediates of Yeast Iso-1-Cytochrome-c
Cytochrome-c (cyt-c) is an electron transport protein, and it is present throughout the evolution. More than 280 sequences have been reported in the protein sequence database (www.uniprot.org). Though sequentially diverse, cyt-c has essentially retained its tertiary structure or fold. Thus a vast data set of varied sequences with retention of similar structure and fun...
متن کاملOn the Computational Complexity of Sequence Design
Inverse protein folding concerns the identification of an amino acid sequence that folds to a given structure. Sequence design problems attempt to avoid the apparant difficulty of inverse protein folding by defining an energy that can be minimized to find protein-like sequences. We evaluate the practical relevance of two sequence design problems by analyzing their computational complexity. Vc' ...
متن کاملRelation Between RNA Sequences, Structures, and Shapes via Variation Networks
Background: RNA plays key role in many aspects of biological processes and its tertiary structure is critical for its biological function. RNA secondary structure represents various significant portions of RNA tertiary structure. Since the biological function of RNA is concluded indirectly from its primary structure, it would be important to analyze the relations between the RNA sequences and t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Protein engineering
دوره 10 4 شماره
صفحات -
تاریخ انتشار 1997