Phyloproteomic Analysis of 11780 Six-Residue-Long Motifs Occurrences
نویسندگان
چکیده
How is it possible to find good traits for phylogenetic reconstructions? Here, we present a new phyloproteomic criterion that is an occurrence of simple motifs which can be imprints of evolution history. We studied the occurrences of 11780 six-residue-long motifs consisting of two randomly located amino acids in 97 eukaryotic and 25 bacterial proteomes. For all eukaryotic proteomes, with the exception of the Amoebozoa, Stramenopiles, and Diplomonadida kingdoms, the number of proteins containing the motifs from the first group (one of the two amino acids occurs once at the terminal position) made about 20%; in the case of motifs from the second (one of two amino acids occurs one time within the pattern) and third (the two amino acids occur randomly) groups, 30% and 50%, respectively. For bacterial proteomes, this relationship was 10%, 27%, and 63%, respectively. The matrices of correlation coefficients between numbers of proteins where a motif from the set of 11780 motifs appears at least once in 9 kingdoms and 5 phyla of bacteria were calculated. Among the correlation coefficients for eukaryotic proteomes, the correlation between the animal and fungi kingdoms (0.62) is higher than between fungi and plants (0.54). Our study provides support that animals and fungi are sibling kingdoms. Comparison of the frequencies of six-residue-long motifs in different proteomes allows obtaining phylogenetic relationships based on similarities between these frequencies: the Diplomonadida kingdoms are more close to Bacteria than to Eukaryota; Stramenopiles and Amoebozoa are more close to each other than to other kingdoms of Eukaryota.
منابع مشابه
Identification of family-specific residue packing motifs and their use for structure-based protein function prediction: I. Method development
Protein function prediction is one of the central problems in computational biology. We present a novel automated protein structure-based function prediction method using libraries of local residue packing patterns that are common to most proteins in a known functional family. Critical to this approach is the representation of a protein structure as a graph where residue vertices (residue name ...
متن کاملBeta-hairpins in proteins revisited: lessons for de novo design.
Beta-Hairpins with short connecting loops (1-5 residues) have been identified from a data set of 250 non-homologous, high resolution (< or =2.0 A) protein crystal structures. The conformational preferences of the loop segments have been analyzed with the specific aim of identifying frequently occurring motifs. Type I' and II' beta-turns were found to have a high propensity for occurrence in two...
متن کاملHeavy Metals Residue in Cultivated Mango Samples from Iran
Background: Heavy metals contaminations are recognized as the serious risk to our environment. The aim of the present study was to analyze heavy metals residue in cultivated mango samples from Iran. Methods: Totally, 72 mango samples were randomly collected among six different mango genotypes cultivated in Southern Iran from June to July 2015. Lead, chromium, cadmium, and arsenic were determin...
متن کاملThe SLiMDisc server: short, linear motif discovery in proteins
Short, linear motifs (SLiMs) play a critical role in many biological processes, particularly in protein-protein interactions. Overrepresentation of convergent occurrences of motifs in proteins with a common attribute (such as similar subcellular location or a shared interaction partner) provides a feasible means to discover novel occurrences computationally. The SLiMDisc (Short, Linear Motif Di...
متن کاملStatistical detection of cooperative transcription factors with similarity adjustment
MOTIVATION Statistical assessment of cis-regulatory modules (CRMs) is a crucial task in computational biology. Usually, one concludes from exceptional co-occurrences of DNA motifs that the corresponding transcription factors (TFs) are cooperative. However, similar DNA motifs tend to co-occur in random sequences due to high probability of overlapping occurrences. Therefore, it is important to co...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
دوره 2015 شماره
صفحات -
تاریخ انتشار 2015