Pairwise similarity scores using functional annotations: review and comparison
نویسندگان
چکیده
Previous works have already checked the relationship of a number of functional similarity scores with different types of biological information: sequences, protein families, expression profiles, pathways, literature, etc. This work aims to provide a review and a critical comparison of previously proposed similarity scores, to help in the decision of choosing the appropriate one for the problem at hand, considering all the information that can be integrated. We provide a detailed comparison of various scores, using data from one of the most complete GO-annotated genomes, Sacharomyces cerevisiae.
منابع مشابه
Assessing annotation transfer for genomics: quantifying the relations between protein sequence, structure and function through traditional and probabilistic scores.
Measuring in a quantitative, statistical sense the degree to which structural and functional information can be "transferred" between pairs of related protein sequences at various levels of similarity is an essential prerequisite for robust genome annotation. To this end, we performed pairwise sequence, structure and function comparisons on approximately 30,000 pairs of protein domains with kno...
متن کاملThe Effects of Multimedia Annotations on Iranian EFL Learners’ L2 Vocabulary Learning
In our modern technological world, Computer-Assisted Language learning (CALL) is a new realm towards learning a language in general, and learning L2 vocabulary in particular. It is assumed that the use of multimedia annotations promotes language learners’ vocabulary acquisition. Therefore, this study set out to investigate the effects of different multimedia annotations (still picture annotatio...
متن کاملCorrelating Extracted and Ground-Truth Harmonic Data in Music Retrieval Tasks
We show that traditional music information retrieval tasks with well-chosen parameters perform similarly using computationally extracted chord annotations and groundtruth annotations. Using a collection of Billboard songs with provided ground-truth chord labels, we use established chord identification algorithms to produce a corresponding extracted chord label dataset. We implement methods to c...
متن کاملAn alignment-free domain architecture similarity search (ADASS) algorithm for inferring homology between multi-domain proteins
Annotations of the genes and their products are largely guided by inferring homology. Sequence similarity is the primary measure used for annotation purpose however, the domain content and order were given less importance albeit the fact that domain insertion, deletion, positional changes can bring in functional varieties. Of late, several methods developed quantify domain architecture similari...
متن کاملCrowdclustering with Sparse Pairwise Labels: A Matrix Completion Approach
Crowdsourcing utilizes human ability by distributing tasks to a large number of workers. It is especially suitable for solving data clustering problems because it provides a way to obtain a similarity measure between objects based on manual annotations, which capture the human perception of similarity among objects. This is in contrast to most clustering algorithms that face the challenge of fi...
متن کامل