Top-Down Clustering for Protein Subfamily Identification
نویسندگان
چکیده
منابع مشابه
Top-Down Clustering for Protein Subfamily Identification
We propose a novel method for the task of protein subfamily identification; that is, finding subgroups of functionally closely related sequences within a protein family. In line with phylogenomic analysis, the method first builds a hierarchical tree using as input a multiple alignment of the protein sequences, then uses a post-pruning procedure to extract clusters from the tree. Differently fro...
متن کاملTop-Down Induction of Clustering Trees
An approach to clustering is presented that adapts the basic top-down induction of decision trees method towards clustering. To this aim, it employs the principles of instance based learning. The resulting methodology is implemented in the TIC (Top down Induction of Clustering trees) system for first order clustering. The TIC system employs the first order logical decision tree representation o...
متن کاملTop-down Clustering Using Multidimensional Indexes
Clustering on large databases has been studied actively as an increasing number of applications involve huge amount of data. In this paper, we propose a novel top-down clustering method based on region density using a multidimensional index. Generally, multidimensional indexes have the clustering property of storing similar objects in the same or adjacent data pages. By taking advantage of this...
متن کاملAutomated Protein Subfamily Identification and Classification
Function prediction by homology is widely used to provide preliminary functional annotations for genes for which experimental evidence of function is unavailable or limited. This approach has been shown to be prone to systematic error, including percolation of annotation errors through sequence databases. Phylogenomic analysis avoids these errors in function prediction but has been difficult to...
متن کاملProSight PTM: an integrated environment for protein identification and characterization by top-down mass spectrometry
ProSight PTM (https://prosightptm.scs.uiuc.edu/) is a web application for identification and characterization of proteins using mass spectra data from 'top-down' fragmentation of intact protein ions (i.e. without any tryptic digestion). ProSight PTM has many tools and graphical features to facilitate analysis of single proteins, proteins in mixtures and proteins fragmented in parallel. Sequence...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Evolutionary Bioinformatics
سال: 2013
ISSN: 1176-9343,1176-9343
DOI: 10.4137/ebo.s11609