HIDDEN MARKOV MODELS AND LARGE - SCALE GENOMEANALYSISSean
نویسنده
چکیده
PFAM is a database of multiple alignments and hidden Markov models (HMMs) of common, conserved protein domains. PFAM HMMs complement BLAST analysis in the annotation of the C. elegans and human genome sequencing projects at Washington University and the Sanger Centre. PFAM2, based on full, gapped multiple alignments of structural and/or functional protein domains, currently contains 527 models. PFAM/HMM analysis hits at least one domain in 24% of the predicted proteins in the C. elegans genome project. 8% of C. elegans proteins are annotated as multidomain proteins by PFAM, with up to 5 diierent kinds of recognized domains per protein and up to 44 total recognized domains per protein.
منابع مشابه
MAN-MACHINE INTERACTION SYSTEM FOR SUBJECT INDEPENDENT SIGN LANGUAGE RECOGNITION USING FUZZY HIDDEN MARKOV MODEL
Sign language recognition has spawned more and more interest in human–computer interaction society. The major challenge that SLR recognition faces now is developing methods that will scale well with increasing vocabulary size with a limited set of training data for the signer independent application. The automatic SLR based on hidden Markov models (HMMs) is very sensitive to gesture's shape inf...
متن کاملProbability Distance Based Compression of Hidden Markov Models
Abstract. Large-scale stochastic models are relevant in many different fields such as computational biology, finance, social sciences, communication and traffic networks. In order to both efficiently simulate and analyze such models and to understand the essential properties of the system, it is desirable to have model reduction techniques that much reduce the dimensionality of the model while ...
متن کاملHidden Markov models in biological sequence
The vast increase of data in biology has meant that many aspects of computational science have been drawn into the field. Two areas of crucial importance are large-scale data management and machine learning. The field between computational science and biology is varyingly described as “computational biology” or “bioinformatics.” This paper reviews machine learning techniques based on the use of...
متن کاملOnline Spectral Identification of Dynamical Systems
Recently, a number of researchers have proposed spectral algorithms for learning models of nonlinear dynamical systems—for example, Hidden Markov Models (HMMs) [1, 2], Partially Observable Markov Decision Processes (POMDPs) [3], and Predictive State Representations (PSRs) [4, 3, 5]. These algorithms are attractive since they are statistically consistent and not subject to local optima. However,...
متن کاملScaling Factorial Hidden Markov Models: Stochastic Variational Inference without Messages
Factorial Hidden Markov Models (FHMMs) are powerful models for sequential data but they do not scale well with long sequences. We propose a scalable inference and learning algorithm for FHMMs that draws on ideas from the stochastic variational inference, neural network and copula literatures. Unlike existing approaches, the proposed algorithm requires no message passing procedure among latent v...
متن کامل