Separating Mixtures Using Megapriorstimothy

نویسنده

  • TIMOTHY L. BAILEY
چکیده

Fitting the parameters of a discrete nite mixture distribution to a set of data using the EM algorithm can be extremely diicult when the likelihood surface has many local maxima, the form of the components is unknown, and the number of components is unknown. The exponential explosion in the number of diierent models and diierent starting points for EM which must be tested can be reduced by nding the individual components of the mixture distribution one-at-a-time. This can be done by tting a succession of two-component models to the data. Each two-component model has one component constrained to be the uniform distribution and, ideally, when EM has converged, the other component of the model is a single component of the sampled distribution. Unfortunately, the second component of the tted model is often a convex combination of two or more components of the distribution. This diiculty can be overcome by using an extremely low variance prior on the parameters of the components of the distribution and maximizing the posterior probability of the data rather than its likelihood using a modiied version of EM. In particular, a \megaprior"|a prior whose variance is a function of the sample size| insures that the convex combination problem is avoided no matter how large the sample. We have incorporated these ideas into our MEME algorithm for discovering patterns (motifs) in biological sequences. Experimentally determined priors are available for the parameters of protein motifs. We decrease the variance of these priors to further reduce the algorithm's tendency to nd convex combinations of motifs. Experiments with protein sequence data show that by using megapriors the problem of convex combinations is avoided and our algorithm discovers biologically signiicant patterns.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Unison Source Separation

In this work we present a new scenario of analyzing and separating linear mixtures of musical instrument signals. When instruments are playing in unison, traditional source separation methods are not performing well. Although the sources share the same pitch, they often still differ in their modulation frequency caused by vibrato and/or tremolo effects. In this paper we propose source separatio...

متن کامل

Nonlinear Blind Source Separation Using a Genetic Algorithm

Demixing independent source signals from their nonlinear mixtures is a very important issue in many scenarios. This paper presents a novel method for blindly separating unobservable independent source signals from their nonlinear mixtures. The demixing system is modeled using a parameterized neural network whose parameters can be determined under the criterion of independence of its outputs. Co...

متن کامل

Blind Separating Convolutive Post Non-linear Mixtures

This paper addresses blind source separation in convolutive post nonlinear (CPNL) mixtures. In these mixtures, the sources are mixed convolutively, and then measured by nonlinear (e.g. saturated) sensors. The algorithm is based on minimizing the mutual information by using multivariate score functions.

متن کامل

Recurrent networks for separating extractable-target nonlinear mixtures. Part II. Blind configurations

While most reported blind source separation methods concern linear mixtures, we here address the nonlinear case. In the first part of this paper, we introduced a general class of nonlinear mixtures which can be inverted using recurrent networks. That part was focused on separating structures themselves and therefore on the non-blind configuration, whereas the current paper addresses the estimat...

متن کامل

A Natural Gradient Convolutive Blind Source Separation Algorithm for Speech Mixtures

In this paper, a novel algorithm for separating mixtures of multiple speech signals measured by multiple microphones in a room environment is proposed. The algorithm is a modification of an existing approach for density-based multichannel blind deconvolution using natural gradient adaptation. It employs linear predictors within the coefficient updates and produces separated speech signals whose...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996