The Discrete Infinite Logistic Normal Distribution
نویسندگان
چکیده
We present the discrete infinite logistic normal distribution (DILN), a Bayesian nonparametric prior for mixed membership models. DILN generalizes the hierarchical Dirichlet process (HDP) to model correlation structure between the weights of the atoms at the group level. We derive a representation of DILN as a normalized collection of gamma-distributed random variables and study its statistical properties. We derive a variational inference algorithm for approximate posterior inference. We apply DILN to topic modeling of documents and study its empirical performance on four corpora, comparing performance with the HDP and the correlated topic model (CTM). To compute with large-scale data, we develop a stochastic variational inference algorithm for DILN and compare with similar algorithms for HDP and latent Dirichlet allocation (LDA) on a collection of 350, 000 articles from Nature.
منابع مشابه
Discussion of "The Discrete Infinite Logistic Normal Distribution for Mixed-Membership Modeling"
Mixed-membership models (e.g. “topic models”) are inarguably popular; especially latent Dirichlet allocation (LDA) [Blei et al., 2003] and its variants. Such models have become a fundamental tool in the analysis and exploration of many types of data. Originally designed to model text documents as per-word draws from a document-specific weighting of a finite collection of “topics” (distributions...
متن کاملThe Discrete Infinite Logistic Normal Distribution for Mixed-Membership Modeling
We present the discrete infinite logistic normal distribution (DILN, “Dylan”), a Bayesian nonparametric prior for mixed membership models. DILN is a generalization of the hierarchical Dirichlet process (HDP) that models correlation structure between the weights of the atoms at the group level. We derive a representation of DILN as a normalized collection of gamma-distributed random variables, a...
متن کاملLogistic–Normal Distribution: Properties and Application
In this paper some properties of logistics - x family are discussed and a member of the family, the logistic–normal distribution, is studied in detail. Average deviations, risk function and fashion for logistic–normal distribution is obtained. The method of maximum likelihood estimation is proposed for estimating the parameters of the logistic–normal distribution and a data set is ...
متن کاملThe Analysis of Bayesian Probit Regression of Binary and Polychotomous Response Data
The goal of this study is to introduce a statistical method regarding the analysis of specific latent data for regression analysis of the discrete data and to build a relation between a probit regression model (related to the discrete response) and normal linear regression model (related to the latent data of continuous response). This method provides precise inferences on binary and multinomia...
متن کاملA Discrete Kumaraswamy Marshall-Olkin Exponential Distribution
Finding new families of distributions has become a popular tool in statistical research. In this article, we introduce a new flexible four-parameter discrete model based on the Marshall-Olkin approach, namely, the discrete Kumaraswamy Marshall-Olkin exponential distribution. The proposed distribution can be viewed as another generalization of the geometric distribution and enfolds some importan...
متن کامل