Dependent nonparametric trees for dynamic hierarchical clustering: Supplementary material
نویسندگان
چکیده
There exist a wide variety of distributions over trees with infinitely many nodes, including the nested Chinese restaurant process (Blei et al., 2004), the Dirichlet diffusion tree (Neal, 2003), and Kingman’s coalescent (Kingman, 1982). These models differ from the TSSBP in that data can only be associated with a leaf node, or equivalently a full path from root to leaf. We chose to base our clustering model on the TSSBP because, in many applications, it makes sense to associate data with internal nodes. For example, a document may be narrowly about Physics or Biology, or may be a more broad article on the sciences in general.
منابع مشابه
Dependent nonparametric trees for dynamic hierarchical clustering
Hierarchical clustering methods offer an intuitive and powerful way to model a wide variety of data sets. However, the assumption of a fixed hierarchy is often overly restrictive when working with data generated over a period of time: We expect both the structure of our hierarchy, and the parameters of the clusters, to evolve with time. In this paper, we present a distribution over collections ...
متن کاملNonparametric K-Sample Tests via Dynamic Slicing
ISSN: 0162-1459 (Print) 1537-274X (Online) Journal homepage: http://www.tandfonline.com/loi/uasa20 Nonparametric K-Sample Tests via Dynamic Slicing Bo Jiang, Chao Ye & Jun S. Liu To cite this article: Bo Jiang, Chao Ye & Jun S. Liu (2015) Nonparametric K-Sample Tests via Dynamic Slicing, Journal of the American Statistical Association, 110:510, 642-653, DOI: 10.1080/01621459.2014.920257 To link...
متن کاملHow networks change with time
MOTIVATION Biological networks change in response to genetic and environmental cues. Changes are reflected in the abundances of biomolecules, the composition of protein complexes and other descriptors of the biological state. Methods to infer the dynamic state of a cell would have great value for understanding how cells change over time to accomplish biological goals. RESULTS A new method pre...
متن کاملNonparametric Bayesian Data Analysis
We review the current state of nonparametric Bayesian inference. The discussion follows a list of important statistical inference problems, including density estimation, regression, survival analysis, hierarchical models and model validation. For each inference problem we review relevant nonparametric Bayesian models and approaches including Dirichlet process (DP) models and variations, Polya t...
متن کاملBinary to Bushy: Bayesian Hierarchical Clustering with the Beta Coalescent
Discovering hierarchical regularities in data is a key problem in interacting with large datasets, modeling cognition, and encoding knowledge. A previous Bayesian solution—Kingman’s coalescent—provides a probabilistic model for data represented as a binary tree. Unfortunately, this is inappropriate for data better described by bushier trees. We generalize an existing belief propagation framewor...
متن کامل