Bayesian Density Regression and Predictor-dependent Clustering
نویسندگان
چکیده
JU-HYUN PARK: Bayesian Density Regression and Predictor-Dependent Clustering. (Under the direction of Dr. David Dunson.) Mixture models are widely used in many application areas, with finite mixtures of Gaussian distributions applied routinely in clustering and density estimation. With the increasing need for a flexible model for predictor-dependent clustering and conditional density estimation, mixture models are generalized to incorporate predictors with infinitely many components in the semiparametric Bayesian perspective. Much of the recent work in the nonparametric Bayes literature focuses on introducing predictor-dependence into the probability weights. In this dissertation we propose three semiparametric Bayesian methods, with a focus on the applications of predictor-dependent clustering and condition density estimation. We first derive a generalized product partition model (GPPM), starting with a Dirichlet process (DP) mixture model. The GPPM results in a generalized Pólya urn scheme. Next, we consider the problem of density estimation in cases where predictors are not directly measured. We propose a model that relies on Bayesian approaches to modeling of the unknown distribution of latent predictors and of the conditional distribution of responses given latent predictors. Finally, we develop a semiparametric Bayesian model for density regression in cases with many predictors. To reduce dimensionality of data, our model is based on factor analysis models with the number of latent variables unknown. A nonparametric prior for infinite factors is defined.
منابع مشابه
Bayesian Density Regression
This article considers Bayesian methods for density regression, allowing a random probability distribution to change flexibly with multiple predictors. The conditional response distribution is expressed as a nonparametric mixture of regression models, with the mixture distribution changing with predictors. A class of weighted mixture of Dirichlet process (WMDP) priors is proposed for the uncoun...
متن کاملBayesian bandwidth estimation for a nonparametric functional regression model with unknown error density
Error density estimation in a nonparametric functional regression model with functional predictor and scalar response is considered. The unknown error density is approximated by a mixture of Gaussian densities with means being the individual residuals, and variance as a constant parameter. This proposed mixture error density has a form of a kernel density estimator of residuals, where the regre...
متن کاملBayesian isotonic density regression.
Density regression models allow the conditional distribution of the response given predictors to change flexibly over the predictor space. Such models are much more flexible than nonparametric mean regression models with nonparametric residual distributions, and are well supported in many applications. A rich variety of Bayesian methods have been proposed for density regression, but it is not c...
متن کاملImproved Bayesian Training for Context-Dependent Modeling in Continuous Persian Speech Recognition
Context-dependent modeling is a widely used technique for better phone modeling in continuous speech recognition. While different types of context-dependent models have been used, triphones have been known as the most effective ones. In this paper, a Maximum a Posteriori (MAP) estimation approach has been used to estimate the parameters of the untied triphone model set used in data-driven clust...
متن کاملمقایسه روش های مختلف آماری در انتخاب ژنومی گاوهای هلشتاین
Genomic selection combines statistical methods with genomic data to predict genetic values for complex traits. The accuracy of prediction of genetic values in selected population has a great effect on the success of this selection method. Accuracy of genomic prediction is highly dependent on the statistical model used to estimate marker effects in reference population. Various factors such a...
متن کامل