Multiple Hypothesis Testing by Clustering Treatment Effects
نویسندگان
چکیده
Multiple hypothesis testing and clustering have been the subject of extensive research in high-dimensional inference, yet these problems usually have been treated separately. By defining true clusters in terms of shared parameter values, we could improve the sensitivity of individual tests, because more data bearing on the same parameter values are available. We develop and evaluate a hybrid methodology that uses clustering information to increase testing sensitivity and accommodates uncertainty in the true clustering. To investigate the potential efficacy of the hybrid approach, we first study a stylized example in which each object is evaluated with a standard z score but different objects are connected by shared parameter values. We show that there is increased testing power when the clustering is estimated sufficiently well. We next develop a model-based analysis using a conjugate Dirichlet process mixture model. The method is general, but for specificity we focus attention on microarray gene expression data, to which both clustering and multiple testing methods are actively applied. Clusters provide the means for sharing information among genes, and the hybrid methodology averages over uncertainty in these clusters through Markov chain sampling. Simulations show that the hybrid method performs substantially better than other methods when clustering is heavy or moderate and performs well even under weak clustering. The proposed method is illustrated on microarray data from a study of the effects of aging on gene expression in heart tissue.
منابع مشابه
Spiked Dirichlet Process Prior for Bayesian Multiple Hypothesis Testing in Random Effects Models.
We propose a Bayesian method for multiple hypothesis testing in random effects models that uses Dirichlet process (DP) priors for a nonparametric treatment of the random effects distribution. We consider a general model formulation which accommodates a variety of multiple treatment conditions. A key feature of our method is the use of a product of spiked distributions, i.e., mixtures of a point...
متن کاملSimultaneous inference for multiple testing and clustering via a Dirichlet process mixture model
We propose a Bayesian nonparametric regression model that exploits clustering for increased sensitivity in multiple hypothesis testing. We build on the recently proposed BEMMA (Bayesian Effects Models for Microarrays) method which is able to model dependence among objects through clustering and then estimates hypothesis-testing parameters averaged over clustering uncertainty. We propose several...
متن کاملP 4: The Hypothesis Detect Multiple Sclerosis in Early Stage with Saliva Testing
Introduction: Recent studies point to the clinical and research efficacy of saliva as a respected diagnostic aid for observing Multiple Sclerosis. The objectives of this Hypothesis are to identify novel biomarkers recognized to Multiple Sclerosis in early stage in saliva and to determine if the levels of these markers correlate with level of these Cerebrospinal fluid and blood assays and urine ...
متن کاملA New Method for Sperm Detection in Infertility Cure: Hypothesis Testing Based on Fuzzy Entropy Decision
In this paper, a new method is introduced for sperm detection in microscopic images for infertility treatment. In this method, firstly a hypothesis testing function is defined to separate sperms from plasma, non-sperm semen particles and noise. Then, some primary candidates are selected for sperms by watershed-based segmentation algorithm. Finally, candidates are either confirmed or rejected us...
متن کاملHierarchical Bayesian Methods in Ecology
Ecosystems are dynamic in both space and time, hence involve multiple spatial and temporal scales, and are often heterogeneous in both of those dimensions, leading to spatial and temporal clustering. Accommodating this complexity in the context of scientific (statistical) hypothesis testing necessitates more advanced methods than those available within the classical null hypothesis testing para...
متن کامل