BayesSUR: An R Package for High-Dimensional Multivariate Bayesian Variable and Covariance Selection in Linear Regression
نویسندگان
چکیده
In molecular biology, advances in high-throughput technologies have made it possible to study complex multivariate phenotypes and their simultaneous associations with high-dimensional genomic other omics data, a problem that can be studied multi-response regression, where the response variables are potentially highly correlated. To this purpose, we recently introduced several Bayesian variable covariance selection models, e.g., estimation methods for sparse seemingly unrelated regression selection. Several priors been implemented context, particular hotspot detection prior latent inclusion indicators, which results between predictors multiple phenotypes. We also propose an alternative, uses Markov random field (MRF) incorporating knowledge about dependence structure of indicators. Inference (SUR) by chain Monte Carlo is computationally feasible factorization matrix amongst variables. paper present BayesSUR, R package, allows user easily specify run range different SUR C++ computational efficiency. The package specification models modular way, chooses separately. demonstrate performance spike-and-slab MRF on synthetic real data sets representing eQTL or mQTL studies vitro anti-cancer drug screening as examples typical applications.
منابع مشابه
FWDselect: An R Package for Variable Selection in Regression Models
In multiple regression models, when there are a large number (p) of explanatory variables which may or may not be relevant for predicting the response, it is useful to be able to reduce the model. To this end, it is necessary to determine the best subset of q (q ≤ p) predictors which will establish the model with the best prediction capacity. FWDselect package introduces a new forward stepwiseb...
متن کاملVariable Clustering in High-Dimensional Linear Regression: The R Package clere
Dimension reduction is one of the biggest challenges in high-dimensional regression models. We recently introduced a new methodology based on variable clustering as a means to reduce dimensionality. We present here the R package clere that implements some refinements of this methodology. An overview of the package functionalities as well as examples to run an analysis are described. Numerical e...
متن کاملHigh-Dimensional Bayesian Clustering with Variable Selection: The R Package bclust
The R package bclust is useful for clustering high-dimensional continuous data. The package uses a parametric spike-and-slab Bayesian model to downweight the effect of noise variables and to quantify the importance of each variable in agglomerative clustering. We take advantage of the existence of closed-form marginal distributions to estimate the model hyper-parameters using empirical Bayes, t...
متن کاملVariable Selection for High Dimensional Multivariate Outcomes.
We consider variable selection for high-dimensional multivariate regression using penalized likelihoods when the number of outcomes and the number of covariates might be large. To account for within-subject correlation, we consider variable selection when a working precision matrix is used and when the precision matrix is jointly estimated using a two-stage procedure. We show that under suitabl...
متن کاملAn R Package flare for High Dimensional Linear Regression and Precision Matrix Estimation
This paper describes an R package named flare, which implements a family of new high dimensional regression methods (LAD Lasso, SQRT Lasso, `q Lasso, and Dantzig selector) and their extensions to sparse precision matrix estimation (TIGER and CLIME). These methods exploit different nonsmooth loss functions to gain modeling flexibility, estimation robustness, and tuning insensitiveness. The devel...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Statistical Software
سال: 2021
ISSN: ['1548-7660']
DOI: https://doi.org/10.18637/jss.v100.i11