How To Use CORREP to Estimate Multivariate Correlation and Statistical Inference Procedures
نویسنده
چکیده
OMICS data are increasingly available to biomedical researchers, and (biological) replications are more and more affordable for gene microarray experiments or proteomics experiments. The functional relationship between a pair of genes or proteins are often inferred by calculating correlation coefficient between their expression profiles. Classical correlation estimation techniques, such as Pearson correlation coefficient, do not explicitly take replicated data into account. As a result, biological replicates are often averaged before correlations are calculated. The averaging is not justified if there is poor concordance between samples and the variance in each sample is not similar. Based on our recently proposed multivariate correlation estimator, CORREP implements functions for estimating multivariate correlation for replicated OMICS data and statistical inference procedures. In this vignette I demo an non-trivial task accomplished using CORREP. First let’s look at examples of replicated OMICS data and non-replicated OMICS data. x11, x12, . . . , x1n
منابع مشابه
Multivariate correlation estimator for inferring functional relationships from replicated genome-wide data
UNLABELLED Estimating pairwise correlation from replicated genome-scale (a.k.a. OMICS) data is fundamental to cluster functionally relevant biomolecules to a cellular pathway. The popular Pearson correlation coefficient estimates bivariate correlation by averaging over replicates. It is not completely satisfactory since it introduces strong bias while reducing variance. We propose a new multiva...
متن کاملMultivariate process capability indices on the presence of priority for quality characteristics
Multivariate Process Capability Indices (MPCI) show how well a manufacturing process can meet specifica-tion limits when quality characteristics enclose a relative correlation. Process capability is an important and commonly used metric for assessing and improving the quality of a production process. When quality charac-teristics of a product are correlated then an attractive comes close to MPC...
متن کاملW. NIEMIRO (Warszawa) ESTIMATION OF NUISANCE PARAMETERS FOR INFERENCE BASED ON LEAST ABSOLUTE DEVIATIONS
Statistical inference procedures based on least absolute deviations involve estimates of a matrix which plays the role of a multivariate nuisance parameter. To estimate this matrix, we use kernel smoothing. We show consistency and obtain bounds on the rate of convergence.
متن کاملKullback-Leibler distance as a measure of the information filtered from multivariate data.
We show that the Kullback-Leibler distance is a good measure of the statistical uncertainty of correlation matrices estimated by using a finite set of data. For correlation matrices of multivariate Gaussian variables we analytically determine the expected values of the Kullback-Leibler distance of a sample correlation matrix from a reference model and we show that the expected values are known ...
متن کاملSample size determination for logistic regression
The problem of sample size estimation is important in medical applications, especially in cases of expensive measurements of immune biomarkers. This paper describes the problem of logistic regression analysis with the sample size determination algorithms, namely the methods of univariate statistics, logistics regression, cross-validation and Bayesian inference. The authors, treating the regr...
متن کامل