Iterative bicluster-based least square framework for estimation of missing values in microarray gene expression data

نویسندگان
چکیده

منابع مشابه

Iterative bicluster-based least square framework for estimation of missing values in microarray gene expression data

DNA microarray experiment inevitably generates gene expression data with missing values. An important and necessary pre-processing step is thus to impute these missing values. Existing imputation methods exploit gene correlation among all experimental conditions for estimating the missing values. However, related genes coexpress in subsets of experimental conditions only. In this paper, we prop...

متن کامل

Missing value estimation for DNA microarray gene expression data: local least squares imputation

MOTIVATION Gene expression data often contain missing expression values. Effective missing value estimation methods are needed since many algorithms for gene expression data analysis require a complete matrix of gene array values. In this paper, imputation methods based on the least squares formulation are proposed to estimate missing values in the gene expression data, which exploit local simi...

متن کامل

LSimpute: accurate estimation of missing values in microarray data with least squares methods.

Microarray experiments generate data sets with information on the expression levels of thousands of genes in a set of biological samples. Unfortunately, such experiments often produce multiple missing expression values, normally due to various experimental problems. As many algorithms for gene expression analysis require a complete data matrix as input, the missing values have to be estimated i...

متن کامل

Missing Values Estimation in Microarray Data with Partial Least Squares Regression

Microarray data usually contain missing values, thus estimating these missing values is an important preprocessing step. This paper proposes an estimation method of missing values based on Partial Least Squares (PLS) regression. The method is feasible for microarray data, because of the characteristics of PLS regression. We compared our method with three methods, including ROWaverage, KNNimpute...

متن کامل

A Bicluster-Based Missing Value Imputation Method for Gene Expression Data

Missing values are often encountered in gene expression data sets. Several imputation methods have been proposed to estimate these missing values. In this paper, a new impute approach based on bicluster is introduced. By using the method of minimization the coherence of subset of genes expression matrix, the missing value is estimated with a more accurate result to improve the overall coherence...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Pattern Recognition

سال: 2012

ISSN: 0031-3203

DOI: 10.1016/j.patcog.2011.10.012