A Unified Framework for Correlation Mining in Ultra-High Dimension

نویسندگان

چکیده

Many applications benefit from theory relevant to the identification of variables having large correlations or partial in high dimension. Recently there has been progress ultra-high dimensional setting when sample size $n$ is fixed and dimension notation="LaTeX">$p$ tends infinity. Despite these advances, correlation screening framework suffers practical, methodological theoretical deficiencies. For instance, previous requires that population covariance matrix be sparse block diagonal. This sparsity assumption however restrictive practical applications. As a second example, estimation dependence measures, which can computationally prohibitive. In this paper, we propose unifying approach mining not restricted diagonal structure, thus yielding methodology suitable for modern By making connections random geometric graphs, number highly correlated are shown have compound Poisson finite-sample characterizations, hold both finite case The also demonstrates duality between with consequences.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Unified Theoretical Framework for Data Mining

The pattern extraction and discovery of useful information from a dataset are the foremost purposes of data mining; the multiple attempts and strong beliefs in the development and the formulation of the unified data mining frameworks that would answer to the fundamental versions related to the discovery of knowledge. In this paper we are presenting a novel unified framework for data mining conc...

متن کامل

A Unified Framework for Dimension Reduction in Forecasting

Factor models are widely used in summarizing large datasets with few underlying latent factors and in building time series forecasting models for economic variables. In these models, the reduction of the predictors and the modeling and forecasting of the response y are carried out in two separate and independent phases. We introduce a potentially more attractive alternative, Sufficient Dimensio...

متن کامل

a framework for identifying and prioritizing factors affecting customers’ online shopping behavior in iran

the purpose of this study is identifying effective factors which make customers shop online in iran and investigating the importance of discovered factors in online customers’ decision. in the identifying phase, to discover the factors affecting online shopping behavior of customers in iran, the derived reference model summarizing antecedents of online shopping proposed by change et al. was us...

15 صفحه اول

A Unified Framework for Utility Based Measures for Mining Itemsets

A pattern is of utility to a person if its use by that person contributes to reaching a goal. Utility based measures use the utilities of the patterns to reflect the user’s goals. In this paper, we first review utility based measures for itemset mining. Then, we present a unified framework for incorporating several utility based measures into the data mining process by defining a unified utilit...

متن کامل

Optimized Rule Mining Through a Unified Framework for Interestingness Measures

The large amount of association rules resulting from a KDD process makes the exploitation of the patterns embedded in the database difficult even impossible. In order to address this problem, various interestingness measures were proposed for selecting the most relevant rules. Nevertheless, the choice of an appropriate measure remains a hard task and the use of several measures may lead to conf...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Information Theory

سال: 2023

ISSN: ['0018-9448', '1557-9654']

DOI: https://doi.org/10.1109/tit.2022.3200577