A Unified Framework for Correlation Mining in Ultra-High Dimension
نویسندگان
چکیده
Many applications benefit from theory relevant to the identification of variables having large correlations or partial in high dimension. Recently there has been progress ultra-high dimensional setting when sample size $n$ is fixed and dimension notation="LaTeX">$p$ tends infinity. Despite these advances, correlation screening framework suffers practical, methodological theoretical deficiencies. For instance, previous requires that population covariance matrix be sparse block diagonal. This sparsity assumption however restrictive practical applications. As a second example, estimation dependence measures, which can computationally prohibitive. In this paper, we propose unifying approach mining not restricted diagonal structure, thus yielding methodology suitable for modern By making connections random geometric graphs, number highly correlated are shown have compound Poisson finite-sample characterizations, hold both finite case The also demonstrates duality between with consequences.
منابع مشابه
A Unified Theoretical Framework for Data Mining
The pattern extraction and discovery of useful information from a dataset are the foremost purposes of data mining; the multiple attempts and strong beliefs in the development and the formulation of the unified data mining frameworks that would answer to the fundamental versions related to the discovery of knowledge. In this paper we are presenting a novel unified framework for data mining conc...
متن کاملA Unified Framework for Dimension Reduction in Forecasting
Factor models are widely used in summarizing large datasets with few underlying latent factors and in building time series forecasting models for economic variables. In these models, the reduction of the predictors and the modeling and forecasting of the response y are carried out in two separate and independent phases. We introduce a potentially more attractive alternative, Sufficient Dimensio...
متن کاملa framework for identifying and prioritizing factors affecting customers’ online shopping behavior in iran
the purpose of this study is identifying effective factors which make customers shop online in iran and investigating the importance of discovered factors in online customers’ decision. in the identifying phase, to discover the factors affecting online shopping behavior of customers in iran, the derived reference model summarizing antecedents of online shopping proposed by change et al. was us...
15 صفحه اولA Unified Framework for Utility Based Measures for Mining Itemsets
A pattern is of utility to a person if its use by that person contributes to reaching a goal. Utility based measures use the utilities of the patterns to reflect the user’s goals. In this paper, we first review utility based measures for itemset mining. Then, we present a unified framework for incorporating several utility based measures into the data mining process by defining a unified utilit...
متن کاملOptimized Rule Mining Through a Unified Framework for Interestingness Measures
The large amount of association rules resulting from a KDD process makes the exploitation of the patterns embedded in the database difficult even impossible. In order to address this problem, various interestingness measures were proposed for selecting the most relevant rules. Nevertheless, the choice of an appropriate measure remains a hard task and the use of several measures may lead to conf...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Information Theory
سال: 2023
ISSN: ['0018-9448', '1557-9654']
DOI: https://doi.org/10.1109/tit.2022.3200577