Principal Cumulant Component Analysis
نویسنده
چکیده
Multivariate Gaussian data is completely characterized by its mean and covariance, yet modern non-Gaussian data makes higher-order statistics such as cumulants inevitable. For univariate data, the third and fourth scalar-valued cumulants are relatively well-studied as skewness and kurtosis. For multivariate data, these cumulants are tensor-valued, higher-order analogs of the covariance matrix capturing higher-order dependence in the data. In addition to their relative obscurity, there are few effective methods for analyzing these cumulant tensors. We propose a technique along the lines of Principal Component Analysis and Independent Component Analysis to analyze multivariate, non-Gaussian data motivated by the multilinear algebraic properties of cumulants. Our method relies on finding principal cumulant components that account for most of the variation in all higher-order cumulants, just as PCA obtains varimax components. An efficient algorithm based on limited-memory quasi-Newton maximization over a Grassmannian, using only standard matrix operations, may be used to find the principal cumulant components. Numerical experiments include forecasting higher portfolio moments and image dimension reduction.
منابع مشابه
Detecting spatial patterns with the cumulant function – Part 1: The theory
In climate studies, detecting spatial patterns that largely deviate from the sample mean still remains a statistical challenge. Although a Principal Component Analysis (PCA), or equivalently a Empirical Orthogonal Functions (EOF) decomposition, is often applied for this purpose, it provides meaningful results only if the underlying multivariate distribution is Gaussian. Indeed, PCA is based on ...
متن کاملDetecting spatial patterns with the cumulant function. Part I: The theory
In climate studies, detecting spatial patterns that largely deviate from the sample mean still remains a statistical challenge. Although a Principal Component Analysis (PCA), or equivalently a Empirical Orthogonal Functions (EOF) decomposition, is often applied on this purpose, it can only provide meaningful results if the underlying multivariate distribution is Gaussian. Indeed, PCA is based o...
متن کاملA Cumulant-Based Method for Gait Identification Using Accelerometer Data with Principal Component Analysis and Support Vector Machine
In this paper a cumulant-based method for identification of gait using accelerometer data is presented. Acceleration data of three different walking speeds (slow, normal and fast) for each subject was acquired by the accelerometer embedded in cell phone which was attached to the person's hip. Data analysis was based on gait cycles that were detected first. Cumulants of order from 1 to 4 with di...
متن کاملPrincipal Independent Component Analysis with Multi-Reference
Conventional Blind Signal Separation algorithms do not adopt any asymmetric information of the input sources, thus the convergence point of a single output is always unpredictable. In this paper, a new Principal Independent Component Analysis concept is proposed, we try to extract the objective Independent Component directly without separating all the signals. A cumulant-based globally converge...
متن کاملPrincipal Component, Independent Component and Parallel Factor Analysis
This talk is an introduction to Independent Component Analysis (ICA) and Parallel Factor Analysis (PARAFAC), the way they are related and their links with Principal Component Analysis (PCA). PCA is now a standard technique for the analysis of two-way multivariate data, i.e., data available in matrix format. However, principal components are subject to rotational in-variance. By imposing statist...
متن کامل