A unified dimensionality reduction framework for semi-paired and semi-supervised multi-view data

نویسندگان

  • Xiaohong Chen
  • Songcan Chen
  • Hui Xue
  • Xudong Zhou
چکیده

semi-supervised multi-view data Xiaohong Chen, Songcan Chen, Hui Xue , Xudong Zhou 1 Department of Mathematics, Nanjing University of Aeronautics & Astronautics, Nanjing, 210016, China 2 Department of Computer Science and Engineering, Nanjing University of Aeronautics & Astronautics, Nanjing, 210016, China 3 State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, 210093, China 4 School of Computer Science and Engineering, Southeast University, Nanjing, 210096, China Abstract—Canonical correlation analysis (CCA) is a popular and powerful dimensionality reduction method to analyze paired multi-view data. However, when facing semi-paired and semi-supervised multi-view data which widely exist in real-world problems, CCA usually performs poorly due to its requirement of data pairing between different views and un-supervision in nature. Recently, several extensions of CCA have been proposed, however, they just handle the semi-paired scenario by utilizing structure information in each view or just deal with semi-supervised scenario by incorporating the discriminant information. In this paper, we present a general dimensionality reduction framework for semi-paired and semi-supervised multi-view data which naturally generalizes existing related works by using different kinds of prior information. Based on the framework, we develop a novel dimensionality reduction method, termed as semi-paired and semi-supervised generalized correlation analysis (SGCA). SGCA exploits a small amount of paired data to perform CCA and at the same time, utilizes both the global structural information captured from the unlabeled data and the local discriminative information captured from the limited labeled data to compensate the limited pairedness. Consequently, SGCA can find the directions which make not only maximal correlation between the paired data but also maximal separability of the labeled data. Experimental results on artificial and four real-world datasets show its effectiveness compared to the existing related dimensionality reduction methods.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A unified semi-supervised dimensionality reduction framework for manifold learning

We present a general framework of semi-supervised dimensionality reduction for manifold learning which naturally generalizes existing supervised and unsupervised learning frameworks which apply the spectral decomposition. Algorithms derived under our framework are able to employ both labeled and unlabeled examples and are able to handle complex problems where data form separate clusters of mani...

متن کامل

Semi Supervised Multi Kernel (SeSMiK) Graph Embedding: Identifying Aggressive Prostate Cancer via Magnetic Resonance Imaging and Spectroscopy

With the wide array of multi scale, multi-modal data now available for disease characterization, the major challenge in integrated disease diagnostics is to able to represent the different data streams in a common framework while overcoming differences in scale and dimensionality. This common knowledge representation framework is an important pre-requisite to develop integrated meta-classifiers...

متن کامل

SUPERVISED SEMI−DEFINITE EMBEDING FOR IMAGE MANIFOLDS (WedPmPO1)

Semi−definite Embedding (SDE) is a recently proposed to maximize the sum of pair wise squared distances between outputs while the input data and outputs are locally isometric, i.e. it pulls the outputs as far apart as possible, subject to unfolding a manifold without any furling or fold for unsupervised nonlinear dimensionality reduction. The extensions of SDE to supervised feature extraction, ...

متن کامل

An Information Theoretic Framework for Multi-view Learning

In the multi-view learning paradigm, the input variable is partitioned into two different views X1 and X2 and there is a target variable Y of interest. The underlying assumption is that either view alone is sufficient to predict the target Y accurately. This provides a natural semi-supervised learning setting in which unlabeled data can be used to eliminate hypothesis from either view, whose pr...

متن کامل

Multi-view hypergraph learning by patch alignment framework

Graph-based methods are currently popular for dimensionality reduction. However, most of them suffer from over-simplified assumption of pairwise relationships among data. Especially for multi-view data, different relationships from different views are hard to be integrated into a single graph. In this paper, we propose a novel semi-supervised dimensionality reduction method for multi-view data....

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Pattern Recognition

دوره 45  شماره 

صفحات  -

تاریخ انتشار 2012