Semi-Supervised Heterogeneous Graph Learning with Multi-level Data Augmentation
نویسندگان
چکیده
In recent years, semi-supervised graph learning with data augmentation (DA) has been the most commonly used and best-performing method to improve model robustness in sparse scenarios few labeled samples. However, of existing DA methods are based on homogeneous while none specific for heterogeneous graph. Differing from graph, faces greater challenges: heterogeneity information requires strategies effectively handle relations, which considers contribution different types neighbors edges target nodes. Furthermore, over-squashing is caused by negative curvature that formed non-uniformity distribution strong clustering complex To address these challenges, this paper presents a novel named Semi-Supervised Heterogeneous Graph Learning Multi-level Data Augmentation (HG-MDA). For problem DA, node topology proposed characteristics And meta-relation-based attention applied as one indexes selecting augmented nodes edges. information, triangle edge adding removing designed alleviate bring gain topology. Finally, loss function consists cross-entropy consistency regularization unlabeled data. order fuse prediction results various strategies, sharpening used. Existing experiments public datasets, i.e., ACM, DBLP, OGB, industry dataset MB show HG-MDA outperforms current SOTA models. Additionly, user identification internet finance scenarios, helping business add 30% key users, increase loans balances 3.6%, 11.1%, 9.8%.
منابع مشابه
Semi - supervised Learning Methods for Data Augmentation
The original goal of this project was to investigate the extent to which data augmentation schemes based on semi-supervised learning algorithms can improve classification accuracy in supervised learning problems. The objectives included determining the appropriate algorithms, customising them for the purposes of this project and providing their Matlab implementations. These algorithms were to b...
متن کاملRevisiting Semi-Supervised Learning with Graph Embeddings
We present a semi-supervised learning framework based on graph embeddings. Given a graph between instances, we train an embedding for each instance to jointly predict the class label and the neighborhood context in the graph. We develop both transductive and inductive variants of our method. In the transductive variant of our method, the class labels are determined by both the learned embedding...
متن کاملSemi-supervised Learning with Spectral Graph Wavelets
We consider the transductive learning problem when the labels belong to a continuous space. Through the use of spectral graph wavelets, we explore the benefits of multiresolution analysis on a graph constructed from the labeled and unlabeled data. The spectral graph wavelets behave like discrete multiscale differential operators on graphs, and thus can sparsely approximate piecewise smooth sign...
متن کاملGraph-Based Semi-Supervised Learning
While labeled data is expensive to prepare, ever increasing amounts of unlabeled data is becoming widely available. In order to adapt to this phenomenon, several semi-supervised learning (SSL) algorithms, which learn from labeled as well as unlabeled data, have been developed. In a separate line of work, researchers have started to realize that graphs provide a natural way to represent data in ...
متن کاملGraph Based Multi-class Semi-supervised Learning Using Gaussian Process
This paper proposes a multi-class semi-supervised learning algorithm of the graph based method. We make use of the Bayesian framework of Gaussian process to solve this problem. We propose the prior based on the normalized graph Laplacian, and introduce a new likelihood based on softmax function model. Both the transductive and inductive problems are regarded as MAP (Maximum A Posterior) problem...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ACM Transactions on Knowledge Discovery From Data
سال: 2023
ISSN: ['1556-472X', '1556-4681']
DOI: https://doi.org/10.1145/3608953