Efficient Processing Node Proximity via Random Walk with Restart
نویسندگان
چکیده
Graph is a useful tool to model complicated data structures. One important task in graph analysis is assessing node proximity based on graph topology. Recently, RandomWalk with Restart (RWR) tends to pop up as a promising measure of node proximity, due to its proliferative applications in e.g. recommender systems, and image segmentation. However, the best-known algorithm for computing RWR resorts to a large LU matrix factorization on an entire graph, which is cost-inhibitive. In this paper, we propose hybrid techniques to efficiently compute RWR. First, a novel divide-and-conquer paradigm is designed, aiming to convert the large LU decomposition into small triangular matrix operations recursively on several partitioned subgraphs. Then, on every subgraph, a “sparse accelerator” is devised to further reduce the time of RWR without any sacrifice in accuracy. Our experimental results on real and synthetic datasets show that our approach outperforms the baseline algorithms by at least one constant factor without loss of exactness.
منابع مشابه
Reverse Top-k Search using Random Walk with Restart
With the increasing popularity of social networks, large volumes of graph data are becoming available. Large graphs are also derived by structure extraction from relational, text, or scientific data (e.g., relational tuple networks, citation graphs, ontology networks, protein-protein interaction graphs). Node-to-node proximity is the key building block for many graph-based applications that sea...
متن کاملEfficient collection of sensor data via a new accelerated random walk
Motivated by the problem of efficiently collecting data from wireless sensor networks via a mobile sink, we present an accelerated random walk on Random Geometric Graphs. Random walks in wireless sensor networks can serve as fully local, lightweight strategies for sink motion that significantly reduce energy dissipation but introduce higher latency in the data collection process. In most cases ...
متن کاملFast and Exact Top-k Search for Random Walk with Restart
Graphs are fundamental data structures and have been em-ployed for centuries to model real-world systems and phe-nomena. Random walk with restart (RWR) provides a goodproximity score between two nodes in a graph, and it hasbeen successfully used in many applications such as auto-matic image captioning, recommender systems, and link pre-diction. The goal of this work is t...
متن کاملScaling Random Walk with Restart over Dynamic Networks
Random Walk with Restart (RWR) is an appealing measure of proximity between nodes based on network topologies. As real graphs are becoming larger and subject to minor changes, it is rather cost-inhibitive to recompute proximities from scratch. Existing method utilizes LU decomposition and degree reordering heuristics. As a result, it yields O(|V |) time and O(|V |) memory to compute all (|V |) ...
متن کاملSupervised and Extended Restart in Random Walks for Ranking and Link Prediction in Networks
Given a real-world graph, how can we measure relevance scores for ranking and link prediction? Random walk with restart (RWR) provides an excellent measure for this and has been applied to various applications such as friend recommendation, community detection, anomaly detection, etc. However, RWR suffers from two problems: 1) using the same restart probability for all the nodes limits the expr...
متن کامل