Metric recovery from directed unweighted graphs
نویسندگان
چکیده
We analyze directed, unweighted graphs obtained from xi ∈ R by connecting vertex i to j iff |xi − xj | < ε(xi). Examples of such graphs include k-nearest neighbor graphs, where ε(xi) varies from point to point, and, arguably, many real-world graphs such as copurchasing graphs. We ask whether we can recover the underlying Euclidean metric ε(xi) and the associated density p(xi) given only the directed graph and d. We show that consistent recovery is possible up to isometric scaling when the vertex degree is at least ω(n log(n)). Our estimator is based on a careful characterization of a random walk over the directed graph and the associated continuum limit. As an algorithm, it resembles the PageRank centrality metric. We demonstrate empirically that the estimator performs well on simulated examples as well as on real-world co-purchasing graphs even with a small number of points and degree scaling as low as log(n).
منابع مشابه
From random walks to distances on unweighted graphs
Large unweighted directed graphs are commonly used to capture relations between entities. A fundamental problem in the analysis of such networks is to properly define the similarity or dissimilarity between any two vertices. Despite the significance of this problem, statistical characterization of the proposed metrics has been limited. We introduce and develop a class of techniques for analyzin...
متن کاملNew Parameterized Algorithms for APSP in Directed Graphs
All Pairs Shortest Path (APSP) is a classic problem in graph theory. While for general weighted graphs there is no algorithm that computes APSP in O(n3−ε) time (ε > 0), by using fast matrix multiplication algorithms, we can compute APSP in O(n logn) time (ω < 2.373) for undirected unweighted graphs, and in O(n2.5302) time for directed unweighted graphs. In the current state of matters, there is...
متن کاملA Constant-Factor Approximation Algorithm for Embedding Unweighted Graphs into Trees
We present a constant-factor approximation algorithm for computing an embedding of the shortest path metric of an unweighted graph into a tree, that minimizes the multiplicative distortion.
متن کاملOn the complexity of the multicut problem in bounded tree-width graphs and digraphs
Given an edgeor vertex-weighted graph or digraph and a list of source-sink pairs, the minimum multicut problem consists in selecting a minimum weight set of edges or vertices whose removal leaves no path from each source to the corresponding sink. This is a classical NPhard problem, and we show that the edge version becomes tractable in bounded tree-width graphs if the number of source-sink pai...
متن کاملThe Directed Minimum Latency Problem
We study the directed minimum latency problem: given an n-vertex asymmetric metric (V, d) with a root vertex r ∈ V , find a spanning path originating at r that minimizes the sum of latencies at all vertices (the latency of any vertex v ∈ V is the distance from r to v along the path). This problem has been well-studied on symmetric metrics, and the best known approximation guarantee is 3.59 [3]....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1411.5720 شماره
صفحات -
تاریخ انتشار 2015