EMD-L1: An Efficient and Robust Algorithm for Comparing Histogram-Based Descriptors
نویسندگان
چکیده
We propose a fast algorithm, EMD-L1, for computing the Earth Mover’s Distance (EMD) between a pair of histograms. Compared to the original formulation, EMD-L1 has a largely simplified structure. The number of unknown variables in EMD-L1 is O(N) that is significantly less than O(N) of the original EMD for a histogram with N bins. In addition, the number of constraints is reduced by half and the objective function is also simplified. We prove that the EMD-L1 is formally equivalent to the original EMD with L1 ground distance without approximation. Exploiting the L1 metric structure, an efficient tree-based algorithm is designed to solve the EMD-L1 computation. An empirical study demonstrates that the new algorithm has the time complexity of O(N), which is much faster than previously reported algorithms with super-cubic complexities. The proposed algorithm thus allows the EMD to be applied for comparing histogram-based features, which is practically impossible with previous algorithms. We conducted experiments for shape recognition and interest point matching. EMD-L1 is applied to compare shape contexts on the widely tested MPEG7 shape dataset and SIFT image descriptors on a set of images with large deformation, illumination change and heavy noise. The results show that our EMD-L1based solutions outperform previously reported state-of-the-art features and distance measures in solving the two tasks.
منابع مشابه
A Linear Time Histogram Metric for Improved SIFT Matching
We present a new metric between histograms such as SIFT descriptors and a linear time algorithm for its computation. It is common practice to use the L2 metric for comparing SIFT descriptors. This practice assumes that SIFT bins are aligned, an assumption which is often not correct due to quantization, distortion, occlusion etc. In this paper we present a new Earth Mover’s Distance (EMD) varian...
متن کاملOffline Language-free Writer Identification based on Speeded-up Robust Features
This article proposes offline language-free writer identification based on speeded-up robust features (SURF), goes through training, enrollment, and identification stages. In all stages, an isotropic Box filter is first used to segment the handwritten text image into word regions (WRs). Then, the SURF descriptors (SUDs) of word region and the corresponding scales and orientations (SOs) are extr...
متن کاملRobust color histogram descriptors for video segment retrieval and identification
Effective and efficient representation of color features of multiple video frames or pictures is an important yet challenging task for visual information management systems. Key frame-based methods to represent the color features of a group of frames (GoF) are highly dependent on the selection criterion of the representative frame(s), and may lead to unreliable results. We present various histo...
متن کاملDistance between Distributions with Special Topologies of Cost Matrices
Abstract Comparing two distributions plays important role in many problems. The traditional minimum cost flow problem has been utilized as a distance measure between two distributions (transportation problem) such as the earth mover’ distance (EMD). If the distributions have b number of bins, the cost matrix is b × b square matrix. While generic algorithms such as Simplex method to compute the ...
متن کاملEfficient image retrieval using MPEG-7 descriptors
In this paper, a new method to calculate the similarity among images using Dominant Color Descriptor is discussed. Using Earth Mover’s Distance (EMD), better retrieval results can be obtained compared with those obtained from the original MPEG-7 reference software (XM) [1]. To further improve the retrieval accuracy, texture information from Edge Histogram Descriptor is added. In order to reduce...
متن کامل