Gaussian mixture distance for information retrieval
نویسندگان
چکیده
In most Information Retrieval (IR) applications, Euclidean distance is used for similarity measurement. It is adequate in many cases but this distance metric is not very accurate when there exist some different local data distributions in the database. We propose a Gaussian mixture distance for performing accurate nearest-neighbor search for Information Retrieval (IR). Under an established Gaussian finite mixture model for the distribution of the data in the database, the Gaussian mixture distance is formulated based on minimizing the Kullback-Leibler (KL) divergence between the distribution of the retrieval data and the data in database. We compared the performance of the Gaussian mixture distance with the well-known Euclidean and Mahalanobis distance based on a precision performance measurement. Experimental results demonstrate that the Gaussian mixture distance function is superior in the others for different types of testing data.
منابع مشابه
An Analytic Distance Metric for Gaussian Mixture Models with Application in Image Retrieval
In this paper we propose a new distance metric for probability density functions (PDF). The main advantage of this metric is that unlike the popular Kullback-Liebler (KL) divergence it can be computed in closed form when the PDFs are modeled as Gaussian Mixtures (GM). The application in mind for this metric is histogram based image retrieval. We experimentally show that in an image retrieval sc...
متن کاملProbabilistic relevance feedback approach for content-based image retrieval based on gaussian mixture models
A new relevance feedback (RF) approach for content-based image retrieval is presented. This approach uses Gaussian mixture (GM) models of the image features and a query that is updated in a probabilistic manner. This update reflects the preferences of the user and is based on the models of both the positive and negative feedback images. The retrieval is based on a recently proposed distance mea...
متن کاملPerceptual Normalized Information Distance for Image Distortion Analysis Based on Kolmogorov Complexity
Image distortion analysis is a fundamental issue in many image processing problems, including compression, restoration, recognition, classification, and retrieval. In this work, we investigate the problem of image distortion measurement based on the theories of Kolmogorov complexity and normalized information distance (NID), which have rarely been studied in the context of image processing. Bas...
متن کاملImage retrieval with embedded sub-class information using Gaussian mixture models
This paper describes content-based image retrieval techniques within the relevance feedback framework. The Gaussian mixture model (GMM) is used to characterize sub-class information to increase retrieval accuracy and reduce number of interactions during a query session. The implementation of GMM is based on the radial basis function using a new learning algorithm that can cope with small traini...
متن کاملSinging Voice Classification in Commercial Music Productions
Instead of the expansion of the information retrieval systems, the music information retrieval domain is still an open one. In this context, the singing voice classification is a promised trend. In this paper, we shall present our experiments concerning the classification of singers according to their voice type, and their voice quality. Some experiments were carried in which two sets of parame...
متن کامل