On exploring the similarity and fusion of i-vector and sparse representation based speaker verification systems
نویسندگان
چکیده
The total variability based i-vector has become one of the most dominant approaches for speaker verification. In addition to this, recently the sparse representation (SR) based speaker verification approaches have also been proposed and are found to give comparable performance. In SR based approach, the dictionary used for sparse representation is either exemplar or learned from data using the KSVD algorithms and its variants. Recently the use of the total variability matrix of the i-vector system as the dictionary for the SR based approach has also been reported. Motivated by these, in this work, we first highlight the similarity between the i-vector and the learned dictionary SR based approaches for speaker verification. It is followed by the exploration about various kinds of learned dictionaries, their sizes and the sparsity constraint in context of SR based speaker verification. Further we have explored the feature level as well as the scores level fusions of these two approaches.
منابع مشابه
Fusion of Thermal Infrared and Visible Images Based on Multi-scale Transform and Sparse Representation
Due to the differences between the visible and thermal infrared images, combination of these two types of images is essential for better understanding the characteristics of targets and the environment. Thermal infrared images have most importance to distinguish targets from the background based on the radiation differences, which work well in all-weather and day/night conditions also in land s...
متن کاملHyperspectral Image Classification Based on the Fusion of the Features Generated by Sparse Representation Methods, Linear and Non-linear Transformations
The ability of recording the high resolution spectral signature of earth surface would be the most important feature of hyperspectral sensors. On the other hand, classification of hyperspectral imagery is known as one of the methods to extracting information from these remote sensing data sources. Despite the high potential of hyperspectral images in the information content point of view, there...
متن کاملSpeaker Verification Using Sparse Representations on Total Variability i-vectors
In this paper, the sparse representation computed by lminimization with quadratic constraints is employed to model the i-vectors in the low dimensional total variability space after performing the Within-Class Covariance Normalization and Linear Discriminate Analysis channel compensation. First, we propose the background normalized l residual as a scoring criterion. Second, we demonstrate that ...
متن کاملModeling the potential of Sand and Dust Storm sources formation using time series of remote sensing data, fuzzy logic and artificial neural network (A Case study of Euphrates basin)
Due to the differences between the visible and thermal infrared images, the combination of these two types of images leads to better understanding of the characteristics of targets and the environment. Thermal infrared images are really in distinguishing targets from the background based on the radiation differences and land surface temperature (LST) calculation. However, their spatial resolu...
متن کاملSpeaker Verification using Lasso based Sparse Total Variability Supervector and Probabilistic Linear Discriminant Analysis
In this paper, we propose a Lasso based framework to generate the sparse total variability supervectors (s-vectors). Rather than the factor analysis framework, which uses a low dimensional Eigenvoice subspace to represent the mean supervector, the proposed Lasso approach utilizes the l norm regularized least square estimation to project the mean supervector on a pre-defined dictionary. The numb...
متن کامل