Large margin Gaussian mixture models for speaker identification
نویسندگان
چکیده
Gaussian mixture models (GMM) have been widely and successfully used in speaker recognition during the last decade. However, they are generally trained using the generative criterion of maximum likelihood estimation. In this paper, we propose a simple and efficient discriminative approach to learn GMM with a large margin criterion to solve the classification problem. Our approach is based on a recent work about the Large Margin GMM (LM-GMM) where each class is modeled by a mixture of ellipsoids and which has shown good results in speech recognition. We propose a simplification of the original algorithm and carry out preliminary experiments on a speaker identification task using NIST-SRE’2006 data. We compare the traditional generative GMM approach, the original LM-GMM one and our own version. The results suggest that our algorithm outperforms the two others.
منابع مشابه
Robust text-independent speaker identification using Gaussian mixture speaker models
This paper introduces and motivates the use of Gaussian mixture models (CMM) for robust text-independent speaker identification. The individual Gaussian components of a GMM are shown to represent some general speaker-dependent spectral shapes that are efTective for modeling speaker identity. The focus of this work is on applications which require high identification rates using short utterance ...
متن کاملSpeaker Identification Using Discriminative Learning of Large Margin GMM
Gaussian mixture models (GMM) have been widely and successfully used in speaker recognition during the last decades. They are generally trained using the generative criterion of maximum likelihood estimation. In an earlier work, we proposed an algorithm for discriminative training of GMM with diagonal covariances under a large margin criterion. In this paper, we present a new version of this al...
متن کاملA combination approach of Gaussian mixture models and support vector machines for speaker identification
Gaussian mixture models are commonly used in speaker identification and verification systems. However, owing to their non discriminant nature, Gaussian mixture models still give greater identification errors in the evaluation process. Partitioning speakers database in clusters based on some proximity criteria where only a single cluster Gaussian mixture models is run in every test, have been su...
متن کاملText-independent speaker identification using Gaussian mixture bigram models
In this paper, a novel speaker modeling technique based on Gaussian mixture bigram model (GMBM) is introduced and evaluated for text-independent speaker identification (speaker-ID). GMBM is a stochastic framework that explores the context or time dependency of continuous observations from an information source. In view of the fact that speech features are correlated between successive frames, w...
متن کاملSpeaker Identification Using Gaussian Mixture Models
In this paper, the performance of Perceptual Linear Prediction (PLP) features has been compared with the performance of Linear Prediction Coefficient (LPC) features for speaker identification. Two classification techniques, Gaussian Mixture Models (GMM) and Vector Quantization (VQ) with Dynamic time wrapping (DTW) are used for classification of speakers based on their speech samples into respec...
متن کامل