Modelling output probability distributions for enhancing speaker recognition

نویسندگان

  • Jason W. Pelecanos
  • Sridha Sridharan
چکیده

This paper discusses the use of a secondary likelihood classifier scheme for improving speaker recognition performance. The system models the output likelihoods of a typical Gaussian Mixture Model system across multiple speakers. The Output Probability Distributions (OPD) of the primary classifiers contain information on inter-speaker relationships, and are modelled by secondary classifiers to improve recognition accuracies. A comparison of the OPD system with the traditional likelihood ratio and maximum likelihood scoring schemes for verification and identification is performed. Fusion of traditional measures with OPDs is shown to enhance overall recognition performance.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using output probability distribution for improving speech recognition in adverse environment

This paper proposed a method to improve the accuracy of small vocabulary isolated word speaker-independent speech recognition in adverse environment. The proposed approach is implemented by using Output Probability Distributions (OPDs) and Support Vector Machine (SVM). OPDs improve the system performance by modeling inter-word relationships; then SVM classifiers are used to discriminate the dif...

متن کامل

Fast algorithm for speech recognition using speaker cluster HMM

This paper describes a high speed algorithm for a speech recognizer based on speaker cluster HMM. The speaker cluster HMM, which enables to deal with variety among speakers, have been reported to show good performance. However, the computation amount grows in proportion to the number of clusters, when the speaker cluster HMM is used in speaker independent recognition, where the recognition proc...

متن کامل

Auditory-instrumental forensic speaker recognition

The most prominent part in forensic speech and audio processing is speaker recognition. In the world a number of approaches to forensic speaker recognition (FSR) have been developed, that are different in terms of technical procedures, methodology, instrumentation and also in terms of the probability scale on which the final conclusion is based. The BKA‘s approach to speaker recognition is a co...

متن کامل

Speaker Recognition System Based on GMM Multivariate Probability Distributions built-in a Digital Watermarking Token

The article describes a speaker recognition system based on continuous speech using GMM multivariate probability distributions. A theoretical model of the system including the extraction of distinctive features and statistical modeling is described. The efficiency of the system implemented in the Linux operating system was determined. The system is designed to support the functionality of the P...

متن کامل

Speech recognition using a strong correlation assumption for the instantaneous spectra

The conventional independence assumption made for the evolving speech spectra is replaced by a strong correlation assumption, which then leads to a new stochastic model. This model implements a nonlinear interpolation between the lower and upper bounds of the joint probability distributions. The advantage of the new model over other correlation-based modelling approaches is that it has a low pa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999