Sub-band based text-dependent speaker verification

نویسندگان

  • Perasiriyan Sivakumaran
  • Aladdin M. Ariyaeeinia
  • Martin Loomes
چکیده

, (p s c S p th cepstral coefficient of the s th sub-bands { c 1 (1,p) = c(p) is the p th full-band cepstral parameter} S number of sub-bands Y(k) k th log spectral magnitude K number of log spectral magnitudes) (k Y ′ ′ k th log-energy outputs of the mel-scale filterbank K ′ ′ number of log-energy outputs of the mel-scale filterbank h t weight associated with the t th segment U number of competing speakers (1−η s) average speaker verification error rate in the s th sub-band R, q number of sub-band-systems Superscript s implies the association of the s th sub-band Superscript u implies the association of the u th competing speaker Superscript r implies the association of the r th sub-band-system ′ implies the association of the complementary full-band cepstral parameters SB-Sub-band based DBSW dynamic band-limited segmental weights MFBO mel-scale filterbank output SNRW signal-to-noise ratio-based weighting factors CFBCC complementary full-band cepstral coefficient SSFCS sub-band system with full-band cepstral supplements MSBSA multiple sub-band-systems analysis FWN filtered white noise RNT real noise type Abstract This paper addresses various issues involved in sub-band based text-dependent speaker verification. The first part of the discussions is concerned with the classification methods. An important issue addressed in this part is the determination of a set of weights which emphasises the sub-bands that are specific to the target speaker while de-emphasising or removing the contaminated ones. In particular, techniques for determining these weights dynamically according to the level of contamination in the sub-bands are described. Furthermore, the effectiveness of these methods is experimentally analysed through a set of comparative studies. The second part of the discussions focuses on the feature extraction process. Analytically, it is shown that for a sub-band system of S bands, the cepstral coefficients with the quefrency of p have a strong linear relationship to the (S×p) th full-band cepstral parameter. With the aid of a set of experimental results, it is demonstrated that this means the conventional classification methods adapted to work with sub-band cepstral parameters may not be able to capture all the useful spectral information contained in the full-band cepstral parameters. In order to tackle this problem, two methods are described and their relative effectiveness is experimentally examined. The experimental investigations also include an examination of speaker discrimination abilities of different sub-bands and an analysis of different possible recombination levels.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The use of sub-band cepstrum in speaker verification

This paper focuses on the spectral representation of the sub-band cepstrum in relation to that of the full-band cepstrum. Through theoretical analysis it is shown that the net spectral information content of the cepstral coefficients with the same index in different sub-bands is only comparable to that of a full-band cepstral parameter whose quefrency is given by the product of that specific in...

متن کامل

Parallel Speaker and Content Modelling for Text-Dependent Speaker Verification

Text-dependent short duration speaker verification involves two challenges. The primary challenge of interest is the verification of the speaker’s identity, and often a secondary challenge of interest is the verification of the lexical content of the pass-phrase. In this paper, we propose the use of two systems to handle these two tasks in parallel with one subsystem modelling speaker identity ...

متن کامل

Improved Data Modeling for Text-Dependent Speaker Recognition Using Sub-Band Processing

A growing body of recent work documents the potential benefits of sub-band processing over wideband processing in automatic speech recognition and, less usually, speaker recognition. It is often found that the subband approach delivers performance improvements (especially in the presence of noise), but not always so. This raises the question of precisely when and how sub-band processing might b...

متن کامل

Sub-band based speaker verification using dynamic recombination weights

This paper describes a new method for generating the recombination weights in sub-band based speaker verification. The approach, which is based on the use of background speaker models, attempts to reduce the effect of any mismatch between the band-limited segments of the test utterance and the corresponding sections in the target speaker model. The discussion also includes an analysis of other ...

متن کامل

Transition-oriented hidden Markov models for speaker verification

In this article, we present a novel mechanism by which more precise voiceprints can be constructed in a typical text-dependent speaker veri cation system based on a continuous density hidden Markov model (HMM). Typical voiceprints (speaker-dependent HMMs) are rst trained using a subscriber's enrollment data. The resulting models are then restructured to permit a modeling of sub-state behavior. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Speech Communication

دوره 41  شماره 

صفحات  -

تاریخ انتشار 2003