Sub-band based text-dependent speaker verification
نویسندگان
چکیده
, (p s c S p th cepstral coefficient of the s th sub-bands { c 1 (1,p) = c(p) is the p th full-band cepstral parameter} S number of sub-bands Y(k) k th log spectral magnitude K number of log spectral magnitudes) (k Y ′ ′ k th log-energy outputs of the mel-scale filterbank K ′ ′ number of log-energy outputs of the mel-scale filterbank h t weight associated with the t th segment U number of competing speakers (1−η s) average speaker verification error rate in the s th sub-band R, q number of sub-band-systems Superscript s implies the association of the s th sub-band Superscript u implies the association of the u th competing speaker Superscript r implies the association of the r th sub-band-system ′ implies the association of the complementary full-band cepstral parameters SB-Sub-band based DBSW dynamic band-limited segmental weights MFBO mel-scale filterbank output SNRW signal-to-noise ratio-based weighting factors CFBCC complementary full-band cepstral coefficient SSFCS sub-band system with full-band cepstral supplements MSBSA multiple sub-band-systems analysis FWN filtered white noise RNT real noise type Abstract This paper addresses various issues involved in sub-band based text-dependent speaker verification. The first part of the discussions is concerned with the classification methods. An important issue addressed in this part is the determination of a set of weights which emphasises the sub-bands that are specific to the target speaker while de-emphasising or removing the contaminated ones. In particular, techniques for determining these weights dynamically according to the level of contamination in the sub-bands are described. Furthermore, the effectiveness of these methods is experimentally analysed through a set of comparative studies. The second part of the discussions focuses on the feature extraction process. Analytically, it is shown that for a sub-band system of S bands, the cepstral coefficients with the quefrency of p have a strong linear relationship to the (S×p) th full-band cepstral parameter. With the aid of a set of experimental results, it is demonstrated that this means the conventional classification methods adapted to work with sub-band cepstral parameters may not be able to capture all the useful spectral information contained in the full-band cepstral parameters. In order to tackle this problem, two methods are described and their relative effectiveness is experimentally examined. The experimental investigations also include an examination of speaker discrimination abilities of different sub-bands and an analysis of different possible recombination levels.
منابع مشابه
The use of sub-band cepstrum in speaker verification
This paper focuses on the spectral representation of the sub-band cepstrum in relation to that of the full-band cepstrum. Through theoretical analysis it is shown that the net spectral information content of the cepstral coefficients with the same index in different sub-bands is only comparable to that of a full-band cepstral parameter whose quefrency is given by the product of that specific in...
متن کاملParallel Speaker and Content Modelling for Text-Dependent Speaker Verification
Text-dependent short duration speaker verification involves two challenges. The primary challenge of interest is the verification of the speaker’s identity, and often a secondary challenge of interest is the verification of the lexical content of the pass-phrase. In this paper, we propose the use of two systems to handle these two tasks in parallel with one subsystem modelling speaker identity ...
متن کاملImproved Data Modeling for Text-Dependent Speaker Recognition Using Sub-Band Processing
A growing body of recent work documents the potential benefits of sub-band processing over wideband processing in automatic speech recognition and, less usually, speaker recognition. It is often found that the subband approach delivers performance improvements (especially in the presence of noise), but not always so. This raises the question of precisely when and how sub-band processing might b...
متن کاملSub-band based speaker verification using dynamic recombination weights
This paper describes a new method for generating the recombination weights in sub-band based speaker verification. The approach, which is based on the use of background speaker models, attempts to reduce the effect of any mismatch between the band-limited segments of the test utterance and the corresponding sections in the target speaker model. The discussion also includes an analysis of other ...
متن کاملTransition-oriented hidden Markov models for speaker verification
In this article, we present a novel mechanism by which more precise voiceprints can be constructed in a typical text-dependent speaker veri cation system based on a continuous density hidden Markov model (HMM). Typical voiceprints (speaker-dependent HMMs) are rst trained using a subscriber's enrollment data. The resulting models are then restructured to permit a modeling of sub-state behavior. ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Speech Communication
دوره 41 شماره
صفحات -
تاریخ انتشار 2003