Prosodic features for speaker verification
نویسندگان
چکیده
In this paper we study the effectiveness of prosodic features for speaker verification. We hypothesize that prosody is linked to linguistic units such as syllables and prosodic features can be better represented with reference to the syllabic sequence. For extracting prosodic features, speech is segmented into syllablelike regions using the knowledge of vowel onset points (VOP). We use a technique based on excitation source information to detect VOPs automatically. The location of VOPs serve as reference for extracting prosodic features directly from speech signal. Various parameters are used to represent the pitch and energy dynamics of the region between two consecutive VOPs. The effectiveness of the derived prosodic features for speaker verification is demonstrated on NIST SRE 2003 extended data. The complementary nature of prosodic features and spectral features help to improve the accuracy of the combined speaker verification system.
منابع مشابه
Comparing prosodic models for speaker recognition
Recently, speaker verification systems using different kinds of prosodic features have been proposed. Although it has been shown that most of these speaker verification systems can improve system performance using score-level fusion with stateof-the-art cepstral-based systems, a systematic comparison of the prosodic modelling algorithms used in these prosodic systems has not yet been performed....
متن کاملProsodic features based on wavelet analysis for speaker verification
Most conventional speaker recognition systems rely on short-term spectral information. But they ignore the long-term information such as prosody which also conveys speaker information. In this paper, we propose an approach that extracts prosodic features based on long-term information. First, by making wavelet analysis, we can reveal the trends of the f0 and energy contour. Subsequently, the pr...
متن کاملSpeaker Verification with Shifted Delta Cepstral Features: Its Pseudo-Prosodic Behavior
This paper examines the linear relation between Shifted Delta Cepstral (SDC) features and the dynamic of prosodic features. SDC features were reported to produce superior performance to ∆ features in Language Identification and speaker recognition systems. A selection of more correlated SDC features is used in speaker verification to evaluate its robustness to channel/handset mismatch. The expe...
متن کاملPertinent Prosodic Features for Speaker Identification by Voice
Most existing systems of speaker recognition use “state of the art” acoustic features. However, many times one can only recognize a speaker by his or her prosodic features, especially by the accent. For this reason, the authors investigate some pertinent prosodic features that can be associated with other classic acoustic features, in order to improve the recognition accuracy. The authors have ...
متن کاملEffectiveness of Short-term Prosodic Features for Speaker Verification
In this work a traditional MFCC based speaker verification system is combined with a prosody based one to determine whether simple short-term prosodic information is useful for improving current state-of-theart ASV. The traditional speaker verification system based in spectral information has an EER of 3.85% when using 1024 mixtures. The prosody based system uses short-term intonation and energ...
متن کامل