Speaker Discrimination Ability of Glottal Waveform Features
نویسندگان
چکیده
To measure the extent to which individual glottal features obtained via inverse filtering can be used to discriminate between the speech of different speakers, we test a set of 16 glottal parameters on a 50-speaker subset of the TIMIT corpus in a frame-level pairwise speaker discrimination task. In addition, we compare a vector of glottal parameters to a set of spectral envelope features in the same task. The results showed higher discrimination ability between pairs of male speakers, and an overall inconsistency in feature rankings between genders. Glottal feature vectors were able to discriminate speakers at a much higher rate, and showed some ability to complement spectral envelope features.
منابع مشابه
Multimodal Speaker Verification Based on Electroglottograph Signal and Glottal Activity Detection
To achieve robust speaker verification, we propose a multimodal method which includes additional nonaudio features and glottal activity detector. As a nonaudio sensor an electroglottograph (EGG) is applied. Parameters of EGG signal are used to augment conventional audio feature vector. Algorithm for EGG parameterization is based on the shape of the idealized waveform and glottal activity detect...
متن کاملGlottal Waveform Analysis of Physical Task Stress Speech
Physical task stress affects the acoustic speech wave in various ways. Motivated by observations that fundamental frequency and open quotient are affected by physical task stress, this study examines the effects of physical task stress on a set of glottal features. It is shown that a set of six glottal features can be used for physical stress detection, implying that physical task stress affect...
متن کاملSpeaker Identification Using Glottal-Source Waveforms and Support-Vector-Machine Modelling
Speaker identification experiments are performed with novel features representative of the glottal source waveform. These are derived from closed-phase analysis and inverse filtering. Source waveforms are segmented into two consecutive periods and normalised in prosody, forming so called source-frame feature vectors. Support-vector-machines are used to construct speaker discriminative hyperplan...
متن کاملModeling of the glottal flow derivative waveform with application to speaker identification
Speech production has long been viewed as a linear filtering process, as described by Fant in the late 1950's [10]. The vocal tract, which acts as the filter, is the primary focus of most speech work. This thesis develops a method for estimating the source of speech, the glottal flow derivative. Models are proposed for the coarse and fine structure of the glottal flow derivative, accounting for...
متن کاملImproving Glottal Waveform Rank-based Glottal Qua
Information on the glottal waveform is an important part of many speech applications. However, glottal waveform estimation remains one of the more inexact sciences of speech processing. The work presented here describes an enhancement to a recently presented algorithm by a new technique involving Rank-Based Glottal Quality Assessment (RB-GQA). The basic premise is to investigate potential measu...
متن کامل