Assessment of disordered voices based on an optimized glottal source model
نویسندگان
چکیده
In this paper, a method for the assessment of disordered voices is proposed. A feature named mean opening quotient (MOQ) obtained from the glottal source estimation is used as an acoustic cue to summarize the degree of severity of the voice disorder. The analysis method uses the empirical mode decomposition algorithm to estimate the glottal source excitation signal from the speech signal. The logarithm of the magnitude spectrum of the speech signal is decomposed into oscillatory modes, called intrinsic mode functions, that are clustered into two classes, the spectral envelope and the harmonic component. The exploitation of the phase information jointly with the estimated harmonic component enables the estimation of the glottal source signal. An appropriate parametric model is fitted to the estimated glottal source excitation signal. The optimal parameters of the glottal source excitation model from which the MOQ is defined are obtained by using a genetic algorithm. The presented method is tested on a corpus of natural speech including the vowel [a] uttered by 22 normophonic speakers and 229 speakers with different degrees of dysphonia. Experimental results show that the proposed method is very effective for assessing the degree of severity of the voice disorder.
منابع مشابه
Preliminary glottal source modeling for pathologic voices
A first attempt at implementing a flexible model for the glottal source waveform of pathologic voices is described. The LF (Liljencrants & Fant) model is the source model used. We also add various noise types, shimmer and jitter to the excitation source in order to replicate more closely the pathologic glottal waveform. Various vocal characteristics are then modeled in order to evaluate the per...
متن کاملAutomatic Topology Generation of Glottal Source HMM
We previously proposed the Auto-Regressive Hidden Markov Model (AR-HMM) for speech signal analysis, where the HMM was introduced as a non-stationary glottal source model. In this paper, we propose a novel method that can automatically generate the topology of the Glottal Source Hidden Markov Model (GS-HMM), as well as estimate the AR-HMM parameter obtained by combining the AR-HMM parameter esti...
متن کاملSynthesis of breathy and rough voices with a view to validating perceptual and automatic glottal cycle pattern recognition
The framework of the presentation is the assessment of the ability of human raters or speechprocessing software to detect glottal cycles in speech sounds and measure their lengths in synthetic breathy and rough voices. The synthesis of hoarse voices designates the generation of speech sounds the timbre of which simulates the voice quality of dysphonic speakers. The added value of synthetically ...
متن کاملPhase distortion statistics as a representation of the glottal source: application to the classification of voice qualities
The representation of the glottal source is of paramount importance for describing para-linguistic information carried through the voice quality (e.g., emotions, mood, attitude). However, some existing representations of the glottal source are based on analytical glottal models, which assume strong a priori constraints on the shape of the glottal pulses. Thus, these representations are restrict...
متن کاملAcoustic model and evaluation of pathological voice production
An acoustic model of pathological voice production is presented. It describes the non-linear effects occurring in the acoustic waveform of disordered voices. The noise components such as fundamental frequency and amplitude irregularities and variations, sub-harmonic components, turbulent noise and voice breaks are formally expressed as a result of random time function influences on the excitati...
متن کامل