Multi-band dysperiodicity analyses of disordered connected speech
نویسندگان
چکیده
The objective is to analyze vocal dysperiodicities in connected speech produced by dysphonic speakers. The analysis involves a variogram-based method that enables tracking instantaneous vocal dysperiodicities. The dysperiodicity trace is summarized by means of the signal-to-dysperiodicity ratio, which has been shown to correlate strongly with the perceived degree of hoarseness of the speaker. Previously, this method has been evaluated on small corpora only. In this article, analyses have been carried out on two corpora comprising over 250 and 700 speakers. This has enabled carrying out multifrequency band and multi-cue analyses without risking over-fitting. The analysis results are compared to the cepstral peak prominence, which is a popular cue that indirectly summarizes vocal dysperiodicities frame-wise. A perceptual rating has been available for
منابع مشابه
Multi-band and multi-cue analyses of disordered connected speech
The objective is to analyze vocal dysperiodicities in connected speech produced by dysphonic speakers. The analysis involves a speech variogram-based method that enables tracking instantaneous vocal dysperiodicities. The dysperiodicity trace is summarized by means of the signal-todysperiodicity ratio, which has been shown to correlate strongly with the perceived degree of hoarseness of the spea...
متن کاملMulti-band Segmental Signal-to-dysperiodicity Ratios in Connected Speech Produced by Normophonic and Dysphonic Speakers
The objective is to analyze vocal dysperiodicities in connected speech produced by dysphonic speakers. The analysis involves a variogram-based method that enables tracking instantaneous vocal dysperiodicities. The dysperiodicity trace is summarized by means of the signal-to-dysperiodicity ratio, which has been shown to correlate strongly with the perceived degree of hoarseness of the speaker. P...
متن کاملMultivariate analysis of frame-based acoustic cues of dysperiodicities in connected speech
Generalized variogram is used to extract vocal dysperiocities in disordered speech produced by dysphonic speakers. Both signal and dysperiodicity are passed through a filter bank and a segmental signal-to-dysperiodicity ratio is defined in each frequency band. Multivariate analysis is carried out to summarize the degree of perceived hoarseness. The predictor variables are the segmental signal-t...
متن کاملAssessment of vocal dysperiodicities in connected disordered speech
The aim of the presentation is to investigate acoustic analysis of connected speech by means of an average-equalized and energy-equalized variogram to extract vocal dysperiodicities. The variogram enables positioning a current and a lagged analysis frame in adjacent speech cycles to track inter-cycle dysperiodicities. Average and energy equalization of the analysis frames are options that make ...
متن کاملCombining temporal and cepstral features for the automatic perceptual categorization of disordered connected speech
The objective of the presentation is to report experiments involving the automatic classification of disordered connected speech into multiple (modal, moderately hoarse, severely hoarse) categories. Support vector machines, used for the classification, have been fed with temporal signal-to-dysperiodicity ratios, the first rahmonic amplitude as well as mel-frequency cepstral coefficients. The si...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Speech Communication
دوره 53 شماره
صفحات -
تاریخ انتشار 2011