Measuring speech rhythm variation in a model-based framework

نویسنده

  • Plínio A. Barbosa
چکیده

A coupled-oscillators-model-based method for measuring speech rhythm is presented. This model explains crosslinguistic differences in rhythm as deriving from varying degrees of coupling strength between a syllable oscillator and a phrase stress oscillator. The method was applied to three texts read aloud in French, in Brazilian and European Portuguese by seven speakers. The results reproduce the early findings on rhythm typology for these languages/varieties with the following advantages: it successfully accounts for speech rate variation, related to the syllabic oscillator frequency in the model; it takes only syllable-sized units into account, not spliting syllables into vowels and consonants; the consequences of phrase stress magnitude on stress group duration are directly considered; both universal and language-specific aspects of speech rhythm are captured by the model.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech enhancement based on hidden Markov model using sparse code shrinkage

This paper presents a new hidden Markov model-based (HMM-based) speech enhancement framework based on the independent component analysis (ICA). We propose analytical procedures for training clean speech and noise models by the Baum re-estimation algorithm and present a Maximum a posterior (MAP) estimator based on Laplace-Gaussian (for clean speech and noise respectively) combination in the HMM ...

متن کامل

Statistical analysis of filled pauses’ rhythm for disfluent speech synthesis

Given that state of the art speech synthesis systems have already reached a high naturalness level, it is time to move to talking speech from the actual read speech framework. For this purpose it is thus necessary to investigate how disfluencies can be included in speech synthesis and even increase its naturalness. This paper builds on a previously presented work and focuses on finding a local ...

متن کامل

Statistical analysis of filled pauses2 rhythm for disfluent speech synthesis

Given that state of the art speech synthesis systems have already reached a high naturalness level, it is time to move to talking speech from the actual read speech framework. For this purpose it is thus necessary to investigate how disfluencies can be included in speech synthesis and even increase its naturalness. This paper builds on a previously presented work and focuses on finding a local ...

متن کامل

MARK TATHAM and KATHERINE MORTON COMPUTATIONAL MODELLING OF SPEECH PRODUCTION: ENGLISH RHYTHM

In this paper we examine the treatment of English rhythm from both a theoretical and an experimental perspective. There are major shortcomings in the way not just rhythm but also prosodics in general is modelled; this is all too clear in various applications of the theory, particularly in computational areas such as speech synthesis (Keller and Keller 2002). Our objective is to begin characteri...

متن کامل

Rhythm and Speech Rate: A variation coefficient for C

The percentage of vocalic intervals (%V) and the standard deviation of consonantal intervals (deltaC) in a speech signal are two dimensions according to which languages of different rhythm classes (e.g. stress-timed, syllable-timed) seem to be differentiable on an acoustic level (Ramus et al., 1999). In this context it has been found that especially deltaC varies considerably as a function of s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009