Comparing two different principles of parametric F0 modeling

نویسنده

  • Gregor Möhler
چکیده

A number of data-based approaches to intonation modeling represent F0 movements using continuous parameters. This is contradictory to most intonation theories, which suggest that intonation can be modeled with a set of distinct phonological entities that are phonetically realized as F0 movements. This principle has rarely been incorporated into data-based intonation modeling. In this study we compare two data-based intonation models following the two different principles described above. The first approach uses an F0 parametrization with 6 continuous parametric intonation event (PaIntE) parameters. These parameters are derived by approximating the F0 curve with an appropriate model function. In the second model we apply vector quantization (VQ) to the PaIntE parametrization, resulting in a number of distinct F0 shapes. We found that the VQ model has advantages over the PaIntE model, because the intonation events as a whole can be predicted, rather than individual parameters. Furthermore, phonetic analysis can be performed on the basic shapes represented in the codebook.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Asynchronous F0 and spectrum modeling for HMM-based speech synthesis

This paper proposes an asynchronous model structure for fundamental frequency(F0) and spectrum modeling in HMMbased parametric speech synthesis to improve the performance of F0 prediction. F0 and spectrum features are considered to be synchronous in the conventional system. Considering that the production of these two features is decided by the movement of different speech organs, an explicitly...

متن کامل

Cross-language F0 modeling for under-resourced tonal languages: a case study on Thai-Mandarin

This paper proposed a novel method for F0 modeling in under-resourced tonal languages. Conventional statistical models require large training data which are deficient in many languages. In tonal languages, different syllabic tones are represented by different F0 shapes, some of them are similar across languages. With cross-language F0 contour mapping, we can augment the F0 model of one under-re...

متن کامل

Long-Term F0 Modeling for Text-Independent Speaker Recognition

Long-term F0 modeling for text-independent speaker recognition is considered using both parametric and nonparametric approaches. In the parametric case, mean, variance, skewness, and kurtosis are computed and the parameter vectors are compared using weighted Euclidean distance. In the nonparametric case, F0 distribution is represented by a histogram, and KullbackLeibler distance is used in addi...

متن کامل

Expressive Control of Singing Voice Synthesis Using Musical Contexts and a Parametric F0 Model

Expressive singing voice synthesis requires an appropriate control of both prosodic and timbral aspects. While it is desirable to have an intuitive control over the expressive parameters, synthesis systems should be able to produce convincing results directly from a score. As countless interpretations of a same score are possible, the system should also target a particular singing style, which ...

متن کامل

Discontinuous Observation HMM for Prosodic-Event-Based F0 Generation

This paper examines F0 modeling and generation techniques for spontaneous speech synthesis. In the previous study, we proposed a prosodic-unit HMM where the synthesis unit is defined as a segment between two prosodic events represented by a ToBI label framework. To take the advantage of the prosodicunit HMM, continuous F0 sequences must be modeled from discontinuous F0 data including unvoiced r...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999