Estimating speaker-specific intonation patterns using the linear alignment model
نویسندگان
چکیده
Modeling speaker-specific intonation is important in several areas, including speaker identification, verification, and imitation using text-to-speech synthesis. However the choice of the intonation model and the estimation of its parameters from spontaneous speech remains a challenge. We propose a way to estimate speaker-specific intonation parameters for a particular superpositional model, the Simplified Linear Alignment Model [1], using robust per-utterance and overall statistics of spontaneous speech. We used this method to compare the intonation of children with autism or language impairment, who often have atypical speech prosody, with that of typically developing children. We found significant differences between the groups, which demonstrates the effectiveness of the proposed method.
منابع مشابه
Personality prediction based on intonation stylization
This study’s aim is to predict speaker personality from intonation patterns in spoken dialogs. Intonation patterns were extracted by a parametric superpositional stylization approach that allows for pattern description on a parametric as well as on a categorical level. Based on features derived from these representations we trained support vector machines and fitted generalized linear regressio...
متن کاملEstimating phrase curves in the general superpositional intonation model
Superpositional intonation models posit that the pitch contour, , can be quasi-additively decomposed into component curves such as phrase curves, accent curves, and segmental perturbation curves. Currently, these component curves can only be estimated if one assumes a specific superpositional model, such as the Fujisaki model. A method is proposed for estimating phrase curves that is model-inde...
متن کاملThe Role of Noticing in L2 Learners’ Production of Intonation Patterns
This study was an attempt to explore the role that the increased perceptual saliency of L2 input features or output flaws and hereby promoting L2 learners’ noticing (through planned instructional activities) can play in the learners’ use of correct English intonation patterns. The participants were 80 Iranian EFL students attending four intact classes, two low-intermediate and two upper-interme...
متن کاملModeling dynamic prosodic variation for speaker verification
Statistics of frame-level pitch have recently been used in speaker recognition systems with good results [1, 2, 3]. Although they convey useful long-term information about a speaker’s distribution of f0 values, such statistics fail to capture information about local dynamics in intonation that characterize an individual’s speaking style. In this work, we take a first step toward capturing such ...
متن کاملJapanese intonation synthesis using superposition and linear alignment models
This paper outlines a new approach to Tokyo Japanese intonation synthesis, in which the F0 contour of an utterance is generated using the superposition of multi-level phrase curves and lexical accent curves, coupled with linear alignment models which determine the precise alignment of the curves with the segmental material. We first discuss the construction of a phrase curve used to model the p...
متن کامل