A text-free approach to assessing nonnative intonation
نویسندگان
چکیده
To compensate for the variability in native English intonation and the unpredictability of nonnative speech, we propose a new method of assessing nonnative intonation without any prior knowledge of the target text or phonetics. After recognition of tone events with HMMs and a bigram model of intonation, we define an utterance’s automatic intonation score as the mean of the posterior probabilities for all recognized tone segments. On the ISLE corpus of learners’ English, we find intonation scores generated by this technique have a 0.331 correlation with general pronunciation scores determined by native listeners. In comparison, the SRI Eduspeak system’s proposal for pronunciation scoring based on suprasegmental features derived from prior knowledge of the target text yields a 0.247 correlation with listener scores on a similar corpus. Because it is text-free, our approach could be used to assess intonation outside of a strictly educational application.
منابع مشابه
Testing Suprasegmental English through Parroting
Parroting exercises in a foreign language are designed to make a student’s speech more native-like through imitation of specific native speech templates. In this paper we describe novel template-based methods for automatically estimating subjective scores for both intonation and rhythm in nonnative English. In terms of accuracy when automatically classifying a parroting speaker as a native or a...
متن کاملBetter nonnative intonation scores through prosodic theory
Pronunciation scoring is one important task for software designed to give feedback to students practicing a second language. English intonation can convey information about a speaker’s nativeness, so previous studies have proposed using intonation-based models to score nonnative pronunciation. One past approach trained models for a set of pronunciation scores using ad hoc features derived from ...
متن کاملTowards an intonation module for a portuguese TTS system
In this paper, a correlation between the linguistic structure of the written text and the real intonation behavior of the read speech in European Portuguese language (EP) is presented. It is our belief that intonation behavior in EP can be strongly predicted from two main coordinates: the syntactic structure of the sentence and its pragmatic communicative function, in one way, combined with the...
متن کاملStudy on Unit-Selection and Statistical Parametric Speech Synthesis Techniques
One of the interesting topics on multimedia domain is concerned with empowering computer in order to speech production. Speech synthesis is granting human abilities to the computer for speech production. Data-based approach and process-based approach are the two main approaches on speech synthesis. Each approach has its varied challenges. Unit-selection speech synthesis and statistical parametr...
متن کاملUniversal and Linguistic Features of Expressing Emotional Information: Differentiation in the Perception Level
The emotion in speech is expressed both by universal and specifically linguistic means. Cases of miscommunication between native and nonnative speakers in terms of expressing and interpretation this emotional information occurs mostly due these cultural and linguistic peculiarities. In addition to lexical means, the role of intonation in such cases is enormous. The present paper describes an ex...
متن کامل