Automatic Prosody Quality Evaluation of Mandarin Speech

نویسندگان

  • Huibin Jia
  • Jianhua Tao
چکیده

Prosody evaluation is an essential part of computer-aided language learning system. In the paper, we investigate an automatic prosody evaluation method for Mandarin speech. The method is based on prosody comparison between the tested and standard utterance. The prosodic similarities are calculated from three aspects: tone, intonation and rhythm. Based on these similarities, a ranking algorithm named SVOR is proposed and investigated to predict the prosody quality score. The algorithm is evaluated on the collected database and it shows better performance than other well-known algorithms. In addition, detailed analyses of human scoring at the sentence-level are given.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An automatic prosody labeling method for Mandarin speech

A new model-based automatic prosody labeling method for Mandarin speech is proposed. It first introduces four models to describe the relationships of the prosody tags to be labeled, the prosodic features of the speech signals, and the linguistic features of the associated texts. It then employs a sequential optimization procedure to estimate parameters of these four models and find all prosody ...

متن کامل

Prosody Variation: Application to Automatic Prosody Evaluation of Mandarin Speech

Prosody evaluation is an essential part of computer-aided language learning system. In the paper, prosodic variability among inter-speakers is investigated based on a database containing eight repetitions of 200 sentences. For Mandarin of reading style, its variability can be analyzed from rhythm, intonation and tone. Experimental results show that the mean correlation of tone between inter-spe...

متن کامل

Using prosody to improve Mandarin automatic speech recognition

In this paper, these problems of how to model and train Mandarin prosody dependent acoustic model and how to decode input speech based on prosody dependent speech recognition system will be discussed. We use automatic prosody labeling methods to annotate syllable prosodic break type and stress type on continuous speech corpus, and utilize our proposed methods to train prosody dependent tonal sy...

متن کامل

Lexical Tones Learning with Automatic Music Composition System Considering Prosody of Mandarin Chinese

Recent research has found that there is an overlap in the processing of music and speech in certain aspects. This research focuses on the relationship between the pitch of tones in language and the melody of songs. We present an automatic music composition system based on the prosody rules of Mandarin and we hypothesize that songs generated with our proposed system can help non-native Mandarin ...

متن کامل

The VUB Blizzard Challenge 2010 Entry: Towards Automatic Voice Building

In this paper we describe the voices we submitted to the 2010 Blizzard Challenge, a yearly challenge to evaluate auditory speech synthesis on common data. One of the goals of a datadriven synthesizer, such as ours, is to generalize the speech database in such a way that it allows a realistic rendition of unseen input text. The two main changes to our system, compared to previous submissions, ar...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007