Seed models combination and state level mappings of cross-lingual transfer for rapid HMM development: from English to Mandarin

نویسندگان

  • Xufang Zhao
  • Douglas D. O'Shaughnessy
چکیده

The practice of cross-lingual transfer of speech technology is of increasing concern as the demand for recognition systems in multiple languages grows. Previous work has proved that: firstly, if there is a big difference between the source language and the target language, cross-lingual adaptation may not outperform scratch training even if the training set is limited; secondly, cross-lingual seed models achieve lower word error rates than flat starts or random models. Based on these two points, this paper improved the approach of generating cross-lingual seed models through two ways: (1) using a combination of cross-lingual seed models and “flatstart” models, and (2) using phoneme mappings on an HMM state level in a new language to reduce mismatched coarticulation. All simulations are carried out with limited available training data, and the effectiveness of the approach was proved using English-Mandarin as a test case.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cross-lingual Speaker Adaptation for Hmm-based Speech Synthesis

This paper explores a cross-lingual speaker adaptation technique for HMM-based speech synthesis, where a source voice model for English is transformed into a target speaker model using Mandarin Chinese speech data from the target speaker. A phone mappingbased method is adopted to map Chinese Initial/Finals into English phonemes and two types of mapping rules, including one-to-one and one-to-seq...

متن کامل

Analysis of unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis using KLD-based transform mapping

In the EMIME project, we developed a mobile device that performs personalized speech-to-speech translation such that a user’s spoken input in one language is used to produce spoken output in another language, while continuing to sound like the user’s voice. We integrated two techniques into a single architecture: unsupervised adaptation for HMM-based TTS using word-based large-vocabulary contin...

متن کامل

Cross-Linguistic Transfer Revisited: The Case of English and Persian

The present study sought to investigate the evidence for cross-linguistic transfer in a partial English immersion and non-immersion educational setting. To this end, a total of 145 first, third and fifth graders in a partial English immersion program and 95 students from the same grade levels in a non-immersion program were chosen. Six different English and Persian tests were administered: the ...

متن کامل

L1 Transfer in L2 Acquisition of the There-Insertion Construction by Mandarin EFL Learners

This study examined the role of the native language (L1) transfer in a non-native language (L2) acquisition of the there-insertion construction at the syntax-semantics interface. Specifically, the study investigated if Mandarin EFL learners would make overgeneralization errors in the situation where an L1 argument structure constitutes a superset of its L2 counterpart. Verbs of existence and ap...

متن کامل

Coarse-grained Cross-lingual Alignment of Comparable Texts with Topic Models and Encyclopedic Knowledge

We present a method for coarse-grained cross-lingual alignment of comparable texts: segments consisting of contiguous paragraphs that discuss the same theme (e.g. history, economy) are aligned based on induced multilingual topics. The method combines three ideas: a two level LDA model that filters out words that do not convey themes, an HMM that models the ordering of themes in the collection o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008