Developing Speech Synthesis for Under-Resourced Languages by "Faking it": An Experiment with Somali

نویسندگان

  • Harold L. Somers
  • David Gareth Evans
  • Zeinab Mohamed
چکیده

Speech synthesis or text-to-speech (TTS) systems are currently available for a number of the world’s major languages, but for thousands of other, unsupported, languages no such technology is available. While awaiting the development of such technology, we propose using an existing TTS system for a major language (the base language, BL) to “fake” TTS for an unsupported language (the target language, TL). This paper describes the factors which determine the choice of a suitable BL for a given TL, and describe an experiment with a fake Somali TTS system evaluated in the real-life situation of a doctor–patient dialogue. 28 Somali participants were asked to judge the comprehensibility of 25 short Somali sentences recorded with a German TTS system. Results suggest that “faking it” provides reasonable stop-gap TTS for unsupported languages.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Home-made speech synthesis for non-English-speaking patients

This poster concerns the development of computer-based support for health-care seekers with limited English, focusing on speech synthesis and on languages for which such technology has not been developed. Speech synthesis (or text-to-speech – TTS) systems are available only for the world’s major languages. In the absence of such technology, we want as a stop-gap solution to use an existing TTS ...

متن کامل

Faking it: Synthetic Text-to-speech Synthesis for Under-resourced Languages - Experimental Design

Speech synthesis or text-to-speech (TTS) systems are currently available for a number of the world’s major languages, but for thousands of the world’s ‘minor’ languages no such technology is available. While awaiting the development of such technology, we would like to try the stop-gap solution of using an existing TTS system for a major language (the base language) to ‘fake’ TTS for a minor la...

متن کامل

Automatic Speech Recognition for Under-Resourced Languages:

Speech processing for under-resourced languages is an active field of research, which has experienced significant progress during the past decade. We propose, in this paper, a survey that focuses on automatic speech recognition (ASR) for these languages. The definition of under-resourced languages and the challenges associated to them are first defined. The main part of the paper is a literatur...

متن کامل

Towards automatic cross-lingual acoustic modelling applied to HMM-based speech synthesis for under-resourced languages

Nowadays Human Computer Interaction (HCI) can also be achieved with voice user interfaces (VUIs). To enable devices to communicate with humans by speech in the user’s own language, low-cost language portability is often discussed and analysed. One of the most time-consuming parts for the language-adaptation process of VUIcapable applications is the target-language speech-data acquisition. Such ...

متن کامل

The development of new corpora for under-resourced languages using data available for well-resourced ones

In the paper we propose to exploit existing corpora of wellresourced languages as a basis for developing similar corpora of under-resourced ones. The construction of this type of corpora will allow finding common patterns of acoustic manifestation of similar functional states regardless of the language. The analysis of these corpora will also allow investigating universal and language-specific ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006