WhiteboardVCR: a Web Lecture Production Tool for Combining Human Narration and Text-to-Speech Synthesis

نویسندگان

  • Ng S. T. Chong
  • Panrit Tosukhowong
  • Masao Sakauchi
چکیده

With rapid advances in computer speech technology, TTS (text -to-speech) synthesis is becoming increasingly attractive as a supplement or even as an alternative to the human narration. This paper explores the potential of TTS synthesis in Web lectures. We propose WhiteboardVCR, a new approach that we have developed for producing and presenting Web lectures. The system supports synchronization of slide markups and slide switching with a narration. The narration is a combination of video, human voice, and speech synthesis. Users can create a presentation on the fly or at leisure time, and edit it before publishing to the Web. Preliminary experimental results of using WhiteboadVCR from the perspective of the audience are discussed.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Study on Unit-Selection and Statistical Parametric Speech Synthesis Techniques

One of the interesting topics on multimedia domain is concerned with empowering computer in order to speech production. Speech synthesis is granting human abilities to the computer for speech production. Data-based approach and process-based approach are the two main approaches on speech synthesis. Each approach has its varied challenges. Unit-selection speech synthesis and statistical parametr...

متن کامل

Combining Source, Content, Presentation, Narration, and Relational Representation

In this paper, we try to bridge the gap between different dimensions/incarnations of mathematical knowledge: MKM representation formats (content), their human-oriented languages (source, presentation), their narrative linearizations (narration), and relational presentations used in the semantic web. The central idea is to transport solutions from software engineering to MKM regarding the parall...

متن کامل

مراحل و نحوه ی تهیه ی دادگان های صوتی هجایی و دایفونی برای سامانه ی تبدیل متن به گفتار فارسی

Abstract Speech databases are part of the concatenative text to speech synthesis systems. Phonetic quality of the databases plays a significant role in the naturalness of the synthesized speech. This paper introduces two syllable and diphone speech databases for Persian and investigates the way of their development and their specifications and their advantages to each other. ...

متن کامل

AT&T VoiceBuilder: A Cloud-Based Text-to-Speech Voice Builder Tool

The AT&T VOICEBUILDER provides a new tool to researchers and practitioners who want to have their voices synthesized by a high–quality, commercial–grade text-to-speech (TTS) system without the need to install, configure, or manage speech processing software and equipment. It is implemented as a web service on the AT&T Speech Mashup Portal. The proposed system records, processes, and validates u...

متن کامل

Designing a massively multiplayer online role-playing game around text-to-speech

CircumReality is an experimental massively multiplayer online role-playing game (MMORPG) that relies on text-tospeech for both narration and non-player character speech. A game-oriented text-to-speech engine differs significantly from a text-to-speech engine targeted at telephony or mobile devices. This paper discusses some of the differences, such memory and download-size requirements, voicetr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Educational Technology & Society

دوره 5  شماره 

صفحات  -

تاریخ انتشار 2002