نتایج جستجو برای: speech synthesis

تعداد نتایج: 514429  

2015
Nobutaka Ono Zafar Rafii Daichi Kitamura Nobutaka Ito Antoine Liutkus

In this paper, we report the 2015 community-based Signal Separation Evaluation Campaign (SiSEC 2015). This SiSEC consists of four speech and music datasets including two new datasets: “Professionally produced music recordings” and “Asynchronous recordings of speech mixtures”. Focusing on them, we overview the campaign specifications such as the tasks, datasets and evaluation criteria. We also s...

1996
Chilin Shih

Trill is one of the most difficult sound for speech synthesis due to the complexity of the speech signal. The problem need to be addressed since it is a popular sound in world's languages. Several languages in the multi-language text-to-speech system of Bell Laboratories have this sound in the inventory. This paper reports a simple method that greatly improve the quality of trill for the Italia...

2016
Antonio Sorgente Antonio Calabrese Gianluca Coda Paolo Vanacore Francesco Mele

In this paper, we present our ongoing research about the composition of syncretic text for artificial museum guides. During a museum visit, the visitors receive information about the cultural assets and responses to their questions. The aim is to reuse existing texts(for example those already published on the web) to compose responses for visitors that take into account the time at their dispos...

2016
Hüseyin Çakmak Bernard Gosselin Catherine Pelachaud Olivier Debeir Olivier Deroo

1998
Albert Febrer Jaume Padrell Antonio Bonafonte

There are many exhaustive works that deal with the use of models for segmental duration. The aim of this paper is to evaluate some of the properties mentioned in literature and evaluate factorial and sum-of-products models in front of a listlike approach for Catalan language as a base for a most exhaustive study on duration in this language. Sum-of-products models for vowels and subsystems of c...

2007
Peter Birkholz Ingmar Steiner Stefan Breuer

We present two concepts for the generation of gestural scores to control an articulatory speech synthesizer. Gestural scores are the common input to the synthesizer and constitute an organized pattern of articulatory gestures. The first concept generates the gestures for an utteranceusing the phonetic transcriptions, phone durations, and intonation commands predicted by the Bonn Open Synthesis ...

2015
Hans Rutger Bosker Eva Reinisch

Speech perception involves a number of processes that deal with variation in the speech signal. One such process is normalization for speechrate: local temporal cues are perceived relative to the rate in the surrounding context. It is as yet unclear whether and how this perceptual effect interacts with higher level impressions of rate, such as a speaker’s nonnative identity. Nonnative speakers ...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید