نتایج جستجو برای: speech synthesis
تعداد نتایج: 514429 فیلتر نتایج به سال:
In this paper, we report the 2015 community-based Signal Separation Evaluation Campaign (SiSEC 2015). This SiSEC consists of four speech and music datasets including two new datasets: “Professionally produced music recordings” and “Asynchronous recordings of speech mixtures”. Focusing on them, we overview the campaign specifications such as the tasks, datasets and evaluation criteria. We also s...
Trill is one of the most difficult sound for speech synthesis due to the complexity of the speech signal. The problem need to be addressed since it is a popular sound in world's languages. Several languages in the multi-language text-to-speech system of Bell Laboratories have this sound in the inventory. This paper reports a simple method that greatly improve the quality of trill for the Italia...
In this paper, we present our ongoing research about the composition of syncretic text for artificial museum guides. During a museum visit, the visitors receive information about the cultural assets and responses to their questions. The aim is to reuse existing texts(for example those already published on the web) to compose responses for visitors that take into account the time at their dispos...
There are many exhaustive works that deal with the use of models for segmental duration. The aim of this paper is to evaluate some of the properties mentioned in literature and evaluate factorial and sum-of-products models in front of a listlike approach for Catalan language as a base for a most exhaustive study on duration in this language. Sum-of-products models for vowels and subsystems of c...
We present two concepts for the generation of gestural scores to control an articulatory speech synthesizer. Gestural scores are the common input to the synthesizer and constitute an organized pattern of articulatory gestures. The first concept generates the gestures for an utteranceusing the phonetic transcriptions, phone durations, and intonation commands predicted by the Bonn Open Synthesis ...
Speech perception involves a number of processes that deal with variation in the speech signal. One such process is normalization for speechrate: local temporal cues are perceived relative to the rate in the surrounding context. It is as yet unclear whether and how this perceptual effect interacts with higher level impressions of rate, such as a speaker’s nonnative identity. Nonnative speakers ...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید