Robust Methodology for TTS Enhancement Evaluation
نویسندگان
چکیده
The paper points to problematic and usually neglected aspects of using listening tests for TTS evaluation. It shows that simple random selection of phrases to be listened to may not cover those cases which are relevant to the evaluated TTS system. Also, it shows that a reliable phrase set cannot be chosen without a deeper knowledge of the distribution of differences in synthetic speech, which are obtained by comparing the output generated by an evaluated TTS system to what stands as a baseline system. Having such knowledge, the method able to evaluate the reliability of listening tests, as related to the estimation of possible invalidity of listening results-derived conclusion, is proposed here and demonstrated on real examples.
منابع مشابه
Explorer Investigating RNN - based speech enhancement methods for noise - robust Text - to - Speech
The quality of text-to-speech (TTS) voices built from noisy speech is compromised. Enhancing the speech data before training has been shown to improve quality but voices built with clean speech are still preferred. In this paper we investigate two different approaches for speech enhancement to train TTS systems. In both approaches we train a recursive neural network (RNN) to map acoustic featur...
متن کاملInvestigating RNN-based speech enhancement methods for noise-robust Text-to-Speech
The quality of text-to-speech (TTS) voices built from noisy speech is compromised. Enhancing the speech data before training has been shown to improve quality but voices built with clean speech are still preferred. In this paper we investigate two different approaches for speech enhancement to train TTS systems. In both approaches we train a recursive neural network (RNN) to map acoustic featur...
متن کاملEnhancement of Robust Tracking Performance via Switching Supervisory Adaptive Control
When the process is highly uncertain, even linear minimum phase systems must sacrifice desirable feedback control benefits to avoid an excessive ‘cost of feedback’, while preserving the robust stability. In this paper, the problem of supervisory based switching Quantitative Feedback Theory (QFT) control is proposed for the control of highly uncertain plants. According to this strategy, the unce...
متن کاملVoicesetting: Voice Authoring UIs for Improved Expressivity in Augmentative Communication
Alternative and augmentative communication (AAC) systems used by people with speech disabilities rely on textto-speech (TTS) engines for synthesizing speech. Advances in TTS systems allowing for the rendering of speech with a range of emotions have yet to be incorporated into AAC systems, leaving AAC users with speech that is mostly devoid of emotion and expressivity. In this work, we describe ...
متن کاملThreshold shifts and enhancement of cortical evoked responses after noise exposure in rats.
The effect of exposure to various types of noise (broadband, high-frequency or low-frequency) was studied in adult pigmented rats. Thresholds and amplitudes of middle latency responses (MLR) recorded from electrodes implanted on the surface of the auditory cortex were analyzed before and after noise exposure. Exposure to noise with intensities ranging from 105 to 120 dB for 1 h produced only te...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013