Overlap-add methods for time-scaling of speech
نویسنده
چکیده
In this tutorial on time scaling we follow one particular line of thought towards computationally efficient high quality methods. We favor time scaling based on time-frequency representations over model based approaches, and proceed to review an iterative phase reconstruction method for time-scaled magnitude spectrograms. The search for a good initial phase estimate leads us to consider synchronized overlap-add methods which are further optimized to eventually arrive at WSOLA, a technique based on a waveform similarity criterion. Dans cet exposé sur la modification de la structure temporelle du signal de parole, nous opterons pour l’utilisation des représentations temps-fréquence du signal, plutôt que pour des représentations par modèles. Nous examinerons une méthode itérative permettant de reconstruire une fonction de phase pour spectrogrammes d’ amplitude modifiés. La recherche d’une bonne condition initiale pour démarrer l’iteration nous ammènera aux méthodes de recouvrementaddition synchronisées et notamment à WSOLA, une technique basée sur un critère de ressemblance entre formes d’ondes. Nr. of pages: 23 + 2 title pages Nr. of figures: 7
منابع مشابه
Epoch-Synchronous Overlap-Add (ESOLA) for Time- and Pitch-Scale Modification of Speech Signals
Timeand pitch-scale modifications of speech signals find important applications in speech synthesis, playback systems, voice conversion, learning/hearing aids, etc.. There is a requirement for computationally efficient and real-time implementable algorithms. In this paper, we propose a high quality and computationally efficient timeand pitch-scaling methodology based on the glottal closure inst...
متن کاملCHAPTER 15 Time - Domain and Frequency - Domain Techniques for Prosodic Modification of Speech
1. Introductjon 2 General consjderatjons on tjrne-scaling and pjtch-scaling 2.1. Asjrnplemodelforvojcedspeech 2 Tjrne-scalernodificatjon 3 Pjtchl r ifi tj 4 ossjble approaches to prosodic modificatjon 3. The short tjrne Fourjer transforrn and overlap-add synthesjs 3.1. naly js 2 Modifi tjo . 3 Sy th sjs 4. im -scalingtechniques 4 OLAt m -scaling 4.2. y chroniz dOLA rne-scaling 3 WSOLA: An overl...
متن کاملAdjusting the Frame: Biphasic Performative Control of Speech Rhythm
Performative time and pitch scaling is a new research paradigm for prosodic analysis by synthesis. In this paper, a system for real-time recorded speech time and pitch scaling by the means of hands or feet gestures is designed and evaluated. Pitch is controlled with the preferred hand, using a stylus on a graphic tablet. Time is controlled using rhythmic frames, or constriction gestures, define...
متن کاملHigh-Quality Speech Modification Based on Pitch- Synchronous Harmonic and Non-harmonic Modeling of Speech
In this paper, we propose a high-quality speech modification method based on pitch-synchronous harmonic and non-harmonic modeling of speech. In the proposed method, the harmonic and non-harmonic parts of speech are modeled by the sum of sinusoids with frequencies corresponding to pitch multiples and with randomized frequencies, respectively. Then, harmonic and nonharmonic parts are synthesized ...
متن کاملNon-parametric techniques for pitch-scale and time-scale modification of speech
Time-scale and, to a lesser extent, pitch-scale modifications of speech and audio signals are the subject of major theoretical and practical interest. Applications are numerous, including, to name but a few, text-to-speech synthesis (based on acoustical unit concatenation), transformation of voice characteristics, foreign language learning but also audio monitoring or film/soundtrack post-synch...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Speech Communication
دوره 30 شماره
صفحات -
تاریخ انتشار 2000