Practical high-quality speech and voice synthesis using fixed frame rate ABS/OLA sinusoidal modeling

نویسنده

  • E. Bryan George
چکیده

This paper describes algorithms developed to apply the Analysis-by-Synthesis/Overlap-Add (ABS/OLA) sinusoidal modeling system to real-time speech and singing voice synthesis. As originally proposed, the ABS/OLA system is limited to unidirectional timescaling, and relies on variable frame length to accomplish time-scale modification. For speech and voice synthesis applications, unidirectional time scaling makes effective looping to produce sustained vocal sounds difficult, and variable frame length makes real-time polyphonic synthesis problematic. This paper presents a reformulation of the basic ABS/OLA system to deal with these issues, which is termed Fixed-Rate ABS/OLA (ABS/OLA-FR).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech quality improvement in TTS system using ABS/OLA sinusoidal model

In this paper, we propose a novel unit concatenation and synthesis method using ABS/OLA sinusoidal model. Phase succession is used in the unit synthesis assuming that the pitch onset time of the rst frame in a given unit is the frame center. In the unit concatenation, the phase succession and interpolation of the sinusoid amplitudes via several frames around the concatenation point is utilized....

متن کامل

Glottal closure instant synchronous sinusoidal model for high quality speech analysis/synthesis

In this paper, a glottal event synchronous sinusoidal model is proposed. A glottal event corresponds to the glottal closure instant (GCI), which is accurately estimated using group delay and fixed point analysis in the time domain using energy centroids. The GCI synchronous sinusoidal model allows adequate processing according to the inherent local properties of speech, resulting in phase match...

متن کامل

An enhanced ABS/OLA sinusoidal model for waveform synthesis in TTS

This paper describes a method for text-to-speech waveform synthesis based on the Analysis-by-Synthesis/Overlap-Add (ABS/OLA) sinusoidal model. This model has been shown in previous work to be a useful framework for pitch and time-scale modi cation of both speech and music signals. This paper explores some extensions of the original ABS/OLA formulation that attempt to overcome speci c artifacts,...

متن کامل

Speech concatenation and synthesis using an overlap-add sinusoidal model

In this paper, an algorithm for the concatenation of speech signal segments taken from disjoint utterances is presented. The algorithm is based on the Analysis-bySynthesis/Overlap-Add (ABS/OLA) sinusoidal model [1, 2, 3], which is capable of performing high quality pitchand time-scale modi cation of both speech and music signals. With the incorporation of concatenation and smoothing techniques,...

متن کامل

Implementing Real - Time MIDI Music Synthesis Algorithms , ABS / OLA , and SMS for the TMS 320 C 32 DSP

This application report describes a real-time MIDI music synthesis system using a low cost digital signal processor (DSP) such as the Texas Instruments (TITM) TMS320C32 in a PC environment. The system consists of a MIDI device with a MIDI interface, an IBM compatible personal computer, and a TMS320C32 development board where the core of the music synthesis engine resides. The MIDI device genera...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998