Perturbation in Gci's and Speech Quality for Pitch Synchronous Synthesis
نویسندگان
چکیده
In pitch synchronous speech synthesis the analysis/synthesis of the speech is done at each glottal closure instant (GCI). The errors in estimation of GCI's affect the quality of the synthesized speech. The effect of random perturbations in the GCI's, obtained from the speech and from glottal signal from an impedance electroglottograph using Childers and Hu's algorithm on the quality of speech synthesized using harmonic plus noise model (HNM), is investigated in this paper. Investigations show that the speech quality is very sensitive to positions of the GCI's. A small perturbation with maximum of 4 % of the local fundamental frequency considerably degrades the synthesized speech. Perturbations above 8 % severely affect quality of the out put speech. GCI's obtained from the glottal signal can afford slightly more perturbation as compared to the GCI's calculated from the speech signal.
منابع مشابه
Epoch-Synchronous Overlap-Add (ESOLA) for Time- and Pitch-Scale Modification of Speech Signals
Timeand pitch-scale modifications of speech signals find important applications in speech synthesis, playback systems, voice conversion, learning/hearing aids, etc.. There is a requirement for computationally efficient and real-time implementable algorithms. In this paper, we propose a high quality and computationally efficient timeand pitch-scaling methodology based on the glottal closure inst...
متن کاملDct Based Pitch Modification
In this paper, we propose a novel algorithm for pitch modification. The linear prediction residual is obtained from pitch synchronous frames by inverse filtering the speech signal. Then Discrete Cosine Transform (DCT) is applied on these pitch synchronous frames. Based on the desired factor of pitch modification, the dimension of the DCT vector is changed by truncation or zero padding, and then...
متن کاملHigh-Quality Speech Modification Based on Pitch- Synchronous Harmonic and Non-harmonic Modeling of Speech
In this paper, we propose a high-quality speech modification method based on pitch-synchronous harmonic and non-harmonic modeling of speech. In the proposed method, the harmonic and non-harmonic parts of speech are modeled by the sum of sinusoids with frequencies corresponding to pitch multiples and with randomized frequencies, respectively. Then, harmonic and nonharmonic parts are synthesized ...
متن کاملSpeech Synthesis in Indian Languages
This paper presents the study of phonemes in the Indian languages for developing good quality speech synthesis. Harmonic plus noise model (HNM) which divides the speech signal in two sub bands: harmonic and noise, is implemented with the objective of studying its capabilities and to investigate the adaptation needed. Childers and Hu's algorithms are used for voicing and pitch detection. As the ...
متن کاملUnit selection using pitch synchronous cross correlation for Japanese concatenative speech synthesis
We describe a corpus-based approach to improving synthesized speech quality and present two useful cost functions for unit selection. One is pitch-synchronous cross correlation for concatenation costs to reduce the noise caused by phase mismatch at concatenation points. The other is a discontinuous cost function for internal and concatenation costs to eliminate unnecessary cost calculation. An ...
متن کامل