ProZed : A speech prosody analysis - by - synthesis tool for linguists
نویسنده
چکیده
. This paper describes a tool designed to allow linguists to manipulate the prosody of an utterance via a symbolic representation in order to evaluate linguistic models. Prosody is manipulated via a Praat TextGrid which allows the user to modify the rhythm and melody. Rhythm is manipulated by factoring segmental duration into three components: (i) intrinsic duration determined by phonemic identity (ii) local modifications encoded on the rhythm tier and (iii) global variations of speech rate encoded on the intonation tier. Melody is similarly determined by tonal segments on the tonal tier (= pitch accents) and on the intonation tier (= boundary tones) together with global parameters of key and span determining changes of pitch register. The TextGrid is used to generate a Manipulation object which can be used either for immediate interactive assessment of the prosody determined by the annotation, or to generate synthesised stimuli for more formal perceptual experiments.
منابع مشابه
Prozed: a Multilingual Prosody Editor for Speech Synthesis
1. Introduction It is generally agreed today that the single most important progress which needs to be made to improve the quality and naturalness of synthetic speech is towards a better understanding and control of prosody. This is true even for those languages which have been the object of considerable research (e.g. English, French, German, Japanese, …)-it is obviously still more true for th...
متن کاملAnalysis by synthesis of speech prosody: the Prozed environment
This paper presents ProZed, an environment for the multilingual analysis by synthesis of speech prosody. The analysis is based on the symbolic representation of prosodic form without reference to prosodic function. The parameters of the model are at present limited to fundamental frequency and duration but the same framework could be extended to accomodate other parameters such as spectral tilt...
متن کاملSPeech Phonetization Alignment and Syllabification (SPPAS): a tool for the automatic analysis of speech prosody
SPASS, SPeech Phonetization Alignment and Syllabification, is a tool to automatically produce annotations which include utterance, word, syllable and phoneme segmentations from a recorded speech sound and its transcription. SPPAS is currently implemented for French, English, Italian and Chinese and there is a very simple procedure to add other languages. The tool is developed for Unix based pla...
متن کاملAn Analysis-by-Synthesis Study of Mandarin Speech Prosody
In the present paper an analysis by synthesis study of mandarin speech prosody is carried out. The mandarin prosodic features are discussed from two salient perspectives, specifically: the function of prosody and the form of prosody. The symbolic representation of prosodic form with the INTSINT (INternational Transcription System for INTonation) system [1] reduces the surface complexity of a pr...
متن کاملProviding linguists with better tools: Daniel Hirst’s contribution to prosodic annotation
Among Daniel Hirst’s contributions to speech prosody, this article addresses those concerned with the development of automatic tools for prosodic annotation. One of Daniel’s concerns has been to facilitate the task of phoneticians and linguists involved in developing efficient rules for the analysis and synthesis of speech prosody. This presentation describes some of his innovative and fruitful...
متن کامل