A new Italian dataset of parallel acoustic and articulatory data
نویسندگان
چکیده
In this paper we introduce a new Italian dataset consisting of simultaneous recordings of continuous speech and trajectories of important vocal tract articulators (i.e. tongue, lips, incisors) tracked by Electromagnetic Articulography (EMA). It includes more than 500 sentences uttered in citation condition by three speakers, one male (cnz) and two females (lls, olm), for approximately 2 hours of speech material. Such dataset has been designed to be large enough and phonetically balanced so as to be used in speech applications (e.g. speech recognition systems). We then test our speaker-dependent articulatory DeepNeural-Network Hidden-Markov-Model (DNN-HMM) phone recognizer on the set of data recorded from the cnz speaker. We show that phone recognition results are comparable to the ones that we previously obtained using two well-known British-English datasets with EMA data of equivalent vocal tract articulators. That suggests that the new set of data is a equally useful and coherent resource. The dataset is the session 1 of a larger Italian corpus, called Multi-SPeaKing-style-Articulatory (MSPKA) corpus, including parallel audio and articulatory data in diverse speaking styles (e.g. read, hyperarticulated and hypoarticulated speech). It is freely available at http://www.mspkacorpus.it for research purposes. In the immediate future the whole corpus will be released.
منابع مشابه
A New Bidirectional Neural Network Model for the Acoustic- Articulatory Inversion Mapping For Speech Recognition
In this paper, a new bidirectional neural network for better acoustic-articulatory inversion mapping is proposed. The model is motivated by the parallel structure of human brain, processing information by having forward-inverse connections. In other words, there would be a feedback from articulatory system to the acoustic signals emitted from that organ. Inspired by this mechanism, a new bidire...
متن کاملAcoustic feature learning using cross-domain articulatory measurements
Previous work has shown that it is possible to improve speech recognition by learning acoustic features from paired acoustic-articulatory data, for example by using canonical correlation analysis (CCA) or its deep extensions. One limitation of this prior work is that the learned feature models are difficult to port to new datasets or domains, and articulatory data is not available for most spee...
متن کاملAnalysis of Acoustic-to-Articulatory Speech Inversion Across Different Accents and Languages
The focus of this paper is estimating articulatory movements of the tongue and lips from acoustic speech data. While there are several potential applications of such a method in speech therapy and pronunciation training, performance of such acoustic-to-articulatory inversion systems is not very high due to limited availability of simultaneous acoustic and articulatory data, substantial speaker ...
متن کاملRelations between acoustic and articulatory measurements of /l/
Variation in the production of English /l/ has received significant study. It has been characterized in terms of categorical allophones, in terms of acoustic properties, and in terms of articulatory timing. Using a parallel corpus of acoustic-articulatory data from two speakers of American English, this study looks at the relations between acoustic and articulatory measurements of /l/ across wo...
متن کاملComparing Technical and Economic Efficiency among Organic and Conventional Italian Olive Farms
In many European states such as Spain and Italy there has been a significant growth of organic utilizable surface as a consequence of both a change in the model of agricultural production and also in order to satisfy arising demand of organic food. The purpose of this research was to investigate the level of technical, allocative and economic efficiency in Italian olive farms with two different...
متن کامل