Articulatory Inversion of Spanish Speech Signals by means of Machine Learning techniques
نویسندگان
چکیده
The speech inverse problem focuses on retrieving the best articulatory vectors for the task of synthesizing a target voice signal. In this research the problem is approached from a neuromotor level, and by means of some machine learning techniques. Specifically, speech articulators are represented on the midsagittal plane, and controlled by muscular activations grouped in the articulatory vector. The movement of the tongue, by effect of muscular contraction, is derived from a TSK Fuzzy Inference System. On its side, an Echo State Network is used to model the glottal airflow. Later, Continuous Genetic Algorithms evolve populations of articulatory configurations in order to approximate acoustic features of target spanish vowels, and of consonants /m/, /n/, /f/ and /s/. The recovered midsagittal configurations along with subjective tests performed by a group of evaluators, positively verify the effectiveness of these techniques. Key–Words: Speech inverse problem, neuromotor inversion, midsagittal model, fuzzy logic, echo state networks, genetic algorithms.
منابع مشابه
Phonological Awareness Impact on Articulatory Accuracy of the Spanish Liquid [r] in Japanese FL Learners of Spanish
Foreign language learners tend to avoid phonological difficulties and simply transfer sounds whether from their L1 or any pre-existing L2. Phonological awareness (PA) gives students an active role in understanding their own potential in improving pronunciation through several methods. However, such methods are likely to be restricted to only passive learning methods, such as repetition, reading...
متن کاملPhonological Awareness Impact on Articulatory Accuracy of the Spanish Liquid [r] in Japanese FL Learners of Spanish
Foreign language learners tend to avoid phonological difficulties and simply transfer sounds whether from their L1 or any pre-existing L2. Phonological awareness (PA) gives students an active role in understanding their own potential in improving pronunciation through several methods. However, such methods are likely to be restricted to only passive learning methods, such as repetition, reading...
متن کاملMachine learning based Visual Evoked Potential (VEP) Signals Recognition
Introduction: Visual evoked potentials contain certain diagnostic information which have proved to be of importance in the visual systems functional integrity. Due to substantial decrease of amplitude in extra macular stimulation in commonly used pattern VEPs, differentiating normal and abnormal signals can prove to be quite an obstacle. Due to developments of use of machine l...
متن کاملA Rough Guide to the Acoustic-to-articulatory Inversion of Speech
| This article reviews a speci c speech research area called acoustic-to-articulatory inversion of speech, or speech inversion, which refers to the problem of mapping the acoustic speech signal onto a space describing the conguration of the human vocal tract that actually produced this signal. This space may be modeled in a variety of ways, such as with trajectories of the movement of the artic...
متن کاملP65: Speech Recognition Based on Bbrain Signals by the Quantum Support Vector Machine for Inflammatory Patient ALS
People communicate with each other by exchanging verbal and visual expressions. However, paralyzed patients with various neurological diseases such as amyotrophic lateral sclerosis and cerebral ischemia have difficulties in daily communications because they cannot control their body voluntarily. In this context, brain-computer interface (BCI) has been studied as a tool of communication for thes...
متن کامل