Towards Articulatory Speech Synthesis with a Dynamic 3D Finite Element Tongue Model
نویسندگان
چکیده
We describe work towards articulatory speech synthesis driven by realistic 3D tissue and bone models. The vocal tract shape is modeled using a fast 3D finite element method (FEM) of a muscle-activated human tongue in conjunction with fixed rigid models of jaw, hyoid and palate connected to a deformable mesh representing the airway. Actuation of the tissue model deforms the airway providing a time-varying acoustic tube which is used for the synthesis of sound. We describe our initial validation of our models geometrically using magnetic resonance images and acoustically using articulatory configurations.
منابع مشابه
Artisynth: an extensible, cross-platform 3d articulatory speech synthesizer
We describe our progress on the construction of a combined 3D face and vocal tract simulator for articulatory speech synthesis called ArtiSynth. The architecture provides six main modules: (1) a simulator engine and synthesis framework, (2) a two and three-dimensional model development component, (3) a numerics engine, (4) a graphical renderer, (5) an audio synthesis engine and (6) a graphical ...
متن کاملArtiSynth: A Biomechanical Simulation Platform for the Vocal Tract and Upper Airway
We describe ArtiSynth, a 3D biomechanical simulation platform directed toward modeling the vocal tract and upper airway. It provides an open-source environment in which researchers can create and interconnect various kinds of dynamic and parametric models to form a complete integrated biomechanical system which is capable of articulatory speech synthesis. An interactive graphical Timeline runs ...
متن کاملTowards an articulatory tongue model using 3D EMA
Within the framework of an acoustic-visual (AV) speech synthesizer, we describe a preliminary tongue model that is both simple and flexible, and which is controlled by 3D electromagnetic articulography (EMA) data through an animation interface, providing realistic tongue movements for improved visual intelligibility. Data from a pilot study is discussed and deemed encouraging, and the integrati...
متن کاملDeveloping Physically-Based, Dynamic Vocal Tract Models using ArtiSynth
We describe the process of using ArtiSynth, a 3D biomechanical simulation platform, to build models of the vocal tract and upper airway which are capable of simulating speech sounds. ArtiSynth allows mass-spring, finite element, and rigid body models of anatomical components (such as the face, jaw, tongue, and pharyngeal wall) to be connected to various acoustical models (including source filte...
متن کاملConstruction and control of a physiological articulatory model.
A physiological articulatory model has been constructed using a fast computation method, which replicates midsagittal regions of the speech organs to simulate articulatory movements during speech. This study aims to improve the accuracy of modeling by using the displacement-based finite-element method and to develop a new approach for controlling the model. A "semicontinuum" tongue tissue model...
متن کامل