A Concurrent, Distributed, and Incremental Spoken Dialogue Architecture with a First Application to Prosody
نویسنده
چکیده
Current-day spoken dialogue systems are tedious to interact with (Ward et al. 2005). eir naturalness and (measurable) quality of interaction can be improved through incremental (step-by-step) processing schemes that enable dialogue systems to interact continuously (Baumann 2013). However, incremental models have not yet adequately addressed the challenge of joint decision making and optimization of hypotheses across the multitude of components within a modularized system in real-time, mostly because their data-ows follow simple pipeline approaches. Ad-hoc integration of modules fails completely for distributed systems which are preferred in robotics, for research systems, and in mobile applications. is shortcoming impedes incremental spoken dialogue systems to leverage their full potential. is project proposes to design and implement an architecture for concurrent, distributed incremental processing and knowledge representation for spoken dialogue in which components share their understanding and collaborate on the emergence of desirable dialogue behaviour. e architecture will be applied to (limited) spoken dialogue domains. Prosody and timing are key issues to successful interaction and control dialogue ow, regardless of its content. us, the project will focus on the interaction between speakers on the prosodic level.
منابع مشابه
Integrating prosodic modelling with incremental speech recognition
We describe ongoing and proposed work concerning incremental prosody extraction and classification for a spoken dialogue system. The system described will be tightly integrated with the SDS’s speech recogntion which also works incrementally. The proposed architecture should allow for more control over the user interaction experience, for example allowing more precise and timely end-ofutterance ...
متن کاملCoupling dialogue and prosody computation in spoken dialogue generation
We introduce a concept-to-speech (CTS) system that generates prosodic structure compositionally, in a spoken dialogue agent architecture . Representations from the semantic interpretation, task modeling, and dialogue strategy selection components drive the computation of accentuation, pitch accent type selection, and choice of melodic contour, respectively. These principled couplings of dialogu...
متن کاملA Distributed Architecture for Cooperative Spoken Dialogue Agents with Coherent Dialogue State and History
It has been very difficult to develop spoken dialogue systems with high domain extensibility. Not only the system complexity inevitably increases with the number of topics and domains, but the concurrent topics need to be handled persistently and consistently across different domains. This paper presents a distributed architecture for cooperative spoken dialogue agents with high domain extensib...
متن کاملOn-Line Learning of a Persian Spoken Dialogue System Using Real Training Data
The first spoken dialogue system developed for the Persian language is introduced. This is a ticket reservation system with Persian ASR and NLU modules. The focus of the paper is on learning the dialogue management module. In this work, real on-line training data are used during the learning process. For on-line learning, the effect of the variations of discount factor (g) on the learning speed...
متن کاملOn-Line Learning of a Persian Spoken Dialogue System Using Real Training Data
The first spoken dialogue system developed for the Persian language is introduced. This is a ticket reservation system with Persian ASR and NLU modules. The focus of the paper is on learning the dialogue management module. In this work, real on-line training data are used during the learning process. For on-line learning, the effect of the variations of discount factor (g) on the learning speed...
متن کامل