Learning of stochastic d through a dialog simulati
نویسندگان
چکیده
We present an approach for the learning of stochastic dialog models using a technique of automatic generation of dialogs. We have applied it to achieve a better performance in our dialog system, which answers telephone queries about train timetables in Spanish. Besides interacting with real users, the stochastic dialog manager can now interact with other module, in the role of the user, developing a large number of dialogs at a very low cost. From this interaction, the dialog manager is able to dynamically adapt its stochastic model, adding new transitions or modifying their probabilities, when a simulation ends satisfactorily. We expect that the modified model provides the dialog manager with a better strategy for answering real users than the strategy given by the initial model estimated from real dialogs.
منابع مشابه
Rapid simulation-driven reinforcement learning of multimodal dialog strategies in human-robot interaction
In this work we propose a procedure model for rapid automatic strategy learning in multimodal dialogs. Our approach is tailored for typical task-oriented human-robot dialog interactions, with no prior knowledge about the expected user and system dynamics being present. For such scenarios, we propose the use of stochastic dialog simulation for strategy learning, where the user and system error m...
متن کاملUsing Wizard-of-Oz simulations to bootstrap Reinforcement - Learning based dialog management systems
This paper describes a method for “bootstrapping” a Reinforcement Learningbased dialog manager using a Wizard-ofOz trial. The state space and action set are discovered through the annotation, and an initial policy is generated using a Supervised Learning algorithm. The method is tested and shown to create an initial policy which performs significantly better and with less effort than a handcraf...
متن کاملSpectral decomposition method of dialog state tracking via collective matrix factorization
The task of dialog management is commonly decomposed into two sequential subtasks: dialog state tracking and dialog policy learning. In an end-to-end dialog system, the aim of dialog state tracking is to accurately estimate the true dialog state from noisy observations produced by the speech recognition and the natural language understanding modules. The state tracking task is primarily meant t...
متن کاملThe Effect of Explicit and Implicit Instruction through Plays on EFL Learners’ Speech Act Production
Despite the general findings that address the positive contribution of teaching pragmatic features to interlanguage pragmatic development, the question as to the most effective method is far from being resolved. Moreover, the potential of literature as a means of introducing learners into the social practices and norms of the target culture, which underlie the pragmatic competence, has not been...
متن کاملA stochastic model of human-machine interaction for learning dialog strategies
In this paper, we propose a quantitative model for dialog systems that can be used for learning the dialog strategy. We claim that the problem of dialog design can be formalized as an optimization problem with an objective function reflecting different dialog dimensions relevant for a given application. We also show that any dialog system can be formally described as a sequential decision proce...
متن کامل