Recent advances in WFST-based dialog system
نویسندگان
چکیده
To construct an expandable and adaptable dialog system which handles multiple tasks, we proposed a dialog system using a weighted finite-state transducer (WFST) in which users concept and system action tags are input and output of the transducer, respectively. To test the potential of the WFSTbased dialog management (DM) platform using statistical DM models, we constructed a dialog system using a human-tohuman spoken dialog corpus for hotel reservation, which is annotated with Interchange Format (IF). A scenario WFST and a spoken language understanding (SLU) WFST were obtained from the corpus and then composed together and optimized. We evaluated the detection accuracy of the system next actions. In this paper, we focus on how WFST optimization operations contribute to the performance of the system. In addition, we have constructed a full WFST-based dialog system by composing SLU, scenario and sentence generation (SG) WFSTs. We show an example of a hotel reservation dialog with the fully composed system and discuss future work.
منابع مشابه
Expansion of WFST-Based Dialog Management for Handling Multiple ASR Hypotheses
We proposed a weighted finite-state transducer-based dialog manager (WFSTDM) which was a platform for expandable and adaptable dialog systems. In this platform, all rules and/or models for dialog management (DM) are expressed in WFST form, and the WFSTs are used to accomplish various tasks via multiple modalities. With this framework, we constructed a statistical dialog system using the user co...
متن کاملDialog management using weighted finite-state transducers
We are aiming to construct an expandable and adaptable dialog system which handles multiple tasks and senses users’ intention via multiple modalities. A flexible platform to integrate different dialog strategies and modalities is indispensable for this purpose. In this paper, we propose an efficient approach to manage a dialog system using a weighted finitestate transducer (WFST) in which users...
متن کاملLVCSR System on a Hybrid GPU-CPU Embedded Platform for Real-Time Dialog Applications
We present the implementation of a largevocabulary continuous speech recognition (LVCSR) system on NVIDIA’s Tegra K1 hyprid GPU-CPU embedded platform. The system is trained on a standard 1000hour corpus, LibriSpeech, features a trigram WFST-based language model, and achieves state-of-the-art recognition accuracy. The fact that the system is realtime-able and consumes less than 7.5 watts peak ma...
متن کاملWFSTDM Builder – Network-based Spoken Dialogue System Builder for Easy Prototyping
This paper introduces a network-based spoken dialog system development tool kit: WFSTDM Builder developed by NICT. WFSTDM Builder provides functions to share and edit SLU and scenario so that developers can create a wfst-based spoken dialogue system instantly with this tool. One can test the scenario by accessing to the servers connected such as ASR, TTS and WFSTDM server via not only the tool’...
متن کاملRapid Prototyping of Robust Language Understanding Modules for Spoken Dialogue Systems
Language understanding (LU) modules for spoken dialogue systems in the early phases of their development need to be (i) easy to construct and (ii) robust against various expressions. Conventional methods of LU are not suitable for new domains, because they take a great deal of effort to make rules or transcribe and annotate a sufficient corpus for training. In our method, the weightings of the ...
متن کامل