Multi-lingual Spoken Dialog Translation System Using Transfer-driven Machine Translation
نویسندگان
چکیده
This paper describes a Transfer-Driven Machine Translation (TDMT) system as a prototype for efficient multi-lingual spoken-dialog translation. Currently, the TDMT system deals with dialogues in the travel domain, such as travel scheduling, hotel reservation, and trouble-shooting, and covers almost all expressions presented in commercially-available travel conversation guides. In addition, to put a speech dialog translation system into practical use, it is necessary to develop a mechanism that can handle the speech recognition errors. In TDMT, robust translation can be achieved by using an example-based correct parts extraction (CPE) technique to translate the plausible parts from speech recognition results even if the results have several recognition errors. We have applied TDMT to three language pairs, i.e., Japanese-English, Japanese-Korean, Japanese-German. Simulations of dialog communication between different language speakers can be provided via a TCP/IP network. In our performance evaluation for the translation of TDMT utilizing 69-87 unseen dialogs, we achieved about 70% acceptability in the JE, KJ translations, almost 60% acceptability in the EJ and JG translations, and about 90% acceptability in the JK translations. In the case of handling erroneous sentences caused by speech recognition errors, although almost all translation results end up as unacceptable translation in conventional methods, 69% of the speech translation results are improved by the CPE technique.
منابع مشابه
An Example-Based Method for Transfer-Driven Machine Translation
This paper presents a method called Transfer-Driven Machine Translation (TDMT), which utilizes an example-based framework for various process and combines multi-level knowledge. An example-based framework can achieve quick processing and consistently describe knowledge. It is useful for spoken-language translation, which needs robust and efficient translation. TDMT strengthens the example-based...
متن کاملImproving Performance of Transfer-Driven Machine Translation with Extra-Linguistic Informatioon from Context, Situation and Environment
This paper describes an improvement in the performance of Transfer-Driven Machine Translation ( T D M T ) by the use of extralinguistic information. In evaluating what constitutes natural speech, particular attention was paid to word usage that depended on the extra-linguistic information from the context, situation, environment and so on. We discuss what types of extra-linguistic information a...
متن کاملCross-lingual spoken language understanding from unaligned data using discriminative classification models and machine translation
This paper investigates several approaches to bootstrapping a new spoken language understanding (SLU) component in a target language given a large dataset of semantically-annotated utterances in some other source language. The aim is to reduce the cost associated with porting a spoken dialogue system from one language to another by minimising the amount of data required in the target language. ...
متن کاملSpeech{language Integration in a Multi{lingual Speech Translation System
In this paper we report on our e orts to combine speech and language processing toward multi-lingual spontaneous speech translation. The ongoing work extends our JANUS system e ort toward handling spontaneous spoken discourse and multiple languages. A major objective of this project is to maximize the number of modules, methods and data structures that are language-independent and extensible to...
متن کاملUsing Prosodic Cues in Spoken Dialog Systems
In this paper we show how prosody can be used in spoken dialog systems. First, we describe the phenomena that prosodic analysis is concerned with and give examples why prosody is relevant in the context of spoken dialog processing. Then we examine prosody in the light of pattern classification. We show how prosodic events can be categorized. We detail how those prosodic events manifest themselv...
متن کامل