Crowdsourcing for Spoken Dialog Systems Evaluation
نویسندگان
چکیده
A spoken dialog system (SDS) is a computer system which supports human-computer conversations in specific knowledge domains. It integrates technologies including speech recognition, natural language understanding, dialog modeling, language generation, and textto-speech synthesis. Advances in speech and language technologies have made SDS an important research area and have brought about systems in a wide variety of applications, such as flight information (Hirschman et al. (1993)), bus schedule inquiries (Raux et al. (2005)), stock market information delivery (Meng et al. (2004a)), tourist information (Wu et al. (2006)) and student tutoring (Litman et al. (2006)). To facilitate the development of SDS and compare the performance of different systems, it is necessary to conduct SDS evaluation with appropriate methodologies. A typical SDS architecture is illustrated in Figure 1.1. This implements a dialog interaction between two interlocutors, which consists of a series of dialog turns. A spoken dialog turn is a process in which one participant A utters something to the other B, and B interprets A’s utterance and then responds accordingly. As illustrated in Figure 1.1, the first step for the system is to recognize the user’s speech with automatic speech recognition and interpret the underlying meaning with natural language understanding technologies. This involves extracting the user’s communicative (and informational) goal and inferring the appropriate follow up actions and responses. Language understanding involves a variety of methods, such as the use of parsers and grammars (Seneff (1992); Ward and Issar (1994)), belief networks (Meng et al. (1999, 2004b)), etc. The dialog model is the principal component of a dialog system which maintains the history of the dialog, decides which action is appropriate based on language understanding, and controls the dialog flow. A dialog model typically incorporates dialog states, state transitions, and a dialog policy. A dialog state represent the results of performing system actions in previous states; state transitions allow dialogs to move forward; and the dialog policy determines how
منابع مشابه
Crowdsourcing for situated dialog systems in a moving car
In this paper, we address issues that arise when crowdsourcing data collection of user queries to situated dialog systems in a moving car. Compared to unimodal spoken dialog systems such as systems for smartphones, collecting dialog data for situated dialog systems is more costly because a clear awareness of the physical surroundings is required for the user to make realistic queries. We consid...
متن کاملUsing a Spoken Dialogue System for Crowdsourcing Street-level Geographic Information
We present a novel scheme for enriching geographic database with street-level geographic information that could be useful for pedestrian navigation. A spoken dialogue system for crowdsourcing street-level geographic details was developed and tested in an in-lab experimentation. The system obtained 96.4% of concept values correctly after interacting with the first six of the fifteen users. This ...
متن کاملCrowdsourcing Street-level Geographic Information Using a Spoken Dialogue System
We present a technique for crowdsourcing street-level geographic information using spoken natural language. In particular, we are interested in obtaining first-person-view information about what can be seen from different positions in the city. This information can then for example be used for pedestrian routing services. The approach has been tested in the lab using a fully implemented spoken ...
متن کاملEvaluation of Crowdsourced User Input Data for Spoken Dialog Systems
Using the Internet for the collection of data is quite common these days. This process is called crowdsourcing and enables the collection of large amounts of data at reasonable costs. While being an inexpensive method, this data typically is of lower quality. Filtering data sets is therefore required. The occurring errors can be classified into different groups. There are technical issues and h...
متن کاملReal User Evaluation of Spoken Dialogue Systems Using Amazon Mechanical Turk
This paper describes a framework for evaluation of spoken dialogue systems. Typically, evaluation of dialogue systems is performed in a controlled test environment with carefully selected and instructed users. However, this approach is very demanding. An alternative is to recruit a large group of users who evaluate the dialogue systems in a remote setting under virtually no supervision. Crowdso...
متن کامل