Evaluating Spoken Language Systems
نویسندگان
چکیده
Spoken language systems (SLSs) for accessing information sources or services through the telephone network and the Internet are currently being trialed and deployed for a variety of tasks. Evaluating the usability of different interface designs requires a method for comparing performance of different versions of the SLS. Recently, Walker et al (1997) proposed PARADISE (PARAdigm for DIalogue System Evaluation) as a general methodology for evaluating SLSs. The PARADISE framework models user satisfaction with an SLS as a linear combination of measures reflecting both task success and dialogue costs. As a test of this methodology, we applied PARADISE to dialogues collected with three SLSs. This paper describes the salient measures identified using PARADISE within and across the three SLSs, and discusses the generalizability of PARADISE performance models.
منابع مشابه
Usability Evaluation In Spoken Language Dialogue Systems
The paper first addresses a series of issues basic to evaluating the usability of spoken language dialogue systems, including types and purpose of evaluation, when to evaluate and which methods to use, user involvement, how to evaluate and what to evaluate. We then go on to present and discuss a comprehensive set of usability evaluation criteria for spoken language dialogue systems.
متن کاملDesign and Evaluation of Spoken Dialog Systems
Interactive spoken dialog systems extend the range of automated telecommunication services beyond simple limited-choice form-filling applications to goal-directed tasks covering richer, more complex domains. Creating effective and efficient dialog systems requires not only accurate ancl robust speech recognition and language modeling, but also iterative, principled design of the user interface ...
متن کاملEvaluating responsiveness in spoken dialog systems
Ratings of user satisfaction, although fairly easy to elicit for today’s spoken language systems, can be more elusive for systems which operate at near-human levels of performance. This problem can be alleviated by adding a ‘relistening’ phase before eliciting judgements: in this phase the user listens to a recording of himself interacting with the system while consulting a transcript of that i...
متن کاملConsiderations in the design and evaluation of spoken language dialog systems
In this paper we summarize our experience at LIMSI in the design, development and evaluation of spoken language dialog systems for information retrieval tasks. This work has been for the most part carried out in the context of several European and international projects. Evaluation plays an integral role in the development of spoken language dialog systems. While there are commonly used measure...
متن کاملSpoken language variation over time and state in a natural spoken dialog system
We are interested in adaptive spoken dialog systems for automated services. Peoples’ spoken language usage varies over time for a fixed task, and furthermore varies depending on the state of the dialog. We will characterize and quantify this variation based on a database of 20K user-transactions with AT&T’s experimental ‘How May I Help You?’ spoken dialog system. We then report on a language ad...
متن کاملDesigning and Evaluating Conversational Interfaces with Animated Characters
During the past decade, due largely to progress inspired by the DARPA Speech Grand Challenge project and similar international efforts (Martin et al. 1997; Cole et al. 1997), significant progress has occurred in the development of spoken language technology (SLT). Spoken language systems now are implemented extensively for telephony applications (Spiegel and Kamm 1997), as well as on workstatio...
متن کامل