Speech communication and multimodal interfaces
نویسندگان
چکیده
Within the area of advanced man-machine interaction, speech communication has always played a major role for several decades. The idea of replacing the convential input devices such as buttons and keyboard by voice control and thus increasing the comfort and the input speed considerably, seems that much attractive, that even the quite slow progress of speech technology during those decades could not discourage people from pursuing that goal. However, nowadays this area is in a different situation than in those earlier times, and these facts shall be also considered in this book section: First of all, speech technology has reached a much higher degree of maturity, mainly through the technique of stochastic modeling which shall be briefly introduced in this chapter. Secondly, other interaction techniques became more mature, too, and in the framework of that development, speech became one of the preferred modalities of multimodal interaction, e.g. as ideal complementary mode to pointing or gesture. This shall be also reflected in the subsection on multimodal interaction. Another relatively recent development is the fact that speech is not only a carrier of linguistic information, but also one of emotional information, and emotions became another important aspect in today’s advanced man machine interaction. This will be considered in a subsection on affective computing, where this topic is also consequently investigated from a multimodal point of view, taking into account the possibilities for extracting emotional cues from the speech signal as well as from visual information. We believe that such an integrated approach to all the above mentioned different aspects is appropriate in order to reflect the newest developments in that field.
منابع مشابه
Towards a Multimodal Interface for In-Car Communication Systems
The number of cars provided with an in-car communication system has considerably increased during the past few years. Using a mobile phone whilst driving is a safety-critical task and can cause usability issues. Speech modality has been incorporated in order to allocate hands and eyes solely to the driving task speech. This paper discusses an investigation into in-car communication systems and ...
متن کاملAuditory/visual speech in multimodal human interfaces
Program in Experimental Psychology University of California Santa Cruz, CA 95064 ABSTRACT It has long been a hope, expectation, and prediction that speech would be the primary medium of communication between humans and machines. To date, this dream has not been realized. We predict that exploiting the multimodal nature of spoken language will facilitate the use of this medium. We begin our pape...
متن کاملComputational Simulations of Mediated Face-to-Face Multimodal Communication
Computational Simulations of Mediated Face-to-Face Multimodal Communication Melanie A. Baljko Doctor of Philosophy, November 2004 Graduate Department of Computer Science, University of Toronto Individuals who have little or no functional speech due to underlying physical disorder may instead use a computational device, called a Voice Output Communication Aid (VOCA), to produce synthesized speec...
متن کاملMultimodal Adaptive Interfaces
Our group is interested in creating human machine interfaces which use natural modalities such as vision and speech to sense and interpret a user's actions [6]. In this paper we describe recent work on multimodal adaptive interfaces which combine automatic speech recognition, computer vision for gesture tracking, and machine learning techniques. Speech is the primary mode of communication betwe...
متن کاملMultimodal Human-Computer Interfaces Editors
The goal of multimodal interfaces is to extend the sensory-motor capabilities of computer systems to better match the natural communication means of human beings. Multimodal interfaces represent a very active interdisciplinary research area which has expanded rapidly. Since the seminal “Put that there” demonstrator by R. Bolt (1980) that combines speech and gesture, significant achievements hav...
متن کامل10 Gestural Interfaces for Hearing-Impaired Communication
Recent research in Human-Computer Interaction (HCI) has focused on equipping machines with means of communication that are used between humans, such as speech and accompanying gestures. For the hearing impaired , the visual components of speech, such as lip movements, or ges-tural languages such as sign language are available means of communication. This has led researchers to focus on lip read...
متن کامل