Sensitive Talking Heads
نویسندگان
چکیده
Spoken language user interface can dramatically speed up computer use. Unfortunately, if the speech user interface gets in the way too often, the user turns it off. Users are unforgiving: a technology that impairs productivity just once may never get a second chance. In order to give the user interface a fighting chance, why not give it a certain amount of emotional sensitivity? Users respond better to an avatar that displays appropriate emotional nuance; conversely, if the avatar detects extreme frustration on the part of the user, it can hide in the corner of the monitor until the frustration has passed. A hidden avatar is still present, and can continue to be of service to the user, upon request. Audio-only speech synthesis and recognition are now sufficiently accurate to be the foundation for a host of application technologies (see, e.g., the May 2008 issue of Signal Processing Magazine). Automatic recognition and synthesis of emotionally nuanced speech, on the other hand, are still topics of active research. This column describes experiments in emotive spoken language user interface. We find that both recognition accuracy and synthesis quality are improved when one takes advantage of multimodal information, synthesizing and recognizing information in both the audio and video modalities.
منابع مشابه
Talking Robots: a Fully Autonomous Implementation of the Talking Heads
The “Talking Robots” experiment, inspired by the “Talking Heads” experiment from Sony, explores possibilities on how to ground symbols into perception using language, with two autonomous Aibo robots in an unconstrained environment. We present here the first results of this experiment and outline in the conclusion a planned extension to social behaviors grounding.
متن کاملTalking Heads
This paper describes an interactive presentation that introduces the Talking Heads website, which was originally proposed at the AVSP'97 meeting in Rhodes, Greece. Talking Heads is an effort to bring together information from a wide range of sources. The site provides interactive access to multimodal material in both its original form and as summarized by us. In addition, the authors have provi...
متن کاملA comparison of German talking heads in a smart home environment
The authors describe a newly developed German Text-Toaudiovisual-Speech (TTavS) synthesis system based on the English speaking HeadZero. Targets of the control parameters of the talking head are generated by mapping of German phonemes to the originally English visemic blend shapes controls. The resulting German version of HeadZero and the German talking head MASSY were extended to generate audi...
متن کاملCartoon talking heads
We discuss CharToon, an interactive system to design and animate 2D cartoon talking heads. We give illustrations of the expressive and artistic effects which can be produced. It has been used with success by different types of users.
متن کاملTalking heads and pronunciation training: a review
This special session will include talks describing the use of talking heads in pronunciation training programs for second-language learners and clinical populations. This introductory talk will provide a brief review of the field.
متن کاملExploring the Uncanny Valley Effect with talking heads
Here the “Uncanny Valley”, where falling just short of perfection in creating synthetic humans exacts a large negative reaction, is explored with talking head animations focusing on naturalness in speech, face model and face motion. We discuss possible techniques to manipulate naturalness for each of these aspects. Outcomes of this method will provide insights for the choice of the degree of re...
متن کامل