Speech Enabled Avatar from a Single Photograph

نویسندگان

  • Dmitri Bitouk
  • Shree K. Nayar
چکیده

This paper presents a complete framework for creating speechenabled 2D and 3D avatars from a single image of a person. Our approach uses a generic facial motion model which represents deformations of the prototype face during speech. We have developed an HMM-based facial animation algorithm which takes into account both lexical stress and coarticulation. This algorithm produces realistic animations of the prototype facial surface from either text or speech. The generic facial motion model is transformed to a novel face geometry using a set of corresponding points between the generic mesh and the novel face. In the case of a 2D avatar, a single photograph of the person is used as input. We manually select a small number of features on the photograph and these are used to deform the prototype surface. The deformed surface is then used to animate the photograph. In the case of a 3D avatar, we use a single stereo image of the person as input. The sparse geometry of the face is computed from this image and used to warp the prototype surface to obtain the complete 3D surface of the person’s face. This surface is etched into a glass cube using sub-surface laser engraving (SSLE) technology. Synthesized facial animation videos are then projected onto the etched glass cube. Even though the etched surface is static, the projection of facial animation onto it results in a compelling experience for the viewer. We show several examples of 2D and 3D avatars that are driven by text and speech inputs.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Text to Visual Speech Instant Messaging System

This paper describes the implementation of text-to-visual-speech instant messaging system using the Remote Method Invocation (RMI) and graphics functionality of Java, together with synthetic speech via the Microsoft Speech API. Our system allows users to communicate over a low-bandwidth network connection using text that is converted into a realistic talking face. The avatar of each user consis...

متن کامل

Web-enabled 3D talking avatars based on WebGL and HTML5

We describe a system for plugin-free deployment of 3D talking characters on the web. The system employs the WebGL capabilites of modern web browsers in order to produce real-time animation of speech movements, in synchrony with text-to-speech synthesis, played back using HTML5 audio functionalty. The implementation is divided into a client and a server part, where the server delivers the audio ...

متن کامل

Input and output modalities used in a sign-language-enabled information kiosk

This paper presents description and evaluation of input and output modalities used in a sign-language-enabled information kiosk. The kiosk was developed for experiments on interaction between computers and deaf users. The input modalities are automatic computer-vision-based sign language recognition, automatic speech recognition (ASR) and a touchscreen. The output modalities are presented on a ...

متن کامل

Cartoon-like Avatar Generation Using Facial Component Matching

Nowadays, avatars are widely used in games and Internet environments. Especially, video game consoles such as Wii (Nintendo) use avatars for representing the user's alter ego. There are several ways to generate avatars. Most existing games or Internet services provide manual systems for generating avatars. Many researchers have suggested automatic avatar generation methods, most of which genera...

متن کامل

Exigent: An Automatic Avatar Generation System

Avatars are pervasive in video games and virtual worlds. The automatic generation of these avatar promises to reduce player effort and provide system-defined mappings between “real” (physical) player characteristics and virtual identities. We present an avatar generation system called Exigent; given a photograph of a human face, Exigent creates an avatar. Exigent leverages two recent computer v...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007