audio visual sign

Audio-visual interaction in sparse representation features for noise robust audio-visual speech recognition

2013

Peng Shen Satoshi Tamura Satoru Hayamizu

In this paper, we investigate audio-visual interaction in sparse representation to obtain robust features for audio-visual speech recognition. Firstly, we introduce our system which uses sparse representation method for noise robust audio-visual speech recognition. Then, we introduce the dictionary matrix used in this paper, and consider the construction of audio-visual dictionary. Finally, we ...

متن کامل

A Research on the Fragmented Culture of Iran’s Youth in the Content of Pop Music Lyrics

Journal: مجله مطالعات جامعه شناختی جوانان 2011

Ensiye Sadati Hossein Mirzaii Leila Fathi Mehdi Akbariyan Rahele Kardavani*

This research was done in order to describe the content of proposed lyrics’ concepts in permissible pop music based on published audio works’ list of music publication office in 2009 among 200 albums of music office published audio works, the 10 number of pop singers were randomly selected and the lyrics’ concept of 70 music were also investigated. In the process of data analysis in the first ...

متن کامل

Audio-visual feature selection and reduction for emotion classification

2008

Sanaul Haq Philip J. B. Jackson James D. Edge

Recognition of expressed emotion from speech and facial gestures was investigated in experiments on an audio-visual emotional database. A total of 106 audio and 240 visual features were extracted and then features were selected with Plus l-Take Away r algorithm based on Bhattacharyya distance criterion. In the second step, linear transformation methods, principal component analysis (PCA) and li...

متن کامل

Learning Bimodal Structure in Audio-Visual Data

Journal: :IEEE transactions on neural networks 2009

Gianluca Monaci Pierre Vandergheynst Friedrich T. Sommer

A novel model is presented to learn bimodally informative structures from audio-visual signals. The signal is represented as a sparse sum of audio-visual kernels. Each kernel is a bimodal function consisting of synchronous snippets of an audio waveform and a spatio-temporal visual basis function. To represent an audio-visual signal, the kernels can be positioned independently and arbitrarily in...

متن کامل

Audio, visual, and audio-visual egocentric distance perception by moving participants in virtual environments

2013

Marc Rébillat Xavier Boutillon Étienne Corteel Brian F.G. Katz

A study on audio, visual, and audio-visual egocentric distance perception by moving participants in virtual environments is presented. Audio-visual rendering is provided using tracked passive visual stereoscopy and acoustic wave eld synthesis (WFS). Distances are estimated using indirect blind-walking (triangulation) under each rendering condition. Experimental results show that distances perce...

متن کامل

Neural systems underlying British Sign Language and audio-visual English processing in native users.

Journal: :Brain : a journal of neurology 2002

Mairéad MacSweeney Bencie Woll Ruth Campbell Philip K McGuire Anthony S David Steven C R Williams John Suckling Gemma A Calvert Michael J Brammer

In order to understand the evolution of human language, it is necessary to explore the neural systems that support language processing in its many forms. In particular, it is informative to separate those mechanisms that may have evolved for sensory processing (hearing) from those that have evolved to represent events and actions symbolically (language). To what extent are the brain systems tha...

متن کامل

Audio-visual Person Veriication Audio-visual Person Veriication

1998

Jiri Matas Souheil Ben Yacoub Kenneth Jonsson Josef Kittler

In this paper we investigate bene ts of classi er combination fusion for a multimodal system for personal identity veri cation The system uses frontal face images and speech We show that a sophisticated fusion strategy enables the system to outperform its facial and vocal modules when taken seperately We show that both trained linear weighted schemes and fusion by Support Vector Machine classi ...

متن کامل

A system for audio-visual speech recognition

2005

Islam Shdaifat Rolf-Rainer Grigat

In this work, a system of audio visual speech recognition will be presented. A new hybrid visual feature combination, which is suitable for audio -visual speech recognition was implemented. The features comprise both the shape and the appearance of lips, the dimensional reduction is applied using discrete cosine transform (DCT). A large visual speech database of the German language has been ass...

متن کامل

Separation of Audio-Visual Speech Sources: A New Approach Exploiting the Audio-Visual Coherence of Speech Stimuli

Journal: :EURASIP Journal on Advances in Signal Processing 2002

متن کامل

Using audio and visual information for single channel speaker separation

2015

Faheem Khan Ben P. Milner

This work proposes a method to exploit both audio and visual speech information to extract a target speaker from a mixture of competing speakers. The work begins by taking an effective audio-only method of speaker separation, namely the soft mask method, and modifying its operation to allow visual speech information to improve the separation process. The audio input is taken from a single chann...

متن کامل