نتایج جستجو برای: audio visual sign

تعداد نتایج: 469891  

2013
Peng Shen Satoshi Tamura Satoru Hayamizu

In this paper, we investigate audio-visual interaction in sparse representation to obtain robust features for audio-visual speech recognition. Firstly, we introduce our system which uses sparse representation method for noise robust audio-visual speech recognition. Then, we introduce the dictionary matrix used in this paper, and consider the construction of audio-visual dictionary. Finally, we ...

Ensiye Sadati Hossein Mirzaii Leila Fathi Mehdi Akbariyan Rahele Kardavani*

This research was done in order to describe the content of proposed lyrics’ concepts in permissible pop music based on published audio works’ list of music publication office in 2009 among 200 albums of music office published audio works,  the 10 number of pop singers were randomly selected and the lyrics’ concept of 70 music were also investigated. In the process of data analysis in the first ...

2008
Sanaul Haq Philip J. B. Jackson James D. Edge

Recognition of expressed emotion from speech and facial gestures was investigated in experiments on an audio-visual emotional database. A total of 106 audio and 240 visual features were extracted and then features were selected with Plus l-Take Away r algorithm based on Bhattacharyya distance criterion. In the second step, linear transformation methods, principal component analysis (PCA) and li...

Journal: :IEEE transactions on neural networks 2009
Gianluca Monaci Pierre Vandergheynst Friedrich T. Sommer

A novel model is presented to learn bimodally informative structures from audio-visual signals. The signal is represented as a sparse sum of audio-visual kernels. Each kernel is a bimodal function consisting of synchronous snippets of an audio waveform and a spatio-temporal visual basis function. To represent an audio-visual signal, the kernels can be positioned independently and arbitrarily in...

2013
Marc Rébillat Xavier Boutillon Étienne Corteel Brian F.G. Katz

A study on audio, visual, and audio-visual egocentric distance perception by moving participants in virtual environments is presented. Audio-visual rendering is provided using tracked passive visual stereoscopy and acoustic wave eld synthesis (WFS). Distances are estimated using indirect blind-walking (triangulation) under each rendering condition. Experimental results show that distances perce...

Journal: :Brain : a journal of neurology 2002
Mairéad MacSweeney Bencie Woll Ruth Campbell Philip K McGuire Anthony S David Steven C R Williams John Suckling Gemma A Calvert Michael J Brammer

In order to understand the evolution of human language, it is necessary to explore the neural systems that support language processing in its many forms. In particular, it is informative to separate those mechanisms that may have evolved for sensory processing (hearing) from those that have evolved to represent events and actions symbolically (language). To what extent are the brain systems tha...

1998
Jiri Matas Souheil Ben Yacoub Kenneth Jonsson Josef Kittler

In this paper we investigate bene ts of classi er combination fusion for a multimodal system for personal identity veri cation The system uses frontal face images and speech We show that a sophisticated fusion strategy enables the system to outperform its facial and vocal modules when taken seperately We show that both trained linear weighted schemes and fusion by Support Vector Machine classi ...

2005
Islam Shdaifat Rolf-Rainer Grigat

In this work, a system of audio visual speech recognition will be presented. A new hybrid visual feature combination, which is suitable for audio -visual speech recognition was implemented. The features comprise both the shape and the appearance of lips, the dimensional reduction is applied using discrete cosine transform (DCT). A large visual speech database of the German language has been ass...

2015
Faheem Khan Ben P. Milner

This work proposes a method to exploit both audio and visual speech information to extract a target speaker from a mixture of competing speakers. The work begins by taking an effective audio-only method of speaker separation, namely the soft mask method, and modifying its operation to allow visual speech information to improve the separation process. The audio input is taken from a single chann...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید